BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>047793
IAASQVTSRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNK
PYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTP
IKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFK
FIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAV
SIDASGSAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYI
RMKRDIDAKEGLCGIAMDSSYPTA

High Scoring Gene Products

Symbol, full name Information P value
SAG12
senescence-associated gene 12
protein from Arabidopsis thaliana 1.8e-96
AT2G27420 protein from Arabidopsis thaliana 4.2e-95
CEP1
cysteine endopeptidase 1
protein from Arabidopsis thaliana 3.0e-92
AT3G49340 protein from Arabidopsis thaliana 3.0e-92
AT2G34080 protein from Arabidopsis thaliana 8.6e-88
AT1G29090 protein from Arabidopsis thaliana 1.3e-86
AT3G19390 protein from Arabidopsis thaliana 3.3e-86
RD21B
esponsive to dehydration 21B
protein from Arabidopsis thaliana 5.4e-86
RD21A
responsive to dehydration 21A
protein from Arabidopsis thaliana 3.0e-85
XCP1
xylem cysteine peptidase 1
protein from Arabidopsis thaliana 6.2e-85
CEP3
cysteine endopeptidase 3
protein from Arabidopsis thaliana 1.9e-83
XCP2
AT1G20850
protein from Arabidopsis thaliana 5.0e-83
AT3G19400 protein from Arabidopsis thaliana 1.3e-82
XBCP3
xylem bark cysteine peptidase 3
protein from Arabidopsis thaliana 3.2e-81
AT1G29080 protein from Arabidopsis thaliana 1.1e-80
AT1G06260 protein from Arabidopsis thaliana 2.9e-80
CP1
cysteine protease 1
protein from Arabidopsis thaliana 2.7e-77
AT4G23520 protein from Arabidopsis thaliana 1.1e-76
CP2
cysteine protease 2
protein from Arabidopsis thaliana 1.0e-75
AT1G29110 protein from Arabidopsis thaliana 1.7e-68
Ctsll3
cathepsin L-like 3
gene from Rattus norvegicus 2.1e-68
Ctsl
cathepsin L
protein from Mus musculus 5.8e-66
AT3G43960 protein from Arabidopsis thaliana 9.4e-66
RGD1308751
similar to Cathepsin L precursor (Major excreted protein) (MEP)
gene from Rattus norvegicus 1.5e-65
CTSL2
Uncharacterized protein
protein from Gallus gallus 2.0e-65
Ctsl1
cathepsin L1
gene from Rattus norvegicus 3.2e-65
ctsl.1
cathepsin L.1
gene_product from Danio rerio 3.2e-65
CTSL2
Cathepsin L2
protein from Homo sapiens 5.2e-65
CTSL1
Cathepsin L1
protein from Bos taurus 1.4e-64
cprE
cysteine proteinase 5
gene from Dictyostelium discoideum 1.4e-64
cprB
cysteine proteinase 2
gene from Dictyostelium discoideum 2.9e-64
CTSL1
Cathepsin L1
protein from Canis lupus familiaris 7.6e-64
zgc:174855 gene_product from Danio rerio 9.7e-64
CTSL2
Cathepsin L2
protein from Bos taurus 1.2e-63
Cp1
Cysteine proteinase-1
protein from Drosophila melanogaster 1.6e-63
wu:fb37b09 gene_product from Danio rerio 3.3e-63
ctsl1a
cathepsin L, 1 a
gene_product from Danio rerio 6.9e-63
zgc:174153 gene_product from Danio rerio 8.8e-63
CTSL1
CTSL1 protein
protein from Bos taurus 1.1e-62
CTSL1
Cathepsin L1
protein from Sus scrofa 1.1e-62
CTSL1
Cathepsin L1
protein from Homo sapiens 3.0e-62
ctsl1b
cathepsin L, 1 b
gene_product from Danio rerio 3.0e-62
Ssc.54235
Uncharacterized protein
protein from Sus scrofa 3.8e-62
ctsll
cathepsin L, like
gene_product from Danio rerio 3.8e-62
Cys
Crustapain
protein from Pandalus borealis 7.9e-62
P83654
Ervatamin-C
protein from Tabernaemontana divaricata 1.6e-61
ctssb.2
cathepsin S, b.2
gene_product from Danio rerio 2.7e-61
Ctss
cathepsin S
protein from Mus musculus 5.5e-61
CTSS
Uncharacterized protein
protein from Sus scrofa 7.1e-61
CTSS
Cathepsin S
protein from Bos taurus 9.0e-61
P83443
Macrodontain-1
protein from Pseudananas sagenarius 1.5e-60
cprC
cysteine proteinase 3
gene from Dictyostelium discoideum 5.0e-60
DDB_G0272298 gene from Dictyostelium discoideum 5.0e-60
cpl-1 gene from Caenorhabditis elegans 1.0e-59
CTSS
Cathepsin S
protein from Canis lupus familiaris 2.2e-59
CTSL1
Cathepsin L1
protein from Canis lupus familiaris 2.2e-59
CTSS
Cathepsin S
protein from Canis lupus familiaris 4.5e-59
Ctsk
cathepsin K
gene from Rattus norvegicus 7.3e-59
CTSS
Cathepsin S
protein from Homo sapiens 1.5e-58
ctsk
cathepsin K
gene_product from Danio rerio 3.2e-58
Ctsk
cathepsin K
protein from Mus musculus 2.2e-57
CTSK
Cathepsin K
protein from Homo sapiens 2.8e-57
CTSK
Cathepsin K
protein from Sus scrofa 4.6e-57
Testin
testin gene
gene from Rattus norvegicus 4.6e-57
CTSK
Cathepsin K
protein from Canis lupus familiaris 5.9e-57
CTSK
Cathepsin K
protein from Canis lupus familiaris 5.9e-57
MGC114246
similar to cathepsin R
gene from Rattus norvegicus 1.2e-56
CTSK
Cathepsin K
protein from Bos taurus 2.0e-56
LOC420160
Uncharacterized protein
protein from Gallus gallus 2.5e-56
4930486L24Rik
RIKEN cDNA 4930486L24 gene
protein from Mus musculus 2.5e-56
cfaD
peptidase C1A family protein
gene from Dictyostelium discoideum 4.2e-56
F1NHB8
Uncharacterized protein
protein from Gallus gallus 5.3e-56
cprH
cysteine proteinase 8
gene from Dictyostelium discoideum 8.6e-56
Cat-1
Cathepsin L-like proteinase
protein from Fasciola hepatica 8.6e-56
AT3G45310 protein from Arabidopsis thaliana 1.1e-55
Ctsr
cathepsin R
protein from Mus musculus 1.4e-55
CTSL
Cathepsin L1
protein from Ovis aries 1.8e-55
Ctsq
cathepsin Q
gene from Rattus norvegicus 3.7e-55
Ctsj
cathepsin J
gene from Rattus norvegicus 3.7e-55
ctssb.1
cathepsin S, b.1
gene_product from Danio rerio 4.8e-55
CTSK
Cathepsin K
protein from Gallus gallus 6.1e-55
zgc:110239 gene_product from Danio rerio 3.4e-54
Ctsm
cathepsin M
protein from Mus musculus 4.3e-54
ctssa
cathepsin S, a
gene_product from Danio rerio 5.5e-54
DDB_G0291191
cysteine protease
gene from Dictyostelium discoideum 7.0e-54
Cts7
cathepsin 7
protein from Mus musculus 1.1e-53
CTSH
Uncharacterized protein
protein from Ailuropoda melanoleuca 2.4e-53
ctsh
cathepsin H
gene_product from Danio rerio 2.4e-53
CTSH
Uncharacterized protein
protein from Macaca mulatta 3.0e-53
26-29-p
26-29kD-proteinase
protein from Drosophila melanogaster 3.8e-53
CTSL2
Uncharacterized protein
protein from Gallus gallus 3.8e-53
Ctss
cathepsin S
gene from Rattus norvegicus 3.8e-53
ALP
aleurain-like protease
protein from Arabidopsis thaliana 3.8e-53
CTSL1
Cathepsin L1
protein from Gallus gallus 4.9e-53
CTSH
Uncharacterized protein
protein from Nomascus leucogenys 4.9e-53
Ctsj
cathepsin J
protein from Mus musculus 4.9e-53
Cts7
cathepsin 7
gene from Rattus norvegicus 6.3e-53
CTSH
Uncharacterized protein
protein from Callithrix jacchus 8.0e-53
CTSH
Uncharacterized protein
protein from Callithrix jacchus 1.0e-52
D3ZZR3
Uncharacterized protein
protein from Rattus norvegicus 1.3e-52

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  047793
        (324 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2152445 - symbol:SAG12 "senescence-associated ...   959  1.8e-96   1
TAIR|locus:2038588 - symbol:AT2G27420 species:3702 "Arabi...   946  4.2e-95   1
TAIR|locus:2157712 - symbol:CEP1 "cysteine endopeptidase ...   919  3.0e-92   1
TAIR|locus:2082881 - symbol:AT3G49340 species:3702 "Arabi...   919  3.0e-92   1
TAIR|locus:2055440 - symbol:AT2G34080 species:3702 "Arabi...   877  8.6e-88   1
TAIR|locus:2029924 - symbol:AT1G29090 species:3702 "Arabi...   866  1.3e-86   1
TAIR|locus:2090614 - symbol:AT3G19390 species:3702 "Arabi...   862  3.3e-86   1
TAIR|locus:2167821 - symbol:RD21B "esponsive to dehydrati...   860  5.4e-86   1
TAIR|locus:2825832 - symbol:RD21A "responsive to dehydrat...   853  3.0e-85   1
TAIR|locus:2122113 - symbol:XCP1 "xylem cysteine peptidas...   850  6.2e-85   1
TAIR|locus:505006391 - symbol:CEP3 "cysteine endopeptidas...   836  1.9e-83   1
TAIR|locus:2030427 - symbol:XCP2 "xylem cysteine peptidas...   832  5.0e-83   1
TAIR|locus:2090629 - symbol:AT3G19400 species:3702 "Arabi...   828  1.3e-82   1
TAIR|locus:2024362 - symbol:XBCP3 "xylem bark cysteine pe...   815  3.2e-81   1
TAIR|locus:2029934 - symbol:AT1G29080 species:3702 "Arabi...   810  1.1e-80   1
TAIR|locus:2038515 - symbol:AT1G06260 species:3702 "Arabi...   806  2.9e-80   1
TAIR|locus:2128243 - symbol:AT4G11310 species:3702 "Arabi...   778  2.7e-77   1
TAIR|locus:2117979 - symbol:AT4G23520 species:3702 "Arabi...   772  1.1e-76   1
TAIR|locus:2128253 - symbol:AT4G11320 species:3702 "Arabi...   763  1.0e-75   1
TAIR|locus:2030027 - symbol:AT1G29110 species:3702 "Arabi...   695  1.7e-68   1
RGD|1560071 - symbol:Ctsll3 "cathepsin L-like 3" species:...   694  2.1e-68   1
MGI|MGI:88564 - symbol:Ctsl "cathepsin L" species:10090 "...   671  5.8e-66   1
TAIR|locus:2097104 - symbol:AT3G43960 species:3702 "Arabi...   669  9.4e-66   1
RGD|1308751 - symbol:RGD1308751 "similar to Cathepsin L p...   667  1.5e-65   1
UNIPROTKB|F1NYJ1 - symbol:CTSL2 "Uncharacterized protein"...   666  2.0e-65   1
RGD|2448 - symbol:Ctsl1 "cathepsin L1" species:10116 "Rat...   664  3.2e-65   1
ZFIN|ZDB-GENE-040718-61 - symbol:ctsl.1 "cathepsin L.1" s...   664  3.2e-65   1
UNIPROTKB|O60911 - symbol:CTSL2 "Cathepsin L2" species:96...   662  5.2e-65   1
UNIPROTKB|P25975 - symbol:CTSL1 "Cathepsin L1" species:99...   658  1.4e-64   1
DICTYBASE|DDB_G0272815 - symbol:cprE "cysteine proteinase...   537  1.4e-64   2
DICTYBASE|DDB_G0279799 - symbol:cprB "cysteine proteinase...   534  2.9e-64   2
UNIPROTKB|Q9GL24 - symbol:CTSL1 "Cathepsin L1" species:96...   651  7.6e-64   1
ZFIN|ZDB-GENE-071004-74 - symbol:zgc:174855 "zgc:174855" ...   650  9.7e-64   1
UNIPROTKB|Q5E998 - symbol:CTSL2 "Cathepsin L2" species:99...   649  1.2e-63   1
FB|FBgn0013770 - symbol:Cp1 "Cysteine proteinase-1" speci...   648  1.6e-63   1
ZFIN|ZDB-GENE-030131-572 - symbol:wu:fb37b09 "wu:fb37b09"...   645  3.3e-63   1
ZFIN|ZDB-GENE-030131-106 - symbol:ctsl1a "cathepsin L, 1 ...   642  6.9e-63   1
ZFIN|ZDB-GENE-080215-7 - symbol:zgc:174153 "zgc:174153" s...   641  8.8e-63   1
UNIPROTKB|A4IFS7 - symbol:CTSL1 "CTSL1 protein" species:9...   640  1.1e-62   1
UNIPROTKB|Q28944 - symbol:CTSL1 "Cathepsin L1" species:98...   640  1.1e-62   1
UNIPROTKB|P07711 - symbol:CTSL1 "Cathepsin L1" species:96...   636  3.0e-62   1
ZFIN|ZDB-GENE-980526-285 - symbol:ctsl1b "cathepsin L, 1 ...   636  3.0e-62   1
UNIPROTKB|F1S4J6 - symbol:Ssc.54235 "Cathepsin L1" specie...   635  3.8e-62   1
ZFIN|ZDB-GENE-041010-76 - symbol:ctsll "cathepsin L, like...   635  3.8e-62   1
UNIPROTKB|Q86GF7 - symbol:Cys "Crustapain" species:6703 "...   632  7.9e-62   1
UNIPROTKB|P83654 - symbol:P83654 "Ervatamin-C" species:52...   629  1.6e-61   1
ZFIN|ZDB-GENE-050626-55 - symbol:ctssb.2 "cathepsin S, b....   627  2.7e-61   1
MGI|MGI:107341 - symbol:Ctss "cathepsin S" species:10090 ...   624  5.5e-61   1
UNIPROTKB|F1SS93 - symbol:CTSS "Uncharacterized protein" ...   623  7.1e-61   1
UNIPROTKB|P25326 - symbol:CTSS "Cathepsin S" species:9913...   622  9.0e-61   1
UNIPROTKB|P83443 - symbol:P83443 "Macrodontain-1" species...   620  1.5e-60   1
DICTYBASE|DDB_G0283867 - symbol:cprC "cysteine proteinase...   615  5.0e-60   1
DICTYBASE|DDB_G0272298 - symbol:DDB_G0272298 species:4468...   615  5.0e-60   1
WB|WBGene00000776 - symbol:cpl-1 species:6239 "Caenorhabd...   612  1.0e-59   1
UNIPROTKB|F1PAK0 - symbol:CTSS "Cathepsin S" species:9615...   609  2.2e-59   1
UNIPROTKB|F1PMM9 - symbol:CTSL1 "Cathepsin L1" species:96...   609  2.2e-59   1
UNIPROTKB|Q8HY81 - symbol:CTSS "Cathepsin S" species:9615...   606  4.5e-59   1
RGD|61810 - symbol:Ctsk "cathepsin K" species:10116 "Ratt...   604  7.3e-59   1
UNIPROTKB|P25774 - symbol:CTSS "Cathepsin S" species:9606...   601  1.5e-58   1
ZFIN|ZDB-GENE-001205-4 - symbol:ctsk "cathepsin K" specie...   598  3.2e-58   1
MGI|MGI:107823 - symbol:Ctsk "cathepsin K" species:10090 ...   590  2.2e-57   1
UNIPROTKB|P43235 - symbol:CTSK "Cathepsin K" species:9606...   589  2.8e-57   1
UNIPROTKB|Q9GLE3 - symbol:CTSK "Cathepsin K" species:9823...   587  4.6e-57   1
RGD|708447 - symbol:Testin "testin gene" species:10116 "R...   587  4.6e-57   1
UNIPROTKB|G1K2A7 - symbol:CTSK "Cathepsin K" species:9615...   586  5.9e-57   1
UNIPROTKB|Q3ZKN1 - symbol:CTSK "Cathepsin K" species:9615...   586  5.9e-57   1
RGD|1562210 - symbol:MGC114246 "similar to cathepsin R" s...   583  1.2e-56   1
UNIPROTKB|Q5E968 - symbol:CTSK "Cathepsin K" species:9913...   581  2.0e-56   1
UNIPROTKB|F1NZ37 - symbol:LOC420160 "Uncharacterized prot...   580  2.5e-56   1
MGI|MGI:1922258 - symbol:4930486L24Rik "RIKEN cDNA 493048...   580  2.5e-56   1
DICTYBASE|DDB_G0281605 - symbol:cfaD "peptidase C1A famil...   578  4.2e-56   1
UNIPROTKB|F1NHB8 - symbol:F1NHB8 "Uncharacterized protein...   577  5.3e-56   1
DICTYBASE|DDB_G0278401 - symbol:cprH "cysteine proteinase...   575  8.6e-56   1
UNIPROTKB|Q24940 - symbol:Cat-1 "Cathepsin L-like protein...   575  8.6e-56   1
TAIR|locus:2078312 - symbol:AT3G45310 species:3702 "Arabi...   574  1.1e-55   1
MGI|MGI:1861723 - symbol:Ctsr "cathepsin R" species:10090...   573  1.4e-55   1
UNIPROTKB|Q10991 - symbol:CTSL "Cathepsin L1" species:994...   572  1.8e-55   1
RGD|631421 - symbol:Ctsq "cathepsin Q" species:10116 "Rat...   569  3.7e-55   1
RGD|69241 - symbol:Ctsj "cathepsin J" species:10116 "Ratt...   569  3.7e-55   1
ZFIN|ZDB-GENE-050522-559 - symbol:ctssb.1 "cathepsin S, b...   568  4.8e-55   1
UNIPROTKB|Q90686 - symbol:CTSK "Cathepsin K" species:9031...   567  6.1e-55   1
ZFIN|ZDB-GENE-050417-107 - symbol:zgc:110239 "zgc:110239"...   560  3.4e-54   1
MGI|MGI:1927229 - symbol:Ctsm "cathepsin M" species:10090...   559  4.3e-54   1
ZFIN|ZDB-GENE-040426-1583 - symbol:ctssa "cathepsin S, a"...   558  5.5e-54   1
DICTYBASE|DDB_G0291191 - symbol:DDB_G0291191 "cysteine pr...   557  7.0e-54   1
MGI|MGI:1860262 - symbol:Cts7 "cathepsin 7" species:10090...   555  1.1e-53   1
UNIPROTKB|G1M0X4 - symbol:CTSH "Uncharacterized protein" ...   552  2.4e-53   1
ZFIN|ZDB-GENE-030131-3539 - symbol:ctsh "cathepsin H" spe...   552  2.4e-53   1
UNIPROTKB|F6R7P5 - symbol:CTSH "Uncharacterized protein" ...   551  3.0e-53   1
FB|FBgn0250848 - symbol:26-29-p "26-29kD-proteinase" spec...   550  3.8e-53   1
UNIPROTKB|F1NEC8 - symbol:CTSL2 "Uncharacterized protein"...   550  3.8e-53   1
RGD|621513 - symbol:Ctss "cathepsin S" species:10116 "Rat...   550  3.8e-53   1
TAIR|locus:2175088 - symbol:ALP "aleurain-like protease" ...   550  3.8e-53   1
UNIPROTKB|P09648 - symbol:CTSL1 "Cathepsin L1" species:90...   549  4.9e-53   1
UNIPROTKB|G1RBY1 - symbol:CTSH "Uncharacterized protein" ...   549  4.9e-53   1
MGI|MGI:1349426 - symbol:Ctsj "cathepsin J" species:10090...   549  4.9e-53   1
RGD|1309226 - symbol:Cts7 "cathepsin 7" species:10116 "Ra...   548  6.3e-53   1
UNIPROTKB|F7BRD4 - symbol:CTSH "Uncharacterized protein" ...   547  8.0e-53   1
UNIPROTKB|F7B939 - symbol:CTSH "Uncharacterized protein" ...   546  1.0e-52   1
UNIPROTKB|D3ZZR3 - symbol:D3ZZR3 "Uncharacterized protein...   545  1.3e-52   1

WARNING:  Descriptions of 195 database sequences were not reported due to the
          limiting value of parameter V = 100.


>TAIR|locus:2152445 [details] [associations]
            symbol:SAG12 "senescence-associated gene 12" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009723 "response to ethylene stimulus" evidence=IEP]
            [GO:0007568 "aging" evidence=IEP;TAS] [GO:0010150 "leaf senescence"
            evidence=IEP;TAS] [GO:0010282 "senescence-associated vacuole"
            evidence=IDA] [GO:0009817 "defense response to fungus, incompatible
            interaction" evidence=IEP] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:CP002688 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0010150 GO:GO:0009817 EMBL:AB016870
            HSSP:O65039 OMA:NDEQALM EMBL:AF370131 EMBL:AY040073 IPI:IPI00544181
            RefSeq:NP_568651.1 UniGene:At.75256 UniGene:At.7710
            ProteinModelPortal:Q9FJ47 SMR:Q9FJ47 IntAct:Q9FJ47 STRING:Q9FJ47
            MEROPS:C01.117 PRIDE:Q9FJ47 ProMEX:Q9FJ47 EnsemblPlants:AT5G45890.1
            GeneID:834629 KEGG:ath:AT5G45890 TAIR:At5g45890 InParanoid:Q9FJ47
            PhylomeDB:Q9FJ47 ProtClustDB:CLSN2917735 ArrayExpress:Q9FJ47
            Genevestigator:Q9FJ47 GO:GO:0010282 Uniprot:Q9FJ47
        Length = 346

 Score = 959 (342.6 bits), Expect = 1.8e-96, P = 1.8e-96
 Identities = 177/323 (54%), Positives = 234/323 (72%)

Query:     8 SRKLQ-EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAA-GNKPYKLS 65
             SR L  E  + ++H +WM+K+G+VY + +E+  R+ +FK+NVE IE LN+    + +KL+
Sbjct:    25 SRPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLA 84

Query:    66 INEFADQTNQEFKAFRNGYRRPDGLTSRKGTS---FKYENVID--VPATMDWRKNGAVTP 120
             +N+FAD TN EF++   G++    L+S+  T    F+Y+NV    +P ++DWRK GAVTP
Sbjct:    85 VNQFADLTNDEFRSMYTGFKGVSALSSQSQTKMSPFRYQNVSSGALPVSVDWRKKGAVTP 144

Query:   121 IKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFK 180
             IKNQG CG CWAFSAVAA EG TQ+  GKLISLSEQ+LV CDT+  D GCEGG M+ AF+
Sbjct:   145 IKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN--DFGCEGGLMDTAFE 202

Query:   181 FIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAV 240
              I    G+TTE+NYPY+  D TCN          I GYE VP N E+AL+KAVA+QPV+V
Sbjct:   203 HIKATGGLTTESNYPYKGEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAHQPVSV 262

Query:   241 SIDASGSAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYI 300
              I+  G  FQFYSSGVFTG+C T LDH VTA+GYG + NG+KYW++KNSWGT WGE GY+
Sbjct:   263 GIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGYM 322

Query:   301 RMKRDIDAKEGLCGIAMDSSYPT 323
             R+++D+  K+GLCG+AM +SYPT
Sbjct:   323 RIQKDVKDKQGLCGLAMKASYPT 345


>TAIR|locus:2038588 [details] [associations]
            symbol:AT2G27420 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002685
            GenomeReviews:CT485783_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC006232
            MEROPS:I29.003 OMA:EEFRATH HOGENOM:HOG000230773 HSSP:P53634
            ProtClustDB:CLSN2688476 EMBL:AY064033 EMBL:AY096388 IPI:IPI00539752
            PIR:F84672 RefSeq:NP_565649.1 UniGene:At.27094
            ProteinModelPortal:Q9ZQH7 SMR:Q9ZQH7 PRIDE:Q9ZQH7
            EnsemblPlants:AT2G27420.1 GeneID:817287 KEGG:ath:AT2G27420
            TAIR:At2g27420 InParanoid:Q9ZQH7 PhylomeDB:Q9ZQH7
            ArrayExpress:Q9ZQH7 Genevestigator:Q9ZQH7 Uniprot:Q9ZQH7
        Length = 348

 Score = 946 (338.1 bits), Expect = 4.2e-95, P = 4.2e-95
 Identities = 181/332 (54%), Positives = 233/332 (70%)

Query:     4 SQVTSR-KLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPY 62
             S  TSR  L EAS  EKHEQWM+++ +VY +  EK  RF IFK N+EF+++ N      Y
Sbjct:    18 SLATSRGSLFEASAIEKHEQWMARFNRVYSDETEKRNRFNIFKKNLEFVQNFNMNNKITY 77

Query:    63 KLSINEFADQTNQEFKAFRNGYRRPDGLT------SRKGT-SFKYENVIDVPATMDWRKN 115
             K+ INEF+D T++EF+A   G   P+ +T      S K T  F+Y NV D   +MDWR+ 
Sbjct:    78 KVDINEFSDLTDEEFRATHTGLVVPEAITRISTLSSGKNTVPFRYGNVSDNGESMDWRQE 137

Query:   116 GAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEM 175
             GAVTP+K QG CG CWAFSAVAA EGIT++T G+L+SLSEQ+L+ CD    + GC GG M
Sbjct:   138 GAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQLLDCDRD-YNQGCRGGIM 196

Query:   176 EDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEAS---HVAKIKGYETVPANSEEALLKA 232
               AF++II N GITTE NYPYQ    TC+ +   S     A I GYETVP N+EEALL+A
Sbjct:   197 SKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATISGYETVPMNNEEALLQA 256

Query:   233 VANQPVAVSIDASGSAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGT 292
             V+ QPV+V I+ +G+AF+ YS GVF G+CGT+L H VT VGYG +  GTKYW+VKNSWG 
Sbjct:   257 VSQQPVSVGIEGTGAAFRHYSGGVFNGECGTDLHHAVTIVGYGMSEEGTKYWVVKNSWGE 316

Query:   293 SWGEEGYIRMKRDIDAKEGLCGIAMDSSYPTA 324
             +WGE GY+R+KRD+DA +G+CG+A+ + YP A
Sbjct:   317 TWGENGYMRIKRDVDAPQGMCGLAILAFYPLA 348


>TAIR|locus:2157712 [details] [associations]
            symbol:CEP1 "cysteine endopeptidase 1" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005783 EMBL:CP002688
            GenomeReviews:BA000015_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 PROSITE:PS00014
            EMBL:AB024031 MEROPS:I29.003 EMBL:HM367092 EMBL:AY091087
            IPI:IPI00516991 RefSeq:NP_568722.1 UniGene:At.7918 HSSP:O65039
            ProteinModelPortal:Q9FGR9 SMR:Q9FGR9 PaxDb:Q9FGR9 PRIDE:Q9FGR9
            EnsemblPlants:AT5G50260.1 GeneID:835091 KEGG:ath:AT5G50260
            TAIR:At5g50260 HOGENOM:HOG000230773 InParanoid:Q9FGR9 KO:K16292
            OMA:WHSKKYH PhylomeDB:Q9FGR9 ProtClustDB:CLSN2689970
            Genevestigator:Q9FGR9 Uniprot:Q9FGR9
        Length = 361

 Score = 919 (328.6 bits), Expect = 3.0e-92, P = 3.0e-92
 Identities = 183/316 (57%), Positives = 224/316 (70%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQ 72
             E SL E +E+W S +  V ++ EEK KRF +FK NV+ I   N   +K YKL +N+F D 
Sbjct:    31 ENSLWELYERWRSHH-TVARSLEEKAKRFNVFKHNVKHIHETNKK-DKSYKLKLNKFGDM 88

Query:    73 TNQEFKAFRNG-----YRRPDGLTSRKGT-SFKYENVIDVPATMDWRKNGAVTPIKNQGP 126
             T++EF+    G     +R   G   +K T SF Y NV  +P ++DWRKNGAVTP+KNQG 
Sbjct:    89 TSEEFRRTYAGSNIKHHRMFQG--EKKATKSFMYANVNTLPTSVDWRKNGAVTPVKNQGQ 146

Query:   127 CGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHND 186
             CGSCWAFS V A EGI Q+ T KL SLSEQELV CDT+  + GC GG M+ AF+FI    
Sbjct:   147 CGSCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCDTNQ-NQGCNGGLMDLAFEFIKEKG 205

Query:   187 GITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASG 246
             G+T+E  YPY+A D TC+   E + V  I G+E VP NSE+ L+KAVANQPV+V+IDA G
Sbjct:   206 GLTSELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGG 265

Query:   247 SAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDI 306
             S FQFYS GVFTG CGTEL+HGV  VGYG T +GTKYW+VKNSWG  WGE+GYIRM+R I
Sbjct:   266 SDFQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGI 325

Query:   307 DAKEGLCGIAMDSSYP 322
               KEGLCGIAM++SYP
Sbjct:   326 RHKEGLCGIAMEASYP 341


>TAIR|locus:2082881 [details] [associations]
            symbol:AT3G49340 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002686
            GenomeReviews:BA000014_GR EMBL:AC012329 EMBL:AL132956
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 MEROPS:I29.003 HOGENOM:HOG000230773 HSSP:P07711
            KO:K01376 IPI:IPI00520642 PIR:T45839 RefSeq:NP_566920.1
            UniGene:At.53854 ProteinModelPortal:Q9SG15 SMR:Q9SG15
            EnsemblPlants:AT3G49340.1 GeneID:824096 KEGG:ath:AT3G49340
            TAIR:At3g49340 InParanoid:Q9SG15 OMA:PQNDEEA PhylomeDB:Q9SG15
            ProtClustDB:CLSN2688476 Genevestigator:Q9SG15 Uniprot:Q9SG15
        Length = 341

 Score = 919 (328.6 bits), Expect = 3.0e-92, P = 3.0e-92
 Identities = 177/328 (53%), Positives = 228/328 (69%)

Query:     4 SQVTSRK-LQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPY 62
             S VTSR  L EAS  EKHEQWMS++ +VY +  EK  RF IF +N++F+ES+N   NK Y
Sbjct:    18 SGVTSRGGLFEASAVEKHEQWMSRFNRVYSDDSEKTSRFEIFTNNLKFVESINMNTNKTY 77

Query:    63 KLSINEFADQTNQEFKAFRNGYRRPDGLT------SRKGTSFKYENVIDVPATMDWRKNG 116
              L +NEF+D T++EFKA   G   P+G+T      S +  SF+YENV +   +MDW + G
Sbjct:    78 TLDVNEFSDLTDEEFKARYTGLVVPEGMTRISTTDSHETVSFRYENVGETGESMDWIQEG 137

Query:   117 AVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEME 176
             AVT +K+Q  CG CWAFSAVAA EG+T++  G+L+SLSEQ+L+ C T   ++GC GG M 
Sbjct:   138 AVTSVKHQQQCGCCWAFSAVAAVEGMTKIANGELVSLSEQQLLDCSTE--NNGCGGGIMW 195

Query:   177 DAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ 236
              AF +I  N GITTE NYPYQ    TC   + A+  A I GYETVP N EEALLKAV+ Q
Sbjct:   196 KAFDYIKENQGITTEDNYPYQGAQQTCESNHLAA--ATISGYETVPQNDEEALLKAVSQQ 253

Query:   237 PVAVSIDASGSAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGE 296
             PV+V+I+ SG  F  YS G+F G+CGT+L H VT VGYG +  G KYWL+KNSWG SWGE
Sbjct:   254 PVSVAIEGSGYEFIHYSGGIFNGECGTQLTHAVTIVGYGVSEEGIKYWLLKNSWGESWGE 313

Query:   297 EGYIRMKRDIDAKEGLCGIAMDSSYPTA 324
              GY+R+ RD+D+ +G+CG+A  + YP A
Sbjct:   314 NGYMRIMRDVDSPQGMCGLASLAYYPVA 341


>TAIR|locus:2055440 [details] [associations]
            symbol:AT2G34080 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002685 GenomeReviews:CT485783_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 MEROPS:I29.003 EMBL:AC002341
            HOGENOM:HOG000230773 HSSP:P53634 IPI:IPI00530325 PIR:B84752
            RefSeq:NP_565780.1 UniGene:At.28613 UniGene:At.37859
            ProteinModelPortal:O22961 SMR:O22961 EnsemblPlants:AT2G34080.1
            GeneID:817969 KEGG:ath:AT2G34080 TAIR:At2g34080 InParanoid:O22961
            OMA:SENDYSY PhylomeDB:O22961 ProtClustDB:CLSN2688064
            ArrayExpress:O22961 Genevestigator:O22961 Uniprot:O22961
        Length = 345

 Score = 877 (313.8 bits), Expect = 8.6e-88, P = 8.6e-88
 Identities = 169/331 (51%), Positives = 226/331 (68%)

Query:     4 SQVTSRKL--QEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKP 61
             SQ TSR +  +E S+ +KHEQWM+++ + Y++  EK  R  +FK N++FIE+ N  GNK 
Sbjct:    21 SQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDVFKKNLKFIENFNKKGNKS 80

Query:    62 YKLSINEFADQTNQEFKAFRNGYRRPDGLTS-------RKGTSFKYENVID-VPATMDWR 113
             YKL +NEFAD TN+EF A   G +   GLT         K  S +  NV D V  + DWR
Sbjct:    81 YKLGVNEFADWTNEEFLAIHTGLK---GLTEVSPSKVVAKTISSQTWNVSDMVVESKDWR 137

Query:   114 KNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGG 173
               GAVTP+K QG CG CWAFSAVAA EG+ ++  G L+SLSEQ+L+ CD    D GC+GG
Sbjct:   138 AEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCDRE-YDRGCDGG 196

Query:   174 EMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAV 233
              M DAF +++ N GI +E +Y YQ  DG C ++N A   A+I G++TVP+N+E ALL+AV
Sbjct:   197 IMSDAFNYVVQNRGIASENDYSYQGSDGGC-RSN-ARPAARISGFQTVPSNNERALLEAV 254

Query:   234 ANQPVAVSIDASGSAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTS 293
             + QPV+VS+DA+G  F  YS GV+ G CGT  +H VT VGYG + +GTKYWL KNSWG +
Sbjct:   255 SRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGTKYWLAKNSWGET 314

Query:   294 WGEEGYIRMKRDIDAKEGLCGIAMDSSYPTA 324
             WGE+GYIR++RD+   +G+CG+A  + YP A
Sbjct:   315 WGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345


>TAIR|locus:2029924 [details] [associations]
            symbol:AT1G29090 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002684 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            HOGENOM:HOG000230773 HSSP:P53634 ProtClustDB:CLSN2688064
            EMBL:BT004146 IPI:IPI00545702 RefSeq:NP_564321.2 UniGene:At.40814
            ProteinModelPortal:Q84W75 SMR:Q84W75 MEROPS:C01.A15
            EnsemblPlants:AT1G29090.1 GeneID:839784 KEGG:ath:AT1G29090
            TAIR:At1g29090 InParanoid:Q84W75 OMA:SIRGHED PhylomeDB:Q84W75
            ArrayExpress:Q84W75 Genevestigator:Q84W75 Uniprot:Q84W75
        Length = 355

 Score = 866 (309.9 bits), Expect = 1.3e-86, P = 1.3e-86
 Identities = 167/333 (50%), Positives = 224/333 (67%)

Query:     1 IAASQVTSR-KLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGN 59
             +  SQ TSR    E  ++E H+QWM+++ +VY +  EK+ RF +FK N++FIE  N  G+
Sbjct:    27 LKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGD 86

Query:    60 KPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGT-----SFKYENVIDVPA--TMDW 112
             + YKL +NEFAD T +EF A   G +  +G+ S +       S+ + NV DV    T DW
Sbjct:    87 RTYKLGVNEFADWTREEFIATHTGLKGVNGIPSSEFVDEMIPSWNW-NVSDVAGRETKDW 145

Query:   113 RKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEG 172
             R  GAVTP+K QG CG CWAFS+VAA EG+T++    L+SLSEQ+L+ CD    D+GC G
Sbjct:   146 RYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRER-DNGCNG 204

Query:   173 GEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKA 232
             G M DAF +II N GI +EA+YPYQA +GTC    + S  A I+G++TVP+N+E ALL+A
Sbjct:   205 GIMSDAFSYIIKNRGIASEASYPYQAAEGTCRYNGKPS--AWIRGFQTVPSNNERALLEA 262

Query:   233 VANQPVAVSIDASGSAFQFYSSGVFTGD-CGTELDHGVTAVGYGATANGTKYWLVKNSWG 291
             V+ QPV+VSIDA G  F  YS GV+    CGT ++H VT VGYG +  G KYWL KNSWG
Sbjct:   263 VSKQPVSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWG 322

Query:   292 TSWGEEGYIRMKRDIDAKEGLCGIAMDSSYPTA 324
              +WGE GYIR++RD+   +G+CG+A  + YP A
Sbjct:   323 ETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 355


>TAIR|locus:2090614 [details] [associations]
            symbol:AT3G19390 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA;ISS] [GO:0000041 "transition metal ion
            transport" evidence=RCA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:CP002686 GenomeReviews:BA000014_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AB025624
            MEROPS:I29.003 OMA:KAMDQKC HSSP:O65039 HOGENOM:HOG000230773
            InterPro:IPR000118 Pfam:PF00396 SMART:SM00277 EMBL:AY062725
            EMBL:AY093350 IPI:IPI00520189 RefSeq:NP_566633.1 UniGene:At.27473
            ProteinModelPortal:Q9LT78 SMR:Q9LT78 IntAct:Q9LT78 STRING:Q9LT78
            PaxDb:Q9LT78 PRIDE:Q9LT78 EnsemblPlants:AT3G19390.1 GeneID:821473
            KEGG:ath:AT3G19390 TAIR:At3g19390 InParanoid:Q9LT78
            PhylomeDB:Q9LT78 ProtClustDB:CLSN2917188 Genevestigator:Q9LT78
            Uniprot:Q9LT78
        Length = 452

 Score = 862 (308.5 bits), Expect = 3.3e-86, P = 3.3e-86
 Identities = 164/318 (51%), Positives = 212/318 (66%)

Query:     7 TSRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSI 66
             T     EA     +E+W+ +  K Y    EKE+RF IFKDN++F+E  ++  N+ Y++ +
Sbjct:    30 TETTRNEAEARRMYERWLVENRKNYNGLGEKERRFEIFKDNLKFVEEHSSIPNRTYEVGL 89

Query:    67 NEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGP 126
               FAD TN EF+A     +        KG  + Y+    +P  +DWR  GAV P+K+QG 
Sbjct:    90 TRFADLTNDEFRAIYLRSKMERTRVPVKGEKYLYKVGDSLPDAIDWRAKGAVNPVKDQGS 149

Query:   127 CGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHND 186
             CGSCWAFSA+ A EGI Q+ TG+LISLSEQELV CDTS  D GC GG M+ AFKFII N 
Sbjct:   150 CGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCDTSYND-GCGGGLMDYAFKFIIENG 208

Query:   187 GITTEANYPYQAVD-GTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDAS 245
             GI TE +YPY A D   CN   + + V  I GYE VP N E++L KA+ANQP++V+I+A 
Sbjct:   209 GIDTEEDYPYIATDVNVCNSDKKNTRVVTIDGYEDVPQNDEKSLKKALANQPISVAIEAG 268

Query:   246 GSAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRD 305
             G AFQ Y+SGVFTG CGT LDHGV AVGYG+   G  YW+V+NSWG++WGE GY +++R+
Sbjct:   269 GRAFQLYTSGVFTGTCGTSLDHGVVAVGYGSEG-GQDYWIVRNSWGSNWGESGYFKLERN 327

Query:   306 IDAKEGLCGIAMDSSYPT 323
             I    G CG+AM +SYPT
Sbjct:   328 IKESSGKCGVAMMASYPT 345


>TAIR|locus:2167821 [details] [associations]
            symbol:RD21B "esponsive to dehydration 21B" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS] [GO:0005773
            "vacuole" evidence=IDA] [GO:0009651 "response to salt stress"
            evidence=IEP] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005829 "cytosol" evidence=IDA] [GO:0052541 "plant-type cell
            wall cellulose metabolic process" evidence=RCA] [GO:0052546 "cell
            wall pectin metabolic process" evidence=RCA] [GO:0005783
            "endoplasmic reticulum" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005783 GO:GO:0005829 EMBL:CP002688
            GO:GO:0005773 GO:GO:0009651 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AB008267 HSSP:O65039
            HOGENOM:HOG000230773 KO:K01376 InterPro:IPR000118 Pfam:PF00396
            SMART:SM00277 ProtClustDB:CLSN2688498 EMBL:AY062608 EMBL:AY114661
            IPI:IPI00520971 RefSeq:NP_568620.1 UniGene:At.24130 SMR:Q9FMH8
            IntAct:Q9FMH8 STRING:Q9FMH8 MEROPS:C01.A12
            EnsemblPlants:AT5G43060.1 GeneID:834321 KEGG:ath:AT5G43060
            TAIR:At5g43060 InParanoid:Q9FMH8 OMA:ENSEASL Genevestigator:Q9FMH8
            Uniprot:Q9FMH8
        Length = 463

 Score = 860 (307.8 bits), Expect = 5.4e-86, P = 5.4e-86
 Identities = 172/322 (53%), Positives = 222/322 (68%)

Query:     7 TSRKLQEASLSEKHEQWMSKYGKVYKNPE----EKEKRFRIFKDNVEFIESLNAAGNKPY 62
             TSR   ++ +   +E WM ++GK   N      EK++RF IFKDN+ FI+  N   N  Y
Sbjct:    39 TSRS--DSEVERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTK-NLSY 95

Query:    63 KLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYE-NVID-VPATMDWRKNGAVTP 120
             KL +  FAD TN+E+++   G  +P     +  TS +Y+  V D +P ++DWRK GAV  
Sbjct:    96 KLGLTRFADLTNEEYRSMYLG-AKPTKRVLK--TSDRYQARVGDALPDSVDWRKEGAVAD 152

Query:   121 IKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFK 180
             +K+QG CGSCWAFS + A EGI ++ TG LISLSEQELV CDTS  + GC GG M+ AF+
Sbjct:   153 VKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFE 211

Query:   181 FIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAV 240
             FII N GI TEA+YPY+A DG C++  + + V  I  YE VP NSE +L KA+A+QP++V
Sbjct:   212 FIIKNGGIDTEADYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISV 271

Query:   241 SIDASGSAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYI 300
             +I+A G AFQ YSSGVF G CGTELDHGV AVGYG T NG  YW+V+NSWG  WGE GYI
Sbjct:   272 AIEAGGRAFQLYSSGVFDGLCGTELDHGVVAVGYG-TENGKDYWIVRNSWGNRWGESGYI 330

Query:   301 RMKRDIDAKEGLCGIAMDSSYP 322
             +M R+I+A  G CGIAM++SYP
Sbjct:   331 KMARNIEAPTGKCGIAMEASYP 352


>TAIR|locus:2825832 [details] [associations]
            symbol:RD21A "responsive to dehydration 21A" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;IMP]
            [GO:0008234 "cysteine-type peptidase activity"
            evidence=ISS;IDA;IMP] [GO:0009507 "chloroplast" evidence=IDA]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0008233 "peptidase
            activity" evidence=IDA] [GO:0048046 "apoplast" evidence=IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005829 "cytosol"
            evidence=RCA] [GO:0009506 "plasmodesma" evidence=IDA] [GO:0050832
            "defense response to fungus" evidence=IMP] [GO:0006096 "glycolysis"
            evidence=RCA] [GO:0006833 "water transport" evidence=RCA]
            [GO:0006972 "hyperosmotic response" evidence=RCA] [GO:0007030
            "Golgi organization" evidence=RCA] [GO:0009266 "response to
            temperature stimulus" evidence=RCA] [GO:0009651 "response to salt
            stress" evidence=RCA] [GO:0015996 "chlorophyll catabolic process"
            evidence=RCA] [GO:0042744 "hydrogen peroxide catabolic process"
            evidence=RCA] [GO:0046686 "response to cadmium ion" evidence=RCA]
            [GO:0009414 "response to water deprivation" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0009506 GO:GO:0009507 GO:GO:0005773
            GO:GO:0050832 GO:GO:0048046 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC083835
            HOGENOM:HOG000230773 KO:K01376 InterPro:IPR000118 Pfam:PF00396
            SMART:SM00277 UniGene:At.43549 EMBL:D13043 EMBL:AY072130
            EMBL:AY133781 IPI:IPI00530094 PIR:JN0719 RefSeq:NP_564497.1
            UniGene:At.47599 UniGene:At.71705 ProteinModelPortal:P43297
            SMR:P43297 IntAct:P43297 STRING:P43297 MEROPS:C01.064 PaxDb:P43297
            PRIDE:P43297 ProMEX:P43297 EnsemblPlants:AT1G47128.1 GeneID:841122
            KEGG:ath:AT1G47128 TAIR:At1g47128 InParanoid:P43297 OMA:EAWLVKH
            PhylomeDB:P43297 ProtClustDB:CLSN2688498 Genevestigator:P43297
            GermOnline:AT1G47128 Uniprot:P43297
        Length = 462

 Score = 853 (305.3 bits), Expect = 3.0e-85, P = 3.0e-85
 Identities = 161/314 (51%), Positives = 214/314 (68%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPE--EKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFA 70
             EA +   +E W+ K+GK        EK++RF IFKDN+ F++  N   N  Y+L +  FA
Sbjct:    43 EAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEK-NLSYRLGLTRFA 101

Query:    71 DQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVI--DVPATMDWRKNGAVTPIKNQGPCG 128
             D TN E+++   G +        + TS +YE  +  ++P ++DWRK GAV  +K+QG CG
Sbjct:   102 DLTNDEYRSKYLGAKMEK--KGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCG 159

Query:   129 SCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGI 188
             SCWAFS + A EGI Q+ TG LI+LSEQELV CDTS  + GC GG M+ AF+FII N GI
Sbjct:   160 SCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGGI 218

Query:   189 TTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSA 248
              T+ +YPY+ VDGTC++  + + V  I  YE VP  SEE+L KAVA+QP++++I+A G A
Sbjct:   219 DTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRA 278

Query:   249 FQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDA 308
             FQ Y SG+F G CGT+LDHGV AVGYG T NG  YW+V+NSWG SWGE GY+RM R+I +
Sbjct:   279 FQLYDSGIFDGSCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGKSWGESGYLRMARNIAS 337

Query:   309 KEGLCGIAMDSSYP 322
               G CGIA++ SYP
Sbjct:   338 SSGKCGIAIEPSYP 351


>TAIR|locus:2122113 [details] [associations]
            symbol:XCP1 "xylem cysteine peptidase 1" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0000325 "plant-type vacuole" evidence=IDA] [GO:0005634
            "nucleus" evidence=IDA] [GO:0010623 "developmental programmed cell
            death" evidence=IMP] [GO:0010413 "glucuronoxylan metabolic process"
            evidence=RCA] [GO:0045492 "xylan biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005886
            GO:GO:0005634 EMBL:CP002687 GenomeReviews:CT486007_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0000325
            EMBL:AL022604 EMBL:AL161587 GO:GO:0010623 MEROPS:I29.003
            HOGENOM:HOG000230773 EMBL:AF191027 EMBL:AK117394 EMBL:BT005179
            IPI:IPI00532220 PIR:T06122 RefSeq:NP_567983.1 UniGene:At.2280
            UniGene:At.67622 ProteinModelPortal:O65493 SMR:O65493 STRING:O65493
            PaxDb:O65493 PRIDE:O65493 EnsemblPlants:AT4G35350.1 GeneID:829688
            KEGG:ath:AT4G35350 GeneFarm:5033 TAIR:At4g35350 InParanoid:O65493
            KO:K16290 OMA:FEVFREN PhylomeDB:O65493 ProtClustDB:CLSN2689772
            Genevestigator:O65493 Uniprot:O65493
        Length = 355

 Score = 850 (304.3 bits), Expect = 6.2e-85, P = 6.2e-85
 Identities = 162/309 (52%), Positives = 212/309 (68%)

Query:    16 LSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQ 75
             L E  E WMS++ K YK+ EEK  RF +F++N+  I+  N   N  Y L +NEFAD T++
Sbjct:    47 LLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINS-YWLGLNEFADLTHE 105

Query:    76 EFKAFRNGYRRPDGLTSRK-GTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWAFS 134
             EFK    G  +P     R+   +F+Y ++ D+P ++DWRK GAV P+K+QG CGSCWAFS
Sbjct:   106 EFKGRYLGLAKPQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFS 165

Query:   135 AVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANY 194
              VAA EGI Q+TTG L SLSEQEL+ CDT+  + GC GG M+ AF++II   G+  E +Y
Sbjct:   166 TVAAVEGINQITTGNLSSLSEQELIDCDTT-FNSGCNGGLMDYAFQYIISTGGLHKEDDY 224

Query:   195 PYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSS 254
             PY   +G C +  E      I GYE VP N +E+L+KA+A+QPV+V+I+ASG  FQFY  
Sbjct:   225 PYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYKG 284

Query:   255 GVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCG 314
             GVF G CGT+LDHGV AVGYG++  G+ Y +VKNSWG  WGE+G+IRMKR+    EGLCG
Sbjct:   285 GVFNGKCGTDLDHGVAAVGYGSS-KGSDYVIVKNSWGPRWGEKGFIRMKRNTGKPEGLCG 343

Query:   315 IAMDSSYPT 323
             I   +SYPT
Sbjct:   344 INKMASYPT 352


>TAIR|locus:505006391 [details] [associations]
            symbol:CEP3 "cysteine endopeptidase 3" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005783 EMBL:CP002686
            GenomeReviews:BA000014_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 PROSITE:PS00014
            EMBL:AL049659 HSSP:O65039 HOGENOM:HOG000230773 KO:K16292
            EMBL:AK119026 IPI:IPI00525150 PIR:T06707 RefSeq:NP_566901.1
            UniGene:At.3162 ProteinModelPortal:Q9STL5 SMR:Q9STL5 MEROPS:C01.A02
            PRIDE:Q9STL5 EnsemblPlants:AT3G48350.1 GeneID:823993
            KEGG:ath:AT3G48350 TAIR:At3g48350 InParanoid:Q9STL5 OMA:DITHHEF
            PhylomeDB:Q9STL5 ProtClustDB:CLSN2917387 Genevestigator:Q9STL5
            Uniprot:Q9STL5
        Length = 364

 Score = 836 (299.3 bits), Expect = 1.9e-83, P = 1.9e-83
 Identities = 167/317 (52%), Positives = 208/317 (65%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQ 72
             E ++ + +E+W   +  V +   E  KRF +F+ NV  +   N   NKPYKL IN FAD 
Sbjct:    31 EENVWKLYERWRGHHS-VSRASHEAIKRFNVFRHNVLHVHRTNKK-NKPYKLKINRFADI 88

Query:    73 TNQEFKAFRNG-----YRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPC 127
             T+ EF++   G     +R   G   R    F YENV  VP+++DWR+ GAVT +KNQ  C
Sbjct:    89 THHEFRSSYAGSNVKHHRMLRG-PKRGSGGFMYENVTRVPSSVDWREKGAVTEVKNQQDC 147

Query:   128 GSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDG 187
             GSCWAFS VAA EGI ++ T KL+SLSEQELV CDT   + GC GG ME AF+FI +N G
Sbjct:   148 GSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEE-NQGCAGGLMEPAFEFIKNNGG 206

Query:   188 ITTEANYPYQAVDGT-CNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASG 246
             I TE  YPY + D   C   +       I G+E VP N EE LLKAVA+QPV+V+IDA  
Sbjct:   207 IKTEETYPYDSSDVQFCRANSIGGETVTIDGHEHVPENDEEELLKAVAHQPVSVAIDAGS 266

Query:   247 SAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDI 306
             S FQ YS GVF G+CGT+L+HGV  VGYG T NGTKYW+V+NSWG  WGE GY+R++R I
Sbjct:   267 SDFQLYSEGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIERGI 326

Query:   307 DAKEGLCGIAMDSSYPT 323
                EG CGIAM++SYPT
Sbjct:   327 SENEGRCGIAMEASYPT 343


>TAIR|locus:2030427 [details] [associations]
            symbol:XCP2 "xylem cysteine peptidase 2" species:3702
            "Arabidopsis thaliana" [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009507 "chloroplast" evidence=ISM] [GO:0008233 "peptidase
            activity" evidence=ISS] [GO:0005618 "cell wall" evidence=IDA]
            [GO:0010623 "developmental programmed cell death" evidence=IMP]
            [GO:0010075 "regulation of meristem growth" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0005886 GO:GO:0005618 GO:GO:0005773
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AC069251 EMBL:AC007369 GO:GO:0010623
            OMA:YKEIPEG HOGENOM:HOG000230773 KO:K16290 EMBL:AF191028
            EMBL:BT004822 IPI:IPI00526722 PIR:A86341 RefSeq:NP_564126.1
            UniGene:At.21316 ProteinModelPortal:Q9LM66 SMR:Q9LM66 IntAct:Q9LM66
            STRING:Q9LM66 MEROPS:C01.120 PaxDb:Q9LM66 PRIDE:Q9LM66
            ProMEX:Q9LM66 EnsemblPlants:AT1G20850.1 GeneID:838677
            KEGG:ath:AT1G20850 GeneFarm:5034 TAIR:At1g20850 InParanoid:Q9LM66
            PhylomeDB:Q9LM66 ProtClustDB:CLSN2917031 Genevestigator:Q9LM66
            GermOnline:AT1G20850 Uniprot:Q9LM66
        Length = 356

 Score = 832 (297.9 bits), Expect = 5.0e-83, P = 5.0e-83
 Identities = 160/312 (51%), Positives = 210/312 (67%)

Query:    16 LSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQ 75
             L E  E W+S + K Y+  EEK  RF +FKDN++ I+  N  G K Y L +NEFAD +++
Sbjct:    47 LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFADLSHE 105

Query:    76 EFKAFRNGYR----RPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCW 131
             EFK    G +    R D    R    F Y +V  VP ++DWRK GAV  +KNQG CGSCW
Sbjct:   106 EFKKMYLGLKTDIVRRD--EERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCW 163

Query:   132 AFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTE 191
             AFS VAA EGI ++ TG L +LSEQEL+ CDT+  ++GC GG M+ AF++I+ N G+  E
Sbjct:   164 AFSTVAAVEGINKIVTGNLTTLSEQELIDCDTT-YNNGCNGGLMDYAFEYIVKNGGLRKE 222

Query:   192 ANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQF 251
              +YPY   +GTC    + S    I G++ VP N E++LLKA+A+QP++V+IDASG  FQF
Sbjct:   223 EDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQF 282

Query:   252 YSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEG 311
             YS GVF G CG +LDHGV AVGYG++  G+ Y +VKNSWG  WGE+GYIR+KR+    EG
Sbjct:   283 YSGGVFDGRCGVDLDHGVAAVGYGSS-KGSDYIIVKNSWGPKWGEKGYIRLKRNTGKPEG 341

Query:   312 LCGIAMDSSYPT 323
             LCGI   +S+PT
Sbjct:   342 LCGINKMASFPT 353


>TAIR|locus:2090629 [details] [associations]
            symbol:AT3G19400 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0019344 "cysteine biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            EMBL:CP002686 GenomeReviews:BA000014_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AB025624
            MEROPS:I29.003 HOGENOM:HOG000230773 EMBL:AK118509 IPI:IPI00543468
            RefSeq:NP_566634.2 UniGene:At.38409 ProteinModelPortal:Q9LT77
            SMR:Q9LT77 PaxDb:Q9LT77 PRIDE:Q9LT77 EnsemblPlants:AT3G19400.1
            GeneID:821474 KEGG:ath:AT3G19400 TAIR:At3g19400 InParanoid:Q9LT77
            OMA:IGEHERR ProtClustDB:CLSN2679975 Genevestigator:Q9LT77
            Uniprot:Q9LT77
        Length = 362

 Score = 828 (296.5 bits), Expect = 1.3e-82, P = 1.3e-82
 Identities = 159/319 (49%), Positives = 214/319 (67%)

Query:     7 TSRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSI 66
             T  +  E  +   +EQW+ +  K Y    EKE+RF+IFKDN++F++  N+  ++ +++ +
Sbjct:    31 TEIERNETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGL 90

Query:    67 NEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGP 126
               FAD TN+EF+A     +      S K   + Y+    +P  +DWR NGAV  +K+QG 
Sbjct:    91 TRFADLTNEEFRAIYLRKKMERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGN 150

Query:   127 CGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHND 186
             CGSCWAFSAV A EGI Q+TTG+LISLSEQELV CD   V+ GC+GG M  AF+FI+ N 
Sbjct:   151 CGSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNG 210

Query:   187 GITTEANYPYQAVD-GTCNKT-NEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDA 244
             GI T+ +YPY A D G CN   N  + V  I GYE VP + E++L KAVA+QPV+V+I+A
Sbjct:   211 GIETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEA 270

Query:   245 SGSAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKR 304
             S  AFQ Y SGV TG CG  LDHGV  VGYG+T+ G  YW+++NSWG +WG+ GY++++R
Sbjct:   271 SSQAFQLYKSGVMTGTCGISLDHGVVVVGYGSTS-GEDYWIIRNSWGLNWGDSGYVKLQR 329

Query:   305 DIDAKEGLCGIAMDSSYPT 323
             +ID   G CGIAM  SYPT
Sbjct:   330 NIDDPFGKCGIAMMPSYPT 348


>TAIR|locus:2024362 [details] [associations]
            symbol:XBCP3 "xylem bark cysteine peptidase 3"
            species:3702 "Arabidopsis thaliana" [GO:0005576 "extracellular
            region" evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0005783 "endoplasmic
            reticulum" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005783 EMBL:CP002684 GO:GO:0005773 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 MEROPS:I29.003
            HOGENOM:HOG000230773 InterPro:IPR000118 Pfam:PF00396 SMART:SM00277
            UniGene:At.10233 OMA:CEIESAV EMBL:BT026490 EMBL:AK226753
            IPI:IPI00536687 RefSeq:NP_563855.1 ProteinModelPortal:Q0WVJ5
            SMR:Q0WVJ5 PRIDE:Q0WVJ5 EnsemblPlants:AT1G09850.1 GeneID:837517
            KEGG:ath:AT1G09850 TAIR:At1g09850 InParanoid:Q0WVJ5
            PhylomeDB:Q0WVJ5 ProtClustDB:CLSN2687747 Genevestigator:Q0WVJ5
            Uniprot:Q0WVJ5
        Length = 437

 Score = 815 (292.0 bits), Expect = 3.2e-81, P = 3.2e-81
 Identities = 158/309 (51%), Positives = 204/309 (66%)

Query:    16 LSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQ 75
             +SE  + W  K+GK Y + EE+++R +IFKDN +F+   N   N  Y LS+N FAD T+ 
Sbjct:    28 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 87

Query:    76 EFKAFRNGYR--RPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWAF 133
             EFKA R G     P  + + KG S      + VP ++DWRK GAVT +K+QG CG+CW+F
Sbjct:    88 EFKASRLGLSVSAPSVIMASKGQSLG--GSVKVPDSVDWRKKGAVTNVKDQGSCGACWSF 145

Query:   134 SAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEAN 193
             SA  A EGI Q+ TG LISLSEQEL+ CD S  + GC GG M+ AF+F+I N GI TE +
Sbjct:   146 SATGAMEGINQIVTGDLISLSEQELIDCDKS-YNAGCNGGLMDYAFEFVIKNHGIDTEKD 204

Query:   194 YPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYS 253
             YPYQ  DGTC K      V  I  Y  V +N E+AL++AVA QPV+V I  S  AFQ YS
Sbjct:   205 YPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYS 264

Query:   254 SGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLC 313
             SG+F+G C T LDH V  VGYG+  NG  YW+VKNSWG SWG +G++ M+R+ +  +G+C
Sbjct:   265 SGIFSGPCSTSLDHAVLIVGYGSQ-NGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVC 323

Query:   314 GIAMDSSYP 322
             GI M +SYP
Sbjct:   324 GINMLASYP 332


>TAIR|locus:2029934 [details] [associations]
            symbol:AT1G29080 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002684 GenomeReviews:CT485782_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AC021043 MEROPS:I29.003 HOGENOM:HOG000230773
            HSSP:P53634 ProtClustDB:CLSN2688064 EMBL:DQ056468 IPI:IPI00521747
            PIR:C86413 RefSeq:NP_564320.1 UniGene:At.51814
            ProteinModelPortal:Q9LP39 SMR:Q9LP39 EnsemblPlants:AT1G29080.1
            GeneID:839783 KEGG:ath:AT1G29080 TAIR:At1g29080 InParanoid:Q9LP39
            OMA:KTWGENG PhylomeDB:Q9LP39 Genevestigator:Q9LP39 Uniprot:Q9LP39
        Length = 346

 Score = 810 (290.2 bits), Expect = 1.1e-80, P = 1.1e-80
 Identities = 157/330 (47%), Positives = 222/330 (67%)

Query:     4 SQVTSRKL--QEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKP 61
             S+ TSR    + +S+ + H+QWM ++ +VY +  EK+ R ++  +N++FIES N  GN+ 
Sbjct:    21 SEATSRVALYKPSSIVDYHQQWMIQFSRVYDDEFEKQLRLQVLTENLKFIESFNNMGNQS 80

Query:    62 YKLSINEFADQTNQEFKAFRNGYR-----RPDGLTSRKGTSFKYENVIDVPAT-MDWRKN 115
             YKL +NEF D T +EF A   G R      P  + +    ++ +  V DV  T  DWR  
Sbjct:    81 YKLGVNEFTDWTKEEFLATYTGLRGVNVTSPFEVVNETKPAWNW-TVSDVLGTNKDWRNE 139

Query:   116 GAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEM 175
             GAVTP+K+QG CG CWAFSA+AA EG+T++  G LISLSEQ+L+ C T   ++GC+GG  
Sbjct:   140 GAVTPVKSQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDC-TREQNNGCKGGTF 198

Query:   176 EDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVAN 235
              +AF +II + GI++E  YPYQ  +G C ++N A     I+G+E VP+N+E ALL+AV+ 
Sbjct:   199 VNAFNYIIKHRGISSENEYPYQVKEGPC-RSN-ARPAILIRGFENVPSNNERALLEAVSR 256

Query:   236 QPVAVSIDASGSAFQFYSSGVFTG-DCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSW 294
             QPVAV+IDAS + F  YS GV+   +CGT ++H VT VGYG +  G KYWL KNSWG +W
Sbjct:   257 QPVAVAIDASEAGFVHYSGGVYNARNCGTSVNHAVTLVGYGTSPEGMKYWLAKNSWGKTW 316

Query:   295 GEEGYIRMKRDIDAKEGLCGIAMDSSYPTA 324
             GE GYIR++RD++  +G+CG+A  +SYP A
Sbjct:   317 GENGYIRIRRDVEWPQGMCGVAQYASYPVA 346


>TAIR|locus:2038515 [details] [associations]
            symbol:AT1G06260 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0048046 "apoplast"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0048046 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC025290
            MEROPS:I29.003 HSSP:O65039 HOGENOM:HOG000230773 OMA:METAFEF
            IPI:IPI00525965 PIR:D86198 RefSeq:NP_563764.1 UniGene:At.24617
            ProteinModelPortal:Q9LNC1 SMR:Q9LNC1 PaxDb:Q9LNC1 PRIDE:Q9LNC1
            EnsemblPlants:AT1G06260.1 GeneID:837137 KEGG:ath:AT1G06260
            TAIR:At1g06260 InParanoid:Q9LNC1 PhylomeDB:Q9LNC1
            ProtClustDB:CLSN2916975 Genevestigator:Q9LNC1 Uniprot:Q9LNC1
        Length = 343

 Score = 806 (288.8 bits), Expect = 2.9e-80, P = 2.9e-80
 Identities = 156/308 (50%), Positives = 202/308 (65%)

Query:    15 SLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTN 74
             +L ++ E+W+  + K+Y   +E   RF I++ NV+ I+ +N+  + P+KL+ N FAD TN
Sbjct:    38 TLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSL-HLPFKLTDNRFADMTN 96

Query:    75 QEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWAFS 134
              EFKA   G      L   K      +   +VP  +DWR  GAVTPI+NQG CG CWAFS
Sbjct:    97 SEFKAHFLGLNT-SSLRLHKKQRPVCDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFS 155

Query:   135 AVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANY 194
             AVAA EGI ++ TG L+SLSEQ+L+ CD    + GC GG ME AF+FI  N G+ TE +Y
Sbjct:   156 AVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKTNGGLATETDY 215

Query:   195 PYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSS 254
             PY  ++GTC++    + V  I+GY+ V A +E +L  A A QPV+V IDA G  FQ YSS
Sbjct:   216 PYTGIEGTCDQEKSKNKVVTIQGYQKV-AQNEASLQIAAAQQPVSVGIDAGGFIFQLYSS 274

Query:   255 GVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCG 314
             GVFT  CGT L+HGVT VGYG   +  KYW+VKNSWGT WGEEGYIRM+R +    G CG
Sbjct:   275 GVFTNYCGTNLNHGVTVVGYGVEGD-QKYWIVKNSWGTGWGEEGYIRMERGVSEDTGKCG 333

Query:   315 IAMDSSYP 322
             IAM +SYP
Sbjct:   334 IAMMASYP 341


>TAIR|locus:2128243 [details] [associations]
            symbol:AT4G11310 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005618 "cell wall"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 GO:GO:0005618 EMBL:CP002687
            GenomeReviews:CT486007_GR EMBL:AL096882 EMBL:AL161531
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            HOGENOM:HOG000230773 KO:K01376 EMBL:AY093066 EMBL:BT000099
            IPI:IPI00520496 PIR:T13022 RefSeq:NP_567376.1 UniGene:At.43189
            ProteinModelPortal:Q9SUT0 SMR:Q9SUT0 IntAct:Q9SUT0 STRING:Q9SUT0
            MEROPS:C01.A20 PaxDb:Q9SUT0 PRIDE:Q9SUT0 EnsemblPlants:AT4G11310.1
            GeneID:826733 KEGG:ath:AT4G11310 TAIR:At4g11310 InParanoid:Q9SUT0
            OMA:EVCHGAD PhylomeDB:Q9SUT0 ProtClustDB:CLSN2689395
            Genevestigator:Q9SUT0 GermOnline:AT4G11310 Uniprot:Q9SUT0
        Length = 364

 Score = 778 (278.9 bits), Expect = 2.7e-77, P = 2.7e-77
 Identities = 156/314 (49%), Positives = 210/314 (66%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQ 72
             EASL    E WM K+GKVY +  EKE+R  IF+DN+ FI + NA  N  Y+L +  FAD 
Sbjct:    44 EASLI--FESWMVKHGKVYGSVAEKERRLTIFEDNLRFINNRNAE-NLSYRLGLTGFADL 100

Query:    73 TNQEFKAFRNGYR-RPDGLTSRKGTSFKYENVID--VPATMDWRKNGAVTPIKNQGPCGS 129
             +  E+K   +G   RP        +S +Y+   D  +P ++DWR  GAVT +K+QG C S
Sbjct:   101 SLHEYKEVCHGADPRPPRNHVFMTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRS 160

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             CWAFS V A EG+ ++ TG+L++LSEQ+L++C+    ++GC GG++E A++FI+ N G+ 
Sbjct:   161 CWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKE--NNGCGGGKLETAYEFIMKNGGLG 218

Query:   190 TEANYPYQAVDGTCN-KTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSA 248
             T+ +YPY+AV+G C+ +  E +    I GYE +PAN E AL+KAVA+QPV   ID+S   
Sbjct:   219 TDNDYPYKAVNGVCDGRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSRE 278

Query:   249 FQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDA 308
             FQ Y SGVF G CGT L+HGV  VGYG T NG  YWLVKNS G +WGE GY++M R+I  
Sbjct:   279 FQLYESGVFDGSCGTNLNHGVVVVGYG-TENGRDYWLVKNSRGITWGEAGYMKMARNIAN 337

Query:   309 KEGLCGIAMDSSYP 322
               GLCGIAM +SYP
Sbjct:   338 PRGLCGIAMRASYP 351


>TAIR|locus:2117979 [details] [associations]
            symbol:AT4G23520 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002687 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            KO:K01376 IPI:IPI00527171 RefSeq:NP_567686.2 UniGene:At.32421
            ProteinModelPortal:F4JNL3 SMR:F4JNL3 MEROPS:C01.A22 PRIDE:F4JNL3
            EnsemblPlants:AT4G23520.1 GeneID:828452 KEGG:ath:AT4G23520
            OMA:PANDEIS ArrayExpress:F4JNL3 Uniprot:F4JNL3
        Length = 356

 Score = 772 (276.8 bits), Expect = 1.1e-76, P = 1.1e-76
 Identities = 151/307 (49%), Positives = 207/307 (67%)

Query:    21 EQWMSKYGKVYKNP-EEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKA 79
             + WMSK+GK Y N   EKE+RF+ FKDN+ FI+  NA  N  Y+L +  FAD T QE++ 
Sbjct:    48 QMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHNAK-NLSYQLGLTRFADLTVQEYRD 106

Query:    80 FRNGYRRPDGLTSRKGTSFKYENVI--DVPATMDWRKNGAVTPIKNQGPCGSCWAFSAVA 137
                G  +P     +  TS +Y  +    +P ++DWR+ GAV+ IK+QG C SCWAFS VA
Sbjct:   107 LFPGSPKPKQRNLK--TSRRYVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVA 164

Query:   138 ATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEG-GEMEDAFKFIIHNDGITTEANYPY 196
             A EG+ ++ TG+LISLSEQELV C+   V++GC G G M+ AF+F+I+N+G+ +E +YPY
Sbjct:   165 AVEGLNKIVTGELISLSEQELVDCNL--VNNGCYGSGLMDTAFQFLINNNGLDSEKDYPY 222

Query:   197 QAVDGTCNKTNEASH-VAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSG 255
             Q   G+CN+    S+ V  I  YE VPAN E +L KAVA+QPV+V +D     F  Y S 
Sbjct:   223 QGTQGSCNRKQSTSNKVITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSC 282

Query:   256 VFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGI 315
             ++ G CGT LDH +  VGYG+  NG  YW+V+NSWGT+WG+ GYI++ R+ +  +GLCGI
Sbjct:   283 IYNGPCGTNLDHALVIVGYGSE-NGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGI 341

Query:   316 AMDSSYP 322
             AM +SYP
Sbjct:   342 AMLASYP 348


>TAIR|locus:2128253 [details] [associations]
            symbol:AT4G11320 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002687 GenomeReviews:CT486007_GR EMBL:AL096882 EMBL:AL161531
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 OMA:ICHGADP
            HOGENOM:HOG000230773 KO:K01376 ProtClustDB:CLSN2689395
            EMBL:AY035055 EMBL:AY051062 IPI:IPI00520480 PIR:T13023
            RefSeq:NP_567377.1 UniGene:At.25206 ProteinModelPortal:Q9SUS9
            SMR:Q9SUS9 STRING:Q9SUS9 MEROPS:C01.A21 PaxDb:Q9SUS9 PRIDE:Q9SUS9
            EnsemblPlants:AT4G11320.1 GeneID:826734 KEGG:ath:AT4G11320
            TAIR:At4g11320 InParanoid:Q9SUS9 PhylomeDB:Q9SUS9
            Genevestigator:Q9SUS9 GermOnline:AT4G11320 Uniprot:Q9SUS9
        Length = 371

 Score = 763 (273.6 bits), Expect = 1.0e-75, P = 1.0e-75
 Identities = 153/314 (48%), Positives = 210/314 (66%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQ 72
             EA+L    E WM K+GKVY +  EKE+R  IF+DN+ FI + NA  N  Y+L +N FAD 
Sbjct:    51 EATLM--FESWMVKHGKVYDSVAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADL 107

Query:    73 TNQEFKAFRNGYR-RPDGLTSRKGTSFKYENVI-DV-PATMDWRKNGAVTPIKNQGPCGS 129
             +  E+    +G   RP        +S +Y+    DV P ++DWR  GAVT +K+QG C S
Sbjct:   108 SLHEYGEICHGADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRS 167

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             CWAFS V A EG+ ++ TG+L++LSEQ+L++C+    ++GC GG++E A++FI++N G+ 
Sbjct:   168 CWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKE--NNGCGGGKVETAYEFIMNNGGLG 225

Query:   190 TEANYPYQAVDGTCN-KTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSA 248
             T+ +YPY+A++G C  +  E +    I GYE +PAN E AL+KAVA+QPV   +D+S   
Sbjct:   226 TDNDYPYKALNGVCEGRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSRE 285

Query:   249 FQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDA 308
             FQ Y SGVF G CGT L+HGV  VGYG T NG  YW+VKNS G +WGE GY++M R+I  
Sbjct:   286 FQLYESGVFDGTCGTNLNHGVVVVGYG-TENGRDYWIVKNSRGDTWGEAGYMKMARNIAN 344

Query:   309 KEGLCGIAMDSSYP 322
               GLCGIAM +SYP
Sbjct:   345 PRGLCGIAMRASYP 358


>TAIR|locus:2030027 [details] [associations]
            symbol:AT1G29110 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            EMBL:CP002684 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            IPI:IPI00544534 RefSeq:NP_564322.1 UniGene:At.51816
            ProteinModelPortal:F4HZW2 SMR:F4HZW2 EnsemblPlants:AT1G29110.1
            GeneID:839786 KEGG:ath:AT1G29110 OMA:SCRANAR Uniprot:F4HZW2
        Length = 334

 Score = 695 (249.7 bits), Expect = 1.7e-68, P = 1.7e-68
 Identities = 138/320 (43%), Positives = 200/320 (62%)

Query:    11 LQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFA 70
             L E S+ + H+QWM+++ +VYK+  EKE R ++FK N++FIE+ N  GN+ Y L +NEF 
Sbjct:    29 LNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKVFKKNLKFIENFNNMGNQSYTLGVNEFT 88

Query:    71 DQTNQEFKAFRNGYR-RPDGLTS--RKGTSFKYENVIDVPA---TMDWRKNGAVTPIKNQ 124
             D   +EF A   G R     L+    K    +  N+ D+     + DWR  GAVTP+K Q
Sbjct:    89 DWKTEEFLATHTGLRVNVTSLSELFNKTKPSRNWNMSDIDMEDESKDWRDEGAVTPVKYQ 148

Query:   125 GPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIH 184
             G C              +T+++   L++LSEQ+L+ CD    + GC GGE E+AFK+II 
Sbjct:   149 GACR-------------LTKISGKNLLTLSEQQLIDCDIEK-NGGCNGGEFEEAFKYIIK 194

Query:   185 NDGITTEANYPYQAVDGTCN-KTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSID 243
             N G++ E  YPYQ    +C      A H  +I+G++ VP+++E ALL+AV  QPV+V ID
Sbjct:   195 NGGVSLETEYPYQVKKESCRANARRAPHT-QIRGFQMVPSHNERALLEAVRRQPVSVLID 253

Query:   244 ASGSAFQFYSSGVFTG-DCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRM 302
             A   +F  Y  GV+ G DCGT+++H VT VGYG T +G  YW++KNSWG SWGE GY+R+
Sbjct:   254 ARADSFGHYKGGVYAGLDCGTDVNHAVTIVGYG-TMSGLNYWVLKNSWGESWGENGYMRI 312

Query:   303 KRDIDAKEGLCGIAMDSSYP 322
             +RD++  +G+CGIA  ++YP
Sbjct:   313 RRDVEWPQGMCGIAQVAAYP 332


>RGD|1560071 [details] [associations]
            symbol:Ctsll3 "cathepsin L-like 3" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1560071 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:CH474032 IPI:IPI00560469 RefSeq:XP_001065834.2
            RefSeq:XP_573976.3 UniGene:Rn.104851 MEROPS:C01.107
            Ensembl:ENSRNOT00000061398 GeneID:498691 KEGG:rno:498691
            UCSC:RGD:1560071 CTD:70202 OMA:NCGIASD OrthoDB:EOG4HDSTZ
            NextBio:700548 Uniprot:D3ZJV2
        Length = 330

 Score = 694 (249.4 bits), Expect = 2.1e-68, P = 2.1e-68
 Identities = 141/328 (42%), Positives = 199/328 (60%)

Query:     1 IAASQVTSRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNA---A 57
             +    +++    + S     E+W +K+GK Y   EE +KR  ++++N++ I   N     
Sbjct:    10 LCLGMISAAPTHDPSFDTVWEEWKTKHGKTYNTNEEGQKR-AVWENNMKMINLHNEDYLK 68

Query:    58 GNKPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA 117
             G   + L +N F D TN EF+    G++   G  ++    F    + DVP T+DWRK+G 
Sbjct:    69 GKHGFSLEMNAFGDLTNTEFRELMTGFQ---GQKTKMMKVFPEPFLGDVPKTVDWRKHGY 125

Query:   118 VTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMED 177
             VTP+KNQGPCGSCWAFSAV + EG     TGKL+ LSEQ LV C  S  + GC+GG  + 
Sbjct:   126 VTPVKNQGPCGSCWAFSAVGSLEGQVFRKTGKLVPLSEQNLVDCSWSHGNKGCDGGLPDF 185

Query:   178 AFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ- 236
             AF+++  N G+ T  +YPY+A++GTC + N     AK+ G+ ++P  SE AL+KAVA   
Sbjct:   186 AFQYVKDNGGLDTSVSYPYEALNGTC-RYNPKYSAAKVVGFMSIPP-SENALMKAVATVG 243

Query:   237 PVAVSIDASGSAFQFYSSGVF-TGDCG-TELDHGVTAVGYGATANGTKYWLVKNSWGTSW 294
             P++V ID    +FQFY  G++   DC  T L+H V  VGYG  ++G KYWLVKNSWG  W
Sbjct:   244 PISVGIDIKHKSFQFYKGGMYYEPDCSSTNLNHAVLVVGYGEESDGRKYWLVKNSWGRDW 303

Query:   295 GEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
             G +GYI+M +D +     CGIA D+SYP
Sbjct:   304 GMDGYIKMAKDWNNN---CGIASDASYP 328


>MGI|MGI:88564 [details] [associations]
            symbol:Ctsl "cathepsin L" species:10090 "Mus musculus"
            [GO:0004177 "aminopeptidase activity" evidence=ISO] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISO] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005730 "nucleolus"
            evidence=NAS] [GO:0005737 "cytoplasm" evidence=ISO] [GO:0005764
            "lysosome" evidence=ISO] [GO:0005773 "vacuole" evidence=ISO]
            [GO:0005902 "microvillus" evidence=ISO] [GO:0006508 "proteolysis"
            evidence=ISO;IDA] [GO:0007154 "cell communication" evidence=ISO]
            [GO:0008152 "metabolic process" evidence=TAS] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISO;TAS] [GO:0009897 "external side of
            plasma membrane" evidence=ISO] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0030141 "secretory granule" evidence=ISO]
            [GO:0030984 "kininogen binding" evidence=ISO] [GO:0032403 "protein
            complex binding" evidence=ISO] [GO:0042277 "peptide binding"
            evidence=ISO] [GO:0042393 "histone binding" evidence=ISO;NAS]
            [GO:0043005 "neuron projection" evidence=ISO] [GO:0043204
            "perikaryon" evidence=ISO] [GO:0045177 "apical part of cell"
            evidence=ISO] [GO:0048863 "stem cell differentiation" evidence=NAS]
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IMP] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:88564 GO:GO:0005730 GO:GO:0009897 GO:GO:0034698
            GO:GO:0043204 GO:GO:0009749 GO:GO:0030141 GO:GO:0048863
            GO:GO:0051384 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045177 GO:GO:0043005
            GO:GO:0007283 GO:GO:0004177 GO:GO:0005764 GO:GO:0042277
            GO:GO:0009267 GO:GO:0021675 GO:GO:0042393 GO:GO:0005902
            GO:GO:0010259 GO:GO:0004197 GO:GO:0048102 GO:GO:0046697
            HOVERGEN:HBG011513 KO:K01365 OMA:EEFRATH OrthoDB:EOG48PMKF
            MEROPS:C01.032 BRENDA:3.4.22.15 ChiTaRS:CTSL1 EMBL:X06086
            EMBL:J02583 EMBL:M20495 EMBL:AF121837 EMBL:AF121838 EMBL:AF121839
            EMBL:BC068163 EMBL:X04392 IPI:IPI00128154 PIR:S01177
            RefSeq:NP_034114.1 UniGene:Mm.930 PDB:1MVV PDBsum:1MVV
            ProteinModelPortal:P06797 SMR:P06797 STRING:P06797
            PhosphoSite:P06797 PaxDb:P06797 PRIDE:P06797
            Ensembl:ENSMUST00000021933 GeneID:13039 KEGG:mmu:13039 CTD:13039
            InParanoid:P06797 BioCyc:MetaCyc:MONOMER-14812 BindingDB:P06797
            ChEMBL:CHEMBL5291 NextBio:282928 Bgee:P06797 CleanEx:MM_CTSL
            Genevestigator:P06797 GermOnline:ENSMUSG00000021477 GO:GO:0060008
            Uniprot:P06797
        Length = 334

 Score = 671 (241.3 bits), Expect = 5.8e-66, P = 5.8e-66
 Identities = 141/322 (43%), Positives = 200/322 (62%)

Query:    10 KLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAA---GNKPYKLSI 66
             K  +   +E H QW S + ++Y   EE+ +R  I++ N+  I+  N     G   + + +
Sbjct:    20 KFDQTFSAEWH-QWKSTHRRLYGTNEEEWRR-AIWEKNMRMIQLHNGEYSNGQHGFSMEM 77

Query:    67 NEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGP 126
             N F D TN+EF+   NGYR       +KG  F+   ++ +P ++DWR+ G VTP+KNQG 
Sbjct:    78 NAFGDMTNEEFRQVVNGYRHQK---HKKGRLFQEPLMLKIPKSVDWREKGCVTPVKNQGQ 134

Query:   127 CGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHND 186
             CGSCWAFSA    EG   L TGKLISLSEQ LV C  +  + GC GG M+ AF++I  N 
Sbjct:   135 CGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENG 194

Query:   187 GITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDAS 245
             G+ +E +YPY+A DG+C    E + VA   G+  +P   E+AL+KAVA   P++V++DAS
Sbjct:   195 GLDSEESYPYEAKDGSCKYRAEFA-VANDTGFVDIP-QQEKALMKAVATVGPISVAMDAS 252

Query:   246 GSAFQFYSSGVF-TGDCGTE-LDHGVTAVGYG---ATANGTKYWLVKNSWGTSWGEEGYI 300
               + QFYSSG++   +C ++ LDHGV  VGYG     +N  KYWLVKNSWG+ WG EGYI
Sbjct:   253 HPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYI 312

Query:   301 RMKRDIDAKEGLCGIAMDSSYP 322
             ++ +D   ++  CG+A  +SYP
Sbjct:   313 KIAKD---RDNHCGLATAASYP 331


>TAIR|locus:2097104 [details] [associations]
            symbol:AT3G43960 species:3702 "Arabidopsis thaliana"
            [GO:0005886 "plasma membrane" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0031225 "anchored to
            membrane" evidence=TAS] [GO:0048767 "root hair elongation"
            evidence=IMP] [GO:0016132 "brassinosteroid biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:CP002686
            GenomeReviews:BA000014_GR GO:GO:0031225 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0048767 MEROPS:I29.003 HOGENOM:HOG000230773
            EMBL:AL163975 EMBL:AK118634 IPI:IPI00526842 PIR:T48950
            RefSeq:NP_566867.1 UniGene:At.43352 ProteinModelPortal:Q9LXW3
            SMR:Q9LXW3 STRING:Q9LXW3 PaxDb:Q9LXW3 PRIDE:Q9LXW3
            EnsemblPlants:AT3G43960.1 GeneID:823513 KEGG:ath:AT3G43960
            TAIR:At3g43960 eggNOG:NOG286334 InParanoid:Q9LXW3 KO:K01376
            OMA:MAISFRT PhylomeDB:Q9LXW3 ProtClustDB:CLSN2917367
            Genevestigator:Q9LXW3 GermOnline:AT3G43960 Uniprot:Q9LXW3
        Length = 376

 Score = 669 (240.6 bits), Expect = 9.4e-66, P = 9.4e-66
 Identities = 139/328 (42%), Positives = 193/328 (58%)

Query:     1 IAASQVTSRKLQ--EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAG 58
             I+   VT+ + Q  E  +   +EQW+ + GK Y    EKE+RF+IFKDN++ IE  N+  
Sbjct:    20 ISLGVVTATESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDP 79

Query:    59 NKPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAV 118
             N+ Y+  +N+F+D T  EF+A   G +      S     ++Y+    +P  +DWR+ GAV
Sbjct:    80 NRSYERGLNKFSDLTADEFQASYLGGKMEKKSLSDVAERYQYKEGDVLPDEVDWRERGAV 139

Query:   119 TP-IKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMED 177
              P +K QG CGSCWAF+A  A EGI Q+TTG+L+SLSEQEL+ CD    + GC GG    
Sbjct:   140 VPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVW 199

Query:   178 AFKFIIHNDGITTEANYPYQAVDGTCNKTNE--ASHVAKIKGYETVPANSEEALLKAVAN 235
             AF+FI  N GI ++  Y Y   D    K  E   + V  I G+E VP N E +L KAVA 
Sbjct:   200 AFEFIKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAY 259

Query:   236 QPVAVSIDASGSAFQFYSSGVFTGDCGTEL-DHGVTAVGYGATANGTKYWLVKNSWGTSW 294
             QP++V I A+  +   Y SGV+ G C     DH V  VGYG +++   YWL++NSWG  W
Sbjct:   260 QPISVMISAANMSD--YKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEW 317

Query:   295 GEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
             GE GY+R++R+     G C +A+   YP
Sbjct:   318 GEGGYLRLQRNFHEPTGKCAVAVAPVYP 345


>RGD|1308751 [details] [associations]
            symbol:RGD1308751 "similar to Cathepsin L precursor (Major
            excreted protein) (MEP)" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1308751 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:CH474032 IPI:IPI00365697 RefSeq:XP_001065885.2
            RefSeq:XP_225137.5 MEROPS:C01.069 Ensembl:ENSRNOT00000061391
            GeneID:290981 KEGG:rno:290981 UCSC:RGD:1308751 CTD:290981
            OMA:ESYAYEA OrthoDB:EOG42823G NextBio:631921 Uniprot:D3ZKC3
        Length = 330

 Score = 667 (239.9 bits), Expect = 1.5e-65, P = 1.5e-65
 Identities = 138/329 (41%), Positives = 198/329 (60%)

Query:     1 IAASQVTSRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNA---A 57
             +    +++    + S     E+W +K+GK Y   EE +KR  ++++N++ I   N     
Sbjct:    10 LCLGMISAAPTHDPSFDTVWEEWKTKHGKTYNTNEEGQKR-AVWENNMKMINLHNEDYLK 68

Query:    58 GNKPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA 117
             G   + L +N F D TN EF+    G++    +  ++ T F+   + D+P ++DWR++G 
Sbjct:    69 GKHGFSLEMNAFGDLTNTEFRELMTGFQ---SMGPKETTIFREPFLGDIPKSLDWREHGY 125

Query:   118 VTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMED 177
             VTP+KNQG CGSCWAFSAV + EG     TGKL+SLSEQ LV C  S  + GC GG ME 
Sbjct:   126 VTPVKNQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDCSWSYGNLGCNGGLMEF 185

Query:   178 AFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ- 236
             AF+++  N G+ T  +Y Y+A DG C + N     A + G+  VP  SE+ L+ AVA+  
Sbjct:   186 AFQYVKENRGLDTGESYAYEAQDGLC-RYNPKYSAANVTGFVKVPL-SEDDLMSAVASVG 243

Query:   237 PVAVSIDASGSAFQFYSSGVF-TGDCG-TELDHGVTAVGYGATANGTKYWLVKNSWGTSW 294
             PV+V ID+   +F+FYS G++   DC  TE+DH V  VGYG  ++G KYWLVKNSWG  W
Sbjct:   244 PVSVGIDSHHQSFRFYSGGMYYEPDCSSTEMDHAVLVVGYGEESDGGKYWLVKNSWGEDW 303

Query:   295 GEEGYIRMKRDIDAKEGLCGIAMDSSYPT 323
             G +GYI+M +D   +   CGIA  + YPT
Sbjct:   304 GMDGYIKMAKD---QNNNCGIATYAIYPT 329


>UNIPROTKB|F1NYJ1 [details] [associations]
            symbol:CTSL2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 IPI:IPI00602255
            OMA:DITHHEF EMBL:AADN02067812 Ensembl:ENSGALT00000020588
            ArrayExpress:F1NYJ1 Uniprot:F1NYJ1
        Length = 339

 Score = 666 (239.5 bits), Expect = 2.0e-65, P = 2.0e-65
 Identities = 137/315 (43%), Positives = 189/315 (60%)

Query:    18 EKHEQ-WMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEFADQT 73
             + H Q W S + K Y   EE  +R  +++ N++ IE  N   + G   YKL +N+F D T
Sbjct:    27 DSHWQLWKSWHSKDYHEREESWRRV-VWEKNLKMIELHNLDHSLGKHSYKLGMNQFGDMT 85

Query:    74 NQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWAF 133
              +EF+   NGY+        +G+ F   + ++ P ++DWR+ G VTP+K+QG CGSCWAF
Sbjct:    86 AEEFRQLMNGYKHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAF 145

Query:   134 SAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEAN 193
             S   A EG     TGKL+SLSEQ LV C     + GC GG M+ AF+++  N GI +E +
Sbjct:   146 STTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEES 205

Query:   194 YPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSAFQFY 252
             YPY A D    +     + A   G+  +P   E AL+KAVA+  PV+V+IDA  S+FQFY
Sbjct:   206 YPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVASVGPVSVAIDAGHSSFQFY 265

Query:   253 SSGVF-TGDCGTE-LDHGVTAVGYG---ATANGTKYWLVKNSWGTSWGEEGYIRMKRDID 307
              SG++   DC +E LDHGV  VGYG      +G KYW+VKNSWG  WG++GYI M +D  
Sbjct:   266 QSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKD-- 323

Query:   308 AKEGLCGIAMDSSYP 322
              ++  CGIA  +SYP
Sbjct:   324 -RKNHCGIATAASYP 337


>RGD|2448 [details] [associations]
            symbol:Ctsl1 "cathepsin L1" species:10116 "Rattus norvegicus"
          [GO:0002250 "adaptive immune response" evidence=ISO] [GO:0004177
          "aminopeptidase activity" evidence=IDA] [GO:0004197 "cysteine-type
          endopeptidase activity" evidence=ISO;IDA] [GO:0005576 "extracellular
          region" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IDA]
          [GO:0005764 "lysosome" evidence=ISO;IDA] [GO:0005773 "vacuole"
          evidence=IDA] [GO:0005902 "microvillus" evidence=IDA] [GO:0006508
          "proteolysis" evidence=IEP;ISO] [GO:0007154 "cell communication"
          evidence=IDA] [GO:0007283 "spermatogenesis" evidence=IEP] [GO:0008234
          "cysteine-type peptidase activity" evidence=ISO] [GO:0008584 "male
          gonad development" evidence=IEP] [GO:0009267 "cellular response to
          starvation" evidence=IEP] [GO:0009749 "response to glucose stimulus"
          evidence=IEP] [GO:0009897 "external side of plasma membrane"
          evidence=IDA] [GO:0010259 "multicellular organismal aging"
          evidence=IEP] [GO:0014070 "response to organic cyclic compound"
          evidence=IEP] [GO:0021675 "nerve development" evidence=IEP]
          [GO:0030984 "kininogen binding" evidence=IPI] [GO:0032403 "protein
          complex binding" evidence=IPI] [GO:0034698 "response to gonadotropin
          stimulus" evidence=IEP] [GO:0042277 "peptide binding" evidence=IDA]
          [GO:0042393 "histone binding" evidence=ISO] [GO:0043005 "neuron
          projection" evidence=IDA] [GO:0043204 "perikaryon" evidence=IDA]
          [GO:0046697 "decidualization" evidence=IEP] [GO:0048102 "autophagic
          cell death" evidence=IEP] [GO:0051384 "response to glucocorticoid
          stimulus" evidence=IEP] [GO:0060008 "Sertoli cell differentiation"
          evidence=IEP] [GO:0097067 "cellular response to thyroid hormone
          stimulus" evidence=ISO] [GO:0030141 "secretory granule" evidence=IDA]
          [GO:0045177 "apical part of cell" evidence=IDA] [GO:0060441
          "epithelial tube branching involved in lung morphogenesis"
          evidence=ISO] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
          PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:Y00697 RGD:2448
          GO:GO:0005576 GO:GO:0009897 GO:GO:0034698 GO:GO:0043204 GO:GO:0009749
          GO:GO:0051384 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
          InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
          PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
          PROSITE:PS00139 PROSITE:PS00639 GO:GO:0043005 GO:GO:0007283
          GO:GO:0004177 GO:GO:0005764 GO:GO:0042277 GO:GO:0009267 GO:GO:0021675
          GO:GO:0005902 GO:GO:0010259 GO:GO:0004197 GO:GO:0048102 GO:GO:0046697
          GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 KO:K01365
          OrthoDB:EOG48PMKF MEROPS:C01.032 OMA:FDQNLDT CTD:1514
          BRENDA:3.4.22.15 GO:GO:0060008 EMBL:AF025476 EMBL:BC063175
          EMBL:S85184 IPI:IPI00326070 PIR:S07098 RefSeq:NP_037288.1
          UniGene:Rn.1294 ProteinModelPortal:P07154 SMR:P07154 IntAct:P07154
          STRING:P07154 PhosphoSite:P07154 PRIDE:P07154
          Ensembl:ENSRNOT00000025462 GeneID:25697 KEGG:rno:25697 UCSC:RGD:2448
          InParanoid:P07154 SABIO-RK:P07154 BindingDB:P07154 ChEMBL:CHEMBL2305
          NextBio:607715 Genevestigator:P07154 GermOnline:ENSRNOG00000018566
          Uniprot:P07154
        Length = 334

 Score = 664 (238.8 bits), Expect = 3.2e-65, P = 3.2e-65
 Identities = 137/319 (42%), Positives = 196/319 (61%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAA---GNKPYKLSINEF 69
             + + + +  QW S + ++Y   EE+ +R  +++ N+  I+  N     G   + + +N F
Sbjct:    22 DQTFNAQWHQWKSTHRRLYGTNEEEWRR-AVWEKNMRMIQLHNGEYSNGKHGFTMEMNAF 80

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
              D TN+EF+   NGYR       +KG  F+   ++ +P T+DWR+ G VTP+KNQG CGS
Sbjct:    81 GDMTNEEFRQIVNGYRHQK---HKKGRLFQEPLMLQIPKTVDWREKGCVTPVKNQGQCGS 137

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             CWAFSA    EG   L TGKLISLSEQ LV C     + GC GG M+ AF++I  N G+ 
Sbjct:   138 CWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLD 197

Query:   190 TEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSA 248
             +E +YPY+A DG+C    E + VA   G+  +P   E+AL+KAVA   P++V++DAS  +
Sbjct:   198 SEESYPYEAKDGSCKYRAEYA-VANDTGFVDIP-QQEKALMKAVATVGPISVAMDASHPS 255

Query:   249 FQFYSSGVF-TGDCGT-ELDHGVTAVGYG---ATANGTKYWLVKNSWGTSWGEEGYIRMK 303
              QFYSSG++   +C + +LDHGV  VGYG     +N  KYWLVKNSWG  WG +GYI++ 
Sbjct:   256 LQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIA 315

Query:   304 RDIDAKEGLCGIAMDSSYP 322
             +D   +   CG+A  +SYP
Sbjct:   316 KD---RNNHCGLATAASYP 331


>ZFIN|ZDB-GENE-040718-61 [details] [associations]
            symbol:ctsl.1 "cathepsin L.1" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-040718-61
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            GeneTree:ENSGT00660000095458 MEROPS:C01.092 EMBL:FP015965
            EMBL:BC075887 IPI:IPI00513499 RefSeq:NP_001002368.1
            UniGene:Dr.85174 SMR:Q6DHT0 Ensembl:ENSDART00000017756
            GeneID:436641 KEGG:dre:436641 CTD:436641 InParanoid:Q6DHT0
            OMA:GGQMENA OrthoDB:EOG41ZFB9 NextBio:20831086 Uniprot:Q6DHT0
        Length = 334

 Score = 664 (238.8 bits), Expect = 3.2e-65, P = 3.2e-65
 Identities = 146/332 (43%), Positives = 201/332 (60%)

Query:     1 IAASQVTSRKLQEASLSE-KHEQWMSKYGKVYKNPEEKEKRFRIFKDN--VEFIESLNA- 56
             +AA+ +        SL + +   W  K+GK Y++ EE+  R   +  N  +  + ++ A 
Sbjct:     6 VAAAFLAVASAASLSLEDMEFHAWKLKFGKSYRSAEEESHRQLTWLTNRKLVLVHNMMAD 65

Query:    57 AGNKPYKLSINEFADQTNQEFK--AFRNGYRRPDGLTSRKG-TSFKYENVIDVPATMDWR 113
              G K Y+L +  FAD +N+E++   FR      +   +R G T F+      VP T+DWR
Sbjct:    66 QGLKSYRLGMTYFADMSNEEYRQLVFRGCLGSMNNTKARGGSTFFRLRKAAVVPDTVDWR 125

Query:   114 KNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGG 173
               G VT IK+Q  CGSCWAFSA  + EG T   TGKL+SLSEQ+LV C  S  ++GC+GG
Sbjct:   126 DKGYVTDIKDQKQCGSCWAFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGSYGNYGCDGG 185

Query:   174 EMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAV 233
              M+ AF++I  N G+ TE +YPY+A DG C + N ++  A   GY  + +  E AL +AV
Sbjct:   186 LMDQAFQYIEANKGLDTEDSYPYEAQDGEC-RFNPSTVGASCTGYVDIASGDESALQEAV 244

Query:   234 AN-QPVAVSIDASGSAFQFYSSGVFTG-DCGT-ELDHGVTAVGYGATANGTKYWLVKNSW 290
             A   P++V+IDA  S+FQ YSSGV+   DC + ELDHGV AVGYG++ NG  YW+VKNSW
Sbjct:   245 ATIGPISVAIDAGHSSFQLYSSGVYNEPDCSSSELDHGVLAVGYGSS-NGDDYWIVKNSW 303

Query:   291 GTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
             G  WG +GYI M R+   K   CGIA  +SYP
Sbjct:   304 GLDWGVQGYILMSRN---KSNQCGIATAASYP 332


>UNIPROTKB|O60911 [details] [associations]
            symbol:CTSL2 "Cathepsin L2" species:9606 "Homo sapiens"
            [GO:0004177 "aminopeptidase activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA] [GO:0005902
            "microvillus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0007283 "spermatogenesis" evidence=IEA] [GO:0009267 "cellular
            response to starvation" evidence=IEA] [GO:0009749 "response to
            glucose stimulus" evidence=IEA] [GO:0009897 "external side of
            plasma membrane" evidence=IEA] [GO:0010259 "multicellular
            organismal aging" evidence=IEA] [GO:0021675 "nerve development"
            evidence=IEA] [GO:0030984 "kininogen binding" evidence=IEA]
            [GO:0032403 "protein complex binding" evidence=IEA] [GO:0034698
            "response to gonadotropin stimulus" evidence=IEA] [GO:0042277
            "peptide binding" evidence=IEA] [GO:0043005 "neuron projection"
            evidence=IEA] [GO:0043204 "perikaryon" evidence=IEA] [GO:0046697
            "decidualization" evidence=IEA] [GO:0048102 "autophagic cell death"
            evidence=IEA] [GO:0051384 "response to glucocorticoid stimulus"
            evidence=IEA] [GO:0060008 "Sertoli cell differentiation"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=TAS] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_118779 Reactome:REACT_6900
            GO:GO:0009897 GO:GO:0019886 GO:GO:0034698 GO:GO:0043204
            GO:GO:0009749 GO:GO:0030141 GO:GO:0051384 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0045177 GO:GO:0043005 GO:GO:0007283
            GO:GO:0004177 GO:GO:0042277 GO:GO:0009267 GO:GO:0021675
            GO:GO:0043202 GO:GO:0005902 GO:GO:0010259 GO:GO:0004197
            GO:GO:0048102 GO:GO:0046697 HOVERGEN:HBG011513 CTD:1515
            OrthoDB:EOG48PMKF OMA:FDQNLDT GO:GO:0060008 EMBL:Y14734
            EMBL:AB001928 EMBL:AF070448 EMBL:AB019534 EMBL:AY358641
            EMBL:AL445670 EMBL:BC023504 EMBL:BC110512 IPI:IPI00000013
            RefSeq:NP_001188504.1 RefSeq:NP_001324.2 UniGene:Hs.610096 PDB:1FH0
            PDB:3H6S PDB:3KFQ PDBsum:1FH0 PDBsum:3H6S PDBsum:3KFQ
            ProteinModelPortal:O60911 SMR:O60911 IntAct:O60911 STRING:O60911
            MEROPS:I29.010 PhosphoSite:O60911 PaxDb:O60911 PeptideAtlas:O60911
            PRIDE:O60911 Ensembl:ENST00000259470 Ensembl:ENST00000538255
            GeneID:1515 KEGG:hsa:1515 UCSC:uc004awt.3 GeneCards:GC09M099794
            HGNC:HGNC:2538 HPA:CAB017112 MIM:603308 neXtProt:NX_O60911
            PharmGKB:PA27036 InParanoid:O60911 KO:K01375 PhylomeDB:O60911
            BRENDA:3.4.22.43 SABIO-RK:O60911 BindingDB:O60911 ChEMBL:CHEMBL3272
            ChiTaRS:CTSL2 EvolutionaryTrace:O60911 GenomeRNAi:1515 NextBio:6277
            Bgee:O60911 CleanEx:HS_CTSL2 Genevestigator:O60911
            GermOnline:ENSG00000136943 Uniprot:O60911
        Length = 334

 Score = 662 (238.1 bits), Expect = 5.2e-65, P = 5.2e-65
 Identities = 140/319 (43%), Positives = 192/319 (60%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEF 69
             + +L  K  QW + + ++Y   EE  +R  +++ N++ IE  N   + G   + +++N F
Sbjct:    22 DQNLDTKWYQWKATHRRLYGANEEGWRR-AVWEKNMKMIELHNGEYSQGKHGFTMAMNAF 80

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
              D TN+EF+     +R       RKG  F+    +D+P ++DWRK G VTP+KNQ  CGS
Sbjct:    81 GDMTNEEFRQMMGCFRNQK---FRKGKVFREPLFLDLPKSVDWRKKGYVTPVKNQKQCGS 137

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             CWAFSA  A EG     TGKL+SLSEQ LV C     + GC GG M  AF+++  N G+ 
Sbjct:   138 CWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLD 197

Query:   190 TEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSA 248
             +E +YPY AVD  C    E S VA   G+  V    E+AL+KAVA   P++V++DA  S+
Sbjct:   198 SEESYPYVAVDEICKYRPENS-VANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSS 256

Query:   249 FQFYSSGV-FTGDCGTE-LDHGVTAVGYG---ATANGTKYWLVKNSWGTSWGEEGYIRMK 303
             FQFY SG+ F  DC ++ LDHGV  VGYG   A +N +KYWLVKNSWG  WG  GY+++ 
Sbjct:   257 FQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIA 316

Query:   304 RDIDAKEGLCGIAMDSSYP 322
             +D   K   CGIA  +SYP
Sbjct:   317 KD---KNNHCGIATAASYP 332


>UNIPROTKB|P25975 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 EMBL:X91755 EMBL:BC102312 EMBL:AB017648
            IPI:IPI00687440 PIR:S15845 RefSeq:NP_776457.1 UniGene:Bt.3987
            ProteinModelPortal:P25975 SMR:P25975 STRING:P25975
            Ensembl:ENSBTAT00000022710 Ensembl:ENSBTAT00000036427 GeneID:281108
            KEGG:bta:281108 CTD:1515 InParanoid:P25975 KO:K01365 OMA:EEFRATH
            OrthoDB:EOG48PMKF BindingDB:P25975 ChEMBL:CHEMBL2113
            NextBio:20805179 ArrayExpress:P25975 Uniprot:P25975
        Length = 334

 Score = 658 (236.7 bits), Expect = 1.4e-64, P = 1.4e-64
 Identities = 136/321 (42%), Positives = 197/321 (61%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEF 69
             + +L     QW + + ++Y   EE+ +R  +++ N + I+  N   + G   +++++N F
Sbjct:    22 DPNLDAHWHQWKATHRRLYGMNEEEWRR-AVWEKNKKIIDLHNQEYSEGKHGFRMAMNAF 80

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
              D TN+EF+   NG++       +KG  F    ++DVP ++DW K G VTP+KNQG CGS
Sbjct:    81 GDMTNEEFRQVMNGFQNQK---HKKGKLFHEPLLVDVPKSVDWTKKGYVTPVKNQGQCGS 137

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             CWAFSA  A EG     TGKL+SLSEQ LV C  +  + GC GG M++AF++I  N G+ 
Sbjct:   138 CWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGGLD 197

Query:   190 TEANYPYQAVD-GTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGS 247
             +E +YPY A D  +CN   E S  A   G+  +P   E+AL+KAVA   P++V+IDA  +
Sbjct:   198 SEESYPYLATDTNSCNYKPECS-AANDTGFVDIPQR-EKALMKAVATVGPISVAIDAGHT 255

Query:   248 AFQFYSSGVFTG-DCGT-ELDHGVTAVGYG---ATANGTKYWLVKNSWGTSWGEEGYIRM 302
             +FQFY SG++   DC + +LDHGV  VGYG     +N  K+W+VKNSWG  WG  GY++M
Sbjct:   256 SFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPEWGWNGYVKM 315

Query:   303 KRDIDAKEGLCGIAMDSSYPT 323
              +D   +   CGIA  +SYPT
Sbjct:   316 AKD---QNNHCGIATAASYPT 333


>DICTYBASE|DDB_G0272815 [details] [associations]
            symbol:cprE "cysteine proteinase 5" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0272815 GO:GO:0005615
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GenomeReviews:CM000151_GR GO:GO:0005764
            EMBL:AAFI02000008 MEROPS:I29.003 KO:K01376 EMBL:L36205
            RefSeq:XP_644977.1 ProteinModelPortal:P54640 SMR:P54640
            PRIDE:P54640 EnsemblProtists:DDB0185092 GeneID:8618654
            KEGG:ddi:DDB_G0272815 OMA:METAFEF ProtClustDB:CLSZ2430780
            Uniprot:P54640
        Length = 344

 Score = 537 (194.1 bits), Expect = 1.4e-64, Sum P(2) = 1.4e-64
 Identities = 119/276 (43%), Positives = 170/276 (61%)

Query:     6 VTSRKLQEASLSEKHE--QWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYK 63
             V + K Q + L  ++    WM  + K Y + EE   R+ IFK N+++++  N+ G++   
Sbjct:    14 VATAKQQFSELQYRNAFTDWMITHQKSYTS-EEFGARYNIFKANMDYVQQWNSKGSETV- 71

Query:    64 LSINEFADQTNQEFKAFRNGYRRPD-GLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIK 122
             L +N FAD TN+E+   RN Y       +S  GT  +        A+ DWR  GAVTP+K
Sbjct:    72 LGLNNFADITNEEY---RNTYLGTKFDASSLIGTQEEKVFTTSSAASKDWRSEGAVTPVK 128

Query:   123 NQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFI 182
             NQG CG CW+FS   +TEG    + G+L+SLSEQ L+ C T   + GC+GG M  AF++I
Sbjct:   129 NQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCSTE--NSGCDGGLMTYAFEYI 186

Query:   183 IHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSI 242
             I+N+GI TE++YPY+A +G C   +E S  A +  Y+TV A SE +L  AV   PV+V+I
Sbjct:   187 INNNGIDTESSYPYKAENGKCEYKSENSG-ATLSSYKTVTAGSESSLESAVNVNPVSVAI 245

Query:   243 DASGSAFQFYSSGVF-TGDCGTE-LDHGVTAVGYGA 276
             DAS  +FQ Y+SG++   +C +E LDHGV AVGYG+
Sbjct:   246 DASHQSFQLYTSGIYYEPECSSENLDHGVLAVGYGS 281

 Score = 139 (54.0 bits), Expect = 1.4e-64, Sum P(2) = 1.4e-64
 Identities = 25/47 (53%), Positives = 33/47 (70%)

Query:   276 ATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
             + ++  +YW+VKNSWGTSWG EGYI M R+ D     CGIA  +S+P
Sbjct:   299 SASSSNEYWIVKNSWGTSWGIEGYILMSRNRDNN---CGIASSASFP 342


>DICTYBASE|DDB_G0279799 [details] [associations]
            symbol:cprB "cysteine proteinase 2" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0279799 GenomeReviews:CM000152_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            MEROPS:I29.003 KO:K01365 EMBL:AAFI02000033 EMBL:M16039 EMBL:X03344
            PIR:A25439 RefSeq:XP_641494.1 ProteinModelPortal:P04989 SMR:P04989
            EnsemblProtists:DDB0214998 GeneID:8622234 KEGG:ddi:DDB_G0279799
            OMA:YVNITAG Uniprot:P04989
        Length = 376

 Score = 534 (193.0 bits), Expect = 2.9e-64, Sum P(2) = 2.9e-64
 Identities = 112/271 (41%), Positives = 158/271 (58%)

Query:     9 RKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINE 68
             R+  E+       +W  K+ + Y +  E   R+ IFK N++++++ N+ G+    L +N 
Sbjct:    25 RRFSESQYRTAFTEWTLKFNRQYSS-SEFSNRYSIFKSNMDYVDNWNSKGDSQTVLGLNN 83

Query:    69 FADQTNQEFKAFRNGYR-RPDGLTSRKGTS-FKYENVIDVPATMDWRKNGAVTPIKNQGP 126
             FAD TN+E++    G R          G      E++   P ++DWR   AVTPIK+QG 
Sbjct:    84 FADITNEEYRKTYLGTRVNAHSYNGYDGREVLNVEDLQTNPKSIDWRTKNAVTPIKDQGQ 143

Query:   127 CGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHND 186
             CGSCW+FS   +TEG   L T KL+SLSEQ LV C     + GC+GG M +AF +II N 
Sbjct:   144 CGSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAFDYIIKNK 203

Query:   187 GITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASG 246
             GI TE++YPY A  G+    N++   A IKGY  + A SE +L     + PV+V+IDAS 
Sbjct:   204 GIDTESSYPYTAETGSTCLFNKSDIGATIKGYVNITAGSEISLENGAQHGPVSVAIDASH 263

Query:   247 SAFQFYSSGVF-TGDCG-TELDHGVTAVGYG 275
             ++FQ Y+SG++    C  TELDHGV  VGYG
Sbjct:   264 NSFQLYTSGIYYEPKCSPTELDHGVLVVGYG 294

 Score = 139 (54.0 bits), Expect = 2.9e-64, Sum P(2) = 2.9e-64
 Identities = 26/42 (61%), Positives = 31/42 (73%)

Query:   283 YWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYPTA 324
             YW+VKNSWGTSWG +GYI M +D   ++  CGIA  SSYP A
Sbjct:   338 YWIVKNSWGTSWGIKGYILMSKD---RKNNCGIASVSSYPLA 376


>UNIPROTKB|Q9GL24 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 CTD:1515 KO:K01365
            OrthoDB:EOG48PMKF EMBL:AJ279008 RefSeq:NP_001239115.1
            UniGene:Cfa.3571 ProteinModelPortal:Q9GL24 SMR:Q9GL24
            MEROPS:C01.032 Ensembl:ENSCAFT00000001770
            Ensembl:ENSCAFT00000023837 GeneID:100684364 KEGG:cfa:100684364
            InParanoid:Q9GL24 OMA:FDQNLDT NextBio:20817211 Uniprot:Q9GL24
        Length = 333

 Score = 651 (234.2 bits), Expect = 7.6e-64, P = 7.6e-64
 Identities = 136/320 (42%), Positives = 196/320 (61%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEF 69
             + SL+ +  QW + + ++Y   EE  +R  +++ N++ IE  N   + G   + +++N F
Sbjct:    22 DQSLNAQWYQWKATHRRLYGMNEEGWRR-AVWEKNMKMIELHNREYSQGKHGFTMAMNAF 80

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
              D TN+EF+   NG++       +KG  F+     ++P ++DWR+ G VTP+KNQG CGS
Sbjct:    81 GDMTNEEFRQVMNGFQNQK---HKKGKMFQEPLFAEIPKSVDWREKGYVTPVKNQGQCGS 137

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             CWAFSA  A EG     TGKL+SLSEQ LV C  +  + GC GG M++AF+++  N G+ 
Sbjct:   138 CWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNAFRYVKDNGGLD 197

Query:   190 TEANYPYQAVDG-TCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGS 247
             +E +YPY   D  TCN   E S  A   G+  +P   E+AL+KAVA   P++V+IDA   
Sbjct:   198 SEESYPYLGRDTETCNYKPECS-AANDTGFVDLPQR-EKALMKAVATLGPISVAIDAGHQ 255

Query:   248 AFQFYSSGV-FTGDCGT-ELDHGVTAVGYG--ATANGTKYWLVKNSWGTSWGEEGYIRMK 303
             +FQFY SG+ F  DC + +LDHGV  VGYG   T +  K+W+VKNSWG  WG  GY++M 
Sbjct:   256 SFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKFWIVKNSWGPEWGWNGYVKMA 315

Query:   304 RDIDAKEGLCGIAMDSSYPT 323
             +D   +   CGIA  +SYPT
Sbjct:   316 KD---QNNHCGIATAASYPT 332


>ZFIN|ZDB-GENE-071004-74 [details] [associations]
            symbol:zgc:174855 "zgc:174855" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-071004-74
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 MEROPS:C01.032 EMBL:BX000534 EMBL:BC152282
            IPI:IPI00773140 RefSeq:NP_001096592.1 UniGene:Dr.104905 SMR:A7MCR6
            STRING:A7MCR6 Ensembl:ENSDART00000109968 GeneID:569326
            KEGG:dre:569326 NextBio:20889622 Uniprot:A7MCR6
        Length = 335

 Score = 650 (233.9 bits), Expect = 9.7e-64, P = 9.7e-64
 Identities = 135/318 (42%), Positives = 188/318 (59%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEF 69
             +  L +    W S++GK Y    E  +R  I+++N+  IE  N   + GN  +K+ +N+F
Sbjct:    21 DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSLGNHTFKMGMNQF 79

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
              D TN+EF+   NGY++    TS KG  F   +    P  +DWR+ G VTP+K+Q  CGS
Sbjct:    80 GDMTNEEFRQAMNGYKQDPNRTS-KGALFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGS 138

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             CW+FS+  A EG     TGKLIS+SEQ LV C     + GC GG M+ AF+++  N G+ 
Sbjct:   139 CWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQAFQYVKENKGLD 198

Query:   190 TEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSA 248
             +E +YPY A D    + +   +VAKI G+  +P  +E AL+ AVA   PV+V+IDAS  +
Sbjct:   199 SEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGNELALMNAVAAVGPVSVAIDASHQS 258

Query:   249 FQFYSSGVF-TGDCGTELDHGVTAVGYG---ATANGTKYWLVKNSWGTSWGEEGYIRMKR 304
              QFY SG++    C + LDH V  VGYG   A   G +YW+VKNSW   WG++GYI M +
Sbjct:   259 LQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAK 318

Query:   305 DIDAKEGLCGIAMDSSYP 322
             D   K   CGIA  +SYP
Sbjct:   319 D---KNNHCGIATMASYP 333


>UNIPROTKB|Q5E998 [details] [associations]
            symbol:CTSL2 "Cathepsin L2" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            HOVERGEN:HBG011513 UniGene:Bt.3987 MEROPS:C01.032 EMBL:BT021022
            IPI:IPI00711962 ProteinModelPortal:Q5E998 SMR:Q5E998 STRING:Q5E998
            InParanoid:Q5E998 Uniprot:Q5E998
        Length = 334

 Score = 649 (233.5 bits), Expect = 1.2e-63, P = 1.2e-63
 Identities = 135/321 (42%), Positives = 196/321 (61%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEF 69
             + +L     QW + + ++Y   EE+ +R  +++ N + I+  N   + G   +++++N F
Sbjct:    22 DPNLDAHWHQWKATHRRLYGMNEEEWRR-AVWEKNKKIIDLHNQEYSEGKHGFRMAMNAF 80

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
              D TN+EF+   NG++       +KG  F    ++DVP ++DW K G VTP+KNQG CGS
Sbjct:    81 GDMTNEEFRQVMNGFQNQK---HKKGKLFHEPLLVDVPKSVDWTKKGYVTPVKNQGQCGS 137

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             CWAFSA  A EG     TGKL+SLSEQ LV C  +  + GC GG M++AF++I  N  + 
Sbjct:   138 CWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGCLD 197

Query:   190 TEANYPYQAVD-GTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGS 247
             +E +YPY A D  +CN   E S  A   G+  +P   E+AL+KAVA   P++V+IDA  +
Sbjct:   198 SEESYPYLATDTNSCNYKPECS-AANDTGFVDIPQR-EKALMKAVATVGPISVAIDAGHT 255

Query:   248 AFQFYSSGVFTG-DCGT-ELDHGVTAVGYG---ATANGTKYWLVKNSWGTSWGEEGYIRM 302
             +FQFY SG++   DC + +LDHGV  VGYG     +N  K+W+VKNSWG  WG  GY++M
Sbjct:   256 SFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPEWGWNGYVKM 315

Query:   303 KRDIDAKEGLCGIAMDSSYPT 323
              +D   +   CGIA  +SYPT
Sbjct:   316 AKD---QNNHCGIATAASYPT 333


>FB|FBgn0013770 [details] [associations]
            symbol:Cp1 "Cysteine proteinase-1" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS;NAS] [GO:0030163 "protein catabolic process"
            evidence=NAS] [GO:0005764 "lysosome" evidence=NAS] [GO:0048102
            "autophagic cell death" evidence=IEP] [GO:0035071 "salivary gland
            cell autophagic cell death" evidence=IEP] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0045169 "fusome" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:AE013599 GO:GO:0007586 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0035071 GO:GO:0045169 GeneTree:ENSGT00660000095458 KO:K01365
            EMBL:U75652 EMBL:AF012089 EMBL:BT016071 EMBL:D31970
            RefSeq:NP_523735.2 RefSeq:NP_725347.1 RefSeq:NP_725348.1
            UniGene:Dm.7400 ProteinModelPortal:Q95029 SMR:Q95029 IntAct:Q95029
            MINT:MINT-814156 STRING:Q95029 MEROPS:C01.092 PaxDb:Q95029
            EnsemblMetazoa:FBtr0087593 GeneID:36546 KEGG:dme:Dmel_CG6692
            CTD:36546 FlyBase:FBgn0013770 InParanoid:Q95029 OMA:ICHGADP
            OrthoDB:EOG46M91C PhylomeDB:Q95029 GenomeRNAi:36546 NextBio:799136
            Bgee:Q95029 GermOnline:CG6692 Uniprot:Q95029
        Length = 371

 Score = 648 (233.2 bits), Expect = 1.6e-63, P = 1.6e-63
 Identities = 138/319 (43%), Positives = 197/319 (61%)

Query:    21 EQWMS---KYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEFADQTN 74
             E+W +   ++ K Y++  E+  R +IF +N   I   N   A G   +KL++N++AD  +
Sbjct:    57 EEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLH 116

Query:    75 QEFKAFRNGY--------RRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGP 126
              EF+   NG+        R  D   S KG +F     + +P ++DWR  GAVT +K+QG 
Sbjct:   117 HEFRQLMNGFNYTLHKQLRAAD--ESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 174

Query:   127 CGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHND 186
             CGSCWAFS+  A EG     +G L+SLSEQ LV C T   ++GC GG M++AF++I  N 
Sbjct:   175 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 234

Query:   187 GITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDAS 245
             GI TE +YPY+A+D +C+  N+ +  A  +G+  +P   E+ + +AVA   PV+V+IDAS
Sbjct:   235 GIDTEKSYPYEAIDDSCH-FNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 293

Query:   246 GSAFQFYSSGVFTG-DCGTE-LDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMK 303
               +FQFYS GV+    C  + LDHGV  VG+G   +G  YWLVKNSWGT+WG++G+I+M 
Sbjct:   294 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKML 353

Query:   304 RDIDAKEGLCGIAMDSSYP 322
             R+   KE  CGIA  SSYP
Sbjct:   354 RN---KENQCGIASASSYP 369


>ZFIN|ZDB-GENE-030131-572 [details] [associations]
            symbol:wu:fb37b09 "wu:fb37b09" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030131-572 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:BX465840 IPI:IPI00866294 RefSeq:XP_001923796.1
            UniGene:Dr.25683 PRIDE:E9QBE2 Ensembl:ENSDART00000133962
            GeneID:321853 KEGG:dre:321853 NextBio:20807556 Uniprot:E9QBE2
        Length = 335

 Score = 645 (232.1 bits), Expect = 3.3e-63, P = 3.3e-63
 Identities = 134/318 (42%), Positives = 187/318 (58%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEF 69
             +  L +    W S++GK Y    E  +R  I+++N+  IE  N   + GN  +K+ +N+F
Sbjct:    21 DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSLGNHTFKMGMNQF 79

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
              D TN+EF+   NGY+     TS+ G  F        P  +DWR+ G VTP+K+Q  CGS
Sbjct:    80 GDMTNEEFRQAMNGYKHDPNRTSQ-GPLFMEPKFFAAPQQVDWRQRGYVTPVKDQKQCGS 138

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             CW+FS+  A EG     TGKLIS+SEQ LV C     + GC GG M+ AF+++  N G+ 
Sbjct:   139 CWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHGNQGCNGGLMDQAFQYVKENKGLD 198

Query:   190 TEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSA 248
             +E +YPY A D    + +   +VAKI G+  +P  +E AL+ AVA   PV+V+IDAS  +
Sbjct:   199 SEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGNELALMNAVAAVGPVSVAIDASHQS 258

Query:   249 FQFYSSGVF-TGDCGTELDHGVTAVGYG---ATANGTKYWLVKNSWGTSWGEEGYIRMKR 304
              QFY SG++    C ++LDH V  VGYG   A   G +YW+VKNSW   WG++GYI M +
Sbjct:   259 LQFYQSGIYYERACTSQLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAK 318

Query:   305 DIDAKEGLCGIAMDSSYP 322
             D   K   CGIA  +SYP
Sbjct:   319 D---KNNHCGIATMASYP 333


>ZFIN|ZDB-GENE-030131-106 [details] [associations]
            symbol:ctsl1a "cathepsin L, 1 a" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030131-106 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 HSSP:P43235
            KO:K01365 EMBL:BC066490 IPI:IPI00495935 RefSeq:NP_997749.1
            UniGene:Dr.104499 ProteinModelPortal:Q6NYR5 SMR:Q6NYR5
            MEROPS:C01.074 PRIDE:Q6NYR5 GeneID:321453 KEGG:dre:321453
            CTD:321453 InParanoid:Q6NYR5 NextBio:20807387 ArrayExpress:Q6NYR5
            Bgee:Q6NYR5 Uniprot:Q6NYR5
        Length = 337

 Score = 642 (231.1 bits), Expect = 6.9e-63, P = 6.9e-63
 Identities = 135/331 (40%), Positives = 193/331 (58%)

Query:     1 IAASQVTSRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AA 57
             +  S V +    +  L++  +QW   + K Y   EE  +R  I++ N++ IE  N   + 
Sbjct:    10 LCLSAVFAAPTLDQQLNDHWDQWKKWHSKKYHATEEGWRRV-IWEKNLKKIEMHNLEHSM 68

Query:    58 GNKPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA 117
             G   Y+L +N F D T++EF+   NG++       R G+ F   N I+VP  +DWR+ G 
Sbjct:    69 GIHTYRLGMNHFGDMTHEEFRQVMNGFKHKKDRRFR-GSLFMEPNFIEVPNKLDWREKGY 127

Query:   118 VTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMED 177
             VTP+K+QG CGSCWAFS   A EG     TGKL+SLSEQ LV C     + GC GG M+ 
Sbjct:   128 VTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQ 187

Query:   178 AFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ- 236
             AF+++   +G+ +E +YPY   D      +  +  A   G+  +P+  E AL+KA+A   
Sbjct:   188 AFQYVKDQNGLDSEESYPYLGTDDQPCHFDPKNSAANDTGFVDIPSGKERALMKAIAAVG 247

Query:   237 PVAVSIDASGSAFQFYSSGVF-TGDCGTE-LDHGVTAVGYG---ATANGTKYWLVKNSWG 291
             PV+V+IDA   +FQFY SG++   +C +E LDHGV AVGYG      +G KYW+VKNSW 
Sbjct:   248 PVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVDGKKYWIVKNSWS 307

Query:   292 TSWGEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
              +WG++GYI M +D   +   CGIA  +SYP
Sbjct:   308 ENWGDKGYIYMAKD---RHNHCGIATAASYP 335


>ZFIN|ZDB-GENE-080215-7 [details] [associations]
            symbol:zgc:174153 "zgc:174153" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-080215-7
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 EMBL:BX000534 EMBL:BX322603
            IPI:IPI00483644 Ensembl:ENSDART00000113654 OMA:ITLCISA Bgee:F1R8Y0
            Uniprot:F1R8Y0
        Length = 336

 Score = 641 (230.7 bits), Expect = 8.8e-63, P = 8.8e-63
 Identities = 133/319 (41%), Positives = 188/319 (58%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEF 69
             +  L +    W S++GK Y    E  +R  I+++N+  IE  N   + GN  +K+ +N+F
Sbjct:    21 DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSYGNHTFKMGMNQF 79

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
              D TN+EF+   NGY+     TS+ G  F   +    P  +DWR+ G VTP+K+Q  CGS
Sbjct:    80 GDMTNEEFRQAMNGYKHDPNQTSQ-GPLFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGS 138

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             CW+FS+  A EG     TGKLIS+SEQ LV C     + GC GG M+ AF+++  N G+ 
Sbjct:   139 CWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGLMDQAFQYVKENKGLD 198

Query:   190 TEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSA 248
             +E +YPY A D    + +   +VAKI G+  +P+ +E AL+ AVA   PV+V+IDAS  +
Sbjct:   199 SEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGNEPALMNAVAAVGPVSVAIDASHQS 258

Query:   249 FQFYSSGVF-TGDCGTE-LDHGVTAVGYG---ATANGTKYWLVKNSWGTSWGEEGYIRMK 303
              QFY SG++    C +  LDH V  VGYG   A   G +YW+VKNSW   WG++GYI M 
Sbjct:   259 LQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMA 318

Query:   304 RDIDAKEGLCGIAMDSSYP 322
             +D   K   CG+A  +SYP
Sbjct:   319 KD---KNNHCGVATKASYP 334


>UNIPROTKB|A4IFS7 [details] [associations]
            symbol:CTSL1 "CTSL1 protein" species:9913 "Bos taurus"
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197 GO:GO:0002250
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 GO:GO:0097067
            OrthoDB:EOG48PMKF MEROPS:C01.032 CTD:1514 EMBL:DAAA02023987
            EMBL:BC134741 IPI:IPI00708619 RefSeq:NP_001077155.1
            UniGene:Bt.23199 SMR:A4IFS7 Ensembl:ENSBTAT00000000962
            GeneID:515200 KEGG:bta:515200 InParanoid:A4IFS7 OMA:NDEQALM
            NextBio:20871707 Uniprot:A4IFS7
        Length = 333

 Score = 640 (230.4 bits), Expect = 1.1e-62, P = 1.1e-62
 Identities = 135/321 (42%), Positives = 198/321 (61%)

Query:    15 SLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEFAD 71
             SL  + + W + + K Y   EE  ++  ++K N++ IE  N   + G   + +++N F D
Sbjct:    24 SLDTQWKLWKAAHRKPYDLNEEGWRK-AVWKKNMKMIELHNQEYSQGKHSFSMAMNAFGD 82

Query:    72 QTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVI-DVPATMDWRKNGAVTPIKNQGPCGSC 130
              TN+EF+   NG++R     ++KG  F +E +   +P ++DWR+ G VTP+KNQG CGSC
Sbjct:    83 MTNEEFRHTMNGFQRQK---NKKGKEF-HETIFASIPPSVDWREKGYVTPVKNQGKCGSC 138

Query:   131 WAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITT 190
             WAFSA  A EG     TGKL+SLSEQ LV C     + GC GG +++AF++++   G+ +
Sbjct:   139 WAFSATGALEGQMFQKTGKLVSLSEQNLVDCSQPEGNRGCHGGFIDNAFQYVLDVGGLDS 198

Query:   191 EANYPYQAVDGTC--NKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGS 247
             E +YPY  + GTC  N  N A++     G+  +P   E+AL+KAVAN  P++V++DA   
Sbjct:   199 EESYPYTGLVGTCLYNPNNSAANET---GFVDLP-KQEKALMKAVANLGPISVAVDAHNP 254

Query:   248 AFQFYSSGVF-TGDCGTE-LDHGVTAVGYG---ATANGTKYWLVKNSWGTSWGEEGYIRM 302
             +FQFY SG++   +C +E +DH V  VGYG   A ++  KYWLVKNSWG  WG  GYI+M
Sbjct:   255 SFQFYKSGIYYEPNCSSESVDHAVLVVGYGFEGADSDDNKYWLVKNSWGEHWGMNGYIKM 314

Query:   303 KRDIDAKEGLCGIAMDSSYPT 323
              +D   +   CGIA  +SYPT
Sbjct:   315 AKD---RNNHCGIATMASYPT 332


>UNIPROTKB|Q28944 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9823 "Sus scrofa"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 KO:K01365 OrthoDB:EOG48PMKF MEROPS:C01.032
            CTD:1514 EMBL:D37917 EMBL:AJ315771 PIR:A58195 RefSeq:NP_999057.1
            UniGene:Ssc.54036 ProteinModelPortal:Q28944 SMR:Q28944
            STRING:Q28944 Ensembl:ENSSSCT00000012233 GeneID:396926
            KEGG:ssc:396926 OMA:DASETGK ArrayExpress:Q28944 Uniprot:Q28944
        Length = 334

 Score = 640 (230.4 bits), Expect = 1.1e-62, P = 1.1e-62
 Identities = 135/324 (41%), Positives = 197/324 (60%)

Query:    10 KLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSI 66
             KL + +L     +W + +G++Y   EE  +R  +++ N++ IE  N   + G   + +++
Sbjct:    20 KLDQ-NLDADWYKWKATHGRLYGMNEEGWRR-AVWEKNMKMIELHNQEYSQGKHGFSMAM 77

Query:    67 NEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGP 126
             N F D TN+EF+   NG++       +KG  F    V++VP ++DWR+ G VT +KNQG 
Sbjct:    78 NAFGDMTNEEFRQVMNGFQNQK---HKKGKVFHESLVLEVPKSVDWREKGYVTAVKNQGQ 134

Query:   127 CGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHND 186
             CGSCWAFSA  A EG     TGKL+SLSEQ LV C     + GC GG M++AF+++  N 
Sbjct:   135 CGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNG 194

Query:   187 GITTEANYPYQAVD-GTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDA 244
             G+ TE +YPY   +  +C    E S  A   G+  +P   E+AL+KAVA   P++V+IDA
Sbjct:   195 GLDTEESYPYLGRETNSCTYKPECS-AANDTGFVDIPQR-EKALMKAVATVGPISVAIDA 252

Query:   245 SGSAFQFYSSGVFTG-DCGT-ELDHGVTAVGYG---ATANGTKYWLVKNSWGTSWGEEGY 299
               S+FQFY SG++   DC + +LDHGV  VGYG     +N +K+W+VKNSWG  WG  GY
Sbjct:   253 GHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWGPEWGWNGY 312

Query:   300 IRMKRDIDAKEGLCGIAMDSSYPT 323
             ++M +D   +   CGI+  +SYPT
Sbjct:   313 VKMAKD---QNNHCGISTAASYPT 333


>UNIPROTKB|P07711 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9606 "Homo sapiens"
            [GO:0005576 "extracellular region" evidence=NAS] [GO:0005764
            "lysosome" evidence=IDA;NAS] [GO:0006508 "proteolysis"
            evidence=IDA] [GO:0002250 "adaptive immune response" evidence=IEP]
            [GO:0019882 "antigen processing and presentation" evidence=TAS]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0036021 "endolysosome lumen" evidence=TAS] [GO:0043202
            "lysosomal lumen" evidence=TAS] [GO:0045087 "innate immune
            response" evidence=TAS] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042393 "histone binding" evidence=IDA] [GO:0005634 "nucleus"
            evidence=TAS] [GO:0071888 "macrophage apoptotic process"
            evidence=NAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            Reactome:REACT_118779 EMBL:X12451 GO:GO:0005634 Reactome:REACT_6900
            GO:GO:0005576 GO:GO:0019886 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087
            GO:GO:0042393 GO:GO:0004197 GO:GO:0002250 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0036021 KO:K01365 OrthoDB:EOG48PMKF EMBL:M20496
            EMBL:CR457053 EMBL:BX537395 EMBL:AL160279 EMBL:BC012612 EMBL:X05256
            IPI:IPI00012887 PIR:S01002 RefSeq:NP_001244900.1
            RefSeq:NP_001244901.1 RefSeq:NP_001903.1 RefSeq:NP_666023.1
            UniGene:Hs.731507 UniGene:Hs.731952 PDB:1CJL PDB:1CS8 PDB:1ICF
            PDB:1MHW PDB:2NQD PDB:2VHS PDB:2XU1 PDB:2XU3 PDB:2XU4 PDB:2XU5
            PDB:2YJ2 PDB:2YJ8 PDB:2YJ9 PDB:2YJB PDB:2YJC PDB:3BC3 PDB:3H89
            PDB:3H8B PDB:3H8C PDB:3HHA PDB:3HWN PDB:3IV2 PDB:3K24 PDB:3KSE
            PDB:3OF8 PDB:3OF9 PDBsum:1CJL PDBsum:1CS8 PDBsum:1ICF PDBsum:1MHW
            PDBsum:2NQD PDBsum:2VHS PDBsum:2XU1 PDBsum:2XU3 PDBsum:2XU4
            PDBsum:2XU5 PDBsum:2YJ2 PDBsum:2YJ8 PDBsum:2YJ9 PDBsum:2YJB
            PDBsum:2YJC PDBsum:3BC3 PDBsum:3H89 PDBsum:3H8B PDBsum:3H8C
            PDBsum:3HHA PDBsum:3HWN PDBsum:3IV2 PDBsum:3K24 PDBsum:3KSE
            PDBsum:3OF8 PDBsum:3OF9 ProteinModelPortal:P07711 SMR:P07711
            IntAct:P07711 STRING:P07711 MEROPS:I29.001 PhosphoSite:P07711
            DMDM:115741 PaxDb:P07711 PeptideAtlas:P07711 PRIDE:P07711
            DNASU:1514 Ensembl:ENST00000340342 Ensembl:ENST00000343150
            GeneID:1514 KEGG:hsa:1514 UCSC:uc004aph.3 CTD:1514
            GeneCards:GC09P090341 H-InvDB:HIX0058839 H-InvDB:HIX0170314
            HGNC:HGNC:2537 HPA:CAB000459 MIM:116880 neXtProt:NX_P07711
            PharmGKB:PA162382890 InParanoid:P07711 OMA:REPLFAQ PhylomeDB:P07711
            BRENDA:3.4.22.15 BindingDB:P07711 ChEMBL:CHEMBL3837 ChiTaRS:CTSL1
            DrugBank:DB00040 EvolutionaryTrace:P07711 GenomeRNAi:1514
            NextBio:6271 PMAP-CutDB:P07711 ArrayExpress:P07711 Bgee:P07711
            CleanEx:HS_CTSL1 Genevestigator:P07711 GermOnline:ENSG00000135047
            GO:GO:0071888 Uniprot:P07711
        Length = 333

 Score = 636 (228.9 bits), Expect = 3.0e-62, P = 3.0e-62
 Identities = 136/320 (42%), Positives = 194/320 (60%)

Query:    15 SLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAA---GNKPYKLSINEFAD 71
             SL  +  +W + + ++Y   EE  +R  +++ N++ IE  N     G   + +++N F D
Sbjct:    24 SLEAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEYREGKHSFTMAMNAFGD 82

Query:    72 QTNQEFKAFRNGY--RRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
              T++EF+   NG+  R+P     RKG  F+     + P ++DWR+ G VTP+KNQG CGS
Sbjct:    83 MTSEEFRQVMNGFQNRKP-----RKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGS 137

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             CWAFSA  A EG     TG+LISLSEQ LV C     + GC GG M+ AF+++  N G+ 
Sbjct:   138 CWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLD 197

Query:   190 TEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSA 248
             +E +YPY+A + +C K N    VA   G+  +P   E+AL+KAVA   P++V+IDA   +
Sbjct:   198 SEESYPYEATEESC-KYNPKYSVANDTGFVDIP-KQEKALMKAVATVGPISVAIDAGHES 255

Query:   249 FQFYSSGV-FTGDCGTE-LDHGVTAVGYG--AT-ANGTKYWLVKNSWGTSWGEEGYIRMK 303
             F FY  G+ F  DC +E +DHGV  VGYG  +T ++  KYWLVKNSWG  WG  GY++M 
Sbjct:   256 FLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMA 315

Query:   304 RDIDAKEGLCGIAMDSSYPT 323
             +D   +   CGIA  +SYPT
Sbjct:   316 KD---RRNHCGIASAASYPT 332


>ZFIN|ZDB-GENE-980526-285 [details] [associations]
            symbol:ctsl1b "cathepsin L, 1 b" species:7955
            "Danio rerio" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005576
            "extracellular region" evidence=IDA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-980526-285 GO:GO:0005576 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:BX465840 IPI:IPI00498443 Ensembl:ENSDART00000145570
            Bgee:F1R7B3 Uniprot:F1R7B3
        Length = 352

 Score = 636 (228.9 bits), Expect = 3.0e-62, P = 3.0e-62
 Identities = 133/319 (41%), Positives = 187/319 (58%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEF 69
             +  L +    W S++GK Y    E  +R  I+++N+  IE  N   + GN  +K+ +N+F
Sbjct:    37 DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSYGNHTFKMGMNQF 95

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
              D TN+EF+   NGY      TS+ G  F   +    P  +DWR+ G VTP+K+Q  CGS
Sbjct:    96 GDMTNEEFRQAMNGYTHDPNQTSQ-GPLFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGS 154

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             CW+FS+  A EG     TGKLIS+SEQ LV C     + GC GG M+ AF+++  N G+ 
Sbjct:   155 CWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGLMDQAFQYVKENKGLD 214

Query:   190 TEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSA 248
             +E +YPY A D    + +   +VAKI G+  +P+ +E AL+ AVA   PV+V+IDAS  +
Sbjct:   215 SEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGNELALMNAVAAVGPVSVAIDASHQS 274

Query:   249 FQFYSSGVF-TGDCGTE-LDHGVTAVGYG---ATANGTKYWLVKNSWGTSWGEEGYIRMK 303
              QFY SG++    C +  LDH V  VGYG   A   G +YW+VKNSW   WG++GYI M 
Sbjct:   275 LQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMA 334

Query:   304 RDIDAKEGLCGIAMDSSYP 322
             +D   K   CG+A  +SYP
Sbjct:   335 KD---KNNHCGVATKASYP 350


>UNIPROTKB|F1S4J6 [details] [associations]
            symbol:Ssc.54235 "Cathepsin L1" species:9823 "Sus scrofa"
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197
            GO:GO:0002250 GeneTree:ENSGT00560000076577 GO:GO:0097067
            EMBL:CU571031 RefSeq:XP_003130681.1 Ensembl:ENSSSCT00000011983
            GeneID:100515919 KEGG:ssc:100515919 OMA:IAICATK Uniprot:F1S4J6
        Length = 332

 Score = 635 (228.6 bits), Expect = 3.8e-62, P = 3.8e-62
 Identities = 139/316 (43%), Positives = 186/316 (58%)

Query:    15 SLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEFAD 71
             SL     +W + + K+Y   EE  +R  I++ N++ IE  N     G   + +++N F D
Sbjct:    24 SLDADWYKWKATHRKLYGLNEEGRRR-AIWEKNMKMIERHNWEHRQGKHSFTMAMNAFGD 82

Query:    72 QTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCW 131
              TN+EF+   NG++       +KG  F        P ++DWR+ G VT +KNQG CGSCW
Sbjct:    83 MTNEEFRKTMNGFQNQK---HKKGKVFLDAGSALTPHSVDWREKGYVTAVKNQGHCGSCW 139

Query:   132 AFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTE 191
             AFSA  A EG     T KLISLSEQ LV C     + GC GG M++AF++I  N G+ +E
Sbjct:   140 AFSATGALEGQMFRKTSKLISLSEQNLVDCSWPEGNEGCNGGLMDNAFQYIKDNGGLDSE 199

Query:   192 ANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSAFQ 250
              +YPY   DG+C K    S  A   GY  +P   E+AL+KAVA   P++V IDAS  +FQ
Sbjct:   200 ESYPYFGKDGSC-KYKPQSSAANDTGYVDIP-KQEKALMKAVATVGPISVGIDASHESFQ 257

Query:   251 FYSSGV-FTGDCGTE-LDHGVTAVGYGATA--NGTKYWLVKNSWGTSWGEEGYIRMKRDI 306
             FYS+G+ F   C +E LDHGV  VGYG     +  KYWLVKNSWG +WG +GYI+M +D 
Sbjct:   258 FYSTGIYFEPQCSSEDLDHGVLVVGYGVEGAHSNNKYWLVKNSWGNTWGMDGYIKMTKD- 316

Query:   307 DAKEGLCGIAMDSSYP 322
               +   CGIA  +SYP
Sbjct:   317 --QNNHCGIATMASYP 330


>ZFIN|ZDB-GENE-041010-76 [details] [associations]
            symbol:ctsll "cathepsin L, like" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-041010-76
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            OrthoDB:EOG48PMKF MEROPS:C01.032 EMBL:BX119902 IPI:IPI00616622
            UniGene:Dr.79994 SMR:A2BEM8 Ensembl:ENSDART00000144226
            InParanoid:A2BEM8 OMA:PRYSAAN Uniprot:A2BEM8
        Length = 337

 Score = 635 (228.6 bits), Expect = 3.8e-62, P = 3.8e-62
 Identities = 137/332 (41%), Positives = 192/332 (57%)

Query:     1 IAASQVTSRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AA 57
             +  S V +    +  L +    W   + K Y   EE  +R  +++ N++ IE  N   + 
Sbjct:    10 LCISAVFAAPTLDQKLDDHWHLWKRWHEKSYHEKEEGWRRM-VWEKNLKKIELHNLEHSV 68

Query:    58 GNKPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA 117
             G   ++L +N+F D TN+EF+   NGY R     S KG+ F   +    P  +DWR+ G 
Sbjct:    69 GKHTFRLGMNQFGDMTNEEFRQAMNGYNRDPNRKS-KGSLFIEPSFFTAPQQIDWRQKGY 127

Query:   118 VTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMED 177
             VTPIK+Q  CGSCWAFS+  A EG     TGKL+SLSEQ L+ C     ++GC+GG M+ 
Sbjct:   128 VTPIKDQKRCGSCWAFSSTGALEGQVFRKTGKLVSLSEQNLMDCSRPQGNNGCDGGLMDQ 187

Query:   178 AFKFIIHNDGITTEANYPYQAVDGT-CNKTNEASHVAKIKGYETVPANSEEALLKAVANQ 236
             AF+++  N+G+ +E +YPY A D   C+     S  A + G+  +P+  E AL+KAVA  
Sbjct:   188 AFQYVQDNNGLDSEESYPYLATDDQPCHYDPRYS-AANVTGFVDIPSGKEHALMKAVAAV 246

Query:   237 -PVAVSIDASGSAFQFYSSGVFTGD-CGTE-LDHGVTAVGYG---ATANGTKYWLVKNSW 290
              PVAV+IDA   +FQFY SG++    C TE LDHGV  VGYG       G +YW+VKNSW
Sbjct:   247 GPVAVAIDAGHESFQFYQSGIYYEKACSTEELDHGVLVVGYGYEGVDVAGRRYWIVKNSW 306

Query:   291 GTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
                WG++GYI M +D+   +  CGIA  +SYP
Sbjct:   307 TDRWGDKGYIYMAKDL---KNHCGIATSASYP 335


>UNIPROTKB|Q86GF7 [details] [associations]
            symbol:Cys "Crustapain" species:6703 "Pandalus borealis"
            [GO:0005576 "extracellular region" evidence=IC] [GO:0007586
            "digestion" evidence=NAS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IDA] [GO:0030163 "protein catabolic process"
            evidence=NAS] [GO:0030574 "collagen catabolic process"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005576
            GO:GO:0007586 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0030163 GO:GO:0030574 EMBL:AB091669
            ProteinModelPortal:Q86GF7 SMR:Q86GF7 MEROPS:C01.030 Uniprot:Q86GF7
        Length = 323

 Score = 632 (227.5 bits), Expect = 7.9e-62, P = 7.9e-62
 Identities = 141/311 (45%), Positives = 183/311 (58%)

Query:    21 EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAA---GNKPYKLSINEFADQTNQEF 77
             E + +K+GK Y N EE+  R  +F D ++FI+  N     G   Y L IN F+D T++E 
Sbjct:    21 ENFKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHEEV 80

Query:    78 KAFRNGYRRPDGLTSRKGTSFKYENVIDVP--ATMDWRKNGAVTPIKNQGPCGSCWAFSA 135
              A + G  R      R   S   ++    P  A +DWR  GAVTP+K+QG CGSCWAFSA
Sbjct:    81 LATKTGMTR-----RRHPLSVLPKSAPTTPMAADVDWRNKGAVTPVKDQGQCGSCWAFSA 135

Query:   136 VAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYP 195
             VAA EG   L TG L+SLSEQ LV C +S  + GC GG    A+++II N GI TE++YP
Sbjct:   136 VAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRGIDTESSYP 195

Query:   196 YQAVDGTCNKTNEASHV-AKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSAFQFYS 253
             Y+A+D  C    +A ++ A +  Y    +  E AL  AV N+ PV+V IDA  S+F  Y 
Sbjct:   196 YKAIDDNCRY--DAGNIGATVSSYVEPASGDESALQHAVQNEGPVSVCIDAGQSSFGSYG 253

Query:   254 SGVF-TGDCGT-ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEG 311
              GV+   +C +   +H VTAVGYG  ANG  YW+VKNSWG  WGE GYI+M R+ D    
Sbjct:   254 GGVYYEPNCDSWYANHAVTAVGYGTDANGGDYWIVKNSWGAWWGESGYIKMARNRDNN-- 311

Query:   312 LCGIAMDSSYP 322
              C IA  S YP
Sbjct:   312 -CAIATYSVYP 321


>UNIPROTKB|P83654 [details] [associations]
            symbol:P83654 "Ervatamin-C" species:52861 "Tabernaemontana
            divaricata" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0006508 "proteolysis" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005576 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197 PDB:1O0E PDB:2PNS
            PDBsum:1O0E PDBsum:2PNS MEROPS:C01.116 EvolutionaryTrace:P83654
            Uniprot:P83654
        Length = 208

 Score = 629 (226.5 bits), Expect = 1.6e-61, P = 1.6e-61
 Identities = 127/218 (58%), Positives = 150/218 (68%)

Query:   106 VPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSG 165
             +P  +DWRK GAVTP+KNQG CGSCWAFS V+  E I Q+ TG LISLSEQELV CD   
Sbjct:     1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKK- 59

Query:   166 VDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANS 225
              +HGC GG    A+++II+N GI T+ANYPY+AV G C     AS V  I GY  VP  +
Sbjct:    60 -NHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPCQA---ASKVVSIDGYNGVPFCN 115

Query:   226 EEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWL 285
             E AL +AVA QP  V+IDAS + FQ YSSG+F+G CGT+L+HGVT VGY A      YW+
Sbjct:   116 EXALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQAN-----YWI 170

Query:   286 VKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYPT 323
             V+NSWG  WGE+GYIRM R      GLCGIA    YPT
Sbjct:   171 VRNSWGRYWGEKGYIRMLRVGGC--GLCGIARLPYYPT 206


>ZFIN|ZDB-GENE-050626-55 [details] [associations]
            symbol:ctssb.2 "cathepsin S, b.2" species:7955
            "Danio rerio" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-050626-55
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            KO:K01368 EMBL:BC093339 IPI:IPI00507098 RefSeq:NP_001017661.1
            UniGene:Dr.132688 ProteinModelPortal:Q566T8 SMR:Q566T8
            GeneID:337572 KEGG:dre:337572 CTD:337572 InParanoid:Q566T8
            NextBio:20812306 ArrayExpress:Q566T8 Uniprot:Q566T8
        Length = 330

 Score = 627 (225.8 bits), Expect = 2.7e-61, P = 2.7e-61
 Identities = 134/313 (42%), Positives = 185/313 (59%)

Query:    15 SLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFI--ESLNAA-GNKPYKLSINEFAD 71
             +L +  E W  K+ K+Y   +E+  R  +++ N+E I   +L A+ G   Y L+IN  AD
Sbjct:    22 NLDQHWELWKKKHVKLYSCEDEEVGRRELWERNLELIAIHNLEASMGMHSYDLAINHMAD 81

Query:    72 QTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCW 131
              T +E        R P G   R    +   +   VP T+DWR  G VT +KNQG CGSCW
Sbjct:    82 MTTEEILQTLAVTRVPPGF-KRPTAEYVSSSFAVVPDTLDWRDKGYVTSVKNQGACGSCW 140

Query:   132 AFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTE 191
             AFS+V A EG    TTGKL+ LS Q LV C +   + GC GG M  AF+++I N GI +E
Sbjct:   141 AFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGYMSQAFQYVIDNGGIDSE 200

Query:   192 ANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVAN-QPVAVSIDASGSAFQ 250
             ++YPYQ   G+C + + +   A    Y+ V    E+AL +A+AN  PV+V+IDA+   F 
Sbjct:   201 SSYPYQGTQGSC-RYDPSQRAANCTSYKFVSQGDEQALKEALANIGPVSVAIDATRPQFI 259

Query:   251 FYSSGVFTG-DCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAK 309
             FY SGV+    C  +++HGV AVGYG T +G  YWLVKNSWG  +G+ GYIR+ R+   K
Sbjct:   260 FYRSGVYDDPSCTQKVNHGVLAVGYG-TLSGQDYWLVKNSWGAGFGDGGYIRIARN---K 315

Query:   310 EGLCGIAMDSSYP 322
               +CGIA ++ YP
Sbjct:   316 NNMCGIASEACYP 328


>MGI|MGI:107341 [details] [associations]
            symbol:Ctss "cathepsin S" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=ISO;IDA] [GO:0006508 "proteolysis"
            evidence=IDA] [GO:0008152 "metabolic process" evidence=ISO]
            [GO:0008233 "peptidase activity" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0009986 "cell
            surface" evidence=ISO] [GO:0016020 "membrane" evidence=IDA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0045453 "bone
            resorption" evidence=ISO] [GO:0051930 "regulation of sensory
            perception of pain" evidence=ISO] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:107341 GO:GO:0016020 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0008233 GO:GO:0031905 Reactome:REACT_102124
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 BRENDA:3.4.22.27
            ChiTaRS:CTSS EMBL:AF051732 EMBL:AF051727 EMBL:AF051728
            EMBL:AF051729 EMBL:AF051726 EMBL:AF051730 EMBL:AF051731
            EMBL:AF038546 EMBL:AJ002386 EMBL:AC092203 EMBL:Y18466 EMBL:AJ223208
            IPI:IPI00309520 UniGene:Mm.3619 PDB:1M0H PDBsum:1M0H
            ProteinModelPortal:O70370 SMR:O70370 STRING:O70370
            PhosphoSite:O70370 PaxDb:O70370 PRIDE:O70370
            Ensembl:ENSMUST00000116304 BindingDB:O70370 ChEMBL:CHEMBL4098
            NextBio:282932 Bgee:O70370 CleanEx:MM_CTSS Genevestigator:O70370
            GermOnline:ENSMUSG00000038642 Uniprot:O70370
        Length = 340

 Score = 624 (224.7 bits), Expect = 5.5e-61, P = 5.5e-61
 Identities = 142/326 (43%), Positives = 191/326 (58%)

Query:     6 VTSRKLQEASLSEKH-EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKP 61
             V   +LQ     + H + W   + K YK+  E+E R  I++ N++FI   N   + G   
Sbjct:    21 VAMEQLQRDPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHT 80

Query:    62 YKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFK-YENVIDVPATMDWRKNGAVTP 120
             Y++ +N+  D TN+E        R P    S K  +F+ Y N   +P T+DWR+ G VT 
Sbjct:    81 YQVGMNDMGDMTNEEILCRMGALRIPR--QSPKTVTFRSYSNRT-LPDTVDWREKGCVTE 137

Query:   121 IKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGV--DHGCEGGEMEDA 178
             +K QG CG+CWAFSAV A EG  +L TGKLISLS Q LV C       + GC GG M +A
Sbjct:   138 VKYQGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEA 197

Query:   179 FKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-P 237
             F++II N GI  +A+YPY+A D  C+  N  +  A    Y  +P   E+AL +AVA + P
Sbjct:   198 FQYIIDNGGIEADASYPYKATDEKCHY-NSKNRAATCSRYIQLPFGDEDALKEAVATKGP 256

Query:   238 VAVSIDASGSAFQFYSSGVFTG-DCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGE 296
             V+V IDAS S+F FY SGV+    C   ++HGV  VGYG T +G  YWLVKNSWG ++G+
Sbjct:   257 VSVGIDASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYG-TLDGKDYWLVKNSWGLNFGD 315

Query:   297 EGYIRMKRDIDAKEGLCGIAMDSSYP 322
             +GYIRM R+    +  CGIA   SYP
Sbjct:   316 QGYIRMARN---NKNHCGIASYCSYP 338


>UNIPROTKB|F1SS93 [details] [associations]
            symbol:CTSS "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0002250 "adaptive immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0016020 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0002250
            GeneTree:ENSGT00560000076577 GO:GO:0097067 EMBL:CU463875
            Ensembl:ENSSSCT00000007284 OMA:CEIESAV Uniprot:F1SS93
        Length = 342

 Score = 623 (224.4 bits), Expect = 7.1e-61, P = 7.1e-61
 Identities = 137/328 (41%), Positives = 192/328 (58%)

Query:     1 IAASQVTSRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVE--FIESL-NAA 57
             +  S   ++  ++ +L    + W   YGK YK   E+  R  I++ N++   + +L ++ 
Sbjct:    20 LLCSSAMAQLHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHSM 79

Query:    58 GNKPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA 117
             G   Y L +N   D T++E  +  +  R P      +  ++K      +P +MDWR+ G 
Sbjct:    80 GMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWP--RNVTYKSNPNQKLPDSMDWREKGC 137

Query:   118 VTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGV-DHGCEGGEME 176
             VT +K QG CGSCWAFSAV A E   ++ TG+L+SLS Q LV C T    + GC GG M 
Sbjct:   138 VTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMT 197

Query:   177 DAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ 236
             +AF++II N+GI +EA+YPY+AVDG C K +  +  A    Y  +P   E AL +AVAN+
Sbjct:   198 EAFQYIIDNNGIDSEASYPYKAVDGKC-KYDSKNRAATCSRYTELPFADEYALKEAVANK 256

Query:   237 -PVAVSIDASGSAFQFYSSGVFTG-DCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSW 294
              PV+V+IDA  S+F FY SGV+    C   ++HGV  VGYG   NG  YWLVKNSWG ++
Sbjct:   257 GPVSVAIDAKHSSFFFYRSGVYYDPSCTQNVNHGVLVVGYG-NLNGKDYWLVKNSWGLNF 315

Query:   295 GEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
             G+ GYIRM R+    E  CGIA   SYP
Sbjct:   316 GDGGYIRMARN---SENHCGIANYPSYP 340


>UNIPROTKB|P25326 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9913 "Bos taurus"
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0016020 "membrane" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0002250 "adaptive
            immune response" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0016020 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0002250 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            GO:GO:0097067 EMBL:BC102245 EMBL:M95211 EMBL:X62001 IPI:IPI00702008
            PIR:S15844 RefSeq:NP_001028787.1 UniGene:Bt.7938
            ProteinModelPortal:P25326 SMR:P25326 STRING:P25326 PRIDE:P25326
            Ensembl:ENSBTAT00000022774 GeneID:327711 KEGG:bta:327711 CTD:1520
            InParanoid:P25326 KO:K01368 OMA:KAMDQKC OrthoDB:EOG4JM7Q2
            NextBio:20810175 Uniprot:P25326
        Length = 331

 Score = 622 (224.0 bits), Expect = 9.0e-61, P = 9.0e-61
 Identities = 135/317 (42%), Positives = 189/317 (59%)

Query:    12 QEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINE 68
             ++ +L    + W   YGK YK   E+  R  I++ N++ +   N   + G   Y+L +N 
Sbjct:    20 RDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVTLHNLEHSMGMHSYELGMNH 79

Query:    69 FADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCG 128
               D T++E  +  +  R P      +  ++K +    +P +MDWR+ G VT +K QG CG
Sbjct:    80 LGDMTSEEVISLMSSLRVPSQWP--RNVTYKSDPNQKLPDSMDWREKGCVTEVKYQGACG 137

Query:   129 SCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDH-GCEGGEMEDAFKFIIHNDG 187
             SCWAFSAV A E   +L TGKL+SLS Q LV C T+   + GC GG M +AF++II N+G
Sbjct:   138 SCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNNG 197

Query:   188 ITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASG 246
             I +EA+YPY+A+DG C + +  +  A    Y  +P  SEEAL +AVAN+ PV+V IDAS 
Sbjct:   198 IDSEASYPYKAMDGKC-QYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVGIDASH 256

Query:   247 SAFQFYSSGVFTG-DCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRD 305
             S+F  Y +GV+    C   ++HGV  VGYG   +G  YWLVKNSWG  +G++GYIRM R+
Sbjct:   257 SSFFLYKTGVYYDPSCTQNVNHGVLVVGYG-NLDGKDYWLVKNSWGLHFGDQGYIRMARN 315

Query:   306 IDAKEGLCGIAMDSSYP 322
                    CGIA   SYP
Sbjct:   316 ---SGNHCGIANYPSYP 329


>UNIPROTKB|P83443 [details] [associations]
            symbol:P83443 "Macrodontain-1" species:203992 "Pseudananas
            sagenarius" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0006508 "proteolysis" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197
            ProteinModelPortal:P83443 SMR:P83443 MEROPS:C01.028 Uniprot:P83443
        Length = 213

 Score = 620 (223.3 bits), Expect = 1.5e-60, P = 1.5e-60
 Identities = 110/218 (50%), Positives = 150/218 (68%)

Query:   106 VPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSG 165
             VP ++DWR  GAV  +KNQGPCG CWAF+A+A  EGI ++  G L+ LSEQE++ C    
Sbjct:     2 VPQSIDWRDYGAVNEVKNQGPCGGCWAFAAIATVEGIYKIRKGNLVYLSEQEVLDC---A 58

Query:   166 VDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANS 225
             V +GC+GG +  A+ FII N+G+TT+ NYPY+A  GTCN  N   + A I GY  V  N 
Sbjct:    59 VSYGCKGGWVNRAYDFIISNNGVTTDENYPYRAYQGTCN-ANYFPNSAYITGYSYVRRND 117

Query:   226 EEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWL 285
             E  ++ AV+NQP+A  IDASG  FQ+Y  GV++G CG  L+H +T +GYG  +    YW+
Sbjct:   118 ESHMMYAVSNQPIAALIDASGDNFQYYKGGVYSGPCGFSLNHAITIIGYGRDS----YWI 173

Query:   286 VKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYPT 323
             V+NSWG+SWG+ GY+R++RD+    G+CGIAM   +PT
Sbjct:   174 VRNSWGSSWGQGGYVRIRRDVSHSGGVCGIAMSPLFPT 211


>DICTYBASE|DDB_G0283867 [details] [associations]
            symbol:cprC "cysteine proteinase 3" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0283867 GenomeReviews:CM000153_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:AAFI02000057
            KO:K01365 EMBL:X03930 RefSeq:XP_638859.1 ProteinModelPortal:Q23894
            SMR:Q23894 MEROPS:C01.114 EnsemblProtists:DDB0220784 GeneID:8624257
            KEGG:ddi:DDB_G0283867 OMA:NNVEHIN Uniprot:Q23894
        Length = 337

 Score = 615 (221.5 bits), Expect = 5.0e-60, P = 5.0e-60
 Identities = 135/330 (40%), Positives = 196/330 (59%)

Query:     1 IAASQVTSRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNK 60
             I+A  V S K  + S  +    WM    K Y + +E   R+  FK N++++ + N+ G+K
Sbjct:    19 ISAGNVFSHKQYQDSFID----WMRSNNKAYTH-KEFMPRYEEFKKNMDYVHNWNSKGSK 73

Query:    61 PYKLSINEFADQTNQEFKAFRNGYRRPDGLTS--RKGTSFKYENV-IDVPATMDWRKNGA 117
                L +N+ AD +N+E++    G R    L    ++    +        P  +DWR+  A
Sbjct:    74 TV-LGLNQHADLSNEEYRLNYLGTRAHIKLNGYHKRNLGLRLNRPQFKQPLNVDWREKDA 132

Query:   118 VTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMED 177
             VTP+K+QG CGSC++FS   + EG+T + TGKL+SLSEQ ++ C +S  + GC GG M +
Sbjct:   133 VTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNILDCSSSFGNEGCNGGLMTN 192

Query:   178 AFKFIIHNDGITTEANYPYQA-VDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ 236
             AF++II N+G+ +E  YPY+  V+  C K  E S  AKI  Y+ + A  E  L  A+   
Sbjct:   193 AFEYIIKNNGLNSEEQYPYEMKVNDEC-KFQEGSVAAKITSYKEIEAGDENDLQNALLLN 251

Query:   237 PVAVSIDASGSAFQFYSSGVFTGD-CGTE-LDHGVTAVGYGATANGTKYWLVKNSWGTSW 294
             PV+V+IDAS ++FQ Y++GV+    C +E LDHGV AVG G T NG  Y++VKNSWG SW
Sbjct:   252 PVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHGVLAVGMG-TDNGEDYYIVKNSWGPSW 310

Query:   295 GEEGYIRMKRDIDAKEGLCGIAMDSSYPTA 324
             G  GYI M R+   K+  CGI+  +SYP A
Sbjct:   311 GLNGYIHMARN---KDNNCGISTMASYPIA 337


>DICTYBASE|DDB_G0272298 [details] [associations]
            symbol:DDB_G0272298 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0272298 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246
            SMART:SM00848 EMBL:AAFI02000008 KO:K01365 RefSeq:XP_645281.1
            ProteinModelPortal:Q559Q3 MEROPS:C01.A53 EnsemblProtists:DDB0203746
            GeneID:8618447 KEGG:ddi:DDB_G0272298 InParanoid:Q559Q3 OMA:PANINWR
            Uniprot:Q559Q3
        Length = 305

 Score = 615 (221.5 bits), Expect = 5.0e-60, P = 5.0e-60
 Identities = 131/306 (42%), Positives = 176/306 (57%)

Query:    24 MSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEF--KAFR 81
             M KY K YKN +E  KRF IF+DN  FI +      +  ++ +NE++D T +EF  K F 
Sbjct:     1 MVKYNKHYKNNKEYLKRFDIFQDNYNFILNHRNKNGENIEMDLNEYSDLTQKEFADKFFE 60

Query:    82 NGYRRP-DG-LTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAAT 139
                  P  G +   K T FK+     +P + DWR +GAV  +KNQG C SCW+FSA+ A 
Sbjct:    61 KLVPEPRSGPINDIKATPFKHNVNATIPKSFDWRDHGAVGKVKNQGSCASCWSFSALGAL 120

Query:   140 EGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAV 199
             EG   +  G+L+ LSEQ LV C T     GC+ G M DAFK+II + G+  E+ YPY   
Sbjct:   121 EGHYYIKYGELLDLSEQNLVDCATPFGPKGCKTGWMHDAFKYIISSGGVNLESQYPYTGK 180

Query:   200 DGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSAFQFYSSGVFT 258
             D  C K N++   AK+ G+  +P   E AL++A+A   PVAV ID S   FQ  S G++ 
Sbjct:   181 DEVC-KFNQSEKEAKVSGFVMIPKFDESALMEAIALYGPVAVPIDTSTKEFQHLSGGIYY 239

Query:   259 GD-CGT-ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIA 316
              D C      H V A+GYG   NG  Y+L+KNSWG SWG  G+ ++KR +   +G CGI 
Sbjct:   240 SDSCDPWNTIHAVLAIGYGTDENGVDYFLMKNSWGKSWGTNGFFKVKRGV---KGKCGIV 296

Query:   317 MDSSYP 322
               +SYP
Sbjct:   297 TAASYP 302


>WB|WBGene00000776 [details] [associations]
            symbol:cpl-1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0040010 "positive regulation
            of growth rate" evidence=IMP] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] [GO:0040011
            "locomotion" evidence=IMP] [GO:0070265 "necrotic cell death"
            evidence=IMP] [GO:0031983 "vesicle lumen" evidence=IDA] [GO:0042718
            "yolk granule" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0009792 GO:GO:0040010 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0040011
            GO:GO:0031983 GO:GO:0070265 GeneTree:ENSGT00660000095458 KO:K01365
            GO:GO:0042718 MEROPS:I29.009 EMBL:Z92812 GeneID:180111
            KEGG:cel:CELE_T03E6.7 CTD:180111 PIR:T24387 RefSeq:NP_001256718.1
            HSSP:P80067 ProteinModelPortal:O45734 SMR:O45734 DIP:DIP-26616N
            IntAct:O45734 MINT:MINT-211563 STRING:O45734 PaxDb:O45734
            EnsemblMetazoa:T03E6.7.1 EnsemblMetazoa:T03E6.7.2 UCSC:T03E6.7.1
            WormBase:T03E6.7a InParanoid:O45734 OMA:HIENHNR NextBio:908128
            Uniprot:O45734
        Length = 337

 Score = 612 (220.5 bits), Expect = 1.0e-59, P = 1.0e-59
 Identities = 139/330 (42%), Positives = 184/330 (55%)

Query:     2 AASQVTSRKL--QEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNA--- 56
             A   V S KL  Q  S  EK + +   + K Y   EE +     F  N+  IE+ N    
Sbjct:    12 AVVAVNSAKLSRQIESAIEKWDDYKEDFDKEYSESEE-QTYMEAFVKNMIHIENHNRDHR 70

Query:    57 AGNKPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTS-FKYENVIDVPATMDWRKN 115
              G K +++ +N  AD    +++   NGYRR  G +  K +S F     + VP  +DWR  
Sbjct:    71 LGRKTFEMGLNHIADLPFSQYRKL-NGYRRLFGDSRIKNSSSFLAPFNVQVPDEVDWRDT 129

Query:   116 GAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEM 175
               VT +KNQG CGSCWAFSA  A EG      G+L+SLSEQ LV C T   +HGC GG M
Sbjct:   130 HLVTDVKNQGMCGSCWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLM 189

Query:   176 EDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVAN 235
             + AF++I  N G+ TE +YPY+  D  C+  N+ +  A  KGY   P   EE L  AVA 
Sbjct:   190 DQAFEYIRDNHGVDTEESYPYKGRDMKCH-FNKKTVGADDKGYVDTPEGDEEQLKIAVAT 248

Query:   236 Q-PVAVSIDASGSAFQFYSSGVFTGD-CGTE-LDHGVTAVGYGATANGTKYWLVKNSWGT 292
             Q P++++IDA   +FQ Y  GV+  + C +E LDHGV  VGYG       YW+VKNSWG 
Sbjct:   249 QGPISIAIDAGHRSFQLYKKGVYYDEECSSEELDHGVLLVGYGTDPEHGDYWIVKNSWGA 308

Query:   293 SWGEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
              WGE+GYIR+ R+   +   CG+A  +SYP
Sbjct:   309 GWGEKGYIRIARN---RNNHCGVATKASYP 335


>UNIPROTKB|F1PAK0 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9615 "Canis lupus
            familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:AAEX03011051 Ensembl:ENSCAFT00000019176 OMA:YEPACTQ
            Uniprot:F1PAK0
        Length = 339

 Score = 609 (219.4 bits), Expect = 2.2e-59, P = 2.2e-59
 Identities = 132/317 (41%), Positives = 184/317 (58%)

Query:    12 QEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINE 68
             ++ +L      W   Y K YK   E+  R  I++ N++F+   N   + G   Y L +N 
Sbjct:    28 KDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNH 87

Query:    69 FADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCG 128
               D T +E  +     R P     ++  +++  +   +P ++DWR+ G VT +K QG CG
Sbjct:    88 LGDMTGEEVISLMGSLRVPSQW--QRNVTYRSNSNQKLPDSVDWREKGCVTEVKYQGSCG 145

Query:   129 SCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDH-GCEGGEMEDAFKFIIHNDG 187
             +CWAFSAV A E   +L TGKL+SLS Q LV C T    + GC GG M  AF++II N+G
Sbjct:   146 ACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNG 205

Query:   188 ITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASG 246
             I +EA+YPY+AV+G C + +     A    Y  +P  SE+AL +AVAN+ PV+V+IDAS 
Sbjct:   206 IDSEASYPYKAVNGKC-RYDSKKRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDASH 264

Query:   247 SAFQFYSSGVF-TGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRD 305
              +F  Y SGV+    C   ++HGV  VGYG   NG  YWLVKNSWG ++G++GYIRM R+
Sbjct:   265 YSFFLYRSGVYYEPSCTQNVNHGVLVVGYG-NLNGKDYWLVKNSWGLNFGDQGYIRMARN 323

Query:   306 IDAKEGLCGIAMDSSYP 322
                    CGIA   SYP
Sbjct:   324 ---SGNHCGIASYPSYP 337


>UNIPROTKB|F1PMM9 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9615 "Canis lupus
            familiaris" [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00639 GO:GO:0004197 GO:GO:0002250
            GeneTree:ENSGT00560000076577 GO:GO:0097067 EMBL:AAEX03000499
            Ensembl:ENSCAFT00000002029 OMA:EFKQVLN Uniprot:F1PMM9
        Length = 341

 Score = 609 (219.4 bits), Expect = 2.2e-59, P = 2.2e-59
 Identities = 130/320 (40%), Positives = 187/320 (58%)

Query:    12 QEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINE 68
             Q+ SL     QW   +GK+Y   EE  +R  +++ N+E IE  N   + G   + L++N 
Sbjct:    29 QDHSLDAHWSQWKEAHGKLYDKDEEGWRR-TVWERNMEMIEQHNQEYSQGEHSFTLAMNA 87

Query:    69 FADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCG 128
             F D TN+EFK   N ++       +KG  F      +VP+++DWR+ G VTP+K+QG C 
Sbjct:    88 FGDMTNEEFKQVLNDFKIQK---HKKGKVFPAPLFAEVPSSVDWREQGYVTPVKDQGQCL 144

Query:   129 SCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGI 188
              CWAFSA  A EG     TGKL+SLSEQ LV C  S  + GC GG ME AF+++  N G+
Sbjct:   145 GCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWSQGNRGCNGGLMEYAFQYVKDNGGL 204

Query:   189 TTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGS 247
              +E +YPY A +  C    E S  A +  +  +  N E+ L+  VA   PV+ ++D+S  
Sbjct:   205 DSEESYPYLARNEPCKYRPEKS-AANVTAFWPI-LNEEDGLMTTVATVGPVSAAVDSSPQ 262

Query:   248 AFQFYSSGVFTGD-CGTEL-DHGVTAVGYG---ATANGTKYWLVKNSWGTSWGEEGYIRM 302
             +FQFY  G++    C  +L +HGV  VGYG   A ++  KYW+VKNSWGT+WG +GY+ +
Sbjct:   263 SFQFYKKGIYYDPKCSNKLLNHGVLVVGYGFEGAESDNKKYWIVKNSWGTNWGMQGYMLL 322

Query:   303 KRDIDAKEGLCGIAMDSSYP 322
              +D   ++  CGIA  +SYP
Sbjct:   323 AKD---RDNHCGIATRASYP 339


>UNIPROTKB|Q8HY81 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            CTD:1520 KO:K01368 OrthoDB:EOG4JM7Q2 EMBL:AY156692
            RefSeq:NP_001002938.2 UniGene:Cfa.1661 ProteinModelPortal:Q8HY81
            SMR:Q8HY81 STRING:Q8HY81 MEROPS:C01.034 GeneID:403400
            KEGG:cfa:403400 InParanoid:Q8HY81 NextBio:20816922 Uniprot:Q8HY81
        Length = 331

 Score = 606 (218.4 bits), Expect = 4.5e-59, P = 4.5e-59
 Identities = 131/317 (41%), Positives = 184/317 (58%)

Query:    12 QEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINE 68
             ++ +L      W   Y K YK   E+  R  I++ N++F+   N   + G   Y L +N 
Sbjct:    20 KDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNH 79

Query:    69 FADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCG 128
               D T +E  +     R P     ++  +++  +   +P ++DWR+ G VT +K QG CG
Sbjct:    80 LGDMTGEEVISLMGSLRVPSQW--QRNVTYRSNSNQKLPDSVDWREKGCVTEVKYQGSCG 137

Query:   129 SCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDH-GCEGGEMEDAFKFIIHNDG 187
             +CWAFSAV A E   +L TGKL+SLS Q LV C T    + GC GG M  AF++II N+G
Sbjct:   138 ACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNG 197

Query:   188 ITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASG 246
             I +EA+YPY+A++G C + +     A    Y  +P  SE+AL +AVAN+ PV+V+IDAS 
Sbjct:   198 IDSEASYPYKAMNGKC-RYDSKKRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDASH 256

Query:   247 SAFQFYSSGVF-TGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRD 305
              +F  Y SGV+    C   ++HGV  VGYG   NG  YWLVKNSWG ++G++GYIRM R+
Sbjct:   257 YSFFLYRSGVYYEPSCTQNVNHGVLVVGYG-NLNGKDYWLVKNSWGLNFGDQGYIRMARN 315

Query:   306 IDAKEGLCGIAMDSSYP 322
                    CGIA   SYP
Sbjct:   316 ---SGNHCGIASYPSYP 329


>RGD|61810 [details] [associations]
            symbol:Ctsk "cathepsin K" species:10116 "Rattus norvegicus"
           [GO:0001957 "intramembranous ossification" evidence=IEP] [GO:0005615
           "extracellular space" evidence=IDA] [GO:0005737 "cytoplasm"
           evidence=IDA] [GO:0005764 "lysosome" evidence=IDA] [GO:0006508
           "proteolysis" evidence=TAS] [GO:0008234 "cysteine-type peptidase
           activity" evidence=TAS] [GO:0045453 "bone resorption" evidence=IMP]
           InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
           Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
           RGD:61810 GO:GO:0005615 eggNOG:COG4870 HOGENOM:HOG000230774
           GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
           InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
           PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
           GO:GO:0045453 GO:GO:0001957 GeneTree:ENSGT00560000076577
           HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
           OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:AF010306 EMBL:BC078793
           IPI:IPI00206378 RefSeq:NP_113748.1 UniGene:Rn.5598
           ProteinModelPortal:O35186 SMR:O35186 STRING:O35186
           PhosphoSite:O35186 PRIDE:O35186 Ensembl:ENSRNOT00000028730
           GeneID:29175 KEGG:rno:29175 UCSC:RGD:61810 InParanoid:O35186
           OMA:YKEIPEG BindingDB:O35186 ChEMBL:CHEMBL3034 NextBio:608248
           Genevestigator:O35186 GermOnline:ENSRNOG00000021155 Uniprot:O35186
        Length = 329

 Score = 604 (217.7 bits), Expect = 7.3e-59, P = 7.3e-59
 Identities = 131/316 (41%), Positives = 188/316 (59%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIE--SLNAA-GNKPYKLSINEF 69
             E +L  + E W   +GK Y +  ++  R  I++ N++ I   +L A+ G   Y+L++N  
Sbjct:    19 EETLDTQWELWKKTHGKQYNSKVDEISRRLIWEKNLKKISVHNLEASLGAHTYELAMNHL 78

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
              D T++E      G R P   +    T +  E    VP ++D+RK G VTP+KNQG CGS
Sbjct:    79 GDMTSEEVVQKMTGLRVPPSRSFSNDTLYTPEWEGRVPDSIDYRKKGYVTPVKNQGQCGS 138

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             CWAFS+  A EG  +  TGKL++LS Q LV C +   ++GC GG M  AF+++  N GI 
Sbjct:   139 CWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVSE--NYGCGGGYMTTAFQYVQQNGGID 196

Query:   190 TEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSA 248
             +E  YPY   D +C   N  +  AK +GY  +P  +E+AL +AVA   PV+VSIDAS ++
Sbjct:   197 SEDAYPYVGQDESC-MYNATAKAAKCRGYREIPVGNEKALKRAVARVGPVSVSIDASLTS 255

Query:   249 FQFYSSGVFTGD-CGTE-LDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDI 306
             FQFYS GV+  + C  + ++H V  VGYG T  G KYW++KNSWG SWG +GY+ + R+ 
Sbjct:   256 FQFYSRGVYYDENCDRDNVNHAVLVVGYG-TQKGNKYWIIKNSWGESWGNKGYVLLARN- 313

Query:   307 DAKEGLCGIAMDSSYP 322
               K   CGI   +S+P
Sbjct:   314 --KNNACGITNLASFP 327


>UNIPROTKB|P25774 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0016020 "membrane"
            evidence=IEA] [GO:0005576 "extracellular region" evidence=NAS]
            [GO:0005764 "lysosome" evidence=IDA;NAS] [GO:0097067 "cellular
            response to thyroid hormone stimulus" evidence=IEP] [GO:0002250
            "adaptive immune response" evidence=IEP] [GO:0019882 "antigen
            processing and presentation" evidence=TAS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=TAS] [GO:0006955
            "immune response" evidence=TAS] [GO:0002474 "antigen processing and
            presentation of peptide antigen via MHC class I" evidence=TAS]
            [GO:0002480 "antigen processing and presentation of exogenous
            peptide antigen via MHC class I, TAP-independent" evidence=TAS]
            [GO:0019886 "antigen processing and presentation of exogenous
            peptide antigen via MHC class II" evidence=TAS] [GO:0036021
            "endolysosome lumen" evidence=TAS] [GO:0042590 "antigen processing
            and presentation of exogenous peptide antigen via MHC class I"
            evidence=TAS] [GO:0043202 "lysosomal lumen" evidence=TAS]
            [GO:0045087 "innate immune response" evidence=TAS] [GO:0043231
            "intracellular membrane-bounded organelle" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 Reactome:REACT_118779
            Reactome:REACT_6900 GO:GO:0005576 GO:GO:0002480 GO:GO:0016020
            GO:GO:0019886 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087 EMBL:CH471121
            GO:GO:0004197 GO:GO:0002250 HOVERGEN:HBG011513 GO:GO:0097067
            GO:GO:0036021 EMBL:AL356292 CTD:1520 KO:K01368 OMA:KAMDQKC
            OrthoDB:EOG4JM7Q2 EMBL:S93414 EMBL:M86553 EMBL:M90696 EMBL:U07374
            EMBL:U07370 EMBL:U07371 EMBL:U07372 EMBL:U07373 EMBL:CR541676
            EMBL:AK301472 EMBL:AK314482 EMBL:BC002642 IPI:IPI00299150
            IPI:IPI00910216 PIR:A42482 RefSeq:NP_001186668.1 RefSeq:NP_004070.3
            UniGene:Hs.181301 PDB:1BXF PDB:1GLO PDB:1MS6 PDB:1NPZ PDB:1NQC
            PDB:2C0Y PDB:2F1G PDB:2FQ9 PDB:2FRA PDB:2FRQ PDB:2FT2 PDB:2FUD
            PDB:2FYE PDB:2G6D PDB:2G7Y PDB:2H7J PDB:2HH5 PDB:2HHN PDB:2HXZ
            PDB:2OP3 PDB:2R9M PDB:2R9N PDB:2R9O PDB:3IEJ PDB:3KWN PDB:3MPE
            PDB:3MPF PDB:3N3G PDB:3N4C PDB:3OVX PDBsum:1BXF PDBsum:1GLO
            PDBsum:1MS6 PDBsum:1NPZ PDBsum:1NQC PDBsum:2C0Y PDBsum:2F1G
            PDBsum:2FQ9 PDBsum:2FRA PDBsum:2FRQ PDBsum:2FT2 PDBsum:2FUD
            PDBsum:2FYE PDBsum:2G6D PDBsum:2G7Y PDBsum:2H7J PDBsum:2HH5
            PDBsum:2HHN PDBsum:2HXZ PDBsum:2OP3 PDBsum:2R9M PDBsum:2R9N
            PDBsum:2R9O PDBsum:3IEJ PDBsum:3KWN PDBsum:3MPE PDBsum:3MPF
            PDBsum:3N3G PDBsum:3N4C PDBsum:3OVX ProteinModelPortal:P25774
            SMR:P25774 IntAct:P25774 STRING:P25774 MEROPS:I29.004
            PhosphoSite:P25774 DMDM:88984046 PaxDb:P25774 PeptideAtlas:P25774
            PRIDE:P25774 DNASU:1520 Ensembl:ENST00000368985
            Ensembl:ENST00000448301 GeneID:1520 KEGG:hsa:1520 UCSC:uc001evn.3
            GeneCards:GC01M150702 HGNC:HGNC:2545 HPA:CAB000460 HPA:HPA002988
            MIM:116845 neXtProt:NX_P25774 PharmGKB:PA27041 InParanoid:P25774
            PhylomeDB:P25774 BRENDA:3.4.22.27 BindingDB:P25774
            ChEMBL:CHEMBL2954 ChiTaRS:CTSS EvolutionaryTrace:P25774
            GenomeRNAi:1520 NextBio:6291 PMAP-CutDB:P25774 ArrayExpress:P25774
            Bgee:P25774 CleanEx:HS_CTSS Genevestigator:P25774
            GermOnline:ENSG00000163131 Uniprot:P25774
        Length = 331

 Score = 601 (216.6 bits), Expect = 1.5e-58, P = 1.5e-58
 Identities = 132/328 (40%), Positives = 186/328 (56%)

Query:     1 IAASQVTSRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AA 57
             +  S   ++  ++ +L      W   YGK YK   E+  R  I++ N++F+   N   + 
Sbjct:     9 LVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSM 68

Query:    58 GNKPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA 117
             G   Y L +N   D T++E  +  +  R P     ++  ++K      +P ++DWR+ G 
Sbjct:    69 GMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQW--QRNITYKSNPNRILPDSVDWREKGC 126

Query:   118 VTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDH-GCEGGEME 176
             VT +K QG CG+CWAFSAV A E   +L TGKL+SLS Q LV C T    + GC GG M 
Sbjct:   127 VTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMT 186

Query:   177 DAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ 236
              AF++II N GI ++A+YPY+A+D  C   ++    A    Y  +P   E+ L +AVAN+
Sbjct:   187 TAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKY-RAATCSKYTELPYGREDVLKEAVANK 245

Query:   237 -PVAVSIDASGSAFQFYSSGVF-TGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSW 294
              PV+V +DA   +F  Y SGV+    C   ++HGV  VGYG   NG +YWLVKNSWG ++
Sbjct:   246 GPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG-DLNGKEYWLVKNSWGHNF 304

Query:   295 GEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
             GEEGYIRM R+   K   CGIA   SYP
Sbjct:   305 GEEGYIRMARN---KGNHCGIASFPSYP 329


>ZFIN|ZDB-GENE-001205-4 [details] [associations]
            symbol:ctsk "cathepsin K" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-001205-4 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            CTD:1513 KO:K01371 OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55
            EMBL:BC092901 IPI:IPI00512751 RefSeq:NP_001017778.1
            UniGene:Dr.76224 ProteinModelPortal:Q568D6 SMR:Q568D6 GeneID:550475
            KEGG:dre:550475 InParanoid:Q568D6 NextBio:20879718
            ArrayExpress:Q568D6 Uniprot:Q568D6
        Length = 333

 Score = 598 (215.6 bits), Expect = 3.2e-58, P = 3.2e-58
 Identities = 133/321 (41%), Positives = 180/321 (56%)

Query:     8 SRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAA---GNKPYKL 64
             +  L   SL E  E W   + + Y    E+  R  I++ N+ FIE+ N     G   Y L
Sbjct:    18 AHSLDNLSLDEAWESWKITHKREYNGLNEESIRRTIWEKNMLFIEAHNKEYELGIHTYDL 77

Query:    65 SINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQ 124
              +N F D T +E      G + P        T    + V  +P ++D+RK G VT +KNQ
Sbjct:    78 GMNHFGDMTLEEVAEKVMGLQMPM-YRDPANTFVPDDRVGKLPKSIDYRKLGYVTSVKNQ 136

Query:   125 GPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIH 184
             G CGSCWAFS+V A EG    T G+L+ LS Q LV C T   + GC GG M +AF+++ +
Sbjct:   137 GSCGSCWAFSSVGALEGQLMKTKGQLVDLSPQNLVDCVTE--NDGCGGGYMTNAFRYVSN 194

Query:   185 NDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSID 243
             N GI +E +YPY   D  C   N +   A  +GY+ +P  +E AL  AVAN  PV+V ID
Sbjct:   195 NQGIDSEESYPYVGTDQQC-AYNTSGVAASCRGYKEIPQGNERALTAAVANVGPVSVGID 253

Query:   244 ASGSAFQFYSSGVFTG-DCGTE-LDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIR 301
             A  S F +Y SGV+   +C  E ++H V AVGYGAT  G KYW+VKNSWG  WG++GY+ 
Sbjct:   254 AMQSTFLYYKSGVYYDPNCNKEDVNHAVLAVGYGATPRGKKYWIVKNSWGEEWGKKGYVL 313

Query:   302 MKRDIDAKEGLCGIAMDSSYP 322
             M R+   +   CGIA  +S+P
Sbjct:   314 MARN---RNNACGIANLASFP 331


>MGI|MGI:107823 [details] [associations]
            symbol:Ctsk "cathepsin K" species:10090 "Mus musculus"
            [GO:0005615 "extracellular space" evidence=ISO] [GO:0005737
            "cytoplasm" evidence=ISO] [GO:0005764 "lysosome" evidence=ISO]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0045453 "bone resorption" evidence=ISO]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:107823 GO:GO:0005615 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0045453
            GO:GO:0001957 HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 OMA:LKVPPSH EMBL:X94444
            EMBL:AJ006033 EMBL:BC046320 IPI:IPI00316575 PIR:S74227
            RefSeq:NP_031828.2 UniGene:Mm.272085 ProteinModelPortal:P55097
            SMR:P55097 MINT:MINT-3089515 STRING:P55097 PhosphoSite:P55097
            PRIDE:P55097 Ensembl:ENSMUST00000015664 GeneID:13038 KEGG:mmu:13038
            InParanoid:P55097 BioCyc:MetaCyc:MONOMER-14811 ChEMBL:CHEMBL1075277
            NextBio:282924 Bgee:P55097 CleanEx:MM_CTSK Genevestigator:P55097
            GermOnline:ENSMUSG00000028111 Uniprot:P55097
        Length = 329

 Score = 590 (212.7 bits), Expect = 2.2e-57, P = 2.2e-57
 Identities = 128/316 (40%), Positives = 185/316 (58%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEF 69
             E  L  + E W   + K Y +  ++  R  I++ N++ I + N   + G   Y+L++N  
Sbjct:    19 EEMLDTQWELWKKTHQKQYNSKVDEISRRLIWEKNLKQISAHNLEASLGVHTYELAMNHL 78

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
              D T++E      G R P   +    T +  E    VP ++D+RK G VTP+KNQG CGS
Sbjct:    79 GDMTSEEVVQKMTGLRIPPSRSYSNDTLYTPEWEGRVPDSIDYRKKGYVTPVKNQGQCGS 138

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             CWAFS+  A EG  +  TGKL++LS Q LV C T   ++GC GG M  AF+++  N GI 
Sbjct:   139 CWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVTE--NYGCGGGYMTTAFQYVQQNGGID 196

Query:   190 TEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSA 248
             +E  YPY   D +C   N  +  AK +GY  +P  +E+AL +AVA   P++VSIDAS ++
Sbjct:   197 SEDAYPYVGQDESC-MYNATAKAAKCRGYREIPVGNEKALKRAVARVGPISVSIDASLAS 255

Query:   249 FQFYSSGVFTGD-CGTE-LDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDI 306
             FQFYS GV+  + C  + ++H V  VGYG T  G+K+W++KNSWG SWG +GY  + R+ 
Sbjct:   256 FQFYSRGVYYDENCDRDNVNHAVLVVGYG-TQKGSKHWIIKNSWGESWGNKGYALLARN- 313

Query:   307 DAKEGLCGIAMDSSYP 322
               K   CGI   +S+P
Sbjct:   314 --KNNACGITNMASFP 327


>UNIPROTKB|P43235 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0001957
            "intramembranous ossification" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0045453 "bone resorption"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=TAS] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=TAS] [GO:0036021 "endolysosome lumen" evidence=TAS]
            [GO:0045087 "innate immune response" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            Reactome:REACT_118779 Reactome:REACT_6900 GO:GO:0005615
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087 GO:GO:0045453
            EMBL:CH471121 EMBL:AL355860 GO:GO:0004197 GO:GO:0001957
            HOVERGEN:HBG011513 GO:GO:0036021 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:U13665 EMBL:X82153
            EMBL:U20280 EMBL:S79895 EMBL:CR541675 EMBL:AL356292 EMBL:BC016058
            IPI:IPI00300599 PIR:JC2476 RefSeq:NP_000387.1 UniGene:Hs.632466
            PDB:1ATK PDB:1AU0 PDB:1AU2 PDB:1AU3 PDB:1AU4 PDB:1AYU PDB:1AYV
            PDB:1AYW PDB:1BGO PDB:1BY8 PDB:1MEM PDB:1NL6 PDB:1NLJ PDB:1Q6K
            PDB:1SNK PDB:1TU6 PDB:1U9V PDB:1U9W PDB:1U9X PDB:1VSN PDB:1YK7
            PDB:1YK8 PDB:1YT7 PDB:2ATO PDB:2AUX PDB:2AUZ PDB:2BDL PDB:2R6N
            PDB:3C9E PDB:3H7D PDB:3KW9 PDB:3KWB PDB:3KWZ PDB:3KX1 PDB:3O0U
            PDB:3O1G PDB:3OVZ PDB:4DMX PDB:4DMY PDB:7PCK PDBsum:1ATK
            PDBsum:1AU0 PDBsum:1AU2 PDBsum:1AU3 PDBsum:1AU4 PDBsum:1AYU
            PDBsum:1AYV PDBsum:1AYW PDBsum:1BGO PDBsum:1BY8 PDBsum:1MEM
            PDBsum:1NL6 PDBsum:1NLJ PDBsum:1Q6K PDBsum:1SNK PDBsum:1TU6
            PDBsum:1U9V PDBsum:1U9W PDBsum:1U9X PDBsum:1VSN PDBsum:1YK7
            PDBsum:1YK8 PDBsum:1YT7 PDBsum:2ATO PDBsum:2AUX PDBsum:2AUZ
            PDBsum:2BDL PDBsum:2R6N PDBsum:3C9E PDBsum:3H7D PDBsum:3KW9
            PDBsum:3KWB PDBsum:3KWZ PDBsum:3KX1 PDBsum:3O0U PDBsum:3O1G
            PDBsum:3OVZ PDBsum:4DMX PDBsum:4DMY PDBsum:7PCK
            ProteinModelPortal:P43235 SMR:P43235 DIP:DIP-39993N IntAct:P43235
            STRING:P43235 PhosphoSite:P43235 DMDM:1168793 PaxDb:P43235
            PRIDE:P43235 DNASU:1513 Ensembl:ENST00000271651 GeneID:1513
            KEGG:hsa:1513 UCSC:uc001evp.2 GeneCards:GC01M150768 HGNC:HGNC:2536
            MIM:265800 MIM:601105 neXtProt:NX_P43235 Orphanet:763
            PharmGKB:PA27034 InParanoid:P43235 OMA:LKVPPSH PhylomeDB:P43235
            BindingDB:P43235 ChEMBL:CHEMBL268 EvolutionaryTrace:P43235
            GenomeRNAi:1513 NextBio:6267 ArrayExpress:P43235 Bgee:P43235
            CleanEx:HS_CTSK CleanEx:HS_CTSO Genevestigator:P43235
            GermOnline:ENSG00000143387 Uniprot:P43235
        Length = 329

 Score = 589 (212.4 bits), Expect = 2.8e-57, P = 2.8e-57
 Identities = 132/324 (40%), Positives = 191/324 (58%)

Query:     6 VTSRKLQEASLSEKH-EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIE--SLNAA-GNKP 61
             V S  L    + + H E W   + K Y N  ++  R  I++ N+++I   +L A+ G   
Sbjct:    11 VVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHT 70

Query:    62 YKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPI 121
             Y+L++N   D T++E      G + P   +    T +  E     P ++D+RK G VTP+
Sbjct:    71 YELAMNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPDSVDYRKKGYVTPV 130

Query:   122 KNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKF 181
             KNQG CGSCWAFS+V A EG  +  TGKL++LS Q LV C +   + GC GG M +AF++
Sbjct:   131 KNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDGCGGGYMTNAFQY 188

Query:   182 IIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAV 240
             +  N GI +E  YPY   + +C   N     AK +GY  +P  +E+AL +AVA   PV+V
Sbjct:   189 VQKNRGIDSEDAYPYVGQEESC-MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSV 247

Query:   241 SIDASGSAFQFYSSGVFTGD-CGTE-LDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEG 298
             +IDAS ++FQFYS GV+  + C ++ L+H V AVGYG    G K+W++KNSWG +WG +G
Sbjct:   248 AIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQ-KGNKHWIIKNSWGENWGNKG 306

Query:   299 YIRMKRDIDAKEGLCGIAMDSSYP 322
             YI M R+   K   CGIA  +S+P
Sbjct:   307 YILMARN---KNNACGIANLASFP 327


>UNIPROTKB|Q9GLE3 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9823 "Sus scrofa"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0005576 "extracellular
            region" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005576 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 MEROPS:I29.007
            CTD:1513 KO:K01371 OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55
            OMA:LKVPPSH EMBL:AF292030 RefSeq:NP_999467.1 UniGene:Ssc.1020
            ProteinModelPortal:Q9GLE3 SMR:Q9GLE3 STRING:Q9GLE3
            Ensembl:ENSSSCT00000007283 GeneID:397569 KEGG:ssc:397569
            ArrayExpress:Q9GLE3 Uniprot:Q9GLE3
        Length = 330

 Score = 587 (211.7 bits), Expect = 4.6e-57, P = 4.6e-57
 Identities = 131/323 (40%), Positives = 189/323 (58%)

Query:     6 VTSRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIE--SLNAA-GNKPY 62
             ++S    E  L  + E W   Y K Y +  ++  R  I++ N++ I   +L A+ G   Y
Sbjct:    13 MSSALYPEEILDTQWELWKKTYRKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTY 72

Query:    63 KLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIK 122
             +L++N   D T++E      G + P   +    T +  +     P ++D+RK G VTP+K
Sbjct:    73 ELAMNHLGDMTSEEVVQKMTGLKVPPSHSRSNDTLYIPDWEGRTPDSIDYRKKGYVTPVK 132

Query:   123 NQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFI 182
             NQG CGSCWAFS+V A EG  +  TGKL++LS Q LV C +   + GC GG M +AF+++
Sbjct:   133 NQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDGCGGGYMTNAFQYV 190

Query:   183 IHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVS 241
               N GI +E  YPY   D  C   N     AK +GY  +P  +E+AL +AVA   PV+V+
Sbjct:   191 QKNRGIDSEDAYPYVGQDENC-MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVA 249

Query:   242 IDASGSAFQFYSSGVFTGD-CGTE-LDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGY 299
             IDAS ++FQFYS GV+  + C ++ L+H V AVGYG    G K+W++KNSWG +WG +GY
Sbjct:   250 IDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQ-KGKKHWIIKNSWGENWGNKGY 308

Query:   300 IRMKRDIDAKEGLCGIAMDSSYP 322
             I M R+   K   CGIA  +S+P
Sbjct:   309 ILMARN---KNNACGIANLASFP 328


>RGD|708447 [details] [associations]
            symbol:Testin "testin gene" species:10116 "Rattus norvegicus"
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0030054 "cell junction" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 RGD:708447 GO:GO:0005576 GO:GO:0030054 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            MEROPS:C01.972 OMA:RYHAENS OrthoDB:EOG4XWG0N EMBL:U16858
            IPI:IPI00207173 PIR:I52525 PIR:PC1251 RefSeq:NP_775155.1
            UniGene:Rn.10029 ProteinModelPortal:P15242 SMR:P15242
            Ensembl:ENSRNOT00000024467 GeneID:286916 KEGG:rno:286916
            UCSC:RGD:708447 CTD:286916 InParanoid:P15242 NextBio:625036
            Genevestigator:P15242 GermOnline:ENSRNOG00000018028 Uniprot:P15242
        Length = 333

 Score = 587 (211.7 bits), Expect = 4.6e-57, P = 4.6e-57
 Identities = 129/319 (40%), Positives = 183/319 (57%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEF 69
             + SL  +  +W +K+GK Y   EE+ KR  +++ N + IE  N     G   + +++N F
Sbjct:    22 DPSLDVEWNEWRTKHGKTYNMNEERLKR-AVWEKNFKMIELHNWEYLEGRHDFTMAMNAF 80

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
              D TN EF     G++R      +K   F+    + VP  +DWR+ G VTP+KNQG C S
Sbjct:    81 GDLTNIEFVKMMTGFQRQK---IKKTHIFQDHQFLYVPKRVDWRQLGYVTPVKNQGHCAS 137

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
              WAFSA  + EG     T +LI LSEQ L+ C  S V HGC GG M+ AF+++  N G+ 
Sbjct:   138 SWAFSATGSLEGQMFRKTERLIPLSEQNLLDCMGSNVTHGCSGGFMQYAFQYVKDNGGLA 197

Query:   190 TEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSA 248
             TE +YPY+     C    E S  A ++ +  +P  SEEAL+KAVA   P++V++DAS  +
Sbjct:   198 TEESYPYRGQGRECRYHAENS-AANVRDFVQIPG-SEEALMKAVAKVGPISVAVDASHGS 255

Query:   249 FQFYSSGVF-TGDCG-TELDHGVTAVGYG---ATANGTKYWLVKNSWGTSWGEEGYIRMK 303
             FQFY SG++    C    L+H V  VGYG     ++G  +WLVKNSWG  WG +GY+++ 
Sbjct:   256 FQFYGSGIYYEPQCKRVHLNHAVLVVGYGFEGEESDGNSFWLVKNSWGEEWGMKGYMKLA 315

Query:   304 RDIDAKEGLCGIAMDSSYP 322
             +D       CGIA  S+YP
Sbjct:   316 KDWSNH---CGIATYSTYP 331


>UNIPROTKB|G1K2A7 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9615 "Canis lupus
            familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 PANTHER:PTHR12411:SF55 OMA:LKVPPSH
            EMBL:AAEX03011051 Ensembl:ENSCAFT00000019202 Uniprot:G1K2A7
        Length = 333

 Score = 586 (211.3 bits), Expect = 5.9e-57, P = 5.9e-57
 Identities = 128/316 (40%), Positives = 187/316 (59%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIE--SLNAA-GNKPYKLSINEF 69
             E  L  + + W   Y K Y +  ++  R  I++ N++ I   +L A+ G   Y+L++N  
Sbjct:    23 EEILDTQWDLWKKTYRKQYNSKVDELSRRLIWEKNLKHISIHNLEASLGVHTYELAMNHL 82

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
              D T++E      G + P   +    T +  +     P ++D+RK G VTP+KNQG CGS
Sbjct:    83 GDMTSEEVVQKMTGLKVPPSHSRSNDTLYIPDWESRAPDSVDYRKKGYVTPVKNQGQCGS 142

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             CWAFS+V A EG  +  TGKL++LS Q LV C +   + GC GG M +AF+++  N GI 
Sbjct:   143 CWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDGCGGGYMTNAFQYVQKNRGID 200

Query:   190 TEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSA 248
             +E  YPY   D +C   N     AK +GY  +P  +E+AL +AVA   P++V+IDAS ++
Sbjct:   201 SEDAYPYVGQDESC-MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTS 259

Query:   249 FQFYSSGVFTGD-CGTE-LDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDI 306
             FQFYS GV+  + C ++ L+H V AVGYG    G K+W++KNSWG +WG +GYI M R+ 
Sbjct:   260 FQFYSKGVYYDENCNSDNLNHAVLAVGYGIQ-KGNKHWIIKNSWGENWGNKGYILMARN- 317

Query:   307 DAKEGLCGIAMDSSYP 322
               K   CGIA  +S+P
Sbjct:   318 --KNNACGIANLASFP 331


>UNIPROTKB|Q3ZKN1 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:AY738221
            RefSeq:NP_001029168.1 UniGene:Cfa.588 HSSP:P43235
            ProteinModelPortal:Q3ZKN1 SMR:Q3ZKN1 STRING:Q3ZKN1 GeneID:608843
            KEGG:cfa:608843 InParanoid:Q3ZKN1 NextBio:20894470 Uniprot:Q3ZKN1
        Length = 330

 Score = 586 (211.3 bits), Expect = 5.9e-57, P = 5.9e-57
 Identities = 128/316 (40%), Positives = 187/316 (59%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIE--SLNAA-GNKPYKLSINEF 69
             E  L  + + W   Y K Y +  ++  R  I++ N++ I   +L A+ G   Y+L++N  
Sbjct:    20 EEILDTQWDLWKKTYRKQYNSKVDELSRRLIWEKNLKHISIHNLEASLGVHTYELAMNHL 79

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
              D T++E      G + P   +    T +  +     P ++D+RK G VTP+KNQG CGS
Sbjct:    80 GDMTSEEVVQKMTGLKVPPSHSRSNDTLYIPDWESRAPDSVDYRKKGYVTPVKNQGQCGS 139

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             CWAFS+V A EG  +  TGKL++LS Q LV C +   + GC GG M +AF+++  N GI 
Sbjct:   140 CWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDGCGGGYMTNAFQYVQKNRGID 197

Query:   190 TEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSA 248
             +E  YPY   D +C   N     AK +GY  +P  +E+AL +AVA   P++V+IDAS ++
Sbjct:   198 SEDAYPYVGQDESC-MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTS 256

Query:   249 FQFYSSGVFTGD-CGTE-LDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDI 306
             FQFYS GV+  + C ++ L+H V AVGYG    G K+W++KNSWG +WG +GYI M R+ 
Sbjct:   257 FQFYSKGVYYDENCNSDNLNHAVLAVGYGIQ-KGNKHWIIKNSWGENWGNKGYILMARN- 314

Query:   307 DAKEGLCGIAMDSSYP 322
               K   CGIA  +S+P
Sbjct:   315 --KNNACGIANLASFP 328


>RGD|1562210 [details] [associations]
            symbol:MGC114246 "similar to cathepsin R" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1562210 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 EMBL:CH474032 MEROPS:C01.042 KO:K09601
            OMA:ASHESFK OrthoDB:EOG4ZCT6D EMBL:BC091563 IPI:IPI00555186
            RefSeq:NP_001017509.1 UniGene:Rn.198321 SMR:Q5BJA0
            Ensembl:ENSRNOT00000061470 GeneID:498688 KEGG:rno:498688
            UCSC:RGD:1562210 InParanoid:Q5BJA0 NextBio:700535
            Genevestigator:Q5BJA0 Uniprot:Q5BJA0
        Length = 334

 Score = 583 (210.3 bits), Expect = 1.2e-56, P = 1.2e-56
 Identities = 125/321 (38%), Positives = 184/321 (57%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIE---SLNAAGNKPYKLSINEF 69
             + SL  + ++W  KY K Y + EE+E R  ++++N++ I+     N  G   + + INEF
Sbjct:    22 DPSLDAEWQEWKKKYDKSY-SLEEEELRRAVWEENLKMIKLHNGENGLGKNGFTMEINEF 80

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDV-PATMDWRKNGAVTPIKNQGPCG 128
              D T +EF+     +  P   T R+G S        + P  +DWRK G VTP++ QG C 
Sbjct:    81 GDTTGEEFRKMMVEF--PVQ-THREGKSIMKRAAGSIFPKFVDWRKKGYVTPVRRQGNCN 137

Query:   129 SCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGI 188
             +CWAFS   A E  T   +GKLI LS Q LV C     ++GC GG+  +AF++++HN G+
Sbjct:   138 ACWAFSVTGAIEAQTIWQSGKLIPLSVQNLVDCSKPQGNNGCLGGDTYNAFQYVLHNGGL 197

Query:   189 TTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVAN-QPVAVSIDASGS 247
              +EA YPY+  DG C + N  +  A+I G+ ++P  SE+ L+ AVA   P++  IDAS  
Sbjct:   198 QSEATYPYEGKDGPC-RYNPKNSSAEITGFVSLP-ESEDILMVAVATIGPISAGIDASHE 255

Query:   248 AFQFYSSGVF-TGDCGTE-LDHGVTAVGYGATAN---GTKYWLVKNSWGTSWGEEGYIRM 302
             +F+FY  G++   +C +  + HGV  VGYG   N   G  YWL+KNSWG  WG  GY+++
Sbjct:   256 SFKFYKKGIYHEPNCSSNSVTHGVLVVGYGFKGNDTGGDHYWLIKNSWGKQWGIRGYMKI 315

Query:   303 KRDIDAKEGLCGIAMDSSYPT 323
              +D   K   C IA  + YPT
Sbjct:   316 TKD---KNNHCAIASYAHYPT 333


>UNIPROTKB|Q5E968 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:BT021052
            EMBL:BC109853 IPI:IPI00709374 RefSeq:NP_001029607.1
            UniGene:Bt.23218 ProteinModelPortal:Q5E968 SMR:Q5E968 STRING:Q5E968
            MEROPS:I29.007 PRIDE:Q5E968 Ensembl:ENSBTAT00000028016
            GeneID:513038 KEGG:bta:513038 CTD:1513 InParanoid:Q5E968 KO:K01371
            OrthoDB:EOG4SJ5FC NextBio:20870669 PANTHER:PTHR12411:SF55
            Uniprot:Q5E968
        Length = 329

 Score = 581 (209.6 bits), Expect = 2.0e-56, P = 2.0e-56
 Identities = 128/316 (40%), Positives = 185/316 (58%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIE--SLNAA-GNKPYKLSINEF 69
             E  L  + E W   Y K Y +  ++  R  I++ N++ I   +L A+ G   Y+L++N  
Sbjct:    19 EEILDTQWELWKKTYRKQYNSKGDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHL 78

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
              D T++E      G + P   +    T +  +     P ++D+RK G VTP+KNQG CGS
Sbjct:    79 GDMTSEEVVQKMTGLKVPASRSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQCGS 138

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             CWAFS+V A EG  +  TGKL++LS Q LV C +   + GC GG M +AF+++  N GI 
Sbjct:   139 CWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDGCGGGYMTNAFQYVQKNRGID 196

Query:   190 TEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSA 248
             +E  YPY   D  C   N     AK +GY  +P  +E+AL +AVA   P++V+IDAS ++
Sbjct:   197 SEDAYPYVGQDENC-MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTS 255

Query:   249 FQFYSSGVFTGD-CGTE-LDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDI 306
             FQFY  GV+  + C ++ L+H V AVGYG    G K+W++KNSWG +WG +GYI M R+ 
Sbjct:   256 FQFYRKGVYYDENCNSDNLNHAVLAVGYGIQ-KGNKHWIIKNSWGENWGNKGYILMARN- 313

Query:   307 DAKEGLCGIAMDSSYP 322
               K   CGIA  +S+P
Sbjct:   314 --KNNACGIANLASFP 327


>UNIPROTKB|F1NZ37 [details] [associations]
            symbol:LOC420160 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 EMBL:AADN02062018
            IPI:IPI00587784 Ensembl:ENSGALT00000006765 OMA:CGVANQA
            Uniprot:F1NZ37
        Length = 340

 Score = 580 (209.2 bits), Expect = 2.5e-56, P = 2.5e-56
 Identities = 124/315 (39%), Positives = 175/315 (55%)

Query:    16 LSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEFADQ 72
             L E  E+W S Y K Y    E  +R  ++++N+  IE  N   + G   ++L +N + D 
Sbjct:    30 LEEAWERWKSLYAKEYPGEAELIRR-EVWENNLRRIEQHNWEESQGQHTFRLGMNHYGDL 88

Query:    73 TNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWA 132
              ++EF    NG+            +F+       PA +DWR  G VTP+KNQG CGSCWA
Sbjct:    89 MDEEFNQLLNGFAPVQH--EEPALTFQASAAQKTPAEVDWRMRGYVTPVKNQGHCGSCWA 146

Query:   133 FSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEA 192
             FSA  A EG+    TGKL  LSEQ L+ C     ++GC+GG M  AF+++  N G+ +E 
Sbjct:   147 FSATGALEGLVFNWTGKLAVLSEQNLIDCSWKLGNNGCQGGYMTRAFQYVHDNGGMNSEH 206

Query:   193 NYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSAFQF 251
              YPYQA D +  + N A   A       V   SE AL +AVA   PV+V++DAS   F F
Sbjct:   207 IYPYQATDTSSCRYNPADRAANCSTVWLVAQGSEAALEQAVATVGPVSVAVDASSFFFHF 266

Query:   252 YSSGVFTGD-CGTELDHGVTAVGYGATANGTK---YWLVKNSWGTSWGEEGYIRMKRDID 307
             Y SG+F    C  +++HG+ AVGYG +    K   YW++KNSW   WGE+GYIR+ + ++
Sbjct:   267 YKSGIFNSMFCSQKVNHGMLAVGYGISQEARKNVSYWILKNSWSEVWGEKGYIRLLKGVN 326

Query:   308 AKEGLCGIAMDSSYP 322
                  CG+A  +S+P
Sbjct:   327 NH---CGVANQASFP 338


>MGI|MGI:1922258 [details] [associations]
            symbol:4930486L24Rik "RIKEN cDNA 4930486L24 gene"
            species:10090 "Mus musculus" [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030054 "cell
            junction" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 MGI:MGI:1922258
            GO:GO:0005576 GO:GO:0030054 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 HSSP:P07711
            EMBL:AY146988 EMBL:AK145933 EMBL:BC061218 IPI:IPI00280732
            RefSeq:NP_835199.1 UniGene:Mm.19839 ProteinModelPortal:Q80UB0
            SMR:Q80UB0 MEROPS:C01.972 PRIDE:Q80UB0 Ensembl:ENSMUST00000091569
            GeneID:214639 KEGG:mmu:214639 UCSC:uc007qvs.1 InParanoid:Q80UB0
            OMA:RYHAENS OrthoDB:EOG4XWG0N NextBio:374408 Bgee:Q80UB0
            CleanEx:MM_4930486L24RIK Genevestigator:Q80UB0 Uniprot:Q80UB0
        Length = 333

 Score = 580 (209.2 bits), Expect = 2.5e-56, P = 2.5e-56
 Identities = 127/319 (39%), Positives = 182/319 (57%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEF 69
             + SL  +  +W +K+GK Y   EE+ +R  +++ N + IE  N     G   + +++N F
Sbjct:    22 DPSLDVQWNEWRTKHGKAYNVNEERLRR-AVWEKNFKMIELHNWEYLEGKHDFTMTMNAF 80

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
              D TN EF     G+RR      ++   F+    + VP  +DWR  G VTP+KNQG C S
Sbjct:    81 GDLTNTEFVKMMTGFRRQK---IKRMHVFQDHQFLYVPKYVDWRMLGYVTPVKNQGYCAS 137

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
              WAFSA  + EG     TG+L+ LSEQ L+ C  S V H C GG M++AF+++  N G+ 
Sbjct:   138 SWAFSATGSLEGQMFKKTGRLVPLSEQNLLDCMGSNVTHDCSGGFMQNAFQYVKDNGGLA 197

Query:   190 TEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSA 248
             TE +YPY      C    E S  A ++ +  +P   EEAL+KAVA   P++V++DAS  +
Sbjct:   198 TEESYPYIGPGRKCRYHAENS-AANVRDFVQIPGR-EEALMKAVAKVGPISVAVDASHDS 255

Query:   249 FQFYSSGVF-TGDCG-TELDHGVTAVGYG---ATANGTKYWLVKNSWGTSWGEEGYIRMK 303
             FQFY SG++    C    L+H V  VGYG     ++G  YWLVKNSWG  WG +GYI++ 
Sbjct:   256 FQFYDSGIYYEPQCKRVHLNHAVLVVGYGFEGEESDGNSYWLVKNSWGEEWGMKGYIKIA 315

Query:   304 RDIDAKEGLCGIAMDSSYP 322
             +D +     CGIA  ++YP
Sbjct:   316 KDWNNH---CGIATLATYP 331


>DICTYBASE|DDB_G0281605 [details] [associations]
            symbol:cfaD "peptidase C1A family protein"
            species:44689 "Dictyostelium discoideum" [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IDA] [GO:0006508 "proteolysis" evidence=IDA] [GO:0031410
            "cytoplasmic vesicle" evidence=IDA] [GO:0031288 "sorocarp
            morphogenesis" evidence=IMP] [GO:0008285 "negative regulation of
            cell proliferation" evidence=IGI;IDA] [GO:0005576 "extracellular
            region" evidence=IEA;IDA] [GO:0005515 "protein binding"
            evidence=IPI] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0281605
            GO:GO:0008285 GO:GO:0005615 GenomeReviews:CM000152_GR
            eggNOG:COG4870 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0031410 EMBL:AAFI02000042
            GO:GO:0031288 RefSeq:XP_640530.1 HSSP:P07711
            ProteinModelPortal:Q54TR1 STRING:Q54TR1 PRIDE:Q54TR1
            EnsemblProtists:DDB0229857 GeneID:8623140 KEGG:ddi:DDB_G0281605
            InParanoid:Q54TR1 OMA:PSAHEHE ProtClustDB:CLSZ2430523
            Uniprot:Q54TR1
        Length = 531

 Score = 578 (208.5 bits), Expect = 4.2e-56, P = 4.2e-56
 Identities = 126/318 (39%), Positives = 186/318 (58%)

Query:    12 QEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFAD 71
             +E   S   +++ ++Y K Y + +E ++RF  FK   + I + NA  +  YKL +N +AD
Sbjct:   217 KEEQASNLFKEYKAQYNKEYSSQDEHDERFINFKAARKIIATHNAKESS-YKLGMNHYAD 275

Query:    72 QTNQEFKAF-RNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSC 130
              +N+EF    +    RP  +T         E++  +P+T+DWR    VTP+K+QG CGSC
Sbjct:   276 LSNKEFNTLVKPKVARPS-VTGADSVHDD-ESLRSIPSTVDWRNQNCVTPVKDQGICGSC 333

Query:   131 WAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITT 190
             W F +  + EG   +T G+L+SLSEQ+LV C       GC GG    AF++++    + T
Sbjct:   334 WTFGSTGSLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGSLAT 393

Query:   191 EANYPYQAVDGTC-NKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSA 248
             E+NYPY   +G C ++T   S V+ I GY  V + SE AL  A+A   PVA++IDAS   
Sbjct:   394 ESNYPYLMQNGLCRDRTVTPSGVS-ITGYVNVTSGSESALQNAIATTGPVAIAIDASVDD 452

Query:   249 FQFYSSGVFTGD-CGT---ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKR 304
             F++Y SGV+    C     +LDH V A+GYG T  G  Y+LVKNSW T+WG +GY+ M R
Sbjct:   453 FRYYMSGVYNNPACKNGLDDLDHEVLAIGYG-TYQGQDYFLVKNSWSTNWGMDGYVYMAR 511

Query:   305 DIDAKEGLCGIAMDSSYP 322
             + D    LCG++  ++YP
Sbjct:   512 N-D--NNLCGVSSQATYP 526


>UNIPROTKB|F1NHB8 [details] [associations]
            symbol:F1NHB8 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 EMBL:AADN02044011
            IPI:IPI00586027 Ensembl:ENSGALT00000021873 OMA:SELDHAV
            Uniprot:F1NHB8
        Length = 329

 Score = 577 (208.2 bits), Expect = 5.3e-56, P = 5.3e-56
 Identities = 126/306 (41%), Positives = 180/306 (58%)

Query:    23 WMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRN 82
             +  ++GK Y + EE E R R F  N+ F+ S N A    Y L++N  AD+T QE  A R 
Sbjct:    29 YKERFGKRYSSEEEHEHRKRTFIHNMRFVHSKNRAALS-YSLALNHLADRTPQEMAALRG 87

Query:    83 GYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGI 142
               R  D  + +  +   Y +++ +P ++DWR  GAVTP+K+Q  CGSCW+F+   A EG 
Sbjct:    88 RRRSGDPKSGQPFSMQLYASLV-LPESLDWRLYGAVTPVKDQAVCGSCWSFATTGAMEGA 146

Query:   143 TQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANY-PYQAVDG 201
               L TG L  LS+Q L+ C     ++ C+GGE   A+++I  + GI +  +Y PY   +G
Sbjct:   147 LFLKTGVLTPLSQQVLIDCSWGFGNYACDGGEEWRAYEWIKKHGGIASTESYGPYLGQNG 206

Query:   202 TCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSAFQFYSSGVFTGD 260
              C+  N++  VA + GY TV + + EAL  A+    PVAV+IDAS  +F FY++GV+   
Sbjct:   207 YCHY-NQSELVAPLAGYVTVESGNAEALKAALFKHGPVAVNIDASHKSFTFYANGVYEEP 265

Query:   261 -CG---TELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIA 316
              CG   +ELDH V AVGYG   +G  YWL+KNSW T WG +GYI M      K+  CG+A
Sbjct:   266 HCGNETSELDHAVLAVGYGVL-HGKSYWLIKNSWSTYWGNDGYILMAM----KDNNCGVA 320

Query:   317 MDSSYP 322
               +S+P
Sbjct:   321 TAASFP 326


>DICTYBASE|DDB_G0278401 [details] [associations]
            symbol:cprH "cysteine proteinase 8" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0278401 EMBL:AAFI02000023
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 ProtClustDB:CLSZ2430780 RefSeq:XP_642342.1
            ProteinModelPortal:Q54Y60 MEROPS:C01.A62 EnsemblProtists:DDB0205428
            GeneID:8621547 KEGG:ddi:DDB_G0278401 InParanoid:Q54Y60 OMA:FANMENE
            Uniprot:Q54Y60
        Length = 337

 Score = 575 (207.5 bits), Expect = 8.6e-56, P = 8.6e-56
 Identities = 135/329 (41%), Positives = 185/329 (56%)

Query:     9 RKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINE 68
             ++L E+   +    WM    K Y +  E   R+ IFK N ++IE  N+ G++   L +N+
Sbjct:    19 QELSESQYRDAFTDWMISNQKSYSS-SEFITRYNIFKTNFDYIEEWNSKGSETV-LGLNK 76

Query:    69 FADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCG 128
              AD TN+E+++   G  +P   +S  GT  +        +T+DWRK GAVT +KNQ  C 
Sbjct:    77 MADITNEEYRSLYLG--KPFDASSLIGTKEEILFSNKFSSTVDWRKKGAVTHVKNQQSCS 134

Query:   129 SCWAFSAVAATEGITQLT---TGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHN 185
              CW+FSA  ATEG  +L    T +L+SLSEQ L+ C T   + GC GG +  AF++II N
Sbjct:   135 GCWSFSATGATEGAHKLANNGTNELVSLSEQNLIDCSTPFGNTGCNGGVITYAFEYIISN 194

Query:   186 DGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDAS 245
              GI TE +YP++  DGTC   +E S  A I  Y  V   SE +L  AV   PVA SIDAS
Sbjct:   195 GGIDTEKSYPFEGTDGTCRYKSENSG-ATISSYVNVTFGSESSLESAVNVNPVACSIDAS 253

Query:   246 GSAFQFYSSGV-FTGDCG-TELDHGVTAVGYG----------ATANGTKYWLVKNSWGTS 293
              S+F FY SG+ F   C  T LDHGV  VGYG          +  N + YW+ KNSWG +
Sbjct:   254 HSSFLFYKSGIYFEPACSRTNLDHGVLVVGYGTENSQSQDSSSEPNHSNYWIAKNSWGIN 313

Query:   294 WGEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
                 GYI M +D   ++ +CGI+  +S+P
Sbjct:   314 ----GYILMSKD---RDNMCGISTLASFP 335


>UNIPROTKB|Q24940 [details] [associations]
            symbol:Cat-1 "Cathepsin L-like proteinase" species:6192
            "Fasciola hepatica" [GO:0004175 "endopeptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005576 "extracellular region" evidence=IDA]
            [GO:0006508 "proteolysis" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005576 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0004197 EMBL:L33771 PIR:S43991 PDB:2O6X
            PDBsum:2O6X ProteinModelPortal:Q24940 SMR:Q24940 MEROPS:C01.033
            EvolutionaryTrace:Q24940 Uniprot:Q24940
        Length = 326

 Score = 575 (207.5 bits), Expect = 8.6e-56, P = 8.6e-56
 Identities = 129/308 (41%), Positives = 177/308 (57%)

Query:    22 QWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAA---GNKPYKLSINEFADQTNQEFK 78
             QW   Y K Y   +++ +R  I++ NV+ I+  N     G   Y L +N+F D T +EFK
Sbjct:    23 QWKRMYNKEYNGADDQHRR-NIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFK 81

Query:    79 A-FRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWAFSAVA 137
             A +     R   + S  G  ++  N   VP  +DWR++G VT +K+QG CGSCWAFS   
Sbjct:    82 AKYLTEMSRASDILSH-GVPYEANNRA-VPDKIDWRESGYVTEVKDQGNCGSCWAFSTTG 139

Query:   138 ATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQ 197
               EG         IS SEQ+LV C     ++GC GG ME+A++++    G+ TE++YPY 
Sbjct:   140 TMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYL-KQFGLETESSYPYT 198

Query:   198 AVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAV-ANQPVAVSIDASGSAFQFYSSGV 256
             AV+G C + N+   VAK+ GY TV + SE  L   V A +P AV++D   S F  Y SG+
Sbjct:   199 AVEGQC-RYNKQLGVAKVTGYYTVHSGSEVELKNLVGARRPAAVAVDVE-SDFMMYRSGI 256

Query:   257 FTGD-CGT-ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCG 314
             +    C    ++H V AVGYG T  GT YW+VKNSWGT WGE GYIRM R+   +  +CG
Sbjct:   257 YQSQTCSPLRVNHAVLAVGYG-TQGGTDYWIVKNSWGTYWGERGYIRMARN---RGNMCG 312

Query:   315 IAMDSSYP 322
             IA  +S P
Sbjct:   313 IASLASLP 320


>TAIR|locus:2078312 [details] [associations]
            symbol:AT3G45310 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005773 EMBL:CP002686
            GenomeReviews:BA000014_GR eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AL132953
            EMBL:AY091771 IPI:IPI00540369 PIR:T47471 RefSeq:NP_566880.1
            UniGene:At.25239 ProteinModelPortal:Q8RWQ9 SMR:Q8RWQ9
            MEROPS:C01.162 PaxDb:Q8RWQ9 PRIDE:Q8RWQ9 EnsemblPlants:AT3G45310.1
            GeneID:823669 KEGG:ath:AT3G45310 GeneFarm:5032 TAIR:At3g45310
            InParanoid:Q8RWQ9 KO:K01366 OMA:AFEVVHE PhylomeDB:Q8RWQ9
            ProtClustDB:CLSN2689015 Genevestigator:Q8RWQ9 Uniprot:Q8RWQ9
        Length = 358

 Score = 574 (207.1 bits), Expect = 1.1e-55, P = 1.1e-55
 Identities = 128/302 (42%), Positives = 176/302 (58%)

Query:    26 KYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGYR 85
             +YGK Y++ EE + RF +FK+N++ I S N  G   YKLS+N+FAD T QEF+ ++ G  
Sbjct:    65 RYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLS-YKLSLNQFADLTWQEFQRYKLGAA 123

Query:    86 RPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQL 145
             +    T  KG S K      VP T DWR++G V+P+K QG CGSCW FS   A E     
Sbjct:   124 QNCSATL-KG-SHKITEAT-VPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQ 180

Query:   146 TTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNK 205
               GK ISLSEQ+LV C  +  + GC GG    AF++I +N G+ TE  YPY   DG C K
Sbjct:   181 AFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGGC-K 239

Query:   206 TNEASHVAKIKGYETVPANSEEALLKAVAN-QPVAVSIDASGSAFQFYSSGVFTGD-CG- 262
              +  +   +++    +   +E+ L  AV   +PV+V+ +     F+FY  GVFT + CG 
Sbjct:   240 FSAKNIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVVHE-FRFYKKGVFTSNTCGN 298

Query:   263 TELD--HGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSS 320
             T +D  H V AVGYG   +   YWL+KNSWG  WG+ GY +M+      + +CG+A  SS
Sbjct:   299 TPMDVNHAVLAVGYGVE-DDVPYWLIKNSWGGEWGDNGYFKMEMG----KNMCGVATCSS 353

Query:   321 YP 322
             YP
Sbjct:   354 YP 355


>MGI|MGI:1861723 [details] [associations]
            symbol:Ctsr "cathepsin R" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=ISA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=ISA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0030163 "protein
            catabolic process" evidence=ISA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1861723 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0030163
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF245399
            EMBL:AY014778 EMBL:AK014432 EMBL:AK005429 IPI:IPI00120321
            RefSeq:NP_064680.1 UniGene:Mm.315715 ProteinModelPortal:Q9JIA9
            SMR:Q9JIA9 MEROPS:C01.042 PRIDE:Q9JIA9 Ensembl:ENSMUST00000021889
            GeneID:56835 KEGG:mmu:56835 CTD:56835 InParanoid:Q9JIA9 KO:K09601
            OMA:ASHESFK OrthoDB:EOG4ZCT6D NextBio:313379 Bgee:Q9JIA9
            CleanEx:MM_CTSR Genevestigator:Q9JIA9 GermOnline:ENSMUSG00000055679
            Uniprot:Q9JIA9
        Length = 334

 Score = 573 (206.8 bits), Expect = 1.4e-55, P = 1.4e-55
 Identities = 124/321 (38%), Positives = 183/321 (57%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIE---SLNAAGNKPYKLSINEF 69
             ++SL  + + W  KY K Y   EEK KR  ++++ ++ I+     N+ G   + + +NEF
Sbjct:    22 DSSLDAEWQDWKIKYNKSYSLKEEKLKRV-VWEEKLKMIKLHNRENSLGKNGFTMKMNEF 80

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSF-KYENVIDVPATMDWRKNGAVTPIKNQGPCG 128
              DQT++EF   R         T R+G S  K E    +P  +DWRK G VTP++ QG C 
Sbjct:    81 GDQTDEEF---RKMMIEISVWTHREGKSIMKREAGSILPKFVDWRKKGYVTPVRRQGDCD 137

Query:   129 SCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGI 188
             +CWAF+   A E      TGKL  LS Q LV C     ++GC GG+  +AF++++HN G+
Sbjct:   138 ACWAFAVTGAIEAQAIWQTGKLTPLSVQNLVDCSKPQGNNGCLGGDTYNAFQYVLHNGGL 197

Query:   189 TTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVAN-QPVAVSIDASGS 247
              +EA YPY+  DG C + N  +  A+I G+ ++P  SE+ L+ AVA   P+   IDAS  
Sbjct:   198 ESEATYPYEGKDGPC-RYNPKNSKAEITGFVSLP-QSEDILMAAVATIGPITAGIDASHE 255

Query:   248 AFQFYSSGVF-TGDCGTE-LDHGVTAVGYG---ATANGTKYWLVKNSWGTSWGEEGYIRM 302
             +F+ Y  G++   +C ++ + HGV  VGYG      +G  YWL+KNSWG  WG  GY+++
Sbjct:   256 SFKNYKGGIYHEPNCSSDTVTHGVLVVGYGFKGIETDGNHYWLIKNSWGKRWGIRGYMKL 315

Query:   303 KRDIDAKEGLCGIAMDSSYPT 323
              +D   K   CGIA  + YPT
Sbjct:   316 AKD---KNNHCGIASYAHYPT 333


>UNIPROTKB|Q10991 [details] [associations]
            symbol:CTSL "Cathepsin L1" species:9940 "Ovis aries"
            [GO:0005515 "protein binding" evidence=IPI] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            MEROPS:C01.032 ProteinModelPortal:Q10991 SMR:Q10991 Uniprot:Q10991
        Length = 217

 Score = 572 (206.4 bits), Expect = 1.8e-55, P = 1.8e-55
 Identities = 112/221 (50%), Positives = 146/221 (66%)

Query:   106 VPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSG 165
             VP ++DW K G VTP+KNQG CGSCWAFSA  A EG     TGKL+SLSEQ LV      
Sbjct:     1 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQ 60

Query:   166 VDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANS 225
              + GC GG M++AF++I  N G+ +E +YPY+A D +CN   E S  AK  G+  +P   
Sbjct:    61 GNQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTSCNYKPEYS-AAKDTGFVDIPQR- 118

Query:   226 EEALLKAVANQ-PVAVSIDASGSAFQFYSSGVFTG-DCGT-ELDHGVTAVGYGATANGTK 282
             E+AL+KAVA   P++V+IDA  S+FQFY SG++   DC + +LDHGV  VGYG      K
Sbjct:   119 EKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTNNK 178

Query:   283 YWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYPT 323
             +W+VKNSWG  WG +GY++M +D   +   CGIA  +SYPT
Sbjct:   179 FWIVKNSWGPEWGNKGYVKMAKD---QNNHCGIATAASYPT 216


>RGD|631421 [details] [associations]
            symbol:Ctsq "cathepsin Q" species:10116 "Rattus norvegicus"
            [GO:0005764 "lysosome" evidence=NAS] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 RGD:631421 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 UniGene:Rn.34875 EMBL:AF187323 IPI:IPI00214897
            PIR:JC7183 RefSeq:NP_640355.1 UniGene:Rn.35820
            ProteinModelPortal:Q9QZE3 SMR:Q9QZE3 STRING:Q9QZE3 MEROPS:C01.039
            PRIDE:Q9QZE3 Ensembl:ENSRNOT00000024208 GeneID:246147
            KEGG:rno:246147 UCSC:RGD:631421 CTD:104002 InParanoid:Q9QZE3
            OMA:ESEDVLM OrthoDB:EOG4HHP48 NextBio:623425 Genevestigator:Q9QZE3
            GermOnline:ENSRNOG00000017946 Uniprot:Q9QZE3
        Length = 343

 Score = 569 (205.4 bits), Expect = 3.7e-55, P = 3.7e-55
 Identities = 125/327 (38%), Positives = 181/327 (55%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIE---SLNAAGNKPYKLSINEF 69
             + SL  + ++W  KY K+Y   EE  KR  ++++NV+ IE     N+ G   Y + IN+F
Sbjct:    22 DLSLDVQWQEWKIKYEKLYSPEEEVLKRV-VWEENVKKIELHNRENSLGKNTYTMEINDF 80

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRK------GTSFKYE-NVID-VPATMDWRKNGAVTPI 121
             AD T++EFK    G++ P   T ++      G+ F    N  D +P  +DWR  G VT +
Sbjct:    81 ADMTDEEFKDMIIGFQLPVHNTEKRLWKRALGSFFPNSWNWRDALPKFVDWRNEGYVTRV 140

Query:   122 KNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKF 181
             + QG C SCWAF    A EG     TGKLI LS Q L+ C     + GC  G   +AF++
Sbjct:   141 RKQGGCSSCWAFPVTGAIEGQMFKKTGKLIPLSVQNLIDCSKPQGNRGCLWGNTYNAFQY 200

Query:   182 IIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAV 240
             ++HN G+  EA YPY+  +G C + N  +  AKI G+  +P  SE+ L+ AVA + P+A 
Sbjct:   201 VLHNGGLEAEATYPYERKEGVC-RYNPKNSSAKITGFVVLP-ESEDVLMDAVATKGPIAT 258

Query:   241 SIDASGSAFQFYSSGVF-TGDCGTELDHGVTAVGYGATAN---GTKYWLVKNSWGTSWGE 296
              +    S+F+FY  GV+    C + ++H V  VGYG   N   G  YWL+KNSWG  WG 
Sbjct:   259 GVHVISSSFRFYQKGVYHEPKCSSYVNHAVLVVGYGFEGNETDGNNYWLIKNSWGKRWGL 318

Query:   297 EGYIRMKRDIDAKEGLCGIAMDSSYPT 323
              GY+++ +D   +   C IA  + YPT
Sbjct:   319 RGYMKIAKD---RNNHCAIASLAQYPT 342


>RGD|69241 [details] [associations]
            symbol:Ctsj "cathepsin J" species:10116 "Rattus norvegicus"
           [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
           evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
           evidence=IEA] [GO:0048471 "perinuclear region of cytoplasm"
           evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
           PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:L14776
           RGD:69241 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
           InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246
           SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
           GO:GO:0005764 HOVERGEN:HBG011513 MEROPS:C01.038 CTD:26898 KO:K09599
           EMBL:AF310623 EMBL:BC097263 IPI:IPI00205027 PIR:I58002
           RefSeq:NP_058817.1 UniGene:Rn.34875 ProteinModelPortal:Q63088
           SMR:Q63088 PRIDE:Q63088 GeneID:29174 KEGG:rno:29174 NextBio:608244
           Genevestigator:Q63088 Uniprot:Q63088
        Length = 334

 Score = 569 (205.4 bits), Expect = 3.7e-55, P = 3.7e-55
 Identities = 123/320 (38%), Positives = 182/320 (56%)

Query:    12 QEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIE---SLNAAGNKPYKLSINE 68
             ++ +L  + + W +KY K Y   EE+ KR  ++++N++ I+     N  G   + + +N 
Sbjct:    21 RDPNLDAEWQDWKTKYAKSYSPVEEELKR-AVWEENLKMIQLHNKENGLGKNGFTMEMNA 79

Query:    69 FADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCG 128
             FAD T +EF+   +    P  +T+    S + +  I +P   DWRK G VTP++NQG CG
Sbjct:    80 FADTTGEEFRKSLSDILIPAAVTN---PSAQKQVSIGLPNFKDWRKEGYVTPVRNQGKCG 136

Query:   129 SCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGI 188
             SCWAF+AV A EG     TG L  LS Q L+ C  S  ++GC  G    AF +++ N G+
Sbjct:   137 SCWAFAAVGAIEGQMFSKTGNLTPLSVQNLLDCSKSEGNNGCRWGTAHQAFNYVLKNKGL 196

Query:   189 TTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVAN-QPVAVSIDASGS 247
               EA YPY+  DG C   +E +  A I G+  +P N E  L  AVA+  PV+ +IDAS  
Sbjct:   197 EAEATYPYEGKDGPCRYHSENAS-ANITGFVNLPPN-ELYLWVAVASIGPVSAAIDASHD 254

Query:   248 AFQFYSSGVF-TGDCGTEL-DHGVTAVGYGATAN---GTKYWLVKNSWGTSWGEEGYIRM 302
             +F+FYS GV+   +C + + +H V  VGYG   N   G  YWL+KNSWG  WG  G++++
Sbjct:   255 SFRFYSGGVYHEPNCSSYVVNHAVLVVGYGFEGNETDGNNYWLIKNSWGEEWGINGFMKI 314

Query:   303 KRDIDAKEGLCGIAMDSSYP 322
              +D   +   CGIA  +S+P
Sbjct:   315 AKD---RNNHCGIASQASFP 331


>ZFIN|ZDB-GENE-050522-559 [details] [associations]
            symbol:ctssb.1 "cathepsin S, b.1" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-050522-559 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 MEROPS:C01.034
            EMBL:BC095694 IPI:IPI00607338 UniGene:Dr.75553
            ProteinModelPortal:Q502H6 SMR:Q502H6 InParanoid:Q502H6
            ArrayExpress:Q502H6 Uniprot:Q502H6
        Length = 330

 Score = 568 (205.0 bits), Expect = 4.8e-55, P = 4.8e-55
 Identities = 125/313 (39%), Positives = 179/313 (57%)

Query:    15 SLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFI--ESLNAA-GNKPYKLSINEFAD 71
             +L +  E W   YGK+Y    E+  R ++++ N++ I   +L A+ G   Y LS+N   D
Sbjct:    22 NLDQHWELWKKTYGKIYTTEVEEFGRRQLWERNLQLITVHNLEASMGMHSYDLSMNHMGD 81

Query:    72 QTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCW 131
              T +E          P G   R+  +    +   VP ++DWR+ G V+ +K QG CGSCW
Sbjct:    82 LTTEEILQTLALTHVPSGF-KRQIANIVGSSGDAVPDSLDWREKGYVSSVKMQGACGSCW 140

Query:   132 AFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTE 191
             AFS+V A EG  + TTGKL+ LS Q LV C +   + GC GG M DAF+++I N GI ++
Sbjct:   141 AFSSVGALEGQLKKTTGKLVDLSPQNLVDCSSKYGNKGCNGGFMSDAFQYVIDNGGIASD 200

Query:   192 ANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSAFQ 250
             + YPY+ V   C+ ++ +   A    Y  V    E AL +AVA+  P++V+IDA+   F 
Sbjct:   201 SAYPYRGVQQQCSYSS-SQRAANCTKYYFVRQGDENALKQAVASVGPISVAIDATRPQFV 259

Query:   251 FYSSGVFTGD-CGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAK 309
              Y SGV+    C   ++H V  VGYG T +G  +WLVKNSWGT +G+ GYIRM R+   K
Sbjct:   260 LYHSGVYNDPTCSKRVNHAVLVVGYG-TLSGQDHWLVKNSWGTRFGDGGYIRMARN---K 315

Query:   310 EGLCGIAMDSSYP 322
               +CGIA  + YP
Sbjct:   316 NNMCGIASYACYP 328


>UNIPROTKB|Q90686 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9031 "Gallus gallus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            CTD:1513 KO:K01371 PANTHER:PTHR12411:SF55 EMBL:U37691
            IPI:IPI00575213 RefSeq:NP_990302.1 UniGene:Gga.51509
            ProteinModelPortal:Q90686 SMR:Q90686 MEROPS:C01.036 GeneID:395818
            KEGG:gga:395818 NextBio:20815886 Uniprot:Q90686
        Length = 334

 Score = 567 (204.7 bits), Expect = 6.1e-55, P = 6.1e-55
 Identities = 121/269 (44%), Positives = 169/269 (62%)

Query:    58 GNKPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA 117
             G   ++L++N   D T++E      G R P       GT +  +     PA +DWR+ G 
Sbjct:    72 GKHSFQLAMNYLGDMTSEEVVRTMTGLRVPRSRPRPNGTLYVPDWSSRAPAAVDWRRKGY 131

Query:   118 VTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMED 177
             VTP+K+QG CGSCWAFS+V A EG  +  TGKL+SLS Q LV C ++  ++GC GG M +
Sbjct:   132 VTPVKDQGQCGSCWAFSSVGALEGQLKRRTGKLLSLSPQNLVYCVSN--NNGCGGGYMTN 189

Query:   178 AFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVAN-Q 236
             AF+++  N GI +E  YPY   D +C   +     AK +GY  +P ++E+AL +AVA   
Sbjct:   190 AFEYVRLNRGIDSEDAYPYIGQDESC-MYSPTGKAAKCRGYREIPEDNEKALKRAVARIG 248

Query:   237 PVAVSIDASGSAFQFYSSGVF--TGDCGTE-LDHGVTAVGYGATANGTKYWLVKNSWGTS 293
             PV+V IDAS  +FQFYS GV+  TG C  E ++H V AVGYGA   GTK+W++KNSWGT 
Sbjct:   249 PVSVGIDASLPSFQFYSRGVYYDTG-CNPENINHAVLAVGYGAQ-KGTKHWIIKNSWGTE 306

Query:   294 WGEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
             WG +GY+ + R++  K+  CGIA  +S+P
Sbjct:   307 WGNKGYVLLARNM--KQ-TCGIANLASFP 332


>ZFIN|ZDB-GENE-050417-107 [details] [associations]
            symbol:zgc:110239 "zgc:110239" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-050417-107
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 MEROPS:I29.003 OrthoDB:EOG412M56 EMBL:BC092817
            IPI:IPI00503987 RefSeq:NP_001017633.1 UniGene:Dr.39081
            ProteinModelPortal:Q568K7 GeneID:550326 KEGG:dre:550326
            HOGENOM:HOG000007373 HOVERGEN:HBG105018 InParanoid:Q568K7
            NextBio:20879584 ArrayExpress:Q568K7 Uniprot:Q568K7
        Length = 546

 Score = 560 (202.2 bits), Expect = 3.4e-54, P = 3.4e-54
 Identities = 129/323 (39%), Positives = 186/323 (57%)

Query:    11 LQEASLSEKHEQ---WMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSIN 67
             ++ + +S  H     +  K+ + Y N  E E+R   F  N+ ++ S+N AG   + LS+N
Sbjct:   231 VETSPVSHAHRMFGHYKEKFNRQYDNEMEHEEREHNFVHNIRYVHSMNRAGLS-FSLSVN 289

Query:    68 EFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYE-NVIDVPATMDWRKNGAVTPIKNQGP 126
               AD++ +E    R G +R   +  RK   F  E   I  P ++DWR  GAVTP+K+Q  
Sbjct:   290 HLADRSQKELSMMR-GCQRTHKV-HRKAQPFPSEIRSIATPNSVDWRLYGAVTPVKDQAV 347

Query:   127 CGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHND 186
             CGSCW+F+     EG   L TG+L SLS+Q LV C     ++GC+GGE   AF++I+ + 
Sbjct:   348 CGSCWSFATTGTLEGALFLKTGQLTSLSQQMLVDCTWGFGNNGCDGGEEWRAFEWIMKHG 407

Query:   187 GITTEANY-PYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVAN-QPVAVSIDA 244
             GI+T  +Y  Y  ++G C+  +++S VA++ GY  V +    AL  A+    PVAVSIDA
Sbjct:   408 GISTAESYGAYMGMNGLCHY-DKSSMVAQLTGYTNVTSGDILALKAAIFKFGPVAVSIDA 466

Query:   245 SGSAFQFYSSGVF-TGDC--G-TELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYI 300
             +  +F FYS+GV+   +C  G  +LDH V AVGYG   N   YWLVKNSW + WG +GYI
Sbjct:   467 AHRSFAFYSNGVYYEPECKNGINDLDHAVLAVGYGIM-NNESYWLVKNSWSSYWGNDGYI 525

Query:   301 RMKRDIDAKEGLCGIAMDSSYPT 323
              M      K+  CG+A D+ Y T
Sbjct:   526 LMSM----KDNNCGVATDAIYAT 544


>MGI|MGI:1927229 [details] [associations]
            symbol:Ctsm "cathepsin M" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008152 "metabolic process" evidence=ISS] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015645 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1927229 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF202528
            EMBL:AY014777 EMBL:AY057446 EMBL:AK005550 EMBL:AK005428
            IPI:IPI00131133 RefSeq:NP_071721.2 UniGene:Mm.279933
            ProteinModelPortal:Q9JL96 SMR:Q9JL96 STRING:Q9JL96 MEROPS:C01.023
            PRIDE:Q9JL96 DNASU:64139 Ensembl:ENSMUST00000099451 GeneID:64139
            KEGG:mmu:64139 UCSC:uc007qwj.1 CTD:64139 InParanoid:Q9JL96
            KO:K09600 OrthoDB:EOG4TTGKR NextBio:319931 Bgee:Q9JL96
            CleanEx:MM_CTSM Genevestigator:Q9JL96 GermOnline:ENSMUSG00000074484
            GermOnline:ENSMUSG00000074871 PANTHER:PTHR12411:SF58 Uniprot:Q9JL96
        Length = 333

 Score = 559 (201.8 bits), Expect = 4.3e-54, P = 4.3e-54
 Identities = 125/316 (39%), Positives = 182/316 (57%)

Query:    16 LSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIE---SLNAAGNKPYKLSINEFADQ 72
             L  + ++W  KYGK Y   EE +KR  +++DN++ I+     N  G   + + +N F D 
Sbjct:    25 LDVEWQKWKIKYGKAYSLEEEGQKR-AVWEDNMKKIKLHNGENGLGKHGFTMEMNAFGDM 83

Query:    73 TNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWA 132
             T +EF+        P   T +KG S +    +++P  ++W+K G VTP++ QG C SCWA
Sbjct:    84 TLEEFRKVMIEIPVP---TVKKGKSVQKRLSVNLPKFINWKKRGYVTPVQTQGRCNSCWA 140

Query:   133 FSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEA 192
             FS   A EG     TG+LI LS Q LV C     + GC  G    A  +++ N G+ +EA
Sbjct:   141 FSVTGAIEGQMFRKTGQLIPLSVQNLVDCSRPQGNWGCYLGNTYLALHYVMENGGLESEA 200

Query:   193 NYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVAN-QPVAVSIDASGSAFQF 251
              YPY+  DG+C  + E S  A I G+E VP N E+AL+ AVA+  P++V+IDA  ++F F
Sbjct:   201 TYPYEEKDGSCRYSPENS-TANITGFEFVPKN-EDALMNAVASIGPISVAIDARHASFLF 258

Query:   252 YSSGVF-TGDCGT-ELDHGVTAVGYGAT---ANGTKYWLVKNSWGTSWGEEGYIRMKRDI 306
             Y  G++   +C +  + H +  VGYG T   ++G KYWLVKNS GT WG +GY+++ RD 
Sbjct:   259 YKRGIYYEPNCSSCVVTHSMLLVGYGFTGRESDGRKYWLVKNSMGTQWGNKGYMKISRD- 317

Query:   307 DAKEGLCGIAMDSSYP 322
               K   CGIA  + YP
Sbjct:   318 --KGNHCGIATYALYP 331


>ZFIN|ZDB-GENE-040426-1583 [details] [associations]
            symbol:ctssa "cathepsin S, a" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-040426-1583
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            GeneTree:ENSGT00660000095458 EMBL:CR548627 IPI:IPI00491948
            UniGene:Dr.81560 SMR:Q1L8W8 Ensembl:ENSDART00000053638 OMA:RNTREER
            OrthoDB:EOG480HX9 Uniprot:Q1L8W8
        Length = 328

 Score = 558 (201.5 bits), Expect = 5.5e-54, P = 5.5e-54
 Identities = 121/314 (38%), Positives = 176/314 (56%)

Query:    16 LSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEFADQ 72
             L+ +   W S++ K Y+N  E+  R  ++K N++ I   N   A G   Y L +N+ +D 
Sbjct:    23 LTNQWTTWKSQHNKTYRNTREERLRRSVWKQNLQDILLHNEAAAVGLHSYTLGLNQLSDM 82

Query:    73 TNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWA 132
             T  E     NG    D        +F   ++  +P  ++W ++G V+P++NQGPCGSCWA
Sbjct:    83 TADEVNDM-NGLLEED--FPDVNATFSPPSLQTLPQRVNWTEHGMVSPVQNQGPCGSCWA 139

Query:   133 FSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEA 192
             FSAV + E   +  T  L+ LS Q L+ C  S  + GC+GG +  AF ++I N GI +  
Sbjct:   140 FSAVGSLEAQMKRRTAALVPLSAQNLLDCSVSLGNRGCKGGFLSRAFLYVIQNRGIDSST 199

Query:   193 NYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVAN-QPVAVSIDASGSAFQF 251
              YPY+  +G C + + +       G+  VP ++E AL  AVAN  PV+V I+A   +F  
Sbjct:   200 FYPYEHKEGVC-RYSVSGRAGYCTGFRIVPRHNEAALQSAVANIGPVSVGINAKLLSFHR 258

Query:   252 YSSGVFTGD-CGTEL-DHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAK 309
             Y SG++    C + L +H V  VGYG+  NG  YWLVKNSWGT+WGE GYIRM R+    
Sbjct:   259 YRSGIYNDPKCSSALINHAVLVVGYGSE-NGQDYWLVKNSWGTAWGENGYIRMARN---- 313

Query:   310 EGLCGIAMDSSYPT 323
             + +CGI+    YPT
Sbjct:   314 KNMCGISSFGIYPT 327


>DICTYBASE|DDB_G0291191 [details] [associations]
            symbol:DDB_G0291191 "cysteine protease" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0291191
            GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AAFI02000175 MEROPS:C01.022
            ProtClustDB:CLSZ2429603 RefSeq:XP_635374.1
            ProteinModelPortal:Q54F16 PRIDE:Q54F16 EnsemblProtists:DDB0252831
            GeneID:8628022 KEGG:ddi:DDB_G0291191 OMA:NETQIAS Uniprot:Q54F16
        Length = 352

 Score = 557 (201.1 bits), Expect = 7.0e-54, P = 7.0e-54
 Identities = 131/322 (40%), Positives = 173/322 (53%)

Query:    23 WMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGN---KPYKLSINEFADQTNQEFKA 79
             + +KY K+Y + EE   +F  FK N+  I++LN          K  +N+FAD + +EFK 
Sbjct:    30 FQNKYNKIY-SAEEYLVKFETFKSNLLNIDALNKQATTIGSDTKFGVNKFADLSKEEFKK 88

Query:    80 F---RNGYRRPDGLTSRKGTSFKYENVIDV-PATMDWRKNGA---------VTPIKNQGP 126
             +       R  D L      S   +++I   PA  DWR  G          VT +KNQG 
Sbjct:    89 YYLSSKEARLTDDLPMLPNLS---DDIISATPAAFDWRNTGGSTKFPQGTPVTAVKNQGQ 145

Query:   127 CGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDH--------GCEGGEMEDA 178
             CGSCW+FS     EG   L+TG L+ LSEQ LV CD + + +        GC+GG   +A
Sbjct:   146 CGSCWSFSTTGNVEGQHYLSTGTLVGLSEQNLVDCDHTCMTYENENVCNAGCDGGLQPNA 205

Query:   179 FKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPV 238
             + +II N GI TEA YPY AVDG C K N A   AKI  +  VP N  +       N P+
Sbjct:   206 YNYIIKNGGIQTEATYPYTAVDGEC-KFNSAQVGAKISSFTMVPQNETQIASYLFNNGPL 264

Query:   239 AVSIDASGSAFQFYSSGVFTGDCGTELDHGVTAVGYGA--TANG--TKYWLVKNSWGTSW 294
             A++ DA    +QFY  GVF   CG  LDHG+  VGYGA  T  G  T YW++KNSWG  W
Sbjct:   265 AIAADAE--EWQFYMGGVFDFPCGQTLDHGILIVGYGAQDTIVGKNTPYWIIKNSWGADW 322

Query:   295 GEEGYIRMKRDIDAKEGLCGIA 316
             GE GY++++R+ D     CG+A
Sbjct:   323 GEAGYLKVERNTDK----CGVA 340


>MGI|MGI:1860262 [details] [associations]
            symbol:Cts7 "cathepsin 7" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005576
            "extracellular region" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005768 "endosome" evidence=IEA]
            [GO:0005794 "Golgi apparatus" evidence=IEA] [GO:0006508
            "proteolysis" evidence=ISA] [GO:0007049 "cell cycle" evidence=IEA]
            [GO:0007067 "mitosis" evidence=IEA] [GO:0008152 "metabolic process"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=ISA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0051301 "cell
            division" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:1860262 GO:GO:0005634 GO:GO:0005794 GO:GO:0048471
            GO:GO:0005615 GO:GO:0051301 GO:GO:0007067 GO:GO:0005768
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GO:GO:0008233 EMBL:CH466546
            EMBL:AY014779 EMBL:CT030645 EMBL:BC064740 EMBL:AF250837
            IPI:IPI00131132 RefSeq:NP_062412.1 UniGene:Mm.3692 HSSP:O60911
            ProteinModelPortal:Q91ZF2 SMR:Q91ZF2 STRING:Q91ZF2 MEROPS:C01.016
            PRIDE:Q91ZF2 Ensembl:ENSMUST00000021892 GeneID:56092 KEGG:mmu:56092
            UCSC:uc007qwi.1 CTD:56092 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 InParanoid:Q91ZF2 OMA:ERRVIWE OrthoDB:EOG44QT2S
            NextBio:311908 Bgee:Q91ZF2 Genevestigator:Q91ZF2 Uniprot:Q91ZF2
        Length = 331

 Score = 555 (200.4 bits), Expect = 1.1e-53, P = 1.1e-53
 Identities = 124/317 (39%), Positives = 176/317 (55%)

Query:    15 SLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESL---NAAGNKPYKLSINEFAD 71
             +L  + E+W     + Y +PEE+++R  +++ NV++I+     N      + + +NEF D
Sbjct:    24 NLDAEWEEWKRSNDRTY-SPEEEKQRRAVWEGNVKWIKQHIMENGLWMNNFTIEMNEFGD 82

Query:    72 QTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCW 131
              T +E K        P     R G   +  N   +P T+DWRK G VTP++ QG CG+CW
Sbjct:    83 MTGEEMKMLTESSSYP----LRNGKHIQKRNP-KIPPTLDWRKEGYVTPVRRQGSCGACW 137

Query:   132 AFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTE 191
             AFS  A  EG     TGKLI LS Q L+ C  S    GC+GG   DAF+++ +N G+  E
Sbjct:   138 AFSVTACIEGQLFKKTGKLIPLSVQNLMDCSVSYGTKGCDGGRPYDAFQYVKNNGGLEAE 197

Query:   192 ANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKA-VANQPVAVSIDASGSAFQ 250
             A YPY+A    C    E S V K+  +  VP N EEALL+A V + P+AV+ID S ++F 
Sbjct:   198 ATYPYEAKAKHCRYRPERS-VVKVNRFFVVPRN-EEALLQALVTHGPIAVAIDGSHASFH 255

Query:   251 FYSSGVF-TGDCGTE-LDHGVTAVGYGATANGT---KYWLVKNSWGTSWGEEGYIRMKRD 305
              Y  G++    C  + LDHG+  VGYG   + +   KYWL+KNS G  WGE GY+++ R 
Sbjct:   256 SYRGGIYHEPKCRKDTLDHGLLLVGYGYEGHESENRKYWLLKNSHGERWGENGYMKLPR- 314

Query:   306 IDAKEGLCGIAMDSSYP 322
                +   CGIA  + YP
Sbjct:   315 --GQNNYCGIASYAMYP 329


>UNIPROTKB|G1M0X4 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9646
            "Ailuropoda melanoleuca" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 EMBL:ACTA01057330 EMBL:ACTA01065330
            Ensembl:ENSAMET00000013529 Uniprot:G1M0X4
        Length = 337

 Score = 552 (199.4 bits), Expect = 2.4e-53, P = 2.4e-53
 Identities = 122/314 (38%), Positives = 178/314 (56%)

Query:    17 SEK-H-EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTN 74
             +EK H + WM ++ K Y + EE + R R F  N   I + NA GN  +K+ +N+F+D + 
Sbjct:    32 TEKVHFKSWMVQHQKKYSS-EEYQHRLRTFVGNWRKINAHNA-GNHTFKMGLNQFSDMSF 89

Query:    75 QEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA-VTPIKNQGPCGSCWAF 133
              E K  +  +  P   ++ KG   +       P  +DWRK G  V+P+KNQG CGSCW F
Sbjct:    90 AEIKR-KYLWSEPQNCSATKGNYLRGTG--PYPPFVDWRKKGKFVSPVKNQGGCGSCWTF 146

Query:   134 SAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEAN 193
             S   A E    + TGKL+SL+EQ+LV C     +HGC+GG    AF++I +N GI  E +
Sbjct:   147 STTGALESAIAIKTGKLLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYIRYNRGIMGEDS 206

Query:   194 YPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVAN-QPVAVSIDASGSAFQFY 252
             YPY+  DG C K   +  +A +K    +  N E+A+++AVA   PV+ + + +G  F  Y
Sbjct:   207 YPYKGQDGDC-KFQPSKAIAFVKDVANITINDEQAMVEAVALFNPVSFAFEVTGD-FMMY 264

Query:   253 SSGVFTG-DCGT---ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDA 308
               GV++   C     +++H V AVGYG   NG  YW+VKNSWG  WG  GY  ++R    
Sbjct:   265 RKGVYSSTSCHKTPDKVNHAVLAVGYGEQ-NGVPYWIVKNSWGPQWGMHGYFLIERG--- 320

Query:   309 KEGLCGIAMDSSYP 322
              + +CG+A  +SYP
Sbjct:   321 -KNMCGLAACASYP 333


>ZFIN|ZDB-GENE-030131-3539 [details] [associations]
            symbol:ctsh "cathepsin H" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-030131-3539
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 KO:K01366 HOVERGEN:HBG011513
            CTD:1512 OrthoDB:EOG4W9J43 MEROPS:I29.003 HSSP:P43235 EMBL:BC067615
            IPI:IPI00506892 RefSeq:NP_997853.1 UniGene:Dr.14176
            ProteinModelPortal:Q6NWF2 SMR:Q6NWF2 PRIDE:Q6NWF2 GeneID:324818
            KEGG:dre:324818 InParanoid:Q6NWF2 NextBio:20808976 Bgee:Q6NWF2
            Uniprot:Q6NWF2
        Length = 330

 Score = 552 (199.4 bits), Expect = 2.4e-53, P = 2.4e-53
 Identities = 120/312 (38%), Positives = 174/312 (55%)

Query:    18 EKH-EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQE 76
             E H + WMS+Y K Y+   E  +R +IF +N + I+  N  GN  + + +N+F+D T  E
Sbjct:    27 EYHFKSWMSQYNKKYEI-NEFYQRLQIFLENKKRIDQHNE-GNHKFSMGLNQFSDMTFAE 84

Query:    77 FKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA-VTPIKNQGPCGSCWAFSA 135
             FK        P   ++ +G       +   P  +DWR  G  +T +KNQGPCGSCW FS 
Sbjct:    85 FKKTYL-LTEPQNCSATRGNHVSSNGLY--PDAIDWRTKGHYITDVKNQGPCGSCWTFST 141

Query:   136 VAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYP 195
                 E +T + TGKL+ L+EQ+L+ C     +HGC GG    AF++I++N G+ TE +YP
Sbjct:   142 TGCLESVTAIATGKLLQLAEQQLIDCAGDFDNHGCNGGLPSHAFEYIMYNKGLMTEDDYP 201

Query:   196 YQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVAN-QPVAVSIDASGSAFQFYSS 254
             YQA  G C    + +  A +K    +    E  ++ AVA   PV+ + + + S F  Y  
Sbjct:   202 YQAKGGQCRFKPQLA-AAFVKEVVNITKYDEMGMVDAVARLNPVSFAYEVT-SDFMHYKD 259

Query:   255 GVFTG-DCGTELD---HGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKE 310
             G++T  +C    D   H V AVGY A  NGT YW+VKNSWGT+WG +GY  ++R     +
Sbjct:   260 GIYTSTECHNTTDMVNHAVLAVGY-AEENGTPYWIVKNSWGTNWGIKGYFYIERG----K 314

Query:   311 GLCGIAMDSSYP 322
              +CG+A  SSYP
Sbjct:   315 NMCGLAACSSYP 326


>UNIPROTKB|F6R7P5 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9544 "Macaca
            mulatta" [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=ISS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0004175 "endopeptidase activity" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0004252
            "serine-type endopeptidase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005829 "cytosol" evidence=ISS] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0010628 "positive regulation of
            gene expression" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0010815 "bradykinin catabolic
            process" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0016505 "apoptotic protease
            activator activity" evidence=ISS] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=ISS] [GO:0030335
            "positive regulation of cell migration" evidence=ISS] [GO:0031638
            "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0032526 "response to retinoic
            acid" evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0043129 "surfactant homeostasis"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0070324
            "thyroid hormone binding" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            GO:GO:0043066 GO:GO:0005615 GO:GO:0008284 GO:GO:0070371
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458 CTD:1512
            OMA:STSCHKT GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 RefSeq:XP_001108862.1
            UniGene:Mmu.3000 Ensembl:ENSMMUT00000014095 GeneID:711437
            KEGG:mcc:711437 NextBio:19969972 Uniprot:F6R7P5
        Length = 335

 Score = 551 (199.0 bits), Expect = 3.0e-53, P = 3.0e-53
 Identities = 120/320 (37%), Positives = 176/320 (55%)

Query:    10 KLQEASLSEKH-EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINE 68
             +L   SL + H + WMSK+ K Y   EE   R + F  N   I + N  GN  +K+++N+
Sbjct:    24 ELSVNSLEKFHFKSWMSKHHKTYST-EEYHHRMQTFASNWRKINAHNN-GNHTFKMALNQ 81

Query:    69 FADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA-VTPIKNQGPC 127
             F+D +  E K  +  +  P   ++ K    +       P +MDWRK G  V+P+KNQG C
Sbjct:    82 FSDMSFAEIK-HKYLWSEPQNCSATKSNYLRGTG--PYPPSMDWRKKGNFVSPVKNQGAC 138

Query:   128 GSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDG 187
             GSCW FS   A E    + TGK++SL+EQ+LV C     +HGC+GG    AF++I++N G
Sbjct:   139 GSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKG 198

Query:   188 ITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVA-NQPVAVSIDASG 246
             I  E  YPYQ  DG C K      +  +K    +    EEA+++AVA   PV+ + + + 
Sbjct:   199 IMGEDTYPYQGKDGDC-KFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQ 257

Query:   247 SAFQFYSSGVFTG-DCGT---ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRM 302
               F  Y +G+++   C     +++H V AVGYG   NG  YW+VKNSWG  WG  GY  +
Sbjct:   258 D-FMIYKTGIYSSTSCHKTPDKVNHAVLAVGYGEE-NGIPYWIVKNSWGPQWGMNGYFLI 315

Query:   303 KRDIDAKEGLCGIAMDSSYP 322
             +R     + +CG+A  +SYP
Sbjct:   316 ERG----KNMCGLAACASYP 331


>FB|FBgn0250848 [details] [associations]
            symbol:26-29-p "26-29kD-proteinase" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005811
            "lipid particle" evidence=IDA] [GO:0005875 "microtubule associated
            complex" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005875 EMBL:AE014296 GO:GO:0005811 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 MEROPS:I29.003 HSSP:O65039
            EMBL:AY122222 EMBL:AB011376 RefSeq:NP_620470.1 UniGene:Dm.3049
            SMR:Q9V3U6 MINT:MINT-890485 STRING:Q9V3U6
            EnsemblMetazoa:FBtr0075766 GeneID:39547 KEGG:dme:Dmel_CG8947
            UCSC:CG8947-RA CTD:39547 FlyBase:FBgn0250848 InParanoid:Q9V3U6
            OMA:IHSKNRA OrthoDB:EOG4BVQ8T GenomeRNAi:39547 NextBio:814210
            Uniprot:Q9V3U6
        Length = 549

 Score = 550 (198.7 bits), Expect = 3.8e-53, P = 3.8e-53
 Identities = 125/306 (40%), Positives = 173/306 (56%)

Query:    26 KYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGYR 85
             K+G  Y +  E E R  IF+ N+ +I S N A    Y L++N  AD+T +E KA R GY+
Sbjct:   251 KHGVAYHSDTEHEHRKNIFRQNLRYIHSKNRA-KLTYTLAVNHLADKTEEELKA-RRGYK 308

Query:    86 RPDGLTSRKGTSFKYE--NVID-VPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGI 142
                G+ +  G  F Y+     D +P   DWR  GAVTP+K+Q  CGSCW+F  +   EG 
Sbjct:   309 S-SGIYNT-GKPFPYDVPKYKDEIPDQYDWRLYGAVTPVKDQSVCGSCWSFGTIGHLEGA 366

Query:   143 TQLTTG-KLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANY-PYQAVD 200
               L  G  L+ LS+Q L+ C  +  ++GC+GGE    +++++ + G+ TE  Y PY   D
Sbjct:   367 FFLKNGGNLVRLSQQALIDCSWAYGNNGCDGGEDFRVYQWMLQSGGVPTEEEYGPYLGQD 426

Query:   201 GTCNKTNEASHVAKIKGYETVPANSEEAL-LKAVANQPVAVSIDASGSAFQFYSSGVF-T 258
             G C+  N  + VA IKG+  V +N   A  L  + + P++V+IDAS   F FYS GV+  
Sbjct:   427 GYCH-VNNVTLVAPIKGFVNVTSNDPNAFKLALLKHGPLSVAIDASPKTFSFYSHGVYYE 485

Query:   259 GDCGTE---LDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGI 315
               C  +   LDH V AVGYG+  NG  YWLVKNSW T WG +GYI M     AK+  CG+
Sbjct:   486 PTCKNDVDGLDHAVLAVGYGSI-NGEDYWLVKNSWSTYWGNDGYILMS----AKKNNCGV 540

Query:   316 AMDSSY 321
                 +Y
Sbjct:   541 MTMPTY 546


>UNIPROTKB|F1NEC8 [details] [associations]
            symbol:CTSL2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:AADN02067812 IPI:IPI00820956 Ensembl:ENSGALT00000037988
            ArrayExpress:F1NEC8 Uniprot:F1NEC8
        Length = 218

 Score = 550 (198.7 bits), Expect = 3.8e-53, P = 3.8e-53
 Identities = 109/219 (49%), Positives = 142/219 (64%)

Query:   107 PATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGV 166
             P ++DWR+ G VTP+K+QG CGSCWAFS   A EG     TGKL+SLSEQ LV C     
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEG 61

Query:   167 DHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSE 226
             + GC GG M+ AF+++  N GI +E +YPY A D    +     + A   G+  +P   E
Sbjct:    62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHE 121

Query:   227 EALLKAVANQ-PVAVSIDASGSAFQFYSSGVF-TGDCGTE-LDHGVTAVGYGATANGTKY 283
              AL+KAVA+  PV+V+IDA  S+FQFY SG++   DC +E LDHGV  VGYG   +G KY
Sbjct:   122 RALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFE-DGKKY 180

Query:   284 WLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
             W+VKNSWG  WG++GYI M +D   ++  CGIA  +SYP
Sbjct:   181 WIVKNSWGEKWGDKGYIYMAKD---RKNHCGIATAASYP 216


>RGD|621513 [details] [associations]
            symbol:Ctss "cathepsin S" species:10116 "Rattus norvegicus"
            [GO:0001656 "metanephros development" evidence=IEP] [GO:0002250
            "adaptive immune response" evidence=ISO] [GO:0005764 "lysosome"
            evidence=IEA;ISO] [GO:0006508 "proteolysis" evidence=IEA;ISO]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0009986 "cell
            surface" evidence=IDA] [GO:0016020 "membrane" evidence=ISO]
            [GO:0043231 "intracellular membrane-bounded organelle"
            evidence=ISO] [GO:0045453 "bone resorption" evidence=IMP]
            [GO:0051930 "regulation of sensory perception of pain"
            evidence=IMP] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            RGD:621513 GO:GO:0009986 GO:GO:0051930 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0045453
            GO:GO:0001656 HOVERGEN:HBG011513 CTD:1520 KO:K01368 MEROPS:I29.004
            BRENDA:3.4.22.27 EMBL:L03201 IPI:IPI00210228 PIR:A45087
            RefSeq:NP_059016.1 UniGene:Rn.11347 ProteinModelPortal:Q02765
            PhosphoSite:Q02765 PRIDE:Q02765 GeneID:50654 KEGG:rno:50654
            UCSC:RGD:621513 ChEMBL:CHEMBL1075217 NextBio:610462
            Genevestigator:Q02765 Uniprot:Q02765
        Length = 330

 Score = 550 (198.7 bits), Expect = 3.8e-53, P = 3.8e-53
 Identities = 132/321 (41%), Positives = 184/321 (57%)

Query:    12 QEASLSEKHEQWMSKYGKVYKNPEEKEKRFR--IFKDNVEFIESLN---AAGNKPYKLSI 66
             +  +L    + W  K  ++ +N ++ E+  R  I++ N++FI   N   + G   Y + +
Sbjct:    18 ERPTLDHHWDLW--KKTRMRRNTDQNEEDVRRLIWEKNLKFIMLHNLEHSMGMHSYSVGM 75

Query:    67 NEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGP 126
             N   D T +E   +    R P    +R GT  K  +   +P ++DWR+ G VT +K QG 
Sbjct:    76 NHMGDMTPEEVIGYMGSLRIPRPW-NRSGT-LKSSSNQTLPDSVDWREKGCVTNVKYQGS 133

Query:   127 CGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGV--DHGCEGGEMEDAFKFIIH 184
             CGSCWAFSA  A EG  +L TGKL+SLS Q LV C T     + GC GG M +AF++II 
Sbjct:   134 CGSCWAFSAEGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEAFQYII- 192

Query:   185 NDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSID 243
             +  I +EA+YPY+A+D  C   +  +  A    Y  +P   EEAL +AVA + PV+V ID
Sbjct:   193 DTSIDSEASYPYKAMDEKC-LYDPKNRAATCSRYIELPFGDEEALKEAVATKGPVSVGID 251

Query:   244 -ASGSAFQFYSSGVFTG-DCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIR 301
              AS S+F  Y SGV+    C   ++HGV  VGYG T +G  YWLVKNSWG  +G++GYIR
Sbjct:   252 DASHSSFFLYQSGVYDDPSCTENMNHGVLVVGYG-TLDGKDYWLVKNSWGLHFGDQGYIR 310

Query:   302 MKRDIDAKEGLCGIAMDSSYP 322
             M R+    +  CGIA   SYP
Sbjct:   311 MARN---NKNHCGIASYCSYP 328


>TAIR|locus:2175088 [details] [associations]
            symbol:ALP "aleurain-like protease" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009723 "response to ethylene stimulus" evidence=IEP]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0005829 "cytosol"
            evidence=RCA] [GO:0006096 "glycolysis" evidence=RCA] [GO:0006816
            "calcium ion transport" evidence=RCA] [GO:0006833 "water transport"
            evidence=RCA] [GO:0006972 "hyperosmotic response" evidence=RCA]
            [GO:0007030 "Golgi organization" evidence=RCA] [GO:0009266
            "response to temperature stimulus" evidence=RCA] [GO:0009651
            "response to salt stress" evidence=RCA] [GO:0009750 "response to
            fructose stimulus" evidence=RCA] [GO:0042744 "hydrogen peroxide
            catabolic process" evidence=RCA] [GO:0046686 "response to cadmium
            ion" evidence=RCA] [GO:0007568 "aging" evidence=IEP]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002688 GO:GO:0005773
            GO:GO:0007568 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AB011483 KO:K01366
            ProtClustDB:CLSN2689015 UniGene:At.25414 IPI:IPI00846287
            RefSeq:NP_001078774.1 ProteinModelPortal:A8MQZ1 SMR:A8MQZ1
            STRING:A8MQZ1 PRIDE:A8MQZ1 EnsemblPlants:AT5G60360.3 GeneID:836158
            KEGG:ath:AT5G60360 OMA:CGSTPMD Genevestigator:A8MQZ1 Uniprot:A8MQZ1
        Length = 361

 Score = 550 (198.7 bits), Expect = 3.8e-53, P = 3.8e-53
 Identities = 122/294 (41%), Positives = 170/294 (57%)

Query:    26 KYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGYR 85
             +YGK Y+N EE + RF IFK+N++ I S N  G   YKL +N+FAD T QEF+  + G  
Sbjct:    65 RYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLS-YKLGVNQFADLTWQEFQRTKLGAA 123

Query:    86 RPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQL 145
             +    T  KG S K      +P T DWR++G V+P+K+QG CGSCW FS   A E     
Sbjct:   124 QNCSATL-KG-SHKVTEAA-LPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQ 180

Query:   146 TTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNK 205
               GK ISLSEQ+LV C  +  ++GC GG    AF++I  N G+ TE  YPY   D TC  
Sbjct:   181 AFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKDETCKF 240

Query:   206 TNEASHVAKIKGYETVPANSEEALLKAVAN-QPVAVSIDASGSAFQFYSSGVFTGD-CGT 263
             + E   V  +     +   +E+ L  AV   +PV+++ +   S F+ Y SGV+T   CG+
Sbjct:   241 SAENVGVQVLNSVN-ITLGAEDELKHAVGLVRPVSIAFEVIHS-FRLYKSGVYTDSHCGS 298

Query:   264 ---ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCG 314
                +++H V AVGYG   +G  YWL+KNSWG  WG++GY +M+      + +CG
Sbjct:   299 TPMDVNHAVLAVGYGVE-DGVPYWLIKNSWGADWGDKGYFKMEMG----KNMCG 347


>UNIPROTKB|P09648 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9031 "Gallus gallus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            OrthoDB:EOG48PMKF MEROPS:C01.032 IPI:IPI00602255 PIR:S00081
            UniGene:Gga.523 ProteinModelPortal:P09648 SMR:P09648 Uniprot:P09648
        Length = 218

 Score = 549 (198.3 bits), Expect = 4.9e-53, P = 4.9e-53
 Identities = 109/219 (49%), Positives = 141/219 (64%)

Query:   107 PATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGV 166
             P ++DWR+ G VTP+K+QG CGSCWAFS   A EG    T GKL+SLSEQ LV C     
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRPEG 61

Query:   167 DHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSE 226
             + GC GG M+ AF+++  N GI +E +YPY A D    +     + A   G+  +P   E
Sbjct:    62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHE 121

Query:   227 EALLKAVANQ-PVAVSIDASGSAFQFYSSGVF-TGDCGTE-LDHGVTAVGYGATANGTKY 283
              AL+KAVA+  PV+V+IDA  S+FQFY SG++   DC +E LDHGV  VGYG    G KY
Sbjct:   122 RALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEG-GKKY 180

Query:   284 WLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
             W+VKNSWG  WG++GYI M +D   ++  CGIA  +SYP
Sbjct:   181 WIVKNSWGEKWGDKGYIYMAKD---RKNHCGIATAASYP 216


>UNIPROTKB|G1RBY1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:61853
            "Nomascus leucogenys" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 EMBL:ADFV01087552 RefSeq:XP_003275518.1
            Ensembl:ENSNLET00000011249 GeneID:100584322 Uniprot:G1RBY1
        Length = 335

 Score = 549 (198.3 bits), Expect = 4.9e-53, P = 4.9e-53
 Identities = 120/320 (37%), Positives = 176/320 (55%)

Query:    10 KLQEASLSEKH-EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINE 68
             +L   SL + H + WMSK+ K Y   EE   R ++F  N   I + N  GN  +K+++N+
Sbjct:    24 ELSVNSLEKFHFKSWMSKHHKTYST-EEYHHRLQMFASNWRKINAHNN-GNHTFKMALNQ 81

Query:    69 FADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA-VTPIKNQGPC 127
             F+D +  E K  +  +  P   ++ K    +       P +MDWRK G  V+P+KNQG C
Sbjct:    82 FSDMSFAEIK-HKYLWSEPQNCSATKSNYLRGTG--PYPPSMDWRKKGNFVSPVKNQGAC 138

Query:   128 GSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDG 187
             GSCW FS   A E    + TGK++SL+EQ+LV C     +HGC+GG    AF++I++N G
Sbjct:   139 GSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKG 198

Query:   188 ITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVA-NQPVAVSIDASG 246
             I  E  YPYQ  DG C K      +  +K    +    EEA+++AVA   PV+ + + + 
Sbjct:   199 IMGEDTYPYQGKDGYC-KFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQ 257

Query:   247 SAFQFYSSGVFTG-DCGT---ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRM 302
               F  Y  G+++   C     +++H V AVGYG   NG  YW+VKNSWG  WG  GY  +
Sbjct:   258 D-FMMYRRGIYSSTSCHKTPDKVNHAVLAVGYGEK-NGIPYWIVKNSWGPQWGMNGYFLI 315

Query:   303 KRDIDAKEGLCGIAMDSSYP 322
             +R     + +CG+A  +SYP
Sbjct:   316 ERG----KNMCGLAACASYP 331


>MGI|MGI:1349426 [details] [associations]
            symbol:Ctsj "cathepsin J" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=ISO]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0048471 "perinuclear region
            of cytoplasm" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:1349426 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF136272
            EMBL:AF158182 EMBL:AY034579 EMBL:AK005526 EMBL:AK131661
            EMBL:BC103769 IPI:IPI00126770 RefSeq:NP_036137.1 UniGene:Mm.31948
            ProteinModelPortal:Q9R014 SMR:Q9R014 MEROPS:C01.038 PRIDE:Q9R014
            Ensembl:ENSMUST00000071526 GeneID:26898 KEGG:mmu:26898
            UCSC:uc007qwa.1 CTD:26898 InParanoid:Q9R014 KO:K09599
            NextBio:304745 Bgee:Q9R014 CleanEx:MM_CTSJ Genevestigator:Q9R014
            GermOnline:ENSMUSG00000055298 Uniprot:Q9R014
        Length = 334

 Score = 549 (198.3 bits), Expect = 4.9e-53, P = 4.9e-53
 Identities = 117/319 (36%), Positives = 179/319 (56%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIE---SLNAAGNKPYKLSINEF 69
             +  L  + + W +KY K Y +P+E+  R  ++++N+  I+     N+ G   + + +N+F
Sbjct:    22 DPKLDAEWKDWKTKYAKSY-SPKEEALRRAVWEENMRMIKLHNKENSLGKNNFTMKMNKF 80

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
              DQT++EF+   +    P  +T     +      I +P   DWR+ G VTP++NQG CGS
Sbjct:    81 GDQTSEEFRKSIDNIPIPAAMTDPHAQNHVS---IGLPDYKDWREEGYVTPVRNQGKCGS 137

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             CWAF+A  A EG     TG L  LS Q L+ C  +  + GC+ G    AF++++ N G+ 
Sbjct:   138 CWAFAAAGAIEGQMFWKTGNLTPLSVQNLLDCSKTVGNKGCQSGTAHQAFEYVLKNKGLE 197

Query:   190 TEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVAN-QPVAVSIDASGSA 248
              EA YPY+  DG C   +E +  A I  Y  +P N E  L  AVA+  PV+ +IDAS  +
Sbjct:   198 AEATYPYEGKDGPCRYRSENAS-ANITDYVNLPPN-ELYLWVAVASIGPVSAAIDASHDS 255

Query:   249 FQFYSSGVF-TGDCGTE-LDHGVTAVGYGATAN---GTKYWLVKNSWGTSWGEEGYIRMK 303
             F+FY+ G++   +C +  ++H V  VGYG+  +   G  YWL+KNSWG  WG  GY+++ 
Sbjct:   256 FRFYNGGIYYEPNCSSYFVNHAVLVVGYGSEGDVKDGNNYWLIKNSWGEEWGMNGYMQIA 315

Query:   304 RDIDAKEGLCGIAMDSSYP 322
             +D       CGIA  +SYP
Sbjct:   316 KD---HNNHCGIASLASYP 331


>RGD|1309226 [details] [associations]
            symbol:Cts7 "cathepsin 7" species:10116 "Rattus norvegicus"
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005768 "endosome" evidence=IEA] [GO:0005794 "Golgi apparatus"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0007067
            "mitosis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IEA] [GO:0051301 "cell division" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 RGD:1309226 GO:GO:0005634
            GO:GO:0005794 GO:GO:0048471 GO:GO:0005615 GO:GO:0051301
            GO:GO:0007067 GO:GO:0005768 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 MEROPS:C01.016 CTD:56092
            GeneTree:ENSGT00560000076577 OrthoDB:EOG44QT2S EMBL:CH474032
            IPI:IPI00870531 RefSeq:NP_001099569.1 UniGene:Rn.218615
            Ensembl:ENSRNOT00000043686 GeneID:290970 KEGG:rno:290970
            UCSC:RGD:1309226 OMA:VESFNAN Uniprot:D3ZZ07
        Length = 331

 Score = 548 (198.0 bits), Expect = 6.3e-53, P = 6.3e-53
 Identities = 121/317 (38%), Positives = 179/317 (56%)

Query:    15 SLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIE---SLNAAGNKPYKLSINEFAD 71
             SL  + E+W     K Y +PEE+++R  ++++NV+ I+     N      + + +NEF D
Sbjct:    24 SLDAEWEEWKRNNAKTY-SPEEEKQRRAVWEENVKMIKWHTMQNGLWMNNFTIEMNEFGD 82

Query:    72 QTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCW 131
              T +E +   +       LT R G   +  NV  +P T+DWR  G V P+++QG CG+CW
Sbjct:    83 MTGEEMRMMTDS----SALTLRNGKHIQKRNV-KIPKTLDWRDTGCVAPVRSQGGCGACW 137

Query:   132 AFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTE 191
             AFS  A+ E      TGKLI LS Q L+ C  +  ++ C GG+   AF+++ +N G+  E
Sbjct:   138 AFSVAASIESQLFKKTGKLIPLSVQNLIDCTVTYGNNDCSGGKPYTAFQYVKNNGGLEAE 197

Query:   192 ANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSAFQ 250
             A YPY+A    C    E S V KI  +  VP N EEAL++A+    P+AV+ID S ++F+
Sbjct:   198 ATYPYEAKLRHCRYRPERS-VVKIARFFVVPRN-EEALMQALVTYGPIAVAIDGSHASFK 255

Query:   251 FYSSGVF-TGDCGTE-LDHGVTAVGYGATANGT---KYWLVKNSWGTSWGEEGYIRMKRD 305
              Y  G++    C  + LDHG+  VGYG   + +   KYWL+KNS G  WGE GY+++ RD
Sbjct:   256 RYRGGIYHEPKCRRDTLDHGLLLVGYGYEGHESENRKYWLLKNSHGEQWGERGYMKLPRD 315

Query:   306 IDAKEGLCGIAMDSSYP 322
                +   CGIA  + YP
Sbjct:   316 ---QNNYCGIASYAMYP 329


>UNIPROTKB|F7BRD4 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001656 "metanephros development"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0001656
            GeneTree:ENSGT00660000095458 EMBL:ACFV01158341 EMBL:ACFV01158342
            EMBL:ACFV01158343 Ensembl:ENSCJAT00000004396 Uniprot:F7BRD4
        Length = 336

 Score = 547 (197.6 bits), Expect = 8.0e-53, P = 8.0e-53
 Identities = 117/330 (35%), Positives = 181/330 (54%)

Query:     1 IAASQVTSRKLQEASLSEK-H-EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAG 58
             +  + V+ +K ++    EK H + WM+K+ K Y   EE  +R + F  N   I + N  G
Sbjct:    14 LLGTPVSKKKKKKMLALEKFHFKSWMAKHHKTYSREEEYHQRLQTFASNWRKINAHNN-G 72

Query:    59 NKPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA- 117
             N  +K+++N+F+D +  E K  +  +  P   ++ K    +       P ++DWRK G  
Sbjct:    73 NHTFKMAVNQFSDMSFAEIKR-KYLWSEPQNCSATKSNYLRGTG--PYPPSVDWRKKGHF 129

Query:   118 VTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMED 177
             V+P+KNQG CGSCW FS   A E    + TGK++SL+EQ+LV C     +HGC+GG    
Sbjct:   130 VSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQ 189

Query:   178 AFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVA-NQ 236
             AF++I++N+GI  E  YPYQ  D  C K      +  +K    +    E+A+++AVA   
Sbjct:   190 AFEYILYNNGIMGEDTYPYQGKDSDC-KFQPGKAIGFVKDVANITIYDEDAMVEAVALYN 248

Query:   237 PVAVSIDASGSAFQFYSSGVFTG-DCGT---ELDHGVTAVGYGATANGTKYWLVKNSWGT 292
             PV+ + + +   F  Y  G+++   C     +++H V AVGYG   NG  YW+VKNSWG 
Sbjct:   249 PVSFAFEVTQD-FMMYKRGIYSSTSCHKTPDKVNHAVLAVGYGEE-NGIPYWIVKNSWGP 306

Query:   293 SWGEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
              WG  GY  ++R     + +CG+A  +SYP
Sbjct:   307 QWGMNGYFLIERG----KNMCGLAACASYP 332


>UNIPROTKB|F7B939 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458 CTD:1512
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 EMBL:ACFV01158341
            EMBL:ACFV01158342 EMBL:ACFV01158343 RefSeq:XP_002753411.1
            Ensembl:ENSCJAT00000004397 GeneID:100413104 Uniprot:F7B939
        Length = 336

 Score = 546 (197.3 bits), Expect = 1.0e-52, P = 1.0e-52
 Identities = 116/320 (36%), Positives = 176/320 (55%)

Query:    10 KLQEASLSEKH-EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINE 68
             +L   SL + H + WM+K+ K Y   EE  +R + F  N   I + N  GN  +K+++N+
Sbjct:    24 ELSVNSLEKFHFKSWMAKHHKTYSREEEYHQRLQTFASNWRKINAHNN-GNHTFKMAVNQ 82

Query:    69 FADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA-VTPIKNQGPC 127
             F+D +  E K  +  +  P   ++ K    +       P ++DWRK G  V+P+KNQG C
Sbjct:    83 FSDMSFAEIKR-KYLWSEPQNCSATKSNYLRGTG--PYPPSVDWRKKGHFVSPVKNQGAC 139

Query:   128 GSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDG 187
             GSCW FS   A E    + TGK++SL+EQ+LV C     +HGC+GG    AF++I++N+G
Sbjct:   140 GSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNNG 199

Query:   188 ITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVA-NQPVAVSIDASG 246
             I  E  YPYQ  D  C K      +  +K    +    E+A+++AVA   PV+ + + + 
Sbjct:   200 IMGEDTYPYQGKDSDC-KFQPGKAIGFVKDVANITIYDEDAMVEAVALYNPVSFAFEVTQ 258

Query:   247 SAFQFYSSGVFTG-DCGT---ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRM 302
               F  Y  G+++   C     +++H V AVGYG   NG  YW+VKNSWG  WG  GY  +
Sbjct:   259 D-FMMYKRGIYSSTSCHKTPDKVNHAVLAVGYGEE-NGIPYWIVKNSWGPQWGMNGYFLI 316

Query:   303 KRDIDAKEGLCGIAMDSSYP 322
             +R     + +CG+A  +SYP
Sbjct:   317 ERG----KNMCGLAACASYP 332


>UNIPROTKB|D3ZZR3 [details] [associations]
            symbol:D3ZZR3 "Uncharacterized protein" species:10116
            "Rattus norvegicus" [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0016020 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0002250 GeneTree:ENSGT00560000076577 GO:GO:0097067
            OrthoDB:EOG4JM7Q2 IPI:IPI00210228 PRIDE:D3ZZR3
            Ensembl:ENSRNOT00000028732 Uniprot:D3ZZR3
        Length = 331

 Score = 545 (196.9 bits), Expect = 1.3e-52, P = 1.3e-52
 Identities = 131/320 (40%), Positives = 176/320 (55%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEF 69
             E  L    + W   + K YK+  E++ R  I++ N++FI   N   + G   Y + +N  
Sbjct:    18 ERPLDHHWDLWKKTHEKEYKDQNEEDVRRLIWEKNLKFIMLHNLEHSMGMHSYSVGMNHM 77

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVI-DVPATMDW--RKNGAVTPIKNQGP 126
              D   +         R P     RK       +V  ++PA + W  R  G    +  QG 
Sbjct:    78 GDMVAETIIGEMGSERLP---RKRKALGLIPSSVNQNLPAGVKWKERTKGCWKNLVFQGS 134

Query:   127 CGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGV--DHGCEGGEMEDAFKFIIH 184
             CGSCWAFSAV A EG  +L TGKL+SLS Q LV C T     + GC GG M +AF++II 
Sbjct:   135 CGSCWAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEAFQYIID 194

Query:   185 NDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSID 243
             N GI +EA+YPY+A+D  C+  +  +  A    Y  +P   EEAL +AVA + PV+V ID
Sbjct:   195 NGGIDSEASYPYKAMDEKCHY-DPKNRAATCSRYIELPFGDEEALKEAVATKGPVSVGID 253

Query:   244 ASGSAFQFYSSGVFTG-DCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRM 302
             AS S+F  Y SGV+    C   ++HGV  VGYG T +G  YWLVKNSWG  +G++GYIRM
Sbjct:   254 ASHSSFFLYQSGVYDDPSCTENVNHGVLVVGYG-TLDGKDYWLVKNSWGLHFGDQGYIRM 312

Query:   303 KRDIDAKEGLCGIAMDSSYP 322
              R+    +  CGIA   SYP
Sbjct:   313 ARN---NKNHCGIASYCSYP 329


>UNIPROTKB|P09668 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9606 "Homo sapiens"
            [GO:0001520 "outer dense fiber" evidence=IEA] [GO:0001669
            "acrosomal vesicle" evidence=IEA] [GO:0007283 "spermatogenesis"
            evidence=IEA] [GO:0030984 "kininogen binding" evidence=IEA]
            [GO:0032403 "protein complex binding" evidence=IEA] [GO:0043621
            "protein self-association" evidence=IEA] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0031648 "protein destabilization"
            evidence=IMP] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=ISS] [GO:0032526 "response to retinoic acid"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0030108 "HLA-A
            specific activating MHC class I receptor activity" evidence=IDA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEP] [GO:0010813 "neuropeptide catabolic process"
            evidence=IDA] [GO:0010815 "bradykinin catabolic process"
            evidence=IDA] [GO:0030335 "positive regulation of cell migration"
            evidence=IDA] [GO:0070371 "ERK1 and ERK2 cascade" evidence=IDA]
            [GO:0010628 "positive regulation of gene expression" evidence=IDA]
            [GO:0006508 "proteolysis" evidence=IDA;TAS] [GO:0031638 "zymogen
            activation" evidence=IDA] [GO:0016505 "apoptotic protease activator
            activity" evidence=IDA] [GO:0010952 "positive regulation of
            peptidase activity" evidence=IDA] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=ISS] [GO:0004252 "serine-type endopeptidase
            activity" evidence=ISS] [GO:0043066 "negative regulation of
            apoptotic process" evidence=ISS] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0033619 "membrane protein proteolysis"
            evidence=IDA] [GO:0004175 "endopeptidase activity" evidence=IDA]
            [GO:0004177 "aminopeptidase activity" evidence=IDA] [GO:0005764
            "lysosome" evidence=IDA] [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0002250 "adaptive immune response" evidence=IEP]
            [GO:0019882 "antigen processing and presentation" evidence=TAS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0070324 "thyroid hormone binding" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0008284
            "positive regulation of cell proliferation" evidence=ISS]
            [GO:0045766 "positive regulation of angiogenesis" evidence=ISS]
            [GO:0008233 "peptidase activity" evidence=IDA] [GO:0097208
            "alveolar lamellar body" evidence=IDA] [GO:0043129 "surfactant
            homeostasis" evidence=IDA] [GO:0004197 "cysteine-type endopeptidase
            activity" evidence=IDA;TAS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 Reactome:REACT_6900 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913 MEROPS:C01.040 CTD:1512
            OMA:STSCHKT OrthoDB:EOG4W9J43 GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 EMBL:X16832 EMBL:AF426247 EMBL:AK314698 EMBL:AC011944
            EMBL:BC002479 EMBL:X07549 IPI:IPI00297487 PIR:S12486
            RefSeq:NP_004381.2 UniGene:Hs.148641 PDB:1BZN PDBsum:1BZN
            ProteinModelPortal:P09668 SMR:P09668 IntAct:P09668 STRING:P09668
            PhosphoSite:P09668 DMDM:288558851 PaxDb:P09668 PRIDE:P09668
            DNASU:1512 Ensembl:ENST00000220166 GeneID:1512 KEGG:hsa:1512
            UCSC:uc021srk.1 GeneCards:GC15M079213 H-InvDB:HIX0012481
            HGNC:HGNC:2535 HPA:CAB000458 HPA:HPA003524 MIM:116820
            neXtProt:NX_P09668 PharmGKB:PA27033 InParanoid:P09668
            PhylomeDB:P09668 BRENDA:3.4.22.16 ChEMBL:CHEMBL2225 GenomeRNAi:1512
            NextBio:6261 ArrayExpress:P09668 Bgee:P09668 CleanEx:HS_CTSH
            Genevestigator:P09668 GermOnline:ENSG00000103811 GO:GO:0019882
            Uniprot:P09668
        Length = 335

 Score = 544 (196.6 bits), Expect = 1.7e-52, P = 1.7e-52
 Identities = 118/315 (37%), Positives = 174/315 (55%)

Query:    15 SLSEKH-EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQT 73
             SL + H + WMSK+ K Y   EE   R + F  N   I + N  GN  +K+++N+F+D +
Sbjct:    29 SLEKFHFKSWMSKHRKTYST-EEYHHRLQTFASNWRKINAHNN-GNHTFKMALNQFSDMS 86

Query:    74 NQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA-VTPIKNQGPCGSCWA 132
               E K  +  +  P   ++ K    +       P ++DWRK G  V+P+KNQG CGSCW 
Sbjct:    87 FAEIK-HKYLWSEPQNCSATKSNYLRGTG--PYPPSVDWRKKGNFVSPVKNQGACGSCWT 143

Query:   133 FSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEA 192
             FS   A E    + TGK++SL+EQ+LV C     +HGC+GG    AF++I++N GI  E 
Sbjct:   144 FSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGED 203

Query:   193 NYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVA-NQPVAVSIDASGSAFQF 251
              YPYQ  DG C K      +  +K    +    EEA+++AVA   PV+ + + +   F  
Sbjct:   204 TYPYQGKDGYC-KFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQD-FMM 261

Query:   252 YSSGVFTG-DCGT---ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDID 307
             Y +G+++   C     +++H V AVGYG   NG  YW+VKNSWG  WG  GY  ++R   
Sbjct:   262 YRTGIYSSTSCHKTPDKVNHAVLAVGYGEK-NGIPYWIVKNSWGPQWGMNGYFLIERG-- 318

Query:   308 AKEGLCGIAMDSSYP 322
               + +CG+A  +SYP
Sbjct:   319 --KNMCGLAACASYP 331


>UNIPROTKB|O46427 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9823 "Sus scrofa"
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0032526 "response to retinoic acid" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0043129
            "surfactant homeostasis" evidence=ISS] [GO:0010815 "bradykinin
            catabolic process" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0005829 "cytosol"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0030335 "positive regulation of cell
            migration" evidence=ISS] [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0016505 "apoptotic protease activator
            activity" evidence=ISS] [GO:0004252 "serine-type endopeptidase
            activity" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=ISS] [GO:0031638 "zymogen activation"
            evidence=ISS] [GO:0030108 "HLA-A specific activating MHC class I
            receptor activity" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0010628 "positive regulation of gene
            expression" evidence=ISS] [GO:0070324 "thyroid hormone binding"
            evidence=ISS] [GO:0006508 "proteolysis" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0060448
            "dichotomous subdivision of terminal units involved in lung
            branching" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] [GO:0005764 "lysosome" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0004177 "aminopeptidase
            activity" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=ISS] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 MEROPS:C01.040 CTD:1512 OMA:STSCHKT
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 EMBL:AF001169
            RefSeq:NP_999094.1 UniGene:Ssc.3593 PDB:1NB3 PDB:1NB5 PDB:8PCH
            PDBsum:1NB3 PDBsum:1NB5 PDBsum:8PCH ProteinModelPortal:O46427
            SMR:O46427 Ensembl:ENSSSCT00000001983 GeneID:396969 KEGG:ssc:396969
            EvolutionaryTrace:O46427 ArrayExpress:O46427 Uniprot:O46427
        Length = 335

 Score = 542 (195.9 bits), Expect = 2.7e-52, P = 2.7e-52
 Identities = 119/319 (37%), Positives = 177/319 (55%)

Query:    11 LQEASLSEKH-EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEF 69
             L  +S  + H + WM ++ K Y + EE   R ++F  N   I + NA GN  +KL +N+F
Sbjct:    25 LAVSSFEKLHFKSWMVQHQKKY-SLEEYHHRLQVFVSNWRKINAHNA-GNHTFKLGLNQF 82

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA-VTPIKNQGPCG 128
             +D +  E +  +  +  P   ++ KG   +       P +MDWRK G  V+P+KNQG CG
Sbjct:    83 SDMSFDEIR-HKYLWSEPQNCSATKGNYLRGTG--PYPPSMDWRKKGNFVSPVKNQGSCG 139

Query:   129 SCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGI 188
             SCW FS   A E    + TGK++SL+EQ+LV C  +  +HGC+GG    AF++I +N GI
Sbjct:   140 SCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGI 199

Query:   189 TTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVA-NQPVAVSIDASGS 247
               E  YPY+  D  C K      +A +K    +  N EEA+++AVA   PV+ + + +  
Sbjct:   200 MGEDTYPYKGQDDHC-KFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFAFEVTND 258

Query:   248 AFQFYSSGVFTG-DCGT---ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMK 303
              F  Y  G+++   C     +++H V AVGYG   NG  YW+VKNSWG  WG  GY  ++
Sbjct:   259 -FLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEE-NGIPYWIVKNSWGPQWGMNGYFLIE 316

Query:   304 RDIDAKEGLCGIAMDSSYP 322
             R     + +CG+A  +SYP
Sbjct:   317 RG----KNMCGLAACASYP 331


>UNIPROTKB|Q4QRC2 [details] [associations]
            symbol:Ctsql2 "Protein Ctsql2" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 HOVERGEN:HBG011513 EMBL:CH474032
            RGD:1303225 EMBL:BC097257 IPI:IPI00421946 RefSeq:NP_001002813.2
            UniGene:Rn.128678 SMR:Q4QRC2 MEROPS:C01.111
            Ensembl:ENSRNOT00000038758 GeneID:408201 KEGG:rno:408201 CTD:408201
            InParanoid:Q4QRC2 OMA:NDEGALM NextBio:696394 Genevestigator:Q4QRC2
            Uniprot:Q4QRC2
        Length = 343

 Score = 541 (195.5 bits), Expect = 3.5e-52, P = 3.5e-52
 Identities = 123/324 (37%), Positives = 175/324 (54%)

Query:    15 SLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIE---SLNAAGNKPYKLSINEFAD 71
             SL  + ++W  KY K+Y   EE  KR  ++++NV+ IE     N+ G   Y + IN FAD
Sbjct:    24 SLDVQWQEWKMKYEKLYSPEEELLKRV-VWEENVKKIELHNRENSLGKNTYIMEINNFAD 82

Query:    72 QTNQEFKAFRNGYRRPDGLTSRK------GTSFKYENVI-D-VPATMDWRKNGAVTPIKN 123
              T++EFK    G   P   T +       G+ F       D +P ++DWRK G VT ++ 
Sbjct:    83 LTDEEFKDMITGITLPINNTMKSLWKRALGSPFPNSWYWRDALPKSIDWRKEGYVTRVRE 142

Query:   124 QGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFII 183
             QG C SCWAF    A EG     TGKL  LS Q LV C     + GC GG   +AF++++
Sbjct:   143 QGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVL 202

Query:   184 HNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSI 242
              N G+ +EA YPY+  +G C K N  +  AKI  +  +P + E+ L+ A+A + PVA  I
Sbjct:   203 QNGGLESEATYPYKGKEGLC-KYNPKNAYAKITRFVALPED-EDVLMDALATKGPVAAGI 260

Query:   243 DASGSAFQFYSSGVF-TGDCGTELDHGVTAVGYGATAN---GTKYWLVKNSWGTSWGEEG 298
                 S+ +FY  G++    C   ++H V  VGYG   N   G  YWL+KNSWG  WG +G
Sbjct:   261 HVVYSSLRFYKKGIYHEPKCNNRVNHAVLVVGYGFEGNETDGNNYWLIKNSWGKQWGLKG 320

Query:   299 YIRMKRDIDAKEGLCGIAMDSSYP 322
             Y+++ +D   +   CGIA  + YP
Sbjct:   321 YMKIAKD---RNNHCGIATFAQYP 341


>MGI|MGI:107285 [details] [associations]
            symbol:Ctsh "cathepsin H" species:10090 "Mus musculus"
            [GO:0001520 "outer dense fiber" evidence=ISO] [GO:0001669
            "acrosomal vesicle" evidence=ISO] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=IGI] [GO:0002764 "immune response-regulating
            signaling pathway" evidence=ISO] [GO:0004175 "endopeptidase
            activity" evidence=ISO;IMP] [GO:0004177 "aminopeptidase activity"
            evidence=ISO] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISO;IDA] [GO:0004252 "serine-type endopeptidase activity"
            evidence=IMP] [GO:0005615 "extracellular space" evidence=ISO]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0005829 "cytosol"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=ISO;IMP]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0008284
            "positive regulation of cell proliferation" evidence=IMP]
            [GO:0010628 "positive regulation of gene expression" evidence=ISO]
            [GO:0010634 "positive regulation of epithelial cell migration"
            evidence=IMP] [GO:0010813 "neuropeptide catabolic process"
            evidence=ISO] [GO:0010815 "bradykinin catabolic process"
            evidence=ISO] [GO:0010952 "positive regulation of peptidase
            activity" evidence=IGI;ISO] [GO:0016505 "apoptotic protease
            activator activity" evidence=IGI;ISO] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0030108 "HLA-A specific activating MHC
            class I receptor activity" evidence=ISO] [GO:0030335 "positive
            regulation of cell migration" evidence=ISO] [GO:0030984 "kininogen
            binding" evidence=ISO] [GO:0031638 "zymogen activation"
            evidence=ISO;IMP] [GO:0031648 "protein destabilization"
            evidence=ISO;IMP] [GO:0032403 "protein complex binding"
            evidence=ISO] [GO:0032526 "response to retinoic acid" evidence=IDA]
            [GO:0033619 "membrane protein proteolysis" evidence=ISO;IMP]
            [GO:0035085 "cilium axoneme" evidence=ISO] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IMP] [GO:0043129
            "surfactant homeostasis" evidence=ISO] [GO:0043621 "protein
            self-association" evidence=ISO] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IMP] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=IMP]
            [GO:0070324 "thyroid hormone binding" evidence=ISO] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISO] [GO:0097208 "alveolar
            lamellar body" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:107285 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 EMBL:CH466560 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            HOVERGEN:HBG011513 GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT OrthoDB:EOG4W9J43
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 BRENDA:3.4.22.16
            EMBL:U06119 EMBL:AK149949 EMBL:AK150583 EMBL:AK157376 EMBL:AK160026
            EMBL:Y18464 IPI:IPI00118987 RefSeq:NP_031827.2 UniGene:Mm.2277
            ProteinModelPortal:P49935 SMR:P49935 STRING:P49935 MEROPS:I29.003
            PhosphoSite:P49935 PaxDb:P49935 PRIDE:P49935
            Ensembl:ENSMUST00000034915 GeneID:13036 KEGG:mmu:13036
            InParanoid:Q3UCD6 ChEMBL:CHEMBL1949491 NextBio:282920 Bgee:P49935
            CleanEx:MM_CTSH Genevestigator:P49935 GermOnline:ENSMUSG00000032359
            Uniprot:P49935
        Length = 333

 Score = 540 (195.1 bits), Expect = 4.4e-52, P = 4.4e-52
 Identities = 114/308 (37%), Positives = 177/308 (57%)

Query:    21 EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAF 80
             + WM ++ K Y + E    R ++F +N   I++ N   N  +K+++N+F+D +  E K  
Sbjct:    34 KSWMKQHQKTYSSVEYNH-RLQMFANNWRKIQAHNQR-NHTFKMALNQFSDMSFAEIK-H 90

Query:    81 RNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNG-AVTPIKNQGPCGSCWAFSAVAAT 139
             +  +  P   ++ K    +       P++MDWRK G  V+P+KNQG CGSCW FS   A 
Sbjct:    91 KFLWSEPQNCSATKSNYLRGTG--PYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGAL 148

Query:   140 EGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAV 199
             E    + +GK++SL+EQ+LV C  +  +HGC+GG    AF++I++N GI  E +YPY   
Sbjct:   149 ESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFEYILYNKGIMEEDSYPYIGK 208

Query:   200 DGTCNKTNEASHVAKIKGYETVPANSEEALLKAVA-NQPVAVSIDASGSAFQFYSSGVFT 258
             D +C + N    VA +K    +  N E A+++AVA   PV+ + + +   F  Y SGV++
Sbjct:   209 DSSC-RFNPQKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTED-FLMYKSGVYS 266

Query:   259 G-DCGT---ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCG 314
                C     +++H V AVGYG   NG  YW+VKNSWG+ WGE GY  ++R     + +CG
Sbjct:   267 SKSCHKTPDKVNHAVLAVGYGEQ-NGLLYWIVKNSWGSQWGENGYFLIERG----KNMCG 321

Query:   315 IAMDSSYP 322
             +A  +SYP
Sbjct:   322 LAACASYP 329


>DICTYBASE|DDB_G0290957 [details] [associations]
            symbol:cprA "cysteine proteinase 1" species:44689
            "Dictyostelium discoideum" [GO:0006972 "hyperosmotic response"
            evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0290957
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GenomeReviews:CM000154_GR GO:GO:0005764
            GO:GO:0006972 EMBL:AAFI02000174 KO:K01376 EMBL:X02407 PIR:A22827
            RefSeq:XP_635417.1 ProteinModelPortal:P04988 MEROPS:C01.022
            GlycoSuiteDB:P04988 SWISS-2DPAGE:P04988 EnsemblProtists:DDB0201647
            GeneID:8627918 KEGG:ddi:DDB_G0290957 OMA:KISNFTM
            ProtClustDB:CLSZ2429603 Uniprot:P04988
        Length = 343

 Score = 539 (194.8 bits), Expect = 5.6e-52, P = 5.6e-52
 Identities = 129/329 (39%), Positives = 174/329 (52%)

Query:     6 VTSRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN--AAGNKP-Y 62
             V+SR +     S+  E +  K+ K Y + EE  +RF IFK N+  IE LN  A  +K   
Sbjct:    16 VSSRGIPLEEQSQFLE-FQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADT 73

Query:    63 KLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVID-VPATMDWRKNGAVTPI 121
             K  +N+FAD ++ EFK +    +            +  +  I+ +P   DWR  GAVTP+
Sbjct:    74 KFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPV 133

Query:   122 KNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCD--------TSGVDHGCEGG 173
             KNQG CGSCW+FS     EG   ++  KL+SLSEQ LV CD            D GC GG
Sbjct:   134 KNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEQACDEGCNGG 193

Query:   174 EMEDAFKFIIHNDGITTEANYPYQAVDGT-CNKTNEASHVAKIKGYETVPANSEEALLKA 232
                +A+ +II N GI TE++YPY A  GT CN  N A+  AKI  +  +P N        
Sbjct:   194 LQPNAYNYIIKNGGIQTESSYPYTAETGTQCN-FNSANIGAKISNFTMIPKNETVMAGYI 252

Query:   233 VANQPVAVSIDASGSAFQFYSSGVFTGDCG-TELDHGVTAVGYGAT----ANGTKYWLVK 287
             V+  P+A++ DA    +QFY  GVF   C    LDHG+  VGY A          YW+VK
Sbjct:   253 VSTGPLAIAADAV--EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVK 310

Query:   288 NSWGTSWGEEGYIRMKRDIDAKEGLCGIA 316
             NSWG  WGE+GYI ++R     +  CG++
Sbjct:   311 NSWGADWGEQGYIYLRRG----KNTCGVS 335


>FB|FBgn0260462 [details] [associations]
            symbol:CG12163 species:7227 "Drosophila melanogaster"
            [GO:0035071 "salivary gland cell autophagic cell death"
            evidence=IEP] [GO:0048102 "autophagic cell death" evidence=IEP]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0004869 "cysteine-type
            endopeptidase inhibitor activity" evidence=IEA] [GO:0045169
            "fusome" evidence=IDA] [GO:0035220 "wing disc development"
            evidence=IGI] [GO:0022416 "chaeta development" evidence=IGI]
            InterPro:IPR000010 InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00043 SMART:SM00645
            InterPro:IPR000169 EMBL:AE014297 GO:GO:0004869 eggNOG:COG4870
            KO:K01373 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0022416 GO:GO:0035220 GO:GO:0035071
            GO:GO:0045169 GeneTree:ENSGT00660000095458 EMBL:AY121614
            EMBL:BT003231 RefSeq:NP_649521.1 RefSeq:NP_730901.1
            RefSeq:NP_730902.2 UniGene:Dm.7315 ProteinModelPortal:Q9VN93
            SMR:Q9VN93 DIP:DIP-17491N IntAct:Q9VN93 MINT:MINT-763966
            STRING:Q9VN93 MEROPS:C01.A27 PaxDb:Q9VN93
            EnsemblMetazoa:FBtr0078823 GeneID:40628 KEGG:dme:Dmel_CG12163
            UCSC:CG12163-RA FlyBase:FBgn0260462 InParanoid:Q9VN93 OMA:GPRWGEQ
            OrthoDB:EOG4CC2G9 PhylomeDB:Q9VN93 GenomeRNAi:40628 NextBio:819744
            Bgee:Q9VN93 GermOnline:CG12163 Uniprot:Q9VN93
        Length = 614

 Score = 539 (194.8 bits), Expect = 5.6e-52, P = 5.6e-52
 Identities = 122/304 (40%), Positives = 174/304 (57%)

Query:    26 KYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGYR 85
             ++G+ Y +  E++ R RIF+ N++ IE LNA      K  I EFAD T+ E+K  R G  
Sbjct:   314 RFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSEYKE-RTGLW 372

Query:    86 RPDGLTSRKGTSF---KYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGI 142
             + D   +  G++     Y    ++P   DWR+  AVT +KNQG CGSCWAFS     EG+
Sbjct:   373 QRDEAKATGGSAAVVPAYHG--ELPKEFDWRQKDAVTQVKNQGSCGSCWAFSVTGNIEGL 430

Query:   143 TQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGT 202
               + TG+L   SEQEL+ CDT+  D  C GG M++A+K I    G+  EA YPY+A    
Sbjct:   431 YAVKTGELKEFSEQELLDCDTT--DSACNGGLMDNAYKAIKDIGGLEYEAEYPYKAKKNQ 488

Query:   203 CNKTNEASHVAKIKGYETVPANSEEALLK-AVANQPVAVSIDASGSAFQFYSSGV---FT 258
             C+     SHV ++ G+  +P  +E A+ +  +AN P+++ I+A+  A QFY  GV   + 
Sbjct:   489 CHFNRTLSHV-QVAGFVDLPKGNETAMQEWLLANGPISIGINAN--AMQFYRGGVSHPWK 545

Query:   259 GDCGTE-LDHGVTAVGYGAT--ANGTK---YWLVKNSWGTSWGEEGYIRMKRDIDAKEGL 312
               C  + LDHGV  VGYG +   N  K   YW+VKNSWG  WGE+GY R+ R     +  
Sbjct:   546 ALCSKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYRVYRG----DNT 601

Query:   313 CGIA 316
             CG++
Sbjct:   602 CGVS 605


>UNIPROTKB|Q3T0I2 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9913 "Bos taurus"
            [GO:0031638 "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=ISS] [GO:0010815 "bradykinin
            catabolic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0010813 "neuropeptide catabolic
            process" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0045766 "positive regulation of angiogenesis" evidence=ISS]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=ISS] [GO:0008234 "cysteine-type peptidase activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0033619 "membrane protein proteolysis" evidence=ISS]
            [GO:0043066 "negative regulation of apoptotic process"
            evidence=ISS] [GO:0004252 "serine-type endopeptidase activity"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0016505 "apoptotic protease activator activity"
            evidence=ISS] [GO:0010952 "positive regulation of peptidase
            activity" evidence=ISS] [GO:0030108 "HLA-A specific activating MHC
            class I receptor activity" evidence=ISS] [GO:0002764 "immune
            response-regulating signaling pathway" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0070324 "thyroid
            hormone binding" evidence=ISS] [GO:0006508 "proteolysis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0097208
            "alveolar lamellar body" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004175
            "endopeptidase activity" evidence=ISS] [GO:0032526 "response to
            retinoic acid" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 EMBL:BC102386 IPI:IPI00693034
            RefSeq:NP_001029557.1 UniGene:Bt.52393 ProteinModelPortal:Q3T0I2
            SMR:Q3T0I2 STRING:Q3T0I2 MEROPS:C01.040 PRIDE:Q3T0I2
            Ensembl:ENSBTAT00000014593 GeneID:510524 KEGG:bta:510524 CTD:1512
            InParanoid:Q3T0I2 OMA:STSCHKT OrthoDB:EOG4W9J43 NextBio:20869490
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 Uniprot:Q3T0I2
        Length = 335

 Score = 539 (194.8 bits), Expect = 5.6e-52, P = 5.6e-52
 Identities = 120/329 (36%), Positives = 181/329 (55%)

Query:     1 IAASQVTSRKLQEASLSEKHEQ-WMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGN 59
             + A    + +L   SL + H Q WM ++ K Y + EE   R + F  N+  I + NA  N
Sbjct:    15 LGAPACGAAELAANSLEKFHFQSWMVQHQKKYSS-EEYYHRLQAFASNLREINAHNAR-N 72

Query:    60 KPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA-V 118
               +K+ +N+F+D +  E K  +  +  P   ++ K    +       P +MDWRK G  V
Sbjct:    73 HTFKMGLNQFSDMSFDELKR-KYLWSEPQNCSATKSNYLRGTG--PYPPSMDWRKKGNFV 129

Query:   119 TPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDA 178
             TP+KNQG CGSCW FS   A E    + TGKL  L+EQ+LV C  +  +HGC+GG    A
Sbjct:   130 TPVKNQGSCGSCWTFSTTGALESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQA 189

Query:   179 FKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVA-NQP 237
             F++I +N GI  E  YPY+  DG C K   +  +A +K    +  N EEA+++AVA + P
Sbjct:   190 FEYIRYNKGIMGEDTYPYRGQDGDC-KYQPSKAIAFVKDVANITLNDEEAMVEAVALHNP 248

Query:   238 VAVSIDASGSAFQFYSSGVFTG-DCGT---ELDHGVTAVGYGATANGTKYWLVKNSWGTS 293
             V+ + + +   F  Y  G+++   C     +++H V AVGYG    G  YW+VKNSWG +
Sbjct:   249 VSFAFEVTAD-FMMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEE-KGIPYWIVKNSWGPN 306

Query:   294 WGEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
             WG +GY  ++R     + +CG+A  +S+P
Sbjct:   307 WGMKGYFLIERG----KNMCGLAACASFP 331


>UNIPROTKB|G1SQF0 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9986
            "Oryctolagus cuniculus" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 RefSeq:XP_002721635.1 UniGene:Ocu.7137
            Ensembl:ENSOCUT00000006138 GeneID:100101597 Uniprot:G1SQF0
        Length = 333

 Score = 539 (194.8 bits), Expect = 5.6e-52, P = 5.6e-52
 Identities = 116/315 (36%), Positives = 181/315 (57%)

Query:    15 SLSEKH-EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQT 73
             +L + H + WMS++ K Y + EE  +R + F  N   I + N  GN  +++ +N+F+D +
Sbjct:    27 NLEKFHFKSWMSQHHKKY-SAEEYPRRLQTFVRNWRKINAHNN-GNHTFQMGLNQFSDMS 84

Query:    74 NQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA-VTPIKNQGPCGSCWA 132
               E K  +  +  P   ++ K    +       P+++DWRK G  V+P+KNQG CGSCW 
Sbjct:    85 FAEIK-HKYLWTEPQNCSATKSNYLRGTG--PYPSSVDWRKKGNFVSPVKNQGACGSCWT 141

Query:   133 FSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEA 192
             FS   A E    +  GK++SL+EQ+LV C  +  +HGCEGG    AF++I++N GI  E 
Sbjct:   142 FSTTGALESAVAIAGGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQAFEYILYNKGIMGED 201

Query:   193 NYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVA-NQPVAVSIDASGSAFQF 251
             +YPY+A++G C K      +A +K    +  N EEA+++AVA   PV+ + + +    Q 
Sbjct:   202 SYPYRAMEGRC-KFQPQKAIAFVKDVANITLNDEEAMVEAVALYNPVSFAFEVTEDFMQ- 259

Query:   252 YSSGVFTG-DCGT---ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDID 307
             Y  G+++   C     +++H V AVGYG   NG  YW+VKNSWG+ WG  GY  ++R   
Sbjct:   260 YRKGIYSSTSCHKTPDKVNHAVLAVGYGEE-NGVPYWIVKNSWGSHWGMNGYFYIERG-- 316

Query:   308 AKEGLCGIAMDSSYP 322
               + +CG+A  +SYP
Sbjct:   317 --KNMCGLAACASYP 329


>RGD|2447 [details] [associations]
            symbol:Ctsh "cathepsin H" species:10116 "Rattus norvegicus"
          [GO:0001520 "outer dense fiber" evidence=IDA] [GO:0001656
          "metanephros development" evidence=IEP] [GO:0001669 "acrosomal
          vesicle" evidence=IDA] [GO:0001913 "T cell mediated cytotoxicity"
          evidence=ISO;ISS] [GO:0002250 "adaptive immune response"
          evidence=ISO] [GO:0002764 "immune response-regulating signaling
          pathway" evidence=ISO;ISS] [GO:0004175 "endopeptidase activity"
          evidence=ISO] [GO:0004177 "aminopeptidase activity" evidence=ISO;IDA]
          [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISO;ISS]
          [GO:0004252 "serine-type endopeptidase activity" evidence=ISO;ISS]
          [GO:0005615 "extracellular space" evidence=ISO;ISS;IDA] [GO:0005764
          "lysosome" evidence=ISO;ISS;IDA] [GO:0005829 "cytosol"
          evidence=ISO;ISS] [GO:0006508 "proteolysis" evidence=IEP;ISO]
          [GO:0007283 "spermatogenesis" evidence=IEP] [GO:0008233 "peptidase
          activity" evidence=ISO] [GO:0008234 "cysteine-type peptidase
          activity" evidence=ISO] [GO:0008284 "positive regulation of cell
          proliferation" evidence=ISO;ISS] [GO:0010628 "positive regulation of
          gene expression" evidence=ISO;ISS] [GO:0010634 "positive regulation
          of epithelial cell migration" evidence=ISO;ISS] [GO:0010813
          "neuropeptide catabolic process" evidence=ISO;ISS] [GO:0010815
          "bradykinin catabolic process" evidence=ISO;ISS] [GO:0010952
          "positive regulation of peptidase activity" evidence=ISO;ISS]
          [GO:0016505 "apoptotic protease activator activity" evidence=ISO;ISS]
          [GO:0030108 "HLA-A specific activating MHC class I receptor activity"
          evidence=ISO;ISS] [GO:0030335 "positive regulation of cell migration"
          evidence=ISO;ISS] [GO:0030984 "kininogen binding" evidence=IPI]
          [GO:0031638 "zymogen activation" evidence=ISO;ISS] [GO:0031648
          "protein destabilization" evidence=ISO;ISS] [GO:0032403 "protein
          complex binding" evidence=IPI] [GO:0032526 "response to retinoic
          acid" evidence=ISO;ISS] [GO:0033619 "membrane protein proteolysis"
          evidence=ISO;ISS] [GO:0035085 "cilium axoneme" evidence=IDA]
          [GO:0043066 "negative regulation of apoptotic process"
          evidence=ISO;ISS] [GO:0043129 "surfactant homeostasis"
          evidence=ISO;ISS] [GO:0043621 "protein self-association"
          evidence=IDA] [GO:0045766 "positive regulation of angiogenesis"
          evidence=ISO;ISS] [GO:0060448 "dichotomous subdivision of terminal
          units involved in lung branching" evidence=ISO;ISS] [GO:0070324
          "thyroid hormone binding" evidence=ISO;ISS] [GO:0070371 "ERK1 and
          ERK2 cascade" evidence=ISO;ISS] [GO:0097067 "cellular response to
          thyroid hormone stimulus" evidence=ISO;IEP] [GO:0097208 "alveolar
          lamellar body" evidence=ISO;ISS;IDA] InterPro:IPR000668
          InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
          InterPro:IPR000169 RGD:2447 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
          GO:GO:0008284 GO:GO:0070371 GO:GO:0001669 eggNOG:COG4870
          HOGENOM:HOG000230774 InterPro:IPR025661 InterPro:IPR025660
          InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
          PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0007283
          GO:GO:0045766 GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
          GO:GO:0043621 GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 KO:K01366
          GO:GO:0016505 GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
          HOVERGEN:HBG011513 GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
          GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT OrthoDB:EOG4W9J43
          GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
          GO:GO:0010813 GO:GO:0043129 MEROPS:I29.003 EMBL:Y00708 EMBL:BC085352
          EMBL:M38135 IPI:IPI00212809 PIR:S00211 RefSeq:NP_037071.1
          UniGene:Rn.1997 ProteinModelPortal:P00786 SMR:P00786 STRING:P00786
          PRIDE:P00786 Ensembl:ENSRNOT00000019285 GeneID:25425 KEGG:rno:25425
          UCSC:RGD:2447 InParanoid:P00786 BindingDB:P00786 NextBio:606599
          Genevestigator:P00786 GermOnline:ENSRNOG00000014064 GO:GO:0035086
          GO:GO:0001520 Uniprot:P00786
        Length = 333

 Score = 539 (194.8 bits), Expect = 5.6e-52, P = 5.6e-52
 Identities = 113/306 (36%), Positives = 176/306 (57%)

Query:    23 WMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRN 82
             WM ++ K Y +  E   R ++F +N   I++ N   N  +K+ +N+F+D +  E K  + 
Sbjct:    36 WMKQHQKTYSS-REYSHRLQVFANNWRKIQAHNQR-NHTFKMGLNQFSDMSFAEIK-HKY 92

Query:    83 GYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNG-AVTPIKNQGPCGSCWAFSAVAATEG 141
              +  P   ++ K    +       P++MDWRK G  V+P+KNQG CGSCW FS   A E 
Sbjct:    93 LWSEPQNCSATKSNYLRGTG--PYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGALES 150

Query:   142 ITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDG 201
                + +GK+++L+EQ+LV C  +  +HGC+GG    AF++I++N GI  E +YPY   +G
Sbjct:   151 AVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIGKNG 210

Query:   202 TCNKTNEASHVAKIKGYETVPANSEEALLKAVA-NQPVAVSIDASGSAFQFYSSGVFTGD 260
              C K N    VA +K    +  N E A+++AVA   PV+ + + +   F  Y SGV++ +
Sbjct:   211 QC-KFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTED-FMMYKSGVYSSN 268

Query:   261 -CGT---ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIA 316
              C     +++H V AVGYG   NG  YW+VKNSWG++WG  GY  ++R     + +CG+A
Sbjct:   269 SCHKTPDKVNHAVLAVGYGEQ-NGLLYWIVKNSWGSNWGNNGYFLIERG----KNMCGLA 323

Query:   317 MDSSYP 322
               +SYP
Sbjct:   324 ACASYP 329


>UNIPROTKB|G3R9A7 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9595 "Gorilla
            gorilla gorilla" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 OMA:STSCHKT GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 RefSeq:XP_004056662.1 Ensembl:ENSGGOT00000012331
            GeneID:101144312 Uniprot:G3R9A7
        Length = 335

 Score = 538 (194.4 bits), Expect = 7.2e-52, P = 7.2e-52
 Identities = 115/306 (37%), Positives = 169/306 (55%)

Query:    23 WMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRN 82
             WMSK+ K Y   EE   R + F  N   I + N  GN  +K+++N+F+D +  E K  + 
Sbjct:    38 WMSKHRKTYST-EEYHHRLQTFASNWRKINAHNN-GNHTFKMALNQFSDMSFAEIK-HKY 94

Query:    83 GYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA-VTPIKNQGPCGSCWAFSAVAATEG 141
              +  P   ++ K    +       P ++DWRK G  V+P+KNQG CGSCW FS   A E 
Sbjct:    95 LWSEPQNCSATKSNYLRGTG--PYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALES 152

Query:   142 ITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDG 201
                + TGK++SL+EQ+LV C     +HGC+GG    AF++I++N GI  E  YPYQ  DG
Sbjct:   153 AIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDG 212

Query:   202 TCNKTNEASHVAKIKGYETVPANSEEALLKAVA-NQPVAVSIDASGSAFQFYSSGVFTG- 259
              C K      +  +K    +    EEA+++AVA   PV+ + + +   F  Y +G+++  
Sbjct:   213 YC-KFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQD-FMMYRTGIYSST 270

Query:   260 DCGT---ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIA 316
              C     +++H V AVGYG   NG  YW+VKNSWG  WG  GY  ++R     + +CG+A
Sbjct:   271 SCHKTPDKVNHAVLAVGYGEK-NGIPYWIVKNSWGPKWGMNGYFLIERG----KNMCGLA 325

Query:   317 MDSSYP 322
               +SYP
Sbjct:   326 ACASYP 331


>UNIPROTKB|F1NT07 [details] [associations]
            symbol:LOC100857883 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 EMBL:AADN02044012
            EMBL:AADN02044013 EMBL:AADN02044014 IPI:IPI00577314
            Ensembl:ENSGALT00000000192 OMA:IYKHGPV Uniprot:F1NT07
        Length = 317

 Score = 536 (193.7 bits), Expect = 1.2e-51, P = 1.2e-51
 Identities = 125/307 (40%), Positives = 173/307 (56%)

Query:    26 KYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGYR 85
             + G+ Y +  E E R RIF  ++ F+ S N A    Y L++N  AD+T QE  A R   R
Sbjct:    18 RLGRPYGSAREMEHRQRIFAHHMRFVHSKNRAALS-YSLALNHLADRTPQEMAALRG--R 74

Query:    86 RPDGLTSRKGTSFKYENV--IDVPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGIT 143
             R  G     G  F  E+   I +P ++DWR  GAVTP+K+Q  CGSCW+F+   A EG  
Sbjct:    75 RRSG-DPNHGLPFPAEHYTGIILPESLDWRMYGAVTPVKDQAVCGSCWSFATTGAMEGAL 133

Query:   144 QLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGI-TTEA--NYPYQAVD 200
              L TG L  LS+Q L+ C     ++ C+GGE   A  +I  + GI +TE+  ++P    +
Sbjct:   134 FLKTGVLTPLSQQVLIDCSWGKGNYACDGGEEWRAKGWIKKHGGIASTESPPSFPLVLQN 193

Query:   201 GTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSAFQFYSSGVF-T 258
             G C+  N++  +AKI GY  V + +  A+  A+    PVAVSIDAS   F FYS+G++  
Sbjct:   194 GLCHY-NQSEMLAKITGYVNVTSGNITAVKTAIYKHGPVAVSIDASHKTFSFYSNGIYYE 252

Query:   259 GDCGT---ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGI 315
               C     +LDH V AVGYG    G  YWL+KNSW T WG +GYI M      K+  CG+
Sbjct:   253 PKCANKPGQLDHAVLAVGYGVL-QGETYWLIKNSWSTYWGNDGYILMAM----KDNNCGV 307

Query:   316 AMDSSYP 322
             A +++YP
Sbjct:   308 ATEATYP 314


>UNIPROTKB|H9KYW5 [details] [associations]
            symbol:CTSS "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0002250 "adaptive immune response" evidence=IEA]
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 OMA:YEPACTQ EMBL:AADN02010496
            Ensembl:ENSGALT00000001122 Uniprot:H9KYW5
        Length = 245

 Score = 536 (193.7 bits), Expect = 1.2e-51, P = 1.2e-51
 Identities = 120/252 (47%), Positives = 150/252 (59%)

Query:    73 TNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWA 132
             T+++  A   G R P G      TS  Y      P  MDWR+ G VT +KNQG CG+CWA
Sbjct:     1 TSEDVAALLTGLRVPSG---HNQTS-TYRRRGGAPDAMDWREKGCVTEVKNQGACGACWA 56

Query:   133 FSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEA 192
             FSAV A E   +L TGKL+SLS Q LV C     + GC GG M  AF++II N+GI +E 
Sbjct:    57 FSAVGALEAQVKLKTGKLVSLSAQNLVDCSMMYGNKGCGGGFMTRAFQYIIDNNGIDSEE 116

Query:   193 NYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSAFQF 251
             +YPY A +GTC + N ++  A    Y  +P   E AL  AVAN  PV+V+IDA+   F  
Sbjct:   117 SYPYMAQNGTC-QYNVSTRAATCSKYVELPYADEAALKDAVANVGPVSVAIDATQPTFFL 175

Query:   252 YSSGVFTGD-CGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKE 310
             Y SGV+    C  E++HGV  VGYG T N   +WLVKNSWG  +G+ GYIRM R+  A  
Sbjct:   176 YRSGVYDDPRCTQEVNHGVLVVGYG-TLNEKDFWLVKNSWGERFGDGGYIRMSRN-HANH 233

Query:   311 GLCGIAMDSSYP 322
               CGIA  +SYP
Sbjct:   234 --CGIASYASYP 243


>RGD|1588248 [details] [associations]
            symbol:Cts8 "cathepsin 8" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1588248 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 IPI:IPI00765053
            RefSeq:NP_001121688.1 UniGene:Rn.220599 Ensembl:ENSRNOT00000061486
            GeneID:680718 KEGG:rno:680718 UCSC:RGD:1588248 CTD:56094
            OMA:DSEWQEW OrthoDB:EOG4JT07C NextBio:719350 Uniprot:D3ZP54
        Length = 333

 Score = 536 (193.7 bits), Expect = 1.2e-51, P = 1.2e-51
 Identities = 121/322 (37%), Positives = 176/322 (54%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGN---KPYKLSINEF 69
             + SL  + ++W +KY K Y   EE +KR  ++++N++ ++  N   +   K + + +N F
Sbjct:    22 DPSLDSEWQEWKTKYEKNYSLEEEGQKR-AVWEENMKVVKQHNIEYDQEKKNFTMELNAF 80

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTS---FKYENVIDVPATMDWRKNGAVTPIKNQGP 126
             AD T +EF+           L  +K      F+Y     +P  +DWR+ G VT +KNQG 
Sbjct:    81 ADMTGEEFRKMMTNIP-VQNLRKKKSIHQPIFRY-----LPKFVDWRRRGYVTSVKNQGT 134

Query:   127 CGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHND 186
             C SCWAFS   A EG     TG+L+SLS Q LV C     +HGC  G    A K++  N 
Sbjct:   135 CNSCWAFSVAGAIEGQMFRKTGRLVSLSPQNLVDCSRPEGNHGCHMGSTLYALKYVWSNG 194

Query:   187 GITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVAN-QPVAVSIDAS 245
             G+  E+ YPY+  +G C      S  A++ G+ TV A SEEAL+ AVA   P++V IDAS
Sbjct:   195 GLEAESTYPYEGKEGPCRYLPRRS-AARVTGFSTV-ARSEEALMHAVATIGPISVGIDAS 252

Query:   246 GSAFQFYSSGVFTGD-CGTE-LDHGVTAVGYG---ATANGTKYWLVKNSWGTSWGEEGYI 300
               +F+FY  G++    C +  ++H V  VGYG     ++G KYWL+KNS G  WG  GY+
Sbjct:   253 HVSFRFYRRGIYYEPRCSSNRINHSVLVVGYGYEGRESDGRKYWLIKNSHGVGWGMNGYM 312

Query:   301 RMKRDIDAKEGLCGIAMDSSYP 322
             ++ R  +     CGIA    YP
Sbjct:   313 KLARGWNNH---CGIATYGFYP 331


>UNIPROTKB|F6X9C1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00660000095458
            OMA:STSCHKT Ensembl:ENSCAFT00000036196 EMBL:AAEX03002388
            Uniprot:F6X9C1
        Length = 305

 Score = 534 (193.0 bits), Expect = 1.9e-51, P = 1.9e-51
 Identities = 116/308 (37%), Positives = 173/308 (56%)

Query:    21 EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAF 80
             + W  ++ K Y + EE  +R + F  N   I + NA GN  +K+ +N+F+D    E K  
Sbjct:     6 KSWAVQHQKKYSS-EEYLQRLQTFVGNWRKINAHNA-GNHTFKMGLNQFSDMNFAEIK-H 62

Query:    81 RNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA-VTPIKNQGPCGSCWAFSAVAAT 139
             +  +  P   ++ KG   +       P  +DWRK G  V+P+KNQG CGSCW FS   A 
Sbjct:    63 KYLWSEPQNCSATKGNYLRGTG--PYPPFVDWRKKGKFVSPVKNQGSCGSCWTFSTTGAL 120

Query:   140 EGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAV 199
             E    + +GKL+SL+EQ+LV C  +  +HGC+GG    AF++I +N GI  E +YPY+  
Sbjct:   121 ESAIAIKSGKLLSLAEQQLVDCAQNFNNHGCQGGAPLQAFEYIRYNKGIMGEDSYPYKGQ 180

Query:   200 DGTCNKTNEASHVAKIKGYETVPANSEEALLKAVA-NQPVAVSIDASGSAFQFYSSGVFT 258
             DG C K   +  +A +K    +  N E+A+++AVA   PV+ + + + S F  Y  G+++
Sbjct:   181 DGDC-KYQPSKAIAFVKDVANITINDEQAMVEAVALYNPVSFAFEVT-SDFMMYRKGIYS 238

Query:   259 G-DCGT---ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCG 314
                C     +++H V AVGYG   NG  YW+VKNSWG  WG  GY  M+R     + +CG
Sbjct:   239 STSCHKTPDKVNHAVLAVGYGEQ-NGIPYWIVKNSWGPQWGMNGYFLMERG----KNMCG 293

Query:   315 IAMDSSYP 322
             +A  +SYP
Sbjct:   294 LAACASYP 301


>UNIPROTKB|J9P7C5 [details] [associations]
            symbol:J9P7C5 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 EMBL:AAEX03010953
            Ensembl:ENSCAFT00000012925 Uniprot:J9P7C5
        Length = 321

 Score = 534 (193.0 bits), Expect = 1.9e-51, P = 1.9e-51
 Identities = 123/315 (39%), Positives = 179/315 (56%)

Query:    18 EKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAGNKPYKLSINEFADQTN 74
             ++  QW + + ++Y   EE  +R  +++ N++ IE  N   + G   + +++N F D TN
Sbjct:    22 DQRYQWKAMHRRLYGMNEEGWRR-AVWEKNMKMIELHNREYSQGKHGFTMAMNAFGDMTN 80

Query:    75 QEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWAFS 134
             +EF+   NG++       +KG  F+     ++P ++DWR+ G VTP+KNQG CGSCWAFS
Sbjct:    81 EEFRQVINGFQNQK---HKKGKVFQEPLFAEIPKSVDWREKGYVTPVKNQGQCGSCWAFS 137

Query:   135 AVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANY 194
             A  A EG     TG L+ LSEQ L      G + GC GG M++AF+++  N  + +E +Y
Sbjct:   138 ATGAFEGQMFWKTGNLVPLSEQNLAQ----G-NEGCNGGLMDNAFQYVKDNRCLDSEESY 192

Query:   195 PYQAVD-GTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSAFQFY 252
             PY   D  TCN   E S  A   G+  +P   E+AL+KA+A    + V+IDA    FQFY
Sbjct:   193 PYLGRDTDTCNYKPECS-AAHDSGFVDLPQR-EKALMKAMATLGSITVAIDAGHQYFQFY 250

Query:   253 SSGV-FTGDCGT-ELDHGVTAVGYG--ATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDA 308
              S + F  DC + +LDHGV  VGYG   T +  K W+VKNSW   WG   Y++M +    
Sbjct:   251 KSSIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNK-WIVKNSWSPEWGWNSYVKMAK---G 306

Query:   309 KEGLCGIAMDSSYPT 323
             +   CGI   +SYPT
Sbjct:   307 QNNHCGITA-ASYPT 320


>DICTYBASE|DDB_G0278721 [details] [associations]
            symbol:cprD "cysteine proteinase 4" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0278721 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000024 EMBL:L36204 RefSeq:XP_641963.1
            ProteinModelPortal:P54639 SMR:P54639 MEROPS:C01.A57 PRIDE:P54639
            EnsemblProtists:DDB0214999 GeneID:8621695 KEGG:ddi:DDB_G0278721
            OMA:NAFADIT ProtClustDB:CLSZ2846820 Uniprot:P54639
        Length = 442

 Score = 533 (192.7 bits), Expect = 2.4e-51, P = 2.4e-51
 Identities = 121/283 (42%), Positives = 172/283 (60%)

Query:     8 SRKLQEASLSEKHE--QWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLS 65
             S K Q + L  ++    WM  + + Y + EE   R++IFK N++++   N+ G +   L 
Sbjct:    16 SAKQQFSELQYRNAFTNWMQAHQRTYSS-EEFNARYQIFKSNMDYVHQWNSKGGETV-LG 73

Query:    66 INEFADQTNQEFKAFRNGYRRP-DGLTSRKGTSFKYENVIDVPA-TMDWRKNGAVTPIKN 123
             +N FAD TNQE++    G   P DG ++  GT  + E +   PA T+DWR  GAVTPIKN
Sbjct:    74 LNVFADITNQEYRTTYLG--TPFDG-SALIGT--EEEKIFSTPAPTVDWRAQGAVTPIKN 128

Query:   124 QGPCGSCWAFSAVAATEGITQLTTGK---LISLSEQELVSCDTSGVDHGCEGGEMEDAFK 180
             QG CG CW+FS   +TEG   + +G    L+SLSEQ L+ C  S  ++GCEGG M  AF+
Sbjct:   129 QGQCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSEQNLIDCSKSYGNNGCEGGLMTLAFE 188

Query:   181 FIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAV 240
             +II+N GI TE++YPY A DG   K   ++  A+I  Y+ V + SE +L  A  N PV+V
Sbjct:   189 YIINNKGIDTESSYPYTAEDGKECKFKTSNIGAQIVSYQNVTSGSEASLQSASNNAPVSV 248

Query:   241 SIDASGSAFQFYSSGVFTGD-CG-TELDHGVTAVGYGATANGT 281
             +IDAS  +FQ Y SG++    C  T+LDHGV  VGYG+ ++ +
Sbjct:   249 AIDASNESFQLYESGIYYEPACSPTQLDHGVLVVGYGSGSSSS 291

 Score = 150 (57.9 bits), Expect = 8.0e-08, P = 8.0e-08
 Identities = 43/140 (30%), Positives = 68/140 (48%)

Query:   187 GITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLK-AVANQPVAVSIDAS 245
             G T+ ++   +A   +  K + +S   K     +  + S+      + + Q        +
Sbjct:   307 GKTSSSSSSGKASSSSSGKASSSSSSGKTSSAASSTSGSQSGSQSGSQSGQSTGSQSGQT 366

Query:   246 GSAFQFYSSGVFTGDCGTELDHGVTAVGYGAT-ANGTKYWLVKNSWGTSWGEEGYIRMKR 304
              ++ Q  +SG  +G  G+    G    G GA  A+   YW+VKNSWGTSWG +GYI M +
Sbjct:   367 SASGQASASGSGSGS-GSGSGSGS---GSGAVEASSGNYWIVKNSWGTSWGMDGYIFMSK 422

Query:   305 DIDAKEGLCGIAMDSSYPTA 324
             D   +   CGIA  +S+PTA
Sbjct:   423 D---RNNNCGIATMASFPTA 439


>DICTYBASE|DDB_G0279187 [details] [associations]
            symbol:cprG "cysteine proteinase 7" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0279187 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000030 ProtClustDB:CLSZ2846820 MEROPS:C01.081
            EMBL:U72746 RefSeq:XP_641720.2 ProteinModelPortal:Q94504 SMR:Q94504
            PRIDE:Q94504 EnsemblProtists:DDB0215005 GeneID:8621915
            KEGG:ddi:DDB_G0279187 OMA:INTETEK Uniprot:Q94504
        Length = 460

 Score = 533 (192.7 bits), Expect = 2.4e-51, P = 2.4e-51
 Identities = 119/286 (41%), Positives = 167/286 (58%)

Query:     1 IAASQVTSRK-LQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGN 59
             +  S  T+++ L E         WM  + + Y + EE   R+ IFK N++++   N  G+
Sbjct:    10 LLVSVATAKQQLSEVEYRNAFTNWMIAHQRHYSS-EEFNGRYNIFKANMDYVNEWNTKGS 68

Query:    60 KPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVT 119
             +   L +N FAD +N+E++A   G   P   +S + T  + + + D  A +DWR  GAVT
Sbjct:    69 ETV-LGLNVFADISNEEYRATYLG--TPFDASSLEMT--ESDKIFDASAQVDWRTQGAVT 123

Query:   120 PIKNQGPCGSCWAFSAVAATEGITQLTTGK--LISLSEQELVSCDTSGVDHGCEGGEMED 177
             PIKNQG CG CW+FS   ATEG   L  GK  L+SLSEQ L+ C  S  ++GCEGG M  
Sbjct:   124 PIKNQGQCGGCWSFSTTGATEGAQYLANGKKNLVSLSEQNLIDCSGSYGNNGCEGGLMTL 183

Query:   178 AFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQP 237
             AF++II+N GI TE++YPY A DG   K N  +  A++  Y  V + SE  L   V   P
Sbjct:   184 AFEYIINNKGIDTESSYPYTAEDGKKCKFNPKNVAAQLSSYVNVTSGSESDLAAKVTQGP 243

Query:   238 VAVSIDASGSAFQFYSSGVFTGD-CG-TELDHGVTAVGYGATANGT 281
              +V+IDAS  +FQ Y SG++    C  T+LDHGV AVG+G T +G+
Sbjct:   244 TSVAIDASNQSFQLYVSGIYNEPACSSTQLDHGVLAVGFG-TGSGS 288

 Score = 138 (53.6 bits), Expect = 1.9e-06, P = 1.9e-06
 Identities = 40/114 (35%), Positives = 50/114 (43%)

Query:   218 YETVPANSEEALLKAVANQPVAVSIDASGSAFQFYS-SGVFTGDCGTELDHGVTAVGYGA 276
             Y    + S+     A   Q  A S   SGS     S SG  +G          +    G+
Sbjct:   346 YSGSQSGSQSGNSGAAVKQTGAGSGSGSGSGSGSGSGSGSVSGSASGSASGSASGSSSGS 405

Query:   277 TANGT------KYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYPTA 324
              +NG        YW+VKNSWGTSWG +GYI M +        CGIA  +S PTA
Sbjct:   406 NSNGGVYPTAGDYWIVKNSWGTSWGMDGYILMTK---GNNNQCGIATMASRPTA 456


>UNIPROTKB|G3SSC1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9785
            "Loxodonta africana" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 RefSeq:XP_003413898.1
            Ensembl:ENSLAFT00000003415 GeneID:100662496 Uniprot:G3SSC1
        Length = 335

 Score = 532 (192.3 bits), Expect = 3.1e-51, P = 3.1e-51
 Identities = 119/319 (37%), Positives = 177/319 (55%)

Query:    11 LQEASLSEKHEQ-WMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEF 69
             L  +S  + H Q WM+++ K Y + EE  +R + F  N   I + NA  N  +K+++N+F
Sbjct:    25 LSVSSYEKFHFQSWMAQHQKKYSS-EEYHQRQQTFVSNWRKINAHNAR-NHTFKMALNQF 82

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA-VTPIKNQGPCG 128
             +D T  E K  +  +  P   ++ KG   +       P  +DWRK G  V+P+KNQG CG
Sbjct:    83 SDMTFAEIKQ-KYLWSEPQNCSATKGNYLRGTG--PYPPFVDWRKKGHFVSPVKNQGACG 139

Query:   129 SCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGI 188
             SCW FS   A E    +  GKL+SL+EQ+LV C     +HGC+GG    AF++I++N GI
Sbjct:   140 SCWTFSTTGALESAIAIAGGKLLSLAEQQLVDCAKDFNNHGCQGGLPSQAFEYILYNKGI 199

Query:   189 TTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVA-NQPVAVSIDASGS 247
               E  YPY+  D  C K      +A +K    +  N EEA+++AVA   PV+ + + +  
Sbjct:   200 MGEDTYPYKGQDDVC-KFQPKKAIAFVKDVANITLNDEEAMVEAVALYNPVSFAFEVTDD 258

Query:   248 AFQFYSSGVFTG-DCGT---ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMK 303
              F  YS G+++   C     +++H V AVGYG    G  YW+VKNSWG  WG +GY  ++
Sbjct:   259 -FMKYSKGIYSSTSCHKTPDKVNHAVLAVGYGEE-KGIPYWIVKNSWGPYWGMDGYFLIE 316

Query:   304 RDIDAKEGLCGIAMDSSYP 322
             R     + +CG+A  +SYP
Sbjct:   317 RG----KNMCGLAACASYP 331


>UNIPROTKB|E9PSK9 [details] [associations]
            symbol:Ctsql2 "Protein Ctsql2" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            IPI:IPI00562656 Ensembl:ENSRNOT00000045847 RGD:1303225
            ArrayExpress:E9PSK9 Uniprot:E9PSK9
        Length = 342

 Score = 530 (191.6 bits), Expect = 5.1e-51, P = 5.1e-51
 Identities = 123/323 (38%), Positives = 171/323 (52%)

Query:    15 SLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIE---SLNAAGNKPYKLSINEFAD 71
             SL  + ++W  KY K+Y   EE  KR  ++++NV+ IE     N+ G   Y + IN FAD
Sbjct:    24 SLDVQWQEWKMKYEKLYSPEEELLKRV-VWEENVKKIELHNRENSLGKNTYIMEINNFAD 82

Query:    72 QTNQEFKAFRNGYRRPDGLTSRK------GTSFKYENVI-D-VPATMDWRKNGAVTPIKN 123
              T++EFK    G   P   T +       G+ F       D +P ++DWRK G VT ++ 
Sbjct:    83 LTDEEFKDMITGITLPINNTMKSLWKRALGSPFPNSWYWRDALPKSIDWRKEGYVTRVRE 142

Query:   124 QGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFII 183
             QG C SCWAF    A EG     TGKL  LS Q LV C     + GC GG   +AF++++
Sbjct:   143 QGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVL 202

Query:   184 HNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSI 242
              N G+ +EA YPY+  +G C K N  +  AKI  +  +P + E+ L+ A+A + PVA  I
Sbjct:   203 QNGGLESEATYPYKGKEGLC-KYNPKNAYAKITRFVALPED-EDVLMDALATKGPVAAGI 260

Query:   243 DASGSAFQFYSSGVFTGDCGTELDHGVTAVGYGATAN---GTKYWLVKNSWGTSWGEEGY 299
                 S F F S       C   ++H V  VGYG   N   G  YWL+KNSWG  WG +GY
Sbjct:   261 HVVYSYFHFVSGIYHEPKCNNRVNHAVLVVGYGFEGNETDGNNYWLIKNSWGKQWGLKGY 320

Query:   300 IRMKRDIDAKEGLCGIAMDSSYP 322
             +++ +D   +   CGIA  + YP
Sbjct:   321 MKIAKD---RNNHCGIATFAQYP 340


>ZFIN|ZDB-GENE-050208-336 [details] [associations]
            symbol:ctskl "cathepsin K, like" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-050208-336 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:BX465190
            GeneTree:ENSGT00660000095458 IPI:IPI00491185 RefSeq:XP_695425.1
            UniGene:Dr.110795 Ensembl:ENSDART00000062749 GeneID:567046
            KEGG:dre:567046 CTD:567046 NextBio:20888499 Bgee:F1QCP8
            Uniprot:F1QCP8
        Length = 349

 Score = 520 (188.1 bits), Expect = 5.8e-50, P = 5.8e-50
 Identities = 129/333 (38%), Positives = 184/333 (55%)

Query:     2 AASQVTSRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLN---AAG 58
             A  QV S   +EA    +   W  K+   Y    E   R  I++ N++ I   N   + G
Sbjct:    25 APVQVASESEEEAPT--EWNLWKKKHEISYDEESEDVHRKTIWETNMQKIWKNNNDFSFG 82

Query:    59 NKPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKG--TSFKYE--NVIDVPAT-MDWR 113
                +K+++N++ D T+ E+K       +  G  +RKG  TS +    N   +  T +D+R
Sbjct:    83 LSMFKMAMNKYGDLTSVEYKRLLGS--KIKGTGNRKGKITSAQMLRLNAKRLGVTNIDYR 140

Query:   114 KNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGG 173
               G VT +K+QG CGSCW+FS   A EG     TG+L+SLSEQ+LV C  S   +GC G 
Sbjct:   141 AKGYVTEVKDQGYCGSCWSFSTTGAIEGQMYKHTGRLVSLSEQQLVDCSRSYGTYGCSGA 200

Query:   174 EMEDAFKFIIHNDGITTEANYPYQAVDGT-CNKTNEASHVAKIKGYETVPANSEEALLKA 232
              M +A+ ++I+N  + +   YPY +VD   C      + +A I  Y  VPA +E+AL  A
Sbjct:   201 WMANAYDYVINN-ALESSDTYPYTSVDTQPCFYEKNLA-MAGISDYRFVPAGNEQALADA 258

Query:   233 VANQ-PVAVSIDASGSAFQFYSSGVFT-GDCG-TELDHGVTAVGYGATANGTKYWLVKNS 289
             VA   PV+V+IDA   +F FYSSG++   +C    L+H V  VGYG+   GT YW++KNS
Sbjct:   259 VATVGPVSVAIDADNPSFLFYSSGIYKESNCNPNNLNHAVLVVGYGSE-EGTDYWIIKNS 317

Query:   290 WGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
             WGT WGE GY+RM R+    +  CGIA  + YP
Sbjct:   318 WGTGWGEGGYMRMIRN---GKNTCGIASYALYP 347


>UNIPROTKB|F7BJD8 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9796 "Equus
            caballus" [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=ISS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0004175 "endopeptidase activity" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0004252
            "serine-type endopeptidase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005829 "cytosol" evidence=ISS] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0010628 "positive regulation of
            gene expression" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0010815 "bradykinin catabolic
            process" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0016505 "apoptotic protease
            activator activity" evidence=ISS] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=ISS] [GO:0030335
            "positive regulation of cell migration" evidence=ISS] [GO:0031638
            "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0032526 "response to retinoic
            acid" evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0043129 "surfactant homeostasis"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0070324
            "thyroid hormone binding" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            GO:GO:0043066 GO:GO:0005615 GO:GO:0008284 GO:GO:0070371
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458
            OMA:STSCHKT GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129
            Ensembl:ENSECAT00000013967 Uniprot:F7BJD8
        Length = 305

 Score = 519 (187.8 bits), Expect = 7.4e-50, P = 7.4e-50
 Identities = 113/309 (36%), Positives = 172/309 (55%)

Query:    21 EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAF 80
             + WM ++ K Y + EE   R + F  N   I + N  GN  +++ +N+F+     E K  
Sbjct:     6 KSWMVQHQKKYSS-EEYHHRLQTFVSNWRKINAHNT-GNHTFRMGLNQFSAMNFAELK-H 62

Query:    81 RNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA-VTPIKNQGPCGSCWAFSAVAAT 139
             +  +  P   ++ KG   +       P ++DWRK G  V+P+KNQG CGSCW FS   A 
Sbjct:    63 KYLWSEPQNCSATKGNYLRGAG--PYPPSVDWRKKGNFVSPVKNQGGCGSCWTFSTTGAL 120

Query:   140 EGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAV 199
             E    + +GKL+SL+EQ+LV C  +  +HGC+GG    AF++I +N GI  E  YPY+  
Sbjct:   121 ESAVAIASGKLLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKGQ 180

Query:   200 DGTCN-KTNEASHVAKIKGYETVPANSEEALLKAVA-NQPVAVSIDASGSAFQFYSSGVF 257
             DG C  + N+A  +A +K    +  N E+A+++AVA   PV+ + + +   F  Y  G++
Sbjct:   181 DGDCKFQPNKA--IAFVKDVANITLNDEKAMVEAVALYNPVSFAFEVTED-FMMYRKGIY 237

Query:   258 TG-DCGT---ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLC 313
             +   C     +++H V AVGYG   NG  YW+VKNSWG  WG  GY  ++R     + +C
Sbjct:   238 SSTSCHKTPDKVNHAVLAVGYGEE-NGIPYWIVKNSWGPHWGMNGYFLIERG----KNMC 292

Query:   314 GIAMDSSYP 322
             G+A  +SYP
Sbjct:   293 GLAACASYP 301


>TAIR|locus:2120222 [details] [associations]
            symbol:RD19 "RESPONSIVE TO DEHYDRATION 19" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009269 "response to desiccation" evidence=IEP] [GO:0006970
            "response to osmotic stress" evidence=IGI] [GO:0005634 "nucleus"
            evidence=IDA] [GO:0005773 "vacuole" evidence=IDA] [GO:0042742
            "defense response to bacterium" evidence=IMP] [GO:0006096
            "glycolysis" evidence=RCA] [GO:0006833 "water transport"
            evidence=RCA] [GO:0006972 "hyperosmotic response" evidence=RCA]
            [GO:0007030 "Golgi organization" evidence=RCA] [GO:0009266
            "response to temperature stimulus" evidence=RCA] [GO:0009651
            "response to salt stress" evidence=IEP;RCA] [GO:0046686 "response
            to cadmium ion" evidence=RCA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=ISS] [GO:0009414 "response to
            water deprivation" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005634 GO:GO:0005773 EMBL:CP002687
            GenomeReviews:CT486007_GR GO:GO:0009651 GO:GO:0042742
            eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            ProtClustDB:CLSN2688311 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AL035679 EMBL:AL161594 GO:GO:0004197
            MEROPS:C01.022 EMBL:D13042 EMBL:AY080598 EMBL:AY133844
            IPI:IPI00544363 PIR:JN0718 RefSeq:NP_568052.1 UniGene:At.2850
            UniGene:At.74924 ProteinModelPortal:P43296 SMR:P43296 STRING:P43296
            PaxDb:P43296 PRIDE:P43296 EnsemblPlants:AT4G39090.1 GeneID:830064
            KEGG:ath:AT4G39090 TAIR:At4g39090 InParanoid:P43296 OMA:EDFDWRD
            PhylomeDB:P43296 Genevestigator:P43296 GermOnline:AT4G39090
            Uniprot:P43296
        Length = 368

 Score = 515 (186.3 bits), Expect = 2.0e-49, P = 2.0e-49
 Identities = 124/339 (36%), Positives = 171/339 (50%)

Query:     5 QVTSRKLQEASLSEKH-EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYK 63
             QV      +   SE H   +  K+GKVY + EE + RF +FK N+               
Sbjct:    35 QVVGGAEPQVLTSEDHFSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATH 94

Query:    64 LSINEFADQTNQEFK----AFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVT 119
               + +F+D T  EF+      R+G++ P    + K      EN+   P   DWR +GAVT
Sbjct:    95 -GVTQFSDLTRSEFRKKHLGVRSGFKLPKD--ANKAPILPTENL---PEDFDWRDHGAVT 148

Query:   120 PIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCD-------TSGVDHGCEG 172
             P+KNQG CGSCW+FSA  A EG   L TGKL+SLSEQ+LV CD           D GC G
Sbjct:   149 PVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNG 208

Query:   173 GEMEDAFKFIIHNDGITTEANYPYQAVDG-TCNKTNEASHVAKIKGYETVPANSEEALLK 231
             G M  AF++ +   G+  E +YPY   DG TC K +++  VA +  +  +  + E+    
Sbjct:   209 GLMNSAFEYTLKTGGLMKEEDYPYTGKDGKTC-KLDKSKIVASVSNFSVISIDEEQIAAN 267

Query:   232 AVANQPVAVSIDASGSAFQFYSSGVFTGD-CGTELDHGVTAVGYGATANGTK------YW 284
              V N P+AV+I+A     Q Y  GV     C   L+HGV  VGYGA            YW
Sbjct:   268 LVKNGPLAVAINAG--YMQTYIGGVSCPYICTRRLNHGVLLVGYGAAGYAPARFKEKPYW 325

Query:   285 LVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYPT 323
             ++KNSWG +WGE G+ ++ +       +CG+  DS   T
Sbjct:   326 IIKNSWGETWGENGFYKICKG----RNICGV--DSMVST 358


>DICTYBASE|DDB_G0279185 [details] [associations]
            symbol:cprF "cysteine proteinase 6" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0279185 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000030 HSSP:P07711 ProtClustDB:CLSZ2846820 EMBL:U72745
            RefSeq:XP_641725.1 ProteinModelPortal:Q94503 SMR:Q94503
            MEROPS:C01.081 PRIDE:Q94503 EnsemblProtists:DDB0215002
            GeneID:8621921 KEGG:ddi:DDB_G0279185 Uniprot:Q94503
        Length = 434

 Score = 513 (185.6 bits), Expect = 3.2e-49, P = 3.2e-49
 Identities = 121/289 (41%), Positives = 171/289 (59%)

Query:     1 IAASQVTSRK-LQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGN 59
             +  S  T+++ L E         WM  + + Y + EE   RF IFK N+++I   N  G+
Sbjct:    10 LLVSVATAKQQLSELQYRNAFTNWMIAHQRHYSS-EEFNGRFNIFKANMDYINEWNTKGS 68

Query:    60 KPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVID-VPA-TMDWRKNGA 117
             +   L +N FAD TN+E++A   G   P   +S + T    E V   V A ++DWR  GA
Sbjct:    69 ETV-LGLNVFADITNEEYRATYLG--TPFDASSLEMTPS--EKVFGGVQANSVDWRAKGA 123

Query:   118 VTPIKNQGPCGSCWAFSAVAATEGITQLTTGK--LISLSEQELVSCDTSGVDHGCEGGEM 175
             VTPIKNQG CG CW+FSA  ATEG   +  G   L S+SEQ+L+ C  S  ++GCEGG M
Sbjct:   124 VTPIKNQGECGGCWSFSATGATEGAQYIANGDSDLTSVSEQQLIDCSGSYGNNGCEGGLM 183

Query:   176 EDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVAN 235
               AF++II+N GI TE++YP+ A    C K N ++  A++  Y  V + SE  L   V  
Sbjct:   184 TLAFEYIINNGGIDTESSYPFTANTEKC-KYNPSNIGAELSSYVNVTSGSESDLAAKVTQ 242

Query:   236 QPVAVSIDASGSAFQFYSSGVFTGD-CG-TELDHGVTAVGYGATANGTK 282
              P +V+IDAS  +FQFYSSG++    C  T+LDHGV AVG+G+ ++G++
Sbjct:   243 GPTSVAIDASQPSFQFYSSGIYNEPACSSTQLDHGVLAVGFGSGSSGSQ 291

 Score = 126 (49.4 bits), Expect = 4.0e-05, P = 4.0e-05
 Identities = 35/85 (41%), Positives = 42/85 (49%)

Query:   241 SIDASGSAFQFYSSGVFTGDC-GTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGY 299
             S+  SGSA     S  F+G   G   + G     Y    N   YW+VKNSWG  WG  GY
Sbjct:   356 SVSGSGSAS---GSSSFSGSSNGGNSNSG----DYPTDGN---YWIVKNSWGLDWGINGY 405

Query:   300 IRMKRDIDAKEGLCGIAMDSSYPTA 324
             I M +D   K+  CGIA  +S P A
Sbjct:   406 ILMSKD---KDNQCGIATMASIPQA 427


>UNIPROTKB|E9PTT3 [details] [associations]
            symbol:Ctsr "Protein Ctsr" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            IPI:IPI00627092 Ensembl:ENSRNOT00000024115 RGD:631422
            Uniprot:E9PTT3
        Length = 334

 Score = 513 (185.6 bits), Expect = 3.2e-49, P = 3.2e-49
 Identities = 115/321 (35%), Positives = 180/321 (56%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIE---SLNAAGNKPYKLSINEF 69
             + SL  +     ++Y K Y   EE  +R  ++++N++ I+     N+ G   + + +NEF
Sbjct:    22 DPSLDAEWHDXKTEYEKSYTMEEEGHRR-AVWEENMKMIKLHNRENSLGKNGFIMEMNEF 80

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDV-PATMDWRKNGAVTPIKNQGPCG 128
              D T +EF+        P   + RKG   +  +V +V P  +DWRK G VT ++NQ  C 
Sbjct:    81 GDLTAEEFRKMMVNI--PIR-SHRKGKIIRKRDVGNVLPKFVDWRKKGYVTRVQNQKFCN 137

Query:   129 SCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGI 188
             SCWAF+   A EG     TG+L  LS Q LV C  S  + GC+ G+   A++++++N G+
Sbjct:   138 SCWAFAVTGAIEGQMFNKTGQLTPLSVQNLVDCTKSQGNEGCQWGDPHIAYEYVLNNGGL 197

Query:   189 TTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVAN-QPVAVSIDASGS 247
               EA YPY+  +G C + N     A+I G+ ++P  SE+ L++AVA   P++V++DAS +
Sbjct:   198 EAEATYPYKGKEGVC-RYNPKHSKAEITGFVSLP-ESEDILMEAVATIGPISVAVDASFN 255

Query:   248 AFQFYSSGVFTG-DCGTE-LDHGVTAVGYGATAN---GTKYWLVKNSWGTSWGEEGYIRM 302
             +F FY  G++   +C    ++H V  VGYG   N   G  YWL+KNSWG  WG  GY+++
Sbjct:   256 SFGFYKKGLYDEPNCSNNTVNHSVLVVGYGFEGNETDGNSYWLIKNSWGRKWGLRGYMKI 315

Query:   303 KRDIDAKEGLCGIAMDSSYPT 323
              +D   +   C IA  + YPT
Sbjct:   316 PKD---QNNFCAIASYAHYPT 333


>UNIPROTKB|G3V9F8 [details] [associations]
            symbol:Ctsm "RCG24133" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015645 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 EMBL:CH474032
            PANTHER:PTHR12411:SF58 Ensembl:ENSRNOT00000045830 RGD:631420
            Uniprot:G3V9F8
        Length = 333

 Score = 507 (183.5 bits), Expect = 1.4e-48, P = 1.4e-48
 Identities = 115/326 (35%), Positives = 177/326 (54%)

Query:     6 VTSRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIE---SLNAAGNKPY 62
             ++S    +  L  + ++W  KY K Y   EE +KR  ++++N++ I+     N  G   +
Sbjct:    15 ISSSPAPDPVLDAEWQKWKIKYEKTYSLEEEGQKR-AVWEENMKKIKLHNGENGLGKHGF 73

Query:    63 KLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIK 122
              + +N F D T +EF+        P   T +K  S +    ++VP  ++WRK G VTP++
Sbjct:    74 TMEMNAFGDMTIEEFRKLMIEIPIP---TVKKENSVQKRQAVNVPNFINWRKRGYVTPVR 130

Query:   123 NQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFI 182
              QG C  CWAFS   A EG     TG+LI LS Q LV C     + GC  G    A +++
Sbjct:   131 RQGRCNVCWAFSVAGAIEGQMFQKTGQLIPLSVQNLVDCSRPQGNLGCYLGNTYLALQYV 190

Query:   183 IHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVS 241
               N G+ +EA YPY+  +G+C    + S  A I  +E VP N E+AL+ AVA   P++V+
Sbjct:   191 KENGGLESEATYPYEEKEGSCRYHPDNS-TASITDFEFVPKN-EDALMNAVATLGPISVA 248

Query:   242 IDASGSAFQFYSSGVF-TGDCGTEL-DHGVTAVGYGAT---ANGTKYWLVKNSWGTSWGE 296
             IDA   +F FY +G++   +C + +  H +  VGYG     ++G KYW++KNS G  WG 
Sbjct:   249 IDARHESFLFYRNGIYHEPNCSSSVVTHAMLLVGYGFVGEESDGRKYWILKNSMGNKWGN 308

Query:   297 EGYIRMKRDIDAKEGLCGIAMDSSYP 322
              GY+++ +D   +   CGIA  + YP
Sbjct:   309 RGYMKIAKD---QGNHCGIATYALYP 331


>FB|FBgn0034229 [details] [associations]
            symbol:CG4847 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0032504
            "multicellular organism reproduction" evidence=IEP] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0005615 "extracellular space"
            evidence=ISM;IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:AE013599 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 GO:GO:0032504 GeneTree:ENSGT00560000076599
            KO:K01371 EMBL:BT099507 RefSeq:NP_725686.1 UniGene:Dm.4677
            SMR:A1ZAU4 IntAct:A1ZAU4 MEROPS:C01.A28 EnsemblMetazoa:FBtr0086935
            GeneID:36973 KEGG:dme:Dmel_CG4847 UCSC:CG4847-RB
            FlyBase:FBgn0034229 InParanoid:A1ZAU4 OMA:GGFQEYA OrthoDB:EOG4J9KFC
            ChiTaRS:CG4847 GenomeRNAi:36973 NextBio:801302 Uniprot:A1ZAU4
        Length = 420

 Score = 506 (183.2 bits), Expect = 1.8e-48, P = 1.8e-48
 Identities = 119/312 (38%), Positives = 168/312 (53%)

Query:    23 WMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAA---GNKPYKLSINEFADQTNQEFKA 79
             ++S+ GK Y +  ++      F      +E+ NAA   G   +K ++N FAD T+ EF +
Sbjct:   115 FLSQSGKTYLSAADRALHEGAFASTKNLVEAGNAAFAQGVHTFKQAVNAFADLTHSEFLS 174

Query:    80 FRNGYRRPDGLTSRKGTSFKYENVI--DVPATMDWRKNGAVTPIKNQGPCGSCWAFSAVA 137
                G +R     +R   S K  N+    +P   DWR++G VTP+K QG CGSCWAF+   
Sbjct:   175 QLTGLKRSPEAKARAAASLKLVNLPAKPIPDAFDWREHGGVTPVKFQGTCGSCWAFATTG 234

Query:   138 ATEGITQLTTGKLISLSEQELVSC---DTSGVDHGCEGGEMEDAFKFIIH-NDGITTEAN 193
             A EG T   TG L +LSEQ LV C   +  G++ GC+GG  E AF FI     G++ E  
Sbjct:   235 AIEGHTFRKTGSLPNLSEQNLVDCGPVEDFGLN-GCDGGFQEAAFCFIDEVQKGVSQEGA 293

Query:   194 YPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSAFQFY 252
             YPY    GTC K + +   A ++G+  +P   EE L K VA   PVA S++      + Y
Sbjct:   294 YPYIDNKGTC-KYDGSKSGATLQGFAAIPPKDEEQLKKVVATLGPVACSVNGL-ETLKNY 351

Query:   253 SSGVFTGD-CGT-ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKE 310
             + G++  D C   E +H +  VGYG+   G  YW+VKNSW  +WGE+GY R+ R     +
Sbjct:   352 AGGIYNDDECNKGEPNHSILVVGYGSE-KGQDYWIVKNSWDDTWGEKGYFRLPRG----K 406

Query:   311 GLCGIAMDSSYP 322
               C IA + SYP
Sbjct:   407 NYCFIAEECSYP 418


>TAIR|locus:2050145 [details] [associations]
            symbol:AT2G21430 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002685 GenomeReviews:CT485783_GR
            EMBL:AC006841 EMBL:X74359 IPI:IPI00519637 PIR:B84601
            RefSeq:NP_565512.1 UniGene:At.14069 ProteinModelPortal:P43295
            SMR:P43295 MEROPS:C01.A04 PRIDE:P43295 EnsemblPlants:AT2G21430.1
            GeneID:816682 KEGG:ath:AT2G21430 TAIR:At2g21430 eggNOG:COG4870
            HOGENOM:HOG000230774 InParanoid:P43295 KO:K01373 OMA:GSIEEHY
            PhylomeDB:P43295 ProtClustDB:CLSN2688311 Genevestigator:P43295
            GermOnline:AT2G21430 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 Uniprot:P43295
        Length = 361

 Score = 501 (181.4 bits), Expect = 6.0e-48, P = 6.0e-48
 Identities = 117/331 (35%), Positives = 167/331 (50%)

Query:     5 QVTSRKLQEASLSEKH-EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPY- 62
             QV      +   SE H   +  K+GKVY + EE   RF +FK N+  + ++      P  
Sbjct:    32 QVVDETEPKVLSSEDHFTLFKKKFGKVYGSIEEHYYRFSVFKANL--LRAMRHQKMDPSA 89

Query:    63 KLSINEFADQTNQEFK----AFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAV 118
             +  + +F+D T  EF+      + G++ P    + +      +N+   P   DWR  GAV
Sbjct:    90 RHGVTQFSDLTRSEFRRKHLGVKGGFKLPKD--ANQAPILPTQNL---PEEFDWRDRGAV 144

Query:   119 TPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCD-------TSGVDHGCE 171
             TP+KNQG CGSCW+FS   A EG   L TGKL+SLSEQ+LV CD           D GC 
Sbjct:   145 TPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCN 204

Query:   172 GGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLK 231
             GG M  AF++ +   G+  E +YPY   DG   K + +  VA +  +  V  N ++    
Sbjct:   205 GGLMNSAFEYTLKTGGLMREKDYPYTGTDGGSCKLDRSKIVASVSNFSVVSINEDQIAAN 264

Query:   232 AVANQPVAVSIDASGSAFQFYSSGVFTGD-CGTELDHGVTAVGYG------ATANGTKYW 284
              + N P+AV+I+A+    Q Y  GV     C   L+HGV  VGYG      A      YW
Sbjct:   265 LIKNGPLAVAINAA--YMQTYIGGVSCPYICSRRLNHGVLLVGYGSAGFSQARLKEKPYW 322

Query:   285 LVKNSWGTSWGEEGYIRMKRDIDAKEGLCGI 315
             ++KNSWG SWGE G+ ++ +       +CG+
Sbjct:   323 IIKNSWGESWGENGFYKICKG----RNICGV 349


>DICTYBASE|DDB_G0272742 [details] [associations]
            symbol:DDB_G0272742 species:44689 "Dictyostelium
            discoideum" [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0272742 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639 EMBL:AAFI02000008
            eggNOG:NOG331187 RefSeq:XP_644986.1 ProteinModelPortal:Q7KWP5
            PRIDE:Q7KWP5 EnsemblProtists:DDB0168242 GeneID:8618663
            KEGG:ddi:DDB_G0272742 InParanoid:Q7KWP5 OMA:ATESAHF Uniprot:Q7KWP5
        Length = 345

 Score = 495 (179.3 bits), Expect = 2.6e-47, P = 2.6e-47
 Identities = 128/336 (38%), Positives = 186/336 (55%)

Query:    10 KLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEF 69
             KL E     +   WM+   + Y +  E   R+  FK N++FI   N+ G+K   L++NEF
Sbjct:    19 KLTEIQYRNEFTAWMTSNQRTYAS-SEFTNRYNTFKSNLDFINQWNSKGSKTV-LALNEF 76

Query:    70 ADQTNQEFKAFRNGYRRPDGLTSRKGTSF---KYENVIDVPAT-------MDWRKNGAVT 119
             AD +N+E+   R  Y R D   ++  +     K +  I   ++       +DWRK GAV 
Sbjct:    77 ADISNEEY---RKNYLRNDNNINKLSSLLINDKEDKEIKSSSSSGSGSSGIDWRKKGAVP 133

Query:   120 PIKNQ-GPCGSCWAFSAVAATEGITQLTTGK--LISLSEQELVSCDTSGVDHGCEGGEME 176
              +K+Q G CGS W  +AV ATE    L   K   ISLS Q L+ C  S ++  C  G + 
Sbjct:   134 SVKSQIGGCGS-WPITAVGATESAHFLANPKDPFISLSMQNLIDC--SNLNKQCYQGTVN 190

Query:   177 DAFKFIIHNDGITTEANYPYQAVD-GTCNKTNEASHVAKIKGYETVPANSEEALLKAVAN 235
             +AF++II N GI +E +Y +   + G C K N ++ VAKI  YE V + SE +L  AV+ 
Sbjct:   191 EAFQYIIENGGIDSEESYKFSGGEPGKC-KYNSSNSVAKITSYEKVKSGSESSLESAVSL 249

Query:   236 QPVAVSIDASGSAFQFYSSGVF-TGDCG-TELDHGVTAVGYG--ATA------NGTKYWL 285
             +PVA  IDAS S+FQFYSSG++    C  T+L+H +  VG+   +T       + + YW+
Sbjct:   250 KPVAAYIDASLSSFQFYSSGIYYEPSCNSTDLNHSILIVGFSDFSTTPTDSLKHSSNYWI 309

Query:   286 VKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSY 321
             V+NS+G +WGE GYI M +D D     CGI+  +SY
Sbjct:   310 VQNSFGKNWGENGYIFMSKDRDDN---CGISKMASY 342


>WB|WBGene00007055 [details] [associations]
            symbol:tag-196 species:6239 "Caenorhabditis elegans"
            [GO:0004869 "cysteine-type endopeptidase inhibitor activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000010
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00031 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00043 SMART:SM00645 InterPro:IPR000169
            GO:GO:0004869 eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 EMBL:FO080488 PIR:T31871
            RefSeq:NP_505215.2 HSSP:Q9UBX1 ProteinModelPortal:O16454 SMR:O16454
            DIP:DIP-27400N IntAct:O16454 MINT:MINT-1044990 MEROPS:C01.A50
            PaxDb:O16454 EnsemblMetazoa:F41E6.6.1 EnsemblMetazoa:F41E6.6.2
            EnsemblMetazoa:F41E6.6.3 GeneID:179240 KEGG:cel:CELE_F41E6.6
            UCSC:F41E6.6.1 CTD:179240 WormBase:F41E6.6 InParanoid:O16454
            OMA:GGGLMTN NextBio:904514 Uniprot:O16454
        Length = 477

 Score = 493 (178.6 bits), Expect = 4.2e-47, P = 4.2e-47
 Identities = 117/304 (38%), Positives = 161/304 (52%)

Query:    23 WMSKYGKVYKNPEEKEKRFRIFKDNVEFIESL--NAAGNKPYKLSINEFADQTNQEFKAF 80
             ++ ++ K Y N  E  KRFR+FK N + I  L  N  G   Y     +F+D T  EFK  
Sbjct:   177 FVDRHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVY--GFTKFSDMTTMEFKKI 234

Query:    81 RNGYRRPDGLTSRKGTSFKYENVI----DVPATMDWRKNGAVTPIKNQGPCGSCWAFSAV 136
                Y+    +   +  +F+  +V     D+P + DWR+ GAVT +KNQG CGSCWAFS  
Sbjct:   235 MLPYQWEQPVYPMEQANFEKHDVTINEEDLPESFDWREKGAVTQVKNQGNCGSCWAFSTT 294

Query:   137 AATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPY 196
                EG   +   KL+SLSEQELV CD+  +D GC GG   +A+K II   G+  E  YPY
Sbjct:   295 GNVEGAWFIAKNKLVSLSEQELVDCDS--MDQGCNGGLPSNAYKEIIRMGGLEPEDAYPY 352

Query:   197 QAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGV 256
                  TC+   +   V  I G   +P +  E     V   P+++ ++A+    QFY  GV
Sbjct:   353 DGRGETCHLVRKDIAVY-INGSVELPHDEVEMQKWLVTKGPISIGLNAN--TLQFYRHGV 409

Query:   257 ---FTGDCGT-ELDHGVTAVGYGATANGTK-YWLVKNSWGTSWGEEGYIRMKRDIDAKEG 311
                F   C    L+HGV  VGYG   +G K YW+VKNSWG +WGE GY ++ R     + 
Sbjct:   410 VHPFKIFCEPFMLNHGVLIVGYGK--DGRKPYWIVKNSWGPNWGEAGYFKLYRG----KN 463

Query:   312 LCGI 315
             +CG+
Sbjct:   464 VCGV 467


>GENEDB_PFALCIPARUM|PF11_0165 [details] [associations]
            symbol:PF11_0165 "falcipain 2 precursor"
            species:5833 "Plasmodium falciparum" [GO:0020020 "food vacuole"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014186 HSSP:O65039 GO:GO:0020020
            RefSeq:XP_001347836.1 ProteinModelPortal:Q8I6U4 SMR:Q8I6U4
            IntAct:Q8I6U4 MINT:MINT-1559493 MEROPS:C01.046
            EnsemblProtists:PF11_0165:mRNA GeneID:810712 KEGG:pfa:PF11_0165
            EuPathDB:PlasmoDB:PF3D7_1115700 HOGENOM:HOG000065857 OMA:NESLHAN
            ProtClustDB:PTZ00021 BindingDB:Q8I6U4 ChEMBL:CHEMBL3470
            Uniprot:Q8I6U4
        Length = 484

 Score = 491 (177.9 bits), Expect = 6.9e-47, P = 6.9e-47
 Identities = 115/314 (36%), Positives = 163/314 (51%)

Query:    29 KVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGYRRPD 88
             K Y +P E ++RF++F  N   +   N   N  YK  +N FAD T  EFK      R   
Sbjct:   174 KQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKELNRFADLTYHEFKNKYLSLRSSK 233

Query:    89 GLTSRKGT--SFKYENVIDV--------PATMDWRKNGAVTPIKNQGPCGSCWAFSAVAA 138
              L + K       YE VI           A  DWR +  VTP+K+Q  CGSCWAFS++ +
Sbjct:   234 PLKNSKYLLDQMNYEEVIKKYKGNENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIGS 293

Query:   139 TEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQA 198
              E    +   KLI+LSEQELV C  S  ++GC GG + +AF+ +I   GI T+ +YPY +
Sbjct:   294 VESQYAIRKNKLITLSEQELVDC--SFKNYGCNGGLINNAFEDMIELGGICTDDDYPYVS 351

Query:   199 -VDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGVF 257
                  CN  +  +    IK Y +VP N  +  L+ +   P+++S+  S   F FY  G+F
Sbjct:   352 DAPNLCN-IDRCTEKYGIKNYLSVPDNKLKEALRFLG--PISISVAVSDD-FAFYKEGIF 407

Query:   258 TGDCGTELDHGVTAVGYGA-------TANGTK--YWLVKNSWGTSWGEEGYIRMKRDIDA 308
              G+CG +L+H V  VG+G        T  G K  Y+++KNSWG  WGE G+I ++ D   
Sbjct:   408 DGECGDQLNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDESG 467

Query:   309 KEGLCGIAMDSSYP 322
                 CG+  D+  P
Sbjct:   468 LMRKCGLGTDAFIP 481


>UNIPROTKB|Q8I6U4 [details] [associations]
            symbol:PF11_0165 "Falcipain-2A" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            EMBL:AE014186 HSSP:O65039 GO:GO:0020020 RefSeq:XP_001347836.1
            ProteinModelPortal:Q8I6U4 SMR:Q8I6U4 IntAct:Q8I6U4
            MINT:MINT-1559493 MEROPS:C01.046 EnsemblProtists:PF11_0165:mRNA
            GeneID:810712 KEGG:pfa:PF11_0165 EuPathDB:PlasmoDB:PF3D7_1115700
            HOGENOM:HOG000065857 OMA:NESLHAN ProtClustDB:PTZ00021
            BindingDB:Q8I6U4 ChEMBL:CHEMBL3470 Uniprot:Q8I6U4
        Length = 484

 Score = 491 (177.9 bits), Expect = 6.9e-47, P = 6.9e-47
 Identities = 115/314 (36%), Positives = 163/314 (51%)

Query:    29 KVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGYRRPD 88
             K Y +P E ++RF++F  N   +   N   N  YK  +N FAD T  EFK      R   
Sbjct:   174 KQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKELNRFADLTYHEFKNKYLSLRSSK 233

Query:    89 GLTSRKGT--SFKYENVIDV--------PATMDWRKNGAVTPIKNQGPCGSCWAFSAVAA 138
              L + K       YE VI           A  DWR +  VTP+K+Q  CGSCWAFS++ +
Sbjct:   234 PLKNSKYLLDQMNYEEVIKKYKGNENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIGS 293

Query:   139 TEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQA 198
              E    +   KLI+LSEQELV C  S  ++GC GG + +AF+ +I   GI T+ +YPY +
Sbjct:   294 VESQYAIRKNKLITLSEQELVDC--SFKNYGCNGGLINNAFEDMIELGGICTDDDYPYVS 351

Query:   199 -VDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGVF 257
                  CN  +  +    IK Y +VP N  +  L+ +   P+++S+  S   F FY  G+F
Sbjct:   352 DAPNLCN-IDRCTEKYGIKNYLSVPDNKLKEALRFLG--PISISVAVSDD-FAFYKEGIF 407

Query:   258 TGDCGTELDHGVTAVGYGA-------TANGTK--YWLVKNSWGTSWGEEGYIRMKRDIDA 308
              G+CG +L+H V  VG+G        T  G K  Y+++KNSWG  WGE G+I ++ D   
Sbjct:   408 DGECGDQLNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDESG 467

Query:   309 KEGLCGIAMDSSYP 322
                 CG+  D+  P
Sbjct:   468 LMRKCGLGTDAFIP 481


>TAIR|locus:2130180 [details] [associations]
            symbol:AT4G16190 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0042744 "hydrogen peroxide catabolic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005773
            EMBL:CP002687 HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 EMBL:Z97340 EMBL:AL161543 UniGene:At.25555
            EMBL:AY039556 EMBL:AY129473 EMBL:AY136316 EMBL:BT000733
            EMBL:AK226366 IPI:IPI00543588 PIR:D71428 RefSeq:NP_567489.1
            HSSP:P25779 ProteinModelPortal:Q9SUL1 SMR:Q9SUL1 STRING:Q9SUL1
            MEROPS:C01.A06 PRIDE:Q9SUL1 EnsemblPlants:AT4G16190.1 GeneID:827311
            KEGG:ath:AT4G16190 TAIR:At4g16190 InParanoid:Q9SUL1 OMA:NACGINK
            PhylomeDB:Q9SUL1 ProtClustDB:CLSN2917559 Genevestigator:Q9SUL1
            Uniprot:Q9SUL1
        Length = 373

 Score = 491 (177.9 bits), Expect = 6.9e-47, P = 6.9e-47
 Identities = 119/338 (35%), Positives = 174/338 (51%)

Query:     5 QVTSRKLQEASLSEKHE--QWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPY 62
             QV   +  E  L+ +H    + SKY K Y    E + RFR+FK N+      N   +   
Sbjct:    38 QVVPEENDEQLLNAEHHFTLFKSKYEKTYATQVEHDHRFRVFKANLRRARR-NQLLDPSA 96

Query:    63 KLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVI---DVPATMDWRKNGAVT 119
                + +F+D T +EF+    G +R  G   R  T  +   ++   D+P   DWR+ GAVT
Sbjct:    97 VHGVTQFSDLTPKEFRRKFLGLKRR-GF--RLPTDTQTAPILPTSDLPTEFDWREQGAVT 153

Query:   120 PIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCD-------TSGVDHGCEG 172
             P+KNQG CGSCW+FSA+ A EG   L T +L+SLSEQ+LV CD        +  D GC G
Sbjct:   154 PVKNQGMCGSCWSFSAIGALEGAHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSG 213

Query:   173 GEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKA 232
             G M +AF++ +   G+  E +YPY   D T  K +++  VA +  +  V ++ ++     
Sbjct:   214 GLMNNAFEYALKAGGLMKEEDYPYTGRDHTACKFDKSKIVASVSNFSVVSSDEDQIAANL 273

Query:   233 VANQPVAVSIDASGSAFQFYSSGVFTGD-CGTELDHGVTAVGYGATANGT------KYWL 285
             V + P+A++I+A     Q Y  GV     C    DHGV  VG+G++           YW+
Sbjct:   274 VQHGPLAIAINAMW--MQTYIGGVSCPYVCSKSQDHGVLLVGFGSSGYAPIRLKEKPYWI 331

Query:   286 VKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYPT 323
             +KNSWG  WGE GY ++ R       +CG  MD+   T
Sbjct:   332 IKNSWGAMWGEHGYYKICR---GPHNMCG--MDTMVST 364


>WB|WBGene00019986 [details] [associations]
            symbol:R09F10.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            EMBL:FO081137 HSSP:P53634 PIR:D89588 RefSeq:NP_509408.1
            ProteinModelPortal:Q23030 SMR:Q23030 STRING:Q23030 MEROPS:C01.A44
            PaxDb:Q23030 EnsemblMetazoa:R09F10.1 GeneID:181087
            KEGG:cel:CELE_R09F10.1 UCSC:R09F10.1 CTD:181087 WormBase:R09F10.1
            InParanoid:Q23030 OMA:EYPYSAL NextBio:912346 Uniprot:Q23030
        Length = 383

 Score = 490 (177.5 bits), Expect = 8.8e-47, P = 8.8e-47
 Identities = 122/313 (38%), Positives = 171/313 (54%)

Query:    19 KHEQWMS----KYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTN 74
             KHEQ  +    K+ + Y + EE E R++IF  NV   E+     N    L +NEF D T+
Sbjct:    77 KHEQMFNDFILKFDRKYTSVEEFEYRYQIFLRNVIEFEA-EEERNLGLDLDVNEFTDWTD 135

Query:    75 QEFKAF--RNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWA 132
             +E +     N Y + D  T +   S+    VI  PA++DWR+ G +TPIKNQG CGSCWA
Sbjct:   136 EELQKMVQENKYTKYDFDTPKFEGSYLETGVIR-PASIDWREQGKLTPIKNQGQCGSCWA 194

Query:   133 FSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEA 192
             F+ VA+ E    +  GKL+SLSEQE+V CD  G ++GC GG    A KF+  N G+ +E 
Sbjct:   195 FATVASVEAQNAIKKGKLVSLSEQEMVDCD--GRNNGCSGGYRPYAMKFVKEN-GLESEK 251

Query:   193 NYPYQAV--DGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSAF 249
              YPY A+  D    K N+      I  +  + +N+EE +   V  + PV   ++   + +
Sbjct:   252 EYPYSALKHDQCFLKENDTR--VFIDDFRML-SNNEEDIANWVGTKGPVTFGMNVVKAMY 308

Query:   250 QFYSSGVFTG---DCGTELD---HGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMK 303
               Y SG+F     DC TE     H +T +GYG       YW+VKNSWGTSWG  GY R+ 
Sbjct:   309 S-YRSGIFNPSVEDC-TEKSMGAHALTIIGYGGEGESA-YWIVKNSWGTSWGASGYFRLA 365

Query:   304 RDIDAKEGLCGIA 316
             R +++    CG+A
Sbjct:   366 RGVNS----CGLA 374


>GENEDB_PFALCIPARUM|PF11_0161 [details] [associations]
            symbol:PF11_0161 "falcipain-2 precursor,
            putative" species:5833 "Plasmodium falciparum" [GO:0020020 "food
            vacuole" evidence=TAS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014186 HSSP:O65039 OMA:NNVEHIN GO:GO:0020020
            MEROPS:C01.046 HOGENOM:HOG000065857 ProtClustDB:PTZ00021
            RefSeq:XP_001347832.1 ProteinModelPortal:Q8I6U5 SMR:Q8I6U5
            IntAct:Q8I6U5 MINT:MINT-1546851 EnsemblProtists:PF11_0161:mRNA
            GeneID:810708 KEGG:pfa:PF11_0161 EuPathDB:PlasmoDB:PF3D7_1115300
            Uniprot:Q8I6U5
        Length = 482

 Score = 486 (176.1 bits), Expect = 2.3e-46, P = 2.3e-46
 Identities = 115/314 (36%), Positives = 164/314 (52%)

Query:    29 KVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGYRRPD 88
             K Y +P E ++RF++F  N   ++  N      YK  +N FAD T  EFK+     R   
Sbjct:   172 KQYNSPNEMKERFQVFLQNAHKVKMHNNNKKSLYKKELNRFADLTYHEFKSKYLTLRSSK 231

Query:    89 GLTSRKGT--SFKYENVIDV--------PATMDWRKNGAVTPIKNQGPCGSCWAFSAVAA 138
              L + K       Y+ VI           A  DWR +  VTP+K+Q  CGSCWAFS++ +
Sbjct:   232 PLKNSKYLLDQINYDAVIKKYKGNENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIGS 291

Query:   139 TEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQA 198
              E    +   KLI+LSEQELV C  S  ++GC GG + +AF+ +I   GI T+ +YPY +
Sbjct:   292 VESQYAIRKNKLITLSEQELVDC--SFKNYGCNGGLINNAFEDMIELGGICTDDDYPYVS 349

Query:   199 -VDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGVF 257
                  CN  +  +    IK Y +VP N  +  L+ +   P+++SI  S   F FY  G+F
Sbjct:   350 DAPNLCN-IDRCTEKYGIKNYLSVPDNKLKEALRFLG--PISISIAVSDD-FPFYKEGIF 405

Query:   258 TGDCGTELDHGVTAVGYGA-------TANGTK--YWLVKNSWGTSWGEEGYIRMKRDIDA 308
              G+CG EL+H V  VG+G        T  G K  Y+++KNSWG  WGE G+I ++ D   
Sbjct:   406 DGECGDELNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDESG 465

Query:   309 KEGLCGIAMDSSYP 322
                 CG+  D+  P
Sbjct:   466 LMRKCGLGTDAFIP 479


>UNIPROTKB|Q8I6U5 [details] [associations]
            symbol:PF11_0161 "Falcipain-2B" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            EMBL:AE014186 HSSP:O65039 OMA:NNVEHIN GO:GO:0020020 MEROPS:C01.046
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347832.1
            ProteinModelPortal:Q8I6U5 SMR:Q8I6U5 IntAct:Q8I6U5
            MINT:MINT-1546851 EnsemblProtists:PF11_0161:mRNA GeneID:810708
            KEGG:pfa:PF11_0161 EuPathDB:PlasmoDB:PF3D7_1115300 Uniprot:Q8I6U5
        Length = 482

 Score = 486 (176.1 bits), Expect = 2.3e-46, P = 2.3e-46
 Identities = 115/314 (36%), Positives = 164/314 (52%)

Query:    29 KVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGYRRPD 88
             K Y +P E ++RF++F  N   ++  N      YK  +N FAD T  EFK+     R   
Sbjct:   172 KQYNSPNEMKERFQVFLQNAHKVKMHNNNKKSLYKKELNRFADLTYHEFKSKYLTLRSSK 231

Query:    89 GLTSRKGT--SFKYENVIDV--------PATMDWRKNGAVTPIKNQGPCGSCWAFSAVAA 138
              L + K       Y+ VI           A  DWR +  VTP+K+Q  CGSCWAFS++ +
Sbjct:   232 PLKNSKYLLDQINYDAVIKKYKGNENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIGS 291

Query:   139 TEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQA 198
              E    +   KLI+LSEQELV C  S  ++GC GG + +AF+ +I   GI T+ +YPY +
Sbjct:   292 VESQYAIRKNKLITLSEQELVDC--SFKNYGCNGGLINNAFEDMIELGGICTDDDYPYVS 349

Query:   199 -VDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGVF 257
                  CN  +  +    IK Y +VP N  +  L+ +   P+++SI  S   F FY  G+F
Sbjct:   350 DAPNLCN-IDRCTEKYGIKNYLSVPDNKLKEALRFLG--PISISIAVSDD-FPFYKEGIF 405

Query:   258 TGDCGTELDHGVTAVGYGA-------TANGTK--YWLVKNSWGTSWGEEGYIRMKRDIDA 308
              G+CG EL+H V  VG+G        T  G K  Y+++KNSWG  WGE G+I ++ D   
Sbjct:   406 DGECGDELNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDESG 465

Query:   309 KEGLCGIAMDSSYP 322
                 CG+  D+  P
Sbjct:   466 LMRKCGLGTDAFIP 479


>UNIPROTKB|F1P3U9 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005829 "cytosol" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=IEA] [GO:0010628 "positive regulation of gene expression"
            evidence=IEA] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=IEA] [GO:0010813 "neuropeptide catabolic
            process" evidence=IEA] [GO:0010815 "bradykinin catabolic process"
            evidence=IEA] [GO:0016505 "apoptotic protease activator activity"
            evidence=IEA] [GO:0030108 "HLA-A specific activating MHC class I
            receptor activity" evidence=IEA] [GO:0031638 "zymogen activation"
            evidence=IEA] [GO:0031648 "protein destabilization" evidence=IEA]
            [GO:0032526 "response to retinoic acid" evidence=IEA] [GO:0033619
            "membrane protein proteolysis" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0043129
            "surfactant homeostasis" evidence=IEA] [GO:0045766 "positive
            regulation of angiogenesis" evidence=IEA] [GO:0060448 "dichotomous
            subdivision of terminal units involved in lung branching"
            evidence=IEA] [GO:0070324 "thyroid hormone binding" evidence=IEA]
            [GO:0070371 "ERK1 and ERK2 cascade" evidence=IEA] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA]
            [GO:0097208 "alveolar lamellar body" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066
            GO:GO:0005615 GO:GO:0008284 GO:GO:0070371 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0032526 GO:GO:0010628
            GO:GO:0070324 GO:GO:0016505 GO:GO:0010634 GO:GO:0004197
            GO:GO:0042599 GO:GO:0031648 GO:GO:0097067 GO:GO:0031638
            GO:GO:0001913 GeneTree:ENSGT00660000095458 OMA:STSCHKT
            GO:GO:0030108 GO:GO:0010815 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 EMBL:AADN02038832 EMBL:AADN02038831 IPI:IPI00594147
            Ensembl:ENSGALT00000013440 Uniprot:F1P3U9
        Length = 261

 Score = 481 (174.4 bits), Expect = 7.9e-46, P = 7.9e-46
 Identities = 101/267 (37%), Positives = 152/267 (56%)

Query:    62 YKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA-VTP 120
             + +++N+F+D T  EFK     +  P   ++ +G   + +     P  +DWRK G  VTP
Sbjct:     1 FLVALNQFSDMTFAEFKKLYL-WSEPQNCSATRGNFLRSDG--PCPEAVDWRKKGNFVTP 57

Query:   121 IKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFK 180
             +KNQGPCGSCW FS     E    + TGKL+SL+EQ LV C  +  +HGC GG    AF+
Sbjct:    58 VKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQLLVDCAQAFNNHGCSGGLPSQAFE 117

Query:   181 FIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVA-NQPVA 239
             +I++N G+  E  YPY+A +GTC K      +A +K    +    E  +++AV  + PV+
Sbjct:   118 YILYNKGLMGEDAYPYRAQNGTC-KFQPDKAIAFVKDVINITQYDEAGMVEAVGKHNPVS 176

Query:   240 VSIDASGSAFQFYSSGVFTGD-CG---TELDHGVTAVGYGATANGTKYWLVKNSWGTSWG 295
              + + + S F  Y  GV++   C     +++H V AVGYG   +G  YW+VKNSWG  WG
Sbjct:   177 FAFEVT-SDFMHYRKGVYSNPRCEHTPDKVNHAVLAVGYGEE-DGRPYWIVKNSWGPLWG 234

Query:   296 EEGYIRMKRDIDAKEGLCGIAMDSSYP 322
              +GY  ++R     + +CG+A  +SYP
Sbjct:   235 MDGYFLIERG----KNMCGLAACASYP 257


>TAIR|locus:2082687 [details] [associations]
            symbol:AT3G54940 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002686 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HSSP:P53634
            OMA:GGGLMTN EMBL:AY070063 IPI:IPI00528988 RefSeq:NP_567010.5
            UniGene:At.28412 ProteinModelPortal:Q8VYS0 SMR:Q8VYS0 PRIDE:Q8VYS0
            EnsemblPlants:AT3G54940.2 GeneID:824659 KEGG:ath:AT3G54940
            TAIR:At3g54940 PhylomeDB:Q8VYS0 ProtClustDB:CLSN2718801
            ArrayExpress:Q8VYS0 Genevestigator:Q8VYS0 Uniprot:Q8VYS0
        Length = 367

 Score = 474 (171.9 bits), Expect = 4.4e-45, P = 4.4e-45
 Identities = 117/325 (36%), Positives = 167/325 (51%)

Query:    19 KHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKL-SINEFADQTNQEF 77
             K   +MS YGK Y   EE   R  IF  NV  +++       P  +  + +F+D T +EF
Sbjct:    50 KFRLFMSDYGKNYSTREEYIHRLGIFAKNV--LKAAEHQMMDPSAVHGVTQFSDLTEEEF 107

Query:    78 KAFRNGYRRPDGLTSRKGTSFKYENVIDV---PATMDWRKNGAVTPIKNQGPCGSCWAFS 134
             K    G     G  SR GT      +++V   P   DWR+ G VT +KNQG CGSCWAFS
Sbjct:   108 KRMYTGVADVGG--SRGGTVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFS 165

Query:   135 AVAATEGITQLTTGKLISLSEQELVSCDTS-------GVDHGCEGGEMEDAFKFIIHNDG 187
                A EG   ++TGKL+SLSEQ+LV CD +         D+GC GG M +A+++++   G
Sbjct:   166 TTGAAEGAHFVSTGKLLSLSEQQLVDCDQACDPKDKKACDNGCGGGLMTNAYEYLMEAGG 225

Query:   188 ITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGS 247
             +  E +YPY    G C K +      ++  + T+P +  +     V + P+AV ++A   
Sbjct:   226 LEEERSYPYTGKRGHC-KFDPEKVAVRVLNFTTIPLDENQIAANLVRHGPLAVGLNAV-- 282

Query:   248 AFQFYSSGVFTG-DCGTE-LDHGVTAVGYGATA------NGTKYWLVKNSWGTSWGEEGY 299
               Q Y  GV     C    ++HGV  VGYG+        +   YW++KNSWG  WGE GY
Sbjct:   283 FMQTYIGGVSCPLICSKRNVNHGVLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGY 342

Query:   300 IRMKRDIDAKEGLCGI-AMDSSYPT 323
              ++ R  D    +CGI +M S+  T
Sbjct:   343 YKLCRGHD----ICGINSMVSAVAT 363


>DICTYBASE|DDB_G0274385 [details] [associations]
            symbol:DDB_G0274385 "Cysteine proteinase 1,
            mitochondrial" species:44689 "Dictyostelium discoideum" [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0274385 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 EMBL:AAFI02000012 RefSeq:XP_644301.1
            ProteinModelPortal:Q86KD4 EnsemblProtists:DDB0167535 GeneID:8619729
            KEGG:ddi:DDB_G0274385 InParanoid:Q86KD4 OMA:SICVDAS Uniprot:Q86KD4
        Length = 358

 Score = 467 (169.5 bits), Expect = 2.4e-44, P = 2.4e-44
 Identities = 113/328 (34%), Positives = 167/328 (50%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQ 72
             ++S+ +    W  K+ K+YK+  E E RF  FK+N++    LN+      K   N F+D 
Sbjct:    37 DSSMRDTFNHWAKKHSKIYKDSIEMENRFSNFKENMKKNIELNSMHAGKAKFESNGFSDL 96

Query:    73 TNQEFKAFR--NGYR-RPDGL-TSRKGTSFKYENVI---------DVPA--TMDWRKNGA 117
             + +EF  F     ++ +P  L  S K     + ++I         D+    ++DWRK G 
Sbjct:    97 SEEEFSNFHLNKAFKGKPSHLRNSIKPQPTPHHSLINGYKEMENGDLNELYSIDWRKKGL 156

Query:   118 VTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMED 177
             VTP+K+QG CGSC+ FSAV   E        K I LSEQ+ V CD    D  C GG+   
Sbjct:   157 VTPVKDQGQCGSCYIFSAVEQIETAWIKAGNKPILLSEQQAVDCDP--YDGQCGGGDPYT 214

Query:   178 AFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ- 236
              +++     G++T A YPY A DGTC   N +  V  +  +       E  L+K + N  
Sbjct:   215 VYEYFSQVGGVSTNAQYPYTATDGTC--VNMSRAVPVVSYHYVTQGGDENTLIKTIVNDG 272

Query:   237 PVAVSIDASGSAFQFYSSGVFTGDCGTELDHGVTAVGYGAT----ANGTKYWLVKNSWGT 292
             PV++ +DAS   +Q YS G+ T  CG  +DH V  VG        +N  +Y++++NSWGT
Sbjct:   273 PVSICVDAS--TWQSYSGGIITTGCGKNIDHCVQVVGLEVDKTDPSNPVQYYIIRNSWGT 330

Query:   293 SWGEEGYIRMKRDIDAKEGLCGIAMDSS 320
              WG +GYI +    D    LCGI  +S+
Sbjct:   331 DWGIDGYIYVATGSD----LCGITYEST 354


>WB|WBGene00012747 [details] [associations]
            symbol:Y40H7A.10 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000230773 EMBL:AL033510
            HSSP:P80067 MEROPS:C01.A48 PIR:T26792 RefSeq:NP_502836.1
            ProteinModelPortal:Q9XWA4 SMR:Q9XWA4 STRING:Q9XWA4
            EnsemblMetazoa:Y40H7A.10 GeneID:189809 KEGG:cel:CELE_Y40H7A.10
            UCSC:Y40H7A.10 CTD:189809 WormBase:Y40H7A.10 eggNOG:NOG286423
            InParanoid:Q9XWA4 OMA:NGPMIVC NextBio:943702 Uniprot:Q9XWA4
        Length = 343

 Score = 464 (168.4 bits), Expect = 5.0e-44, P = 5.0e-44
 Identities = 119/305 (39%), Positives = 170/305 (55%)

Query:    21 EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNA--AGNKPYKLSINEFADQTNQEFK 78
             + ++ KY + Y N  E  KRF IF  N++ +E  N   AG   Y+L  N+F+D T +E+K
Sbjct:    52 QNFLVKYLREYPNEYEIVKRFTIFSRNLDLVERYNKEDAGKVTYEL--NDFSDLTEEEWK 109

Query:    79 AFRNGYRRPDGLTSRKGTSFKYENVID---VPATMDWRK-NGA--VTPIKNQGPCGSCWA 132
              +     +PD   S K  S K + +ID   +P ++DWR  NG   VT IK QGPCGSCWA
Sbjct:   110 KYLMT-PKPDH--SEK--SLKPKTLIDKKNLPNSVDWRNVNGTNHVTGIKYQGPCGSCWA 164

Query:   133 FSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEA 192
             F+  AA E    ++ G L SLS Q+L+ C    V   C GGE  +A K+   + GITT  
Sbjct:   165 FATAAAIESAVSISGGGLQSLSSQQLLDCTV--VSDKCGGGEPVEALKYA-QSHGITTAH 221

Query:   193 NYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVA-NQPVAVSIDASGSAFQF 251
             NYPY      C +T     VA+I  +  + A SE+ + + VA N P+ V  + + +  +F
Sbjct:   222 NYPYYFWTTKCRET--VPTVARISSW--MKAESEDEMAQIVALNGPMIVCANFATNKNRF 277

Query:   252 YSSGVFTG-DCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKE 310
             Y SG+    DCGTE  H +  +GYG       YW++KN++   WGE+GY+R+KRD++   
Sbjct:   278 YHSGIAEDPDCGTEPTHALIVIGYGPD-----YWILKNTYSKVWGEKGYMRVKRDVN--- 329

Query:   311 GLCGI 315
               CGI
Sbjct:   330 -WCGI 333


>ZFIN|ZDB-GENE-030131-9831 [details] [associations]
            symbol:ctsf "cathepsin F" species:7955 "Danio
            rerio" [GO:0004869 "cysteine-type endopeptidase inhibitor activity"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000010 InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00031 Pfam:PF00112 PRINTS:PR00705 SMART:SM00043
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-030131-9831
            GO:GO:0004869 eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 HOVERGEN:HBG011513 CTD:8722 OrthoDB:EOG4CC41T
            MEROPS:I25.006 EMBL:BC124243 IPI:IPI00503226 RefSeq:NP_001071036.1
            UniGene:Dr.81265 ProteinModelPortal:Q08CH0 SMR:Q08CH0 GeneID:565588
            KEGG:dre:565588 InParanoid:Q08CH0 NextBio:20885952
            ArrayExpress:Q08CH0 Uniprot:Q08CH0
        Length = 473

 Score = 463 (168.0 bits), Expect = 6.4e-44, P = 6.4e-44
 Identities = 117/326 (35%), Positives = 171/326 (52%)

Query:     1 IAASQVT-SRKLQEA-SLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAG 58
             +AA  +T S+ ++E+  L    + +M  Y + Y + EE EKR RIF+ N++  ++L +  
Sbjct:   154 VAAVPLTHSKPMKESVELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLE 213

Query:    59 NKPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDV----PATMDWRK 114
                 +  I +F+D T  EF+     Y  P  + S+     + +  I      P T DWR 
Sbjct:   214 QGSAEYGITKFSDLTEDEFRMM---YLNP--MLSQWSLKKEMKPAIPASAPAPDTWDWRD 268

Query:   115 NGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGE 174
             +GAV+P+KNQG CGSCWAFS     EG     TG+L+SLSEQELV CD   +D  C GG 
Sbjct:   269 HGAVSPVKNQGMCGSCWAFSVTGNIEGQWFKKTGQLLSLSEQELVDCDK--LDQACGGGL 326

Query:   175 MEDAFKFIIHNDGITTEANYPYQAVDGTCN-KTNEASHVAKIKGYETVPANSEEALLKAV 233
               +A++ I +  G+ TE +Y Y     +C+  T + +  A I     +P + +E      
Sbjct:   327 PSNAYEAIENLGGLETETDYSYTGHKQSCDFSTGKVA--AYINSSVELPKDEKEIAAFLA 384

Query:   234 ANQPVAVSIDASGSAFQFYSSGV---FTGDCGT-ELDHGVTAVGYGATANGTKYWLVKNS 289
              N PV+ +++A   A QFY  GV       C    +DH V  VG+G   NG  +W +KNS
Sbjct:   385 ENGPVSAALNAF--AMQFYRKGVSHPLKIFCNPWMIDHAVLLVGFGQR-NGVPFWAIKNS 441

Query:   290 WGTSWGEEGYIRMKRDIDAKEGLCGI 315
             WG  +GE+GY  + R      GLCGI
Sbjct:   442 WGEDYGEQGYYYLYRG----SGLCGI 463


>MGI|MGI:1861434 [details] [associations]
            symbol:Ctsf "cathepsin F" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=ISS]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISS] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:1861434 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513 MEROPS:C01.018
            CTD:8722 OMA:LAPPEWD OrthoDB:EOG4CC41T EMBL:AF136280 EMBL:AF217224
            EMBL:AJ131851 EMBL:AK075862 EMBL:BC058758 IPI:IPI00126769
            RefSeq:NP_063914.1 UniGene:Mm.29561 ProteinModelPortal:Q9R013
            SMR:Q9R013 STRING:Q9R013 PhosphoSite:Q9R013 PaxDb:Q9R013
            PRIDE:Q9R013 Ensembl:ENSMUST00000119694 GeneID:56464 KEGG:mmu:56464
            UCSC:uc008gbc.1 GeneTree:ENSGT00660000095458 InParanoid:Q9R013
            NextBio:312722 Bgee:Q9R013 CleanEx:MM_CTSF Genevestigator:Q9R013
            GermOnline:ENSMUSG00000006458 Uniprot:Q9R013
        Length = 462

 Score = 459 (166.6 bits), Expect = 1.7e-43, P = 1.7e-43
 Identities = 116/308 (37%), Positives = 164/308 (53%)

Query:    21 EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKA- 79
             + +M+ Y + Y++ EE + R  +F  N+   + + A      +  I +F+D T +EF   
Sbjct:   166 KDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHTI 225

Query:    80 FRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAAT 139
             + N   + +  + RK +  K  N +  P   DWRK GAVT +KNQG CGSCWAFS     
Sbjct:   226 YLNPLLQKE--SGRKMSPAKSINDL-APPEWDWRKKGAVTEVKNQGMCGSCWAFSVTGNV 282

Query:   140 EGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAV 199
             EG   L  G L+SLSEQEL+ CD   VD  C GG   +A+  I +  G+ TE +Y YQ  
Sbjct:   283 EGQWFLNRGTLLSLSEQELLDCDK--VDKACLGGLPSNAYAAIKNLGGLETEDDYGYQGH 340

Query:   200 DGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ--PVAVSIDASGSAFQFYSSGV- 256
               TCN +   + +AK+   ++V  +  E  + A   Q  P++V+I+A G   QFY  G+ 
Sbjct:   341 VQTCNFS---AQMAKVYINDSVELSRNENKIAAWLAQKGPISVAINAFG--MQFYRHGIA 395

Query:   257 --FTGDCGTE-LDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLC 313
               F   C    +DH V  VGYG  +N   YW +KNSWG+ WGEEGY  + R      G C
Sbjct:   396 HPFRPLCSPWFIDHAVLLVGYGNRSN-IPYWAIKNSWGSDWGEEGYYYLYRG----SGAC 450

Query:   314 GI-AMDSS 320
             G+  M SS
Sbjct:   451 GVNTMASS 458


>FB|FBgn0033874 [details] [associations]
            symbol:CG6347 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE013599 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 HSSP:P53634 EMBL:AY069609
            RefSeq:NP_610906.1 UniGene:Dm.608 SMR:Q7K0S6 MEROPS:C01.A29
            EnsemblMetazoa:FBtr0087637 GeneID:36531 KEGG:dme:Dmel_CG6347
            UCSC:CG6347-RA FlyBase:FBgn0033874 InParanoid:Q7K0S6 OMA:FEYIRDH
            OrthoDB:EOG4FQZ74 GenomeRNAi:36531 NextBio:799046 Uniprot:Q7K0S6
        Length = 352

 Score = 455 (165.2 bits), Expect = 4.5e-43, P = 4.5e-43
 Identities = 115/327 (35%), Positives = 170/327 (51%)

Query:    18 EKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFI--ESLNAA-GNKPYKLSINEFADQTN 74
             +  + ++ + GKVY + EE+  R  IF   +  I   + NA  G   ++L +N  AD T 
Sbjct:    36 QNFDDFLRQTGKVYSD-EERVYRESIFAAKMSLITLSNKNADNGVSGFRLGVNTLADMTR 94

Query:    75 QEF------KAFRNGYRRPDG----LTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQ 124
             +E       K    G R  +G    +T+R   S       ++P   DWR+ G VTP   Q
Sbjct:    95 KEIATLLGSKISEFGERYTNGHINFVTARNPAS------ANLPEMFDWREKGGVTPPGFQ 148

Query:   125 GP-CGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFII 183
             G  CG+CW+F+   A EG     TG L SLS+Q LV C     + GC+GG  E  F++I 
Sbjct:   149 GVGCGACWSFATTGALEGHLFRRTGVLASLSQQNLVDCADDYGNMGCDGGFQEYGFEYI- 207

Query:   184 HNDGITTEANYPYQAVDGTCNKTNEASH-----VAKIKGYETVPANSEEALLKAVANQ-P 237
              + G+T    YPY   +  C +   A       + KI+ Y T+    EE + + +A   P
Sbjct:   208 RDHGVTLANKYPYTQTEMQCRQNETAGRPPRESLVKIRDYATITPGDEEKMKEVIATLGP 267

Query:   238 VAVSIDASGSAFQFYSSGVFTGD-CGT-ELDHGVTAVGYGATANGTKYWLVKNSWGTSWG 295
             +A S++A   +F+ YS G++  + C   EL+H VT VGYG T NG  YW++KNS+  +WG
Sbjct:   268 LACSMNADTISFEQYSGGIYEDEECNQGELNHSVTVVGYG-TENGRDYWIIKNSYSQNWG 326

Query:   296 EEGYIRMKRDIDAKEGLCGIAMDSSYP 322
             E G++R+ R+     G CGIA + SYP
Sbjct:   327 EGGFMRILRNAG---GFCGIASECSYP 350


>GENEDB_PFALCIPARUM|PF11_0162 [details] [associations]
            symbol:PF11_0162 "falcipain-3" species:5833
            "Plasmodium falciparum" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 HSSP:P43235 EMBL:AE014186 GO:GO:0020020
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347833.1
            ProteinModelPortal:Q8IIL0 SMR:Q8IIL0 MEROPS:C01.063
            EnsemblProtists:PF11_0162:mRNA GeneID:810709 KEGG:pfa:PF11_0162
            EuPathDB:PlasmoDB:PF3D7_1115400 OMA:ENDEDYW ChEMBL:CHEMBL1250373
            Uniprot:Q8IIL0
        Length = 492

 Score = 455 (165.2 bits), Expect = 4.5e-43, P = 4.5e-43
 Identities = 112/322 (34%), Positives = 168/322 (52%)

Query:    23 WMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKA-FR 81
             ++ +  K Y+  EE +KRF IF +N   IE  N   N  YK  +N+F D + +EF++ + 
Sbjct:   174 FLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKFGDLSPEEFRSKYL 233

Query:    82 NGYRRPDGLTSRKGTSFK--YENVIDV--PA-------TMDWRKNGAVTPIKNQGPCGSC 130
             N        T     S++  YE+VI    PA         DWR +G VTP+K+Q  CGSC
Sbjct:   234 NLKTHGPFKTLSPPVSYEANYEDVIKKYKPADAKLDRIAYDWRLHGGVTPVKDQALCGSC 293

Query:   131 WAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITT 190
             WAFS+V + E    +    L   SEQELV C     ++GC GG + +AF  +I   G+ +
Sbjct:   294 WAFSSVGSVESQYAIRKKALFLFSEQELVDCSVK--NNGCYGGYITNAFDDMIDLGGLCS 351

Query:   191 EANYPYQA-VDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAF 249
             + +YPY + +  TCN     +    IK Y ++P +  +  L+ +   P+++SI AS   F
Sbjct:   352 QDDYPYVSNLPETCN-LKRCNERYTIKSYVSIPDDKFKEALRYLG--PISISIAASDD-F 407

Query:   250 QFYSSGVFTGDCGTELDHGVTAVGYGA-------TANGTK--YWLVKNSWGTSWGEEGYI 300
              FY  G + G+CG   +H V  VGYG        T    K  Y+++KNSWG+ WGE GYI
Sbjct:   408 AFYRGGFYDGECGAAPNHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWGSDWGEGGYI 467

Query:   301 RMKRDIDAKEGLCGIAMDSSYP 322
              ++ D +  +  C I  ++  P
Sbjct:   468 NLETDENGYKKTCSIGTEAYVP 489


>UNIPROTKB|Q8IIL0 [details] [associations]
            symbol:PF11_0162 "Falcipain-3" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 HSSP:P43235 EMBL:AE014186 GO:GO:0020020
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347833.1
            ProteinModelPortal:Q8IIL0 SMR:Q8IIL0 MEROPS:C01.063
            EnsemblProtists:PF11_0162:mRNA GeneID:810709 KEGG:pfa:PF11_0162
            EuPathDB:PlasmoDB:PF3D7_1115400 OMA:ENDEDYW ChEMBL:CHEMBL1250373
            Uniprot:Q8IIL0
        Length = 492

 Score = 455 (165.2 bits), Expect = 4.5e-43, P = 4.5e-43
 Identities = 112/322 (34%), Positives = 168/322 (52%)

Query:    23 WMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKA-FR 81
             ++ +  K Y+  EE +KRF IF +N   IE  N   N  YK  +N+F D + +EF++ + 
Sbjct:   174 FLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKFGDLSPEEFRSKYL 233

Query:    82 NGYRRPDGLTSRKGTSFK--YENVIDV--PA-------TMDWRKNGAVTPIKNQGPCGSC 130
             N        T     S++  YE+VI    PA         DWR +G VTP+K+Q  CGSC
Sbjct:   234 NLKTHGPFKTLSPPVSYEANYEDVIKKYKPADAKLDRIAYDWRLHGGVTPVKDQALCGSC 293

Query:   131 WAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITT 190
             WAFS+V + E    +    L   SEQELV C     ++GC GG + +AF  +I   G+ +
Sbjct:   294 WAFSSVGSVESQYAIRKKALFLFSEQELVDCSVK--NNGCYGGYITNAFDDMIDLGGLCS 351

Query:   191 EANYPYQA-VDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAF 249
             + +YPY + +  TCN     +    IK Y ++P +  +  L+ +   P+++SI AS   F
Sbjct:   352 QDDYPYVSNLPETCN-LKRCNERYTIKSYVSIPDDKFKEALRYLG--PISISIAASDD-F 407

Query:   250 QFYSSGVFTGDCGTELDHGVTAVGYGA-------TANGTK--YWLVKNSWGTSWGEEGYI 300
              FY  G + G+CG   +H V  VGYG        T    K  Y+++KNSWG+ WGE GYI
Sbjct:   408 AFYRGGFYDGECGAAPNHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWGSDWGEGGYI 467

Query:   301 RMKRDIDAKEGLCGIAMDSSYP 322
              ++ D +  +  C I  ++  P
Sbjct:   468 NLETDENGYKKTCSIGTEAYVP 489


>FB|FBgn0032228 [details] [associations]
            symbol:CG5367 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE014134 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 HSSP:P80067
            RefSeq:NP_609387.1 UniGene:Dm.26782 ProteinModelPortal:Q9VKY4
            SMR:Q9VKY4 MEROPS:C01.A30 EnsemblMetazoa:FBtr0080055 GeneID:34401
            KEGG:dme:Dmel_CG5367 UCSC:CG5367-RA FlyBase:FBgn0032228
            InParanoid:Q9VKY4 OMA:QIVDCSV OrthoDB:EOG4THT8X PhylomeDB:Q9VKY4
            GenomeRNAi:34401 NextBio:788324 ArrayExpress:Q9VKY4 Bgee:Q9VKY4
            Uniprot:Q9VKY4
        Length = 338

 Score = 449 (163.1 bits), Expect = 1.9e-42, P = 1.9e-42
 Identities = 102/333 (30%), Positives = 171/333 (51%)

Query:     1 IAASQVTSRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNA---A 57
             I  S ++      A+   + E++ +   + Y    ++ + ++ F++N + IE  N     
Sbjct:    17 IVTSNLSEGNSSSANCKSEFEKFKNNNNRKYLRTYDEMRSYKAFEENFKVIEEHNQNYKE 76

Query:    58 GNKPYKLSINEFADQTNQEF-KAF-----RNGYRRPDGLTSRKGTSFKYENVIDVPATMD 111
             G   ++L  N FAD +   + K F      N     D +    G+      + +VP ++D
Sbjct:    77 GQTSFRLKPNIFADMSTDGYLKGFLRLLKSNIEDSADNMAEIVGSPL----MANVPESLD 132

Query:   112 WRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCE 171
             WR  G +TP  NQ  CGSC+AFS   +  G     TGK++SLS+Q++V C  S  + GC 
Sbjct:   133 WRSKGFITPPYNQLSCGSCYAFSIAESIMGQVFKRTGKILSLSKQQIVDCSVSHGNQGCV 192

Query:   172 GGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLK 231
             GG + +   ++    GI  + +YPY A  G C    + S V  +  +  +P   E+A+  
Sbjct:   193 GGSLRNTLSYLQSTGGIMRDQDYPYVARKGKCQFVPDLS-VVNVTSWAILPVRDEQAIQA 251

Query:   232 AVAN-QPVAVSIDASGSAFQFYSSGVFTGD-CGT-ELDHGVTAVGYGATANGTKYWLVKN 288
             AV +  PVA+SI+AS   FQ YS G++    C +  ++H +  +G+G       YW++KN
Sbjct:   252 AVTHIGPVAISINASPKTFQLYSDGIYDDPLCSSASVNHAMVVIGFGKD-----YWILKN 306

Query:   289 SWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSY 321
              WG +WGE GYIR+++ ++    +CGIA  ++Y
Sbjct:   307 WWGQNWGENGYIRIRKGVN----MCGIANYAAY 335


>UNIPROTKB|F1RU48 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            OMA:LAPPEWD GeneTree:ENSGT00660000095458 EMBL:CU928034
            EMBL:FP565364 Ensembl:ENSSSCT00000014140 Ensembl:ENSSSCT00000014154
            Uniprot:F1RU48
        Length = 460

 Score = 446 (162.1 bits), Expect = 4.0e-42, P = 4.0e-42
 Identities = 111/308 (36%), Positives = 158/308 (51%)

Query:    21 EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAF 80
             +++++ Y + Y   EE   R  +F +N+   + + A      +  + +F+D T +EF+  
Sbjct:   164 KEFVTTYNRTYDTKEEARWRMSVFANNMVRAQKIQALDTGTARYGVTKFSDLTEEEFRTI 223

Query:    81 RNGYRRPDGLTSRKGTSFKYENVIDV--PATMDWRKNGAVTPIKNQGPCGSCWAFSAVAA 138
                Y  P  L    G   +    +    P   DWRK GAVT +K+QG CGSCWAFS    
Sbjct:   224 ---YLNPL-LQEEPGRKMRLAKSVSSLPPPEWDWRKKGAVTKVKDQGMCGSCWAFSVTGN 279

Query:   139 TEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQA 198
              EG   L  G L+SLSEQEL+ CD   VD GC GG   +A+  I    G+ TE +Y Y+ 
Sbjct:   280 VEGQWFLKQGTLLSLSEQELLDCDK--VDKGCMGGLPSNAYSAIKTLGGLETEEDYSYRG 337

Query:   199 VDGTCNKTNEASHVAKIKGYETVPANSEEALLKA-VANQ-PVAVSIDASGSAFQFYSSGV 256
                TC+   E    AK+   ++V  +  E  L A +A + P++V+I+A G   QFY  G+
Sbjct:   338 HLQTCSFNAEK---AKVYINDSVELSQNEQKLAAWLAEKGPISVAINAFG--MQFYRHGI 392

Query:   257 ---FTGDCGTEL-DHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGL 312
                    C   L DH V  VGYG   + T +W +KNSWGT WGEEGY  + R      G 
Sbjct:   393 SHPLRPLCSPWLIDHAVLLVGYG-NRSATPFWAIKNSWGTDWGEEGYYYLYRG----SGA 447

Query:   313 CGIAMDSS 320
             CG+ + +S
Sbjct:   448 CGVNIMAS 455


>RGD|1308181 [details] [associations]
            symbol:Ctsf "cathepsin F" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1308181 eggNOG:COG4870 HOGENOM:HOG000230774
            KO:K01373 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 MEROPS:C01.018
            CTD:8722 OMA:LAPPEWD OrthoDB:EOG4CC41T GeneTree:ENSGT00660000095458
            EMBL:CH473953 EMBL:BC099780 EMBL:EU253481 IPI:IPI00201100
            RefSeq:NP_001029282.1 UniGene:Rn.25087 SMR:Q499S6
            Ensembl:ENSRNOT00000026718 GeneID:361704 KEGG:rno:361704
            UCSC:RGD:1308181 InParanoid:Q499S6 NextBio:677325
            Genevestigator:Q499S6 Uniprot:Q499S6
        Length = 462

 Score = 445 (161.7 bits), Expect = 5.2e-42, P = 5.2e-42
 Identities = 112/308 (36%), Positives = 160/308 (51%)

Query:    21 EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAF 80
             + +M+ Y + Y++ EE + R  +F  N+   + + A      +  I +F+D T +EF   
Sbjct:   166 KDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHTI 225

Query:    81 RNGYRRPDGLTSRKGTSFKYENVIDV-PATMDWRKNGAVTPIKNQGPCGSCWAFSAVAAT 139
                Y  P       G     +++ D+ P   DWRK GAVT +K+QG CGSCWAFS     
Sbjct:   226 ---YLNPLLQKESGGKMSLAKSINDLAPPEWDWRKKGAVTEVKDQGMCGSCWAFSVTGNV 282

Query:   140 EGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAV 199
             EG   L  G L+SLSEQEL+ CD   +D  C GG   +A+  I +  G+ TE +Y YQ  
Sbjct:   283 EGQWFLNRGTLLSLSEQELLDCDK--MDKACMGGLPSNAYTAIKNLGGLETEDDYGYQGH 340

Query:   200 DGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ--PVAVSIDASGSAFQFYSSGV- 256
                CN + +   +AK+   ++V  + +E  + A   Q  P++V+I+A G   QFY  G+ 
Sbjct:   341 VQACNFSTQ---MAKVYINDSVELSRDENKIAAWLAQKGPISVAINAFG--MQFYRHGIA 395

Query:   257 --FTGDCGTE-LDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLC 313
               F   C    +DH V  VGYG  +N   YW +KNSWG  WGEEGY  + R      G C
Sbjct:   396 HPFRPLCSPWFIDHAVLLVGYGNRSN-IPYWAIKNSWGRDWGEEGYYYLYRG----SGAC 450

Query:   314 GI-AMDSS 320
             G+  M SS
Sbjct:   451 GVNTMASS 458


>UNIPROTKB|E2RR02 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            OMA:LAPPEWD GeneTree:ENSGT00660000095458 EMBL:AAEX03011628
            Ensembl:ENSCAFT00000019742 Uniprot:E2RR02
        Length = 460

 Score = 444 (161.4 bits), Expect = 6.6e-42, P = 6.6e-42
 Identities = 111/309 (35%), Positives = 161/309 (52%)

Query:    21 EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAF 80
             +++++ Y + Y+  EE E R  +F +N+   + + A      +  I +F+D T +EF+  
Sbjct:   163 KEFVTTYNRTYETKEEAEWRMSVFSNNMVRAQKIQALDRGTAQYGITKFSDLTEEEFRTI 222

Query:    81 RNGYRRPDGLTSRKGTSFKY-ENVID--VPATMDWRKNGAVTPIKNQGPCGSCWAFSAVA 137
                Y  P  L   +G   +  +++ D   P   DWR  GAVT +K+QG CGSCWAFS   
Sbjct:   223 ---YLNPL-LRENRGKKMRLAKSISDHAPPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTG 278

Query:   138 ATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQ 197
               EG   L  G L+SLSEQEL+ CD   VD  C GG   +A+  I+   G+ TE +Y YQ
Sbjct:   279 NVEGQWFLKEGTLLSLSEQELLDCDK--VDKACLGGLPSNAYSAIMTLGGLETEDDYSYQ 336

Query:   198 AVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGSAFQFYSSGV 256
                  C+ + + + V      E   + +E+ L   +A + P++V+I+A G   QFY  G+
Sbjct:   337 GHLQACSFSAKKARVYINDSMEL--SQNEQKLAAWLAKKGPISVAINAFG--MQFYRHGI 392

Query:   257 ---FTGDCGTEL-DHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGL 312
                    C   L DH V  VGYG   +G  +W +KNSWGT WGEEGY  + R      G 
Sbjct:   393 SHPLRPLCSPWLIDHAVLLVGYG-NRSGIPFWAIKNSWGTDWGEEGYYYLHRG----SGA 447

Query:   313 CGI-AMDSS 320
             CG+  M SS
Sbjct:   448 CGVNTMASS 456


>DICTYBASE|DDB_G0282991 [details] [associations]
            symbol:DDB_G0282991 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0282991 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            EMBL:AAFI02000049 eggNOG:NOG331187 RefSeq:XP_639299.1
            ProteinModelPortal:Q54RQ2 EnsemblProtists:DDB0185304 GeneID:8623870
            KEGG:ddi:DDB_G0282991 InParanoid:Q54RQ2 OMA:PENGNEY Uniprot:Q54RQ2
        Length = 339

 Score = 441 (160.3 bits), Expect = 1.4e-41, P = 1.4e-41
 Identities = 109/311 (35%), Positives = 169/311 (54%)

Query:    22 QWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFR 81
             +W +KY K+Y N +E   RF  FK N E+++  N    +   L +N FAD +  E+    
Sbjct:    29 EWTNKYNKIYSN-KEFYMRFNNFKKNKEYVDQWNEKQLETI-LELNFFADLSRNEY--IN 84

Query:    82 NGYRRPDGLTSRKGTSFKYE-----NVIDVPATMDWRKNGAVTPIKNQGPC-GSCWAFSA 135
             N       +++ +  + KYE     N  +   ++DWR   AVTP+KNQG C G+ ++FSA
Sbjct:    85 NYLASFIDISNIEQKNTKYEGNLKNNFNNSIKSIDWRNFDAVTPVKNQGLCSGAGYSFSA 144

Query:   136 VAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYP 195
             +   E    +   +LI+LSEQ ++ C T   ++GC GG    AF +II   GI +E NYP
Sbjct:   145 IGVIESSHFIKNKELITLSEQNIIDCTTDMGNNGCMGGLALIAFDYIIKQKGIDSEFNYP 204

Query:   196 YQA--VD-----GTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSA 248
             Y+   ++     G C + N     A I  Y  +   +E  L +++   PV+V IDAS  +
Sbjct:   205 YEGYLIEPYEGRGRC-RYNSFYSKASISSYIEIERFNENELTQSLIKSPVSVMIDASQLS 263

Query:   249 FQFYSSGVFTG-DCG-TELDHGVTAVGYGATA-NGTKYWLVKNSWGTSWGEEGYIRMKRD 305
             F  Y SGV+    C  T L+HG+  +G+G T  NG +Y+++KNS+G+ WG +GYI + R+
Sbjct:   264 FMLYKSGVYKDPSCSSTILNHGILNIGFGVTPENGNEYYILKNSFGSKWGMKGYIYLSRN 323

Query:   306 IDAKEGLCGIA 316
              +     CGI+
Sbjct:   324 FNNH---CGIS 331


>UNIPROTKB|Q0VCU3 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            HOVERGEN:HBG011513 MEROPS:C01.018 CTD:8722 OMA:LAPPEWD
            OrthoDB:EOG4CC41T GeneTree:ENSGT00660000095458 EMBL:DAAA02063594
            EMBL:BC120003 IPI:IPI00717812 RefSeq:NP_001068884.1 UniGene:Bt.7264
            SMR:Q0VCU3 Ensembl:ENSBTAT00000014587 GeneID:509715 KEGG:bta:509715
            InParanoid:Q0VCU3 NextBio:20869091 Uniprot:Q0VCU3
        Length = 460

 Score = 436 (158.5 bits), Expect = 4.6e-41, P = 4.6e-41
 Identities = 110/308 (35%), Positives = 158/308 (51%)

Query:    21 EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAF 80
             + +++ Y + Y + EE   R  +F +N+   + + A      +  + +F+D T +EF+  
Sbjct:   164 KDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTARYGVTKFSDLTEEEFRTI 223

Query:    81 RNGYRRPDGLTSRKGTSFK-YENVIDVPATM-DWRKNGAVTPIKNQGPCGSCWAFSAVAA 138
                Y  P  L    G + +  + V DVP    DWR  GAVT +K+QG CGSCWAFS    
Sbjct:   224 ---YLNPL-LKDAPGRNMRPAQPVTDVPPPQWDWRNKGAVTNVKDQGMCGSCWAFSVTGN 279

Query:   139 TEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQA 198
              EG   L  G L+SLSEQEL+ CD +  D  C GG   +A+  I    G+ TE +Y Y+ 
Sbjct:   280 VEGQWFLKRGTLLSLSEQELLDCDKT--DKACLGGLPSNAYSAIRTLGGLETEDDYSYRG 337

Query:   199 VDGTCNKTNEASHVAKIKGYETVPANSEEALLKA--VANQPVAVSIDASGSAFQFYSSGV 256
                TC+ + E    AK+   ++V  +  E  L A    N PV+++I+A G   QFY  G+
Sbjct:   338 RLQTCSFSAEK---AKVYINDSVELSKNEQKLAAWLAKNGPVSIAINAFG--MQFYRHGI 392

Query:   257 ---FTGDCGTEL-DHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGL 312
                    C   L DH V  VGYG   +   +W +KNSWGT WGEEGY  + R      G 
Sbjct:   393 SHPLRPLCSPWLIDHAVLLVGYG-NRSAIPFWAIKNSWGTDWGEEGYYYLHRG----SGA 447

Query:   313 CGIAMDSS 320
             CG+ + +S
Sbjct:   448 CGVNIMAS 455


>RGD|1564827 [details] [associations]
            symbol:RGD1564827 "similar to cathepsin M" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 IPI:IPI00192321
            Ensembl:ENSRNOT00000023990 ArrayExpress:D3ZY04 Uniprot:D3ZY04
        Length = 338

 Score = 406 (148.0 bits), Expect = 1.1e-40, Sum P(2) = 1.1e-40
 Identities = 82/203 (40%), Positives = 111/203 (54%)

Query:   124 QGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFII 183
             QG C SCWAF  V A EG     TGKL  LS Q LV C     + GC GG   +AF++++
Sbjct:   139 QGRCNSCWAFPVVGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVL 198

Query:   184 HNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSID 243
              N G+ +EA YPY+  +G C     +S  AKI      P  +E+ L+ AVA +PVA  I 
Sbjct:   199 QNGGLESEATYPYEGKEGLCRYNPNSS--AKITXICAPPQKNEDVLMDAVATKPVAAGIH 256

Query:   244 ASGSAFQFYSSGVF-TGDCGTELDHGVTAVGYGATAN---GTKYWLVKNSWGTSWGEEGY 299
                S+ +FY  G++    C   ++H V  VGYG   N   G  YWL++NSWG  WG  GY
Sbjct:   257 VVHSSLRFYKKGIYHEPKCNNYVNHAVLVVGYGFEGNETDGNNYWLIQNSWGERWGLNGY 316

Query:   300 IRMKRDIDAKEGLCGIAMDSSYP 322
             +++ +D   +   CGIA  + YP
Sbjct:   317 MKIAKD---RNNHCGIATFAQYP 336

 Score = 43 (20.2 bits), Expect = 1.1e-40, Sum P(2) = 1.1e-40
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query:    13 EASLSEKHEQWMSKYGKVYKNPEEKEK 39
             + SL  + ++W  KY K+Y +P   +K
Sbjct:    22 DLSLDVQWQEWKMKYEKLY-SPVRIQK 47


>FB|FBgn0037396 [details] [associations]
            symbol:CG11459 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE014297 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 KO:K01365 HSSP:P07711 EMBL:AY060710
            RefSeq:NP_649608.1 UniGene:Dm.3894 SMR:Q9VNK6 MEROPS:C01.A31
            EnsemblMetazoa:FBtr0078623 GeneID:40741 KEGG:dme:Dmel_CG11459
            UCSC:CG11459-RA FlyBase:FBgn0037396 InParanoid:Q9VNK6 OMA:NYDEREL
            OrthoDB:EOG4MGQPX ChiTaRS:CG11459 GenomeRNAi:40741 NextBio:820359
            Uniprot:Q9VNK6
        Length = 336

 Score = 431 (156.8 bits), Expect = 1.6e-40, P = 1.6e-40
 Identities = 104/313 (33%), Positives = 161/313 (51%)

Query:    21 EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAA---GNKPYKLSINEFADQTNQEF 77
             +Q+ +KY K Y+N  +K  R  +++  V  +ES N     G   +K+ +N+F+D   +  
Sbjct:    31 DQYKAKYNKQYRN-RDKYHR-ALYEQRVLAVESHNQLYLQGKVAFKMGLNKFSDTDQRIL 88

Query:    78 KAFRNGYRRP-DGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGP-CGSCWAFSA 135
               +R+    P +  T+    +  Y+    +   +DWR+ G ++P+ +QG  C SCWAFS 
Sbjct:    89 FNYRSSIPAPLETSTNALTETVNYKRYDQITEGIDWRQYGYISPVGDQGTECLSCWAFST 148

Query:   136 VAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYP 195
                 E       G L+ LS + LV C     ++GC GG +  AF +   + GI T+ +YP
Sbjct:   149 SGVLEAHMAKKYGNLVPLSPKHLVDC-VPYPNNGCSGGWVSVAFNYT-RDHGIATKESYP 206

Query:   196 YQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVAN-QPVAVSIDASGSAFQFYSS 254
             Y+ V G C   ++ S    + GY T+    E  L + V N  PVAVSID     F  YS 
Sbjct:   207 YEPVSGECLWKSDRS-AGTLSGYVTLGNYDERELAEVVYNIGPVAVSIDHLHEEFDQYSG 265

Query:   255 GVFT-GDCGT---ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKE 310
             GV +   C +   +L H V  VG+G       YW++KNS+GT WGE GY+++ R+ +   
Sbjct:   266 GVLSIPACRSKRQDLTHSVLLVGFGTHRKWGDYWIIKNSYGTDWGESGYLKLARNAN--- 322

Query:   311 GLCGIAMDSSYPT 323
              +CG+A    YPT
Sbjct:   323 NMCGVASLPQYPT 335


>UNIPROTKB|Q9UBX1 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=TAS] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_6900 GO:GO:0019886 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0043202
            GO:GO:0004197 HOVERGEN:HBG011513 EMBL:AJ007331 EMBL:AF088886
            EMBL:AF132894 EMBL:AF136279 EMBL:AF071748 EMBL:AF071749
            EMBL:AK313657 EMBL:BC011682 EMBL:BC036451 EMBL:AL137742
            IPI:IPI00002816 RefSeq:NP_003784.2 UniGene:Hs.11590 PDB:1D5U
            PDB:1M6D PDBsum:1D5U PDBsum:1M6D ProteinModelPortal:Q9UBX1
            SMR:Q9UBX1 STRING:Q9UBX1 MEROPS:C01.018 PhosphoSite:Q9UBX1
            DMDM:12643325 PaxDb:Q9UBX1 PeptideAtlas:Q9UBX1 PRIDE:Q9UBX1
            DNASU:8722 Ensembl:ENST00000310325 GeneID:8722 KEGG:hsa:8722
            UCSC:uc001oip.3 CTD:8722 GeneCards:GC11M066332 HGNC:HGNC:2531
            HPA:CAB002141 MIM:603539 neXtProt:NX_Q9UBX1 PharmGKB:PA27031
            InParanoid:Q9UBX1 OMA:LAPPEWD OrthoDB:EOG4CC41T PhylomeDB:Q9UBX1
            BindingDB:Q9UBX1 ChEMBL:CHEMBL2517 ChiTaRS:CTSF
            EvolutionaryTrace:Q9UBX1 GenomeRNAi:8722 NextBio:32715
            ArrayExpress:Q9UBX1 Bgee:Q9UBX1 CleanEx:HS_CTSF
            Genevestigator:Q9UBX1 GermOnline:ENSG00000174080 Uniprot:Q9UBX1
        Length = 484

 Score = 431 (156.8 bits), Expect = 1.6e-40, P = 1.6e-40
 Identities = 111/304 (36%), Positives = 160/304 (52%)

Query:    27 YGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKA-FRNGYR 85
             Y + Y++ EE   R  +F +N+   + + A      +  + +F+D T +EF+  + N   
Sbjct:   194 YNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNTLL 253

Query:    86 RPDGLTSRKGTSFKY-ENVIDV-PATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGIT 143
             R +      G   K  ++V D+ P   DWR  GAVT +K+QG CGSCWAFS     EG  
Sbjct:   254 RKE-----PGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQW 308

Query:   144 QLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTC 203
              L  G L+SLSEQEL+ CD   +D  C GG   +A+  I +  G+ TE +Y YQ    +C
Sbjct:   309 FLNQGTLLSLSEQELLDCDK--MDKACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQSC 366

Query:   204 NKTNEASHVAKIKGYETVPANSEEALLKA-VANQ-PVAVSIDASGSAFQFYSSGV---FT 258
             N + E    AK+   ++V  +  E  L A +A + P++V+I+A G   QFY  G+     
Sbjct:   367 NFSAEK---AKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFG--MQFYRHGISRPLR 421

Query:   259 GDCGTEL-DHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGI-A 316
               C   L DH V  VGYG  ++   +W +KNSWGT WGE+GY  + R      G CG+  
Sbjct:   422 PLCSPWLIDHAVLLVGYGNRSD-VPFWAIKNSWGTDWGEKGYYYLHRG----SGACGVNT 476

Query:   317 MDSS 320
             M SS
Sbjct:   477 MASS 480


>DICTYBASE|DDB_G0281077 [details] [associations]
            symbol:DDB_G0281077 species:44689 "Dictyostelium
            discoideum" [GO:0030246 "carbohydrate binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR008965 InterPro:IPR013128 InterPro:IPR019028
            Pfam:PF00112 Pfam:PF09478 PRINTS:PR00705 SMART:SM00645
            SMART:SM01063 InterPro:IPR000169 dictyBase:DDB_G0281077
            GO:GO:0030246 EMBL:AAFI02000040 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 SUPFAM:SSF49384 ProtClustDB:CLSZ2430562
            RefSeq:XP_640803.1 ProteinModelPortal:Q54UH3
            EnsemblProtists:DDB0203998 GeneID:8622857 KEGG:ddi:DDB_G0281077
            InParanoid:Q54UH3 OMA:LINDFNF Uniprot:Q54UH3
        Length = 662

 Score = 355 (130.0 bits), Expect = 3.4e-38, Sum P(2) = 3.4e-38
 Identities = 75/185 (40%), Positives = 105/185 (56%)

Query:   107 PATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGV 166
             P ++DWR  G V+ +KNQG CGSC+AFS V A E        ++++LSEQ LV C T   
Sbjct:   472 PISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALEAHYYRKNNRMLNLSEQNLVDC-TRNY 530

Query:   167 DHG-CEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANS 225
              +G C GG M + F++I  N GI  ++ YPY+   G C + N     ++I  Y  +  + 
Sbjct:   531 GNGECSGGWMHNCFRYIKENGGINLQSTYPYEGRVGLC-RYNSGDAQSRISNYVMIKQHD 589

Query:   226 EEALLKAVANQ-PVAVSIDASGSAFQFYSSGVFTGD-CGT-ELDHGVTAVGYGATANGTK 282
             EE L  AVA+  PV+V+ DAS   F +YSSG++  D C      H V  VGYG   NG  
Sbjct:   590 EEDLANAVASVGPVSVAYDASTREFMYYSSGIYNSDSCDKYRTTHAVVVVGYGIE-NGVD 648

Query:   283 YWLVK 287
             +W++K
Sbjct:   649 FWIIK 653

 Score = 84 (34.6 bits), Expect = 3.4e-38, Sum P(2) = 3.4e-38
 Identities = 19/70 (27%), Positives = 35/70 (50%)

Query:     9 RKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAG-NKPYKLSIN 67
             RK +E        QW +++ + Y+  ++   ++  FKD+  FIE       N   +L + 
Sbjct:   152 RK-RELEYQNSFIQWSNQFNRTYR-ADQFLLKYEAFKDSSRFIEQYKRENQNSTMELGLT 209

Query:    68 EFADQTNQEF 77
             +F+D T+ EF
Sbjct:   210 QFSDMTHDEF 219


>WB|WBGene00011102 [details] [associations]
            symbol:R07E3.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            EMBL:Z49207 HSSP:P53634 PIR:T24030 RefSeq:NP_001041280.1
            ProteinModelPortal:Q21810 SMR:Q21810 STRING:Q21810 MEROPS:C01.A43
            PaxDb:Q21810 EnsemblMetazoa:R07E3.1a GeneID:181242
            KEGG:cel:CELE_R07E3.1 UCSC:R07E3.1a CTD:181242 WormBase:R07E3.1a
            HOGENOM:HOG000021028 InParanoid:Q21810 OMA:ACKNEVI NextBio:913066
            ArrayExpress:Q21810 Uniprot:Q21810
        Length = 402

 Score = 398 (145.2 bits), Expect = 4.9e-37, P = 4.9e-37
 Identities = 104/325 (32%), Positives = 165/325 (50%)

Query:    15 SLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAA---GNKPYKLSINEFAD 71
             ++++++  +  K+ K Y   +E  KR   + +  E I + N     G+  Y    N+ +D
Sbjct:    85 NIAKEYIAYTEKFDKSYATSQESLKRLNAYYNTDENIANWNIQNEHGSAEY--GHNDMSD 142

Query:    72 QTNQEFKAF---RNGYRR-----------PDGLTSRKGTSFKYENVIDVPATMDWRKNGA 117
              T++EF+     ++ Y+R           P+ LT++KG     E+    P   DWR    
Sbjct:   143 WTDEEFEKTLLPKSFYKRLHKEAEFIEPIPESLTAKKG-----ESSSPFPDFFDWRDKNV 197

Query:   118 VTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMED 177
             +TP+K QG CGSCWAF++ A  E    +  G+  +LSEQ L+ CD   VD+ C+GG+ + 
Sbjct:   198 ITPVKAQGQCGSCWAFASTATVEAAWAIAHGEKRNLSEQTLLDCDL--VDNACDGGDEDK 255

Query:   178 AFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVAN-Q 236
             AF++I H +G+    + PY A        N+  +  +IK    +  + E++++  + N  
Sbjct:   256 AFRYI-HRNGLANAVDLPYVAHRQNGCAVNDHWNTTRIKAAYFLH-HDEDSIINWLVNFG 313

Query:   237 PVAVSIDASGSAFQFYSSGVFTGD---CGTELD--HGVTAVGYGATANGTKYWLVKNSWG 291
             PV + + A     + Y  GVFT     C  E+   H +   GYG +  G KYW+VKNSWG
Sbjct:   314 PVNIGM-AVIQPMRAYKGGVFTPSEYACKNEVIGLHALLITGYGTSKTGEKYWIVKNSWG 372

Query:   292 TSWG-EEGYIRMKRDIDAKEGLCGI 315
              +WG E GYI   R I+A    CGI
Sbjct:   373 NTWGVEHGYIYFARGINA----CGI 393


>UNIPROTKB|P56202 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 GO:GO:0006955 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AF013611
            EMBL:AF015954 EMBL:AF055903 EMBL:AP001201 EMBL:BC048255
            IPI:IPI00328978 RefSeq:NP_001326.2 UniGene:Hs.416848
            ProteinModelPortal:P56202 SMR:P56202 STRING:P56202 MEROPS:C01.037
            PhosphoSite:P56202 DMDM:259016196 PaxDb:P56202 PRIDE:P56202
            Ensembl:ENST00000307886 GeneID:1521 KEGG:hsa:1521 UCSC:uc001ogc.1
            CTD:1521 GeneCards:GC11P065647 HGNC:HGNC:2546 HPA:CAB016345
            MIM:602364 neXtProt:NX_P56202 PharmGKB:PA27042 eggNOG:NOG288820
            HOVERGEN:HBG100117 InParanoid:P56202 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 PhylomeDB:P56202 GenomeRNAi:1521 NextBio:6295
            ArrayExpress:P56202 Bgee:P56202 CleanEx:HS_CTSW
            Genevestigator:P56202 GermOnline:ENSG00000172543 Uniprot:P56202
        Length = 376

 Score = 307 (113.1 bits), Expect = 6.4e-37, Sum P(2) = 6.4e-37
 Identities = 84/281 (29%), Positives = 133/281 (47%)

Query:    12 QEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFAD 71
             Q   L E  + +  ++ + Y +PEE   R  IF  N+   + L        +  +  F+D
Sbjct:    34 QPLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSD 93

Query:    72 QTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVID-VPATMDWRK-NGAVTPIKNQGPCGS 129
              T +EF     GYRR  G     G   + E   + VP + DWRK   A++PIK+Q  C  
Sbjct:    94 LTEEEFGQLY-GYRRAAGGVPSMGREIRSEEPEESVPFSCDWRKVASAISPIKDQKNCNC 152

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             CWA +A    E + +++    + +S QEL+ C   G D GC GG + DAF  +++N G+ 
Sbjct:   153 CWAMAAAGNIETLWRISFWDFVDVSVQELLDCGRCG-D-GCHGGFVWDAFITVLNNSGLA 210

Query:   190 TEANYPYQA-VDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGS 247
             +E +YP+Q  V        +   VA I+ +  +  N+E  + + +A   P+ V+I+    
Sbjct:   211 SEKDYPFQGKVRAHRCHPKKYQKVAWIQDFIMLQ-NNEHRIAQYLATYGPITVTINMK-- 267

Query:   248 AFQFYSSGVFTGD---CGTEL-DHGVTAVGYGATANGTKYW 284
               Q Y  GV       C  +L DH V  VG+G+  +    W
Sbjct:   268 PLQLYRKGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEGIW 308

 Score = 106 (42.4 bits), Expect = 6.4e-37, Sum P(2) = 6.4e-37
 Identities = 18/35 (51%), Positives = 23/35 (65%)

Query:   281 TKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGI 315
             T YW++KNSWG  WGE+GY R+ R  +     CGI
Sbjct:   324 TPYWILKNSWGAQWGEKGYFRLHRGSNT----CGI 354


>UNIPROTKB|F1MHV4 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 OMA:GRCGDGC EMBL:DAAA02063574
            IPI:IPI00716321 Ensembl:ENSBTAT00000027681 Uniprot:F1MHV4
        Length = 375

 Score = 305 (112.4 bits), Expect = 8.2e-37, Sum P(2) = 8.2e-37
 Identities = 81/275 (29%), Positives = 132/275 (48%)

Query:    12 QEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFAD 71
             Q   L E    +  +Y + Y NP E  +R  IF  N+   + L        +  + +F+D
Sbjct:    34 QPLELKEVFRLFQMQYNRSYPNPAEYARRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSD 93

Query:    72 QTNQEFKAFRNGYRRPDGL-TSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSC 130
              T +EF          + L  SRK  S ++      P T DWRK G ++P+++Q  C  C
Sbjct:    94 LTEEEFVQLYGSQVAGEALGVSRKVGSEEWGE--SEPQTCDWRKVGTISPVRDQRNCNCC 151

Query:   131 WAFSAVAATEGITQLTTGKLISLSEQ-ELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             WA +A    E +  +     + +S Q EL+ CD  G  +GC GG + DAF  +++N G+ 
Sbjct:   152 WAMAAAGNIEALWAIKFRHFVEVSVQPELLDCDRCG--NGCRGGFVWDAFLTVLNNSGLA 209

Query:   190 TEANYPYQAVDGT--CNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASG 246
             +E +YP+     T  C    +   VA I+ +  + A  E+++ + +A + P+ V+I+ + 
Sbjct:   210 SEKDYPFNGSGKTHRC-LAKKYKKVAWIQDFIILQA-CEQSMARHLATEGPITVTINMT- 266

Query:   247 SAFQFYSSGVFTGD---CG-TELDHGVTAVGYGAT 277
                Q Y  GV       C  T++DH V  VG+G T
Sbjct:   267 -LLQQYQKGVIKATPTTCDPTQVDHSVLLVGFGKT 300

 Score = 107 (42.7 bits), Expect = 8.2e-37, Sum P(2) = 8.2e-37
 Identities = 21/48 (43%), Positives = 28/48 (58%)

Query:   271 AVGYGATANGTK---YWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGI 315
             A  +G+ A   +   YW++KNSWG  WGEEGY R+ R  +     CGI
Sbjct:   310 AASFGSHARPRRSMAYWILKNSWGPQWGEEGYFRLHRGSNT----CGI 353


>DICTYBASE|DDB_G0281079 [details] [associations]
            symbol:DDB_G0281079 species:44689 "Dictyostelium
            discoideum" [GO:0030246 "carbohydrate binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR008965 InterPro:IPR013128 InterPro:IPR019028
            Pfam:PF00112 Pfam:PF09478 PRINTS:PR00705 SMART:SM00645
            SMART:SM01063 InterPro:IPR000169 dictyBase:DDB_G0281079
            GO:GO:0030246 EMBL:AAFI02000040 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 SUPFAM:SSF49384 RefSeq:XP_640804.1
            ProteinModelPortal:Q54UH2 EnsemblProtists:DDB0204000 GeneID:8622858
            KEGG:ddi:DDB_G0281079 InParanoid:Q54UH2 OMA:ALESHYY
            ProtClustDB:CLSZ2430562 Uniprot:Q54UH2
        Length = 664

 Score = 338 (124.0 bits), Expect = 2.7e-36, Sum P(2) = 2.7e-36
 Identities = 72/186 (38%), Positives = 100/186 (53%)

Query:   107 PATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGV 166
             P ++DWR  G V+ +KNQG CGSC+AFS V A E        +++ LSEQ LV C  S  
Sbjct:   471 PISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALESHYYRKNNRMLDLSEQNLVDCTASNK 530

Query:   167 --DHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPAN 224
               + GC GG M + + +I  N GI  E+ YPY+   G C + N     ++I  +  +  +
Sbjct:   531 YRNGGCSGGWMHNCYSYIQENGGINQESTYPYEGKFGQC-RYNSGDAQSRISKFVMIKQH 589

Query:   225 SEEALLKAVANQ-PVAVSIDASGSAFQFYSSGVFTGD-CGT-ELDHGVTAVGYGATANGT 281
              EE L   VA+  PV+V+ DAS   F +YS G++  D C      H V  VGY    NG 
Sbjct:   590 DEEDLADTVASVGPVSVAYDASTREFMYYSRGIYYSDNCNKYRTTHAVVVVGYD-NENGV 648

Query:   282 KYWLVK 287
              YW++K
Sbjct:   649 DYWIIK 654

 Score = 84 (34.6 bits), Expect = 2.7e-36, Sum P(2) = 2.7e-36
 Identities = 19/70 (27%), Positives = 35/70 (50%)

Query:     9 RKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAG-NKPYKLSIN 67
             RK +E        QW +++ + Y+  ++   ++  FKD+  FIE       N   +L + 
Sbjct:   151 RK-RELEYQNSFIQWSNQFNRTYR-ADQFLLKYEAFKDSSRFIEQYKRENQNSTMELGLT 208

Query:    68 EFADQTNQEF 77
             +F+D T+ EF
Sbjct:   209 QFSDMTHDEF 218

 Score = 41 (19.5 bits), Expect = 8.6e-32, Sum P(2) = 8.6e-32
 Identities = 17/78 (21%), Positives = 33/78 (42%)

Query:     5 QVTSRKLQEASLSEKHEQWMSK-YGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYK 63
             ++T + + + +    HE    K    +  +   + K F I  +NV+  +SL++       
Sbjct:    46 EITVKVIIKNNHDHDHEHQPHKELDDIQDDKANRCKGFNINNNNVDGSDSLDSEIGSGGD 105

Query:    64 LSINEFADQTNQEFKAFR 81
             +S +   D  N E  A R
Sbjct:   106 ISNDSNGDNENNEENAKR 123


>RGD|1309354 [details] [associations]
            symbol:Ctsw "cathepsin W" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1309354 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 MEROPS:C01.037
            CTD:1521 eggNOG:NOG288820 HOVERGEN:HBG100117 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 EMBL:CH473953 EMBL:BC093401 IPI:IPI00371471
            RefSeq:NP_001019413.1 UniGene:Rn.34406 Ensembl:ENSRNOT00000037404
            GeneID:293676 KEGG:rno:293676 UCSC:RGD:1309354 InParanoid:Q561Q9
            NextBio:636716 Genevestigator:Q561Q9 Uniprot:Q561Q9
        Length = 371

 Score = 294 (108.6 bits), Expect = 5.7e-36, Sum P(2) = 5.7e-36
 Identities = 79/275 (28%), Positives = 135/275 (49%)

Query:    16 LSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQ 75
             L E  + +  ++ + Y NP E  +R  IF  N+   + L        +     F+D T +
Sbjct:    36 LKEVFKLFQIQFNRSYSNPAEYTRRLGIFAHNLAQAQRLQEEDLGTAEFGQTPFSDLTEE 95

Query:    76 EFKAFRNGYRRPDGLTS--RKGTSFKYENVIDVPATMDWRK-NGAVTPIKNQGPCGSCWA 132
             EF       R P+ + +  +K  S ++     VP T DWRK    ++ IKNQG C  CWA
Sbjct:    96 EFGQLYGHQRAPERILNMAKKVKSERWGE--SVPPTCDWRKVKNIISSIKNQGNCRCCWA 153

Query:   133 FSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEA 192
              +A    + + ++ T + + +S QEL+ CD  G  +GC GG + DA+  +++N G+ +E 
Sbjct:   154 IAAADNIQTLWRIKTQQFVDVSVQELLDCDRCG--NGCNGGFVWDAYITVLNNSGLASEE 211

Query:   193 NYPYQAVDGT--CNKTNEASHVAKIKGYETVPANSEEALLKAVA-NQPVAVSIDASGSAF 249
             +YP+Q       C   ++   VA I+ + T+ +++E+ +   +A + P+ V+I+      
Sbjct:   212 DYPFQGHQKPHRC-LADKYRKVAWIQDF-TMLSSNEQVIAGYLAIHGPITVTINMK--LL 267

Query:   250 QFYSSGVFTGD---CGTEL-DHGVTAVGYGATANG 280
             Q+Y  GV       C   L +H V  VG+G    G
Sbjct:   268 QYYQKGVIKATPSTCDPHLVNHSVLLVGFGKEKGG 302

 Score = 110 (43.8 bits), Expect = 5.7e-36, Sum P(2) = 5.7e-36
 Identities = 21/42 (50%), Positives = 25/42 (59%)

Query:   281 TKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
             T YW++KNSWG  WGE+GY R+ R        CGIA    YP
Sbjct:   319 TPYWILKNSWGAEWGEKGYFRLYRG----NNTCGIA---KYP 353


>UNIPROTKB|E2RPX3 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 CTD:1521 KO:K08569 OMA:GRCGDGC
            EMBL:AAEX03011632 RefSeq:XP_540846.2 Ensembl:ENSCAFT00000020910
            GeneID:483725 KEGG:cfa:483725 Uniprot:E2RPX3
        Length = 374

 Score = 294 (108.6 bits), Expect = 9.2e-36, Sum P(2) = 9.2e-36
 Identities = 83/273 (30%), Positives = 129/273 (47%)

Query:    12 QEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFAD 71
             Q   L +    +  +Y + Y NPEE  +R  IF  N+   + L        +  +  F+D
Sbjct:    34 QPLELKQVFALFQIQYNRSYSNPEEYARRLDIFAHNLAQAQQLEDEDLGTAEFGVTPFSD 93

Query:    72 QTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVID-VPATMDWRK-NGAVTPIKNQGPCGS 129
              T +EF  F  G++R  G     G   + E   + VP T DWRK  G ++PIK QG C  
Sbjct:    94 LTEEEFGQFY-GHQRMAGEAPSVGRKVESEEWGEPVPPTCDWRKLPGIISPIKQQGNCRC 152

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             CWA +A    E +  +   + + +S QEL+ C   G D GC+GG   DAF  +++N G+ 
Sbjct:   153 CWAMAAAGNIEALWGIRYHQPVEVSVQELLDCGRCG-D-GCKGGFTWDAFITVLNNSGLA 210

Query:   190 TEANYPY--QAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASG 246
             +  +YP+        C    +   VA I+ +  +  N E+A+   +A + P+ V+I+   
Sbjct:   211 SAKDYPFLGNTKPHRC-LAKKYKKVAWIQDFIMLQGN-EQAIAWYLATKGPITVTINMK- 267

Query:   247 SAFQFYSSGVFTGD---CGTE-LDHGVTAVGYG 275
                Q Y  GV       C  + +DH V  VG+G
Sbjct:   268 -LLQHYQKGVIQATHTTCDPQRVDHSVLLVGFG 299

 Score = 108 (43.1 bits), Expect = 9.2e-36, Sum P(2) = 9.2e-36
 Identities = 20/40 (50%), Positives = 24/40 (60%)

Query:   283 YWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
             YW++KNSWG  WGEEGY R+ R        CGI   + YP
Sbjct:   324 YWILKNSWGAEWGEEGYFRLHRG----NNTCGI---TKYP 356


>WB|WBGene00013764 [details] [associations]
            symbol:Y113G7B.15 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 GeneTree:ENSGT00560000076599
            EMBL:AL110477 HOGENOM:HOG000019851 RefSeq:NP_507904.2
            ProteinModelPortal:Q9U2X1 SMR:Q9U2X1 DIP:DIP-25339N IntAct:Q9U2X1
            MINT:MINT-1058673 STRING:Q9U2X1 MEROPS:C01.A47
            EnsemblMetazoa:Y113G7B.15 GeneID:190976 KEGG:cel:CELE_Y113G7B.15
            UCSC:Y113G7B.15 CTD:190976 WormBase:Y113G7B.15 eggNOG:NOG302449
            OMA:AEEDIME Uniprot:Q9U2X1
        Length = 362

 Score = 313 (115.2 bits), Expect = 3.1e-35, Sum P(2) = 3.1e-35
 Identities = 81/241 (33%), Positives = 124/241 (51%)

Query:    85 RRPDGLTSRKGTSFKYENVIDVPATMDWRK---NGA--VTPIKNQGPCGSCWAFSAVAAT 139
             R P G  +      K ++  D+P   D R    +G+  V P+K+Q  CG CWAF+  A T
Sbjct:   111 RHPRGSRNHHNKRSKRQSG-DIPDYFDLRDIYVDGSPVVGPVKDQEQCGCCWAFATTAIT 169

Query:   140 EGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQA- 198
             E    L +    SLS+QE+  C  SG   GC GG+  +  K ++H  G +++ +YPY+  
Sbjct:   170 EAANTLYSKSFTSLSDQEICDCADSGDTPGCVGGDPRNGLK-MVHLRGQSSDGDYPYEEY 228

Query:   199 ---VDGTCNKTNEASHVAK---IKGYETVPANSEEALLKAV-ANQ-PVAVSIDASGSAFQ 250
                  G C   +E S V +   +  Y      +EE +++ +  N  P AV     G  F+
Sbjct:   229 RANTTGNC-VGDEKSTVIQPETLNVYRFDQDYAEEDIMENLYLNHIPTAVYFRV-GENFE 286

Query:   251 FYSSGVFTG-DCG--TELD-HGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDI 306
             +Y+SGV    DC   T  + H V  VGYG + +G  YWLV+NSW + WG  GY++++R +
Sbjct:   287 WYTSGVLQSEDCYQMTPAEWHSVAIVGYGTSDDGVPYWLVRNSWNSDWGLHGYVKIRRGV 346

Query:   307 D 307
             +
Sbjct:   347 N 347

 Score = 84 (34.6 bits), Expect = 3.1e-35, Sum P(2) = 3.1e-35
 Identities = 23/66 (34%), Positives = 32/66 (48%)

Query:    29 KVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYK---LSINEFADQTNQEFKAFRNGYR 85
             K Y+ P EK++R   F  N + I+ LNA   +  +      N+FAD+  QE  A RN   
Sbjct:    39 KHYRTPAEKDRRLAHFAKNHQKIQELNAKARREGRNVTFGWNKFADKNRQELSA-RNSKI 97

Query:    86 RPDGLT 91
              P   T
Sbjct:    98 HPKNHT 103


>MGI|MGI:1338045 [details] [associations]
            symbol:Ctsw "cathepsin W" species:10090 "Mus musculus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 MGI:MGI:1338045 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 MEROPS:C01.037
            CTD:1521 eggNOG:NOG288820 HOVERGEN:HBG100117 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 EMBL:AF014941 EMBL:AC122861 IPI:IPI00111727
            RefSeq:NP_034115.2 UniGene:Mm.113590 ProteinModelPortal:P56203
            SMR:P56203 PhosphoSite:P56203 PRIDE:P56203 DNASU:13041
            Ensembl:ENSMUST00000025844 GeneID:13041 KEGG:mmu:13041
            InParanoid:P56203 NextBio:282936 Bgee:P56203 CleanEx:MM_CTSW
            Genevestigator:P56203 GermOnline:ENSMUSG00000024910 Uniprot:P56203
        Length = 371

 Score = 292 (107.8 bits), Expect = 5.0e-35, Sum P(2) = 5.0e-35
 Identities = 78/276 (28%), Positives = 136/276 (49%)

Query:    16 LSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQ 75
             L E  + +  ++ + Y NP E  +R  IF  N+   + L        +     F+D T +
Sbjct:    36 LKEVFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDLGTAEFGETPFSDLTEE 95

Query:    76 EFKAFRNGYRRPD---GLTSRKGTSFKYENVIDVPATMDWRK-NGAVTPIKNQGPCGSCW 131
             EF       R P+    +T +  ++   E+V   P T DWRK    ++ +KNQG C  CW
Sbjct:    96 EFGQLYGQERSPERTPNMTKKVESNTWGESV---PRTCDWRKAKNIISSVKNQGSCKCCW 152

Query:   132 AFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTE 191
             A +A    + + ++   + + +S QEL+ C+  G  +GC GG + DA+  +++N G+ +E
Sbjct:   153 AMAAADNIQALWRIKHQQFVDVSVQELLDCERCG--NGCNGGFVWDAYLTVLNNSGLASE 210

Query:   192 ANYPYQAVDGTCNK--TNEASHVAKIKGYETVPANSEEALLKAVA-NQPVAVSIDASGSA 248
              +YP+Q  D   ++    +   VA I+ + T+ +N+E+A+   +A + P+ V+I+     
Sbjct:   211 KDYPFQG-DRKPHRCLAKKYKKVAWIQDF-TMLSNNEQAIAHYLAVHGPITVTINMK--L 266

Query:   249 FQFYSSGVFTG---DCGT-ELDHGVTAVGYGATANG 280
              Q Y  GV       C   ++DH V  VG+G    G
Sbjct:   267 LQHYQKGVIKATPSSCDPRQVDHSVLLVGFGKEKEG 302

 Score = 103 (41.3 bits), Expect = 5.0e-35, Sum P(2) = 5.0e-35
 Identities = 25/75 (33%), Positives = 35/75 (46%)

Query:   264 ELDHGVTAVGYGATANGTK----------------YWLVKNSWGTSWGEEGYIRMKRDID 307
             ++DH V  VG+G    G +                YW++KNSWG  WGE+GY R+ R   
Sbjct:   286 QVDHSVLLVGFGKEKEGMQTGTVLSHSRKRRHSSPYWILKNSWGAHWGEKGYFRLYRG-- 343

Query:   308 AKEGLCGIAMDSSYP 322
                  CG+   + YP
Sbjct:   344 --NNTCGV---TKYP 353


>UNIPROTKB|Q5T8F0 [details] [associations]
            symbol:CTSL1 "Cathepsin L1 light chain" species:9606 "Homo
            sapiens" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            EMBL:AL160279 UniGene:Hs.731507 UniGene:Hs.731952 HGNC:HGNC:2537
            ChiTaRS:CTSL1 IPI:IPI00640540 SMR:Q5T8F0 Ensembl:ENST00000342020
            ChEMBL:CHEMBL1293261 Uniprot:Q5T8F0
        Length = 225

 Score = 373 (136.4 bits), Expect = 2.2e-34, P = 2.2e-34
 Identities = 80/202 (39%), Positives = 120/202 (59%)

Query:    15 SLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAA---GNKPYKLSINEFAD 71
             SL  +  +W + + ++Y   EE  +R  +++ N++ IE  N     G   + +++N F D
Sbjct:    24 SLEAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEYREGKHSFTMAMNAFGD 82

Query:    72 QTNQEFKAFRNGY--RRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGS 129
              T++EF+   NG+  R+P     RKG  F+     + P ++DWR+ G VTP+KNQG CGS
Sbjct:    83 MTSEEFRQVMNGFQNRKP-----RKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGS 137

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             CWAFSA  A EG     TG+LISLSEQ LV C     + GC GG M+ AF+++  N G+ 
Sbjct:   138 CWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLD 197

Query:   190 TEANYPYQA-VDGT-CNKTNEA 209
             +E +YPY+A V G  C+ ++ A
Sbjct:   198 SEESYPYEATVSGAPCHHSSSA 219


>UNIPROTKB|E9PI30 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            EMBL:AP001201 HGNC:HGNC:2546 IPI:IPI00984532
            ProteinModelPortal:E9PI30 SMR:E9PI30 Ensembl:ENST00000528419
            ArrayExpress:E9PI30 Bgee:E9PI30 Uniprot:E9PI30
        Length = 364

 Score = 307 (113.1 bits), Expect = 3.4e-34, Sum P(2) = 3.4e-34
 Identities = 84/281 (29%), Positives = 133/281 (47%)

Query:    12 QEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFAD 71
             Q   L E  + +  ++ + Y +PEE   R  IF  N+   + L        +  +  F+D
Sbjct:    34 QPLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSD 93

Query:    72 QTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVID-VPATMDWRK-NGAVTPIKNQGPCGS 129
              T +EF     GYRR  G     G   + E   + VP + DWRK   A++PIK+Q  C  
Sbjct:    94 LTEEEFGQLY-GYRRAAGGVPSMGREIRSEEPEESVPFSCDWRKVASAISPIKDQKNCNC 152

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             CWA +A    E + +++    + +S QEL+ C   G D GC GG + DAF  +++N G+ 
Sbjct:   153 CWAMAAAGNIETLWRISFWDFVDVSVQELLDCGRCG-D-GCHGGFVWDAFITVLNNSGLA 210

Query:   190 TEANYPYQA-VDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGS 247
             +E +YP+Q  V        +   VA I+ +  +  N+E  + + +A   P+ V+I+    
Sbjct:   211 SEKDYPFQGKVRAHRCHPKKYQKVAWIQDFIMLQ-NNEHRIAQYLATYGPITVTINMK-- 267

Query:   248 AFQFYSSGVFTGD---CGTEL-DHGVTAVGYGATANGTKYW 284
               Q Y  GV       C  +L DH V  VG+G+  +    W
Sbjct:   268 PLQLYRKGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEGIW 308

 Score = 80 (33.2 bits), Expect = 3.4e-34, Sum P(2) = 3.4e-34
 Identities = 11/17 (64%), Positives = 14/17 (82%)

Query:   281 TKYWLVKNSWGTSWGEE 297
             T YW++KNSWG  WGE+
Sbjct:   324 TPYWILKNSWGAQWGEK 340


>ZFIN|ZDB-GENE-080724-8 [details] [associations]
            symbol:ctso "cathepsin O" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            ZFIN:ZDB-GENE-080724-8 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 CTD:1519 KO:K01374 EMBL:CR931784
            IPI:IPI00513613 RefSeq:XP_695717.3 UniGene:Dr.88386
            Ensembl:ENSDART00000074786 GeneID:567333 KEGG:dre:567333
            NextBio:20888622 Uniprot:E7FA09
        Length = 334

 Score = 371 (135.7 bits), Expect = 3.6e-34, P = 3.6e-34
 Identities = 88/276 (31%), Positives = 138/276 (50%)

Query:    50 FIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDV--P 107
             F+ S     N+  +  +N+F+  + ++FK     Y       + K    K E  +    P
Sbjct:    66 FLNSALGKSNQSAQYGVNQFSYLSQKQFK---EQYLTARAEAAPKFDQSKSEIKVKANNP 122

Query:   108 ATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVD 167
                DWR +G V P+ NQG CG CWAFS V A E ++     KL  LS Q+++ C  S  +
Sbjct:   123 PRFDWRDHGVVGPVHNQGSCGGCWAFSIVEAIESVSAKGGEKLQQLSVQQVIDC--SYQN 180

Query:   168 HGCEGGEMEDAFKFIIHND-GITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVP-ANS 225
              GC GG   +A  ++  +   + +EA YP++  DG C    +A     ++ Y     +  
Sbjct:   181 QGCNGGSPVEALYWLTQSKLKLVSEAEYPFKGADGVCQFFPQAHAGVAVRNYSAYDFSGQ 240

Query:   226 EEALLKAVAN-QPVAVSIDASGSAFQFYSSGVFTGDCGT-ELDHGVTAVGYGATANGTKY 283
             EE ++ A+ +  P+ V +DA   ++Q Y  G+    C + + +H V   GY  T     Y
Sbjct:   241 EEVMMSALVDFGPLVVIVDAI--SWQDYLGGIIQHHCSSHKANHAVLITGYDTTGE-VPY 297

Query:   284 WLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDS 319
             W+V+NSWGTSWG++GY  +K   D    +CG+A DS
Sbjct:   298 WIVRNSWGTSWGDDGYAYIKIGND----VCGVA-DS 328


>UNIPROTKB|E1BPI9 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 CTD:1519 KO:K01374 OMA:SNVCGIA
            EMBL:DAAA02044933 IPI:IPI01004081 RefSeq:XP_002694471.2
            RefSeq:XP_874012.4 Ensembl:ENSBTAT00000014691 GeneID:616804
            KEGG:bta:616804 Uniprot:E1BPI9
        Length = 313

 Score = 366 (133.9 bits), Expect = 1.2e-33, P = 1.2e-33
 Identities = 91/285 (31%), Positives = 144/285 (50%)

Query:    41 FRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKY 100
             FR   +   ++ SL    N      IN+F+    +EFKA    Y R       +  + +Y
Sbjct:    36 FRESLNRQRYLNSLFPYENSTAVYGINQFSYLFPEEFKAI---YLRSSPSRFPRFPAEEY 92

Query:   101 ENV--IDVPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQEL 158
              ++  + +P   DWR    VT ++NQ  CG CWAFS V A E +  +    L  LS Q++
Sbjct:    93 TSISNLSLPLRFDWRDKHVVTQVRNQKTCGGCWAFSVVGAVESVCAIKGQPLEVLSVQQV 152

Query:   159 VSCDTSGVDHGCEGGEMEDAFKFIIHND-GITTEANYPYQAVDGTCNKTNEASHVAKIKG 217
             + C  S  ++GC GG    A  ++      +  ++ YP+QA +G C   +++   + IKG
Sbjct:   153 IDCSYS--NYGCNGGSPLSALYWLNKLQVKLVRDSEYPFQAQNGLCRYFSDSHSGSSIKG 210

Query:   218 YETVP-ANSEEALLKAV-ANQPVAVSIDASGSAFQFYSSGVFTGDCGT-ELDHGVTAVGY 274
             Y     +  E+ + +A+ A  P+ V +DA   ++Q Y  G+    C + E +H V   G+
Sbjct:   211 YSAYDFSGQEDKMAEALLALGPLIVVVDAM--SWQDYLGGIIQHHCSSGEANHAVLVTGF 268

Query:   275 GATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDS 319
               T +   YW+V+NSWGTSWG +GY+R+K        +CGIA DS
Sbjct:   269 DKTGS-IPYWIVRNSWGTSWGIDGYVRVKMG----GNVCGIA-DS 307


>UNIPROTKB|F1RU23 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 KO:K08569 EMBL:CU928325
            RefSeq:XP_003122571.1 UniGene:Ssc.28940 Ensembl:ENSSSCT00000014177
            GeneID:100525853 KEGG:ssc:100525853 OMA:CWAMAAV Uniprot:F1RU23
        Length = 367

 Score = 360 (131.8 bits), Expect = 5.2e-33, P = 5.2e-33
 Identities = 100/330 (30%), Positives = 157/330 (47%)

Query:    12 QEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFAD 71
             Q   L E    +  +Y + Y NP E  +R  IF  N+   + L        +  +  F+D
Sbjct:    34 QPMGLKEVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSD 93

Query:    72 QTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVID-VPATMDWRKN-GAVTPIKNQGPCGS 129
              T +EF    +G+    G     G     E   + VP + DWRK  G ++ IK+Q  C  
Sbjct:    94 LTEEEFGQL-HGHHWGAGKAPSMGIKVGSEESGETVPQSCDWRKKPGVISAIKHQKDCNC 152

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGIT 189
             CWA +AV   E    +   + + LS Q+++ CD  G  +GC GG + DAF  +++  G+ 
Sbjct:   153 CWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRCG--NGCNGGFVWDAFLTVLNTSGLA 210

Query:   190 TEANYPYQAVDGTCNKTNEASH--VAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASG 246
             +E +YPY+    T ++     H  VA I+ +  +    E+++ + +A + P+ V+I+A  
Sbjct:   211 SEQDYPYKGTVKT-HRCLAKQHRKVAWIQDFLMLQF-CEQSIARYLATEGPITVTINAG- 267

Query:   247 SAFQFYSSGVFTGD---CGTEL-DHGVTAVGYGATAN--GTK--------YWLVKNSWGT 292
                Q Y  GV       C   L +H V  VG+G + +  G +        YW++KNSWG 
Sbjct:   268 -LLQQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGRRPRPGHSIPYWILKNSWGP 326

Query:   293 SWGEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
              WGEEGY R+ R  +     CGI   + YP
Sbjct:   327 DWGEEGYFRLHRGSNT----CGI---TKYP 349


>UNIPROTKB|P43234 [details] [associations]
            symbol:CTSO "Cathepsin O" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 Reactome:REACT_6900
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0004197
            CleanEx:HS_CTSO EMBL:X77383 EMBL:BC049206 IPI:IPI00017257
            PIR:A55090 RefSeq:NP_001325.1 UniGene:Hs.75262
            ProteinModelPortal:P43234 SMR:P43234 IntAct:P43234 STRING:P43234
            MEROPS:C01.035 PhosphoSite:P43234 DMDM:1168795 PRIDE:P43234
            DNASU:1519 Ensembl:ENST00000433477 GeneID:1519 KEGG:hsa:1519
            UCSC:uc003ipg.3 CTD:1519 GeneCards:GC04M156845 HGNC:HGNC:2542
            HPA:HPA002041 MIM:600550 neXtProt:NX_P43234 PharmGKB:PA27040
            HOVERGEN:HBG105050 InParanoid:P43234 KO:K01374 OMA:SNVCGIA
            OrthoDB:EOG4V6ZH1 PhylomeDB:P43234 BindingDB:P43234
            ChEMBL:CHEMBL3035 GenomeRNAi:1519 NextBio:6287 Bgee:P43234
            Genevestigator:P43234 GermOnline:ENSG00000151792 Uniprot:P43234
        Length = 321

 Score = 355 (130.0 bits), Expect = 1.8e-32, P = 1.8e-32
 Identities = 88/295 (29%), Positives = 149/295 (50%)

Query:    34 PEEKEKRFRIFKDNVE---FIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGYRRPDGL 90
             P  +E+    F++++    ++ SL  + N      IN+F+    +EFKA    Y R    
Sbjct:    34 PRSREREAAAFRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAI---YLRSKPS 90

Query:    91 TSRKGTSFKYENV--IDVPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTG 148
                + ++  + ++  + +P   DWR    VT ++NQ  CG CWAFS V A E    +   
Sbjct:    91 KFPRYSAEVHMSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGK 150

Query:   149 KLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHND-GITTEANYPYQAVDGTCNKTN 207
              L  LS Q+++ C  +  ++GC GG   +A  ++      +  ++ YP++A +G C+  +
Sbjct:   151 PLEDLSVQQVIDCSYN--NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFS 208

Query:   208 EASHVAKIKGYETVP-ANSEEALLKAVAN-QPVAVSIDASGSAFQFYSSGVFTGDCGT-E 264
              +     IKGY     ++ E+ + KA+    P+ V +DA   ++Q Y  G+    C + E
Sbjct:   209 GSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAV--SWQDYLGGIIQHHCSSGE 266

Query:   265 LDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDS 319
              +H V   G+  T + T YW+V+NSWG+SWG +GY  +K        +CGIA DS
Sbjct:   267 ANHAVLITGFDKTGS-TPYWIVRNSWGSSWGVDGYAHVKMG----SNVCGIA-DS 315


>DICTYBASE|DDB_G0276111 [details] [associations]
            symbol:DDB_G0276111 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0276111 Pfam:PF00188
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            PROSITE:PS00139 EMBL:AAFI02000014 Gene3D:3.40.33.10
            InterPro:IPR001283 InterPro:IPR014044 PRINTS:PR00837 SMART:SM00198
            SUPFAM:SSF55797 ProtClustDB:CLSZ2429919 RefSeq:XP_643261.1
            ProteinModelPortal:Q75JH0 EnsemblProtists:DDB0169514 GeneID:8620304
            KEGG:ddi:DDB_G0276111 InParanoid:Q75JH0 OMA:GFVTSIK Uniprot:Q75JH0
        Length = 415

 Score = 348 (127.6 bits), Expect = 9.8e-32, P = 9.8e-32
 Identities = 81/210 (38%), Positives = 114/210 (54%)

Query:   110 MDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKL----ISLSEQELVSCDTSG 165
             +DW+  G VT IKNQG CG C++F+  AA E    L    L    I LSEQ  VSC    
Sbjct:   213 VDWKSLGFVTSIKNQGQCGGCYSFATCAALES-AYLIKNNLPNTDIDLSEQNFVSC---- 267

Query:   166 VDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANS 225
             V++GC GG  +     +  + GI  E +YPY+AV G+C    ++    K  GY  +  N 
Sbjct:   268 VNYGCGGGNGQSCLDKL-KSTGIMYETSYPYKAVTGSCPNVIQSPQPFKWTGYSNIQGN- 325

Query:   226 EEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWL 285
             +EA L A+ + P+  S+    S FQ Y SG+++    +  +H +T VGY +  N    +L
Sbjct:   326 KEAFLNALKSGPIYASLYVD-SGFQLYKSGIYSCSQSSTPNHAITIVGYSSADNS---YL 381

Query:   286 VKNSWGTSWGEEGYIRMKRDIDAKEGLCGI 315
             +KNSWGT +GE GYIR+K      EG C +
Sbjct:   382 IKNSWGTIYGESGYIRLK------EGSCNL 405


>UNIPROTKB|F1PGK4 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 OMA:SNVCGIA
            EMBL:AAEX03010073 Ensembl:ENSCAFT00000013638 Uniprot:F1PGK4
        Length = 316

 Score = 346 (126.9 bits), Expect = 1.6e-31, P = 1.6e-31
 Identities = 87/294 (29%), Positives = 145/294 (49%)

Query:    32 KNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAF--RNGYRRPDG 89
             ++ E     FR   +   ++ S+    N      IN+F+  + +EFKA   R+   R   
Sbjct:    30 RSREPPAAAFRESLNRHRYLNSVFPRENSSAVYGINQFSYLSPEEFKAIYLRSKPSRSPR 89

Query:    90 LTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGK 149
               +   TS +  NV  +P   DWR    VT ++NQ  CG CWAFS V A E    +    
Sbjct:    90 YPAEVRTSIR--NV-SLPLRFDWRDKRVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGKP 146

Query:   150 LISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHND-GITTEANYPYQAVDGTCNKTNE 208
             L  +S Q+++ C  +  ++GC GG   +A  ++      +  ++ YP++A +G C+  ++
Sbjct:   147 LADISVQQVIDCSYN--NYGCSGGSTLNALNWLNKTQVKLVRDSEYPFKAQNGLCHYFSD 204

Query:   209 ASHVAKIKGYETVP-ANSEEALLKAVAN-QPVAVSIDASGSAFQFYSSGVFTGDCGT-EL 265
             +     I+GY     ++ E+ + K +    P+ V +DA   ++Q Y  G+    C + E 
Sbjct:   205 SYSGFSIRGYSAYDFSDQEDEMAKVLLTFGPLVVVVDAV--SWQDYLGGIIQHHCSSGEA 262

Query:   266 DHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDS 319
             +H V   G+    + T YW+V+NSWG+SWG +GY  +K        +CGIA DS
Sbjct:   263 NHAVLITGFDKIGS-TPYWIVRNSWGSSWGVDGYAHVKMG----GNICGIA-DS 310


>GENEDB_PFALCIPARUM|PF14_0553 [details] [associations]
            symbol:PF14_0553 "cysteine proteinase
            falcipain-1" species:5833 "Plasmodium falciparum" [GO:0042540
            "hemoglobin catabolic process" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014187 KO:K01376 HSSP:P07688 GO:GO:0042540
            RefSeq:XP_001348727.1 ProteinModelPortal:Q8I6V0 PRIDE:Q8I6V0
            EnsemblProtists:PF14_0553:mRNA GeneID:812135 KEGG:pfa:PF14_0553
            EuPathDB:PlasmoDB:PF3D7_1458000 HOGENOM:HOG000065906
            ProtClustDB:CLSZ2457715 ChEMBL:CHEMBL1250371 Uniprot:Q8I6V0
        Length = 569

 Score = 247 (92.0 bits), Expect = 5.3e-30, Sum P(2) = 5.3e-30
 Identities = 77/256 (30%), Positives = 118/256 (46%)

Query:    24 MSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEF-KAFRN 82
             +  + K+ KN   K K+   F D  E  E L     K Y  ++    +   +++ K F N
Sbjct:   256 IKNHNKLNKNAMYK-KKVNQFSDYSE--EEL-----KEYFKTLLHVPNHMIEKYSKPFEN 307

Query:    83 GYRRPDGLTSRKGTSFKY-ENVI--DVPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAAT 139
              + + + L S   T+ K  E  I   VP  +D+R+ G V   K+QG CGSCWAF++V   
Sbjct:   308 -HLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNI 366

Query:   140 EGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAV 199
             E +       ++S SEQE+V C     + GC+GG    +F +++ N+ +     Y Y+A 
Sbjct:   367 ESVFAKKNKNILSFSEQEVVDCSKD--NFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAK 423

Query:   200 DGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGVFTG 259
             D              +     V  N     L  V   P++V++  +   F  YS GV+ G
Sbjct:   424 DDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVG--PLSVNVGVNND-FVAYSEGVYNG 480

Query:   260 DCGTELDHGVTAVGYG 275
              C  EL+H V  VGYG
Sbjct:   481 TCSEELNHSVLLVGYG 496

 Score = 116 (45.9 bits), Expect = 5.3e-30, Sum P(2) = 5.3e-30
 Identities = 17/40 (42%), Positives = 25/40 (62%)

Query:   283 YWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
             YW++KNSW   WGE G++R+ R+ +     CGI  +  YP
Sbjct:   528 YWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYP 567

 Score = 113 (44.8 bits), Expect = 3.5e-11, Sum P(2) = 3.5e-11
 Identities = 23/65 (35%), Positives = 40/65 (61%)

Query:    17 SEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAG-NKPYKLSINEFADQTNQ 75
             + K  ++M ++ KVYKN +E+ ++F IFK N   I++ N    N  YK  +N+F+D + +
Sbjct:   222 ASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEE 281

Query:    76 EFKAF 80
             E K +
Sbjct:   282 ELKEY 286


>UNIPROTKB|Q8I6V0 [details] [associations]
            symbol:PF14_0553 "Cysteine proteinase falcipain-1"
            species:36329 "Plasmodium falciparum 3D7" [GO:0042540 "hemoglobin
            catabolic process" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014187 KO:K01376 HSSP:P07688 GO:GO:0042540
            RefSeq:XP_001348727.1 ProteinModelPortal:Q8I6V0 PRIDE:Q8I6V0
            EnsemblProtists:PF14_0553:mRNA GeneID:812135 KEGG:pfa:PF14_0553
            EuPathDB:PlasmoDB:PF3D7_1458000 HOGENOM:HOG000065906
            ProtClustDB:CLSZ2457715 ChEMBL:CHEMBL1250371 Uniprot:Q8I6V0
        Length = 569

 Score = 247 (92.0 bits), Expect = 5.3e-30, Sum P(2) = 5.3e-30
 Identities = 77/256 (30%), Positives = 118/256 (46%)

Query:    24 MSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEF-KAFRN 82
             +  + K+ KN   K K+   F D  E  E L     K Y  ++    +   +++ K F N
Sbjct:   256 IKNHNKLNKNAMYK-KKVNQFSDYSE--EEL-----KEYFKTLLHVPNHMIEKYSKPFEN 307

Query:    83 GYRRPDGLTSRKGTSFKY-ENVI--DVPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAAT 139
              + + + L S   T+ K  E  I   VP  +D+R+ G V   K+QG CGSCWAF++V   
Sbjct:   308 -HLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNI 366

Query:   140 EGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAV 199
             E +       ++S SEQE+V C     + GC+GG    +F +++ N+ +     Y Y+A 
Sbjct:   367 ESVFAKKNKNILSFSEQEVVDCSKD--NFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAK 423

Query:   200 DGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGVFTG 259
             D              +     V  N     L  V   P++V++  +   F  YS GV+ G
Sbjct:   424 DDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVG--PLSVNVGVNND-FVAYSEGVYNG 480

Query:   260 DCGTELDHGVTAVGYG 275
              C  EL+H V  VGYG
Sbjct:   481 TCSEELNHSVLLVGYG 496

 Score = 116 (45.9 bits), Expect = 5.3e-30, Sum P(2) = 5.3e-30
 Identities = 17/40 (42%), Positives = 25/40 (62%)

Query:   283 YWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
             YW++KNSW   WGE G++R+ R+ +     CGI  +  YP
Sbjct:   528 YWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYP 567

 Score = 113 (44.8 bits), Expect = 3.5e-11, Sum P(2) = 3.5e-11
 Identities = 23/65 (35%), Positives = 40/65 (61%)

Query:    17 SEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAG-NKPYKLSINEFADQTNQ 75
             + K  ++M ++ KVYKN +E+ ++F IFK N   I++ N    N  YK  +N+F+D + +
Sbjct:   222 ASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEE 281

Query:    76 EFKAF 80
             E K +
Sbjct:   282 ELKEY 286


>MGI|MGI:2139628 [details] [associations]
            symbol:Ctso "cathepsin O" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:2139628 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0005764 GeneTree:ENSGT00560000076599 MEROPS:C01.035 CTD:1519
            HOVERGEN:HBG105050 KO:K01374 OMA:SNVCGIA OrthoDB:EOG4V6ZH1
            EMBL:AK034490 EMBL:AK049470 EMBL:AK165930 EMBL:AK166103
            EMBL:BC044664 IPI:IPI00453524 RefSeq:NP_808330.1 UniGene:Mm.254642
            ProteinModelPortal:Q8BM88 SMR:Q8BM88 STRING:Q8BM88
            PhosphoSite:Q8BM88 PRIDE:Q8BM88 Ensembl:ENSMUST00000029649
            GeneID:229445 KEGG:mmu:229445 UCSC:uc008pon.1 InParanoid:Q8BM88
            NextBio:379433 Bgee:Q8BM88 CleanEx:MM_CTSO Genevestigator:Q8BM88
            GermOnline:ENSMUSG00000028015 Uniprot:Q8BM88
        Length = 312

 Score = 330 (121.2 bits), Expect = 7.9e-30, P = 7.9e-30
 Identities = 81/258 (31%), Positives = 127/258 (49%)

Query:    66 INEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQG 125
             +N+F+    +EFKA   G +                NV  +P   DWR    V P++NQ 
Sbjct:    60 VNQFSYLFPEEFKALYLGSKYAWAPRYPAEGQRPIPNV-SLPLRFDWRDKHVVNPVRNQE 118

Query:   126 PCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHN 185
              CG CWAFS V+A E    +    L  LS Q+++ C  +  + GC GG    A +++   
Sbjct:   119 MCGGCWAFSVVSAIESARAIQGKSLDYLSVQQVIDCSFN--NSGCLGGSPLCALRWLNET 176

Query:   186 D-GITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVP-ANSEEALLKAVAN-QPVAVSI 242
                +  ++ YP++AV+G C    ++     +K +        E+ + +A+ +  P+ V +
Sbjct:   177 QLKLVADSQYPFKAVNGQCRHFPQSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIV 236

Query:   243 DASGSAFQFYSSGVFTGDCGT-ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIR 301
             DA   ++Q Y  G+    C + E +H V   G+  T N T YW+V+NSWG+SWG EGY  
Sbjct:   237 DAM--SWQDYLGGIIQHHCSSGEANHAVLITGFDRTGN-TPYWMVRNSWGSSWGVEGYAH 293

Query:   302 MKRDIDAKEGLCGIAMDS 319
             +K        +CGIA DS
Sbjct:   294 VKMG----GNVCGIA-DS 306


>UNIPROTKB|O97578 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9615 "Canis
            lupus familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 EMBL:AF060171 RefSeq:NP_001182763.1
            UniGene:Cfa.28653 ProteinModelPortal:O97578 SMR:O97578
            MEROPS:C01.070 PRIDE:O97578 GeneID:403458 KEGG:cfa:403458
            InParanoid:O97578 NextBio:20816976 Uniprot:O97578
        Length = 435

 Score = 327 (120.2 bits), Expect = 1.6e-29, P = 1.6e-29
 Identities = 96/284 (33%), Positives = 142/284 (50%)

Query:    42 RIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYE 101
             R++K N EF++++N            E+   T ++    R G R+         T+  +E
Sbjct:   141 RLYKYNYEFVKAINTIQKSWTATRYIEYETLTLRDMMT-RVGGRKIPRPKPTPLTAEIHE 199

Query:   102 NVIDVPATMDWRK-NGA--VTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLIS--LSEQ 156
              +  +P + DWR   G   V+P++NQ  CGSC+AF++ A  E   ++ T    +  LS Q
Sbjct:   200 EISRLPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQ 259

Query:   157 ELVSCDTSGVDHGCEGG-EMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKI 215
             E+VSC  S    GCEGG     A K+   + G+  EA +PY   D  C K N+       
Sbjct:   260 EIVSC--SQYAQGCEGGFPYLIAGKYA-QDFGLVEEACFPYAGSDSPC-KPNDCFRYYSS 315

Query:   216 KGYET--VPANSEEALLKA--VANQPVAVSIDASGSAFQFYSSGVF--TG--DCGT--EL 265
             + Y          EAL+K   V + P+AV+ +     F  Y  G++  TG  D     EL
Sbjct:   316 EYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFH-YQKGIYYHTGLRDPFNPFEL 374

Query:   266 -DHGVTAVGYGA-TANGTKYWLVKNSWGTSWGEEGYIRMKRDID 307
              +H V  VGYG  +A+G  YW+VKNSWG+ WGE+GY R++R  D
Sbjct:   375 TNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTD 418


>UNIPROTKB|F1STR1 [details] [associations]
            symbol:CTSC "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0004252
            "serine-type endopeptidase activity" evidence=IEA] [GO:0001913 "T
            cell mediated cytotoxicity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 KO:K01275 InterPro:IPR014882
            Pfam:PF08773 OMA:YDDFLHY GO:GO:0001913 EMBL:CU855751
            RefSeq:XP_003129789.1 UniGene:Ssc.6155 Ensembl:ENSSSCT00000016280
            GeneID:100522387 KEGG:ssc:100522387 Uniprot:F1STR1
        Length = 463

 Score = 324 (119.1 bits), Expect = 3.4e-29, P = 3.4e-29
 Identities = 98/292 (33%), Positives = 143/292 (48%)

Query:    36 EKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGY-RRPDGLTSRK 94
             +K+   R++K N +F++++N         +  E+   T +E      GY +R        
Sbjct:   160 QKKYSNRLYKYNHDFVKAINGIQKSWTATAYMEYETLTLKEMTQRGGGYNQRLPRPKPAP 219

Query:    95 GTSFKYENVIDVPATMDWRK-NGA--VTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLI 151
              T+   E  + +PA+ DWR   G   VTP++NQ  CGSC++F+++   E   ++ T    
Sbjct:   220 ITAEIQEKSLHLPASWDWRNVRGTNFVTPVRNQASCGSCYSFASMGMMEARIRILTNNTQ 279

Query:   152 S--LSEQELVSCDTSGVDHGCEGG-EMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNE 208
             +  LS QE+VSC  S    GC GG     A K+   + G+  EA +PY   D  C    E
Sbjct:   280 TPILSPQEVVSC--SQYAQGCAGGFPYLIAGKYA-QDFGLVEEACFPYTGTDSPCT-VKE 335

Query:   209 ASHVAKIKGYETVPA---NSEEALLKA--VANQPVAVSIDASGSAFQFYSSGVF--TG-- 259
                      Y  V        EAL+K   V + P+AV+ +     F  Y  G++  TG  
Sbjct:   336 GCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD-FLHYRKGIYHHTGLR 394

Query:   260 DCGT--EL-DHGVTAVGYGAT-ANGTKYWLVKNSWGTSWGEEGYIRMKRDID 307
             D     EL +H V  VGYG   A+G  YW+VKNSWGTSWGE+GY R++R  D
Sbjct:   395 DPFNPFELTNHAVLLVGYGTDLASGMDYWIVKNSWGTSWGEDGYFRIRRGTD 446


>UNIPROTKB|J9P219 [details] [associations]
            symbol:J9P219 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 InterPro:IPR014882
            Pfam:PF08773 OMA:YDDFLHY EMBL:AAEX03012741
            Ensembl:ENSCAFT00000050015 Uniprot:J9P219
        Length = 406

 Score = 322 (118.4 bits), Expect = 5.6e-29, P = 5.6e-29
 Identities = 95/285 (33%), Positives = 141/285 (49%)

Query:    42 RIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYE 101
             R++K N EF++++N            E+   T ++      G + P        T+  +E
Sbjct:   110 RLYKYNYEFVKAINTIQKSWTATRYIEYETLTLRDMMTRGGGRKIPRKPKPTPLTAEIHE 169

Query:   102 NVIDVPATMDWRK-NGA--VTPIKNQGP-CGSCWAFSAVAATEGITQLTTGKLIS--LSE 155
              +  +P + DWR   G   V+P++NQ   CGSC+AF++ A  E   ++ T    +  LS 
Sbjct:   170 EISRLPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRILTNNTQTPILSP 229

Query:   156 QELVSCDTSGVDHGCEGG-EMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAK 214
             QE+VSC  S    GCEGG     A K+   + G+  EA +PY   D  C K N+      
Sbjct:   230 QEIVSC--SQYAQGCEGGFPYLIAGKYA-QDFGLVEEACFPYAGSDSPC-KPNDCFRYYS 285

Query:   215 IKGYET--VPANSEEALLKA--VANQPVAVSIDASGSAFQFYSSGVF--TG--DCGT--E 264
              + Y          EAL+K   V + P+AV+ +     F  Y  G++  TG  D     E
Sbjct:   286 SEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFH-YQKGIYYHTGLRDPFNPFE 344

Query:   265 L-DHGVTAVGYGA-TANGTKYWLVKNSWGTSWGEEGYIRMKRDID 307
             L +H V  VGYG  +A+G  YW+VKNSWG+ WGE+GY R++R  D
Sbjct:   345 LTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTD 389


>UNIPROTKB|F1N455 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1 exclusion domain chain"
            species:9913 "Bos taurus" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 IPI:IPI00697314 UniGene:Bt.49573
            InterPro:IPR014882 Pfam:PF08773 OMA:YDDFLHY GO:GO:0001913
            EMBL:DAAA02062487 EMBL:DAAA02062488 Ensembl:ENSBTAT00000014735
            Uniprot:F1N455
        Length = 463

 Score = 321 (118.1 bits), Expect = 7.1e-29, P = 7.1e-29
 Identities = 97/286 (33%), Positives = 143/286 (50%)

Query:    42 RIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGY-RRPDGLTSRKGTSFKY 100
             R+++ N +F++++NA           E+   T +E      G+ RR         T+   
Sbjct:   166 RLYRYNHDFVKAINAIQKSWTAAPYMEYETLTLKEMIRRGGGHSRRIPRPKPAPITAEIQ 225

Query:   101 ENVIDVPATMDWRK-NGA--VTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLIS--LSE 155
             + ++ +P + DWR  +G   VTP++NQG CGSC++F+++   E   ++ T    +  LS 
Sbjct:   226 KKILHLPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSP 285

Query:   156 QELVSCDTSGVDHGCEGG-EMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAK 214
             QE+VSC  S    GCEGG     A K+   + G+  E  +PY   D  C +  E      
Sbjct:   286 QEVVSC--SQYAQGCEGGFPYLIAGKYA-QDFGLVEEDCFPYTGTDSPC-RLKEGCFRYY 341

Query:   215 IKGYETVPA---NSEEALLKA-VANQ-PVAVSIDASGSAFQFYSSGVF--TG--DCGT-- 263
                Y  V        EAL+K  + +Q P+AV+ +     F  Y  GV+  TG  D     
Sbjct:   342 SSEYHYVGGFYGGCNEALMKLELVHQGPMAVAFEVYDD-FLHYRKGVYHHTGLRDPFNPF 400

Query:   264 EL-DHGVTAVGYGA-TANGTKYWLVKNSWGTSWGEEGYIRMKRDID 307
             EL +H V  VGYG   A+G  YW+VKNSWGTSWGE GY R++R  D
Sbjct:   401 ELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTD 446


>UNIPROTKB|Q3ZCJ8 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9913 "Bos
            taurus" [GO:0031638 "zymogen activation" evidence=IDA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0005764 EMBL:BC102115 IPI:IPI00697314 RefSeq:NP_001028789.1
            UniGene:Bt.49573 ProteinModelPortal:Q3ZCJ8 SMR:Q3ZCJ8 STRING:Q3ZCJ8
            PRIDE:Q3ZCJ8 GeneID:352958 KEGG:bta:352958 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 InParanoid:Q3ZCJ8 KO:K01275
            OrthoDB:EOG4H19VZ BindingDB:Q3ZCJ8 ChEMBL:CHEMBL1075050
            NextBio:20812686 GO:GO:0031638 InterPro:IPR014882 Pfam:PF08773
            Uniprot:Q3ZCJ8
        Length = 463

 Score = 321 (118.1 bits), Expect = 7.1e-29, P = 7.1e-29
 Identities = 97/286 (33%), Positives = 143/286 (50%)

Query:    42 RIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGY-RRPDGLTSRKGTSFKY 100
             R+++ N +F++++NA           E+   T +E      G+ RR         T+   
Sbjct:   166 RLYRYNHDFVKAINAIQKSWTAAPYMEYETLTLKEMIRRGGGHSRRIPRPKPAPITAEIQ 225

Query:   101 ENVIDVPATMDWRK-NGA--VTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLIS--LSE 155
             + ++ +P + DWR  +G   VTP++NQG CGSC++F+++   E   ++ T    +  LS 
Sbjct:   226 KKILHLPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSP 285

Query:   156 QELVSCDTSGVDHGCEGG-EMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAK 214
             QE+VSC  S    GCEGG     A K+   + G+  E  +PY   D  C +  E      
Sbjct:   286 QEVVSC--SQYAQGCEGGFPYLIAGKYA-QDFGLVEEDCFPYTGTDSPC-RLKEGCFRYY 341

Query:   215 IKGYETVPA---NSEEALLKA-VANQ-PVAVSIDASGSAFQFYSSGVF--TG--DCGT-- 263
                Y  V        EAL+K  + +Q P+AV+ +     F  Y  GV+  TG  D     
Sbjct:   342 SSEYHYVGGFYGGCNEALMKLELVHQGPMAVAFEVYDD-FLHYRKGVYHHTGLRDPFNPF 400

Query:   264 EL-DHGVTAVGYGA-TANGTKYWLVKNSWGTSWGEEGYIRMKRDID 307
             EL +H V  VGYG   A+G  YW+VKNSWGTSWGE GY R++R  D
Sbjct:   401 ELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTD 446


>UNIPROTKB|F1PSK8 [details] [associations]
            symbol:F1PSK8 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 InterPro:IPR014882
            Pfam:PF08773 EMBL:AAEX03012741 Ensembl:ENSCAFT00000007054
            Uniprot:F1PSK8
        Length = 405

 Score = 321 (118.1 bits), Expect = 7.1e-29, P = 7.1e-29
 Identities = 96/285 (33%), Positives = 142/285 (49%)

Query:    42 RIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYE 101
             R++K N EF++++N            E+   T ++    R G R+         T+  +E
Sbjct:   110 RLYKYNYEFVKAINTIQKSWTATRYIEYETLTLRDMMT-RGGGRKIPRPKPTPLTAEIHE 168

Query:   102 NVIDVPATMDWRK-NGA--VTPIKNQGP-CGSCWAFSAVAATEGITQLTTGKLIS--LSE 155
              +  +P + DWR   G   V+P++NQ   CGSC+AF++ A  E   ++ T    +  LS 
Sbjct:   169 EISRLPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRILTNNTQTPILSP 228

Query:   156 QELVSCDTSGVDHGCEGG-EMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAK 214
             QE+VSC  S    GCEGG     A K+   + G+  EA +PY   D  C K N+      
Sbjct:   229 QEIVSC--SQYAQGCEGGFPYLIAGKYA-QDFGLVEEACFPYAGSDSPC-KPNDCFRYYS 284

Query:   215 IKGYET--VPANSEEALLKA--VANQPVAVSIDASGSAFQFYSSGVF--TG--DCGT--E 264
              + Y          EAL+K   V + P+AV+ +     F  Y  G++  TG  D     E
Sbjct:   285 SEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFH-YQKGIYYHTGLRDPFNPFE 343

Query:   265 L-DHGVTAVGYGA-TANGTKYWLVKNSWGTSWGEEGYIRMKRDID 307
             L +H V  VGYG  +A+G  YW+VKNSWG+ WGE+GY R++R  D
Sbjct:   344 LTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTD 388


>UNIPROTKB|F1NWG2 [details] [associations]
            symbol:CTSC "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 InterPro:IPR014882 Pfam:PF08773
            OMA:YDDFLHY GO:GO:0001913 EMBL:AADN02004805 IPI:IPI00577371
            Ensembl:ENSGALT00000027869 Uniprot:F1NWG2
        Length = 463

 Score = 320 (117.7 bits), Expect = 9.1e-29, P = 9.1e-29
 Identities = 96/287 (33%), Positives = 142/287 (49%)

Query:    42 RIFKDNVEFIESLNAAGNKPYKLS-INEFADQTNQEFKAFRNG-YRRPDGLTSRKGTSFK 99
             R F  N +F+ ++NA   K ++ +   E+ + + +E      G Y R         T   
Sbjct:   166 RRFVHNFDFVNAINAH-QKSWRATRYEEYENFSLEELTRRAGGLYSRTSRPKPAPLTPEL 224

Query:   100 YENVIDVPATMDWRK-NGA--VTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLIS--LS 154
              + V  +P + DWR  NG   V+P++NQ  CGSC+AF+++   E   ++ T        S
Sbjct:   225 LKKVSGLPESWDWRNVNGVNYVSPVRNQASCGSCYAFASMGMLEARIRILTNNTQKPVFS 284

Query:   155 EQELVSCDTSGVDHGCEGG-EMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVA 213
              Q++VSC  S    GC+GG     A K++  + G+  E  +PY A D  C       H  
Sbjct:   285 PQQVVSC--SQYSQGCDGGFPYLIAGKYV-QDFGVVEEDCFPYTAKDTPCLFKRSCYHYY 341

Query:   214 KIKGYETVPA---NSEEALLKA--VANQPVAVSIDASGSAFQFYSSGVF--TG--DCGT- 263
               + Y  V        EAL+K   V + P+AV+ +     F FY  G++  TG  D    
Sbjct:   342 TSE-YHYVGGFYGACNEALMKLELVLSGPMAVAFEVYND-FMFYKEGIYHHTGLKDEFNP 399

Query:   264 -EL-DHGVTAVGYGATA-NGTKYWLVKNSWGTSWGEEGYIRMKRDID 307
              EL +H V  VGYG    +G K+W+VKNSWGTSWGE+GY R++R  D
Sbjct:   400 FELTNHAVLLVGYGKDPESGEKFWIVKNSWGTSWGEDGYFRIRRGTD 446


>UNIPROTKB|P53634 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9606 "Homo
            sapiens" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0005783 "endoplasmic reticulum" evidence=IEA] [GO:0005794
            "Golgi apparatus" evidence=IEA] [GO:0007568 "aging" evidence=IEA]
            [GO:0010033 "response to organic substance" evidence=IEA]
            [GO:0031404 "chloride ion binding" evidence=IEA] [GO:0042802
            "identical protein binding" evidence=IEA] [GO:0043621 "protein
            self-association" evidence=IEA] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0006508 "proteolysis" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0006955
            "immune response" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005783 GO:GO:0005794 Reactome:REACT_6900
            GO:GO:0006955 GO:GO:0007568 GO:GO:0010033 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004252 GO:GO:0005764 GO:GO:0031404 CTD:1075
            HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ InterPro:IPR014882
            Pfam:PF08773 MEROPS:C01.070 EMBL:X87212 EMBL:U79415 EMBL:AF234263
            EMBL:AF234264 EMBL:AF254757 EMBL:AF525032 EMBL:AF525033
            EMBL:AK292117 EMBL:AK311923 EMBL:AK223038 EMBL:BX537913
            EMBL:AC011088 EMBL:CH471185 EMBL:BC054028 EMBL:BC100891
            EMBL:BC100892 EMBL:BC100893 EMBL:BC100894 EMBL:BC109386
            EMBL:BC110071 EMBL:BC113850 EMBL:BC113897 IPI:IPI00022810
            IPI:IPI00171323 IPI:IPI00872258 PIR:S23941 PIR:S66504
            RefSeq:NP_001107645.1 RefSeq:NP_001805.3 RefSeq:NP_680475.1
            UniGene:Hs.128065 PDB:1K3B PDB:2DJF PDB:2DJG PDB:3PDF PDBsum:1K3B
            PDBsum:2DJF PDBsum:2DJG PDBsum:3PDF ProteinModelPortal:P53634
            SMR:P53634 IntAct:P53634 MINT:MINT-4655964 STRING:P53634
            PhosphoSite:P53634 DMDM:1705632 PaxDb:P53634 PRIDE:P53634
            DNASU:1075 Ensembl:ENST00000227266 Ensembl:ENST00000524463
            Ensembl:ENST00000529974 GeneID:1075 KEGG:hsa:1075 UCSC:uc001pck.4
            UCSC:uc001pcm.4 GeneCards:GC11M088026 HGNC:HGNC:2528 HPA:CAB025364
            MIM:170650 MIM:245000 MIM:245010 MIM:602365 neXtProt:NX_P53634
            Orphanet:2342 Orphanet:678 PharmGKB:PA27028 HOGENOM:HOG000127503
            InParanoid:P53634 OMA:YDDFLHY PhylomeDB:P53634
            BioCyc:MetaCyc:HS03265-MONOMER SABIO-RK:P53634 BindingDB:P53634
            ChEMBL:CHEMBL2252 EvolutionaryTrace:P53634 GenomeRNAi:1075
            NextBio:4488 PMAP-CutDB:P53634 ArrayExpress:P53634 Bgee:P53634
            Genevestigator:P53634 GermOnline:ENSG00000109861 GO:GO:0001913
            Uniprot:P53634
        Length = 463

 Score = 317 (116.6 bits), Expect = 1.9e-28, P = 1.9e-28
 Identities = 95/296 (32%), Positives = 145/296 (48%)

Query:    32 KNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGYRRPDGLT 91
             KN +EK    R++K +  F++++NA        +  E+   T  +      G+ R     
Sbjct:   157 KNSQEKYSN-RLYKYDHNFVKAINAIQKSWTATTYMEYETLTLGDMIRRSGGHSRKIPRP 215

Query:    92 SRKG-TSFKYENVIDVPATMDWRK-NGA--VTPIKNQGPCGSCWAFSAVAATEGITQLTT 147
                  T+   + ++ +P + DWR  +G   V+P++NQ  CGSC++F+++   E   ++ T
Sbjct:   216 KPAPLTAEIQQKILHLPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILT 275

Query:   148 GKLIS--LSEQELVSCDTSGVDHGCEGG-EMEDAFKFIIHNDGITTEANYPYQAVDGTCN 204
                 +  LS QE+VSC  S    GCEGG     A K+   + G+  EA +PY   D  C 
Sbjct:   276 NNSQTPILSPQEVVSC--SQYAQGCEGGFPYLIAGKYA-QDFGLVEEACFPYTGTDSPCK 332

Query:   205 KTNEASHVAKIKGYETVPA---NSEEALLKA--VANQPVAVSIDASGSAFQFYSSGVF-- 257
                +       + Y  V        EAL+K   V + P+AV+ +     F  Y  G++  
Sbjct:   333 MKEDCFRYYSSE-YHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD-FLHYKKGIYHH 390

Query:   258 TG--DCGT--EL-DHGVTAVGYGA-TANGTKYWLVKNSWGTSWGEEGYIRMKRDID 307
             TG  D     EL +H V  VGYG  +A+G  YW+VKNSWGT WGE GY R++R  D
Sbjct:   391 TGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTD 446


>WB|WBGene00022189 [details] [associations]
            symbol:Y71H2AR.2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0004181
            "metallocarboxypeptidase activity" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] [GO:0008340 "determination of adult lifespan"
            evidence=IMP] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0008340 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 HSSP:P07711
            eggNOG:NOG331187 HOGENOM:HOG000114005 EMBL:FO081570
            RefSeq:NP_497627.1 UniGene:Cel.28419 ProteinModelPortal:Q9BL26
            SMR:Q9BL26 EnsemblMetazoa:Y71H2AR.2 GeneID:190615
            KEGG:cel:CELE_Y71H2AR.2 UCSC:Y71H2AR.2 CTD:190615
            WormBase:Y71H2AR.2 InParanoid:Q9BL26 OMA:CAMATTI NextBio:946382
            Uniprot:Q9BL26
        Length = 345

 Score = 317 (116.6 bits), Expect = 1.9e-28, P = 1.9e-28
 Identities = 82/224 (36%), Positives = 121/224 (54%)

Query:    96 TSFKYENVIDVPAT----MDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGI-TQLTTGKL 150
             T F++E  I +  T    +DWR+ G V P+K+QG C +  AF+  ++ E +  + T G L
Sbjct:    68 TRFQWETPIHMDRTTEEFLDWREKGIVGPVKDQGKCNASHAFAITSSIESMYAKATNGTL 127

Query:   151 ISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEAS 210
             +S SEQ+L+ C+  G   GCE     +A  ++  + GI TEA+YPY  VD T  K    S
Sbjct:   128 LSFSEQQLIDCNDQGYK-GCEEQFAMNAIGYLATH-GIETEADYPY--VDKTNEKCTFDS 183

Query:   211 HVAKIKGYETVPANSEEALLKA-VANQ-PVAVSIDASGSAFQFYSSGVFTG---DC-GTE 264
               +KI   + V A   E L K  V N  P   ++ A  S +  Y  G++     +C  T 
Sbjct:   184 TKSKIHLKKGVVAEGNEVLGKVYVTNYGPAFFTMRAPPSLYD-YKIGIYNPSIEECTSTH 242

Query:   265 LDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDA 308
                 +  VGYG      KYW+VK S+GTSWGE+GY+++ RD++A
Sbjct:   243 EIRSMVIVGYGIEGE-QKYWIVKGSFGTSWGEQGYMKLARDVNA 285


>WB|WBGene00008231 [details] [associations]
            symbol:tag-329 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            eggNOG:NOG288820 EMBL:Z70750 HSSP:P53634 HOGENOM:HOG000019851
            PIR:T20110 RefSeq:NP_505458.1 ProteinModelPortal:Q18740 SMR:Q18740
            MEROPS:C01.A36 EnsemblMetazoa:C50F4.3 GeneID:183677
            KEGG:cel:CELE_C50F4.3 UCSC:C50F4.3 CTD:183677 WormBase:C50F4.3
            InParanoid:Q18740 OMA:WIFRNSW NextBio:921986 Uniprot:Q18740
        Length = 374

 Score = 314 (115.6 bits), Expect = 3.9e-28, P = 3.9e-28
 Identities = 95/320 (29%), Positives = 144/320 (45%)

Query:    16 LSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKP---YKLSINEFADQ 72
             L ++ E ++ KY + YK+  EK+ RF+ F      +  +N A  K     K  IN+F+D 
Sbjct:    43 LYKEFEDFIVKYKRNYKDEIEKKFRFQQFVATHNRVGKMNKAAKKAGHDTKYGINKFSDL 102

Query:    73 TNQEFKAFRNGYRRPDGLTSRKGTSFKYENVI----DVPATMDWR--KNGA---VTPIKN 123
             + +E     + +  P   T+    + K   V      +P T D R  K G    + PIK 
Sbjct:   103 SKKEIHGMYSKFGPPKNNTNVPKFNLKNLRVKRQMEGLPKTFDLRNKKVGGHYIIGPIKT 162

Query:   124 QGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFII 183
             Q  C  CW F+A A  E    +   K ++LSEQE+  C       GC GG+  D  ++I 
Sbjct:   163 QDSCACCWGFAATAVAEAALTVHLKKAMNLSEQEVCDCAPKH-GPGCNGGDPVDGLEYI- 220

Query:   184 HNDGITTEANYPYQAVDGT----CN--KTNEASHVAKIKGYETVPANSEEALLKAV--AN 235
                G+T    YP+     T    C   K +   +  ++  Y   P N+E  +   +   N
Sbjct:   221 KEMGLTGGKEYPFNVNRSTQLGRCESEKYDRELNPLELDYYAIDPFNAEYQMTHHLYLLN 280

Query:   236 QPVAVSIDASGSAFQFYSSGVFT-GDCGTELD---HGVTAVGYGATANGT----KYWLVK 287
              P++V+   +G++   Y SG+    DC  E     H    VGYG T N       YW+ +
Sbjct:   281 LPISVAF-RTGASLSSYLSGILELADCDDEKGGHWHSGAIVGYGTTKNSAGRTVDYWIFR 339

Query:   288 NSWGTSWGEEGYIRMKRDID 307
             NSW T WG++GY R+ R  D
Sbjct:   340 NSWWTDWGDDGYARIVRGED 359


>WB|WBGene00044760 [details] [associations]
            symbol:Y71H2AM.25 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0004181
            "metallocarboxypeptidase activity" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            GeneTree:ENSGT00560000076599 EMBL:FO081822 eggNOG:NOG331187
            HOGENOM:HOG000114005 RefSeq:NP_001040887.1
            ProteinModelPortal:Q2AAB9 SMR:Q2AAB9 EnsemblMetazoa:Y71H2AM.25
            GeneID:4363054 KEGG:cel:CELE_Y71H2AM.25 UCSC:Y71H2AM.25 CTD:4363054
            WormBase:Y71H2AM.25 InParanoid:Q2AAB9 NextBio:959635 Uniprot:Q2AAB9
        Length = 299

 Score = 313 (115.2 bits), Expect = 5.0e-28, P = 5.0e-28
 Identities = 81/225 (36%), Positives = 121/225 (53%)

Query:   110 MDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGI-TQLTTGKLISLSEQELVSCDTSGVDH 168
             +DWR  G V P+K+QG C +  AF+  ++ E +  + T G L+S SEQ+L+ CD    DH
Sbjct:    86 LDWRDKGIVGPVKDQGKCNASHAFAISSSIESMYAKATNGSLLSFSEQQLIDCD----DH 141

Query:   169 GCEGGEMEDAFK----FIIHNDGITTEANYPYQAVD-GTCNKTNEASHVAKIKGYETVPA 223
             G +G E + A      FI H  GI TEA+YPY   + G C   +  S + ++K  E V +
Sbjct:   142 GFKGCEEQPAINAVSYFIFH--GIETEADYPYAGKENGKCTFDSTKSKI-QLKDAEFVVS 198

Query:   224 NSEEALLKAVANQ-PVAVSIDASGSAFQFYSSGVFTG---DC-GTELDHGVTAVGYGATA 278
             N  +   + V N  P   ++ A  S +  Y  G++     +C  T     +  VGYG   
Sbjct:   199 NETQGK-ELVTNYGPAFFTMRAPPSLYD-YKIGIYNPSIEECTSTHEIRSMVIVGYGIEG 256

Query:   279 NGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYPT 323
                KYW+VK S+GTSWGE+GY+++ RD++A    C +A   + PT
Sbjct:   257 V-QKYWIVKGSFGTSWGEQGYMKLARDVNA----CAMADFITVPT 296


>UNIPROTKB|H0YD65 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 EMBL:AP002748
            HGNC:HGNC:2531 ChiTaRS:CTSF Ensembl:ENST00000524994 Uniprot:H0YD65
        Length = 283

 Score = 312 (114.9 bits), Expect = 6.4e-28, P = 6.4e-28
 Identities = 85/237 (35%), Positives = 126/237 (53%)

Query:    27 YGKVYKNPEEKEKRFR--IFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKA-FRNG 83
             Y + Y   E KE R+R  +F +N+   + + A      +  + +F+D T +EF+  + N 
Sbjct:    43 YNRTY---ESKEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNT 99

Query:    84 YRRPDGLTSRKGTSFKY-ENVIDV-PATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEG 141
               R +      G   K  ++V D+ P   DWR  GAVT +K+QG CGSCWAFS     EG
Sbjct:   100 LLRKE-----PGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEG 154

Query:   142 ITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDG 201
                L  G L+SLSEQEL+ CD   +D  C GG   +A+  I +  G+ TE +Y YQ    
Sbjct:   155 QWFLNQGTLLSLSEQELLDCDK--MDKACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQ 212

Query:   202 TCNKTNEASHVAKIKGYETVPANSEEALLKA-VANQ-PVAVSIDASGSAFQFYSSGV 256
             +CN + E    AK+   ++V  +  E  L A +A + P++V+I+A G   QFY  G+
Sbjct:   213 SCNFSAEK---AKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFG--MQFYRHGI 264


>UNIPROTKB|J9NSE7 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 InterPro:IPR014882 Pfam:PF08773
            EMBL:AAEX03017125 Ensembl:ENSCAFT00000014269 OMA:INGQICH
            Uniprot:J9NSE7
        Length = 458

 Score = 311 (114.5 bits), Expect = 8.8e-28, P = 8.8e-28
 Identities = 94/285 (32%), Positives = 141/285 (49%)

Query:    42 RIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYE 101
             R++K N EF++++N            E+   T ++      G + P    +   T+  +E
Sbjct:   164 RLYKYNYEFVKAINTIQKSWTATRYIEYETLTLRDMMRRAGGRKIPRPKPTPL-TAEIHE 222

Query:   102 NVIDVPATMDWRK-NGA--VTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLIS--LSEQ 156
              +  +P + DWR   G   V+P++NQ  CGSC+AF++    E   ++ T    +  LS Q
Sbjct:   223 EISRLPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTVMLEARIRILTNNTQTPILSPQ 282

Query:   157 ELVSCDTSGVDHGCEGG-EMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKI 215
             E+VSC  S    GCEGG     A K+   + G+  EA + Y   D  C K N+  H    
Sbjct:   283 EIVSC--SQYAQGCEGGFPYLIAGKYA-QDFGLVDEACFSYAGSDSPC-KPNDCFHYYSS 338

Query:   216 KGYETVPA---NSEEALLKA--VANQPVAVSIDASGSAFQFYSSGVF--TG--DCGT--E 264
             + Y  V        EAL+K   V + P+AV+ +     F  Y  G++  TG  D     E
Sbjct:   339 E-YHYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFH-YQKGIYYHTGLRDPINPFE 396

Query:   265 L-DHGVTAVGYGA-TANGTKYWLVKNSWGTSWGEEGYIRMKRDID 307
             L +H V  VGYG  +A+G  YW+VKNSWG+ WGE+GY ++ R  D
Sbjct:   397 LTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFQICRGTD 441


>UNIPROTKB|F1P0K2 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            OMA:SNVCGIA EMBL:AADN02016534 IPI:IPI00651180
            Ensembl:ENSGALT00000015270 Uniprot:F1P0K2
        Length = 320

 Score = 307 (113.1 bits), Expect = 2.2e-27, P = 2.2e-27
 Identities = 82/289 (28%), Positives = 139/289 (48%)

Query:    37 KEKRFRIFKDNVEFIESLNAAGNKPYKLSI--NEFADQTNQEFKAF--RN-GYRRPDGLT 91
             +E+     +++ + I  LN+  N         N+F+    +EFKA   R+  Y+ P  + 
Sbjct:    39 REEEAAALRESAKRIRLLNSPSNDNGSAFYGKNQFSHLFPEEFKAIYLRSIPYKLPRYIK 98

Query:    92 SRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLI 151
               KG   K      +P   DWR    +  ++NQ  CG CWAFS V   E    +    L 
Sbjct:    99 VPKGEE-K-----PLPKKFDWRDKKVIAEVRNQQTCGGCWAFSVVGGIESAYAIKGHNLE 152

Query:   152 SLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHND-GITTEANYPYQAVDGTCNKTNEAS 210
              LS Q+++ C  S  ++GC GG    A  ++      +  ++ Y ++A  G C+    + 
Sbjct:   153 ELSVQQVIDCSYS--NYGCSGGSTITALSWLNQTKVKLVRDSEYTFKAQTGLCHYFPHSD 210

Query:   211 HVAKIKGYETVP-ANSEEALLKAVANQ-PVAVSIDASGSAFQFYSSGVFTGDCGT-ELDH 267
                 I G+     +  EE +++ + +  P+AV++DA   ++Q Y  G+    C + + +H
Sbjct:   211 FGVSITGFAAYDFSGQEEEMMRVLVDWGPLAVTVDAV--SWQDYLGGIIQYHCSSGKANH 268

Query:   268 GVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIA 316
              V   G+  T     YW+V+NSWG +WG +GY+R+K  I +   +CGIA
Sbjct:   269 AVLITGFDTTGI-IPYWIVQNSWGRTWGIDGYVRVK--IGSN--VCGIA 312


>UNIPROTKB|Q5QP40 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015644 Pfam:PF00112
            InterPro:IPR000169 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 EMBL:AL355860 HOVERGEN:HBG011513
            PANTHER:PTHR12411:SF55 EMBL:AL356292 UniGene:Hs.632466
            HGNC:HGNC:2536 IPI:IPI00514633 SMR:Q5QP40 STRING:Q5QP40
            Ensembl:ENST00000443913 Uniprot:Q5QP40
        Length = 258

 Score = 301 (111.0 bits), Expect = 9.4e-27, P = 9.4e-27
 Identities = 71/190 (37%), Positives = 106/190 (55%)

Query:     6 VTSRKLQEASLSEKH-EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIE--SLNAA-GNKP 61
             V S  L    + + H E W   + K Y N  ++  R  I++ N+++I   +L A+ G   
Sbjct:    70 VVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHT 129

Query:    62 YKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPI 121
             Y+L++N   D T++E      G + P   +    T +  E     P ++D+RK G VTP+
Sbjct:   130 YELAMNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPDSVDYRKKGYVTPV 189

Query:   122 KNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKF 181
             KNQG CGSCWAFS+V A EG  +  TGKL++LS Q LV C +   + GC GG M +AF++
Sbjct:   190 KNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDGCGGGYMTNAFQY 247

Query:   182 IIHNDGITTE 191
             +  N GI +E
Sbjct:   248 VQKNRGIDSE 257


>WB|WBGene00019314 [details] [associations]
            symbol:K02E7.10 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 SMART:SM00645 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 HSSP:P07711
            EMBL:FO080411 PIR:T32392 RefSeq:NP_493904.1 UniGene:Cel.14828
            ProteinModelPortal:O17255 SMR:O17255 EnsemblMetazoa:K02E7.10
            GeneID:186889 KEGG:cel:CELE_K02E7.10 UCSC:K02E7.10 CTD:186889
            WormBase:K02E7.10 eggNOG:NOG331187 HOGENOM:HOG000114005
            InParanoid:O17255 OMA:GNANEAR NextBio:933344 Uniprot:O17255
        Length = 299

 Score = 294 (108.6 bits), Expect = 5.2e-26, P = 5.2e-26
 Identities = 72/220 (32%), Positives = 116/220 (52%)

Query:   110 MDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGI-TQLTTGKLISLSEQELVSCDTSGVDH 168
             +DWR+ G V P+K+QG C + +AF+A+AA E +  +   GKL+S SEQ+++ C  +   +
Sbjct:    84 LDWREKGIVGPVKDQGKCNASYAFAAIAAIESMYAKANNGKLLSFSEQQIIDC--ANFTN 141

Query:   169 GCEGGEMEDAF--KFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSE 226
              C+   +E+    +F+  N G+ TEA+YPY   +       ++S +     Y  V  N E
Sbjct:   142 PCQEN-LENVLSNRFLKEN-GVGTEADYPYVGKENVGKCEYDSSKMKLRPTYIDVYPNEE 199

Query:   227 EALLKAVANQPVAVSIDASGSAFQFYSSGVFTG---DCGTELD-HGVTAVGYGATANGTK 282
              A    +           S  +F  Y +G++     +CG   +   +  VGYG      K
Sbjct:   200 WARAH-ITTFGTGYFRMRSPPSFFHYKTGIYNPTKEECGNANEARSLAIVGYGKDG-AEK 257

Query:   283 YWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYP 322
             YW+VK S+GTSWGE GY+++ R+++A    CG+A   S P
Sbjct:   258 YWIVKGSFGTSWGEHGYMKLARNVNA----CGMAESISIP 293


>MGI|MGI:109553 [details] [associations]
            symbol:Ctsc "cathepsin C" species:10090 "Mus musculus"
            [GO:0001913 "T cell mediated cytotoxicity" evidence=IGI]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IMP]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0005783 "endoplasmic
            reticulum" evidence=ISO] [GO:0005794 "Golgi apparatus"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=ISO;IMP]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0010033
            "response to organic substance" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0031404 "chloride ion
            binding" evidence=ISO] [GO:0042802 "identical protein binding"
            evidence=ISO] [GO:0043621 "protein self-association" evidence=ISO]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 MGI:MGI:109553 GO:GO:0005783
            GO:GO:0005794 GO:GO:0007568 GO:GO:0010033 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004252 GO:GO:0005764 GO:GO:0031404 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 MEROPS:C01.070 OMA:YDDFLHY
            GO:GO:0001913 EMBL:U89269 EMBL:U74683 EMBL:BC067063 IPI:IPI00130015
            RefSeq:NP_034112.3 UniGene:Mm.322945 ProteinModelPortal:P97821
            SMR:P97821 STRING:P97821 PhosphoSite:P97821 PaxDb:P97821
            PRIDE:P97821 Ensembl:ENSMUST00000032779 GeneID:13032 KEGG:mmu:13032
            InParanoid:P97821 BindingDB:P97821 ChEMBL:CHEMBL3454 ChiTaRS:CTSC
            NextBio:282904 Bgee:P97821 CleanEx:MM_CTSC Genevestigator:P97821
            Uniprot:P97821
        Length = 462

 Score = 296 (109.3 bits), Expect = 5.7e-26, P = 5.7e-26
 Identities = 85/285 (29%), Positives = 138/285 (48%)

Query:    42 RIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGY-RRPDGLTSRKGTSFKY 100
             R++  N  F++++N         +  E+   + ++    R+G+ +R         T    
Sbjct:   166 RLYTHNHNFVKAINTVQKSWTATAYKEYEKMSLRDLIR-RSGHSQRIPRPKPAPMTDEIQ 224

Query:   101 ENVIDVPATMDWRK-NGA--VTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLIS--LSE 155
             + ++++P + DWR   G   V+P++NQ  CGSC++F+++   E   ++ T    +  LS 
Sbjct:   225 QQILNLPESWDWRNVQGVNYVSPVRNQESCGSCYSFASMGMLEARIRILTNNSQTPILSP 284

Query:   156 QELVSCDTSGVDHGCEGG-EMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAK 214
             QE+VSC  S    GC+GG     A K+   + G+  E+ +PY A D  C           
Sbjct:   285 QEVVSC--SPYAQGCDGGFPYLIAGKYA-QDFGVVEESCFPYTAKDSPCKPRENCLRYYS 341

Query:   215 IKGYET--VPANSEEALLKA--VANQPVAVSIDASGSAFQFYSSGVF--TGDCGT----E 264
                Y          EAL+K   V + P+AV+ +     F  Y SG++  TG        E
Sbjct:   342 SDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDD-FLHYHSGIYHHTGLSDPFNPFE 400

Query:   265 L-DHGVTAVGYGAT-ANGTKYWLVKNSWGTSWGEEGYIRMKRDID 307
             L +H V  VGYG     G +YW++KNSWG++WGE GY R++R  D
Sbjct:   401 LTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRGTD 445


>DICTYBASE|DDB_G0286015 [details] [associations]
            symbol:gmsA species:44689 "Dictyostelium discoideum"
            [GO:0005615 "extracellular space" evidence=IDA] [GO:0019953 "sexual
            reproduction" evidence=IEP] [GO:0009897 "external side of plasma
            membrane" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA;ISS] [GO:0000747 "conjugation with cellular
            fusion" evidence=IMP] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005576 "extracellular
            region" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0286015 Pfam:PF00188 GO:GO:0005615
            GenomeReviews:CM000153_GR GO:GO:0009897 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AAFI02000085 GO:GO:0000747
            Gene3D:3.40.33.10 InterPro:IPR001283 InterPro:IPR014044
            SMART:SM00198 SUPFAM:SSF55797 HSSP:P07688 RefSeq:XP_637893.1
            ProteinModelPortal:Q54ME1 MEROPS:C01.A52 EnsemblProtists:DDB0191145
            GeneID:8625403 KEGG:ddi:DDB_G0286015 InParanoid:Q54ME1 OMA:PGIAYEK
            ProtClustDB:CLSZ2429919 Uniprot:Q54ME1
        Length = 448

 Score = 294 (108.6 bits), Expect = 7.6e-26, P = 7.6e-26
 Identities = 83/221 (37%), Positives = 111/221 (50%)

Query:   109 TMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEG---ITQLTTGK-LISLSEQELVSCDTS 164
             T+DW      TPI++QG CGSCWAF++ AA E    I   T  K  + LS Q  V+C  S
Sbjct:   243 TVDW--TSYQTPIRDQGQCGSCWAFASSAALESRYLIKYGTAQKSTLQLSNQNAVNCIAS 300

Query:   165 GVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGT-CNKTNEASHVAKIKGYETVPA 223
             G    C GG   + F F     GI  E + PY+AV GT C  T+  +   K   Y     
Sbjct:   301 G----CNGGWSGNYFNFF-KTPGIAYEKDDPYKAVTGTSCITTSSVARF-KYTNYGYTE- 353

Query:   224 NSEEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCG-TELDHGVTAVGYGATANGTK 282
              ++ ALL  +   PV +++    SAFQ Y SG++      T ++H V  VGY    +  K
Sbjct:   354 KTKAALLAELKKGPVTIAVYVD-SAFQNYKSGIYNSATKYTGINHLVLLVGYDQATDAYK 412

Query:   283 YWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSYPT 323
                +KNSWG+ WGE GY+R+    D    L   A +S YPT
Sbjct:   413 ---IKNSWGSWWGESGYMRITASND---NLAIFAYNSYYPT 447


>DICTYBASE|DDB_G0288221 [details] [associations]
            symbol:DDB_G0288221 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0288221 Pfam:PF00188 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 PANTHER:PTHR12411 EMBL:AAFI02000109 Gene3D:3.40.33.10
            InterPro:IPR001283 InterPro:IPR014044 SMART:SM00198 SUPFAM:SSF55797
            MEROPS:C01.A52 ProtClustDB:CLSZ2429919 RefSeq:XP_636852.1
            ProteinModelPortal:Q54J84 EnsemblProtists:DDB0187839 GeneID:8626520
            KEGG:ddi:DDB_G0288221 InParanoid:Q54J84 Uniprot:Q54J84
        Length = 395

 Score = 291 (107.5 bits), Expect = 1.1e-25, P = 1.1e-25
 Identities = 77/242 (31%), Positives = 129/242 (53%)

Query:    86 RPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQL 145
             +P  +     T+ K  N      ++DW  +   TP+++QG C SCW F ++AA E    +
Sbjct:   170 KPTSINPSASTTPKMPNFSS--GSVDW--SDYQTPVRDQGECKSCWVFGSLAALESRYLI 225

Query:   146 TTG----KLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDG 201
               G      + LS Q  ++C TSG    CE G   + F +   + GI  E +YPY A+ G
Sbjct:   226 KNGVSEKSTLHLSAQNAMNCITSG----CESGWPANVFDYF-ESSGIAFEKDYPYDAI-G 279

Query:   202 TCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGVFTG-D 260
             + N T+ +S+  +  GY++V  N++++L++ + N P+ +++  S +AFQ Y+ G++   +
Sbjct:   280 SDNCTS-SSNKFEYSGYDSVE-NTKDSLIQELKNGPITIAL-YSDTAFQSYAGGIYDSVE 336

Query:   261 CGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSS 320
                +++H V  VGY      T  W +KNS GT WGE GY R+    D K G+  +  +S 
Sbjct:   337 EYKDVNHIVLLVGYDKP---TDSWKIKNSLGTKWGELGYARITASND-KLGI--LLYNSF 390

Query:   321 YP 322
             +P
Sbjct:   391 FP 392


>WB|WBGene00008861 [details] [associations]
            symbol:F15D4.4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 SMART:SM00848 PROSITE:PS00640
            GeneTree:ENSGT00560000076599 EMBL:Z80344 HSSP:P53634
            eggNOG:NOG310593 PIR:T20981 ProteinModelPortal:Q93512 SMR:Q93512
            MEROPS:C01.A45 EnsemblMetazoa:F15D4.4 KEGG:cel:CELE_F15D4.4
            UCSC:F15D4.4 CTD:184530 WormBase:F15D4.4 InParanoid:Q93512
            OMA:ITMEQNI NextBio:925068 Uniprot:Q93512
        Length = 608

 Score = 298 (110.0 bits), Expect = 1.2e-25, P = 1.2e-25
 Identities = 91/297 (30%), Positives = 138/297 (46%)

Query:    33 NPEEKE--KRFRIF---KDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGYRRP 87
             N   KE  KRF ++   K  V+    +   G   YK+S N+F+   + E           
Sbjct:   145 NSTAKEGLKRFNVYSKVKKEVDEHNIMYELGMSSYKMSTNQFSVALDGEVAPLTLNL--- 201

Query:    88 DGLTSRKGT---SFKYENVIDVPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQ 144
             D LT        +       D   T+DWR    + PI +Q  CG CWAFS ++  E    
Sbjct:   202 DALTPTATVIPATISSRKKRDTEPTVDWRP--FLKPILDQSTCGGCWAFSMISMIESFFA 259

Query:   145 LTTGKLISLSEQELVSCDTS-----GVDH-GCEGGEMEDAFKFIIHNDGITTEANY-PYQ 197
             +      SLS Q+L++CDT      G+ + GC+GG  + A  ++        +A+  P+ 
Sbjct:   260 IQGYNTSSLSVQQLLTCDTKVDSTYGLANVGCKGGYFQIAGSYL--EVSAARDASLIPFD 317

Query:   198 AVDGTCNKTNEASHVAKIKGYET--VPAN--------SEEALLKAVANQPVAVSIDASGS 247
               D +C+ +     V  I  ++   +  N         E+ +   V   P+AV + A+G 
Sbjct:   318 LEDTSCDSSFFPPVVPTILLFDDGYISGNFTAAQLITMEQNIEDKVRKGPIAVGM-AAGP 376

Query:   248 AFQFYSSGVFTGDCGTELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKR 304
                 YS GV+ GDCGT ++H V  VG+  T +   YW+++NSWG SWGE GY R+KR
Sbjct:   377 DIYKYSEGVYDGDCGTIINHAVVIVGF--TDD---YWIIRNSWGASWGEAGYFRVKR 428


>ZFIN|ZDB-GENE-030619-9 [details] [associations]
            symbol:ctsc "cathepsin C" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030619-9 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 MEROPS:C01.070 HSSP:P43235
            EMBL:BC064286 IPI:IPI00486570 RefSeq:NP_999887.1 UniGene:Dr.32463
            ProteinModelPortal:Q6P2V1 SMR:Q6P2V1 PRIDE:Q6P2V1 GeneID:368704
            KEGG:dre:368704 InParanoid:Q6P2V1 NextBio:20813127
            ArrayExpress:Q6P2V1 Bgee:Q6P2V1 Uniprot:Q6P2V1
        Length = 455

 Score = 292 (107.8 bits), Expect = 1.5e-25, P = 1.5e-25
 Identities = 80/221 (36%), Positives = 113/221 (51%)

Query:   106 VPATMDWRK-NGA--VTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLIS--LSEQELVS 160
             +P   DWR  NG   V+P++NQ  CGSC++F+ +   E   ++ T        S Q++VS
Sbjct:   224 LPQHWDWRNVNGVNFVSPVRNQAQCGSCYSFATMGMLEARVRIQTNNTQQPVFSPQQVVS 283

Query:   161 CDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCN---KTNE--ASHVAKI 215
             C  S    GC+GG      K+I  + GI  E  +PY   D  CN   K  +  AS    +
Sbjct:   284 C--SQYSQGCDGGFPYLIGKYI-QDFGIVEEDCFPYTGSDSPCNLPAKCTKYYASDYHYV 340

Query:   216 KGYETVPANSEEAL-LKAVANQPVAVSIDASGSAFQFYSSGVF--TG--DCGT--EL-DH 267
              G+      SE A+ L+ V N P+ V+++     F  Y  G++  TG  D     EL +H
Sbjct:   341 GGF--YGGCSESAMMLELVKNGPMGVALEVYPD-FMNYKEGIYHHTGLRDANNPFELTNH 397

Query:   268 GVTAVGYGAT-ANGTKYWLVKNSWGTSWGEEGYIRMKRDID 307
              V  VGYG     G KYW+VKNSWG+ WGE G+ R++R  D
Sbjct:   398 AVLLVGYGQCHKTGEKYWIVKNSWGSGWGENGFFRIRRGTD 438


>RGD|2445 [details] [associations]
            symbol:Ctsc "cathepsin C" species:10116 "Rattus norvegicus"
          [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA;ISO]
          [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
          [GO:0004252 "serine-type endopeptidase activity" evidence=IEA;ISO]
          [GO:0005764 "lysosome" evidence=IDA;TAS] [GO:0005783 "endoplasmic
          reticulum" evidence=IDA] [GO:0005794 "Golgi apparatus" evidence=IDA]
          [GO:0006508 "proteolysis" evidence=IEP;ISO;TAS] [GO:0007568 "aging"
          evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
          evidence=ISO] [GO:0010033 "response to organic substance"
          evidence=IDA] [GO:0031404 "chloride ion binding" evidence=IDA]
          [GO:0042802 "identical protein binding" evidence=IDA] [GO:0043621
          "protein self-association" evidence=IDA] InterPro:IPR000668
          InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
          InterPro:IPR000169 RGD:2445 GO:GO:0005783 GO:GO:0005794 GO:GO:0007568
          GO:GO:0010033 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
          InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139
          PROSITE:PS00639 GO:GO:0004252 GO:GO:0005764 GO:GO:0043621
          GO:GO:0042802 GO:GO:0031404 GO:GO:0004197
          GeneTree:ENSGT00560000076599 CTD:1075 HOGENOM:HOG000068022
          HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ InterPro:IPR014882
          Pfam:PF08773 MEROPS:C01.070 OMA:YDDFLHY GO:GO:0001913 EMBL:D90404
          IPI:IPI00193765 PIR:A41158 RefSeq:NP_058793.1 UniGene:Rn.203177
          PDB:1JQP PDBsum:1JQP ProteinModelPortal:P80067 SMR:P80067
          STRING:P80067 PhosphoSite:P80067 PRIDE:P80067
          Ensembl:ENSRNOT00000022342 GeneID:25423 KEGG:rno:25423
          InParanoid:P80067 SABIO-RK:P80067 EvolutionaryTrace:P80067
          NextBio:606591 ArrayExpress:P80067 Genevestigator:P80067
          GermOnline:ENSRNOG00000016496 Uniprot:P80067
        Length = 462

 Score = 292 (107.8 bits), Expect = 1.7e-25, P = 1.7e-25
 Identities = 92/299 (30%), Positives = 141/299 (47%)

Query:    35 EEKEKRFRIFKDNVEFIESLN-------AAGNKPY-KLSINEFADQTNQEFKAFRNGYRR 86
             +EK    R++  N  F++++N       A   + Y KLSI +   ++    +  R    +
Sbjct:   160 QEKYSE-RLYSHNHNFVKAINSVQKSWTATTYEEYEKLSIRDLIRRSGHSGRILRP---K 215

Query:    87 PDGLTSRKGTSFKYENVIDVPATMDWRK-NGA--VTPIKNQGPCGSCWAFSAVAATEGIT 143
             P  +T         + ++ +P + DWR   G   V+P++NQ  CGSC++F+++   E   
Sbjct:   216 PAPITDEI-----QQQILSLPESWDWRNVRGINFVSPVRNQESCGSCYSFASLGMLEARI 270

Query:   144 QLTTGKLIS--LSEQELVSCDTSGVDHGCEGG-EMEDAFKFIIHNDGITTEANYPYQAVD 200
             ++ T    +  LS QE+VSC  S    GC+GG     A K+   + G+  E  +PY A D
Sbjct:   271 RILTNNSQTPILSPQEVVSC--SPYAQGCDGGFPYLIAGKYA-QDFGVVEENCFPYTATD 327

Query:   201 GTCNKTNEASHVAKIKGYET--VPANSEEALLKA--VANQPVAVSIDASGSAFQFYSSGV 256
               C            + Y          EAL+K   V + P+AV+ +     F  Y SG+
Sbjct:   328 APCKPKENCLRYYSSEYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDD-FLHYHSGI 386

Query:   257 F--TGDCGT----EL-DHGVTAVGYGAT-ANGTKYWLVKNSWGTSWGEEGYIRMKRDID 307
             +  TG        EL +H V  VGYG     G  YW+VKNSWG+ WGE GY R++R  D
Sbjct:   387 YHHTGLSDPFNPFELTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRIRRGTD 445


>DICTYBASE|DDB_G0292462 [details] [associations]
            symbol:DDB_G0292462 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0292462 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            EMBL:AAFI02000190 RefSeq:XP_629634.1 MEROPS:C01.A56
            EnsemblProtists:DDB0184413 GeneID:8628698 KEGG:ddi:DDB_G0292462
            InParanoid:Q54D62 OMA:NTQVESH Uniprot:Q54D62
        Length = 323

 Score = 289 (106.8 bits), Expect = 1.8e-25, P = 1.8e-25
 Identities = 77/263 (29%), Positives = 129/263 (49%)

Query:    84 YRRPDGLTSRKGTSFKYENVIDVPATMDWRKN--GAVTPIKNQGPCGSCWA--FSAVAAT 139
             Y  P   +     S+    +  +PA+ D R N    ++P++ Q  CGSCWA   S + A 
Sbjct:    24 YGFPGSYSGCSSISYSQNELDTIPASFDVRTNWGDCMSPVREQQSCGSCWAQVTSGILAD 83

Query:   140 EGITQLTTGKLISLSEQELVSCD-------TSGVDHGCEGGEMEDAFKFIIHNDGITTEA 192
                 +      + LS Q L+ CD        SG ++GC+GG +  A   +I N+GI ++ 
Sbjct:    84 RMCIESDKNIKMLLSPQYLMDCDGSCVSDGVSGCNNGCKGGFVGLALTRLI-NEGIVSDE 142

Query:   193 NYPYQAV-DGTCNKT-NEASHVAKIKGYETVPANS----EEALLKAVANQPVAVSIDASG 246
                YQA  D +C  T ++ S ++    Y+     +    ++A  + + N PV  +     
Sbjct:   143 CLSYQASKDSSCPTTCDDGSPISNTTIYKATSCRAFPTVQDAQYEIMTNGPVIATFMLY- 201

Query:   247 SAFQFYSSGVFTGDCGTELD-HGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRD 305
             S F+ +   V+     T+++ H V  VG+G T++G  YW+  NSWGT WG++GY +++R 
Sbjct:   202 SDFKPHKWDVYIKSSNTQVESHAVRVVGWGTTSDGVDYWIAANSWGTGWGDKGYFKIRRG 261

Query:   306 IDA---KEGLCGIAMDS-SYPTA 324
              D    +EG   +  D+ S PT+
Sbjct:   262 SDEAAFEEGFITVTADTASVPTS 284


>WB|WBGene00013076 [details] [associations]
            symbol:Y51A2D.8 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            EMBL:AL021497 HSSP:P53634 HOGENOM:HOG000019851 PIR:T27079
            RefSeq:NP_507627.1 ProteinModelPortal:Q9XXQ7 SMR:Q9XXQ7
            MEROPS:C01.A49 EnsemblMetazoa:Y51A2D.8 GeneID:180208
            KEGG:cel:CELE_Y51A2D.8 UCSC:Y51A2D.8 CTD:180208 WormBase:Y51A2D.8
            eggNOG:NOG307864 InParanoid:Q9XXQ7 OMA:VAVYFKV NextBio:908434
            Uniprot:Q9XXQ7
        Length = 386

 Score = 287 (106.1 bits), Expect = 2.9e-25, P = 2.9e-25
 Identities = 83/322 (25%), Positives = 141/322 (43%)

Query:     8 SRKLQEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEF-IESLNAAGNKPYKLSI 66
             +RK ++ S   +++Q  + + K Y N ++   + +    + +F I   +      +   +
Sbjct:    51 NRKYKDES---ENQQRFNNFVKSYNNVDKLNAKSKAAGYDTQFGINKFSDLSTAEFHGRL 107

Query:    67 NEFADQTNQEFKAFRNGYRRPD------GLTSRKGTSFKYENVIDVPATMDWRKNGA--V 118
             +      N          ++PD        T  K  S +Y +  D+    + + NG   V
Sbjct:   108 SNVVPSNNTGLPMLNFDKKKPDFRAADMNKTRHKRRSTRYPDYFDL---RNEKINGRYIV 164

Query:   119 TPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDA 178
              PIK+QG C  CW F+  A  E +    +GK  SLS+QE+  C T G   GC+GG +   
Sbjct:   165 GPIKDQGQCACCWGFAVTALVETVYAAHSGKFKSLSDQEVCDCGTEGTP-GCKGGSLTLG 223

Query:   179 FKFIIHNDGITTEANYPY---QAVDGTCNKTNEASHVAKIKGYETV---PANSEEALLKA 232
              +++    G++ + +YPY   +A  G   +  E   +   + +      P  +EE +++ 
Sbjct:   224 VQYV-KKYGLSGDEDYPYDQNRANQGRRCRLRETDRIVPARAFNFAVINPRRAEEQIIQV 282

Query:   233 VANQPVAVSIDAS-GSAFQFYSSGVFT-GDCGTELD-HGVTAVGYGAT--ANGTK--YWL 285
             +    V V++    G  F+ Y  GV    DC      H    VGY     + G    YW+
Sbjct:   283 LTEWKVPVAVYFKVGDQFKEYKEGVIIEDDCRRATQWHAGAIVGYDTVEDSRGRSHDYWI 342

Query:   286 VKNSWGTSWGEEGYIRMKRDID 307
             +KNSWG  W E GY+R+ R  D
Sbjct:   343 IKNSWGGDWAESGYVRVVRGRD 364


>UNIPROTKB|E9PKT6 [details] [associations]
            symbol:CTSH "Cathepsin H" species:9606 "Homo sapiens"
            [GO:0001520 "outer dense fiber" evidence=IEA] [GO:0001656
            "metanephros development" evidence=IEA] [GO:0001669 "acrosomal
            vesicle" evidence=IEA] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0007283 "spermatogenesis" evidence=IEA] [GO:0008284 "positive
            regulation of cell proliferation" evidence=IEA] [GO:0010634
            "positive regulation of epithelial cell migration" evidence=IEA]
            [GO:0016505 "apoptotic protease activator activity" evidence=IEA]
            [GO:0030984 "kininogen binding" evidence=IEA] [GO:0031638 "zymogen
            activation" evidence=IEA] [GO:0031648 "protein destabilization"
            evidence=IEA] [GO:0032403 "protein complex binding" evidence=IEA]
            [GO:0032526 "response to retinoic acid" evidence=IEA] [GO:0033619
            "membrane protein proteolysis" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0043621 "protein
            self-association" evidence=IEA] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IEA] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            InterPro:IPR000169 GO:GO:0043066 GO:GO:0008284 PANTHER:PTHR12411
            PROSITE:PS00139 GO:GO:0045766 GO:GO:0004252 GO:GO:0032526
            GO:GO:0016505 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 GO:GO:0060448 GO:GO:0033619
            EMBL:AC011944 HGNC:HGNC:2535 IPI:IPI00375426
            ProteinModelPortal:E9PKT6 SMR:E9PKT6 PRIDE:E9PKT6
            Ensembl:ENST00000528741 ArrayExpress:E9PKT6 Bgee:E9PKT6
            Uniprot:E9PKT6
        Length = 134

 Score = 282 (104.3 bits), Expect = 9.7e-25, P = 9.7e-25
 Identities = 54/135 (40%), Positives = 80/135 (59%)

Query:    64 LSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA-VTPIK 122
             +++N+F+D +  E K  +  +  P   ++ K    +       P ++DWRK G  V+P+K
Sbjct:     1 MALNQFSDMSFAEIK-HKYLWSEPQNCSATKSNYLRGTG--PYPPSVDWRKKGNFVSPVK 57

Query:   123 NQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFI 182
             NQG CGSCW FS   A E    + TGK++SL+EQ+LV C     +HGC+GG    AF++I
Sbjct:    58 NQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYI 117

Query:   183 IHNDGITTEANYPYQ 197
             ++N GI  E  YPYQ
Sbjct:   118 LYNKGIMGEDTYPYQ 132


>DICTYBASE|DDB_G0280187 [details] [associations]
            symbol:DDB_G0280187 "cathepsin Z-like protein"
            species:44689 "Dictyostelium discoideum" [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            dictyBase:DDB_G0280187 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            EMBL:AAFI02000035 KO:K08568 RefSeq:XP_641294.1
            ProteinModelPortal:Q54VR1 MEROPS:C01.A61 PRIDE:Q54VR1
            EnsemblProtists:DDB0233838 GeneID:8622427 KEGG:ddi:DDB_G0280187
            InParanoid:Q54VR1 OMA:VWKVGDY Uniprot:Q54VR1
        Length = 291

 Score = 183 (69.5 bits), Expect = 3.2e-24, Sum P(2) = 3.2e-24
 Identities = 38/99 (38%), Positives = 63/99 (63%)

Query:   224 NSEEALLKAV-ANQPVAVSIDASGSAFQFYSSGVFTGDCGT--ELDHGVTAVGYGATANG 280
             N   A+++ + A  P+A  ++ +  AF+ Y+SGVFT   G+  E++H ++ +G+G T NG
Sbjct:   190 NGSVAMMQEIFARGPIACGMEVT-DAFESYTSGVFTSSVGSTGEINHEISIIGWG-TENG 247

Query:   281 TKYWLVKNSWGTSWGEEGYIRMKRDID--AKEGLCGIAM 317
               YW+ +NSWGT +GE G+ R++R ID  + E  C  A+
Sbjct:   248 VDYWIGRNSWGTYFGELGFFRIQRGIDLLSIESACDWAV 286

 Score = 146 (56.5 bits), Expect = 3.2e-24, Sum P(2) = 3.2e-24
 Identities = 38/108 (35%), Positives = 59/108 (54%)

Query:   106 VPATMDWRK-NGA--VTPIKNQG-P--CGSCWAFSAVAATEG---ITQLTTGKLISLSEQ 156
             +P   DWR  +G+  +T  +NQ  P  CGSCWA    +A      I +  T   + L+ Q
Sbjct:    49 LPTQYDWRNISGSSYITITRNQHLPQYCGSCWAHGTTSALGDRIKIGRKGTFPEVVLAPQ 108

Query:   157 ELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCN 204
              L++C  +G D+ C+GG+  +A+ ++    GIT E   PY+A+D  CN
Sbjct:   109 VLLNC--AGPDNTCDGGDPTEAYAYMAAK-GITDETCAPYEAIDNECN 153


>UNIPROTKB|F1RWA9 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 EMBL:CU855637
            Ensembl:ENSSSCT00000009707 OMA:WAFSIVG Uniprot:F1RWA9
        Length = 194

 Score = 265 (98.3 bits), Expect = 6.1e-23, P = 6.1e-23
 Identities = 62/197 (31%), Positives = 103/197 (52%)

Query:   127 CGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHND 186
             CG CWAFS V+A E    +    L  LS Q+++ C  +  ++GC GG   +A  ++    
Sbjct:     2 CGGCWAFSVVSAVESAYAIKGQPLEVLSVQQVIDCSYN--NYGCNGGSTLNALYWLNKTQ 59

Query:   187 -GITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVP-ANSEEALLKAVANQ-PVAVSID 243
               + +++ YP++A +G C+  + +     IK Y     +  E+ + K +    P+ V +D
Sbjct:    60 VKVVSDSEYPFKAQNGLCHYFSCSHSGVSIKDYSAYDFSGQEDEMAKTLLTLGPLIVIVD 119

Query:   244 ASGSAFQFYSSGVFTGDCGT-ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRM 302
             A   ++Q Y  G+    C + E +H V   G+  T + T YW+V+NSWG++WG +GY  +
Sbjct:   120 AV--SWQDYLGGIIQHHCSSGEANHAVLVTGFDKTGS-TPYWIVRNSWGSAWGIDGYALV 176

Query:   303 KRDIDAKEGLCGIAMDS 319
             K        +CGIA DS
Sbjct:   177 KMG----GNICGIA-DS 188


>TAIR|locus:2133402 [details] [associations]
            symbol:AT4G01610 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0005829 "cytosol" evidence=IDA] [GO:0000902 "cell
            morphogenesis" evidence=RCA] [GO:0006635 "fatty acid
            beta-oxidation" evidence=RCA] [GO:0010162 "seed dormancy process"
            evidence=RCA] [GO:0016049 "cell growth" evidence=RCA] [GO:0048193
            "Golgi vesicle transport" evidence=RCA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0005773 EMBL:CP002687
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 eggNOG:NOG315657
            HOGENOM:HOG000241341 KO:K01363 PANTHER:PTHR12411:SF16 OMA:DAIPDHF
            HSSP:P07858 ProtClustDB:CLSN2687619 EMBL:AF370193 EMBL:AY065167
            EMBL:AY114015 EMBL:AY086034 EMBL:AF083797 EMBL:BT001190
            EMBL:AK175280 EMBL:AK175481 EMBL:AK175539 EMBL:AK176165
            EMBL:AK176244 EMBL:AK176281 EMBL:AK176330 EMBL:AK176416
            EMBL:AK176433 EMBL:AK176487 EMBL:AK221398 EMBL:AK230235
            IPI:IPI00530811 RefSeq:NP_567215.1 UniGene:At.24471
            ProteinModelPortal:Q94K85 SMR:Q94K85 STRING:Q94K85 MEROPS:C01.144
            PaxDb:Q94K85 PRIDE:Q94K85 EnsemblPlants:AT4G01610.1 GeneID:826792
            KEGG:ath:AT4G01610 TAIR:At4g01610 InParanoid:Q94K85
            PhylomeDB:Q94K85 Genevestigator:Q94K85 Uniprot:Q94K85
        Length = 359

 Score = 169 (64.5 bits), Expect = 9.5e-23, Sum P(2) = 9.5e-23
 Identities = 41/130 (31%), Positives = 63/130 (48%)

Query:   191 EANYPYQAVDGTC---NKT-NEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASG 246
             E  YP       C   NK  +E+ H + +  Y TV +N ++ + +   N PV VS     
Sbjct:   208 EPAYPTPKCSRKCVSDNKLWSESKHYS-VSTY-TVKSNPQDIMAEVYKNGPVEVSFTVYE 265

Query:   247 SAFQFYSSGVFTGDCGTELD-HGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRD 305
               F  Y SGV+    G+ +  H V  +G+G ++ G  YWL+ N W   WG++GY  ++R 
Sbjct:   266 D-FAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRRG 324

Query:   306 IDAKEGLCGI 315
              +     CGI
Sbjct:   325 TNE----CGI 330

 Score = 157 (60.3 bits), Expect = 9.5e-23, Sum P(2) = 9.5e-23
 Identities = 48/161 (29%), Positives = 79/161 (49%)

Query:    42 RIFKDNVEFIESLNAAGNKPYKLSINE-FADQTNQEFKAFRNGYRRPDGLTSRKGTSF-K 99
             +I +D  E ++ +N   N  +K +IN+ F++ T  EFK    G + P       G     
Sbjct:    41 KILQD--EIVKKVNENPNAGWKAAINDRFSNATVAEFKRLL-GVK-PTPKKHFLGVPIVS 96

Query:   100 YENVIDVPATMD----WRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSE 155
             ++  + +P   D    W +  ++  I +QG CGSCWAF AV +      +  G  ISLS 
Sbjct:    97 HDPSLKLPKAFDARTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIQFGMNISLSV 156

Query:   156 QELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPY 196
              +L++C       GC+GG    A+++  ++ G+ TE   PY
Sbjct:   157 NDLLACCGFRCGDGCDGGYPIAAWQYFSYS-GVVTEECDPY 196


>TAIR|locus:505006093 [details] [associations]
            symbol:AT1G02305 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0005829 "cytosol" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684 GO:GO:0005773
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 HOGENOM:HOG000241341
            KO:K01363 PANTHER:PTHR12411:SF16 OMA:CCGFLCG UniGene:At.23486
            UniGene:At.42610 UniGene:At.43952 EMBL:AY039887 EMBL:AF428337
            EMBL:BT002227 IPI:IPI00524601 RefSeq:NP_563648.1 HSSP:P07858
            ProteinModelPortal:Q93VC9 SMR:Q93VC9 IntAct:Q93VC9 STRING:Q93VC9
            MEROPS:C01.049 PRIDE:Q93VC9 ProMEX:Q93VC9 EnsemblPlants:AT1G02305.1
            GeneID:839538 KEGG:ath:AT1G02305 TAIR:At1g02305 InParanoid:Q93VC9
            PhylomeDB:Q93VC9 ProtClustDB:CLSN2687619 Genevestigator:Q93VC9
            Uniprot:Q93VC9
        Length = 362

 Score = 161 (61.7 bits), Expect = 4.8e-22, Sum P(2) = 4.8e-22
 Identities = 38/128 (29%), Positives = 60/128 (46%)

Query:   191 EANYPYQAVDGTCNKTNEASHVAKIKGYET--VPANSEEALLKAVANQPVAVSIDASGSA 248
             E  YP       C   N+    +K  G     V ++ ++ + +   N PV V+       
Sbjct:   211 EPAYPTPKCARKCVSGNQLWRESKHYGVSAYKVRSHPDDIMAEVYKNGPVEVAFTVYED- 269

Query:   249 FQFYSSGVFTGDCGTELD-HGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDID 307
             F  Y SGV+    GT +  H V  +G+G + +G  YWL+ N W  SWG++GY +++R  +
Sbjct:   270 FAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTN 329

Query:   308 AKEGLCGI 315
                  CGI
Sbjct:   330 E----CGI 333

 Score = 160 (61.4 bits), Expect = 4.8e-22, Sum P(2) = 4.8e-22
 Identities = 47/154 (30%), Positives = 73/154 (47%)

Query:    49 EFIESLNAAGNKPYKLSINE-FADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENV-IDV 106
             E ++ +N   N  +K S N+ FA+ T  EFK    G + P   T   G      ++ + +
Sbjct:    49 EIVKEVNENPNAGWKASFNDRFANATVAEFKRLL-GVK-PTPKTEFLGVPIVSHDISLKL 106

Query:   107 PATMD----WRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCD 162
             P   D    W +  ++  I +QG CGSCWAF AV +      +     +SLS  +L++C 
Sbjct:   107 PKEFDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYNMNVSLSVNDLLACC 166

Query:   163 TSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPY 196
                   GC GG    A+++  H+ G+ TE   PY
Sbjct:   167 GFLCGQGCNGGYPIAAWRYFKHH-GVVTEECDPY 199


>DICTYBASE|DDB_G0283401 [details] [associations]
            symbol:ctsZ "cathepsin Z precursor" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            dictyBase:DDB_G0283401 GO:GO:0005615 GenomeReviews:CM000153_GR
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 EMBL:AAFI02000055 KO:K08568 OMA:QCGTCTE
            eggNOG:NOG275763 RefSeq:XP_639036.1 ProteinModelPortal:Q54R55
            IntAct:Q54R55 MEROPS:C01.A60 PRIDE:Q54R55
            EnsemblProtists:DDB0233836 GeneID:8624061 KEGG:ddi:DDB_G0283401
            InParanoid:Q54R55 Uniprot:Q54R55
        Length = 296

 Score = 251 (93.4 bits), Expect = 1.9e-21, P = 1.9e-21
 Identities = 75/243 (30%), Positives = 123/243 (50%)

Query:   104 IDVPATMDWRKNGAV---TPIKNQG-P--CGSCWAFSAVAATEG---ITQLTTGKLISLS 154
             ++VP + DWR    V   T  +NQ  P  CG CWAF++ ++      I +      ++++
Sbjct:    56 LEVPQSWDWRNVSGVNYLTMNRNQHIPQYCGGCWAFASTSSISDRIKIQRKAAFPDVNVA 115

Query:   155 EQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAV--------------- 199
              Q L+ C+  G    C+GG+  DAF FI  N GI  E   PYQA                
Sbjct:   116 PQHLIDCNGGGT---CDGGDPGDAFAFINEN-GIVDETCKPYQAKNLPDECSPACKTCNP 171

Query:   200 DGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGVFTG 259
             DGTC      +++  +  Y +V   +++ + +  A  P+A SIDA+ S  + Y+SG+F  
Sbjct:   172 DGTCQAIPVHTNIT-VTEYGSV-RGAKDMMAEIYARGPIACSIDAT-SKLEAYTSGIFKE 228

Query:   260 DCGTEL-DHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMD 318
                  L +H ++ +G+G   + T YW+V+NSWG+ +GE G+  + +     E L GI +D
Sbjct:   229 FKLDPLPNHIISVIGWGVQ-DSTPYWIVRNSWGSYYGEGGFFNIVQG-SLFENL-GIELD 285

Query:   319 SSY 321
              ++
Sbjct:   286 CNW 288


>MGI|MGI:1891190 [details] [associations]
            symbol:Ctsz "cathepsin Z" species:10090 "Mus musculus"
            [GO:0005615 "extracellular space" evidence=ISO] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1891190 GO:GO:0005783 GO:GO:0005615 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0060441 GeneTree:ENSGT00560000076599 MEROPS:C01.013 CTD:1522
            HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN OMA:QCGTCTE
            ChiTaRS:CTSZ EMBL:AJ242663 EMBL:AF136277 EMBL:AF136278
            EMBL:BC008619 IPI:IPI00986833 RefSeq:NP_071720.1 UniGene:Mm.156919
            ProteinModelPortal:Q9WUU7 SMR:Q9WUU7 IntAct:Q9WUU7 STRING:Q9WUU7
            PaxDb:Q9WUU7 PRIDE:Q9WUU7 Ensembl:ENSMUST00000016400 GeneID:64138
            KEGG:mmu:64138 InParanoid:Q9WUU7 NextBio:319927 Bgee:Q9WUU7
            CleanEx:MM_CTSZ Genevestigator:Q9WUU7 GermOnline:ENSMUSG00000016256
            Uniprot:Q9WUU7
        Length = 306

 Score = 251 (93.4 bits), Expect = 1.9e-21, P = 1.9e-21
 Identities = 68/227 (29%), Positives = 109/227 (48%)

Query:    99 KYENVIDVPATMDWRK-NGA--VTPIKNQG-P--CGSCWAF---SAVAATEGITQLTTGK 149
             +Y +  D+P   DWR  NG    +  +NQ  P  CGSCWA    SA+A    I +     
Sbjct:    57 EYLSPADLPKNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWP 116

Query:   150 LISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEA 209
              I LS Q ++ C  +G    CEGG     +++  H  GI  E    YQA D  C+K N+ 
Sbjct:   117 SILLSVQNVIDCGNAG---SCEGGNDLPVWEYA-HKHGIPDETCNNYQAKDQDCDKFNQC 172

Query:   210 SHVAKIKGYETVP-------------ANSEEALLKAVANQPVAVSIDASGSAFQFYSSGV 256
                 + K   T+              +  E+ + +  AN P++  I A+      Y+ G+
Sbjct:   173 GTCTEFKECHTIQNYTLWRVGDYGSLSGREKMMAEIYANGPISCGIMAT-EMMSNYTGGI 231

Query:   257 FTGDCGTE-LDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRM 302
             +        ++H ++  G+G + +G +YW+V+NSWG  WGE+G++R+
Sbjct:   232 YAEHQDQAVINHIISVAGWGVSNDGIEYWIVRNSWGEPWGEKGWMRI 278


>WB|WBGene00000788 [details] [associations]
            symbol:cpz-1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] [GO:0009792 "embryo development ending in
            birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0010171 "body morphogenesis" evidence=IMP]
            [GO:0018996 "molting cycle, collagen and cuticulin-based cuticle"
            evidence=IMP] [GO:0031012 "extracellular matrix" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0009792 GO:GO:0040007 GO:GO:0002119
            GO:GO:0018996 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0010171 GO:GO:0031012
            GeneTree:ENSGT00560000076599 KO:K08568 OMA:QCGTCTE EMBL:FO081275
            EMBL:BK001409 PIR:T29872 RefSeq:NP_491023.2 HSSP:Q9UBR2
            ProteinModelPortal:G5EGP8 SMR:G5EGP8 IntAct:G5EGP8 MEROPS:C01.A38
            EnsemblMetazoa:F32B5.8 GeneID:171829 KEGG:cel:CELE_F32B5.8
            CTD:171829 WormBase:F32B5.8 NextBio:872879 Uniprot:G5EGP8
        Length = 306

 Score = 248 (92.4 bits), Expect = 3.9e-21, P = 3.9e-21
 Identities = 71/221 (32%), Positives = 105/221 (47%)

Query:   105 DVPATMDWRK-NGA--VTPIKNQG-P--CGSCWAFSAVAATE---GITQLTTGKLISLSE 155
             D+P T DWR  NG    +  +NQ  P  CGSCWAF A +A      I +        LS 
Sbjct:    64 DLPKTWDWRDANGINYASADRNQHIPQYCGSCWAFGATSALADRINIKRKNAWPQAYLSV 123

Query:   156 QELVSCDTSGVDHGCE-GGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEAS---- 210
             QE++ C  +G    C  GGE    +K+  H  GI  E    YQA DG C+  N       
Sbjct:   124 QEVIDCSGAGT---CVMGGEPGGVYKYA-HEHGIPHETCNNYQARDGKCDPYNRCGSCWP 179

Query:   211 -HVAKIKGY------ETVPANSEEALLKAVANQ-PVAVSIDASGSAFQFYSSGVFTGDCG 262
                  IK Y      E    +  E +   + ++ P+A  I A+  AF+ Y+ G++     
Sbjct:   180 GECFSIKNYTLYKVSEYGTVHGYEKMKAEIYHKGPIACGIAAT-KAFETYAGGIYKEVTD 238

Query:   263 TELDHGVTAVGYGAT-ANGTKYWLVKNSWGTSWGEEGYIRM 302
              ++DH ++  G+G    +G +YW+ +NSWG  WGE G+ ++
Sbjct:   239 EDIDHIISVHGWGVDHESGVEYWIGRNSWGEPWGEHGWFKI 279


>DICTYBASE|DDB_G0288563 [details] [associations]
            symbol:DDB_G0288563 species:44689 "Dictyostelium
            discoideum" [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0288563
            GO:GO:0005615 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197
            EMBL:AAFI02000117 PANTHER:PTHR12411:SF16 RefSeq:XP_636643.1
            MEROPS:C01.A58 PRIDE:Q54IS1 EnsemblProtists:DDB0187993
            GeneID:8626689 KEGG:ddi:DDB_G0288563 InParanoid:Q54IS1 OMA:AWEYMEL
            Uniprot:Q54IS1
        Length = 314

 Score = 246 (91.7 bits), Expect = 6.3e-21, P = 6.3e-21
 Identities = 69/230 (30%), Positives = 115/230 (50%)

Query:   106 VPATMDWRKN--GAVTPIKNQGPCGSCWAFSAVAA-TEGITQLTTGKLI--SLSEQELVS 160
             +P + D R      + PI NQ  CGSCWAFS+    ++ +   +  K    +LS Q LV+
Sbjct:    88 IPTSFDSRVQWPDCIHPILNQEQCGSCWAFSSSEVLSDRLCIASNNKTNPGALSPQTLVA 147

Query:   161 CDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGT---CNKT---NEASHVAK 214
             CD  G D GC GG  + A++++    G+ T++  PY A +GT   C ++   +E   + +
Sbjct:   148 CDVYGND-GCSGGIPQLAWEYM-ELKGLPTDSCVPYTAGNGTVYSCQRSCSDSEDYSLYR 205

Query:   215 IKGYETVPANSEEALLKAV-ANQPVAVSIDASGSAFQFYSSGVFTGDCGTEL--DHGVTA 271
              K +     +S + + + + A  P+  +++     F  YSSGV+    G+ L   H +  
Sbjct:   206 AKPFTLKTCSSVQCIQENILAYGPIVGTMEVYED-FMSYSSGVYVMTPGSSLLGGHAIKI 264

Query:   272 VGYGATANGT-KYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSS 320
             VG+G        YW+V NSWG  WG++G+  +  +       C I+ D+S
Sbjct:   265 VGWGFDQTSQLNYWIVANSWGADWGQQGFFFISMET------CSISSDAS 308


>WB|WBGene00000784 [details] [associations]
            symbol:cpr-4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39895 EMBL:L39926 EMBL:FO081381
            PIR:T37280 RefSeq:NP_504682.1 UniGene:Cel.5404
            ProteinModelPortal:P43508 SMR:P43508 DIP:DIP-25376N
            MINT:MINT-1069892 STRING:P43508 MEROPS:C01.A34 PaxDb:P43508
            EnsemblMetazoa:F44C4.3 GeneID:179053 KEGG:cel:CELE_F44C4.3
            UCSC:F44C4.3 CTD:179053 WormBase:F44C4.3 InParanoid:P43508
            OMA:CCGFLCG NextBio:903704 Uniprot:P43508
        Length = 335

 Score = 160 (61.4 bits), Expect = 2.0e-20, Sum P(2) = 2.0e-20
 Identities = 45/135 (33%), Positives = 65/135 (48%)

Query:   185 NDGITTEANYPYQAVDGTCNKTNEASHVA-KIKGYETVPANSEEALLKA--VANQPVAVS 241
             +DG  T A      V+   NK    ++ A K  G        + + ++A  +A+ PV  +
Sbjct:   201 DDGYDTPA-----CVNKCTNKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEAA 255

Query:   242 IDASGSAFQFYSSGVFTGDCGTELD-HGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYI 300
                    +Q Y +GV+    G EL  H +  +G+G T NGT YWLV NSW  +WGE GY 
Sbjct:   256 FTVYEDFYQ-YKTGVYVHTTGQELGGHAIRILGWG-TDNGTPYWLVANSWNVNWGENGYF 313

Query:   301 RMKRDIDAKEGLCGI 315
             R+ R  +     CGI
Sbjct:   314 RIIRGTNE----CGI 324

 Score = 144 (55.7 bits), Expect = 2.0e-20, Sum P(2) = 2.0e-20
 Identities = 34/98 (34%), Positives = 57/98 (58%)

Query:   106 VPATMD----WRKNGAVTPIKNQGPCGSCWAFSAV-AATEGITQLTTGKLISL-SEQELV 159
             +PAT D    W    ++  I++Q  CGSCWAF+A  AA++     + G + +L S ++++
Sbjct:    81 IPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVL 140

Query:   160 SCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQ 197
             SC  S   +GCEGG   +A+K+++ + G  T  +Y  Q
Sbjct:   141 SC-CSNCGYGCEGGYPINAWKYLVKS-GFCTGGSYEAQ 176


>UNIPROTKB|E2QV47 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0097208 "alveolar lamellar body"
            evidence=IEA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0070371 "ERK1 and ERK2 cascade"
            evidence=IEA] [GO:0070324 "thyroid hormone binding" evidence=IEA]
            [GO:0060448 "dichotomous subdivision of terminal units involved in
            lung branching" evidence=IEA] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IEA] [GO:0043129 "surfactant homeostasis"
            evidence=IEA] [GO:0043066 "negative regulation of apoptotic
            process" evidence=IEA] [GO:0033619 "membrane protein proteolysis"
            evidence=IEA] [GO:0032526 "response to retinoic acid" evidence=IEA]
            [GO:0031648 "protein destabilization" evidence=IEA] [GO:0031638
            "zymogen activation" evidence=IEA] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=IEA] [GO:0016505
            "apoptotic protease activator activity" evidence=IEA] [GO:0010815
            "bradykinin catabolic process" evidence=IEA] [GO:0010813
            "neuropeptide catabolic process" evidence=IEA] [GO:0010634
            "positive regulation of epithelial cell migration" evidence=IEA]
            [GO:0010628 "positive regulation of gene expression" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=IEA] [GO:0005829 "cytosol" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0004252 "serine-type endopeptidase activity"
            evidence=IEA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IEA] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 GO:GO:0016505 GO:GO:0010634
            GO:GO:0004197 GO:GO:0042599 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 Ensembl:ENSCAFT00000036196 Uniprot:E2QV47
        Length = 136

 Score = 240 (89.5 bits), Expect = 2.7e-20, P = 2.7e-20
 Identities = 52/137 (37%), Positives = 78/137 (56%)

Query:   191 EANYPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVA-NQPVAVSIDASGSAF 249
             E +YPY+  DG C K   +  +A +K    +  N E+A+++AVA   PV+ + + + S F
Sbjct:     3 EDSYPYKGQDGDC-KYQPSKAIAFVKDVANITINDEQAMVEAVALYNPVSFAFEVT-SDF 60

Query:   250 QFYSSGVFTG-DCGT---ELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRD 305
               Y  G+++   C     +++H V AVGYG   NG  YW+VKNSWG  WG  GY  M+R 
Sbjct:    61 MMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEQ-NGIPYWIVKNSWGPQWGMNGYFLMERG 119

Query:   306 IDAKEGLCGIAMDSSYP 322
                 + +CG+A  +SYP
Sbjct:   120 ----KNMCGLAACASYP 132


>WB|WBGene00000785 [details] [associations]
            symbol:cpr-5 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39896 EMBL:L39927 EMBL:FO081739
            PIR:T37277 RefSeq:NP_503383.1 UniGene:Cel.19730
            ProteinModelPortal:P43509 SMR:P43509 DIP:DIP-25329N IntAct:P43509
            MINT:MINT-1051285 STRING:P43509 MEROPS:C01.A35 PaxDb:P43509
            EnsemblMetazoa:W07B8.5 GeneID:178612 KEGG:cel:CELE_W07B8.5
            UCSC:W07B8.5.1 CTD:178612 WormBase:W07B8.5 InParanoid:P43509
            OMA:DAIPDHF NextBio:901840 Uniprot:P43509
        Length = 344

 Score = 167 (63.8 bits), Expect = 7.0e-20, Sum P(2) = 7.0e-20
 Identities = 41/124 (33%), Positives = 63/124 (50%)

Query:   197 QAVDGTCNKTNEASHVAKIKGYET----VPANSEEALLKAVANQPVAVSIDASGSAFQFY 252
             + VD   +K N A+   + K + +    V    E+   + + N P+ V+       +Q Y
Sbjct:   212 KCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIEVAFTVYEDFYQ-Y 270

Query:   253 SSGVFTGDCGTELD-HGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEG 311
             ++GV+    G  L  H V  +G+G   NGT YWLV NSW  +WGE+GY R+ R ++    
Sbjct:   271 TTGVYVHTAGASLGGHAVKILGWGVD-NGTPYWLVANSWNVAWGEKGYFRIIRGLNE--- 326

Query:   312 LCGI 315
              CGI
Sbjct:   327 -CGI 329

 Score = 131 (51.2 bits), Expect = 7.0e-20, Sum P(2) = 7.0e-20
 Identities = 30/90 (33%), Positives = 49/90 (54%)

Query:   112 WRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLIS--LSEQELVSCDTS--GVD 167
             W    ++  I++Q  CGSCWAF+A  A    T + +   ++  LS ++L+SC T      
Sbjct:    92 WPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLLSCCTGMFSCG 151

Query:   168 HGCEGGEMEDAFKFIIHNDGITTEANYPYQ 197
             +GCEGG    A+K+ + + G+ T  +Y  Q
Sbjct:   152 NGCEGGYPIQAWKWWVKH-GLVTGGSYETQ 180


>UNIPROTKB|E1C4M3 [details] [associations]
            symbol:CTSZ "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0060441 "epithelial tube branching
            involved in lung morphogenesis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GeneTree:ENSGT00560000076599 CTD:1522 KO:K08568 OMA:QCGTCTE
            EMBL:AADN02019004 IPI:IPI00596430 RefSeq:XP_417483.3
            Ensembl:ENSGALT00000012067 GeneID:419311 KEGG:gga:419311
            Uniprot:E1C4M3
        Length = 305

 Score = 230 (86.0 bits), Expect = 3.1e-19, P = 3.1e-19
 Identities = 62/194 (31%), Positives = 96/194 (49%)

Query:   127 CGSCWAFSAVAA-TEGITQLTTGKLIS--LSEQELVSCDTSGVDHGCEGGEMEDAFKFII 183
             CGSCWA  + +A  + I     G   S  LS Q ++ C  +G    CEGG+    + +  
Sbjct:    90 CGSCWAHGSTSALADRINIKRKGAWPSAYLSVQNVIDCANAG---SCEGGDHTGVWMYA- 145

Query:   184 HNDGITTEANYPYQAVDGTCNKTNEAS--------HVAK------IKGYETVPANSEEAL 229
             H+ GI  E    YQA +  C K N+          HV K      +  Y  V +  E+ +
Sbjct:   146 HDHGIPDETCNNYQAKNQKCKKFNQCGTCVTFGECHVIKNYTLWKVADYGAV-SGREKMM 204

Query:   230 LKAVANQPVAVSIDASGSAFQFYSSGVFTG-DCGTELDHGVTAVGYGATANGTKYWLVKN 288
              +  AN P++  I A+      Y+ G++T  +    ++H V+  G+G   NGT+YW+V+N
Sbjct:   205 AEIYANGPISCGIMAT-EKLDAYTGGLYTEYNPSPTVNHIVSVAGWGVE-NGTEYWIVRN 262

Query:   289 SWGTSWGEEGYIRM 302
             SWG  WGE G++R+
Sbjct:   263 SWGEPWGERGWLRI 276

 Score = 137 (53.3 bits), Expect = 1.1e-06, P = 1.1e-06
 Identities = 43/133 (32%), Positives = 64/133 (48%)

Query:    85 RRPDGLTSRKGTSFKYENVIDVPATMDWRK-NGA--VTPIKNQG-P--CGSCWAFSAVAA 138
             RR  GL +      +Y ++ ++P + DWR  NG    +  +NQ  P  CGSCWA  + +A
Sbjct:    43 RRAPGLRTYP-RPHEYLDMAELPQSWDWRNVNGVNYASTTRNQHIPQYCGSCWAHGSTSA 101

Query:   139 -TEGITQLTTGKLIS--LSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYP 195
               + I     G   S  LS Q ++ C  +G    CEGG+    + +  H+ GI  E    
Sbjct:   102 LADRINIKRKGAWPSAYLSVQNVIDCANAG---SCEGGDHTGVWMYA-HDHGIPDETCNN 157

Query:   196 YQAVDGTCNKTNE 208
             YQA +  C K N+
Sbjct:   158 YQAKNQKCKKFNQ 170


>RGD|708479 [details] [associations]
            symbol:Ctsz "cathepsin Z" species:10116 "Rattus norvegicus"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=TAS]
            [GO:0005615 "extracellular space" evidence=IEA;ISO] [GO:0005783
            "endoplasmic reticulum" evidence=IEA;ISO] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0060441 "epithelial tube branching involved in
            lung morphogenesis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:708479 GO:GO:0005576 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004197 MEROPS:C01.013 CTD:1522 HOVERGEN:HBG004456 KO:K08568
            EMBL:AB023781 EMBL:BC091110 IPI:IPI00207663 RefSeq:NP_899159.1
            UniGene:Rn.1475 ProteinModelPortal:Q9R1T3 SMR:Q9R1T3 PRIDE:Q9R1T3
            GeneID:252929 KEGG:rno:252929 BindingDB:Q9R1T3 NextBio:624097
            Genevestigator:Q9R1T3 Uniprot:Q9R1T3
        Length = 306

 Score = 225 (84.3 bits), Expect = 1.1e-18, P = 1.1e-18
 Identities = 57/193 (29%), Positives = 93/193 (48%)

Query:   127 CGSCWAFSAVAA-TEGITQLTTGKLIS--LSEQELVSCDTSGVDHGCEGGEMEDAFKFII 183
             CGSCWA  + +A  + I     G   S  LS Q ++ C  +G    CEGG     +++  
Sbjct:    91 CGSCWAHGSTSALADRINIKRKGAWPSTLLSVQNVIDCGNAG---SCEGGNDLPVWEYA- 146

Query:   184 HNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVP-------------ANSEEALL 230
             H  GI  E    YQA D  C+K N+     + K   T+              +  E+ + 
Sbjct:   147 HKHGIPDETCNNYQAKDQECDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSLSGREKMMA 206

Query:   231 KAVANQPVAVSIDASGSAFQFYSSGVFTGDCGTEL-DHGVTAVGYGATANGTKYWLVKNS 289
             +  AN P++  I A+      Y+ G++T      + +H ++  G+G + +G +YW+V+NS
Sbjct:   207 EIYANGPISCGIMAT-ERMSNYTGGIYTEYQNQAIINHIISVAGWGVSNDGIEYWIVRNS 265

Query:   290 WGTSWGEEGYIRM 302
             WG  WGE G++R+
Sbjct:   266 WGEPWGERGWMRI 278

 Score = 143 (55.4 bits), Expect = 2.3e-07, P = 2.3e-07
 Identities = 43/132 (32%), Positives = 61/132 (46%)

Query:    99 KYENVIDVPATMDWRK-NGA--VTPIKNQG-P--CGSCWAFSAVAA-TEGITQLTTGKLI 151
             +Y +  D+P   DWR  NG    +  +NQ  P  CGSCWA  + +A  + I     G   
Sbjct:    57 EYLSPADLPKNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSALADRINIKRKGAWP 116

Query:   152 S--LSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEA 209
             S  LS Q ++ C  +G    CEGG     +++  H  GI  E    YQA D  C+K N+ 
Sbjct:   117 STLLSVQNVIDCGNAG---SCEGGNDLPVWEYA-HKHGIPDETCNNYQAKDQECDKFNQC 172

Query:   210 SHVAKIKGYETV 221
                 + K   T+
Sbjct:   173 GTCTEFKECHTI 184


>WB|WBGene00000786 [details] [associations]
            symbol:cpr-6 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            PANTHER:PTHR12411:SF16 EMBL:L39894 EMBL:L39939 EMBL:FO080666
            PIR:T37274 RefSeq:NP_741818.1 UniGene:Cel.18138
            ProteinModelPortal:P43510 SMR:P43510 DIP:DIP-25139N
            MINT:MINT-1074025 STRING:P43510 MEROPS:C01.A51 PaxDb:P43510
            PRIDE:P43510 EnsemblMetazoa:C25B8.3a GeneID:180931
            KEGG:cel:CELE_C25B8.3 UCSC:C25B8.3a CTD:180931 WormBase:C25B8.3a
            InParanoid:P43510 OMA:KAKWGLM NextBio:911608 ArrayExpress:P43510
            Uniprot:P43510
        Length = 379

 Score = 158 (60.7 bits), Expect = 1.2e-18, Sum P(2) = 1.2e-18
 Identities = 37/97 (38%), Positives = 58/97 (59%)

Query:   104 IDVPATMD----WRKNGAVTPIKNQGPCGSCWAFSAVAA-TEGITQLTTGKL-ISLSEQE 157
             +D+P + D    W K  ++  I++Q  CGSCWAF AV A ++ I   + G+L ++LS  +
Sbjct:   103 LDIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADD 162

Query:   158 LVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANY 194
             L+SC  S    GC GG+   A+++ +  DGI T +NY
Sbjct:   163 LLSCCKS-CGFGCNGGDPLAAWRYWV-KDGIVTGSNY 197

 Score = 132 (51.5 bits), Expect = 1.2e-18, Sum P(2) = 1.2e-18
 Identities = 38/128 (29%), Positives = 58/128 (45%)

Query:   194 YPYQAVDGTC--NKTNEASHVAKIKGYETVPANSE-EALLKAVANQ-PVAVSIDASGSAF 249
             YP    +  C  + T++     K  G        + EA+ K +    P+ ++ +     F
Sbjct:   228 YPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYED-F 286

Query:   250 QFYSSGVFTGDCGTELD--HGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDID 307
               Y  GV+    G +L   H V  +G+G   +G  YW V NSW T WGE+G+ R+ R +D
Sbjct:   287 LNYDGGVYV-HTGGKLGGGHAVKLIGWGID-DGIPYWTVANSWNTDWGEDGFFRILRGVD 344

Query:   308 AKEGLCGI 315
                  CGI
Sbjct:   345 E----CGI 348


>UNIPROTKB|Q9UBR2 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0060441 "epithelial tube
            branching involved in lung morphogenesis" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            Reactome:REACT_11123 Reactome:REACT_17015 InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 EMBL:CH471077 GO:GO:0005615 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AL109840 GO:GO:0060441 MEROPS:C01.013 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN
            BRENDA:3.4.18.1 EMBL:AF073890 EMBL:AF032906 EMBL:AF136273
            EMBL:AF136276 EMBL:AF136274 EMBL:AF136275 EMBL:AK314931
            EMBL:BC042168 EMBL:AF009923 IPI:IPI00002745 RefSeq:NP_001327.2
            UniGene:Hs.252549 PDB:1DEU PDB:1EF7 PDBsum:1DEU PDBsum:1EF7
            ProteinModelPortal:Q9UBR2 SMR:Q9UBR2 STRING:Q9UBR2 DMDM:12643324
            PaxDb:Q9UBR2 PeptideAtlas:Q9UBR2 PRIDE:Q9UBR2 DNASU:1522
            Ensembl:ENST00000217131 GeneID:1522 KEGG:hsa:1522 UCSC:uc002yai.2
            GeneCards:GC20M057570 HGNC:HGNC:2547 HPA:CAB025114 MIM:603169
            neXtProt:NX_Q9UBR2 PharmGKB:PA27043 InParanoid:Q9UBR2 OMA:QCGTCTE
            PhylomeDB:Q9UBR2 BindingDB:Q9UBR2 ChEMBL:CHEMBL4160 ChiTaRS:CTSZ
            EvolutionaryTrace:Q9UBR2 GenomeRNAi:1522 NextBio:6299 Bgee:Q9UBR2
            CleanEx:HS_CTSZ Genevestigator:Q9UBR2 GermOnline:ENSG00000101160
            Uniprot:Q9UBR2
        Length = 303

 Score = 222 (83.2 bits), Expect = 3.0e-18, P = 3.0e-18
 Identities = 60/194 (30%), Positives = 97/194 (50%)

Query:   127 CGSCWAFSAVAA-TEGITQLTTGKLIS--LSEQELVSCDTSGVDHGCEGGEMEDAFKFII 183
             CGSCWA ++ +A  + I     G   S  LS Q ++ C  +G    CEGG     + +  
Sbjct:    89 CGSCWAHASTSAMADRINIKRKGAWPSTLLSVQNVIDCGNAG---SCEGGNDLSVWDYA- 144

Query:   184 HNDGITTEANYPYQAVD---------GTCNKTNEASHVA-----KIKGYETVPANSEEAL 229
             H  GI  E    YQA D         GTCN+  E   +      ++  Y ++ +  E+ +
Sbjct:   145 HQHGIPDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSL-SGREKMM 203

Query:   230 LKAVANQPVAVSIDASGSAFQFYSSGVFTG-DCGTELDHGVTAVGYGATANGTKYWLVKN 288
              +  AN P++  I A+      Y+ G++      T ++H V+  G+G + +GT+YW+V+N
Sbjct:   204 AEIYANGPISCGIMAT-ERLANYTGGIYAEYQDTTYINHVVSVAGWGIS-DGTEYWIVRN 261

Query:   289 SWGTSWGEEGYIRM 302
             SWG  WGE G++R+
Sbjct:   262 SWGEPWGERGWLRI 275

 Score = 139 (54.0 bits), Expect = 6.3e-07, P = 6.3e-07
 Identities = 50/155 (32%), Positives = 71/155 (45%)

Query:    80 FRNG---YR--RPDGLTSRKGTSF----KYENVIDVPATMDWRKNGAV---TPIKNQG-P 126
             FR G   YR  R DGL     +++    +Y +  D+P + DWR    V   +  +NQ  P
Sbjct:    27 FRRGQTCYRPLRGDGLAPLGRSTYPRPHEYLSPADLPKSWDWRNVDGVNYASITRNQHIP 86

Query:   127 --CGSCWAFSAVAA-TEGITQLTTGKLIS--LSEQELVSCDTSGVDHGCEGGEMEDAFKF 181
               CGSCWA ++ +A  + I     G   S  LS Q ++ C  +G    CEGG     + +
Sbjct:    87 QYCGSCWAHASTSAMADRINIKRKGAWPSTLLSVQNVIDCGNAG---SCEGGNDLSVWDY 143

Query:   182 IIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIK 216
               H  GI  E    YQA D  C+K N+     + K
Sbjct:   144 A-HQHGIPDETCNNYQAKDQECDKFNQCGTCNEFK 177


>MGI|MGI:88561 [details] [associations]
            symbol:Ctsb "cathepsin B" species:10090 "Mus musculus"
            [GO:0004175 "endopeptidase activity" evidence=ISO] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISO] [GO:0005576
            "extracellular region" evidence=ISO] [GO:0005615 "extracellular
            space" evidence=ISO] [GO:0005737 "cytoplasm" evidence=ISO]
            [GO:0005739 "mitochondrion" evidence=ISO;IDA] [GO:0005764
            "lysosome" evidence=ISO;IDA] [GO:0005901 "caveola" evidence=ISO]
            [GO:0006508 "proteolysis" evidence=ISO] [GO:0008233 "peptidase
            activity" evidence=ISO] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISO] [GO:0009897 "external side of plasma
            membrane" evidence=ISO] [GO:0009986 "cell surface" evidence=ISO]
            [GO:0016324 "apical plasma membrane" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0030984 "kininogen binding"
            evidence=ISO] [GO:0032403 "protein complex binding" evidence=ISO]
            [GO:0042277 "peptide binding" evidence=ISO] [GO:0042383
            "sarcolemma" evidence=ISO] [GO:0043621 "protein self-association"
            evidence=ISO] [GO:0048471 "perinuclear region of cytoplasm"
            evidence=ISO] [GO:0050790 "regulation of catalytic activity"
            evidence=IEA] [GO:0060548 "negative regulation of cell death"
            evidence=ISO] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 MGI:MGI:88561
            GO:GO:0005739 GO:GO:0042470 GO:GO:0048471 GO:GO:0005615
            GO:GO:0009612 GO:GO:0009611 GO:GO:0009897 GO:GO:0045471
            GO:GO:0016324 GO:GO:0009749 GO:GO:0006914 GO:GO:0043434
            eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0042383 GO:GO:0014070
            GO:GO:0042277 GO:GO:0060548 GO:GO:0005901 GO:GO:0014075
            GO:GO:0004197 GO:GO:0070670 GO:GO:0007519 MEROPS:C01.060 CTD:1508
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 KO:K01363 OMA:GYPSGAW OrthoDB:EOG4K6G4C
            BRENDA:3.4.22.1 GO:GO:0097067 PANTHER:PTHR12411:SF16 ChiTaRS:CTSB
            EMBL:M65270 EMBL:M65263 EMBL:M65264 EMBL:M65265 EMBL:M65266
            EMBL:M65267 EMBL:M65268 EMBL:M65269 EMBL:M14222 EMBL:X54966
            EMBL:S69034 EMBL:AK083393 EMBL:AK147192 EMBL:AK149884 EMBL:AK151790
            EMBL:AK167361 EMBL:BC006656 IPI:IPI00113517 PIR:A38458
            RefSeq:NP_031824.1 UniGene:Mm.236553 UniGene:Mm.489070
            ProteinModelPortal:P10605 SMR:P10605 IntAct:P10605 STRING:P10605
            PhosphoSite:P10605 SWISS-2DPAGE:P10605 PaxDb:P10605 PRIDE:P10605
            Ensembl:ENSMUST00000006235 GeneID:13030 KEGG:mmu:13030
            UCSC:uc007uhh.1 InParanoid:P10605 BioCyc:MetaCyc:MONOMER-14810
            BindingDB:P10605 ChEMBL:CHEMBL5187 NextBio:282900 Bgee:P10605
            CleanEx:MM_CTSB Genevestigator:P10605 GermOnline:ENSMUSG00000021939
            Uniprot:P10605
        Length = 339

 Score = 148 (57.2 bits), Expect = 3.5e-18, Sum P(2) = 3.5e-18
 Identities = 40/121 (33%), Positives = 58/121 (47%)

Query:   203 CNKTNEASHVAKIK-----GYETVP-ANS-EEALLKAVANQPVAVSIDASGSAFQFYSSG 255
             CNK+ EA +    K     GY +   +NS +E + +   N PV  +     S F  Y SG
Sbjct:   207 CNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVF-SDFLTYKSG 265

Query:   256 VFTGDCGTELD-HGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCG 314
             V+  + G  +  H +  +G+G   NG  YWL  NSW   WG+ G+ ++ R     E  CG
Sbjct:   266 VYKHEAGDMMGGHAIRILGWGVE-NGVPYWLAANSWNLDWGDNGFFKILRG----ENHCG 320

Query:   315 I 315
             I
Sbjct:   321 I 321

 Score = 137 (53.3 bits), Expect = 3.5e-18, Sum P(2) = 3.5e-18
 Identities = 32/84 (38%), Positives = 47/84 (55%)

Query:   104 IDVPATMDWRKNGAVTP----IKNQGPCGSCWAFSAVAATEGITQL-TTGKL-ISLSEQE 157
             ID+P T D R+  +  P    I++QG CGSCWAF AV A    T + T G++ + +S ++
Sbjct:    78 IDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAED 137

Query:   158 LVSCDTSGVDHGCEGGEMEDAFKF 181
             L++C       GC GG    A+ F
Sbjct:   138 LLTCCGIQCGDGCNGGYPSGAWSF 161


>UNIPROTKB|A5GFX7 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9823 "Sus scrofa"
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GO:GO:0060441 GeneTree:ENSGT00560000076599 MEROPS:C01.013 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN
            OMA:QCGTCTE EMBL:CR956646 RefSeq:NP_001116576.1 UniGene:Ssc.16769
            ProteinModelPortal:A5GFX7 SMR:A5GFX7 STRING:A5GFX7
            Ensembl:ENSSSCT00000008249 GeneID:100141405 KEGG:ssc:100141405
            ArrayExpress:A5GFX7 Uniprot:A5GFX7
        Length = 304

 Score = 222 (83.2 bits), Expect = 4.6e-18, P = 4.6e-18
 Identities = 61/194 (31%), Positives = 95/194 (48%)

Query:   127 CGSCWAFSAVAA-TEGITQLTTGKLIS--LSEQELVSCDTSGVDHGCEGGEMEDAFKFII 183
             CGSCWA  + +A  + I     G   S  LS Q ++ C  +G    CEGG+    + +  
Sbjct:    90 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGNAG---SCEGGDDLPVWAYA- 145

Query:   184 HNDGITTEANYPYQAVDGTCNKTNEAS--------HVA------KIKGYETVPANSEEAL 229
             H  GI  E    YQA D  C+K N+          HV       K+  Y +V +  E+ +
Sbjct:   146 HRHGIPDETCNNYQAKDQVCDKFNQCGTCTEFKECHVIQNYTLWKVGDYGSV-SGREKMM 204

Query:   230 LKAVANQPVAVSIDASGSAFQFYSSGVFTG-DCGTELDHGVTAVGYGATANGTKYWLVKN 288
              +  AN P++  I A+      Y+ G++        ++H V+  G+G +  GT+YW+V+N
Sbjct:   205 AEIYANGPISCGIMAT-EKMSNYTGGIYAEYKDQAYINHIVSVAGWGVSG-GTEYWIVRN 262

Query:   289 SWGTSWGEEGYIRM 302
             SWG  WGE G++R+
Sbjct:   263 SWGEPWGERGWMRI 276

 Score = 140 (54.3 bits), Expect = 4.9e-07, P = 4.9e-07
 Identities = 42/127 (33%), Positives = 60/127 (47%)

Query:    99 KYENVIDVPATMDWRK-NGA--VTPIKNQG-P--CGSCWAFSAVAA-TEGITQLTTGKLI 151
             +Y +  D+P + DWR  NG    +  +NQ  P  CGSCWA  + +A  + I     G   
Sbjct:    56 EYLSPSDLPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWP 115

Query:   152 S--LSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEA 209
             S  LS Q ++ C  +G    CEGG+    + +  H  GI  E    YQA D  C+K N+ 
Sbjct:   116 STLLSVQHVIDCGNAG---SCEGGDDLPVWAYA-HRHGIPDETCNNYQAKDQVCDKFNQC 171

Query:   210 SHVAKIK 216
                 + K
Sbjct:   172 GTCTEFK 178


>ZFIN|ZDB-GENE-040426-2650 [details] [associations]
            symbol:ctsba "cathepsin B, a" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0031101 "fin regeneration"
            evidence=IEP] [GO:0008233 "peptidase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 ZFIN:ZDB-GENE-040426-2650 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790
            GO:GO:0004197 GO:GO:0031101 MEROPS:C01.060 HOVERGEN:HBG003480
            PANTHER:PTHR12411:SF16 HSSP:P07688 EMBL:BC044517 IPI:IPI00485996
            UniGene:Dr.3374 ProteinModelPortal:Q803E4 SMR:Q803E4 STRING:Q803E4
            PRIDE:Q803E4 InParanoid:Q803E4 ArrayExpress:Q803E4 Bgee:Q803E4
            Uniprot:Q803E4
        Length = 330

 Score = 153 (58.9 bits), Expect = 5.0e-18, Sum P(2) = 5.0e-18
 Identities = 38/109 (34%), Positives = 51/109 (46%)

Query:   208 EASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGTELD- 266
             E  H  K   Y +VP+N    + +   N PV  +       F  Y SGV+    G+ L  
Sbjct:   220 EDKHFGKTS-Y-SVPSNQNGIMAELFKNGPVEAAFTVYED-FLLYKSGVYQHMSGSALGG 276

Query:   267 HGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGI 315
             H +  +G+G   NG  YWL  NSW T WG+ GY ++ R  D     CGI
Sbjct:   277 HAIKILGWGEE-NGVPYWLAANSWNTDWGDNGYFKILRGEDH----CGI 320

 Score = 129 (50.5 bits), Expect = 5.0e-18, Sum P(2) = 5.0e-18
 Identities = 34/103 (33%), Positives = 54/103 (52%)

Query:    99 KYENVIDVPATMDWRKNGAVTP----IKNQGPCGSCWAFSAVAA-TEGITQLTTGKL-IS 152
             +Y   + +P   D R+     P    I++QG CGSCWAF A  A ++ +   +  K+ + 
Sbjct:    72 QYTEGLKLPKNFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIQSNAKVSVE 131

Query:   153 LSEQELVSC-DTSGVDHGCEGGEMEDAFKFIIHNDGITTEANY 194
             +S Q+L++C D+ G+  GC GG    A+ F    DG+ T   Y
Sbjct:   132 ISSQDLLTCCDSCGM--GCNGGYPSAAWDFWT-TDGLVTGGLY 171


>UNIPROTKB|F1PIF2 [details] [associations]
            symbol:CTSZ "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0060441 "epithelial tube branching involved
            in lung morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0005783 GO:GO:0005615 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GO:GO:0060441 GeneTree:ENSGT00560000076599 OMA:QCGTCTE
            EMBL:AAEX03014054 Ensembl:ENSCAFT00000019357 Uniprot:F1PIF2
        Length = 261

 Score = 216 (81.1 bits), Expect = 9.5e-18, P = 9.5e-18
 Identities = 58/193 (30%), Positives = 91/193 (47%)

Query:   127 CGSCWAFSAVAA-TEGITQLTTGKLIS--LSEQELVSCDTSGVDHGCEGGEMEDAFKFII 183
             CGSCWA  + +A  + I     G   S  LS Q ++ C  +G    CEGG     + +  
Sbjct:    47 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVLDCANAG---SCEGGNDLPVWSYA- 102

Query:   184 HNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVP-------------ANSEEALL 230
             H  GI  E    YQA D  CNK N+     + K    +              +  E+ + 
Sbjct:   103 HEHGIPDETCNNYQAKDQECNKFNQCGTCTEFKECHAIQNYTLWRVGDYGSLSGREKMMA 162

Query:   231 KAVANQPVAVSIDASGSAFQFYSSGVFTG-DCGTELDHGVTAVGYGATANGTKYWLVKNS 289
             +  AN P++  I A+      Y+ G+         ++H ++ VG+G + +GT+YW+V+NS
Sbjct:   163 EIYANGPISCGIMATEKMVN-YTGGIHAEYQEQAYINHVISVVGWGVS-DGTEYWIVRNS 220

Query:   290 WGTSWGEEGYIRM 302
             WG  WGE G++R+
Sbjct:   221 WGEPWGERGWMRI 233

 Score = 141 (54.7 bits), Expect = 2.3e-07, P = 2.3e-07
 Identities = 43/127 (33%), Positives = 59/127 (46%)

Query:    99 KYENVIDVPATMDWRK-NGA--VTPIKNQG-P--CGSCWAFSAVAA-TEGITQLTTGKLI 151
             +Y +  D+P + DWR  NG    +  +NQ  P  CGSCWA  + +A  + I     G   
Sbjct:    13 EYLSPSDLPKSWDWRNVNGVNYASATRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWP 72

Query:   152 S--LSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEA 209
             S  LS Q ++ C  +G    CEGG     + +  H  GI  E    YQA D  CNK N+ 
Sbjct:    73 STLLSVQHVLDCANAG---SCEGGNDLPVWSYA-HEHGIPDETCNNYQAKDQECNKFNQC 128

Query:   210 SHVAKIK 216
                 + K
Sbjct:   129 GTCTEFK 135


>RGD|621509 [details] [associations]
            symbol:Ctsb "cathepsin B" species:10116 "Rattus norvegicus"
            [GO:0004175 "endopeptidase activity" evidence=IMP;IDA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA;ISO;IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005576
            "extracellular region" evidence=IDA] [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0005730 "nucleolus" evidence=IEA;ISO]
            [GO:0005737 "cytoplasm" evidence=IDA] [GO:0005739 "mitochondrion"
            evidence=IEA;ISO;IDA] [GO:0005764 "lysosome" evidence=IEA;ISO;IDA]
            [GO:0006508 "proteolysis" evidence=IEA;IEP;ISO;IMP;IDA;TAS]
            [GO:0006914 "autophagy" evidence=IEP] [GO:0006950 "response to
            stress" evidence=IEP] [GO:0007283 "spermatogenesis" evidence=IEP]
            [GO:0007519 "skeletal muscle tissue development" evidence=IEP]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0009611
            "response to wounding" evidence=IEP] [GO:0009612 "response to
            mechanical stimulus" evidence=IEP] [GO:0009749 "response to glucose
            stimulus" evidence=IEP] [GO:0009897 "external side of plasma
            membrane" evidence=IDA] [GO:0009986 "cell surface" evidence=IDA]
            [GO:0014070 "response to organic cyclic compound" evidence=IEP]
            [GO:0014075 "response to amine stimulus" evidence=IEP] [GO:0016324
            "apical plasma membrane" evidence=IDA] [GO:0030984 "kininogen
            binding" evidence=IPI] [GO:0032403 "protein complex binding"
            evidence=IPI] [GO:0034097 "response to cytokine stimulus"
            evidence=IEP] [GO:0042277 "peptide binding" evidence=IDA]
            [GO:0042383 "sarcolemma" evidence=IDA] [GO:0042470 "melanosome"
            evidence=IEA] [GO:0043231 "intracellular membrane-bounded
            organelle" evidence=ISO] [GO:0043434 "response to peptide hormone
            stimulus" evidence=IEP] [GO:0043621 "protein self-association"
            evidence=IDA] [GO:0045471 "response to ethanol" evidence=IEP]
            [GO:0048471 "perinuclear region of cytoplasm" evidence=ISO;IDA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0060548 "negative regulation of cell death" evidence=IMP]
            [GO:0070670 "response to interleukin-4" evidence=IEP] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA;ISO]
            [GO:0005901 "caveola" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:621509 GO:GO:0005739 GO:GO:0042470
            GO:GO:0005615 GO:GO:0009612 GO:GO:0009611 GO:GO:0009897
            GO:GO:0045471 GO:GO:0016324 GO:GO:0009749 GO:GO:0006914
            GO:GO:0043434 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0007283
            GO:GO:0005764 GO:GO:0042383 GO:GO:0043621 GO:GO:0014070
            GO:GO:0042277 GO:GO:0060548 GO:GO:0014075 GO:GO:0004197
            GO:GO:0070670 GO:GO:0007519 MEROPS:C01.060 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 GO:GO:0097067
            PANTHER:PTHR12411:SF16 EMBL:X82396 EMBL:M11305 IPI:IPI00212811
            PIR:S51041 UniGene:Rn.100909 PDB:1CPJ PDB:1CTE PDB:1MIR PDB:1THE
            PDBsum:1CPJ PDBsum:1CTE PDBsum:1MIR PDBsum:1THE
            ProteinModelPortal:P00787 SMR:P00787 STRING:P00787 PRIDE:P00787
            UCSC:RGD:621509 InParanoid:P00787 SABIO-RK:P00787 BindingDB:P00787
            ChEMBL:CHEMBL2602 EvolutionaryTrace:P00787 ArrayExpress:P00787
            Genevestigator:P00787 GermOnline:ENSRNOG00000010331 Uniprot:P00787
        Length = 339

 Score = 151 (58.2 bits), Expect = 1.0e-17, Sum P(2) = 1.0e-17
 Identities = 40/121 (33%), Positives = 59/121 (48%)

Query:   203 CNKTNEASHVAKIK-----GYETVP-ANSEEALLKAV-ANQPVAVSIDASGSAFQFYSSG 255
             CNK  EA +    K     GY +   ++SE+ ++  +  N PV  +     S F  Y SG
Sbjct:   207 CNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVF-SDFLTYKSG 265

Query:   256 VFTGDCGTELD-HGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCG 314
             V+  + G  +  H +  +G+G   NG  YWLV NSW   WG+ G+ ++ R     E  CG
Sbjct:   266 VYKHEAGDVMGGHAIRILGWGIE-NGVPYWLVANSWNVDWGDNGFFKILRG----ENHCG 320

Query:   315 I 315
             I
Sbjct:   321 I 321

 Score = 129 (50.5 bits), Expect = 1.0e-17, Sum P(2) = 1.0e-17
 Identities = 29/84 (34%), Positives = 46/84 (54%)

Query:   104 IDVPATMD----WRKNGAVTPIKNQGPCGSCWAFSAVAA-TEGITQLTTGKL-ISLSEQE 157
             I++P + D    W     +  I++QG CGSCWAF AV A ++ I   T G++ + +S ++
Sbjct:    78 INLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAED 137

Query:   158 LVSCDTSGVDHGCEGGEMEDAFKF 181
             L++C       GC GG    A+ F
Sbjct:   138 LLTCCGIQCGDGCNGGYPSGAWNF 161


>UNIPROTKB|Q6IN22 [details] [associations]
            symbol:Ctsb "Cathepsin B" species:10116 "Rattus norvegicus"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 RGD:621509 GO:GO:0005739
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 CTD:1508
            GeneTree:ENSGT00560000076599 HOVERGEN:HBG003480 KO:K01363
            OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16 UniGene:Rn.100909
            EMBL:CH474023 HSSP:P00785 EMBL:BC072490 IPI:IPI00562653
            RefSeq:NP_072119.2 SMR:Q6IN22 IntAct:Q6IN22 STRING:Q6IN22
            Ensembl:ENSRNOT00000014177 GeneID:64529 KEGG:rno:64529
            InParanoid:Q6IN22 NextBio:613362 Genevestigator:Q6IN22
            Uniprot:Q6IN22
        Length = 339

 Score = 151 (58.2 bits), Expect = 1.0e-17, Sum P(2) = 1.0e-17
 Identities = 40/121 (33%), Positives = 59/121 (48%)

Query:   203 CNKTNEASHVAKIK-----GYETVP-ANSEEALLKAV-ANQPVAVSIDASGSAFQFYSSG 255
             CNK  EA +    K     GY +   ++SE+ ++  +  N PV  +     S F  Y SG
Sbjct:   207 CNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVF-SDFLTYKSG 265

Query:   256 VFTGDCGTELD-HGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCG 314
             V+  + G  +  H +  +G+G   NG  YWLV NSW   WG+ G+ ++ R     E  CG
Sbjct:   266 VYKHEAGDVMGGHAIRILGWGIE-NGVPYWLVANSWNVDWGDNGFFKILRG----ENHCG 320

Query:   315 I 315
             I
Sbjct:   321 I 321

 Score = 129 (50.5 bits), Expect = 1.0e-17, Sum P(2) = 1.0e-17
 Identities = 29/84 (34%), Positives = 46/84 (54%)

Query:   104 IDVPATMD----WRKNGAVTPIKNQGPCGSCWAFSAVAA-TEGITQLTTGKL-ISLSEQE 157
             I++P + D    W     +  I++QG CGSCWAF AV A ++ I   T G++ + +S ++
Sbjct:    78 INLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAED 137

Query:   158 LVSCDTSGVDHGCEGGEMEDAFKF 181
             L++C       GC GG    A+ F
Sbjct:   138 LLTCCGIQCGDGCNGGYPSGAWNF 161


>WB|WBGene00021072 [details] [associations]
            symbol:W07B8.4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:FO081739 PIR:T31728 RefSeq:NP_503382.1
            HSSP:P53634 ProteinModelPortal:O16288 SMR:O16288 STRING:O16288
            MEROPS:C01.A39 PaxDb:O16288 EnsemblMetazoa:W07B8.4 GeneID:178611
            KEGG:cel:CELE_W07B8.4 UCSC:W07B8.4 CTD:178611 WormBase:W07B8.4
            InParanoid:O16288 OMA:ESQYGCK NextBio:901836 Uniprot:O16288
        Length = 335

 Score = 163 (62.4 bits), Expect = 1.1e-17, Sum P(2) = 1.1e-17
 Identities = 36/93 (38%), Positives = 51/93 (54%)

Query:   224 NSEEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGTELD-HGVTAVGYGATANGTK 282
             ++++   + +A+ PV V        F  Y +G++T   G EL  H V  +G+G   NGT 
Sbjct:   234 SAKQIQTEILAHGPVEVGFIVYED-FYLYKTGIYTHVAGGELGGHAVKMLGWGVD-NGTP 291

Query:   283 YWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGI 315
             YWL  NSW T WGE+GY R+ R +D     CGI
Sbjct:   292 YWLAANSWNTVWGEKGYFRILRGVDE----CGI 320

 Score = 114 (45.2 bits), Expect = 1.1e-17, Sum P(2) = 1.1e-17
 Identities = 30/105 (28%), Positives = 54/105 (51%)

Query:   101 ENVIDVPATMD----WRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLIS--LS 154
             E    +P + D    W +  +V  I++Q  CGSCWA +A  A    T + +   ++  LS
Sbjct:    68 ETADSIPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGDVNTLLS 127

Query:   155 EQELVSCDTSGVD--HGCEGGEMEDAFKFIIHNDGITTEANYPYQ 197
              +++++C T   +   GCEGG    A+++ + N G+ T  ++  Q
Sbjct:   128 AEDILTCCTGKFNCGDGCEGGYPIQAWRYWVKN-GLVTGGSFESQ 171


>UNIPROTKB|P07858 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9606 "Homo sapiens"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0005739 "mitochondrion" evidence=IEA] [GO:0042470 "melanosome"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042981 "regulation of apoptotic process" evidence=TAS]
            [GO:0006508 "proteolysis" evidence=IDA] [GO:0005764 "lysosome"
            evidence=IDA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEP] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IDA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IDA] [GO:0005622 "intracellular" evidence=TAS]
            [GO:0036021 "endolysosome lumen" evidence=TAS] [GO:0045087 "innate
            immune response" evidence=TAS] [GO:0008233 "peptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0005730 "nucleolus" evidence=IDA] [GO:0043231 "intracellular
            membrane-bounded organelle" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_118779 GO:GO:0005739
            GO:GO:0042470 GO:GO:0048471 Reactome:REACT_6900 GO:GO:0005615
            GO:GO:0009612 GO:GO:0009611 GO:GO:0005730 GO:GO:0042981
            GO:GO:0009897 GO:GO:0045471 GO:GO:0016324 GO:GO:0009749
            GO:GO:0006914 GO:GO:0043434 eggNOG:COG4870 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087
            GO:GO:0050790 GO:GO:0042383 GO:GO:0014070 GO:GO:0042277
            GO:GO:0060548 GO:GO:0005901 GO:GO:0014075 GO:GO:0004197
            GO:GO:0070670 EMBL:CH471157 GO:GO:0007519 MEROPS:C01.060 CTD:1508
            HOGENOM:HOG000241341 HOVERGEN:HBG003480 KO:K01363 OMA:GYPSGAW
            OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 GO:GO:0097067
            PANTHER:PTHR12411:SF16 EMBL:M14221 EMBL:L16510 EMBL:AK092070
            EMBL:AK075393 EMBL:BC010240 EMBL:BC095408 EMBL:M13230
            IPI:IPI00295741 PIR:A26498 RefSeq:NP_001899.1 RefSeq:NP_680090.1
            RefSeq:NP_680091.1 RefSeq:NP_680092.1 RefSeq:NP_680093.1
            UniGene:Hs.520898 PDB:1CSB PDB:1GMY PDB:1HUC PDB:1PBH PDB:2IPP
            PDB:2PBH PDB:3AI8 PDB:3CBJ PDB:3CBK PDB:3K9M PDB:3PBH PDBsum:1CSB
            PDBsum:1GMY PDBsum:1HUC PDBsum:1PBH PDBsum:2IPP PDBsum:2PBH
            PDBsum:3AI8 PDBsum:3CBJ PDBsum:3CBK PDBsum:3K9M PDBsum:3PBH
            ProteinModelPortal:P07858 SMR:P07858 DIP:DIP-42785N IntAct:P07858
            MINT:MINT-1397666 STRING:P07858 PhosphoSite:P07858 DMDM:68067549
            SWISS-2DPAGE:P07858 UCD-2DPAGE:P07858 PaxDb:P07858
            PeptideAtlas:P07858 PRIDE:P07858 DNASU:1508 Ensembl:ENST00000345125
            Ensembl:ENST00000353047 Ensembl:ENST00000434271
            Ensembl:ENST00000453527 Ensembl:ENST00000530640
            Ensembl:ENST00000531089 Ensembl:ENST00000533455
            Ensembl:ENST00000534510 GeneID:1508 KEGG:hsa:1508 UCSC:uc003wum.3
            GeneCards:GC08M011700 H-InvDB:HIX0007320 HGNC:HGNC:2527
            HPA:CAB000457 HPA:HPA018156 MIM:116810 neXtProt:NX_P07858
            PharmGKB:PA27027 InParanoid:P07858 PhylomeDB:P07858
            BindingDB:P07858 ChEMBL:CHEMBL4072 ChiTaRS:CTSB
            EvolutionaryTrace:P07858 GenomeRNAi:1508 NextBio:6235
            PMAP-CutDB:P07858 ArrayExpress:P07858 Bgee:P07858 CleanEx:HS_CTSB
            Genevestigator:P07858 GermOnline:ENSG00000164733 GO:GO:0036021
            Uniprot:P07858
        Length = 339

 Score = 151 (58.2 bits), Expect = 1.6e-17, Sum P(2) = 1.6e-17
 Identities = 37/102 (36%), Positives = 53/102 (51%)

Query:   217 GYETVP-ANSEEALLKAV-ANQPVAVSIDASGSAFQFYSSGVFTGDCGTELD-HGVTAVG 273
             GY +   +NSE+ ++  +  N PV  +     S F  Y SGV+    G  +  H +  +G
Sbjct:   226 GYNSYSVSNSEKDIMAEIYKNGPVEGAFSVY-SDFLLYKSGVYQHVTGEMMGGHAIRILG 284

Query:   274 YGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGI 315
             +G   NGT YWLV NSW T WG+ G+ ++ R  D     CGI
Sbjct:   285 WGVE-NGTPYWLVANSWNTDWGDNGFFKILRGQDH----CGI 321

 Score = 127 (49.8 bits), Expect = 1.6e-17, Sum P(2) = 1.6e-17
 Identities = 29/84 (34%), Positives = 45/84 (53%)

Query:   104 IDVPATMD----WRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISL--SEQE 157
             + +PA+ D    W +   +  I++QG CGSCWAF AV A      + T   +S+  S ++
Sbjct:    78 LKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAED 137

Query:   158 LVSCDTSGVDHGCEGGEMEDAFKF 181
             L++C  S    GC GG   +A+ F
Sbjct:   138 LLTCCGSMCGDGCNGGYPAEAWNF 161


>UNIPROTKB|F1N9D7 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9031 "Gallus gallus"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005730 "nucleolus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005739 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0005764
            GO:GO:0004197 GeneTree:ENSGT00560000076599 OMA:GYPSGAW
            GO:GO:0097067 PANTHER:PTHR12411:SF16 IPI:IPI00573387
            EMBL:AADN02018292 Ensembl:ENSGALT00000026896
            Ensembl:ENSGALT00000036723 Uniprot:F1N9D7
        Length = 340

 Score = 145 (56.1 bits), Expect = 1.7e-17, Sum P(2) = 1.7e-17
 Identities = 33/96 (34%), Positives = 48/96 (50%)

Query:   221 VPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGTELD-HGVTAVGYGATAN 279
             VP + +E + +   N PV  +       F  Y SGV+    G ++  H +  +G+G   N
Sbjct:   233 VPRSEKEIMAEIYKNGPVEGAFIVYED-FLMYKSGVYQHVSGEQVGGHAIRILGWGVE-N 290

Query:   280 GTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGI 315
             GT YWL  NSW T WG+ G+ ++ R  D     CGI
Sbjct:   291 GTPYWLAANSWNTDWGDNGFFKILRGEDH----CGI 322

 Score = 134 (52.2 bits), Expect = 1.7e-17, Sum P(2) = 1.7e-17
 Identities = 31/84 (36%), Positives = 48/84 (57%)

Query:   104 IDVPATMDWRKNG----AVTPIKNQGPCGSCWAFSAVAA-TEGITQLTTGKL-ISLSEQE 157
             +D+P T D RK       ++ I++QG CGSCWAF AV A ++ I   T  K+ + +S ++
Sbjct:    78 MDLPDTFDSRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAED 137

Query:   158 LVSCDTSGVDHGCEGGEMEDAFKF 181
             L+SC       GC GG    A+++
Sbjct:   138 LLSCCGFECGMGCNGGYPSGAWRY 161


>ZFIN|ZDB-GENE-070323-1 [details] [associations]
            symbol:ctsbb "capthepsin B, b" species:7955 "Danio
            rerio" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-070323-1 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197
            GeneTree:ENSGT00560000076599 PANTHER:PTHR12411:SF16 OMA:CCGFLCG
            EMBL:CU207296 EMBL:CABZ01037785 IPI:IPI00877452
            Ensembl:ENSDART00000097263 Bgee:F1QZT5 Uniprot:F1QZT5
        Length = 326

 Score = 154 (59.3 bits), Expect = 2.4e-17, Sum P(2) = 2.4e-17
 Identities = 34/96 (35%), Positives = 50/96 (52%)

Query:   221 VPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGTELD-HGVTAVGYGATAN 279
             VP++ ++ + +   N PV  +       F  Y SGV+    G+ L  H V  +G+G   N
Sbjct:   226 VPSDQQQIMTELYTNGPVEAAFTVYED-FPLYKSGVYQHLTGSALGGHAVKILGWGEE-N 283

Query:   280 GTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGI 315
             GT +WLV NSW + WG+ GY ++ R  D     CGI
Sbjct:   284 GTPFWLVANSWNSDWGDNGYFKILRGHDE----CGI 315

 Score = 121 (47.7 bits), Expect = 2.4e-17, Sum P(2) = 2.4e-17
 Identities = 45/153 (29%), Positives = 70/153 (45%)

Query:    49 EFIESLNAAGNKPYKLSINEFADQTNQEFKAF----RNGYRRPDGLTSRKGTSFKYENVI 104
             E I  +NAA    +   +N F +   +  K+       G R P   T +  T+ K  +  
Sbjct:    24 EMISFINAA-RSTWTAGVN-FDNVPKKYLKSLCGTVLKGPRLPH--TVKHSTNVKLPDSF 79

Query:   105 DVPATMDWRKNGAVTPIKNQGPCGSCWAFSAVAA-TEGITQLTTGKLI-SLSEQELVSC- 161
             D+     W     +  I++QG CGSCWAF AV + ++ I   + GK    +S ++L+SC 
Sbjct:    80 DLRD--QWPNCKTLNQIRDQGSCGSCWAFGAVESISDRICIHSKGKQSPEISAEDLLSCC 137

Query:   162 DTSGVDHGCEGGEMEDAFKFIIHNDGITTEANY 194
             D  G   GC GG   +A+ +     G+ T   Y
Sbjct:   138 DQCGF--GCSGGFPAEAWDYW-RRSGLVTGGLY 167


>FB|FBgn0033873 [details] [associations]
            symbol:CG6337 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 EMBL:AE013599
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 HSSP:P80067 EMBL:AY084123
            RefSeq:NP_610905.1 UniGene:Dm.5230 SMR:Q7JYA0 IntAct:Q7JYA0
            EnsemblMetazoa:FBtr0087646 GeneID:36530 KEGG:dme:Dmel_CG6337
            UCSC:CG6337-RA FlyBase:FBgn0033873 eggNOG:NOG310593
            InParanoid:Q7JYA0 OMA:NRTTYRE OrthoDB:EOG4MCVFZ GenomeRNAi:36530
            NextBio:799041 Uniprot:Q7JYA0
        Length = 340

 Score = 223 (83.6 bits), Expect = 4.0e-17, P = 4.0e-17
 Identities = 78/302 (25%), Positives = 127/302 (42%)

Query:    21 EQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNK---PYKLSINEFADQTNQEF 77
             + +   + K Y +   +      F  N   +   NA  ++    Y+ ++N+F+D    +F
Sbjct:    29 QTYEDNFNKTYASTSARNFANYYFIYNRNQVAQHNAQADRNRTTYREAVNQFSDIRLIQF 88

Query:    78 KAFR-NGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGAVTPIKNQGP-CGSCWAFSA 135
              A             S    S       D+    D+   G    +++QG  C S WA++ 
Sbjct:    89 AALLPKAVNTVTSAASDPPASQAASASFDI--ITDF---GLTVAVEDQGVNCSSSWAYAT 143

Query:   136 VAATEGITQLTTGKLI--SLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIH-NDG-ITTE 191
               A E +  + T   +  SLS Q+L+ C  +G+  GC       A  ++    D  +  E
Sbjct:   144 AKAVEIMNAVQTANPLPSSLSAQQLLDC--AGMGTGCSTQTPLAALNYLTQLTDAYLYPE 201

Query:   192 ANYPYQ---AVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQ-PVAVSIDASGS 247
              +YP        G C   +  S   K+ GY TV  N + A+++ V+N  PV V  + +  
Sbjct:   202 VDYPNNNSLKTPGMCQPPSSVSVGVKLAGYSTVADNDDAAVMRYVSNGFPVIVEYNPATF 261

Query:   248 AFQFYSSGVFTGDC----GTELDHGVTAVGYGATANGT-KYWLVKNSWGTSWGEEGYIRM 302
              F  YSSGV+  +       +    +  VGY    +    YW   NS+G +WGEEGYIR+
Sbjct:   262 GFMQYSSGVYVQETRALTNPKSSQFLVVVGYDHDVDSNLDYWRCLNSFGDTWGEEGYIRI 321

Query:   303 KR 304
              R
Sbjct:   322 VR 323


>WB|WBGene00000789 [details] [associations]
            symbol:cpz-2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GeneTree:ENSGT00560000076599 KO:K08568 EMBL:Z81103
            HSSP:P80067 PIR:T23720 RefSeq:NP_506318.1 ProteinModelPortal:P92005
            SMR:P92005 STRING:P92005 MEROPS:C01.A41 PaxDb:P92005
            EnsemblMetazoa:M04G12.2 GeneID:179818 KEGG:cel:CELE_M04G12.2
            UCSC:M04G12.2 CTD:179818 WormBase:M04G12.2 eggNOG:NOG275763
            InParanoid:P92005 OMA:VEYWIAR NextBio:906990 Uniprot:P92005
        Length = 467

 Score = 227 (85.0 bits), Expect = 5.0e-17, P = 5.0e-17
 Identities = 59/204 (28%), Positives = 97/204 (47%)

Query:   119 TPIKNQG-P--CGSCWAFSAVAA-TEGITQLTTGK--LISLSEQELVSCDTSGVDHGCEG 172
             +P +NQ  P  CGSCW F    A  +       G+  +  LS QE++ C+  G    C+G
Sbjct:   237 SPTRNQHIPVYCGSCWVFGTTGALNDRFNVARKGRWPMTQLSPQEIIDCNGKG---NCQG 293

Query:   173 GEMEDAFKFIIHNDGITTEANYPYQAVDGTCNK--------TNEASHVAK-----IKGYE 219
             GE+ +  +      G+  E    Y+A +G CN          NE   +       +K Y 
Sbjct:   294 GEIGNVLEHA-KIQGLVEEGCNVYRATNGECNPYHRCGSCWPNECFSLTNYTRYYVKDYG 352

Query:   220 TVPANSEEALLKAVANQPVAVSIDASGSAFQF-YSSGVFTGDCGTELDHGVTAVGYGATA 278
              V    ++ + +     P+A +I A+   F++ Y  GV++     E +H ++  G+G   
Sbjct:   353 QVQGR-DKIMSEIKKGGPIACAIGAT-KKFEYEYVKGVYSEKSDLESNHIISLTGWGVDE 410

Query:   279 NGTKYWLVKNSWGTSWGEEGYIRM 302
             NG +YW+ +NSWG +WGE G+ R+
Sbjct:   411 NGVEYWIARNSWGEAWGELGWFRV 434

 Score = 137 (53.3 bits), Expect = 2.6e-06, P = 2.6e-06
 Identities = 39/120 (32%), Positives = 57/120 (47%)

Query:    94 KGTSFKYENVIDVPATMDWRKNGAV---TPIKNQG-P--CGSCWAFSAVAA-TEGITQLT 146
             + +SFK  N  D+P   DWR    V   +P +NQ  P  CGSCW F    A  +      
Sbjct:   212 ESSSFK-SN--DLPTGWDWRNVSGVNYCSPTRNQHIPVYCGSCWVFGTTGALNDRFNVAR 268

Query:   147 TGK--LISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCN 204
              G+  +  LS QE++ C+  G    C+GGE+ +  +      G+  E    Y+A +G CN
Sbjct:   269 KGRWPMTQLSPQEIIDCNGKG---NCQGGEIGNVLEHA-KIQGLVEEGCNVYRATNGECN 324


>UNIPROTKB|P43233 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9031 "Gallus gallus"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 MEROPS:C01.060 CTD:1508
            HOGENOM:HOG000241341 HOVERGEN:HBG003480 KO:K01363 OrthoDB:EOG4K6G4C
            PANTHER:PTHR12411:SF16 EMBL:U18083 IPI:IPI00573387 PIR:S58770
            RefSeq:NP_990702.1 UniGene:Gga.3854 ProteinModelPortal:P43233
            SMR:P43233 STRING:P43233 PRIDE:P43233 GeneID:396329 KEGG:gga:396329
            InParanoid:P43233 NextBio:20816377 Uniprot:P43233
        Length = 340

 Score = 139 (54.0 bits), Expect = 7.0e-17, Sum P(2) = 7.0e-17
 Identities = 33/96 (34%), Positives = 47/96 (48%)

Query:   221 VPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGTELD-HGVTAVGYGATAN 279
             VP + +E + +   N PV  +       F  Y SGV+    G ++  H +  +G+G   N
Sbjct:   233 VPRSEKEIMAEIYKNGPVEGAFIVYED-FLMYKSGVYQHVSGEQVGGHAIRILGWGVE-N 290

Query:   280 GTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGI 315
             GT YWL  NSW T WG  G+ ++ R  D     CGI
Sbjct:   291 GTPYWLAANSWNTDWGITGFFKILRGEDH----CGI 322

 Score = 135 (52.6 bits), Expect = 7.0e-17, Sum P(2) = 7.0e-17
 Identities = 31/84 (36%), Positives = 48/84 (57%)

Query:   104 IDVPATMDWRKNG----AVTPIKNQGPCGSCWAFSAVAA-TEGITQLTTGKL-ISLSEQE 157
             +D+P T D RK       ++ I++QG CGSCWAF AV A ++ I   T  K+ + +S ++
Sbjct:    78 MDLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAED 137

Query:   158 LVSCDTSGVDHGCEGGEMEDAFKF 181
             L+SC       GC GG    A+++
Sbjct:   138 LLSCCGFECGMGCNGGYPSGAWRY 161


>UNIPROTKB|E2R6Q7 [details] [associations]
            symbol:CTSB "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730 "nucleolus"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790
            GO:GO:0005764 GO:GO:0004197 CTD:1508 GeneTree:ENSGT00560000076599
            KO:K01363 OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16
            EMBL:AAEX03014318 RefSeq:XP_543203.3 Ensembl:ENSCAFT00000012692
            GeneID:486077 KEGG:cfa:486077 NextBio:20859923 Uniprot:E2R6Q7
        Length = 339

 Score = 147 (56.8 bits), Expect = 1.0e-16, Sum P(2) = 1.0e-16
 Identities = 35/97 (36%), Positives = 49/97 (50%)

Query:   220 TVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGTELD-HGVTAVGYGATA 278
             +V  N +E + +   N PV  +     S F  Y SGV+    G  +  H V  +G+G   
Sbjct:   231 SVSDNEKEIMAEIYKNGPVEAAFTVY-SDFLLYKSGVYQHVTGEMMGGHAVRILGWGVE- 288

Query:   279 NGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGI 315
             +GT YWLV NSW T WG+ G+ ++ R  D     CGI
Sbjct:   289 DGTPYWLVGNSWNTDWGDNGFFKILRGRDH----CGI 321

 Score = 124 (48.7 bits), Expect = 1.0e-16, Sum P(2) = 1.0e-16
 Identities = 29/87 (33%), Positives = 48/87 (55%)

Query:   101 ENVIDVPATMD----WRKNGAVTPIKNQGPCGSCWAFSAVAA-TEGITQLTTGKL-ISLS 154
             +N+I +P + D    W     +  I++QG CGSCWAF AV A ++ I   T G + + +S
Sbjct:    76 KNLI-LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNVEVS 134

Query:   155 EQELVSCDTSGVDHGCEGGEMEDAFKF 181
              +++++C       GC GG   +A+ F
Sbjct:   135 AEDMLTCCGDQCGDGCNGGFPAEAWNF 161


>UNIPROTKB|P07688 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9913 "Bos taurus"
            [GO:0042470 "melanosome" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730
            "nucleolus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0042470
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 EMBL:L06075 EMBL:M64620
            EMBL:U16336 EMBL:U16337 EMBL:U16338 EMBL:U16339 EMBL:U16341
            EMBL:U16342 EMBL:U16343 EMBL:BC102997 IPI:IPI00692061 PIR:S38328
            RefSeq:NP_776456.1 UniGene:Bt.393 PDB:1ITO PDB:1QDQ PDB:1SP4
            PDB:2DC6 PDB:2DC7 PDB:2DC8 PDB:2DC9 PDB:2DCA PDB:2DCB PDB:2DCC
            PDB:2DCD PDBsum:1ITO PDBsum:1QDQ PDBsum:1SP4 PDBsum:2DC6
            PDBsum:2DC7 PDBsum:2DC8 PDBsum:2DC9 PDBsum:2DCA PDBsum:2DCB
            PDBsum:2DCC PDBsum:2DCD ProteinModelPortal:P07688 SMR:P07688
            STRING:P07688 MEROPS:C01.060 PRIDE:P07688
            Ensembl:ENSBTAT00000036795 GeneID:281105 KEGG:bta:281105 CTD:1508
            eggNOG:NOG315657 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 InParanoid:P07688 KO:K01363 OMA:GYPSGAW
            OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 BindingDB:P07688
            ChEMBL:CHEMBL2323 EvolutionaryTrace:P07688 NextBio:20805177
            ArrayExpress:P07688 GO:GO:0097067 PANTHER:PTHR12411:SF16
            Uniprot:P07688
        Length = 335

 Score = 148 (57.2 bits), Expect = 2.5e-16, Sum P(2) = 2.5e-16
 Identities = 35/97 (36%), Positives = 49/97 (50%)

Query:   220 TVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGTELD-HGVTAVGYGATA 278
             +V  N +E + +   N PV  +     S F  Y SGV+    G  +  H +  +G+G   
Sbjct:   231 SVANNEKEIMAEIYKNGPVEGAFSVY-SDFLLYKSGVYQHVSGEIMGGHAIRILGWGVE- 288

Query:   279 NGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGI 315
             NGT YWLV NSW T WG+ G+ ++ R  D     CGI
Sbjct:   289 NGTPYWLVGNSWNTDWGDNGFFKILRGQDH----CGI 321

 Score = 119 (46.9 bits), Expect = 2.5e-16, Sum P(2) = 2.5e-16
 Identities = 25/76 (32%), Positives = 41/76 (53%)

Query:   108 ATMDWRKNGAVTPIKNQGPCGSCWAFSAVAA-TEGITQLTTGKL-ISLSEQELVSCDTSG 165
             A   W     +  I++QG CGSCWAF AV A ++ I   + G++ + +S +++++C    
Sbjct:    86 AREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCGGE 145

Query:   166 VDHGCEGGEMEDAFKF 181
                GC GG    A+ F
Sbjct:   146 CGDGCNGGFPSGAWNF 161


>TAIR|locus:2204873 [details] [associations]
            symbol:AT1G02300 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002684 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 KO:K01363
            PANTHER:PTHR12411:SF16 OMA:ADDINAC IPI:IPI00534431
            RefSeq:NP_563647.1 UniGene:At.43952 ProteinModelPortal:F4HVZ1
            SMR:F4HVZ1 MEROPS:C01.A10 EnsemblPlants:AT1G02300.1 GeneID:839576
            KEGG:ath:AT1G02300 ArrayExpress:F4HVZ1 Uniprot:F4HVZ1
        Length = 379

 Score = 165 (63.1 bits), Expect = 3.5e-16, Sum P(2) = 3.5e-16
 Identities = 39/130 (30%), Positives = 62/130 (47%)

Query:   191 EANYPYQAVDGTCNKTN----EASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASG 246
             E  YP    +  C   N    E+ H   +  Y   P + ++ + +   N PV V+     
Sbjct:   228 EPTYPTPKCERKCVSRNQLWGESKHYG-VGAYRINP-DPQDIMAEVYKNGPVEVAFTVYE 285

Query:   247 SAFQFYSSGVFTGDCGTELD-HGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRD 305
               F  Y SGV+    GT++  H V  +G+G + +G  YWL+ N W  SWG++GY +++R 
Sbjct:   286 D-FAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRG 344

Query:   306 IDAKEGLCGI 315
              +     CGI
Sbjct:   345 TNE----CGI 350

 Score = 100 (40.3 bits), Expect = 3.5e-16, Sum P(2) = 3.5e-16
 Identities = 23/72 (31%), Positives = 36/72 (50%)

Query:   125 GPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIH 184
             G CGSCWAF AV +      +     +SLS  ++++C       GC GG    A+ +  +
Sbjct:   146 GHCGSCWAFGAVESLSDRFCIKYNLNVSLSANDVIACCGLLCGFGCNGGFPMGAWLYFKY 205

Query:   185 NDGITTEANYPY 196
             +  +T E + PY
Sbjct:   206 HGVVTQECD-PY 216


>DICTYBASE|DDB_G0286055 [details] [associations]
            symbol:DDB_G0286055 "peptidase C1A family protein"
            species:44689 "Dictyostelium discoideum" [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 dictyBase:DDB_G0286055 Pfam:PF00188 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411 EMBL:AAFI02000085
            Gene3D:3.40.33.10 InterPro:IPR001283 InterPro:IPR014044
            PRINTS:PR00837 SMART:SM00198 SUPFAM:SSF55797
            ProtClustDB:CLSZ2429919 RefSeq:XP_637918.1
            ProteinModelPortal:Q54MB6 EnsemblProtists:DDB0186794 GeneID:8625429
            KEGG:ddi:DDB_G0286055 InParanoid:Q54MB6 OMA:GENGFAR Uniprot:Q54MB6
        Length = 435

 Score = 220 (82.5 bits), Expect = 3.7e-16, P = 3.7e-16
 Identities = 73/283 (25%), Positives = 117/283 (41%)

Query:    57 AGNKPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNG 116
             +G  PY  +  + AD T   ++ + N     +    R+     Y   +    + DWR NG
Sbjct:   161 SGIPPY--TARQHADLTTMSYEEWPNKIVNLNQRLVRRDDDHIYTASVPTDGSFDWRDNG 218

Query:   117 AVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLISLSEQELVSCDT------SGVDHG- 169
              V   K+   C S WAF+A    E  + + T      S Q+L+ C        S    G 
Sbjct:   219 VVGFPKDSSNCASGWAFTAAGIFESRSAMRTRHRYDYSAQQLIDCINVCIIIFSNFSIGN 278

Query:   170 ---CE--GGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIKGYETVPAN 224
                C    GE+  A  +     G+   + YPY          N++S   +    E     
Sbjct:   279 YTKCSRFSGELNKALMYA-QAYGLQATSTYPYVGASSIGCSYNQSSIAVEGGDVEYSQVG 337

Query:   225 SEEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGTEL------DHGVTAVGYGATA 278
              +  + K     PV V I  +   F +Y+ G+F  +C   L      +H V  VGY    
Sbjct:   338 RDSIVEKCRKQGPVGVGIYVTNE-FLYYAGGIF--ECNNTLIDNANINHNVLLVGYNEKD 394

Query:   279 NGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMDSSY 321
             N   Y+++KN++G +WGE G+ R+  D++ K+  C IA + +Y
Sbjct:   395 N---YYIIKNNFGRTWGENGFARITADVN-KD--CLIAKNPAY 431


>UNIPROTKB|A1E295 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9823 "Sus scrofa"
            [GO:0005615 "extracellular space" evidence=IDA] [GO:0042470
            "melanosome" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730
            "nucleolus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0042470
            GO:GO:0005615 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 MEROPS:C01.060 CTD:1508
            GeneTree:ENSGT00560000076599 HOVERGEN:HBG003480 KO:K01363
            OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16 EMBL:EF095956
            RefSeq:NP_001090927.1 UniGene:Ssc.53773 ProteinModelPortal:A1E295
            SMR:A1E295 PRIDE:A1E295 Ensembl:ENSSSCT00000026923 GeneID:100037961
            KEGG:ssc:100037961 Uniprot:A1E295
        Length = 335

 Score = 144 (55.7 bits), Expect = 5.8e-16, Sum P(2) = 5.8e-16
 Identities = 33/97 (34%), Positives = 48/97 (49%)

Query:   220 TVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGTELD-HGVTAVGYGATA 278
             ++  N +E + +   N PV  +        Q Y SGV+    G  +  H +  +G+G   
Sbjct:   231 SISRNEKEIMAEIYKNGPVEGAFTVYSDFLQ-YKSGVYQHVTGDLMGGHAIRILGWGVE- 288

Query:   279 NGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGI 315
             NGT YWLV NSW T WG+ G+ ++ R  D     CGI
Sbjct:   289 NGTPYWLVGNSWNTDWGDNGFFKILRGQDH----CGI 321

 Score = 120 (47.3 bits), Expect = 5.8e-16, Sum P(2) = 5.8e-16
 Identities = 25/76 (32%), Positives = 41/76 (53%)

Query:   108 ATMDWRKNGAVTPIKNQGPCGSCWAFSAVAA-TEGITQLTTGKL-ISLSEQELVSCDTSG 165
             A   W     +  I++QG CGSCWAF AV A ++ I   + G++ + +S +++++C    
Sbjct:    86 AREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTCCGDE 145

Query:   166 VDHGCEGGEMEDAFKF 181
                GC GG    A+ F
Sbjct:   146 CGDGCNGGFPSGAWNF 161


>UNIPROTKB|P05689 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:BC122603
            EMBL:X01809 IPI:IPI00708474 PIR:A29172 RefSeq:NP_001071303.1
            UniGene:Bt.4902 ProteinModelPortal:P05689 SMR:P05689 MEROPS:C01.013
            PRIDE:P05689 GeneID:404187 KEGG:bta:404187 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 InParanoid:P05689 KO:K08568
            OrthoDB:EOG42Z4QN BRENDA:3.4.18.1 NextBio:20817615 Uniprot:P05689
        Length = 304

 Score = 210 (79.0 bits), Expect = 9.2e-16, P = 9.2e-16
 Identities = 58/194 (29%), Positives = 96/194 (49%)

Query:   127 CGSCWAFSAVAA-TEGITQLTTGKLIS--LSEQELVSCDTSGVDHGCEGGEMEDAFKFII 183
             CGSCWA  + +A  + I     G   S  LS Q ++ C  +G    CEGG     +++  
Sbjct:    90 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGDAG---SCEGGNDLPVWEYA- 145

Query:   184 HNDGITTEANYPYQAVDGTCNKTNEAS--------HVAK------IKGYETVPANSEEAL 229
             H  GI  E    YQA D  C+K N+          HV K      +  Y ++ +  E+ +
Sbjct:   146 HRHGIPDETCNNYQAKDQECDKFNQCGTCTEFKECHVIKNYTLWKVGDYGSL-SGREKMM 204

Query:   230 LKAVANQPVAVSIDASGSAFQFYSSGVFTG-DCGTELDHGVTAVGYGATANGTKYWLVKN 288
              +   N P++  I A+      Y+ G+++  +    ++H V+  G+G + +G +YW+V+N
Sbjct:   205 AEIYTNGPISCGIMAT-EKMSNYTGGIYSEYNDQAFINHIVSVAGWGVS-DGMEYWIVRN 262

Query:   289 SWGTSWGEEGYIRM 302
             SWG  WGE G++R+
Sbjct:   263 SWGEPWGEHGWMRI 276

 Score = 144 (55.7 bits), Expect = 1.7e-07, P = 1.7e-07
 Identities = 51/162 (31%), Positives = 73/162 (45%)

Query:    70 ADQTNQEFKAFRNGYR--RPDGLTSRKGTSF----KYENVIDVPATMDWRK-NGA--VTP 120
             A +    F+  R  YR  R D LT     ++    +Y +  D+P + DWR  NG    + 
Sbjct:    21 AARAGLHFRPGRGCYRPLRGDRLTQLGRRTYPRPHEYLSPSDLPKSWDWRNVNGVNYASV 80

Query:   121 IKNQG-P--CGSCWAFSAVAA-TEGITQLTTGKLIS--LSEQELVSCDTSGVDHGCEGGE 174
              +NQ  P  CGSCWA  + +A  + I     G   S  LS Q ++ C  +G    CEGG 
Sbjct:    81 TRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGDAG---SCEGGN 137

Query:   175 MEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIK 216
                 +++  H  GI  E    YQA D  C+K N+     + K
Sbjct:   138 DLPVWEYA-HRHGIPDETCNNYQAKDQECDKFNQCGTCTEFK 178


>WB|WBGene00010204 [details] [associations]
            symbol:F57F5.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0006898
            "receptor-mediated endocytosis" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0009792 GO:GO:0006898
            GO:GO:0040007 GO:GO:0002119 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0040011
            EMBL:Z75953 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            PANTHER:PTHR12411:SF16 RefSeq:NP_506011.2 ProteinModelPortal:Q20950
            SMR:Q20950 DIP:DIP-24447N IntAct:Q20950 MINT:MINT-211137
            STRING:Q20950 MEROPS:C01.A42 EnsemblMetazoa:F57F5.1 GeneID:179645
            KEGG:cel:CELE_F57F5.1 UCSC:F57F5.1 CTD:179645 WormBase:F57F5.1
            OMA:ADDINAC Uniprot:Q20950
        Length = 351

 Score = 153 (58.9 bits), Expect = 1.3e-15, Sum P(2) = 1.3e-15
 Identities = 40/126 (31%), Positives = 59/126 (46%)

Query:   194 YPYQAVDGTCNKTNEASHVAKIKGYETVPANSEEA--LLKAVANQ-PVAVSIDASGSAFQ 250
             YP    + +C      ++   +   ++  A S++A  + K +    PV V+       F+
Sbjct:   221 YPTDKCERSCQAGYALTYQQDLHFGQSAYAVSKKAAEIQKEIMTHGPVEVAFTVYED-FE 279

Query:   251 FYSSGVFTGDCGTELD-HGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAK 309
              YS GV+    G  L  H V  +G+G   NGT YWL  NSW   WGE GY R+ R ++  
Sbjct:   280 HYSGGVYVHTAGASLGGHAVKMLGWGVD-NGTPYWLCANSWNEDWGENGYFRIIRGVNE- 337

Query:   310 EGLCGI 315
                CGI
Sbjct:   338 ---CGI 340

 Score = 107 (42.7 bits), Expect = 1.3e-15, Sum P(2) = 1.3e-15
 Identities = 27/95 (28%), Positives = 48/95 (50%)

Query:   106 VPATMD----WRKNGAVTPIKNQGPCGSCWAFSAVAA-TEGITQLTTGK-LISLSEQELV 159
             VP + D    W    +++ I++Q  CGSCWA SA    ++ I   +  K ++S+S  ++ 
Sbjct:    97 VPDSFDSRTAWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNAKTILSISADDIN 156

Query:   160 SCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANY 194
             +C      +GC GG   +A++  +   G  T  +Y
Sbjct:   157 ACCGMVCGNGCNGGYPIEAWRHYVKK-GYVTGGSY 190


>WB|WBGene00013072 [details] [associations]
            symbol:Y51A2D.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            EMBL:AL021497 RefSeq:NP_001256811.1 ProteinModelPortal:O62484
            SMR:O62484 MEROPS:C01.A37 EnsemblMetazoa:Y51A2D.1 GeneID:180204
            KEGG:cel:CELE_Y51A2D.1 UCSC:Y51A2D.1 CTD:180204 WormBase:Y51A2D.1a
            HOGENOM:HOG000019851 NextBio:908416 Uniprot:O62484
        Length = 314

 Score = 144 (55.7 bits), Expect = 1.8e-15, Sum P(2) = 1.8e-15
 Identities = 41/114 (35%), Positives = 59/114 (51%)

Query:   222 PANSEEALLKAVAN--QPVAVSIDASGSAFQFYSSGVF-TGDC---GTELDHGVTAVGYG 275
             P N+E  +++ +     PVAV   A+G+AF  Y SGV  T DC   GT + H    VGYG
Sbjct:   201 PENAESEIIEILNTWKTPVAVYF-AAGTAFLQYKSGVLVTEDCDLAGT-VWHAGAIVGYG 258

Query:   276 AT----ANGTKYWLVKNSWGTS-WGEEGYIRMKRDID---AKEGLCGIAMDSSY 321
                       ++W++KNSWG S WG  GY+++ R  +    + G  G  M+  Y
Sbjct:   259 EENDLRGRSQRFWIMKNSWGVSGWGTGGYVKLIRGKNWCGIERGAIGANMEEHY 312

 Score = 114 (45.2 bits), Expect = 1.8e-15, Sum P(2) = 1.8e-15
 Identities = 37/147 (25%), Positives = 62/147 (42%)

Query:    22 QWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLS---INEFADQTNQEF- 77
             ++  K+ + YK+  E + R + F  +   +  LN    K  + S   +N+F+D T  E  
Sbjct:    46 EFKKKFSRTYKSEAENQLRLQNFVKSRNNVVRLNKNAQKAGRNSNFAVNQFSDLTTSELH 105

Query:    78 -------------KAFRNGYRRPDGLTSRKGTSFKYENVIDVPATMDWRKNGA--VTPIK 122
                            F   +++  G T  K  + ++    D+ +    + NG   V PIK
Sbjct:   106 QRLSRFPPNLTENSVFHKNFKKLLGKTRTKRQNSEFARNFDLRSQ---KVNGRYIVGPIK 162

Query:   123 NQGPCGSCWAFSAVAATEGITQLTTGK 149
             NQG C  CW F+  A  E I  +  G+
Sbjct:   163 NQGQCACCWGFAVTAMLETIYAVNVGR 189


>UNIPROTKB|F1MW68 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9913 "Bos taurus"
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0060441
            GeneTree:ENSGT00560000076599 IPI:IPI00708474 UniGene:Bt.4902
            OMA:QCGTCTE EMBL:DAAA02036315 PRIDE:F1MW68
            Ensembl:ENSBTAT00000025007 Uniprot:F1MW68
        Length = 304

 Score = 208 (78.3 bits), Expect = 1.9e-15, P = 1.9e-15
 Identities = 58/194 (29%), Positives = 96/194 (49%)

Query:   127 CGSCWAFSAVAA-TEGITQLTTGKLIS--LSEQELVSCDTSGVDHGCEGGEMEDAFKFII 183
             CGSCWA  + +A  + I     G   S  LS Q ++ C  +G    CEGG     +++  
Sbjct:    90 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVLDCGDAG---SCEGGNDLPVWEYA- 145

Query:   184 HNDGITTEANYPYQAVDGTCNKTNEAS--------HVAK------IKGYETVPANSEEAL 229
             H  GI  E    YQA D  C+K N+          HV K      +  Y ++ +  E+ +
Sbjct:   146 HRHGIPDETCNNYQAKDQECDKFNQCGTCTEFKECHVIKNYTLWKVGDYGSL-SGREKMM 204

Query:   230 LKAVANQPVAVSIDASGSAFQFYSSGVFTG-DCGTELDHGVTAVGYGATANGTKYWLVKN 288
              +   N P++  I A+      Y+ G+++  +    ++H V+  G+G + +G +YW+V+N
Sbjct:   205 AEIYTNGPISCGIMAT-EKMSNYTGGIYSEYNDQAFINHIVSVAGWGVS-DGMEYWIVRN 262

Query:   289 SWGTSWGEEGYIRM 302
             SWG  WGE G++R+
Sbjct:   263 SWGEPWGEHGWMRI 276

 Score = 142 (55.0 bits), Expect = 2.9e-07, P = 2.9e-07
 Identities = 51/162 (31%), Positives = 73/162 (45%)

Query:    70 ADQTNQEFKAFRNGYR--RPDGLTSRKGTSF----KYENVIDVPATMDWRK-NGA--VTP 120
             A +    F+  R  YR  R D LT     ++    +Y +  D+P + DWR  NG    + 
Sbjct:    21 AARAGLHFRPGRGCYRPLRGDRLTQLGRRTYPRPHEYLSPSDLPKSWDWRNVNGVNYASV 80

Query:   121 IKNQG-P--CGSCWAFSAVAA-TEGITQLTTGKLIS--LSEQELVSCDTSGVDHGCEGGE 174
              +NQ  P  CGSCWA  + +A  + I     G   S  LS Q ++ C  +G    CEGG 
Sbjct:    81 TRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVLDCGDAG---SCEGGN 137

Query:   175 MEDAFKFIIHNDGITTEANYPYQAVDGTCNKTNEASHVAKIK 216
                 +++  H  GI  E    YQA D  C+K N+     + K
Sbjct:   138 DLPVWEYA-HRHGIPDETCNNYQAKDQECDKFNQCGTCTEFK 178


>ZFIN|ZDB-GENE-041010-139 [details] [associations]
            symbol:ctsz "cathepsin Z" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0001525 "angiogenesis"
            evidence=IMP] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 ZFIN:ZDB-GENE-041010-139 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0001525
            CTD:1522 HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568
            OrthoDB:EOG42Z4QN UniGene:Dr.935 eggNOG:NOG275763 EMBL:BC083369
            IPI:IPI00483065 RefSeq:NP_001006043.1 ProteinModelPortal:Q5XJD4
            SMR:Q5XJD4 STRING:Q5XJD4 GeneID:450022 KEGG:dre:450022
            InParanoid:Q5XJD4 NextBio:20833005 ArrayExpress:Q5XJD4
            Uniprot:Q5XJD4
        Length = 301

 Score = 204 (76.9 bits), Expect = 6.0e-15, P = 6.0e-15
 Identities = 61/205 (29%), Positives = 96/205 (46%)

Query:   118 VTPIKNQG-P--CGSCWAF---SAVAATEGITQLTTGKLISLSEQELVSCDTSGVDHGCE 171
             V+  +NQ  P  CGSCWA    SA+A    I +        LS Q ++ C  +G    C 
Sbjct:    69 VSTTRNQHIPQYCGSCWAHGSTSALADRINIKRKAAWPSAYLSVQNVIDCGDAG---SCS 125

Query:   172 GGEMEDAFKFIIHNDGITTEANYPYQAVD---------GTCNKTNEASHVAKIKGYETVP 222
             GG+    +++  HN GI  E    YQA D         GTC      + V     ++   
Sbjct:   126 GGDHSGVWEYA-HNKGIPDETCNNYQAKDQDCKPFNQCGTCTTFGVCNIVKNFTLWKVGD 184

Query:   223 ANSEEAL--LKA--VANQPVAVSIDASGSAFQFYSSGVFTGDCGTE-LDHGVTAVGYGAT 277
               S   L  +KA   +  P++  I A+      Y+ G+++       ++H V+  G+G  
Sbjct:   185 YGSASGLDKMKAEIYSGGPISCGIMATDK-LDAYTGGLYSEYVQEPYINHIVSVAGWGVD 243

Query:   278 ANGTKYWLVKNSWGTSWGEEGYIRM 302
              NG ++W+V+NSWG  WGE+G++R+
Sbjct:   244 ENGVEFWVVRNSWGEPWGEKGWLRI 268


>WB|WBGene00009158 [details] [associations]
            symbol:F26E4.3 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005576
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005044
            GeneTree:ENSGT00560000076599 HSSP:P07711 EMBL:Z81070
            eggNOG:NOG310046 HOGENOM:HOG000241342 OMA:DNCNRCT PIR:T21421
            RefSeq:NP_492593.2 ProteinModelPortal:P90850 SMR:P90850
            PaxDb:P90850 EnsemblMetazoa:F26E4.3.1 EnsemblMetazoa:F26E4.3.2
            GeneID:172827 KEGG:cel:CELE_F26E4.3 UCSC:F26E4.3.1 CTD:172827
            WormBase:F26E4.3 InParanoid:P90850 NextBio:877161 Uniprot:P90850
        Length = 452

 Score = 134 (52.2 bits), Expect = 1.4e-14, Sum P(2) = 1.4e-14
 Identities = 43/136 (31%), Positives = 71/136 (52%)

Query:    75 QEFKAFRNGYRRPDGLTSRKGTSF---KYENVIDV-------PATMDWR-KNGA-VTPIK 122
             + + AF  G    DG+  R GT F     +N+ ++       P   D R K G  + P+ 
Sbjct:   144 RNYSAFW-GRSLSDGIKYRLGTLFPERSVQNMNEILIKPRELPEHFDARDKWGPLIHPVA 202

Query:   123 NQGPCGSCWAFSAVA-ATEGITQLTTGKLIS-LSEQELVSCDTSGVDHGCEGGEMEDAFK 180
             +QG CGS W+ S  A +++ +  ++ G++ S LS Q+L+SC+      GCEGG ++ A+ 
Sbjct:   203 DQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQLLSCNQHR-QKGCEGGYLDRAWW 261

Query:   181 FIIHNDGITTEANYPY 196
             +I    G+  +  YPY
Sbjct:   262 YI-RKLGVVGDHCYPY 276

 Score = 123 (48.4 bits), Expect = 1.4e-14, Sum P(2) = 1.4e-14
 Identities = 33/114 (28%), Positives = 50/114 (43%)

Query:   203 CNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGVFT-GDC 261
             C   ++ S   K+     V +  E+   + + N PV  +       F  Y+ GV+   D 
Sbjct:   302 CPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFM-YAGGVYQHSDL 360

Query:   262 GTELD--------HGVTAVGYG---ATANGTKYWLVKNSWGTSWGEEGYIRMKR 304
               +          H V  +G+G   +T    KYWL  NSWGT WGE+GY ++ R
Sbjct:   361 AAQKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGYFKVLR 414


>UNIPROTKB|E1BTI7 [details] [associations]
            symbol:TINAG "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0005044 "scavenger receptor activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0006955 "immune
            response" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0030247 "polysaccharide binding"
            evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
            [GO:0007155 "cell adhesion" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GO:GO:0007155 GO:GO:0005604 GO:GO:0005044
            GeneTree:ENSGT00560000076599 CTD:27283 OMA:WGQLTSS
            EMBL:AADN02002720 EMBL:AADN02002721 IPI:IPI00581566
            RefSeq:XP_419905.3 UniGene:Gga.11215 Ensembl:ENSGALT00000026295
            GeneID:421888 KEGG:gga:421888 Uniprot:E1BTI7
        Length = 467

 Score = 134 (52.2 bits), Expect = 3.2e-14, Sum P(2) = 3.2e-14
 Identities = 49/155 (31%), Positives = 79/155 (50%)

Query:    49 EFIESLNAAGNKPYKL-SINEFADQTNQEFKAFRNGYRRPD-GLTSRK---GTSFKYENV 103
             + I  +N+ G+  +K  +  +F   T +E    R G   P   L + K   G+S   E  
Sbjct:   164 DLIHHINS-GDYGWKADNYTQFWGMTLEEGFRKRLGTLPPSHSLLNMKAIPGSSVPEEKF 222

Query:   104 IDV-PATMDWRKNGAVTPIKNQGPCGSCWAFS-AVAATEGITQLTTGKLI-SLSEQELVS 160
              +   AT  W  +    P+ +Q  CG+ WAFS A  A + IT  + G++  +LS Q L+S
Sbjct:   223 PEFFAATYAW-PDWIHDPL-DQRNCGASWAFSTASVAADRITIHSDGQITDNLSVQNLIS 280

Query:   161 CDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYP 195
             CDT G   GC GG ++ A++++  + G+ + A YP
Sbjct:   281 CDT-GNQRGCNGGSIDGAWRYLTTH-GVVSYACYP 313

 Score = 120 (47.3 bits), Expect = 3.2e-14, Sum P(2) = 3.2e-14
 Identities = 31/121 (25%), Positives = 55/121 (45%)

Query:   192 ANYPYQAVDGTC-NKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQ 250
             + Y     +G C N   +++ + +   +  V +   + + + +A  PV   +      F 
Sbjct:   332 SEYGKNHTNGPCPNALEDSNRLYRCGSHYRVSSKETDIMEEIMAKGPVQAIMKVYEDFF- 390

Query:   251 FYSSGVFTGD--CGTELD-HGVTAVGYGATA--NGTK--YWLVKNSWGTSWGEEGYIRMK 303
              Y  G++      G++   H V  +G+G+    NG K  +W+  NSWG  WGE GY R+ 
Sbjct:   391 LYKEGIYRHSYKAGSKWKTHSVKLLGWGSLPGKNGQKQKFWIAANSWGKYWGENGYFRIL 450

Query:   304 R 304
             R
Sbjct:   451 R 451


>WB|WBGene00000783 [details] [associations]
            symbol:cpr-3 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0009792 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39890 EMBL:L39925 EMBL:Z81119
            EMBL:Z82057 PIR:T37282 RefSeq:NP_506790.1 UniGene:Cel.23503
            ProteinModelPortal:P43507 SMR:P43507 MEROPS:C01.A33
            EnsemblMetazoa:T10H4.12 GeneID:180033 KEGG:cel:CELE_T10H4.12
            UCSC:T10H4.12 CTD:180033 WormBase:T10H4.12 eggNOG:NOG240190
            InParanoid:P43507 OMA:PVEASYK NextBio:907824 Uniprot:P43507
        Length = 370

 Score = 134 (52.2 bits), Expect = 7.5e-14, Sum P(2) = 7.5e-14
 Identities = 25/57 (43%), Positives = 36/57 (63%)

Query:   249 FQFYSSGVFTGDCGTELD-HGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKR 304
             F  Y SGV+    G  +  H V  +G+G   NG  YWL+ NSWGTS+GE+G+ +++R
Sbjct:   265 FYHYKSGVYHYTSGKLVGGHAVKIIGWGVE-NGVDYWLIANSWGTSFGEKGFFKIRR 320

 Score = 113 (44.8 bits), Expect = 7.5e-14, Sum P(2) = 7.5e-14
 Identities = 28/95 (29%), Positives = 48/95 (50%)

Query:   106 VPATMD----WRKNGAVTPIKNQGPCGSCWAFSAVAA-TEGITQLTTG-KLISLSEQELV 159
             +P T D    W     +  I+NQ  CGSCWAF A    ++ +   + G +   +S ++++
Sbjct:    92 LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 151

Query:   160 SCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANY 194
             SC  +   +GC+GG   +A +F   + G  T  +Y
Sbjct:   152 SCCGTTCGYGCKGGYSIEALRFWA-SSGAVTGGDY 185


>WB|WBGene00000782 [details] [associations]
            symbol:cpr-2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 eggNOG:NOG315657 GeneTree:ENSGT00560000076599
            HOGENOM:HOG000241341 PANTHER:PTHR12411:SF16 EMBL:Z81531
            RefSeq:NP_507186.3 ProteinModelPortal:O45466 SMR:O45466
            MEROPS:C01.A40 PaxDb:O45466 EnsemblMetazoa:F36D3.9 GeneID:185355
            KEGG:cel:CELE_F36D3.9 CTD:185355 WormBase:F36D3.9 OMA:FDARLRW
            Uniprot:O45466
        Length = 326

 Score = 142 (55.0 bits), Expect = 8.5e-14, Sum P(2) = 8.5e-14
 Identities = 29/68 (42%), Positives = 39/68 (57%)

Query:   249 FQFYSSGVFTGDCG-TELDHGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDID 307
             F+ Y SG++    G ++  H V  +G+G T  GT YWL  NSWG+ WGE G  R+ R +D
Sbjct:   254 FEKYKSGIYRHIAGRSKGGHAVKLIGWG-TERGTPYWLAVNSWGSQWGESGTFRILRGVD 312

Query:   308 AKEGLCGI 315
                  CGI
Sbjct:   313 E----CGI 316

 Score = 101 (40.6 bits), Expect = 8.5e-14, Sum P(2) = 8.5e-14
 Identities = 29/101 (28%), Positives = 45/101 (44%)

Query:   101 ENVIDV-PATMD----WRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTGKLIS--L 153
             E V+D  P   D    W +  ++  I+ Q  CGSCWAFS        T + +       +
Sbjct:    77 EFVLDATPLNFDARTRWPQCKSMKLIREQSNCGSCWAFSTAEVISDRTCIASNGTQQPII 136

Query:   154 SEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANY 194
             S  +L++C       GC+GG    AF++     G+ T  +Y
Sbjct:   137 SPTDLLTCCGMSCGEGCDGGFPYRAFQWWARR-GVVTGGDY 176


>FB|FBgn0030521 [details] [associations]
            symbol:CtsB1 "Cathepsin B1" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS] [GO:0035071 "salivary gland cell autophagic cell
            death" evidence=IEP] [GO:0048102 "autophagic cell death"
            evidence=IEP] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:AE014298 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0035071
            GO:GO:0004197 MEROPS:C01.060 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 KO:K01363 PANTHER:PTHR12411:SF16
            HSSP:P07688 EMBL:AY060640 RefSeq:NP_572920.1 UniGene:Dm.3926
            SMR:Q9VY87 IntAct:Q9VY87 MINT:MINT-932864 STRING:Q9VY87
            EnsemblMetazoa:FBtr0073838 GeneID:32341 KEGG:dme:Dmel_CG10992
            UCSC:CG10992-RA FlyBase:FBgn0030521 InParanoid:Q9VY87 OMA:TEGHIRR
            OrthoDB:EOG48W9HM ChiTaRS:CG10992 GenomeRNAi:32341 NextBio:778020
            Uniprot:Q9VY87
        Length = 340

 Score = 125 (49.1 bits), Expect = 1.2e-13, Sum P(2) = 1.2e-13
 Identities = 33/102 (32%), Positives = 46/102 (45%)

Query:   216 KGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGTELD-HGVTAVGY 274
             K Y +V  N  E   + + N PV  +          Y  GV+  + G EL  H +  +G+
Sbjct:   234 KSY-SVRRNVREIQEEIMTNGPVEGAFTVYEDLI-LYKDGVYQHEHGKELGGHAIRILGW 291

Query:   275 GATANGT-KYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGI 315
             G        YWL+ NSW T WG+ G+ R+ R  D     CGI
Sbjct:   292 GVWGEEKIPYWLIGNSWNTDWGDHGFFRILRGQDH----CGI 329

 Score = 120 (47.3 bits), Expect = 1.2e-13, Sum P(2) = 1.2e-13
 Identities = 34/82 (41%), Positives = 44/82 (53%)

Query:   100 YENVID-VPATMDWRKNGAVTP----IKNQGPCGSCWAFSAVAA-TEGITQLTTGKL-IS 152
             Y N +D +P   D RK     P    I++QG CGSCWAF AV A ++ +   + GK+   
Sbjct:    80 YVNSVDELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFH 139

Query:   153 LSEQELVSC-DTSGVDHGCEGG 173
              S  +LVSC  T G   GC GG
Sbjct:   140 FSADDLVSCCHTCGF--GCNGG 159


>ZFIN|ZDB-GENE-060503-240 [details] [associations]
            symbol:tinagl1 "tubulointerstitial nephritis
            antigen-like 1" species:7955 "Danio rerio" [GO:0005044 "scavenger
            receptor activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0030414 "peptidase inhibitor activity" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0002040 "sprouting
            angiogenesis" evidence=IMP] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR008037 InterPro:IPR013128 Pfam:PF00112 Pfam:PF05375
            PROSITE:PS00524 PROSITE:PS50958 SMART:SM00201 SMART:SM00645
            ZFIN:ZDB-GENE-060503-240 GO:GO:0006955 GO:GO:0030247 GO:GO:0030414
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 GO:GO:0002040
            GO:GO:0005044 GeneTree:ENSGT00560000076599 GO:GO:0010466
            SUPFAM:SSF57283 HOVERGEN:HBG053961 MEROPS:C01.975 OMA:DNCNRCT
            EMBL:BX950864 IPI:IPI00609339 UniGene:Dr.103937
            Ensembl:ENSDART00000087096 Ensembl:ENSDART00000126228
            InParanoid:Q1LUC6 Uniprot:Q1LUC6
        Length = 471

 Score = 136 (52.9 bits), Expect = 1.3e-13, Sum P(2) = 1.3e-13
 Identities = 49/168 (29%), Positives = 79/168 (47%)

Query:    36 EKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKG 95
             E E+   + +D++  I+ +N         + ++F   T  E   FR G +RP        
Sbjct:   131 ECEQHACLIEDDM--IQEINRRDYGWRAANYSQFWGMTLDEGLRFRLGTKRPTRTIMNMN 188

Query:    96 TSFKYENVID-VPA---TMD-WRKNGAVTPIKNQGPCGSCWAFSAVA-ATEGITQLTTGK 149
                   N  D +P+    +D W   G +    +QG C + WAFS  A A++ I+  + G 
Sbjct:   189 EMQMNMNGNDHLPSYFNAVDKWP--GKIHEPLDQGNCNASWAFSTAAVASDRISIQSMGH 246

Query:   150 LI-SLSEQELVSCDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPY 196
             +   LS Q L+SCDT   D GC GG ++ A+ F+    G+ T+  YP+
Sbjct:   247 MTPQLSPQNLISCDTRHQD-GCAGGRIDGAWWFM-RRRGVVTQDCYPF 292

 Score = 112 (44.5 bits), Expect = 1.3e-13, Sum P(2) = 1.3e-13
 Identities = 29/97 (29%), Positives = 43/97 (44%)

Query:   224 NSEEALLKAVANQPVAVSIDASGSAFQFYSSGVFTG-DCG--------TELDHGVTAVGY 274
             N  E + + + N PV   ++     F  Y SG+F   D              H V   G+
Sbjct:   345 NENEIMKEIMDNGPVQAIMEVHEDFF-VYKSGIFRHTDVNYHKPSQYRKHATHSVRITGW 403

Query:   275 GA----TANGTKYWLVKNSWGTSWGEEGYIRMKRDID 307
             G     +    KYW+  NSWG +WGE+GY R+ R ++
Sbjct:   404 GEERDYSGRTRKYWIGANSWGKNWGEDGYFRIARGVN 440


>DICTYBASE|DDB_G0283921 [details] [associations]
            symbol:ctsB "cathepsin B precursor" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0283921 GO:GO:0005615
            GenomeReviews:CM000153_GR GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:AAFI02000058
            eggNOG:NOG315657 PANTHER:PTHR12411:SF16 OMA:CSLSCQS
            RefSeq:XP_638805.1 HSSP:P07688 MEROPS:C01.A59
            EnsemblProtists:DDB0233997 GeneID:8624329 KEGG:ddi:DDB_G0283921
            Uniprot:Q54QD9
        Length = 311

 Score = 194 (73.4 bits), Expect = 1.8e-13, P = 1.8e-13
 Identities = 69/235 (29%), Positives = 106/235 (45%)

Query:   108 ATMDWRKNGAVTPIKNQGPCGSCWAFSAV-AATEGITQLTTGKLISLSEQELVSCDTS-- 164
             A  +W     ++ I+NQ  CGSCWAF A  +AT+ +  +   + + LS  ++V+CD +  
Sbjct:    85 AQTNWPNCTTISQIQNQARCGSCWAFGATESATDRLC-IHNNENVQLSFMDMVTCDETDN 143

Query:   165 GVDHGC----------EGGEMEDAFKFIIHNDGITTE-----ANYPYQAVDGTCNKT--- 206
             G + G           +G   E+   + I       +      N P    +   N +   
Sbjct:   144 GCEGGDAFSAWNWLRKQGAVSEECLPYTIPTCPPAQQPCLNFVNTPSCTKECQSNSSLIY 203

Query:   207 NEASH-VAKIKGYETVPANSEEALLKA-VANQPVAVSIDASGSAFQFYSSGVFTGDCGTE 264
             ++  H +AKI  ++     S+EA+++  V N PV          F  Y SGV+    G +
Sbjct:   204 SQDKHKMAKIYSFD-----SDEAIMQEIVTNGPVEACFTVFED-FLAYKSGVYVHTTGKD 257

Query:   265 LD-HGVTAVGYGATANGTKYWLVKNSWGTSWGEEGYIRMKRDIDAKEGLCGIAMD 318
             L  H V  VG+G T NG  Y+   N W TSWG+ G   +KR      G CGI+ D
Sbjct:   258 LGGHCVKLVGFG-TLNGVDYYAANNQWTTSWGDNGTFLIKR------GDCGISDD 305


>TAIR|locus:2060420 [details] [associations]
            symbol:AT2G22160 "AT2G22160" species:3702 "Arabidopsis
            thaliana" [GO:0005575 "cellular_component" evidence=ND] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] EMBL:CP002685
            GenomeReviews:CT485783_GR InterPro:IPR013201 Pfam:PF08246
            SMART:SM00848 EMBL:AC007168 IPI:IPI00544896 PIR:F84609
            RefSeq:NP_179806.1 UniGene:At.66231 HSSP:P25774
            ProteinModelPortal:Q9SIE8 SMR:Q9SIE8 EnsemblPlants:AT2G22160.1
            GeneID:816750 KEGG:ath:AT2G22160 TAIR:At2g22160 eggNOG:NOG297278
            InParanoid:Q9SIE8 OMA:HRCITLA PhylomeDB:Q9SIE8 ArrayExpress:Q9SIE8
            Genevestigator:Q9SIE8 Uniprot:Q9SIE8
        Length = 105

 Score = 173 (66.0 bits), Expect = 9.2e-13, P = 9.2e-13
 Identities = 36/91 (39%), Positives = 50/91 (54%)

Query:    36 EKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGYRRPDGLTSRKG 95
             + E  F +FK N E+I   N    KPYKL +N+FA+ T+ EF      +   D       
Sbjct:    10 QTESSFDVFKKNAEYIVKTNKE-RKPYKLKLNKFANLTDVEFVNAHTCFDMSDHKKILDS 68

Query:    96 TSFKYENVIDVPATMDWRKNGAVTPIKNQGP 126
               F YEN+   P ++DWR+ GAVT +K+QGP
Sbjct:    69 KPFFYENMTQAPDSLDWREKGAVTNVKDQGP 99


>UNIPROTKB|H0YDT2 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            EMBL:AP001201 HGNC:HGNC:2546 Ensembl:ENST00000526034 Bgee:H0YDT2
            Uniprot:H0YDT2
        Length = 211

 Score = 172 (65.6 bits), Expect = 1.2e-12, P = 1.2e-12
 Identities = 43/147 (29%), Positives = 68/147 (46%)

Query:    12 QEASLSEKHEQWMSKYGKVYKNPEEKEKRFRIFKDNVEFIESLNAAGNKPYKLSINEFAD 71
             Q   L E  + +  ++ + Y +PEE   R  IF  N+   + L        +  +  F+D
Sbjct:    33 QPLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSD 92

Query:    72 QTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVID-VPATMDWRK-NGAVTPIKNQGPCGS 129
              T +EF     GYRR  G     G   + E   + VP + DWRK   A++PIK+Q  C  
Sbjct:    93 LTEEEFGQLY-GYRRAAGGVPSMGREIRSEEPEESVPFSCDWRKVASAISPIKDQKNCNC 151

Query:   130 CWAFSAVAATEGITQLTTGKLISLSEQ 156
             CWA +A    E + +++    + +S Q
Sbjct:   152 CWAMAAAGNIETLWRISFWDFVDVSVQ 178


>RGD|1359482 [details] [associations]
            symbol:Tinag "tubulointerstitial nephritis antigen"
            species:10116 "Rattus norvegicus" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0005604 "basement membrane"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=IEA] [GO:0006955
            "immune response" evidence=IEA] [GO:0007155 "cell adhesion"
            evidence=ISO] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR001212 InterPro:IPR013128
            Pfam:PF00112 Pfam:PF01033 PROSITE:PS00524 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 RGD:1359482 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GO:GO:0007155 EMBL:CH473954 GO:GO:0005604
            GO:GO:0005044 MEROPS:C01.973 CTD:27283 eggNOG:NOG310046
            HOGENOM:HOG000241342 HOVERGEN:HBG053961 OMA:WGQLTSS
            OrthoDB:EOG47PX5P EMBL:BC081887 IPI:IPI00370427
            RefSeq:NP_001005549.1 UniGene:Rn.43851 STRING:Q66HF6
            Ensembl:ENSRNOT00000041567 GeneID:300846 KEGG:rno:300846
            UCSC:RGD:1359482 InParanoid:Q66HF6 NextBio:647630
            Genevestigator:Q66HF6 Uniprot:Q66HF6
        Length = 475

 Score = 122 (48.0 bits), Expect = 5.5e-12, Sum P(2) = 5.5e-12
 Identities = 29/103 (28%), Positives = 47/103 (45%)

Query:   221 VPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGTELD---------HGVTA 271
             + +N  E + + + N PV   +      F +Y +G++     T  +         H V  
Sbjct:   356 ISSNETEIMREIIQNGPVQAIMQVHEDFF-YYKTGIYRHVVSTNEEPEKYRKLRTHAVKL 414

Query:   272 VGYGAT--ANGTK--YWLVKNSWGTSWGEEGYIRMKRDIDAKE 310
              G+G    A G K  +W+  NSWG SWGE GY R+ R ++  +
Sbjct:   415 TGWGTLRGAQGKKEKFWIAANSWGKSWGENGYFRILRGVNESD 457

 Score = 112 (44.5 bits), Expect = 5.5e-12, Sum P(2) = 5.5e-12
 Identities = 46/165 (27%), Positives = 70/165 (42%)

Query:    49 EFIESLNAAGNKPYKLSINEFADQTNQEFKAFRNGYRRPDG-LTSRKGTSFKYENVIDVP 107
             E I+ +N         + ++F   T +E   FR G   P   L S    +  Y    D+P
Sbjct:   159 ELIDHINKGDYGWTAQNYSQFWGMTLEEGFKFRLGTLPPSPMLLSMNEMTASYPRA-DLP 217

Query:   108 ----ATMDWRKNGAVTPIKNQGPCGSCWAFS-AVAATEGITQLTTGKLIS-LSEQELVSC 161
                 A+  W       P+ +Q  C + WAFS A  A + I   + G+  + LS Q L+SC
Sbjct:   218 EVFIASYKW-PGWTHGPL-DQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISC 275

Query:   162 DTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPYQAVDGTCNKT 206
                   HGC  G ++ A+ F+    G+ + A YP      T N +
Sbjct:   276 CAKN-RHGCNSGSIDRAWWFL-RKRGLVSHACYPLFKEQSTNNNS 318


>UNIPROTKB|I3L9E7 [details] [associations]
            symbol:LOC100153159 "Uncharacterized protein" species:9823
            "Sus scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GeneTree:ENSGT00560000076599 OMA:WGQLTSS
            Ensembl:ENSSSCT00000031207 Uniprot:I3L9E7
        Length = 358

 Score = 121 (47.7 bits), Expect = 9.2e-12, Sum P(2) = 9.2e-12
 Identities = 32/103 (31%), Positives = 47/103 (45%)

Query:   221 VPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGT--ELD-------HGVTA 271
             V +N  E + + + N PV   +      F  Y +G++     T  E D       H V  
Sbjct:   239 VSSNETEIMREIMQNGPVQAIMQVHEDFFH-YKTGIYRHVTSTNEESDKYRKLRTHAVKL 297

Query:   272 VGYGAT--ANGTK--YWLVKNSWGTSWGEEGYIRMKRDIDAKE 310
              G+G    A G K  +W+  NSWG SWGE GY R+ R ++  +
Sbjct:   298 TGWGTLKGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESD 340

 Score = 107 (42.7 bits), Expect = 9.2e-12, Sum P(2) = 9.2e-12
 Identities = 47/186 (25%), Positives = 76/186 (40%)

Query:    67 NEFADQTNQEFKAFRNGYRRPDGLT-SRKGTSFKYENVIDVP----ATMDWRKNGAVTPI 121
             ++F   T +E   +R G   P  L  S    +       D+P    A+  W       P+
Sbjct:    59 SQFWGMTLEEGFKYRLGTLPPSPLLLSMNEVTASLPETTDLPEFFVASYKW-PGWTHGPL 117

Query:   122 KNQGPCGSCWAFS-AVAATEGITQLTTGKLIS-LSEQELVSCDTSGVDHGCEGGEMEDAF 179
              +Q  C + WAFS A  A + I   + G+  + LS Q L+SC      HGC  G ++ A+
Sbjct:   118 -DQKNCAASWAFSTASVAADRIAIQSEGRYTANLSPQNLISCCAKN-RHGCNSGSIDRAW 175

Query:   180 KFIIHNDGITTEANYPY----QAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVAN 235
              ++    G+ + A YP      A +  C   + +    K    +  P N E++      +
Sbjct:   176 WYL-RKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATKPCPNNFEKSNRIYQCS 234

Query:   236 QPVAVS 241
              P  VS
Sbjct:   235 PPYRVS 240


>UNIPROTKB|Q3SZI1 [details] [associations]
            symbol:TINAG "Tubulointerstitial nephritis antigen"
            species:9913 "Bos taurus" [GO:0005604 "basement membrane"
            evidence=IEA] [GO:0007155 "cell adhesion" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044
            "scavenger receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 Pfam:PF01033
            PROSITE:PS00524 PROSITE:PS50958 SMART:SM00201 SMART:SM00645
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0007155
            GO:GO:0005604 GO:GO:0005044 GeneTree:ENSGT00560000076599
            EMBL:BC102843 IPI:IPI00689615 RefSeq:NP_001030279.1
            UniGene:Bt.29080 ProteinModelPortal:Q3SZI1 MEROPS:C01.973
            PRIDE:Q3SZI1 Ensembl:ENSBTAT00000016790 GeneID:512517
            KEGG:bta:512517 CTD:27283 eggNOG:NOG310046 HOGENOM:HOG000241342
            HOVERGEN:HBG053961 InParanoid:Q3SZI1 OMA:WGQLTSS OrthoDB:EOG47PX5P
            NextBio:20870427 Uniprot:Q3SZI1
        Length = 476

 Score = 123 (48.4 bits), Expect = 1.1e-11, Sum P(2) = 1.1e-11
 Identities = 32/120 (26%), Positives = 52/120 (43%)

Query:   204 NKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGT 263
             N   +++ + +      V +N  E + + + N PV   +      F  Y +G++     T
Sbjct:   340 NSIEKSNRIYQCSPPYRVSSNETEIMREIMQNGPVQAIMQVHEDFFN-YKTGIYRHITST 398

Query:   264 ELD---------HGVTAVGYGAT--ANGTK--YWLVKNSWGTSWGEEGYIRMKRDIDAKE 310
               D         H V   G+G    A G K  +W+  NSWG SWGE GY R+ R ++  +
Sbjct:   399 NEDSEKYRKFRTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESD 458

 Score = 108 (43.1 bits), Expect = 1.1e-11, Sum P(2) = 1.1e-11
 Identities = 48/171 (28%), Positives = 76/171 (44%)

Query:    67 NEFADQTNQEFKAFRNGYRRPDGLT-SRKGTSFKYENVIDVP----ATMDWRKNGAVTPI 121
             ++F   T +E   +R G   P  L  S    +       D+P    A+  W       P+
Sbjct:   177 SQFWGMTLEEGFKYRLGTLPPSPLLLSMNEVTASLTKTTDLPEFFIASYKW-PGWTHGPL 235

Query:   122 KNQGPCGSCWAFS-AVAATEGITQLTTGKLIS-LSEQELVSCDTSGVDHGCEGGEMEDAF 179
              +Q  C + WAFS A  A + I   + G+  + LS Q L+SC  +   HGC  G ++ A+
Sbjct:   236 -DQKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNLISC-CAKKRHGCNSGSVDRAW 293

Query:   180 KFIIHNDGITTEANYP-YQAVDGTCNKTNEASHV-AKIKGYETVPA-NSEE 227
              ++    G+ + A YP ++  + T N    AS    + K + T P  NS E
Sbjct:   294 WYL-RKRGLVSHACYPLFKDQNATNNGCAMASRSDGRGKRHATTPCPNSIE 343


>UNIPROTKB|E1B9H1 [details] [associations]
            symbol:TINAGL1 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0043236 "laminin binding" evidence=IEA] [GO:0031012
            "extracellular matrix" evidence=IEA] [GO:0005737 "cytoplasm"
            evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0005044 "scavenger receptor
            activity" evidence=IEA] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0005737 GO:GO:0006955
            GO:GO:0030247 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00639 GO:GO:0031012 GO:GO:0005044
            GeneTree:ENSGT00560000076599 OMA:DNCNRCT EMBL:DAAA02006255
            IPI:IPI00732137 Ensembl:ENSBTAT00000038022 Uniprot:E1B9H1
        Length = 469

 Score = 122 (48.0 bits), Expect = 2.8e-11, Sum P(2) = 2.8e-11
 Identities = 44/156 (28%), Positives = 72/156 (46%)

Query:    49 EFIESLNAAGNKPYKLSINE-FADQTNQEFKAFRNGYRRPDGLTSRKGTSFKYENVIDV- 106
             + IE++N  G+  ++   +  F   T  E   +R G  RP    +            +V 
Sbjct:   147 DMIEAINH-GDYGWRAGNHSAFWGMTLDEGIRYRLGTVRPSSFVANMNEIHTVLGPGEVL 205

Query:   107 PATMD----WRKNGAVTPIKNQGPCGSCWAFSAVA-ATEGITQLTTGKLIS-LSEQELVS 160
             P T +    W  N    P+ +QG C   WAFS  A A++ ++  + G +   LS Q L+S
Sbjct:   206 PRTFEASEKW-PNLIHDPL-DQGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQNLLS 263

Query:   161 CDTSGVDHGCEGGEMEDAFKFIIHNDGITTEANYPY 196
             CDT     GC GG ++ A+ F+    G+ ++  YP+
Sbjct:   264 CDTHN-QQGCRGGRLDGAWWFL-RRRGVVSDHCYPF 297

 Score = 105 (42.0 bits), Expect = 2.8e-11, Sum P(2) = 2.8e-11
 Identities = 33/121 (27%), Positives = 54/121 (44%)

Query:   197 QAVDGTCNKTNEASHVAKIKGYETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGV 256
             QA     N    A+ + ++     + +N +E + + + N PV   ++     F  Y SG+
Sbjct:   324 QATARCPNSYVHANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFF-LYQSGI 382

Query:   257 FT------GDCGTELDHGVTAV---GYGATA--NGT--KYWLVKNSWGTSWGEEGYIRMK 303
             ++      G       HG  +V   G+G     +G   KYW   NSWG +WGE G+ R+ 
Sbjct:   383 YSHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPAWGERGHFRIV 442

Query:   304 R 304
             R
Sbjct:   443 R 443


>WB|WBGene00021070 [details] [associations]
            symbol:W07B8.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:FO081739 HSSP:P07688 PIR:T31730
            RefSeq:NP_503384.1 ProteinModelPortal:O16289 SMR:O16289
            EnsemblMetazoa:W07B8.1 GeneID:178613 KEGG:cel:CELE_W07B8.1
            UCSC:W07B8.1 CTD:178613 WormBase:W07B8.1 eggNOG:NOG245289
            InParanoid:O16289 OMA:TTGIYVH NextBio:901844 Uniprot:O16289
        Length = 335

 Score = 122 (48.0 bits), Expect = 2.9e-11, Sum P(2) = 2.9e-11
 Identities = 26/87 (29%), Positives = 41/87 (47%)

Query:   219 ETVPANSEEALLKAVANQPVAVSIDASGSAFQFYSSGVFTGDCGTELDH-GVTAVGYGAT 277
             + +P +  E     + N P+  + +      Q Y++G++    G +  H  V  +G+G  
Sbjct:   232 DQLPNSQIEIQSDVMLNGPIQATFEVYDDFLQ-YTTGIYVHLTGNKQGHLSVRIIGWGVW 290

Query:   278 ANGTKYWLVKNSWGTSWGEEGYIRMKR 304
               G  YWL  NSWG  WGE G  R+ R
Sbjct:   291 -QGVPYWLCANSWGRQWGENGTFRVLR 316

 Score = 100 (40.3 bits), Expect = 2.9e-11, Sum P(2) = 2.9e-11
 Identities = 28/90 (31%), Positives = 43/90 (47%)

Query:   112 WRKNGAVTPIKNQGPCGSCWAFSAVAATEGITQLTTG--KLISLSEQELVSCDTS--GVD 167
             W +  ++  I +   C + WAF+A  +      + +G  K   LS +EL+SC T      
Sbjct:    86 WPECMSIPQINDISECKTSWAFAAAESMSDRLCINSGGFKNTILSAEELLSCCTGMFSCG 145

Query:   168 HGCEGGEMEDAFKFIIHNDGITTEANYPYQ 197
              GCEGG    A+++I    GI T  +Y  Q
Sbjct:   146 EGCEGGNPFKAWQYI-QKHGIPTGGSYESQ 174

WARNING:  HSPs involving 45 database sequences were not reported due to the
          limiting value of parameter B = 250.


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.312   0.129   0.385    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      324       324   0.00086  116 3  11 23  0.47    34
                                                     33  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  295
  No. of states in DFA:  622 (66 KB)
  Total size of DFA:  240 KB (2129 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  27.06u 0.11s 27.17t   Elapsed:  00:00:01
  Total cpu time:  27.11u 0.11s 27.22t   Elapsed:  00:00:01
  Start:  Thu May  9 18:26:39 2013   End:  Thu May  9 18:26:40 2013
WARNINGS ISSUED:  2

Back to top