BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>018958
MVLIFERSGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD
ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS
TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL
IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA
AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIV
GFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA

High Scoring Gene Products

Symbol, full name Information P value
AT3G49340 protein from Arabidopsis thaliana 9.0e-93
AT2G27420 protein from Arabidopsis thaliana 1.1e-89
AT2G34080 protein from Arabidopsis thaliana 5.9e-89
SAG12
senescence-associated gene 12
protein from Arabidopsis thaliana 9.9e-87
AT1G29090 protein from Arabidopsis thaliana 6.9e-86
AT1G29080 protein from Arabidopsis thaliana 3.4e-84
XCP1
xylem cysteine peptidase 1
protein from Arabidopsis thaliana 4.2e-79
RD21A
responsive to dehydration 21A
protein from Arabidopsis thaliana 4.8e-78
AT3G19390 protein from Arabidopsis thaliana 5.5e-77
RD21B
esponsive to dehydration 21B
protein from Arabidopsis thaliana 5.5e-77
AT4G23520 protein from Arabidopsis thaliana 3.9e-76
CEP1
cysteine endopeptidase 1
protein from Arabidopsis thaliana 7.3e-75
CEP3
cysteine endopeptidase 3
protein from Arabidopsis thaliana 3.1e-74
XCP2
AT1G20850
protein from Arabidopsis thaliana 4.0e-74
CP1
cysteine protease 1
protein from Arabidopsis thaliana 2.8e-73
CP2
cysteine protease 2
protein from Arabidopsis thaliana 2.8e-73
AT1G29110 protein from Arabidopsis thaliana 5.9e-73
XBCP3
xylem bark cysteine peptidase 3
protein from Arabidopsis thaliana 1.6e-72
AT3G19400 protein from Arabidopsis thaliana 1.1e-71
Cp1
Cysteine proteinase-1
protein from Drosophila melanogaster 1.0e-68
AT3G43960 protein from Arabidopsis thaliana 7.2e-68
AT1G06260 protein from Arabidopsis thaliana 3.2e-65
zgc:174855 gene_product from Danio rerio 4.1e-65
RGD1308751
similar to Cathepsin L precursor (Major excreted protein) (MEP)
gene from Rattus norvegicus 6.7e-65
zgc:174153 gene_product from Danio rerio 8.5e-65
wu:fb37b09 gene_product from Danio rerio 2.9e-64
Ctsll3
cathepsin L-like 3
gene from Rattus norvegicus 4.7e-64
ctsll
cathepsin L, like
gene_product from Danio rerio 7.6e-64
ctsl1b
cathepsin L, 1 b
gene_product from Danio rerio 1.6e-63
cprC
cysteine proteinase 3
gene from Dictyostelium discoideum 8.8e-63
CTSL2
Uncharacterized protein
protein from Gallus gallus 1.8e-62
cprD
cysteine proteinase 4
gene from Dictyostelium discoideum 9.9e-62
cprG
cysteine proteinase 7
gene from Dictyostelium discoideum 2.0e-61
ctsl1a
cathepsin L, 1 a
gene_product from Danio rerio 2.7e-61
cprF
cysteine proteinase 6
gene from Dictyostelium discoideum 4.2e-61
CTSS
Cathepsin S
protein from Canis lupus familiaris 7.1e-61
Ctsl
cathepsin L
protein from Mus musculus 9.0e-61
ctsl.1
cathepsin L.1
gene_product from Danio rerio 1.2e-60
CTSS
Cathepsin S
protein from Bos taurus 1.5e-60
CTSS
Cathepsin S
protein from Canis lupus familiaris 1.5e-60
Ctss
cathepsin S
protein from Mus musculus 1.5e-60
Cys
Crustapain
protein from Pandalus borealis 2.4e-60
cprE
cysteine proteinase 5
gene from Dictyostelium discoideum 3.8e-60
CTSS
Uncharacterized protein
protein from Sus scrofa 5.0e-60
CTSL1
Cathepsin L1
protein from Homo sapiens 1.0e-59
CTSL1
CTSL1 protein
protein from Bos taurus 2.2e-59
Ssc.54235
Uncharacterized protein
protein from Sus scrofa 2.2e-59
CTSS
Cathepsin S
protein from Homo sapiens 2.7e-59
cpl-1 gene from Caenorhabditis elegans 2.7e-59
Ctsl1
cathepsin L1
gene from Rattus norvegicus 3.5e-59
CTSL2
Cathepsin L2
protein from Homo sapiens 1.5e-58
cprB
cysteine proteinase 2
gene from Dictyostelium discoideum 2.3e-58
CTSL1
Cathepsin L1
protein from Bos taurus 1.1e-57
CTSL1
Cathepsin L1
protein from Sus scrofa 1.1e-57
ctssb.2
cathepsin S, b.2
gene_product from Danio rerio 1.1e-57
CTSL1
Cathepsin L1
protein from Canis lupus familiaris 1.4e-57
CTSK
Cathepsin K
protein from Homo sapiens 1.7e-57
Ctsk
cathepsin K
gene from Rattus norvegicus 5.9e-57
CTSK
Cathepsin K
protein from Sus scrofa 7.5e-57
ctsh
cathepsin H
gene_product from Danio rerio 7.5e-57
CTSL2
Cathepsin L2
protein from Bos taurus 9.6e-57
Ctsj
cathepsin J
protein from Mus musculus 9.6e-57
P83654
Ervatamin-C
protein from Tabernaemontana divaricata 1.2e-56
ctssb.1
cathepsin S, b.1
gene_product from Danio rerio 1.2e-56
LOC420160
Uncharacterized protein
protein from Gallus gallus 1.6e-56
CTSK
Cathepsin K
protein from Canis lupus familiaris 6.8e-56
CTSK
Cathepsin K
protein from Canis lupus familiaris 6.8e-56
cprH
cysteine proteinase 8
gene from Dictyostelium discoideum 8.6e-56
Testin
testin gene
gene from Rattus norvegicus 8.6e-56
ctsk
cathepsin K
gene_product from Danio rerio 1.8e-55
CTSL1
Cathepsin L1
protein from Canis lupus familiaris 2.3e-55
Ctsk
cathepsin K
protein from Mus musculus 3.7e-55
Ctss
cathepsin S
gene from Rattus norvegicus 4.8e-55
AT3G45310 protein from Arabidopsis thaliana 4.8e-55
ctssa
cathepsin S, a
gene_product from Danio rerio 4.8e-55
Ctsh
cathepsin H
gene from Rattus norvegicus 6.1e-55
Ctsj
cathepsin J
gene from Rattus norvegicus 6.1e-55
CTSH
Uncharacterized protein
protein from Oryctolagus cuniculus 7.8e-55
4930486L24Rik
RIKEN cDNA 4930486L24 gene
protein from Mus musculus 7.8e-55
CTSK
Cathepsin K
protein from Bos taurus 9.9e-55
P83443
Macrodontain-1
protein from Pseudananas sagenarius 2.6e-54
CTSH
Pro-cathepsin H
protein from Sus scrofa 4.3e-54
CTSH
Pro-cathepsin H
protein from Bos taurus 7.0e-54
ctskl
cathepsin K, like
gene_product from Danio rerio 3.0e-53
DDB_G0272298 gene from Dictyostelium discoideum 4.9e-53
CTSH
Uncharacterized protein
protein from Ailuropoda melanoleuca 4.9e-53
Ctsq
cathepsin Q
gene from Rattus norvegicus 4.9e-53
Ctsh
cathepsin H
protein from Mus musculus 6.3e-53
CTSH
Uncharacterized protein
protein from Canis lupus familiaris 8.0e-53
MGC114246
similar to cathepsin R
gene from Rattus norvegicus 8.0e-53
CTSH
Uncharacterized protein
protein from Callithrix jacchus 1.0e-52
CTSH
Uncharacterized protein
protein from Callithrix jacchus 1.0e-52
D3ZZR3
Uncharacterized protein
protein from Rattus norvegicus 1.0e-52
cfaD
peptidase C1A family protein
gene from Dictyostelium discoideum 1.7e-52
CTSH
Uncharacterized protein
protein from Macaca mulatta 2.1e-52
CTSH
Uncharacterized protein
protein from Equus caballus 2.7e-52
Cat-1
Cathepsin L-like proteinase
protein from Fasciola hepatica 4.4e-52
CTSH
Uncharacterized protein
protein from Nomascus leucogenys 5.6e-52

The BLAST search returned 2 gene products which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  018958
        (348 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2082881 - symbol:AT3G49340 species:3702 "Arabi...   924  9.0e-93   1
TAIR|locus:2038588 - symbol:AT2G27420 species:3702 "Arabi...   895  1.1e-89   1
TAIR|locus:2055440 - symbol:AT2G34080 species:3702 "Arabi...   888  5.9e-89   1
TAIR|locus:2152445 - symbol:SAG12 "senescence-associated ...   867  9.9e-87   1
TAIR|locus:2029924 - symbol:AT1G29090 species:3702 "Arabi...   859  6.9e-86   1
TAIR|locus:2029934 - symbol:AT1G29080 species:3702 "Arabi...   843  3.4e-84   1
TAIR|locus:2122113 - symbol:XCP1 "xylem cysteine peptidas...   795  4.2e-79   1
TAIR|locus:2825832 - symbol:RD21A "responsive to dehydrat...   785  4.8e-78   1
TAIR|locus:2090614 - symbol:AT3G19390 species:3702 "Arabi...   775  5.5e-77   1
TAIR|locus:2167821 - symbol:RD21B "esponsive to dehydrati...   775  5.5e-77   1
TAIR|locus:2117979 - symbol:AT4G23520 species:3702 "Arabi...   767  3.9e-76   1
TAIR|locus:2157712 - symbol:CEP1 "cysteine endopeptidase ...   755  7.3e-75   1
TAIR|locus:505006391 - symbol:CEP3 "cysteine endopeptidas...   749  3.1e-74   1
TAIR|locus:2030427 - symbol:XCP2 "xylem cysteine peptidas...   748  4.0e-74   1
TAIR|locus:2128243 - symbol:AT4G11310 species:3702 "Arabi...   740  2.8e-73   1
TAIR|locus:2128253 - symbol:AT4G11320 species:3702 "Arabi...   740  2.8e-73   1
TAIR|locus:2030027 - symbol:AT1G29110 species:3702 "Arabi...   737  5.9e-73   1
TAIR|locus:2024362 - symbol:XBCP3 "xylem bark cysteine pe...   733  1.6e-72   1
TAIR|locus:2090629 - symbol:AT3G19400 species:3702 "Arabi...   725  1.1e-71   1
FB|FBgn0013770 - symbol:Cp1 "Cysteine proteinase-1" speci...   697  1.0e-68   1
TAIR|locus:2097104 - symbol:AT3G43960 species:3702 "Arabi...   689  7.2e-68   1
TAIR|locus:2038515 - symbol:AT1G06260 species:3702 "Arabi...   664  3.2e-65   1
ZFIN|ZDB-GENE-071004-74 - symbol:zgc:174855 "zgc:174855" ...   663  4.1e-65   1
RGD|1308751 - symbol:RGD1308751 "similar to Cathepsin L p...   661  6.7e-65   1
ZFIN|ZDB-GENE-080215-7 - symbol:zgc:174153 "zgc:174153" s...   660  8.5e-65   1
ZFIN|ZDB-GENE-030131-572 - symbol:wu:fb37b09 "wu:fb37b09"...   655  2.9e-64   1
RGD|1560071 - symbol:Ctsll3 "cathepsin L-like 3" species:...   653  4.7e-64   1
ZFIN|ZDB-GENE-041010-76 - symbol:ctsll "cathepsin L, like...   651  7.6e-64   1
ZFIN|ZDB-GENE-980526-285 - symbol:ctsl1b "cathepsin L, 1 ...   648  1.6e-63   1
DICTYBASE|DDB_G0283867 - symbol:cprC "cysteine proteinase...   641  8.8e-63   1
UNIPROTKB|F1NYJ1 - symbol:CTSL2 "Uncharacterized protein"...   638  1.8e-62   1
DICTYBASE|DDB_G0278721 - symbol:cprD "cysteine proteinase...   522  9.9e-62   2
DICTYBASE|DDB_G0279187 - symbol:cprG "cysteine proteinase...   534  2.0e-61   2
ZFIN|ZDB-GENE-030131-106 - symbol:ctsl1a "cathepsin L, 1 ...   627  2.7e-61   1
DICTYBASE|DDB_G0279185 - symbol:cprF "cysteine proteinase...   525  4.2e-61   2
UNIPROTKB|F1PAK0 - symbol:CTSS "Cathepsin S" species:9615...   623  7.1e-61   1
MGI|MGI:88564 - symbol:Ctsl "cathepsin L" species:10090 "...   622  9.0e-61   1
ZFIN|ZDB-GENE-040718-61 - symbol:ctsl.1 "cathepsin L.1" s...   621  1.2e-60   1
UNIPROTKB|P25326 - symbol:CTSS "Cathepsin S" species:9913...   620  1.5e-60   1
UNIPROTKB|Q8HY81 - symbol:CTSS "Cathepsin S" species:9615...   620  1.5e-60   1
MGI|MGI:107341 - symbol:Ctss "cathepsin S" species:10090 ...   620  1.5e-60   1
UNIPROTKB|Q86GF7 - symbol:Cys "Crustapain" species:6703 "...   618  2.4e-60   1
DICTYBASE|DDB_G0272815 - symbol:cprE "cysteine proteinase...   523  3.8e-60   2
UNIPROTKB|F1SS93 - symbol:CTSS "Uncharacterized protein" ...   615  5.0e-60   1
UNIPROTKB|P07711 - symbol:CTSL1 "Cathepsin L1" species:96...   612  1.0e-59   1
UNIPROTKB|A4IFS7 - symbol:CTSL1 "CTSL1 protein" species:9...   609  2.2e-59   1
UNIPROTKB|F1S4J6 - symbol:Ssc.54235 "Cathepsin L1" specie...   609  2.2e-59   1
UNIPROTKB|P25774 - symbol:CTSS "Cathepsin S" species:9606...   608  2.7e-59   1
WB|WBGene00000776 - symbol:cpl-1 species:6239 "Caenorhabd...   608  2.7e-59   1
RGD|2448 - symbol:Ctsl1 "cathepsin L1" species:10116 "Rat...   607  3.5e-59   1
UNIPROTKB|O60911 - symbol:CTSL2 "Cathepsin L2" species:96...   601  1.5e-58   1
DICTYBASE|DDB_G0279799 - symbol:cprB "cysteine proteinase...   487  2.3e-58   2
UNIPROTKB|P25975 - symbol:CTSL1 "Cathepsin L1" species:99...   593  1.1e-57   1
UNIPROTKB|Q28944 - symbol:CTSL1 "Cathepsin L1" species:98...   593  1.1e-57   1
ZFIN|ZDB-GENE-050626-55 - symbol:ctssb.2 "cathepsin S, b....   593  1.1e-57   1
UNIPROTKB|Q9GL24 - symbol:CTSL1 "Cathepsin L1" species:96...   592  1.4e-57   1
UNIPROTKB|P43235 - symbol:CTSK "Cathepsin K" species:9606...   591  1.7e-57   1
RGD|61810 - symbol:Ctsk "cathepsin K" species:10116 "Ratt...   586  5.9e-57   1
UNIPROTKB|Q9GLE3 - symbol:CTSK "Cathepsin K" species:9823...   585  7.5e-57   1
ZFIN|ZDB-GENE-030131-3539 - symbol:ctsh "cathepsin H" spe...   585  7.5e-57   1
UNIPROTKB|Q5E998 - symbol:CTSL2 "Cathepsin L2" species:99...   584  9.6e-57   1
MGI|MGI:1349426 - symbol:Ctsj "cathepsin J" species:10090...   584  9.6e-57   1
UNIPROTKB|P83654 - symbol:P83654 "Ervatamin-C" species:52...   583  1.2e-56   1
ZFIN|ZDB-GENE-050522-559 - symbol:ctssb.1 "cathepsin S, b...   583  1.2e-56   1
UNIPROTKB|F1NZ37 - symbol:LOC420160 "Uncharacterized prot...   582  1.6e-56   1
UNIPROTKB|G1K2A7 - symbol:CTSK "Cathepsin K" species:9615...   576  6.8e-56   1
UNIPROTKB|Q3ZKN1 - symbol:CTSK "Cathepsin K" species:9615...   576  6.8e-56   1
DICTYBASE|DDB_G0278401 - symbol:cprH "cysteine proteinase...   575  8.6e-56   1
RGD|708447 - symbol:Testin "testin gene" species:10116 "R...   575  8.6e-56   1
ZFIN|ZDB-GENE-001205-4 - symbol:ctsk "cathepsin K" specie...   572  1.8e-55   1
UNIPROTKB|F1PMM9 - symbol:CTSL1 "Cathepsin L1" species:96...   571  2.3e-55   1
MGI|MGI:107823 - symbol:Ctsk "cathepsin K" species:10090 ...   569  3.7e-55   1
RGD|621513 - symbol:Ctss "cathepsin S" species:10116 "Rat...   568  4.8e-55   1
TAIR|locus:2078312 - symbol:AT3G45310 species:3702 "Arabi...   568  4.8e-55   1
ZFIN|ZDB-GENE-040426-1583 - symbol:ctssa "cathepsin S, a"...   568  4.8e-55   1
RGD|2447 - symbol:Ctsh "cathepsin H" species:10116 "Rattu...   567  6.1e-55   1
RGD|69241 - symbol:Ctsj "cathepsin J" species:10116 "Ratt...   567  6.1e-55   1
UNIPROTKB|G1SQF0 - symbol:CTSH "Uncharacterized protein" ...   566  7.8e-55   1
MGI|MGI:1922258 - symbol:4930486L24Rik "RIKEN cDNA 493048...   566  7.8e-55   1
UNIPROTKB|Q4QRC2 - symbol:Ctsql2 "Protein Ctsql2" species...   566  7.8e-55   1
UNIPROTKB|Q5E968 - symbol:CTSK "Cathepsin K" species:9913...   565  9.9e-55   1
UNIPROTKB|P83443 - symbol:P83443 "Macrodontain-1" species...   561  2.6e-54   1
UNIPROTKB|O46427 - symbol:CTSH "Pro-cathepsin H" species:...   559  4.3e-54   1
UNIPROTKB|Q3T0I2 - symbol:CTSH "Pro-cathepsin H" species:...   557  7.0e-54   1
ZFIN|ZDB-GENE-050208-336 - symbol:ctskl "cathepsin K, lik...   551  3.0e-53   1
UNIPROTKB|E9PSK9 - symbol:Ctsql2 "Protein Ctsql2" species...   550  3.8e-53   1
DICTYBASE|DDB_G0272298 - symbol:DDB_G0272298 species:4468...   549  4.9e-53   1
UNIPROTKB|G1M0X4 - symbol:CTSH "Uncharacterized protein" ...   549  4.9e-53   1
RGD|631421 - symbol:Ctsq "cathepsin Q" species:10116 "Rat...   549  4.9e-53   1
MGI|MGI:107285 - symbol:Ctsh "cathepsin H" species:10090 ...   548  6.3e-53   1
UNIPROTKB|F6X9C1 - symbol:CTSH "Uncharacterized protein" ...   547  8.0e-53   1
RGD|1562210 - symbol:MGC114246 "similar to cathepsin R" s...   547  8.0e-53   1
UNIPROTKB|F7B939 - symbol:CTSH "Uncharacterized protein" ...   546  1.0e-52   1
UNIPROTKB|F7BRD4 - symbol:CTSH "Uncharacterized protein" ...   546  1.0e-52   1
UNIPROTKB|D3ZZR3 - symbol:D3ZZR3 "Uncharacterized protein...   546  1.0e-52   1
DICTYBASE|DDB_G0281605 - symbol:cfaD "peptidase C1A famil...   544  1.7e-52   1
UNIPROTKB|F6R7P5 - symbol:CTSH "Uncharacterized protein" ...   543  2.1e-52   1
UNIPROTKB|F7BJD8 - symbol:CTSH "Uncharacterized protein" ...   542  2.7e-52   1
UNIPROTKB|Q24940 - symbol:Cat-1 "Cathepsin L-like protein...   540  4.4e-52   1
UNIPROTKB|G1RBY1 - symbol:CTSH "Uncharacterized protein" ...   539  5.6e-52   1

WARNING:  Descriptions of 198 database sequences were not reported due to the
          limiting value of parameter V = 100.


>TAIR|locus:2082881 [details] [associations]
            symbol:AT3G49340 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002686
            GenomeReviews:BA000014_GR EMBL:AC012329 EMBL:AL132956
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 MEROPS:I29.003 HOGENOM:HOG000230773 HSSP:P07711
            KO:K01376 IPI:IPI00520642 PIR:T45839 RefSeq:NP_566920.1
            UniGene:At.53854 ProteinModelPortal:Q9SG15 SMR:Q9SG15
            EnsemblPlants:AT3G49340.1 GeneID:824096 KEGG:ath:AT3G49340
            TAIR:At3g49340 InParanoid:Q9SG15 OMA:PQNDEEA PhylomeDB:Q9SG15
            ProtClustDB:CLSN2688476 Genevestigator:Q9SG15 Uniprot:Q9SG15
        Length = 341

 Score = 924 (330.3 bits), Expect = 9.0e-93, P = 9.0e-93
 Identities = 181/344 (52%), Positives = 234/344 (68%)

Query:    15 TTPMFIIITLLVSCASQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
             T+ +F ++ +L+S  +  V+SR    E S VE HE+WM++  R Y D+ EK  R +IF  
Sbjct:     2 TSIVFFLLAILLSSRTSGVTSRGGLFEASAVEKHEQWMSRFNRVYSDDSEKTSRFEIFTN 61

Query:    74 NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXX-----FKY 128
             NL+++E  N   N+TY L  N+FSDLT++EF+A YTG  +P    R           F+Y
Sbjct:    62 NLKFVESINMNTNKTYTLDVNEFSDLTDEEFKARYTGLVVPEGMTRISTTDSHETVSFRY 121

Query:   129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
             +N+  T    S+DW  +GAVT +K+Q++CGCCWAF+AVAAVEG+TKI +G L+ LSEQQL
Sbjct:   122 ENVGETG--ESMDWIQEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIANGELVSLSEQQL 179

Query:   189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
             LDCST  NNGC GG   KAF YI +NQGI TED YPYQ    TC +    AAA IS YE 
Sbjct:   180 LDCSTE-NNGCGGGIMWKAFDYIKENQGITTEDNYPYQGAQQTCESNHL-AAATISGYET 237

Query:   249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
             VP  DE+ALLKAVS QPVS+AI     EF  Y  GIFNG CGTQL HAVTIVG+G +E+G
Sbjct:   238 VPQNDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTQLTHAVTIVGYGVSEEG 297

Query:   309 ANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
               YWL+KNSWG +WG+ GYM+I+RD    +G+CG+ + + YP+A
Sbjct:   298 IKYWLLKNSWGESWGENGYMRIMRDVDSPQGMCGLASLAYYPVA 341


>TAIR|locus:2038588 [details] [associations]
            symbol:AT2G27420 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002685
            GenomeReviews:CT485783_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC006232
            MEROPS:I29.003 OMA:EEFRATH HOGENOM:HOG000230773 HSSP:P53634
            ProtClustDB:CLSN2688476 EMBL:AY064033 EMBL:AY096388 IPI:IPI00539752
            PIR:F84672 RefSeq:NP_565649.1 UniGene:At.27094
            ProteinModelPortal:Q9ZQH7 SMR:Q9ZQH7 PRIDE:Q9ZQH7
            EnsemblPlants:AT2G27420.1 GeneID:817287 KEGG:ath:AT2G27420
            TAIR:At2g27420 InParanoid:Q9ZQH7 PhylomeDB:Q9ZQH7
            ArrayExpress:Q9ZQH7 Genevestigator:Q9ZQH7 Uniprot:Q9ZQH7
        Length = 348

 Score = 895 (320.1 bits), Expect = 1.1e-89, P = 1.1e-89
 Identities = 177/346 (51%), Positives = 226/346 (65%)

Query:    18 MFIIITLLVSCASQVVSSR-STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
             +  I+T+ +S  + + +SR S  E S +E HE+WMA+  R Y DE EK  R  IFK+NLE
Sbjct:     5 IIFILTIFLSYRTSLATSRGSLFEASAIEKHEQWMARFNRVYSDETEKRNRFNIFKKNLE 64

Query:    77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXX------FKYQN 130
             +++  N     TYK+  N+FSDLT++EFRA +TG  +P    R            F+Y N
Sbjct:    65 FVQNFNMNNKITYKVDINEFSDLTDEEFRATHTGLVVPEAITRISTLSSGKNTVPFRYGN 124

Query:   131 LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
             +S  D   S+DWR +GAVTP+K Q  CG CWAF+AVAAVEGITKI  G L+ LSEQQLLD
Sbjct:   125 VS--DNGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQLLD 182

Query:   191 CSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA----AAKISNY 246
             C  + N GC GG   KAF YII+NQGI TED YPYQ    TCS++   +    AA IS Y
Sbjct:   183 CDRDYNQGCRGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATISGY 242

Query:   247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
             E VP  +E+ALL+AVS QPVS+ I      F+ Y  G+FNG CGT L HAVTIVG+G +E
Sbjct:   243 ETVPMNNEEALLQAVSQQPVSVGIEGTGAAFRHYSGGVFNGECGTDLHHAVTIVGYGMSE 302

Query:   307 DGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
             +G  YW++KNSWG TWG+ GYM+I RD    +G+CG+   + YPLA
Sbjct:   303 EGTKYWVVKNSWGETWGENGYMRIKRDVDAPQGMCGLAILAFYPLA 348


>TAIR|locus:2055440 [details] [associations]
            symbol:AT2G34080 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002685 GenomeReviews:CT485783_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 MEROPS:I29.003 EMBL:AC002341
            HOGENOM:HOG000230773 HSSP:P53634 IPI:IPI00530325 PIR:B84752
            RefSeq:NP_565780.1 UniGene:At.28613 UniGene:At.37859
            ProteinModelPortal:O22961 SMR:O22961 EnsemblPlants:AT2G34080.1
            GeneID:817969 KEGG:ath:AT2G34080 TAIR:At2g34080 InParanoid:O22961
            OMA:SENDYSY PhylomeDB:O22961 ProtClustDB:CLSN2688064
            ArrayExpress:O22961 Genevestigator:O22961 Uniprot:O22961
        Length = 345

 Score = 888 (317.7 bits), Expect = 5.9e-89, P = 5.9e-89
 Identities = 167/337 (49%), Positives = 231/337 (68%)

Query:    20 IIITLLVSCASQVVSSRST--HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
             ++I L         +SR+    EQS+V+ HE+WMA+  R Y+DELEK MR  +FK+NL++
Sbjct:    10 VLIILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDVFKKNLKF 69

Query:    78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK-MPSPSHRXXXXXXFKYQNLSMTD- 135
             IE  NK+GN++YKLG N+F+D TN+EF A++TG K +   S           Q  +++D 
Sbjct:    70 IENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVSPSKVVAKTISSQTWNVSDM 129

Query:   136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
             V  S DWR +GAVTP+K Q +CGCCWAF+AVAAVEG+ KI  GNL+ LSEQQLLDC    
Sbjct:   130 VVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCDREY 189

Query:   196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
             + GC GG    AF Y++QN+GIA+E++Y YQ   G C +  +PAA +IS ++ VPS +E+
Sbjct:   190 DRGCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGCRSNARPAA-RISGFQTVPSNNER 248

Query:   256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
             ALL+AVS QPVS+++ A    F  Y  G+++G CGT  +HAVT VG+GT++DG  YWL K
Sbjct:   249 ALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGTKYWLAK 308

Query:   316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
             NSWG TWG+ GY++I RD    +G+CG+   + YP+A
Sbjct:   309 NSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345


>TAIR|locus:2152445 [details] [associations]
            symbol:SAG12 "senescence-associated gene 12" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009723 "response to ethylene stimulus" evidence=IEP]
            [GO:0007568 "aging" evidence=IEP;TAS] [GO:0010150 "leaf senescence"
            evidence=IEP;TAS] [GO:0010282 "senescence-associated vacuole"
            evidence=IDA] [GO:0009817 "defense response to fungus, incompatible
            interaction" evidence=IEP] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:CP002688 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0010150 GO:GO:0009817 EMBL:AB016870
            HSSP:O65039 OMA:NDEQALM EMBL:AF370131 EMBL:AY040073 IPI:IPI00544181
            RefSeq:NP_568651.1 UniGene:At.75256 UniGene:At.7710
            ProteinModelPortal:Q9FJ47 SMR:Q9FJ47 IntAct:Q9FJ47 STRING:Q9FJ47
            MEROPS:C01.117 PRIDE:Q9FJ47 ProMEX:Q9FJ47 EnsemblPlants:AT5G45890.1
            GeneID:834629 KEGG:ath:AT5G45890 TAIR:At5g45890 InParanoid:Q9FJ47
            PhylomeDB:Q9FJ47 ProtClustDB:CLSN2917735 ArrayExpress:Q9FJ47
            Genevestigator:Q9FJ47 GO:GO:0010282 Uniprot:Q9FJ47
        Length = 346

 Score = 867 (310.3 bits), Expect = 9.9e-87, P = 9.9e-87
 Identities = 164/338 (48%), Positives = 221/338 (65%)

Query:    18 MFIIITLLVSCASQVVSSRSTHEQSVVEI-HEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
             +F+ + +  S    +  SR    + +++  H +WM +HGR Y D  E+  R  +FK N+E
Sbjct:     8 IFLFVAIFSSFCFSITLSRPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVE 67

Query:    77 YIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP--SHRXXXXXXFKYQNLSM 133
              IE  N     RT+KL  NQF+DLTNDEFR++YTG+K  S   S        F+YQN+S 
Sbjct:    68 RIEHLNSIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSALSSQSQTKMSPFRYQNVSS 127

Query:   134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
               +P S+DWR KGAVTPIKNQ  CGCCWAF+AVAA+EG T+I+ G LI LSEQQL+DC T
Sbjct:   128 GALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDT 187

Query:   194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSG 252
             N + GC GG  + AF +I    G+ TE  YPY+    TC++ +  P A  I+ YE+VP  
Sbjct:   188 N-DFGCEGGLMDTAFEHIKATGGLTTESNYPYKGEDATCNSKKTNPKATSITGYEDVPVN 246

Query:   253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
             DEQAL+KAV+ QPVS+ I     +FQ Y  G+F G C T LDHAVT +G+G + +G+ YW
Sbjct:   247 DEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGESTNGSKYW 306

Query:   313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
             +IKNSWG  WG++GYM+I +D    +GLCG+  ++SYP
Sbjct:   307 IIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYP 344


>TAIR|locus:2029924 [details] [associations]
            symbol:AT1G29090 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002684 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            HOGENOM:HOG000230773 HSSP:P53634 ProtClustDB:CLSN2688064
            EMBL:BT004146 IPI:IPI00545702 RefSeq:NP_564321.2 UniGene:At.40814
            ProteinModelPortal:Q84W75 SMR:Q84W75 MEROPS:C01.A15
            EnsemblPlants:AT1G29090.1 GeneID:839784 KEGG:ath:AT1G29090
            TAIR:At1g29090 InParanoid:Q84W75 OMA:SIRGHED PhylomeDB:Q84W75
            ArrayExpress:Q84W75 Genevestigator:Q84W75 Uniprot:Q84W75
        Length = 355

 Score = 859 (307.4 bits), Expect = 6.9e-86, P = 6.9e-86
 Identities = 172/347 (49%), Positives = 236/347 (68%)

Query:    15 TTPMFIIITLLVSCASQVVS---SRST-HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
             T+ +F++++L +   +  VS   SR T HE  V E H++WM +  R Y DELEK+MR  +
Sbjct:    11 TSILFMLVSLTILSMNLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDV 70

Query:    71 FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS--PSHRXXXXXXFKY 128
             FK+NL++IEK NK+G+RTYKLG N+F+D T +EF A +TG K  +  PS          +
Sbjct:    71 FKKNLKFIEKFNKKGDRTYKLGVNEFADWTREEFIATHTGLKGVNGIPSSEFVDEMIPSW 130

Query:   129 QNLSMTDVP--TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
              N +++DV    + DWR +GAVTP+K Q +CGCCWAF++VAAVEG+TKI   NL+ LSEQ
Sbjct:   131 -NWNVSDVAGRETKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQ 189

Query:   187 QLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNY 246
             QLLDC    +NGC GG    AF+YII+N+GIA+E  YPYQA  GTC    KP+A  I  +
Sbjct:   190 QLLDCDRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQAAEGTCRYNGKPSAW-IRGF 248

Query:   247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG-VCGTQLDHAVTIVGFGTT 305
             + VPS +E+ALL+AVS QPVS++I A    F  Y  G+++   CGT ++HAVT VG+GT+
Sbjct:   249 QTVPSNNERALLEAVSKQPVSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTS 308

Query:   306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
              +G  YWL KNSWG TWG+ GY++I RD    +G+CG+   + YP+A
Sbjct:   309 PEGIKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 355


>TAIR|locus:2029934 [details] [associations]
            symbol:AT1G29080 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002684 GenomeReviews:CT485782_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AC021043 MEROPS:I29.003 HOGENOM:HOG000230773
            HSSP:P53634 ProtClustDB:CLSN2688064 EMBL:DQ056468 IPI:IPI00521747
            PIR:C86413 RefSeq:NP_564320.1 UniGene:At.51814
            ProteinModelPortal:Q9LP39 SMR:Q9LP39 EnsemblPlants:AT1G29080.1
            GeneID:839783 KEGG:ath:AT1G29080 TAIR:At1g29080 InParanoid:Q9LP39
            OMA:KTWGENG PhylomeDB:Q9LP39 Genevestigator:Q9LP39 Uniprot:Q9LP39
        Length = 346

 Score = 843 (301.8 bits), Expect = 3.4e-84, P = 3.4e-84
 Identities = 160/314 (50%), Positives = 221/314 (70%)

Query:    42 SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTN 101
             S+V+ H++WM Q  R Y DE EK++RL++  ENL++IE  N  GN++YKLG N+F+D T 
Sbjct:    34 SIVDYHQQWMIQFSRVYDDEFEKQLRLQVLTENLKFIESFNNMGNQSYKLGVNEFTDWTK 93

Query:   102 DEFRALYTGYKMPSPSHRXXXXXXFKYQ-NLSMTDV-PTSLDWRDKGAVTPIKNQKECGC 159
             +EF A YTG +  + +         K   N +++DV  T+ DWR++GAVTP+K+Q ECG 
Sbjct:    94 EEFLATYTGLRGVNVTSPFEVVNETKPAWNWTVSDVLGTNKDWRNEGAVTPVKSQGECGG 153

Query:   160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
             CWAF+A+AAVEG+TKI  GNLI LSEQQLLDC+   NNGC GG+   AF YII+++GI++
Sbjct:   154 CWAFSAIAAVEGLTKIARGNLISLSEQQLLDCTREQNNGCKGGTFVNAFNYIIKHRGISS 213

Query:   220 EDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
             E+EYPYQ   G C +  +PA   I  +E VPS +E+ALL+AVS QPV++AI A    F  
Sbjct:   214 ENEYPYQVKEGPCRSNARPAIL-IRGFENVPSNNERALLEAVSRQPVAVAIDASEAGFVH 272

Query:   280 YKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----E 334
             Y  G++N   CGT ++HAVT+VG+GT+ +G  YWL KNSWG TWG+ GY++I RD    +
Sbjct:   273 YSGGVYNARNCGTSVNHAVTLVGYGTSPEGMKYWLAKNSWGKTWGENGYIRIRRDVEWPQ 332

Query:   335 GLCGIGTRSSYPLA 348
             G+CG+   +SYP+A
Sbjct:   333 GMCGVAQYASYPVA 346


>TAIR|locus:2122113 [details] [associations]
            symbol:XCP1 "xylem cysteine peptidase 1" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0000325 "plant-type vacuole" evidence=IDA] [GO:0005634
            "nucleus" evidence=IDA] [GO:0010623 "developmental programmed cell
            death" evidence=IMP] [GO:0010413 "glucuronoxylan metabolic process"
            evidence=RCA] [GO:0045492 "xylan biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005886
            GO:GO:0005634 EMBL:CP002687 GenomeReviews:CT486007_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0000325
            EMBL:AL022604 EMBL:AL161587 GO:GO:0010623 MEROPS:I29.003
            HOGENOM:HOG000230773 EMBL:AF191027 EMBL:AK117394 EMBL:BT005179
            IPI:IPI00532220 PIR:T06122 RefSeq:NP_567983.1 UniGene:At.2280
            UniGene:At.67622 ProteinModelPortal:O65493 SMR:O65493 STRING:O65493
            PaxDb:O65493 PRIDE:O65493 EnsemblPlants:AT4G35350.1 GeneID:829688
            KEGG:ath:AT4G35350 GeneFarm:5033 TAIR:At4g35350 InParanoid:O65493
            KO:K16290 OMA:FEVFREN PhylomeDB:O65493 ProtClustDB:CLSN2689772
            Genevestigator:O65493 Uniprot:O65493
        Length = 355

 Score = 795 (284.9 bits), Expect = 4.2e-79, P = 4.2e-79
 Identities = 147/314 (46%), Positives = 213/314 (67%)

Query:    38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
             T+   ++E+ E WM++H ++YK   EK  R ++F+ENL +I++ N E N +Y LG N+F+
Sbjct:    42 TNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFA 100

Query:    98 DLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
             DLT++EF+  Y G   P  S +      F+Y+++  TD+P S+DWR KGAV P+K+Q +C
Sbjct:   101 DLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQGQC 158

Query:   158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
             G CWAF+ VAAVEGI +I +GNL  LSEQ+L+DC T  N+GC GG  + AF YII   G+
Sbjct:   159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGL 218

Query:   218 ATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
               ED+YPY    G C   ++      IS YE+VP  D+++L+KA++ QPVS+AI A   +
Sbjct:   219 HKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRD 278

Query:   277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
             FQ YK G+FNG CGT LDH V  VG+G+++ G++Y ++KNSWG  WG+ G++++ R+   
Sbjct:   279 FQFYKGGVFNGKCGTDLDHGVAAVGYGSSK-GSDYVIVKNSWGPRWGEKGFIRMKRNTGK 337

Query:   334 -EGLCGIGTRSSYP 346
              EGLCGI   +SYP
Sbjct:   338 PEGLCGINKMASYP 351


>TAIR|locus:2825832 [details] [associations]
            symbol:RD21A "responsive to dehydration 21A" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;IMP]
            [GO:0008234 "cysteine-type peptidase activity"
            evidence=ISS;IDA;IMP] [GO:0009507 "chloroplast" evidence=IDA]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0008233 "peptidase
            activity" evidence=IDA] [GO:0048046 "apoplast" evidence=IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005829 "cytosol"
            evidence=RCA] [GO:0009506 "plasmodesma" evidence=IDA] [GO:0050832
            "defense response to fungus" evidence=IMP] [GO:0006096 "glycolysis"
            evidence=RCA] [GO:0006833 "water transport" evidence=RCA]
            [GO:0006972 "hyperosmotic response" evidence=RCA] [GO:0007030
            "Golgi organization" evidence=RCA] [GO:0009266 "response to
            temperature stimulus" evidence=RCA] [GO:0009651 "response to salt
            stress" evidence=RCA] [GO:0015996 "chlorophyll catabolic process"
            evidence=RCA] [GO:0042744 "hydrogen peroxide catabolic process"
            evidence=RCA] [GO:0046686 "response to cadmium ion" evidence=RCA]
            [GO:0009414 "response to water deprivation" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0009506 GO:GO:0009507 GO:GO:0005773
            GO:GO:0050832 GO:GO:0048046 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC083835
            HOGENOM:HOG000230773 KO:K01376 InterPro:IPR000118 Pfam:PF00396
            SMART:SM00277 UniGene:At.43549 EMBL:D13043 EMBL:AY072130
            EMBL:AY133781 IPI:IPI00530094 PIR:JN0719 RefSeq:NP_564497.1
            UniGene:At.47599 UniGene:At.71705 ProteinModelPortal:P43297
            SMR:P43297 IntAct:P43297 STRING:P43297 MEROPS:C01.064 PaxDb:P43297
            PRIDE:P43297 ProMEX:P43297 EnsemblPlants:AT1G47128.1 GeneID:841122
            KEGG:ath:AT1G47128 TAIR:At1g47128 InParanoid:P43297 OMA:EAWLVKH
            PhylomeDB:P43297 ProtClustDB:CLSN2688498 Genevestigator:P43297
            GermOnline:AT1G47128 Uniprot:P43297
        Length = 462

 Score = 785 (281.4 bits), Expect = 4.8e-78, P = 4.8e-78
 Identities = 147/315 (46%), Positives = 211/315 (66%)

Query:    40 EQSVVEIHEKWMAQHGRSYKDE--LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
             E  V+ I+E W+ +HG++      +EK+ R +IFK+NL ++++ N E N +Y+LG  +F+
Sbjct:    43 EAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSYRLGLTRFA 101

Query:    98 DLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
             DLTNDE+R+ Y G KM     R       +Y+     ++P S+DWR KGAV  +K+Q  C
Sbjct:   102 DLTNDEYRSKYLGAKMEKKGERRTS---LRYEARVGDELPESIDWRKKGAVAEVKDQGGC 158

Query:   158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
             G CWAF+ + AVEGI +I +G+LI LSEQ+L+DC T+ N GC GG  + AF +II+N GI
Sbjct:   159 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGI 218

Query:   218 ATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
              T+ +YPY+ V GTC   +K A    I +YE+VP+  E++L KAV+ QP+SIAI A    
Sbjct:   219 DTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRA 278

Query:   277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
             FQ Y  GIF+G CGTQLDH V  VG+GT E+G +YW+++NSWG +WG++GY+++ R+   
Sbjct:   279 FQLYDSGIFDGSCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLRMARNIAS 337

Query:   334 -EGLCGIGTRSSYPL 347
               G CGI    SYP+
Sbjct:   338 SSGKCGIAIEPSYPI 352


>TAIR|locus:2090614 [details] [associations]
            symbol:AT3G19390 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA;ISS] [GO:0000041 "transition metal ion
            transport" evidence=RCA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:CP002686 GenomeReviews:BA000014_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AB025624
            MEROPS:I29.003 OMA:KAMDQKC HSSP:O65039 HOGENOM:HOG000230773
            InterPro:IPR000118 Pfam:PF00396 SMART:SM00277 EMBL:AY062725
            EMBL:AY093350 IPI:IPI00520189 RefSeq:NP_566633.1 UniGene:At.27473
            ProteinModelPortal:Q9LT78 SMR:Q9LT78 IntAct:Q9LT78 STRING:Q9LT78
            PaxDb:Q9LT78 PRIDE:Q9LT78 EnsemblPlants:AT3G19390.1 GeneID:821473
            KEGG:ath:AT3G19390 TAIR:At3g19390 InParanoid:Q9LT78
            PhylomeDB:Q9LT78 ProtClustDB:CLSN2917188 Genevestigator:Q9LT78
            Uniprot:Q9LT78
        Length = 452

 Score = 775 (277.9 bits), Expect = 5.5e-77, P = 5.5e-77
 Identities = 150/345 (43%), Positives = 220/345 (63%)

Query:    10 SFKINTTPMFIIITLLVSCA-SQVVSSRST-HEQSVVEIHEKWMAQHGRSYKDELEKEMR 67
             S K  T  + I   LL+S +   V ++ +T +E     ++E+W+ ++ ++Y    EKE R
Sbjct:     4 SIKSITLALLIFSVLLISLSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEKERR 63

Query:    68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFK 127
              +IFK+NL+++E+ +   NRTY++G  +F+DLTNDEFRA+Y   KM             K
Sbjct:    64 FEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKM---ERTRVPVKGEK 120

Query:   128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
             Y       +P ++DWR KGAV P+K+Q  CG CWAF+A+ AVEGI +I++G LI LSEQ+
Sbjct:   121 YLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQE 180

Query:   188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVP-GTCSAAQKPA-AAKISN 245
             L+DC T+ N+GC GG  + AF +II+N GI TE++YPY A     C++ +K      I  
Sbjct:   181 LVDCDTSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCNSDKKNTRVVTIDG 240

Query:   246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
             YE+VP  DE++L KA++ QP+S+AI A    FQ Y  G+F G CGT LDH V  VG+G+ 
Sbjct:   241 YEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTSLDHGVVAVGYGS- 299

Query:   306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
             E G +YW+++NSWG+ WG++GY K+ R+     G CG+   +SYP
Sbjct:   300 EGGQDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYP 344


>TAIR|locus:2167821 [details] [associations]
            symbol:RD21B "esponsive to dehydration 21B" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS] [GO:0005773
            "vacuole" evidence=IDA] [GO:0009651 "response to salt stress"
            evidence=IEP] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005829 "cytosol" evidence=IDA] [GO:0052541 "plant-type cell
            wall cellulose metabolic process" evidence=RCA] [GO:0052546 "cell
            wall pectin metabolic process" evidence=RCA] [GO:0005783
            "endoplasmic reticulum" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005783 GO:GO:0005829 EMBL:CP002688
            GO:GO:0005773 GO:GO:0009651 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AB008267 HSSP:O65039
            HOGENOM:HOG000230773 KO:K01376 InterPro:IPR000118 Pfam:PF00396
            SMART:SM00277 ProtClustDB:CLSN2688498 EMBL:AY062608 EMBL:AY114661
            IPI:IPI00520971 RefSeq:NP_568620.1 UniGene:At.24130 SMR:Q9FMH8
            IntAct:Q9FMH8 STRING:Q9FMH8 MEROPS:C01.A12
            EnsemblPlants:AT5G43060.1 GeneID:834321 KEGG:ath:AT5G43060
            TAIR:At5g43060 InParanoid:Q9FMH8 OMA:ENSEASL Genevestigator:Q9FMH8
            Uniprot:Q9FMH8
        Length = 463

 Score = 775 (277.9 bits), Expect = 5.5e-77, P = 5.5e-77
 Identities = 148/325 (45%), Positives = 216/325 (66%)

Query:    33 VSSRSTHEQSVVE-IHEKWMAQHGRSYKDE----LEKEMRLKIFKENLEYIEKANKEGNR 87
             +++ ++   S VE I+E WM +HG+   ++     EK+ R +IFK+NL +I++ N + N 
Sbjct:    35 ITTETSRSDSEVERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTK-NL 93

Query:    88 TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGA 147
             +YKLG  +F+DLTN+E+R++Y G K   P+ R       +YQ      +P S+DWR +GA
Sbjct:    94 SYKLGLTRFADLTNEEYRSMYLGAK---PTKRVLKTSD-RYQARVGDALPDSVDWRKEGA 149

Query:   148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKA 207
             V  +K+Q  CG CWAF+ + AVEGI KI +G+LI LSEQ+L+DC T+ N GC GG  + A
Sbjct:   150 VADVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYA 209

Query:   208 FAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPV 266
             F +II+N GI TE +YPY+A  G C   +K A    I +YE+VP   E +L KA++ QP+
Sbjct:   210 FEFIIKNGGIDTEADYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPI 269

Query:   267 SIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAG 326
             S+AI A    FQ Y  G+F+G+CGT+LDH V  VG+GT E+G +YW+++NSWGN WG++G
Sbjct:   270 SVAIEAGGRAFQLYSSGVFDGLCGTELDHGVVAVGYGT-ENGKDYWIVRNSWGNRWGESG 328

Query:   327 YMKIVRD----EGLCGIGTRSSYPL 347
             Y+K+ R+     G CGI   +SYP+
Sbjct:   329 YIKMARNIEAPTGKCGIAMEASYPI 353


>TAIR|locus:2117979 [details] [associations]
            symbol:AT4G23520 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002687 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            KO:K01376 IPI:IPI00527171 RefSeq:NP_567686.2 UniGene:At.32421
            ProteinModelPortal:F4JNL3 SMR:F4JNL3 MEROPS:C01.A22 PRIDE:F4JNL3
            EnsemblPlants:AT4G23520.1 GeneID:828452 KEGG:ath:AT4G23520
            OMA:PANDEIS ArrayExpress:F4JNL3 Uniprot:F4JNL3
        Length = 356

 Score = 767 (275.1 bits), Expect = 3.9e-76, P = 3.9e-76
 Identities = 154/346 (44%), Positives = 225/346 (65%)

Query:    16 TPMFIIITLLVSCASQVVSSRST---HEQSVVEIH---EKWMAQHGRSYKDEL-EKEMRL 68
             T +F++I  ++S  S  +   +T   H +S  E+    + WM++HG++Y + L EKE R 
Sbjct:    10 TILFLLIVFVLSAPSSAMDLPATSGGHNRSNEEVEFIFQMWMSKHGKTYTNALGEKERRF 69

Query:    69 KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKY 128
             + FK+NL +I++ N + N +Y+LG  +F+DLT  E+R L+ G   P P  R       +Y
Sbjct:    70 QNFKDNLRFIDQHNAK-NLSYQLGLTRFADLTVQEYRDLFPG--SPKPKQRNLKTSR-RY 125

Query:   129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
               L+   +P S+DWR +GAV+ IK+Q  C  CWAF+ VAAVEG+ KI +G LI LSEQ+L
Sbjct:   126 VPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQEL 185

Query:   189 LDCSTNGNNGCLG-GSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA--AAKISN 245
             +DC+   NNGC G G  + AF ++I N G+ +E +YPYQ   G+C+  Q  +     I +
Sbjct:   186 VDCNLV-NNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQSTSNKVITIDS 244

Query:   246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
             YE+VP+ DE +L KAV+ QPVS+ +   S EF  Y+  I+NG CGT LDHA+ IVG+G+ 
Sbjct:   245 YEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVGYGS- 303

Query:   306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
             E+G +YW+++NSWG TWGDAGY+KI R+    +GLCGI   +SYP+
Sbjct:   304 ENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPI 349


>TAIR|locus:2157712 [details] [associations]
            symbol:CEP1 "cysteine endopeptidase 1" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005783 EMBL:CP002688
            GenomeReviews:BA000015_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 PROSITE:PS00014
            EMBL:AB024031 MEROPS:I29.003 EMBL:HM367092 EMBL:AY091087
            IPI:IPI00516991 RefSeq:NP_568722.1 UniGene:At.7918 HSSP:O65039
            ProteinModelPortal:Q9FGR9 SMR:Q9FGR9 PaxDb:Q9FGR9 PRIDE:Q9FGR9
            EnsemblPlants:AT5G50260.1 GeneID:835091 KEGG:ath:AT5G50260
            TAIR:At5g50260 HOGENOM:HOG000230773 InParanoid:Q9FGR9 KO:K16292
            OMA:WHSKKYH PhylomeDB:Q9FGR9 ProtClustDB:CLSN2689970
            Genevestigator:Q9FGR9 Uniprot:Q9FGR9
        Length = 361

 Score = 755 (270.8 bits), Expect = 7.3e-75, P = 7.3e-75
 Identities = 150/345 (43%), Positives = 212/345 (61%)

Query:    19 FIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
             FI++ L +    +       H      E S+ E++E+W + H  +   E EK  R  +FK
Sbjct:     4 FIVLALCMLMVLETTKGLDFHNKDVESENSLWELYERWRSHHTVARSLE-EKAKRFNVFK 62

Query:    73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXX-----XXXXFK 127
              N+++I + NK+ +++YKL  N+F D+T++EFR  Y G  +    HR           F 
Sbjct:    63 HNVKHIHETNKK-DKSYKLKLNKFGDMTSEEFRRTYAGSNIKH--HRMFQGEKKATKSFM 119

Query:   128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
             Y N++   +PTS+DWR  GAVTP+KNQ +CG CWAF+ V AVEGI +IR+  L  LSEQ+
Sbjct:   120 YANVNT--LPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQE 177

Query:   188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNY 246
             L+DC TN N GC GG  + AF +I +  G+ +E  YPY+A   TC   ++ A    I  +
Sbjct:   178 LVDCDTNQNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGH 237

Query:   247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
             E+VP   E  L+KAV+ QPVS+AI A  ++FQ Y EG+F G CGT+L+H V +VG+GTT 
Sbjct:   238 EDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTI 297

Query:   307 DGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
             DG  YW++KNSWG  WG+ GY+++ R     EGLCGI   +SYPL
Sbjct:   298 DGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342


>TAIR|locus:505006391 [details] [associations]
            symbol:CEP3 "cysteine endopeptidase 3" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005783 EMBL:CP002686
            GenomeReviews:BA000014_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 PROSITE:PS00014
            EMBL:AL049659 HSSP:O65039 HOGENOM:HOG000230773 KO:K16292
            EMBL:AK119026 IPI:IPI00525150 PIR:T06707 RefSeq:NP_566901.1
            UniGene:At.3162 ProteinModelPortal:Q9STL5 SMR:Q9STL5 MEROPS:C01.A02
            PRIDE:Q9STL5 EnsemblPlants:AT3G48350.1 GeneID:823993
            KEGG:ath:AT3G48350 TAIR:At3g48350 InParanoid:Q9STL5 OMA:DITHHEF
            PhylomeDB:Q9STL5 ProtClustDB:CLSN2917387 Genevestigator:Q9STL5
            Uniprot:Q9STL5
        Length = 364

 Score = 749 (268.7 bits), Expect = 3.1e-74, P = 3.1e-74
 Identities = 148/318 (46%), Positives = 203/318 (63%)

Query:    40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
             E++V +++E+W   H  S     E   R  +F+ N+ ++ + NK+ N+ YKL  N+F+D+
Sbjct:    31 EENVWKLYERWRGHHSVSRASH-EAIKRFNVFRHNVLHVHRTNKK-NKPYKLKINRFADI 88

Query:   100 TNDEFRALYTGYKMPSPSHRXXX-----XXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
             T+ EFR+ Y G  +    HR           F Y+N+  T VP+S+DWR+KGAVT +KNQ
Sbjct:    89 THHEFRSSYAGSNVKH--HRMLRGPKRGSGGFMYENV--TRVPSSVDWREKGAVTEVKNQ 144

Query:   155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
             ++CG CWAF+ VAAVEGI KIR+  L+ LSEQ+L+DC T  N GC GG  E AF +I  N
Sbjct:   145 QDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQGCAGGLMEPAFEFIKNN 204

Query:   215 QGIATEDEYPYQAVPGT-CSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
              GI TE+ YPY +     C A         I  +E VP  DE+ LLKAV+ QPVS+AI A
Sbjct:   205 GGIKTEETYPYDSSDVQFCRANSIGGETVTIDGHEHVPENDEEELLKAVAHQPVSVAIDA 264

Query:   273 YSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
              S++FQ Y EG+F G CGTQL+H V IVG+G T++G  YW+++NSWG  WG+ GY++I R
Sbjct:   265 GSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIER 324

Query:   333 ----DEGLCGIGTRSSYP 346
                 +EG CGI   +SYP
Sbjct:   325 GISENEGRCGIAMEASYP 342


>TAIR|locus:2030427 [details] [associations]
            symbol:XCP2 "xylem cysteine peptidase 2" species:3702
            "Arabidopsis thaliana" [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009507 "chloroplast" evidence=ISM] [GO:0008233 "peptidase
            activity" evidence=ISS] [GO:0005618 "cell wall" evidence=IDA]
            [GO:0010623 "developmental programmed cell death" evidence=IMP]
            [GO:0010075 "regulation of meristem growth" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0005886 GO:GO:0005618 GO:GO:0005773
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AC069251 EMBL:AC007369 GO:GO:0010623
            OMA:YKEIPEG HOGENOM:HOG000230773 KO:K16290 EMBL:AF191028
            EMBL:BT004822 IPI:IPI00526722 PIR:A86341 RefSeq:NP_564126.1
            UniGene:At.21316 ProteinModelPortal:Q9LM66 SMR:Q9LM66 IntAct:Q9LM66
            STRING:Q9LM66 MEROPS:C01.120 PaxDb:Q9LM66 PRIDE:Q9LM66
            ProMEX:Q9LM66 EnsemblPlants:AT1G20850.1 GeneID:838677
            KEGG:ath:AT1G20850 GeneFarm:5034 TAIR:At1g20850 InParanoid:Q9LM66
            PhylomeDB:Q9LM66 ProtClustDB:CLSN2917031 Genevestigator:Q9LM66
            GermOnline:AT1G20850 Uniprot:Q9LM66
        Length = 356

 Score = 748 (268.4 bits), Expect = 4.0e-74, P = 4.0e-74
 Identities = 139/315 (44%), Positives = 213/315 (67%)

Query:    38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
             +H++ ++E+ E W++   ++Y+   EK +R ++FK+NL++I++ NK+G ++Y LG N+F+
Sbjct:    43 SHDK-LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFA 100

Query:    98 DLTNDEFRALYTGYKMPSPSH-RXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
             DL+++EF+ +Y G K              F Y+++    VP S+DWR KGAV  +KNQ  
Sbjct:   101 DLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEA--VPKSVDWRKKGAVAEVKNQGS 158

Query:   157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
             CG CWAF+ VAAVEGI KI +GNL  LSEQ+L+DC T  NNGC GG  + AF YI++N G
Sbjct:   159 CGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGG 218

Query:   217 IATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
             +  E++YPY    GTC   +  +    I+ +++VP+ DE++LLKA++ QP+S+AI A   
Sbjct:   219 LRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGR 278

Query:   276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
             EFQ Y  G+F+G CG  LDH V  VG+G+++ G++Y ++KNSWG  WG+ GY+++ R+  
Sbjct:   279 EFQFYSGGVFDGRCGVDLDHGVAAVGYGSSK-GSDYIIVKNSWGPKWGEKGYIRLKRNTG 337

Query:   334 --EGLCGIGTRSSYP 346
               EGLCGI   +S+P
Sbjct:   338 KPEGLCGINKMASFP 352


>TAIR|locus:2128243 [details] [associations]
            symbol:AT4G11310 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005618 "cell wall"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 GO:GO:0005618 EMBL:CP002687
            GenomeReviews:CT486007_GR EMBL:AL096882 EMBL:AL161531
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            HOGENOM:HOG000230773 KO:K01376 EMBL:AY093066 EMBL:BT000099
            IPI:IPI00520496 PIR:T13022 RefSeq:NP_567376.1 UniGene:At.43189
            ProteinModelPortal:Q9SUT0 SMR:Q9SUT0 IntAct:Q9SUT0 STRING:Q9SUT0
            MEROPS:C01.A20 PaxDb:Q9SUT0 PRIDE:Q9SUT0 EnsemblPlants:AT4G11310.1
            GeneID:826733 KEGG:ath:AT4G11310 TAIR:At4g11310 InParanoid:Q9SUT0
            OMA:EVCHGAD PhylomeDB:Q9SUT0 ProtClustDB:CLSN2689395
            Genevestigator:Q9SUT0 GermOnline:AT4G11310 Uniprot:Q9SUT0
        Length = 364

 Score = 740 (265.6 bits), Expect = 2.8e-73, P = 2.8e-73
 Identities = 154/356 (43%), Positives = 221/356 (62%)

Query:     9 GSFKINTTPMFIIITLLVSCASQVVSSRSTHEQ-----SVVE-----IHEKWMAQHGRSY 58
             GS K +   + ++  ++ SCA+ +  S  +++      SV +     I E WM +HG+ Y
Sbjct:     2 GSAK-SAMLILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLIFESWMVKHGKVY 60

Query:    59 KDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSH 118
                 EKE RL IF++NL +I   N E N +Y+LG   F+DL+  E++ +  G     P +
Sbjct:    61 GSVAEKERRLTIFEDNLRFINNRNAE-NLSYRLGLTGFADLSLHEYKEVCHGADPRPPRN 119

Query:   119 RXXXXXXFKYQNLSMTDV-PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS 177
                     +Y+  S  DV P S+DWR++GAVT +K+Q  C  CWAF+ V AVEG+ KI +
Sbjct:   120 HVFMTSSDRYKT-SADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVT 178

Query:   178 GNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK 237
             G L+ LSEQ L++C+   NNGC GG  E A+ +I++N G+ T+++YPY+AV G C    K
Sbjct:   179 GELVTLSEQDLINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLK 237

Query:   238 P--AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDH 295
                    I  YE +P+ DE AL+KAV+ QPV+  I + S EFQ Y+ G+F+G CGT L+H
Sbjct:   238 ENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNH 297

Query:   296 AVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
              V +VG+GT E+G +YWL+KNS G TWG+AGYMK+ R+     GLCGI  R+SYPL
Sbjct:   298 GVVVVGYGT-ENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPL 352


>TAIR|locus:2128253 [details] [associations]
            symbol:AT4G11320 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002687 GenomeReviews:CT486007_GR EMBL:AL096882 EMBL:AL161531
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 OMA:ICHGADP
            HOGENOM:HOG000230773 KO:K01376 ProtClustDB:CLSN2689395
            EMBL:AY035055 EMBL:AY051062 IPI:IPI00520480 PIR:T13023
            RefSeq:NP_567377.1 UniGene:At.25206 ProteinModelPortal:Q9SUS9
            SMR:Q9SUS9 STRING:Q9SUS9 MEROPS:C01.A21 PaxDb:Q9SUS9 PRIDE:Q9SUS9
            EnsemblPlants:AT4G11320.1 GeneID:826734 KEGG:ath:AT4G11320
            TAIR:At4g11320 InParanoid:Q9SUS9 PhylomeDB:Q9SUS9
            Genevestigator:Q9SUS9 GermOnline:AT4G11320 Uniprot:Q9SUS9
        Length = 371

 Score = 740 (265.6 bits), Expect = 2.8e-73, P = 2.8e-73
 Identities = 149/353 (42%), Positives = 217/353 (61%)

Query:    18 MFIIITLLVSCAS----QVVSSRSTHE--------QSVVE-----IHEKWMAQHGRSYKD 60
             +F++  ++ SCA+     VVSS   H         Q + +     + E WM +HG+ Y  
Sbjct:    10 IFLLALVIASCATAMDMSVVSSNDNHHVTAGPGRRQGIFDAEATLMFESWMVKHGKVYDS 69

Query:    61 ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRX 120
               EKE RL IF++NL +I   N E N +Y+LG N+F+DL+  E+  +  G     P +  
Sbjct:    70 VAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADLSLHEYGEICHGADPRPPRNHV 128

Query:   121 XXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
                   +Y+      +P S+DWR++GAVT +K+Q  C  CWAF+ V AVEG+ KI +G L
Sbjct:   129 FMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVEGLNKIVTGEL 188

Query:   181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-- 238
             + LSEQ L++C+   NNGC GG  E A+ +I+ N G+ T+++YPY+A+ G C    K   
Sbjct:   189 VTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCEGRLKEDN 247

Query:   239 AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVT 298
                 I  YE +P+ DE AL+KAV+ QPV+  + + S EFQ Y+ G+F+G CGT L+H V 
Sbjct:   248 KNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGVFDGTCGTNLNHGVV 307

Query:   299 IVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
             +VG+GT E+G +YW++KNS G+TWG+AGYMK+ R+     GLCGI  R+SYPL
Sbjct:   308 VVGYGT-ENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRGLCGIAMRASYPL 359


>TAIR|locus:2030027 [details] [associations]
            symbol:AT1G29110 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            EMBL:CP002684 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            IPI:IPI00544534 RefSeq:NP_564322.1 UniGene:At.51816
            ProteinModelPortal:F4HZW2 SMR:F4HZW2 EnsemblPlants:AT1G29110.1
            GeneID:839786 KEGG:ath:AT1G29110 OMA:SCRANAR Uniprot:F4HZW2
        Length = 334

 Score = 737 (264.5 bits), Expect = 5.9e-73, P = 5.9e-73
 Identities = 154/347 (44%), Positives = 221/347 (63%)

Query:    13 INTTPMFIIITLLVSCASQVVSSR---STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
             ++   +F+ +T+L S   ++  +R   + +EQS+V+ H++WM Q  R YKDE EKEMRLK
Sbjct:     2 VSVRSVFVALTIL-SMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLK 60

Query:    70 IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQ 129
             +FK+NL++IE  N  GN++Y LG N+F+D   +EF A +TG ++   S           +
Sbjct:    61 VFKKNLKFIENFNNMGNQSYTLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSR 120

Query:   130 NLSMTDVPT---SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
             N +M+D+     S DWRD+GAVTP+K Q   G C           +TKI   NL+ LSEQ
Sbjct:   121 NWNMSDIDMEDESKDWRDEGAVTPVKYQ---GAC----------RLTKISGKNLLTLSEQ 167

Query:   187 QLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISN 245
             QL+DC    N GC GG  E+AF YII+N G++ E EYPYQ    +C A A++    +I  
Sbjct:   168 QLIDCDIEKNGGCNGGEFEEAFKYIIKNGGVSLETEYPYQVKKESCRANARRAPHTQIRG 227

Query:   246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQLDHAVTIVGFGT 304
             ++ VPS +E+ALL+AV  QPVS+ I A +  F  YK G++ G+ CGT ++HAVTIVG+GT
Sbjct:   228 FQMVPSHNERALLEAVRRQPVSVLIDARADSFGHYKGGVYAGLDCGTDVNHAVTIVGYGT 287

Query:   305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
                G NYW++KNSWG +WG+ GYM+I RD    +G+CGI   ++YP+
Sbjct:   288 MS-GLNYWVLKNSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAYPV 333


>TAIR|locus:2024362 [details] [associations]
            symbol:XBCP3 "xylem bark cysteine peptidase 3"
            species:3702 "Arabidopsis thaliana" [GO:0005576 "extracellular
            region" evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0005783 "endoplasmic
            reticulum" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005783 EMBL:CP002684 GO:GO:0005773 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 MEROPS:I29.003
            HOGENOM:HOG000230773 InterPro:IPR000118 Pfam:PF00396 SMART:SM00277
            UniGene:At.10233 OMA:CEIESAV EMBL:BT026490 EMBL:AK226753
            IPI:IPI00536687 RefSeq:NP_563855.1 ProteinModelPortal:Q0WVJ5
            SMR:Q0WVJ5 PRIDE:Q0WVJ5 EnsemblPlants:AT1G09850.1 GeneID:837517
            KEGG:ath:AT1G09850 TAIR:At1g09850 InParanoid:Q0WVJ5
            PhylomeDB:Q0WVJ5 ProtClustDB:CLSN2687747 Genevestigator:Q0WVJ5
            Uniprot:Q0WVJ5
        Length = 437

 Score = 733 (263.1 bits), Expect = 1.6e-72, P = 1.6e-72
 Identities = 143/322 (44%), Positives = 207/322 (64%)

Query:    32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKL 91
             +VSS S+ +  + E+ + W  +HG++Y  E E++ R++IFK+N +++ + N   N TY L
Sbjct:    18 LVSSSSSSDD-ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSL 76

Query:    92 GTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLS-MTDVPTSLDWRDKGAVTP 150
               N F+DLT+ EF+A   G  + +PS         K Q+L     VP S+DWR KGAVT 
Sbjct:    77 SLNAFADLTHHEFKASRLGLSVSAPS----VIMASKGQSLGGSVKVPDSVDWRKKGAVTN 132

Query:   151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
             +K+Q  CG CW+F+A  A+EGI +I +G+LI LSEQ+L+DC  + N GC GG  + AF +
Sbjct:   133 VKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEF 192

Query:   211 IIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIA 269
             +I+N GI TE +YPYQ   GTC   + K     I +Y  V S DE+AL++AV+ QPVS+ 
Sbjct:   193 VIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVG 252

Query:   270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
             I      FQ Y  GIF+G C T LDHAV IVG+G+ ++G +YW++KNSWG +WG  G+M 
Sbjct:   253 ICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGS-QNGVDYWIVKNSWGKSWGMDGFMH 311

Query:   330 IVRD----EGLCGIGTRSSYPL 347
             + R+    +G+CGI   +SYP+
Sbjct:   312 MQRNTENSDGVCGINMLASYPI 333


>TAIR|locus:2090629 [details] [associations]
            symbol:AT3G19400 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0019344 "cysteine biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            EMBL:CP002686 GenomeReviews:BA000014_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AB025624
            MEROPS:I29.003 HOGENOM:HOG000230773 EMBL:AK118509 IPI:IPI00543468
            RefSeq:NP_566634.2 UniGene:At.38409 ProteinModelPortal:Q9LT77
            SMR:Q9LT77 PaxDb:Q9LT77 PRIDE:Q9LT77 EnsemblPlants:AT3G19400.1
            GeneID:821474 KEGG:ath:AT3G19400 TAIR:At3g19400 InParanoid:Q9LT77
            OMA:IGEHERR ProtClustDB:CLSN2679975 Genevestigator:Q9LT77
            Uniprot:Q9LT77
        Length = 362

 Score = 725 (260.3 bits), Expect = 1.1e-71, P = 1.1e-71
 Identities = 146/338 (43%), Positives = 214/338 (63%)

Query:    20 IIITLLVSCASQVVSSRSTHEQSVVEI---HEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
             +I+++L+  +S  V++ +  E++  E+   +E+W+ ++ ++Y    EKE R KIFK+NL+
Sbjct:    14 VILSVLLLSSSLGVATETEIERNETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLK 73

Query:    77 YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDV 136
             ++++ N   +RT+++G  +F+DLTN+EFRA+Y   KM            + Y+   +  +
Sbjct:    74 FVDEHNSVPDRTFEVGLTRFADLTNEEFRAIYLRKKMERTKD-SVKTERYLYKEGDV--L 130

Query:   137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
             P  +DWR  GAV  +K+Q  CG CWAF+AV AVEGI +I +G LI LSEQ+L+DC     
Sbjct:   131 PDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFV 190

Query:   196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVP-GTCSAAQK--PAAAKISNYEEVPSG 252
             N GC GG    AF +I++N GI T+ +YPY A   G C+A +        I  YE+VP  
Sbjct:   191 NAGCDGGIMNYAFEFIMKNGGIETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRD 250

Query:   253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
             DE++L KAV+ QPVS+AI A S  FQ YK G+  G CG  LDH V +VG+G+T  G +YW
Sbjct:   251 DEKSLKKAVAHQPVSVAIEASSQAFQLYKSGVMTGTCGISLDHGVVVVGYGSTS-GEDYW 309

Query:   313 LIKNSWGNTWGDAGYMKIVR--DE--GLCGIGTRSSYP 346
             +I+NSWG  WGD+GY+K+ R  D+  G CGI    SYP
Sbjct:   310 IIRNSWGLNWGDSGYVKLQRNIDDPFGKCGIAMMPSYP 347


>FB|FBgn0013770 [details] [associations]
            symbol:Cp1 "Cysteine proteinase-1" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS;NAS] [GO:0030163 "protein catabolic process"
            evidence=NAS] [GO:0005764 "lysosome" evidence=NAS] [GO:0048102
            "autophagic cell death" evidence=IEP] [GO:0035071 "salivary gland
            cell autophagic cell death" evidence=IEP] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0045169 "fusome" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:AE013599 GO:GO:0007586 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0035071 GO:GO:0045169 GeneTree:ENSGT00660000095458 KO:K01365
            EMBL:U75652 EMBL:AF012089 EMBL:BT016071 EMBL:D31970
            RefSeq:NP_523735.2 RefSeq:NP_725347.1 RefSeq:NP_725348.1
            UniGene:Dm.7400 ProteinModelPortal:Q95029 SMR:Q95029 IntAct:Q95029
            MINT:MINT-814156 STRING:Q95029 MEROPS:C01.092 PaxDb:Q95029
            EnsemblMetazoa:FBtr0087593 GeneID:36546 KEGG:dme:Dmel_CG6692
            CTD:36546 FlyBase:FBgn0013770 InParanoid:Q95029 OMA:ICHGADP
            OrthoDB:EOG46M91C PhylomeDB:Q95029 GenomeRNAi:36546 NextBio:799136
            Bgee:Q95029 GermOnline:CG6692 Uniprot:Q95029
        Length = 371

 Score = 697 (250.4 bits), Expect = 1.0e-68, P = 1.0e-68
 Identities = 145/335 (43%), Positives = 200/335 (59%)

Query:    24 LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK 83
             LL   A   V+   +    V+E    +  +H ++Y+DE E+  RLKIF EN   I K N+
Sbjct:    36 LLPLLALLAVAQAVSFADVVMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQ 95

Query:    84 ---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFK---YQNLSMTDVP 137
                EG  ++KL  N+++DL + EFR L  G+              FK   + + +   +P
Sbjct:    96 RFAEGKVSFKLAVNKYADLLHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLP 155

Query:   138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
              S+DWR KGAVT +K+Q  CG CWAF++  A+EG    +SG L+ LSEQ L+DCST  GN
Sbjct:   156 KSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGN 215

Query:   197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
             NGC GG  + AF YI  N GI TE  YPY+A+  +C   +    A    + ++P GDE+ 
Sbjct:   216 NGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKK 275

Query:   257 LLKAVS-MQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWL 313
             + +AV+ + PVS+AI A    FQ Y EG++N   C  Q LDH V +VGFGT E G +YWL
Sbjct:   276 MAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWL 335

Query:   314 IKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
             +KNSWG TWGD G++K++R+ E  CGI + SSYPL
Sbjct:   336 VKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPL 370


>TAIR|locus:2097104 [details] [associations]
            symbol:AT3G43960 species:3702 "Arabidopsis thaliana"
            [GO:0005886 "plasma membrane" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0031225 "anchored to
            membrane" evidence=TAS] [GO:0048767 "root hair elongation"
            evidence=IMP] [GO:0016132 "brassinosteroid biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:CP002686
            GenomeReviews:BA000014_GR GO:GO:0031225 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0048767 MEROPS:I29.003 HOGENOM:HOG000230773
            EMBL:AL163975 EMBL:AK118634 IPI:IPI00526842 PIR:T48950
            RefSeq:NP_566867.1 UniGene:At.43352 ProteinModelPortal:Q9LXW3
            SMR:Q9LXW3 STRING:Q9LXW3 PaxDb:Q9LXW3 PRIDE:Q9LXW3
            EnsemblPlants:AT3G43960.1 GeneID:823513 KEGG:ath:AT3G43960
            TAIR:At3g43960 eggNOG:NOG286334 InParanoid:Q9LXW3 KO:K01376
            OMA:MAISFRT PhylomeDB:Q9LXW3 ProtClustDB:CLSN2917367
            Genevestigator:Q9LXW3 GermOnline:AT3G43960 Uniprot:Q9LXW3
        Length = 376

 Score = 689 (247.6 bits), Expect = 7.2e-68, P = 7.2e-68
 Identities = 144/350 (41%), Positives = 207/350 (59%)

Query:    10 SFKINTTPMFIIITLLVSCASQVVSSRST--HEQSVVEIHEKWMAQHGRSYKDELEKEMR 67
             SF+  T  +  +  LL+S +  VV++  +  +E  V+ ++E+W+ ++G++Y    EKE R
Sbjct:     4 SFR--TLALLTLSVLLISISLGVVTATESQRNEGEVLTMYEQWLVENGKNYNGLGEKERR 61

Query:    68 LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFK 127
              KIFK+NL+ IE+ N + NR+Y+ G N+FSDLT DEF+A Y G KM   S         +
Sbjct:    62 FKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQASYLGGKMEKKS---LSDVAER 118

Query:   128 YQNLSMTDVPTSLDWRDKGAVTP-IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
             YQ      +P  +DWR++GAV P +K Q ECG CWAFAA  AVEGI +I +G L+ LSEQ
Sbjct:   119 YQYKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQ 178

Query:   187 QLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVP-GTCSAAQKPAA--AK 242
             +L+DC   N N GC GG    AF +I +N GI +++ Y Y       C A +        
Sbjct:   179 ELIDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVT 238

Query:   243 ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQL-DHAVTIVG 301
             I+ +E VP  DE +L KAV+ QP+S+ I+A       YK G++ G C     DH V IVG
Sbjct:   239 INGHEVVPVNDEMSLKKAVAYQPISVMISA--ANMSDYKSGVYKGACSNLWGDHNVLIVG 296

Query:   302 FGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
             +GT+ D  +YWLI+NSWG  WG+ GY+++ R+     G C +     YP+
Sbjct:   297 YGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPTGKCAVAVAPVYPI 346


>TAIR|locus:2038515 [details] [associations]
            symbol:AT1G06260 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0048046 "apoplast"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0048046 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC025290
            MEROPS:I29.003 HSSP:O65039 HOGENOM:HOG000230773 OMA:METAFEF
            IPI:IPI00525965 PIR:D86198 RefSeq:NP_563764.1 UniGene:At.24617
            ProteinModelPortal:Q9LNC1 SMR:Q9LNC1 PaxDb:Q9LNC1 PRIDE:Q9LNC1
            EnsemblPlants:AT1G06260.1 GeneID:837137 KEGG:ath:AT1G06260
            TAIR:At1g06260 InParanoid:Q9LNC1 PhylomeDB:Q9LNC1
            ProtClustDB:CLSN2916975 Genevestigator:Q9LNC1 Uniprot:Q9LNC1
        Length = 343

 Score = 664 (238.8 bits), Expect = 3.2e-65, P = 3.2e-65
 Identities = 141/340 (41%), Positives = 196/340 (57%)

Query:    15 TTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
             T  + I   L+ S    V SS     +++ +  EKW+  H + Y    E  +R  I++ N
Sbjct:    11 TLAVLICFVLIASKLCSVDSSVYDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSN 70

Query:    75 LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS-HRXXXXXXFKYQNLSM 133
             ++ I+  N   +  +KL  N+F+D+TN EF+A + G    S   H+          N   
Sbjct:    71 VQLIDYINSL-HLPFKLTDNRFADMTNSEFKAHFLGLNTSSLRLHKKQRPVCDPAGN--- 126

Query:   134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
               VP ++DWR +GAVTPI+NQ +CG CWAF+AVAA+EGI KI++GNL+ LSEQQL+DC  
Sbjct:   127 --VPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDV 184

Query:   194 NG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPS 251
                N GC GG  E AF +I  N G+ATE +YPY  + GTC   + K     I  Y++V  
Sbjct:   185 GTYNKGCSGGLMETAFEFIKTNGGLATETDYPYTGIEGTCDQEKSKNKVVTIQGYQKVAQ 244

Query:   252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
              +E +L  A + QPVS+ I A    FQ Y  G+F   CGT L+H VT+VG+G   D   Y
Sbjct:   245 -NEASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTNYCGTNLNHGVTVVGYGVEGD-QKY 302

Query:   312 WLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
             W++KNSWG  WG+ GY+++ R    D G CGI   +SYPL
Sbjct:   303 WIVKNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASYPL 342


>ZFIN|ZDB-GENE-071004-74 [details] [associations]
            symbol:zgc:174855 "zgc:174855" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-071004-74
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 MEROPS:C01.032 EMBL:BX000534 EMBL:BC152282
            IPI:IPI00773140 RefSeq:NP_001096592.1 UniGene:Dr.104905 SMR:A7MCR6
            STRING:A7MCR6 Ensembl:ENSDART00000109968 GeneID:569326
            KEGG:dre:569326 NextBio:20889622 Uniprot:A7MCR6
        Length = 335

 Score = 663 (238.4 bits), Expect = 4.1e-65, P = 4.1e-65
 Identities = 145/343 (42%), Positives = 209/343 (60%)

Query:    18 MF-IIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENL 75
             MF ++ITL   C S V ++ S   Q  ++ H   W +QHG+SY +++E   R+ I++ENL
Sbjct:     2 MFALLITL---CISAVFTAPSIDIQ--LDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENL 55

Query:    76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLS 132
               IE+ N E   GN T+K+G NQF D+TN+EFR    GYK   P+ R      F     S
Sbjct:    56 RKIEQHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKQ-DPN-RTSKGALF--MEPS 111

Query:   133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
                 P  +DWR +G VTP+K+QK+CG CW+F++  A+EG    ++G LI +SEQ L+DCS
Sbjct:   112 FFAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCS 171

Query:   193 T-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVP 250
                GN GC GG  ++AF Y+ +N+G+ +E  YPY A     C    +   AKI+ + ++P
Sbjct:   172 RPQGNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIP 231

Query:   251 SGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKEGIF-NGVCGTQLDHAVTIVGFGTT-ED 307
              G+E AL+ AV+ + PVS+AI A     Q Y+ GI+    C ++LDHAV +VG+G    D
Sbjct:   232 RGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGAD 291

Query:   308 --GANYWLIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYPL 347
               G  YW++KNSW + WGD GY+ + +D+   CGI T +SYPL
Sbjct:   292 VAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334


>RGD|1308751 [details] [associations]
            symbol:RGD1308751 "similar to Cathepsin L precursor (Major
            excreted protein) (MEP)" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1308751 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:CH474032 IPI:IPI00365697 RefSeq:XP_001065885.2
            RefSeq:XP_225137.5 MEROPS:C01.069 Ensembl:ENSRNOT00000061391
            GeneID:290981 KEGG:rno:290981 UCSC:RGD:1308751 CTD:290981
            OMA:ESYAYEA OrthoDB:EOG42823G NextBio:631921 Uniprot:D3ZKC3
        Length = 330

 Score = 661 (237.7 bits), Expect = 6.7e-65, P = 6.7e-65
 Identities = 132/338 (39%), Positives = 199/338 (58%)

Query:    16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
             TP+F++ TL   C   ++S+  TH+ S   + E+W  +HG++Y    E + R  +++ N+
Sbjct:     2 TPIFLLATL---CLG-MISAAPTHDPSFDTVWEEWKTKHGKTYNTNEEGQKRA-VWENNM 56

Query:    76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLS 132
             + I   N++   G   + L  N F DLTN EFR L TG++   P         F      
Sbjct:    57 KMINLHNEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQSMGPKETTIFREPF------ 110

Query:   133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
             + D+P SLDWR+ G VTP+KNQ +CG CWAF+AV ++EG    ++G L+ LSEQ L+DCS
Sbjct:   111 LGDIPKSLDWREHGYVTPVKNQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDCS 170

Query:   193 TN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
              + GN GC GG  E AF Y+ +N+G+ T + Y Y+A  G C    K +AA ++ + +VP 
Sbjct:   171 WSYGNLGCNGGLMEFAFQYVKENRGLDTGESYAYEAQDGLCRYNPKYSAANVTGFVKVPL 230

Query:   252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF-NGVCG-TQLDHAVTIVGFGTTEDGA 309
              ++  +    S+ PVS+ I ++   F+ Y  G++    C  T++DHAV +VG+G   DG 
Sbjct:   231 SEDDLMSAVASVGPVSVGIDSHHQSFRFYSGGMYYEPDCSSTEMDHAVLVVGYGEESDGG 290

Query:   310 NYWLIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYP 346
              YWL+KNSWG  WG  GY+K+ +D+   CGI T + YP
Sbjct:   291 KYWLVKNSWGEDWGMDGYIKMAKDQNNNCGIATYAIYP 328


>ZFIN|ZDB-GENE-080215-7 [details] [associations]
            symbol:zgc:174153 "zgc:174153" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-080215-7
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 EMBL:BX000534 EMBL:BX322603
            IPI:IPI00483644 Ensembl:ENSDART00000113654 OMA:ITLCISA Bgee:F1R8Y0
            Uniprot:F1R8Y0
        Length = 336

 Score = 660 (237.4 bits), Expect = 8.5e-65, P = 8.5e-65
 Identities = 144/344 (41%), Positives = 210/344 (61%)

Query:    18 MF-IIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENL 75
             MF +IITL   C S V ++ S   Q  ++ H   W +QHG+SY +++E   R+ I++ENL
Sbjct:     2 MFALIITL---CISAVFTAPSIDIQ--LDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENL 55

Query:    76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLS 132
               IE+ N E   GN T+K+G NQF D+TN+EFR    GYK   P+          +   S
Sbjct:    56 RKIEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKH-DPNQTSQGPL---FMEPS 111

Query:   133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
                 P  +DWR +G VTP+K+QK+CG CW+F++  A+EG    ++G LI +SEQ L+DCS
Sbjct:   112 FFAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCS 171

Query:   193 T-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVP 250
                GN GC GG  ++AF Y+ +N+G+ +E  YPY A     C    +   AKI+ + ++P
Sbjct:   172 RPQGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIP 231

Query:   251 SGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKEGIF-NGVCGT-QLDHAVTIVGFGTT-E 306
             SG+E AL+ AV+ + PVS+AI A     Q Y+ GI+    C + +LDHAV +VG+G    
Sbjct:   232 SGNEPALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGA 291

Query:   307 D--GANYWLIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYPL 347
             D  G  YW++KNSW + WGD GY+ + +D+   CG+ T++SYPL
Sbjct:   292 DVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335


>ZFIN|ZDB-GENE-030131-572 [details] [associations]
            symbol:wu:fb37b09 "wu:fb37b09" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030131-572 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:BX465840 IPI:IPI00866294 RefSeq:XP_001923796.1
            UniGene:Dr.25683 PRIDE:E9QBE2 Ensembl:ENSDART00000133962
            GeneID:321853 KEGG:dre:321853 NextBio:20807556 Uniprot:E9QBE2
        Length = 335

 Score = 655 (235.6 bits), Expect = 2.9e-64, P = 2.9e-64
 Identities = 143/343 (41%), Positives = 209/343 (60%)

Query:    18 MF-IIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENL 75
             MF +++TL +S    V ++ S   Q  ++ H   W +QHG+SY +++E   R+ I++ENL
Sbjct:     2 MFALLVTLYISA---VFAAPSIDIQ--LDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENL 55

Query:    76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLS 132
               IE+ N E   GN T+K+G NQF D+TN+EFR    GYK   P+ R      F      
Sbjct:    56 RKIEQHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKH-DPN-RTSQGPLFMEPKFF 113

Query:   133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
                 P  +DWR +G VTP+K+QK+CG CW+F++  A+EG    ++G LI +SEQ L+DCS
Sbjct:   114 AA--PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCS 171

Query:   193 T-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVP 250
               +GN GC GG  ++AF Y+ +N+G+ +E  YPY A     C    +   AKI+ + ++P
Sbjct:   172 RPHGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIP 231

Query:   251 SGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKEGIF-NGVCGTQLDHAVTIVGFGTT-ED 307
              G+E AL+ AV+ + PVS+AI A     Q Y+ GI+    C +QLDHAV +VG+G    D
Sbjct:   232 KGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSQLDHAVLVVGYGYQGAD 291

Query:   308 --GANYWLIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYPL 347
               G  YW++KNSW + WGD GY+ + +D+   CGI T +SYPL
Sbjct:   292 VAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334


>RGD|1560071 [details] [associations]
            symbol:Ctsll3 "cathepsin L-like 3" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1560071 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:CH474032 IPI:IPI00560469 RefSeq:XP_001065834.2
            RefSeq:XP_573976.3 UniGene:Rn.104851 MEROPS:C01.107
            Ensembl:ENSRNOT00000061398 GeneID:498691 KEGG:rno:498691
            UCSC:RGD:1560071 CTD:70202 OMA:NCGIASD OrthoDB:EOG4HDSTZ
            NextBio:700548 Uniprot:D3ZJV2
        Length = 330

 Score = 653 (234.9 bits), Expect = 4.7e-64, P = 4.7e-64
 Identities = 138/340 (40%), Positives = 200/340 (58%)

Query:    16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
             TP+F++ TL   C   ++S+  TH+ S   + E+W  +HG++Y    E + R  +++ N+
Sbjct:     2 TPIFLLATL---CLG-MISAAPTHDPSFDTVWEEWKTKHGKTYNTNEEGQKRA-VWENNM 56

Query:    76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLS 132
             + I   N++   G   + L  N F DLTN EFR L TG++      +      F    L 
Sbjct:    57 KMINLHNEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQ----GQKTKMMKVFPEPFLG 112

Query:   133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
               DVP ++DWR  G VTP+KNQ  CG CWAF+AV ++EG    ++G L+ LSEQ L+DCS
Sbjct:   113 --DVPKTVDWRKHGYVTPVKNQGPCGSCWAFSAVGSLEGQVFRKTGKLVPLSEQNLVDCS 170

Query:   193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
              ++GN GC GG  + AF Y+  N G+ T   YPY+A+ GTC    K +AAK+  +  +P 
Sbjct:   171 WSHGNKGCDGGLPDFAFQYVKDNGGLDTSVSYPYEALNGTCRYNPKYSAAKVVGFMSIPP 230

Query:   252 GDEQALLKAVS-MQPVSIAIAAYSTEFQSYKEGIF-NGVCG-TQLDHAVTIVGFGTTEDG 308
               E AL+KAV+ + P+S+ I      FQ YK G++    C  T L+HAV +VG+G   DG
Sbjct:   231 S-ENALMKAVATVGPISVGIDIKHKSFQFYKGGMYYEPDCSSTNLNHAVLVVGYGEESDG 289

Query:   309 ANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
               YWL+KNSWG  WG  GY+K+ +D    CGI + +SYP+
Sbjct:   290 RKYWLVKNSWGRDWGMDGYIKMAKDWNNNCGIASDASYPI 329


>ZFIN|ZDB-GENE-041010-76 [details] [associations]
            symbol:ctsll "cathepsin L, like" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-041010-76
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            OrthoDB:EOG48PMKF MEROPS:C01.032 EMBL:BX119902 IPI:IPI00616622
            UniGene:Dr.79994 SMR:A2BEM8 Ensembl:ENSDART00000144226
            InParanoid:A2BEM8 OMA:PRYSAAN Uniprot:A2BEM8
        Length = 337

 Score = 651 (234.2 bits), Expect = 7.6e-64, P = 7.6e-64
 Identities = 137/342 (40%), Positives = 201/342 (58%)

Query:    18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
             M +  +L+  C S V ++  T +Q + +    W   H +SY ++ E+  R  ++++NL+ 
Sbjct:     1 MLLFASLVTLCISAVFAA-PTLDQKLDDHWHLWKRWHEKSYHEK-EEGWRRMVWEKNLKK 58

Query:    78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMT 134
             IE  N E   G  T++LG NQF D+TN+EFR    GY    P+ +       +    S  
Sbjct:    59 IELHNLEHSVGKHTFRLGMNQFGDMTNEEFRQAMNGYNR-DPNRKSKGSLFIEP---SFF 114

Query:   135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST- 193
               P  +DWR KG VTPIK+QK CG CWAF++  A+EG    ++G L+ LSEQ L+DCS  
Sbjct:   115 TAPQQIDWRQKGYVTPIKDQKRCGSCWAFSSTGALEGQVFRKTGKLVSLSEQNLMDCSRP 174

Query:   194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVPSG 252
              GNNGC GG  ++AF Y+  N G+ +E+ YPY A     C    + +AA ++ + ++PSG
Sbjct:   175 QGNNGCDGGLMDQAFQYVQDNNGLDSEESYPYLATDDQPCHYDPRYSAANVTGFVDIPSG 234

Query:   253 DEQALLKAVS-MQPVSIAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFG---TTE 306
              E AL+KAV+ + PV++AI A    FQ Y+ GI+    C T+ LDH V +VG+G      
Sbjct:   235 KEHALMKAVAAVGPVAVAIDAGHESFQFYQSGIYYEKACSTEELDHGVLVVGYGYEGVDV 294

Query:   307 DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
              G  YW++KNSW + WGD GY+ + +D +  CGI T +SYPL
Sbjct:   295 AGRRYWIVKNSWTDRWGDKGYIYMAKDLKNHCGIATSASYPL 336


>ZFIN|ZDB-GENE-980526-285 [details] [associations]
            symbol:ctsl1b "cathepsin L, 1 b" species:7955
            "Danio rerio" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005576
            "extracellular region" evidence=IDA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-980526-285 GO:GO:0005576 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:BX465840 IPI:IPI00498443 Ensembl:ENSDART00000145570
            Bgee:F1R7B3 Uniprot:F1R7B3
        Length = 352

 Score = 648 (233.2 bits), Expect = 1.6e-63, P = 1.6e-63
 Identities = 138/353 (39%), Positives = 209/353 (59%)

Query:     7 RSGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEM 66
             +S  F   + P  ++  LLV+     V +  + +  + +    W +QHG+SY +++E   
Sbjct:     4 QSVRFACESPPGRMMFALLVTLYISAVFAAPSIDIQLDDHWNSWKSQHGKSYHEDVEVGR 63

Query:    67 RLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXX 123
             R+ I++ENL  IE+ N E   GN T+K+G NQF D+TN+EFR    GY    P+      
Sbjct:    64 RM-IWEENLRKIEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYTH-DPNQTSQGP 121

Query:   124 XXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
                 +   S    P  +DWR +G VTP+K+QK+CG CW+F++  A+EG    ++G LI +
Sbjct:   122 L---FMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISM 178

Query:   184 SEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAA 241
             SEQ L+DCS   GN GC GG  ++AF Y+ +N+G+ +E  YPY A     C    +   A
Sbjct:   179 SEQNLVDCSRPQGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVA 238

Query:   242 KISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKEGIF-NGVCGT-QLDHAVT 298
             KI+ + ++PSG+E AL+ AV+ + PVS+AI A     Q Y+ GI+    C + +LDHAV 
Sbjct:   239 KITGFVDIPSGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVL 298

Query:   299 IVGFGTT-ED--GANYWLIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYPL 347
             +VG+G    D  G  YW++KNSW + WGD GY+ + +D+   CG+ T++SYPL
Sbjct:   299 VVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 351


>DICTYBASE|DDB_G0283867 [details] [associations]
            symbol:cprC "cysteine proteinase 3" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0283867 GenomeReviews:CM000153_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:AAFI02000057
            KO:K01365 EMBL:X03930 RefSeq:XP_638859.1 ProteinModelPortal:Q23894
            SMR:Q23894 MEROPS:C01.114 EnsemblProtists:DDB0220784 GeneID:8624257
            KEGG:ddi:DDB_G0283867 OMA:NNVEHIN Uniprot:Q23894
        Length = 337

 Score = 641 (230.7 bits), Expect = 8.8e-63, P = 8.8e-63
 Identities = 132/343 (38%), Positives = 209/343 (60%)

Query:    12 KINTTPMFIIITLLVSCASQV-VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
             +++ T +F +I L +S  S   V S   ++ S ++    WM  + ++Y  + E   R + 
Sbjct:     2 RLSITLIFTLIVLSISFISAGNVFSHKQYQDSFID----WMRSNNKAYTHK-EFMPRYEE 56

Query:    71 FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQN 130
             FK+N++Y+   N +G++T  LG NQ +DL+N+E+R  Y G +     +           N
Sbjct:    57 FKKNMDYVHNWNSKGSKTV-LGLNQHADLSNEEYRLNYLGTRAHIKLNGYHKRNLGLRLN 115

Query:   131 LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
                   P ++DWR+K AVTP+K+Q +CG C++F+   +VEG+T I++G L+ LSEQ +LD
Sbjct:   116 RPQFKQPLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNILD 175

Query:   191 CSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA-VPGTCSAAQKPAAAKISNYEE 248
             CS++ GN GC GG    AF YII+N G+ +E++YPY+  V   C   +   AAKI++Y+E
Sbjct:   176 CSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDECKFQEGSVAAKITSYKE 235

Query:   249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFGTTE 306
             + +GDE  L  A+ + PVS+AI A    FQ Y  G++    C ++ LDH V  VG GT +
Sbjct:   236 IEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHGVLAVGMGT-D 294

Query:   307 DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
             +G +Y+++KNSWG +WG  GY+ + R+ +  CGI T +SYP+A
Sbjct:   295 NGEDYYIVKNSWGPSWGLNGYIHMARNKDNNCGISTMASYPIA 337


>UNIPROTKB|F1NYJ1 [details] [associations]
            symbol:CTSL2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 IPI:IPI00602255
            OMA:DITHHEF EMBL:AADN02067812 Ensembl:ENSGALT00000020588
            ArrayExpress:F1NYJ1 Uniprot:F1NYJ1
        Length = 339

 Score = 638 (229.6 bits), Expect = 1.8e-62, P = 1.8e-62
 Identities = 139/343 (40%), Positives = 199/343 (58%)

Query:    18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
             M + +T+L  C     ++        ++ H + W + H + Y  E E+  R  ++++NL+
Sbjct:     2 MNVCLTILSLCLGLAFAAPRVDPD--LDSHWQLWKSWHSKDYH-EREESWRRVVWEKNLK 58

Query:    77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSM 133
              IE  N +   G  +YKLG NQF D+T +EFR L  GYK    S R      F     S 
Sbjct:    59 MIELHNLDHSLGKHSYKLGMNQFGDMTAEEFRQLMNGYKHKK-SERKYRGSQFLEP--SF 115

Query:   134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
              + P S+DWR+KG VTP+K+Q +CG CWAF+   A+EG    ++G L+ LSEQ L+DCS 
Sbjct:   116 LEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSR 175

Query:   194 -NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVPS 251
               GN GC GG  ++AF Y+  N GI +E+ YPY A     C    +  AA  + + ++P 
Sbjct:   176 PEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQ 235

Query:   252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFG---TT 305
             G E+AL+KAV S+ PVS+AI A  + FQ Y+ GI+    C ++ LDH V +VG+G     
Sbjct:   236 GHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGED 295

Query:   306 EDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
              DG  YW++KNSWG  WGD GY+ + +D +  CGI T +SYPL
Sbjct:   296 VDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPL 338


>DICTYBASE|DDB_G0278721 [details] [associations]
            symbol:cprD "cysteine proteinase 4" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0278721 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000024 EMBL:L36204 RefSeq:XP_641963.1
            ProteinModelPortal:P54639 SMR:P54639 MEROPS:C01.A57 PRIDE:P54639
            EnsemblProtists:DDB0214999 GeneID:8621695 KEGG:ddi:DDB_G0278721
            OMA:NAFADIT ProtClustDB:CLSZ2846820 Uniprot:P54639
        Length = 442

 Score = 522 (188.8 bits), Expect = 9.9e-62, Sum P(2) = 9.9e-62
 Identities = 122/288 (42%), Positives = 161/288 (55%)

Query:    24 LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK 83
             LLVS AS   + +   E         WM  H R+Y  E E   R +IFK N++Y+ + N 
Sbjct:    10 LLVSYAS---AKQQFSELQYRNAFTNWMQAHQRTYSSE-EFNARYQIFKSNMDYVHQWNS 65

Query:    84 EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWR 143
             +G  T  LG N F+D+TN E+R  Y G    +P          + + +  T  PT +DWR
Sbjct:    66 KGGETV-LGLNVFADITNQEYRTTYLG----TPFDGSALIGT-EEEKIFSTPAPT-VDWR 118

Query:   144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSG---NLIQLSEQQLLDCSTN-GNNGC 199
              +GAVTPIKNQ +CG CW+F+   + EG   I SG   +L+ LSEQ L+DCS + GNNGC
Sbjct:   119 AQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSEQNLIDCSKSYGNNGC 178

Query:   200 LGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVPSGDEQALL 258
              GG    AF YII N+GI TE  YPY A  G  C        A+I +Y+ V SG E +L 
Sbjct:   179 EGGLMTLAFEYIINNKGIDTESSYPYTAEDGKECKFKTSNIGAQIVSYQNVTSGSEASLQ 238

Query:   259 KAVSMQPVSIAIAAYSTEFQSYKEGIF-NGVCG-TQLDHAVTIVGFGT 304
              A +  PVS+AI A +  FQ Y+ GI+    C  TQLDH V +VG+G+
Sbjct:   239 SASNNAPVSVAIDASNESFQLYESGIYYEPACSPTQLDHGVLVVGYGS 286

 Score = 127 (49.8 bits), Expect = 9.9e-62, Sum P(2) = 9.9e-62
 Identities = 23/50 (46%), Positives = 31/50 (62%)

Query:   301 GFGTTE-DGANYWLIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYPLA 348
             G G  E    NYW++KNSWG +WG  GY+ + +D    CGI T +S+P A
Sbjct:   390 GSGAVEASSGNYWIVKNSWGTSWGMDGYIFMSKDRNNNCGIATMASFPTA 439


>DICTYBASE|DDB_G0279187 [details] [associations]
            symbol:cprG "cysteine proteinase 7" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0279187 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000030 ProtClustDB:CLSZ2846820 MEROPS:C01.081
            EMBL:U72746 RefSeq:XP_641720.2 ProteinModelPortal:Q94504 SMR:Q94504
            PRIDE:Q94504 EnsemblProtists:DDB0215005 GeneID:8621915
            KEGG:ddi:DDB_G0279187 OMA:INTETEK Uniprot:Q94504
        Length = 460

 Score = 534 (193.0 bits), Expect = 2.0e-61, Sum P(2) = 2.0e-61
 Identities = 122/293 (41%), Positives = 159/293 (54%)

Query:    18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
             M ++  L V   S   + +   E         WM  H R Y  E E   R  IFK N++Y
Sbjct:     1 MKVLSALCVLLVSVATAKQQLSEVEYRNAFTNWMIAHQRHYSSE-EFNGRYNIFKANMDY 59

Query:    78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVP 137
             + + N +G+ T  LG N F+D++N+E+RA Y G    + S           ++  + D  
Sbjct:    60 VNEWNTKGSETV-LGLNVFADISNEEYRATYLGTPFDASSLEMT-------ESDKIFDAS 111

Query:   138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSG--NLIQLSEQQLLDCSTN- 194
               +DWR +GAVTPIKNQ +CG CW+F+   A EG   + +G  NL+ LSEQ L+DCS + 
Sbjct:   112 AQVDWRTQGAVTPIKNQGQCGGCWSFSTTGATEGAQYLANGKKNLVSLSEQNLIDCSGSY 171

Query:   195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVPSGD 253
             GNNGC GG    AF YII N+GI TE  YPY A  G  C    K  AA++S+Y  V SG 
Sbjct:   172 GNNGCEGGLMTLAFEYIINNKGIDTESSYPYTAEDGKKCKFNPKNVAAQLSSYVNVTSGS 231

Query:   254 EQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG-VCG-TQLDHAVTIVGFGT 304
             E  L   V+  P S+AI A +  FQ Y  GI+N   C  TQLDH V  VGFGT
Sbjct:   232 ESDLAAKVTQGPTSVAIDASNQSFQLYVSGIYNEPACSSTQLDHGVLAVGFGT 284

 Score = 112 (44.5 bits), Expect = 2.0e-61, Sum P(2) = 2.0e-61
 Identities = 19/47 (40%), Positives = 28/47 (59%)

Query:   303 GTTEDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIGTRSSYPLA 348
             G      +YW++KNSWG +WG  GY+ + + +   CGI T +S P A
Sbjct:   410 GVYPTAGDYWIVKNSWGTSWGMDGYILMTKGNNNQCGIATMASRPTA 456


>ZFIN|ZDB-GENE-030131-106 [details] [associations]
            symbol:ctsl1a "cathepsin L, 1 a" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030131-106 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 HSSP:P43235
            KO:K01365 EMBL:BC066490 IPI:IPI00495935 RefSeq:NP_997749.1
            UniGene:Dr.104499 ProteinModelPortal:Q6NYR5 SMR:Q6NYR5
            MEROPS:C01.074 PRIDE:Q6NYR5 GeneID:321453 KEGG:dre:321453
            CTD:321453 InParanoid:Q6NYR5 NextBio:20807387 ArrayExpress:Q6NYR5
            Bgee:Q6NYR5 Uniprot:Q6NYR5
        Length = 337

 Score = 627 (225.8 bits), Expect = 2.7e-61, P = 2.7e-61
 Identities = 135/342 (39%), Positives = 194/342 (56%)

Query:    18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
             M + +     C S V ++  T +Q + +  ++W   H + Y    E+  R  I+++NL+ 
Sbjct:     1 MRVFLAAFTLCLSAVFAA-PTLDQQLNDHWDQWKKWHSKKYH-ATEEGWRRVIWEKNLKK 58

Query:    78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMT 134
             IE  N E   G  TY+LG N F D+T++EFR +  G+K      R      F   N    
Sbjct:    59 IEMHNLEHSMGIHTYRLGMNHFGDMTHEEFRQVMNGFK--HKKDRRFRGSLFMEPNF--I 114

Query:   135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST- 193
             +VP  LDWR+KG VTP+K+Q ECG CWAF+   A+EG    ++G L+ LSEQ L+DCS  
Sbjct:   115 EVPNKLDWREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRP 174

Query:   194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVPSG 252
              GN GC GG  ++AF Y+    G+ +E+ YPY       C    K +AA  + + ++PSG
Sbjct:   175 EGNEGCNGGLMDQAFQYVKDQNGLDSEESYPYLGTDDQPCHFDPKNSAANDTGFVDIPSG 234

Query:   253 DEQALLKAVS-MQPVSIAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFG---TTE 306
              E+AL+KA++ + PVS+AI A    FQ Y+ GI+    C ++ LDH V  VG+G      
Sbjct:   235 KERALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDV 294

Query:   307 DGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
             DG  YW++KNSW   WGD GY+ + +D    CGI T +SYPL
Sbjct:   295 DGKKYWIVKNSWSENWGDKGYIYMAKDRHNHCGIATAASYPL 336


>DICTYBASE|DDB_G0279185 [details] [associations]
            symbol:cprF "cysteine proteinase 6" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0279185 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000030 HSSP:P07711 ProtClustDB:CLSZ2846820 EMBL:U72745
            RefSeq:XP_641725.1 ProteinModelPortal:Q94503 SMR:Q94503
            MEROPS:C01.081 PRIDE:Q94503 EnsemblProtists:DDB0215002
            GeneID:8621921 KEGG:ddi:DDB_G0279185 Uniprot:Q94503
        Length = 434

 Score = 525 (189.9 bits), Expect = 4.2e-61, Sum P(2) = 4.2e-61
 Identities = 126/316 (39%), Positives = 161/316 (50%)

Query:    18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
             M ++  L V   S   + +   E         WM  H R Y  E E   R  IFK N++Y
Sbjct:     1 MKVLSALCVLLVSVATAKQQLSELQYRNAFTNWMIAHQRHYSSE-EFNGRFNIFKANMDY 59

Query:    78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVP 137
             I + N +G+ T  LG N F+D+TN+E+RA Y G    + S          +  +      
Sbjct:    60 INEWNTKGSETV-LGLNVFADITNEEYRATYLGTPFDASSLEMTPSEKV-FGGVQAN--- 114

Query:   138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN--LIQLSEQQLLDCSTN- 194
              S+DWR KGAVTPIKNQ ECG CW+F+A  A EG   I +G+  L  +SEQQL+DCS + 
Sbjct:   115 -SVDWRAKGAVTPIKNQGECGGCWSFSATGATEGAQYIANGDSDLTSVSEQQLIDCSGSY 173

Query:   195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
             GNNGC GG    AF YII N GI TE  YP+ A    C        A++S+Y  V SG E
Sbjct:   174 GNNGCEGGLMTLAFEYIINNGGIDTESSYPFTANTEKCKYNPSNIGAELSSYVNVTSGSE 233

Query:   255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG-VCG-TQLDHAVTIVGFGTTEDGANYW 312
               L   V+  P S+AI A    FQ Y  GI+N   C  TQLDH V  VGFG+   G+   
Sbjct:   234 SDLAAKVTQGPTSVAIDASQPSFQFYSSGIYNEPACSSTQLDHGVLAVGFGSGSSGSQSQ 293

Query:   313 LI---KNSWGNTWGDA 325
                    S  N W ++
Sbjct:   294 SAGSQSQSSNNNWSES 309

 Score = 118 (46.6 bits), Expect = 4.2e-61, Sum P(2) = 4.2e-61
 Identities = 22/43 (51%), Positives = 29/43 (67%)

Query:   307 DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
             DG NYW++KNSWG  WG  GY+ + +D +  CGI T +S P A
Sbjct:   386 DG-NYWIVKNSWGLDWGINGYILMSKDKDNQCGIATMASIPQA 427


>UNIPROTKB|F1PAK0 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9615 "Canis lupus
            familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:AAEX03011051 Ensembl:ENSCAFT00000019176 OMA:YEPACTQ
            Uniprot:F1PAK0
        Length = 339

 Score = 623 (224.4 bits), Expect = 7.1e-61, P = 7.1e-61
 Identities = 134/335 (40%), Positives = 192/335 (57%)

Query:    21 IITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
             ++ LL  C+  V      H+   ++ H   W   + + YK+E E+  R  I+++NL+++ 
Sbjct:    12 LVGLLPLCSYAVAQ---VHKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVM 68

Query:    80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDV 136
               N E   G  +Y LG N   D+T +E  +L    ++PS   R        Y++ S   +
Sbjct:    69 LHNLEHSMGMHSYDLGMNHLGDMTGEEVISLMGSLRVPSQWQRNVT-----YRSNSNQKL 123

Query:   137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-- 194
             P S+DWR+KG VT +K Q  CG CWAF+AV A+E   K+++G L+ LS Q L+DCST   
Sbjct:   124 PDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKY 183

Query:   195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
             GN GC GG    AF YII N GI +E  YPY+AV G C    K  AA  S Y E+P G E
Sbjct:   184 GNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAVNGKCRYDSKKRAATCSKYTELPFGSE 243

Query:   255 QALLKAVSMQ-PVSIAIAAYSTEFQSYKEGIF-NGVCGTQLDHAVTIVGFGTTEDGANYW 312
              AL +AV+ + PVS+AI A    F  Y+ G++    C   ++H V +VG+G   +G +YW
Sbjct:   244 DALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNL-NGKDYW 302

Query:   313 LIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYP 346
             L+KNSWG  +GD GY+++ R+ G  CGI +  SYP
Sbjct:   303 LVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYP 337


>MGI|MGI:88564 [details] [associations]
            symbol:Ctsl "cathepsin L" species:10090 "Mus musculus"
            [GO:0004177 "aminopeptidase activity" evidence=ISO] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISO] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005730 "nucleolus"
            evidence=NAS] [GO:0005737 "cytoplasm" evidence=ISO] [GO:0005764
            "lysosome" evidence=ISO] [GO:0005773 "vacuole" evidence=ISO]
            [GO:0005902 "microvillus" evidence=ISO] [GO:0006508 "proteolysis"
            evidence=ISO;IDA] [GO:0007154 "cell communication" evidence=ISO]
            [GO:0008152 "metabolic process" evidence=TAS] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISO;TAS] [GO:0009897 "external side of
            plasma membrane" evidence=ISO] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0030141 "secretory granule" evidence=ISO]
            [GO:0030984 "kininogen binding" evidence=ISO] [GO:0032403 "protein
            complex binding" evidence=ISO] [GO:0042277 "peptide binding"
            evidence=ISO] [GO:0042393 "histone binding" evidence=ISO;NAS]
            [GO:0043005 "neuron projection" evidence=ISO] [GO:0043204
            "perikaryon" evidence=ISO] [GO:0045177 "apical part of cell"
            evidence=ISO] [GO:0048863 "stem cell differentiation" evidence=NAS]
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IMP] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:88564 GO:GO:0005730 GO:GO:0009897 GO:GO:0034698
            GO:GO:0043204 GO:GO:0009749 GO:GO:0030141 GO:GO:0048863
            GO:GO:0051384 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045177 GO:GO:0043005
            GO:GO:0007283 GO:GO:0004177 GO:GO:0005764 GO:GO:0042277
            GO:GO:0009267 GO:GO:0021675 GO:GO:0042393 GO:GO:0005902
            GO:GO:0010259 GO:GO:0004197 GO:GO:0048102 GO:GO:0046697
            HOVERGEN:HBG011513 KO:K01365 OMA:EEFRATH OrthoDB:EOG48PMKF
            MEROPS:C01.032 BRENDA:3.4.22.15 ChiTaRS:CTSL1 EMBL:X06086
            EMBL:J02583 EMBL:M20495 EMBL:AF121837 EMBL:AF121838 EMBL:AF121839
            EMBL:BC068163 EMBL:X04392 IPI:IPI00128154 PIR:S01177
            RefSeq:NP_034114.1 UniGene:Mm.930 PDB:1MVV PDBsum:1MVV
            ProteinModelPortal:P06797 SMR:P06797 STRING:P06797
            PhosphoSite:P06797 PaxDb:P06797 PRIDE:P06797
            Ensembl:ENSMUST00000021933 GeneID:13039 KEGG:mmu:13039 CTD:13039
            InParanoid:P06797 BioCyc:MetaCyc:MONOMER-14812 BindingDB:P06797
            ChEMBL:CHEMBL5291 NextBio:282928 Bgee:P06797 CleanEx:MM_CTSL
            Genevestigator:P06797 GermOnline:ENSMUSG00000021477 GO:GO:0060008
            Uniprot:P06797
        Length = 334

 Score = 622 (224.0 bits), Expect = 9.0e-61, P = 9.0e-61
 Identities = 132/341 (38%), Positives = 199/341 (58%)

Query:    18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
             M +++ L V C    +++    +    E H+ W + H R Y    E+E R  I+++N+  
Sbjct:     1 MNLLLLLAVLCLGTALATPKFDQTFSAEWHQ-WKSTHRRLYGTN-EEEWRRAIWEKNMRM 58

Query:    78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMT 134
             I+  N E   G   + +  N F D+TN+EFR +  GY+     H+        +Q   M 
Sbjct:    59 IQLHNGEYSNGQHGFSMEMNAFGDMTNEEFRQVVNGYR--HQKHKKGRL----FQEPLML 112

Query:   135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
              +P S+DWR+KG VTP+KNQ +CG CWAF+A   +EG   +++G LI LSEQ L+DCS  
Sbjct:   113 KIPKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHA 172

Query:   194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
              GN GC GG  + AF YI +N G+ +E+ YPY+A  G+C    + A A  + + ++P   
Sbjct:   173 QGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDIPQ-Q 231

Query:   254 EQALLKAVS-MQPVSIAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFGT--TEDG 308
             E+AL+KAV+ + P+S+A+ A     Q Y  GI+    C ++ LDH V +VG+G   T+  
Sbjct:   232 EKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSN 291

Query:   309 AN-YWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
              N YWL+KNSWG+ WG  GY+KI +D +  CG+ T +SYP+
Sbjct:   292 KNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPV 332


>ZFIN|ZDB-GENE-040718-61 [details] [associations]
            symbol:ctsl.1 "cathepsin L.1" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-040718-61
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            GeneTree:ENSGT00660000095458 MEROPS:C01.092 EMBL:FP015965
            EMBL:BC075887 IPI:IPI00513499 RefSeq:NP_001002368.1
            UniGene:Dr.85174 SMR:Q6DHT0 Ensembl:ENSDART00000017756
            GeneID:436641 KEGG:dre:436641 CTD:436641 InParanoid:Q6DHT0
            OMA:GGQMENA OrthoDB:EOG41ZFB9 NextBio:20831086 Uniprot:Q6DHT0
        Length = 334

 Score = 621 (223.7 bits), Expect = 1.2e-60, P = 1.2e-60
 Identities = 131/332 (39%), Positives = 200/332 (60%)

Query:    24 LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK 83
             L+V+ A   V+S ++     +E H  W  + G+SY+   E+  R   +  N + +   N 
Sbjct:     4 LVVAAAFLAVASAASLSLEDMEFHA-WKLKFGKSYRSAEEESHRQLTWLTNRKLVLVHNM 62

Query:    84 ---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSL 140
                +G ++Y+LG   F+D++N+E+R L     + S ++         ++      VP ++
Sbjct:    63 MADQGLKSYRLGMTYFADMSNEEYRQLVFRGCLGSMNNTKARGGSTFFRLRKAAVVPDTV 122

Query:   141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGC 199
             DWRDKG VT IK+QK+CG CWAF+A  ++EG T  ++G L+ LSEQQL+DCS + GN GC
Sbjct:   123 DWRDKGYVTDIKDQKQCGSCWAFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGSYGNYGC 182

Query:   200 LGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLK 259
              GG  ++AF YI  N+G+ TED YPY+A  G C        A  + Y ++ SGDE AL +
Sbjct:   183 DGGLMDQAFQYIEANKGLDTEDSYPYEAQDGECRFNPSTVGASCTGYVDIASGDESALQE 242

Query:   260 AVS-MQPVSIAIAAYSTEFQSYKEGIFNGV-CGT-QLDHAVTIVGFGTTEDGANYWLIKN 316
             AV+ + P+S+AI A  + FQ Y  G++N   C + +LDH V  VG+G++ +G +YW++KN
Sbjct:   243 AVATIGPISVAIDAGHSSFQLYSSGVYNEPDCSSSELDHGVLAVGYGSS-NGDDYWIVKN 301

Query:   317 SWGNTWGDAGYMKIVRDEG-LCGIGTRSSYPL 347
             SWG  WG  GY+ + R++   CGI T +SYPL
Sbjct:   302 SWGLDWGVQGYILMSRNKSNQCGIATAASYPL 333


>UNIPROTKB|P25326 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9913 "Bos taurus"
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0016020 "membrane" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0002250 "adaptive
            immune response" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0016020 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0002250 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            GO:GO:0097067 EMBL:BC102245 EMBL:M95211 EMBL:X62001 IPI:IPI00702008
            PIR:S15844 RefSeq:NP_001028787.1 UniGene:Bt.7938
            ProteinModelPortal:P25326 SMR:P25326 STRING:P25326 PRIDE:P25326
            Ensembl:ENSBTAT00000022774 GeneID:327711 KEGG:bta:327711 CTD:1520
            InParanoid:P25326 KO:K01368 OMA:KAMDQKC OrthoDB:EOG4JM7Q2
            NextBio:20810175 Uniprot:P25326
        Length = 331

 Score = 620 (223.3 bits), Expect = 1.5e-60, P = 1.5e-60
 Identities = 133/335 (39%), Positives = 195/335 (58%)

Query:    21 IITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
             ++  L+ C+S +      H    ++ H + W   +G+ YK++ E+  R  I+++NL+ + 
Sbjct:     4 LVWALLLCSSAMAH---VHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVT 60

Query:    80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDV 136
               N E   G  +Y+LG N   D+T++E  +L +  ++PS   R         Q L     
Sbjct:    61 LHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVPSQWPRNVTYKSDPNQKL----- 115

Query:   137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-- 194
             P S+DWR+KG VT +K Q  CG CWAF+AV A+E   K+++G L+ LS Q L+DCST   
Sbjct:   116 PDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKY 175

Query:   195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
             GN GC GG   +AF YII N GI +E  YPY+A+ G C    K  AA  S Y E+P G E
Sbjct:   176 GNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYDVKNRAATCSRYIELPFGSE 235

Query:   255 QALLKAVSMQ-PVSIAIAAYSTEFQSYKEGIF-NGVCGTQLDHAVTIVGFGTTEDGANYW 312
             +AL +AV+ + PVS+ I A  + F  YK G++ +  C   ++H V +VG+G   DG +YW
Sbjct:   236 EALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNL-DGKDYW 294

Query:   313 LIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYP 346
             L+KNSWG  +GD GY+++ R+ G  CGI    SYP
Sbjct:   295 LVKNSWGLHFGDQGYIRMARNSGNHCGIANYPSYP 329


>UNIPROTKB|Q8HY81 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            CTD:1520 KO:K01368 OrthoDB:EOG4JM7Q2 EMBL:AY156692
            RefSeq:NP_001002938.2 UniGene:Cfa.1661 ProteinModelPortal:Q8HY81
            SMR:Q8HY81 STRING:Q8HY81 MEROPS:C01.034 GeneID:403400
            KEGG:cfa:403400 InParanoid:Q8HY81 NextBio:20816922 Uniprot:Q8HY81
        Length = 331

 Score = 620 (223.3 bits), Expect = 1.5e-60, P = 1.5e-60
 Identities = 133/335 (39%), Positives = 192/335 (57%)

Query:    21 IITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
             ++ LL  C+  V      H+   ++ H   W   + + YK+E E+  R  I+++NL+++ 
Sbjct:     4 LVGLLPLCSYAVAQ---VHKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVM 60

Query:    80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDV 136
               N E   G  +Y LG N   D+T +E  +L    ++PS   R        Y++ S   +
Sbjct:    61 LHNLEHSMGMHSYDLGMNHLGDMTGEEVISLMGSLRVPSQWQRNVT-----YRSNSNQKL 115

Query:   137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-- 194
             P S+DWR+KG VT +K Q  CG CWAF+AV A+E   K+++G L+ LS Q L+DCST   
Sbjct:   116 PDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKY 175

Query:   195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
             GN GC GG    AF YII N GI +E  YPY+A+ G C    K  AA  S Y E+P G E
Sbjct:   176 GNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKCRYDSKKRAATCSKYTELPFGSE 235

Query:   255 QALLKAVSMQ-PVSIAIAAYSTEFQSYKEGIF-NGVCGTQLDHAVTIVGFGTTEDGANYW 312
              AL +AV+ + PVS+AI A    F  Y+ G++    C   ++H V +VG+G   +G +YW
Sbjct:   236 DALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNL-NGKDYW 294

Query:   313 LIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYP 346
             L+KNSWG  +GD GY+++ R+ G  CGI +  SYP
Sbjct:   295 LVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYP 329


>MGI|MGI:107341 [details] [associations]
            symbol:Ctss "cathepsin S" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=ISO;IDA] [GO:0006508 "proteolysis"
            evidence=IDA] [GO:0008152 "metabolic process" evidence=ISO]
            [GO:0008233 "peptidase activity" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0009986 "cell
            surface" evidence=ISO] [GO:0016020 "membrane" evidence=IDA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0045453 "bone
            resorption" evidence=ISO] [GO:0051930 "regulation of sensory
            perception of pain" evidence=ISO] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:107341 GO:GO:0016020 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0008233 GO:GO:0031905 Reactome:REACT_102124
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 BRENDA:3.4.22.27
            ChiTaRS:CTSS EMBL:AF051732 EMBL:AF051727 EMBL:AF051728
            EMBL:AF051729 EMBL:AF051726 EMBL:AF051730 EMBL:AF051731
            EMBL:AF038546 EMBL:AJ002386 EMBL:AC092203 EMBL:Y18466 EMBL:AJ223208
            IPI:IPI00309520 UniGene:Mm.3619 PDB:1M0H PDBsum:1M0H
            ProteinModelPortal:O70370 SMR:O70370 STRING:O70370
            PhosphoSite:O70370 PaxDb:O70370 PRIDE:O70370
            Ensembl:ENSMUST00000116304 BindingDB:O70370 ChEMBL:CHEMBL4098
            NextBio:282932 Bgee:O70370 CleanEx:MM_CTSS Genevestigator:O70370
            GermOnline:ENSMUSG00000038642 Uniprot:O70370
        Length = 340

 Score = 620 (223.3 bits), Expect = 1.5e-60, P = 1.5e-60
 Identities = 130/324 (40%), Positives = 188/324 (58%)

Query:    33 VSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRT 88
             V+         ++ H + W   H + YKD+ E+E+R  I+++NL++I   N E   G  T
Sbjct:    21 VAMEQLQRDPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHT 80

Query:    89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAV 148
             Y++G N   D+TN+E        ++P  S +        Y N ++   P ++DWR+KG V
Sbjct:    81 YQVGMNDMGDMTNEEILCRMGALRIPRQSPKTVTFRS--YSNRTL---PDTVDWREKGCV 135

Query:   149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSRE 205
             T +K Q  CG CWAF+AV A+EG  K+++G LI LS Q L+DCS     GN GC GG   
Sbjct:   136 TEVKYQGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMT 195

Query:   206 KAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ- 264
             +AF YII N GI  +  YPY+A    C    K  AA  S Y ++P GDE AL +AV+ + 
Sbjct:   196 EAFQYIIDNGGIEADASYPYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKG 255

Query:   265 PVSIAIAAYSTEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWG 323
             PVS+ I A  + F  YK G+++   C   ++H V +VG+GT  DG +YWL+KNSWG  +G
Sbjct:   256 PVSVGIDASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFG 314

Query:   324 DAGYMKIVRD-EGLCGIGTRSSYP 346
             D GY+++ R+ +  CGI +  SYP
Sbjct:   315 DQGYIRMARNNKNHCGIASYCSYP 338


>UNIPROTKB|Q86GF7 [details] [associations]
            symbol:Cys "Crustapain" species:6703 "Pandalus borealis"
            [GO:0005576 "extracellular region" evidence=IC] [GO:0007586
            "digestion" evidence=NAS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IDA] [GO:0030163 "protein catabolic process"
            evidence=NAS] [GO:0030574 "collagen catabolic process"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005576
            GO:GO:0007586 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0030163 GO:GO:0030574 EMBL:AB091669
            ProteinModelPortal:Q86GF7 SMR:Q86GF7 MEROPS:C01.030 Uniprot:Q86GF7
        Length = 323

 Score = 618 (222.6 bits), Expect = 2.4e-60, P = 2.4e-60
 Identities = 129/308 (41%), Positives = 189/308 (61%)

Query:    48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDLTNDEF 104
             E +  + G+ Y +  E+  R+ +F + L++I++ N+   +G  TY L  N FSDLT++E 
Sbjct:    21 ENFKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHEEV 80

Query:   105 RALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
              A  TG  M    H          ++   T +   +DWR+KGAVTP+K+Q +CG CWAF+
Sbjct:    81 LATKTG--MTRRRHPLSVLP----KSAPTTPMAADVDWRNKGAVTPVKDQGQCGSCWAFS 134

Query:   165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
             AVAA+EG   +++G+L+ LSEQ L+DCS++ GN GC GG   +A+ YII N+GI TE  Y
Sbjct:   135 AVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRGIDTESSY 194

Query:   224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKE 282
             PY+A+   C        A +S+Y E  SGDE AL  AV  + PVS+ I A  + F SY  
Sbjct:   195 PYKAIDDNCRYDAGNIGATVSSYVEPASGDESALQHAVQNEGPVSVCIDAGQSSFGSYGG 254

Query:   283 GIF-NGVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 339
             G++    C +   +HAVT VG+GT  +G +YW++KNSWG  WG++GY+K+ R+ +  C I
Sbjct:   255 GVYYEPNCDSWYANHAVTAVGYGTDANGGDYWIVKNSWGAWWGESGYIKMARNRDNNCAI 314

Query:   340 GTRSSYPL 347
              T S YP+
Sbjct:   315 ATYSVYPV 322


>DICTYBASE|DDB_G0272815 [details] [associations]
            symbol:cprE "cysteine proteinase 5" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0272815 GO:GO:0005615
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GenomeReviews:CM000151_GR GO:GO:0005764
            EMBL:AAFI02000008 MEROPS:I29.003 KO:K01376 EMBL:L36205
            RefSeq:XP_644977.1 ProteinModelPortal:P54640 SMR:P54640
            PRIDE:P54640 EnsemblProtists:DDB0185092 GeneID:8618654
            KEGG:ddi:DDB_G0272815 OMA:METAFEF ProtClustDB:CLSZ2430780
            Uniprot:P54640
        Length = 344

 Score = 523 (189.2 bits), Expect = 3.8e-60, Sum P(2) = 3.8e-60
 Identities = 117/305 (38%), Positives = 164/305 (53%)

Query:    18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
             M ++  L V   S   + +   E         WM  H +SY  E E   R  IFK N++Y
Sbjct:     1 MKVLSFLCVLLVSVATAKQQFSELQYRNAFTDWMITHQKSYTSE-EFGARYNIFKANMDY 59

Query:    78 IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVP 137
             +++ N +G+ T  LG N F+D+TN+E+R  Y G K  + S         + + +  T   
Sbjct:    60 VQQWNSKGSETV-LGLNNFADITNEEYRNTYLGTKFDASS-----LIGTQEEKVFTTSSA 113

Query:   138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
              S DWR +GAVTP+KNQ +CG CW+F+   + EG      G L+ LSEQ L+DCST  N+
Sbjct:   114 ASKDWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCSTE-NS 172

Query:   198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
             GC GG    AF YII N GI TE  YPY+A  G C    + + A +S+Y+ V +G E +L
Sbjct:   173 GCDGGLMTYAFEYIINNNGIDTESSYPYKAENGKCEYKSENSGATLSSYKTVTAGSESSL 232

Query:   258 LKAVSMQPVSIAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFGTTEDGANYWLIK 315
               AV++ PVS+AI A    FQ Y  GI+    C ++ LDH V  VG+G+    ++     
Sbjct:   233 ESAVNVNPVSVAIDASHQSFQLYTSGIYYEPECSSENLDHGVLAVGYGSGSGSSSGQSSG 292

Query:   316 NSWGN 320
              S GN
Sbjct:   293 QSSGN 297

 Score = 111 (44.1 bits), Expect = 3.8e-60, Sum P(2) = 3.8e-60
 Identities = 17/38 (44%), Positives = 28/38 (73%)

Query:   311 YWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
             YW++KNSWG +WG  GY+ + R+ +  CGI + +S+P+
Sbjct:   306 YWIVKNSWGTSWGIEGYILMSRNRDNNCGIASSASFPV 343


>UNIPROTKB|F1SS93 [details] [associations]
            symbol:CTSS "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0002250 "adaptive immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0016020 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0002250
            GeneTree:ENSGT00560000076577 GO:GO:0097067 EMBL:CU463875
            Ensembl:ENSSSCT00000007284 OMA:CEIESAV Uniprot:F1SS93
        Length = 342

 Score = 615 (221.5 bits), Expect = 5.0e-60, P = 5.0e-60
 Identities = 132/338 (39%), Positives = 197/338 (58%)

Query:    18 MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
             M  ++ +L+ C+S +      H    ++ H + W   +G+ YK++ E+  R  I+++NL+
Sbjct:    12 MKCLVWVLLLCSSAMAQ---LHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLK 68

Query:    77 YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSM 133
              +   N E   G  +Y LG N   D+T++E  +L +  ++PS   R        Y++   
Sbjct:    69 TVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVT-----YKSNPN 123

Query:   134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
               +P S+DWR+KG VT +K Q  CG CWAF+AV A+E   K+++G L+ LS Q L+DCST
Sbjct:   124 QKLPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCST 183

Query:   194 NG--NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
                 N GC GG   +AF YII N GI +E  YPY+AV G C    K  AA  S Y E+P 
Sbjct:   184 EKYRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPF 243

Query:   252 GDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGIF-NGVCGTQLDHAVTIVGFGTTEDGA 309
              DE AL +AV+ + PVS+AI A  + F  Y+ G++ +  C   ++H V +VG+G   +G 
Sbjct:   244 ADEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQNVNHGVLVVGYGNL-NGK 302

Query:   310 NYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
             +YWL+KNSWG  +GD GY+++ R+ E  CGI    SYP
Sbjct:   303 DYWLVKNSWGLNFGDGGYIRMARNSENHCGIANYPSYP 340


>UNIPROTKB|P07711 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9606 "Homo sapiens"
            [GO:0005576 "extracellular region" evidence=NAS] [GO:0005764
            "lysosome" evidence=IDA;NAS] [GO:0006508 "proteolysis"
            evidence=IDA] [GO:0002250 "adaptive immune response" evidence=IEP]
            [GO:0019882 "antigen processing and presentation" evidence=TAS]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0036021 "endolysosome lumen" evidence=TAS] [GO:0043202
            "lysosomal lumen" evidence=TAS] [GO:0045087 "innate immune
            response" evidence=TAS] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042393 "histone binding" evidence=IDA] [GO:0005634 "nucleus"
            evidence=TAS] [GO:0071888 "macrophage apoptotic process"
            evidence=NAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            Reactome:REACT_118779 EMBL:X12451 GO:GO:0005634 Reactome:REACT_6900
            GO:GO:0005576 GO:GO:0019886 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087
            GO:GO:0042393 GO:GO:0004197 GO:GO:0002250 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0036021 KO:K01365 OrthoDB:EOG48PMKF EMBL:M20496
            EMBL:CR457053 EMBL:BX537395 EMBL:AL160279 EMBL:BC012612 EMBL:X05256
            IPI:IPI00012887 PIR:S01002 RefSeq:NP_001244900.1
            RefSeq:NP_001244901.1 RefSeq:NP_001903.1 RefSeq:NP_666023.1
            UniGene:Hs.731507 UniGene:Hs.731952 PDB:1CJL PDB:1CS8 PDB:1ICF
            PDB:1MHW PDB:2NQD PDB:2VHS PDB:2XU1 PDB:2XU3 PDB:2XU4 PDB:2XU5
            PDB:2YJ2 PDB:2YJ8 PDB:2YJ9 PDB:2YJB PDB:2YJC PDB:3BC3 PDB:3H89
            PDB:3H8B PDB:3H8C PDB:3HHA PDB:3HWN PDB:3IV2 PDB:3K24 PDB:3KSE
            PDB:3OF8 PDB:3OF9 PDBsum:1CJL PDBsum:1CS8 PDBsum:1ICF PDBsum:1MHW
            PDBsum:2NQD PDBsum:2VHS PDBsum:2XU1 PDBsum:2XU3 PDBsum:2XU4
            PDBsum:2XU5 PDBsum:2YJ2 PDBsum:2YJ8 PDBsum:2YJ9 PDBsum:2YJB
            PDBsum:2YJC PDBsum:3BC3 PDBsum:3H89 PDBsum:3H8B PDBsum:3H8C
            PDBsum:3HHA PDBsum:3HWN PDBsum:3IV2 PDBsum:3K24 PDBsum:3KSE
            PDBsum:3OF8 PDBsum:3OF9 ProteinModelPortal:P07711 SMR:P07711
            IntAct:P07711 STRING:P07711 MEROPS:I29.001 PhosphoSite:P07711
            DMDM:115741 PaxDb:P07711 PeptideAtlas:P07711 PRIDE:P07711
            DNASU:1514 Ensembl:ENST00000340342 Ensembl:ENST00000343150
            GeneID:1514 KEGG:hsa:1514 UCSC:uc004aph.3 CTD:1514
            GeneCards:GC09P090341 H-InvDB:HIX0058839 H-InvDB:HIX0170314
            HGNC:HGNC:2537 HPA:CAB000459 MIM:116880 neXtProt:NX_P07711
            PharmGKB:PA162382890 InParanoid:P07711 OMA:REPLFAQ PhylomeDB:P07711
            BRENDA:3.4.22.15 BindingDB:P07711 ChEMBL:CHEMBL3837 ChiTaRS:CTSL1
            DrugBank:DB00040 EvolutionaryTrace:P07711 GenomeRNAi:1514
            NextBio:6271 PMAP-CutDB:P07711 ArrayExpress:P07711 Bgee:P07711
            CleanEx:HS_CTSL1 Genevestigator:P07711 GermOnline:ENSG00000135047
            GO:GO:0071888 Uniprot:P07711
        Length = 333

 Score = 612 (220.5 bits), Expect = 1.0e-59, P = 1.0e-59
 Identities = 130/334 (38%), Positives = 194/334 (58%)

Query:    24 LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN- 82
             +L +    + S+  T + S+     KW A H R Y    E+  R  ++++N++ IE  N 
Sbjct:     6 ILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKMIELHNQ 64

Query:    83 --KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSL 140
               +EG  ++ +  N F D+T++EFR +  G++   P           +Q     + P S+
Sbjct:    65 EYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKV------FQEPLFYEAPRSV 118

Query:   141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGC 199
             DWR+KG VTP+KNQ +CG CWAF+A  A+EG    ++G LI LSEQ L+DCS   GN GC
Sbjct:   119 DWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGC 178

Query:   200 LGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLK 259
              GG  + AF Y+  N G+ +E+ YPY+A   +C    K + A  + + ++P   E+AL+K
Sbjct:   179 NGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKALMK 237

Query:   260 AVS-MQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFG--TTE-DGANYWL 313
             AV+ + P+S+AI A    F  YKEGI F   C ++ +DH V +VG+G  +TE D   YWL
Sbjct:   238 AVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWL 297

Query:   314 IKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 346
             +KNSWG  WG  GY+K+ +D    CGI + +SYP
Sbjct:   298 VKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYP 331


>UNIPROTKB|A4IFS7 [details] [associations]
            symbol:CTSL1 "CTSL1 protein" species:9913 "Bos taurus"
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197 GO:GO:0002250
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 GO:GO:0097067
            OrthoDB:EOG48PMKF MEROPS:C01.032 CTD:1514 EMBL:DAAA02023987
            EMBL:BC134741 IPI:IPI00708619 RefSeq:NP_001077155.1
            UniGene:Bt.23199 SMR:A4IFS7 Ensembl:ENSBTAT00000000962
            GeneID:515200 KEGG:bta:515200 InParanoid:A4IFS7 OMA:NDEQALM
            NextBio:20871707 Uniprot:A4IFS7
        Length = 333

 Score = 609 (219.4 bits), Expect = 2.2e-59, P = 2.2e-59
 Identities = 127/334 (38%), Positives = 193/334 (57%)

Query:    24 LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK 83
             LL +    + S+    + S+    + W A H + Y D  E+  R  ++K+N++ IE  N+
Sbjct:     6 LLTALCLGIASAAPKFDHSLDTQWKLWKAAHRKPY-DLNEEGWRKAVWKKNMKMIELHNQ 64

Query:    84 E---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSL 140
             E   G  ++ +  N F D+TN+EFR    G++      R       ++       +P S+
Sbjct:    65 EYSQGKHSFSMAMNAFGDMTNEEFRHTMNGFQ------RQKNKKGKEFHETIFASIPPSV 118

Query:   141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGC 199
             DWR+KG VTP+KNQ +CG CWAF+A  A+EG    ++G L+ LSEQ L+DCS   GN GC
Sbjct:   119 DWREKGYVTPVKNQGKCGSCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCSQPEGNRGC 178

Query:   200 LGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLK 259
              GG  + AF Y++   G+ +E+ YPY  + GTC      +AA  + + ++P   E+AL+K
Sbjct:   179 HGGFIDNAFQYVLDVGGLDSEESYPYTGLVGTCLYNPNNSAANETGFVDLPK-QEKALMK 237

Query:   260 AVS-MQPVSIAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFG---TTEDGANYWL 313
             AV+ + P+S+A+ A++  FQ YK GI+    C ++ +DHAV +VG+G      D   YWL
Sbjct:   238 AVANLGPISVAVDAHNPSFQFYKSGIYYEPNCSSESVDHAVLVVGYGFEGADSDDNKYWL 297

Query:   314 IKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYP 346
             +KNSWG  WG  GY+K+ +D    CGI T +SYP
Sbjct:   298 VKNSWGEHWGMNGYIKMAKDRNNHCGIATMASYP 331


>UNIPROTKB|F1S4J6 [details] [associations]
            symbol:Ssc.54235 "Cathepsin L1" species:9823 "Sus scrofa"
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197
            GO:GO:0002250 GeneTree:ENSGT00560000076577 GO:GO:0097067
            EMBL:CU571031 RefSeq:XP_003130681.1 Ensembl:ENSSSCT00000011983
            GeneID:100515919 KEGG:ssc:100515919 OMA:IAICATK Uniprot:F1S4J6
        Length = 332

 Score = 609 (219.4 bits), Expect = 2.2e-59, P = 2.2e-59
 Identities = 132/334 (39%), Positives = 194/334 (58%)

Query:    24 LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN- 82
             LL +    + S+   H+ S+     KW A H + Y    E+  R  I+++N++ IE+ N 
Sbjct:     6 LLAAFCLGIASAAPRHDHSLDADWYKWKATHRKLYGLN-EEGRRRAIWEKNMKMIERHNW 64

Query:    83 --KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSL 140
               ++G  ++ +  N F D+TN+EFR    G++  +  H+      F     ++T  P S+
Sbjct:    65 EHRQGKHSFTMAMNAFGDMTNEEFRKTMNGFQ--NQKHKKGKV--FLDAGSALT--PHSV 118

Query:   141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGC 199
             DWR+KG VT +KNQ  CG CWAF+A  A+EG    ++  LI LSEQ L+DCS   GN GC
Sbjct:   119 DWREKGYVTAVKNQGHCGSCWAFSATGALEGQMFRKTSKLISLSEQNLVDCSWPEGNEGC 178

Query:   200 LGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLK 259
              GG  + AF YI  N G+ +E+ YPY    G+C    + +AA  + Y ++P   E+AL+K
Sbjct:   179 NGGLMDNAFQYIKDNGGLDSEESYPYFGKDGSCKYKPQSSAANDTGYVDIPK-QEKALMK 237

Query:   260 AVS-MQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGAN--YWLI 314
             AV+ + P+S+ I A    FQ Y  GI F   C ++ LDH V +VG+G     +N  YWL+
Sbjct:   238 AVATVGPISVGIDASHESFQFYSTGIYFEPQCSSEDLDHGVLVVGYGVEGAHSNNKYWLV 297

Query:   315 KNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYPL 347
             KNSWGNTWG  GY+K+ +D+   CGI T +SYP+
Sbjct:   298 KNSWGNTWGMDGYIKMTKDQNNHCGIATMASYPV 331


>UNIPROTKB|P25774 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0016020 "membrane"
            evidence=IEA] [GO:0005576 "extracellular region" evidence=NAS]
            [GO:0005764 "lysosome" evidence=IDA;NAS] [GO:0097067 "cellular
            response to thyroid hormone stimulus" evidence=IEP] [GO:0002250
            "adaptive immune response" evidence=IEP] [GO:0019882 "antigen
            processing and presentation" evidence=TAS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=TAS] [GO:0006955
            "immune response" evidence=TAS] [GO:0002474 "antigen processing and
            presentation of peptide antigen via MHC class I" evidence=TAS]
            [GO:0002480 "antigen processing and presentation of exogenous
            peptide antigen via MHC class I, TAP-independent" evidence=TAS]
            [GO:0019886 "antigen processing and presentation of exogenous
            peptide antigen via MHC class II" evidence=TAS] [GO:0036021
            "endolysosome lumen" evidence=TAS] [GO:0042590 "antigen processing
            and presentation of exogenous peptide antigen via MHC class I"
            evidence=TAS] [GO:0043202 "lysosomal lumen" evidence=TAS]
            [GO:0045087 "innate immune response" evidence=TAS] [GO:0043231
            "intracellular membrane-bounded organelle" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 Reactome:REACT_118779
            Reactome:REACT_6900 GO:GO:0005576 GO:GO:0002480 GO:GO:0016020
            GO:GO:0019886 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087 EMBL:CH471121
            GO:GO:0004197 GO:GO:0002250 HOVERGEN:HBG011513 GO:GO:0097067
            GO:GO:0036021 EMBL:AL356292 CTD:1520 KO:K01368 OMA:KAMDQKC
            OrthoDB:EOG4JM7Q2 EMBL:S93414 EMBL:M86553 EMBL:M90696 EMBL:U07374
            EMBL:U07370 EMBL:U07371 EMBL:U07372 EMBL:U07373 EMBL:CR541676
            EMBL:AK301472 EMBL:AK314482 EMBL:BC002642 IPI:IPI00299150
            IPI:IPI00910216 PIR:A42482 RefSeq:NP_001186668.1 RefSeq:NP_004070.3
            UniGene:Hs.181301 PDB:1BXF PDB:1GLO PDB:1MS6 PDB:1NPZ PDB:1NQC
            PDB:2C0Y PDB:2F1G PDB:2FQ9 PDB:2FRA PDB:2FRQ PDB:2FT2 PDB:2FUD
            PDB:2FYE PDB:2G6D PDB:2G7Y PDB:2H7J PDB:2HH5 PDB:2HHN PDB:2HXZ
            PDB:2OP3 PDB:2R9M PDB:2R9N PDB:2R9O PDB:3IEJ PDB:3KWN PDB:3MPE
            PDB:3MPF PDB:3N3G PDB:3N4C PDB:3OVX PDBsum:1BXF PDBsum:1GLO
            PDBsum:1MS6 PDBsum:1NPZ PDBsum:1NQC PDBsum:2C0Y PDBsum:2F1G
            PDBsum:2FQ9 PDBsum:2FRA PDBsum:2FRQ PDBsum:2FT2 PDBsum:2FUD
            PDBsum:2FYE PDBsum:2G6D PDBsum:2G7Y PDBsum:2H7J PDBsum:2HH5
            PDBsum:2HHN PDBsum:2HXZ PDBsum:2OP3 PDBsum:2R9M PDBsum:2R9N
            PDBsum:2R9O PDBsum:3IEJ PDBsum:3KWN PDBsum:3MPE PDBsum:3MPF
            PDBsum:3N3G PDBsum:3N4C PDBsum:3OVX ProteinModelPortal:P25774
            SMR:P25774 IntAct:P25774 STRING:P25774 MEROPS:I29.004
            PhosphoSite:P25774 DMDM:88984046 PaxDb:P25774 PeptideAtlas:P25774
            PRIDE:P25774 DNASU:1520 Ensembl:ENST00000368985
            Ensembl:ENST00000448301 GeneID:1520 KEGG:hsa:1520 UCSC:uc001evn.3
            GeneCards:GC01M150702 HGNC:HGNC:2545 HPA:CAB000460 HPA:HPA002988
            MIM:116845 neXtProt:NX_P25774 PharmGKB:PA27041 InParanoid:P25774
            PhylomeDB:P25774 BRENDA:3.4.22.27 BindingDB:P25774
            ChEMBL:CHEMBL2954 ChiTaRS:CTSS EvolutionaryTrace:P25774
            GenomeRNAi:1520 NextBio:6291 PMAP-CutDB:P25774 ArrayExpress:P25774
            Bgee:P25774 CleanEx:HS_CTSS Genevestigator:P25774
            GermOnline:ENSG00000163131 Uniprot:P25774
        Length = 331

 Score = 608 (219.1 bits), Expect = 2.7e-59, P = 2.7e-59
 Identities = 126/335 (37%), Positives = 196/335 (58%)

Query:    21 IITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
             ++ +L+ C+S V      H+   ++ H   W   +G+ YK++ E+ +R  I+++NL+++ 
Sbjct:     4 LVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVM 60

Query:    80 KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDV 136
               N E   G  +Y LG N   D+T++E  +L +  ++PS   R        Y++     +
Sbjct:    61 LHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNPNRIL 115

Query:   137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-- 194
             P S+DWR+KG VT +K Q  CG CWAF+AV A+E   K+++G L+ LS Q L+DCST   
Sbjct:   116 PDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKY 175

Query:   195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
             GN GC GG    AF YII N+GI ++  YPY+A+   C    K  AA  S Y E+P G E
Sbjct:   176 GNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGRE 235

Query:   255 QALLKAVSMQ-PVSIAIAAYSTEFQSYKEGIF-NGVCGTQLDHAVTIVGFGTTEDGANYW 312
               L +AV+ + PVS+ + A    F  Y+ G++    C   ++H V +VG+G   +G  YW
Sbjct:   236 DVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDL-NGKEYW 294

Query:   313 LIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYP 346
             L+KNSWG+ +G+ GY+++ R++G  CGI +  SYP
Sbjct:   295 LVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329


>WB|WBGene00000776 [details] [associations]
            symbol:cpl-1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0040010 "positive regulation
            of growth rate" evidence=IMP] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] [GO:0040011
            "locomotion" evidence=IMP] [GO:0070265 "necrotic cell death"
            evidence=IMP] [GO:0031983 "vesicle lumen" evidence=IDA] [GO:0042718
            "yolk granule" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0009792 GO:GO:0040010 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0040011
            GO:GO:0031983 GO:GO:0070265 GeneTree:ENSGT00660000095458 KO:K01365
            GO:GO:0042718 MEROPS:I29.009 EMBL:Z92812 GeneID:180111
            KEGG:cel:CELE_T03E6.7 CTD:180111 PIR:T24387 RefSeq:NP_001256718.1
            HSSP:P80067 ProteinModelPortal:O45734 SMR:O45734 DIP:DIP-26616N
            IntAct:O45734 MINT:MINT-211563 STRING:O45734 PaxDb:O45734
            EnsemblMetazoa:T03E6.7.1 EnsemblMetazoa:T03E6.7.2 UCSC:T03E6.7.1
            WormBase:T03E6.7a InParanoid:O45734 OMA:HIENHNR NextBio:908128
            Uniprot:O45734
        Length = 337

 Score = 608 (219.1 bits), Expect = 2.7e-59, P = 2.7e-59
 Identities = 127/337 (37%), Positives = 192/337 (56%)

Query:    19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
             FI++ L+ +  +   +  S   +S +E  + +     + Y  E E++  ++ F +N+ +I
Sbjct:     4 FILLALVAAVVAVNSAKLSRQIESAIEKWDDYKEDFDKEYS-ESEEQTYMEAFVKNMIHI 62

Query:    79 EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTD 135
             E  N++   G +T+++G N  +DL   ++R L  GY+      R      F         
Sbjct:    63 ENHNRDHRLGRKTFEMGLNHIADLPFSQYRKL-NGYRRLFGDSRIKNSSSFLAP--FNVQ 119

Query:   136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN- 194
             VP  +DWRD   VT +KNQ  CG CWAF+A  A+EG    + G L+ LSEQ L+DCST  
Sbjct:   120 VPDEVDWRDTHLVTDVKNQGMCGSCWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKY 179

Query:   195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
             GN+GC GG  ++AF YI  N G+ TE+ YPY+     C   +K   A    Y + P GDE
Sbjct:   180 GNHGCNGGLMDQAFEYIRDNHGVDTEESYPYKGRDMKCHFNKKTVGADDKGYVDTPEGDE 239

Query:   255 QALLKAVSMQ-PVSIAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFGTTEDGANY 311
             + L  AV+ Q P+SIAI A    FQ YK+G++ +  C ++ LDH V +VG+GT  +  +Y
Sbjct:   240 EQLKIAVATQGPISIAIDAGHRSFQLYKKGVYYDEECSSEELDHGVLLVGYGTDPEHGDY 299

Query:   312 WLIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYPL 347
             W++KNSWG  WG+ GY++I R+    CG+ T++SYPL
Sbjct:   300 WIVKNSWGAGWGEKGYIRIARNRNNHCGVATKASYPL 336


>RGD|2448 [details] [associations]
            symbol:Ctsl1 "cathepsin L1" species:10116 "Rattus norvegicus"
          [GO:0002250 "adaptive immune response" evidence=ISO] [GO:0004177
          "aminopeptidase activity" evidence=IDA] [GO:0004197 "cysteine-type
          endopeptidase activity" evidence=ISO;IDA] [GO:0005576 "extracellular
          region" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IDA]
          [GO:0005764 "lysosome" evidence=ISO;IDA] [GO:0005773 "vacuole"
          evidence=IDA] [GO:0005902 "microvillus" evidence=IDA] [GO:0006508
          "proteolysis" evidence=IEP;ISO] [GO:0007154 "cell communication"
          evidence=IDA] [GO:0007283 "spermatogenesis" evidence=IEP] [GO:0008234
          "cysteine-type peptidase activity" evidence=ISO] [GO:0008584 "male
          gonad development" evidence=IEP] [GO:0009267 "cellular response to
          starvation" evidence=IEP] [GO:0009749 "response to glucose stimulus"
          evidence=IEP] [GO:0009897 "external side of plasma membrane"
          evidence=IDA] [GO:0010259 "multicellular organismal aging"
          evidence=IEP] [GO:0014070 "response to organic cyclic compound"
          evidence=IEP] [GO:0021675 "nerve development" evidence=IEP]
          [GO:0030984 "kininogen binding" evidence=IPI] [GO:0032403 "protein
          complex binding" evidence=IPI] [GO:0034698 "response to gonadotropin
          stimulus" evidence=IEP] [GO:0042277 "peptide binding" evidence=IDA]
          [GO:0042393 "histone binding" evidence=ISO] [GO:0043005 "neuron
          projection" evidence=IDA] [GO:0043204 "perikaryon" evidence=IDA]
          [GO:0046697 "decidualization" evidence=IEP] [GO:0048102 "autophagic
          cell death" evidence=IEP] [GO:0051384 "response to glucocorticoid
          stimulus" evidence=IEP] [GO:0060008 "Sertoli cell differentiation"
          evidence=IEP] [GO:0097067 "cellular response to thyroid hormone
          stimulus" evidence=ISO] [GO:0030141 "secretory granule" evidence=IDA]
          [GO:0045177 "apical part of cell" evidence=IDA] [GO:0060441
          "epithelial tube branching involved in lung morphogenesis"
          evidence=ISO] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
          PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:Y00697 RGD:2448
          GO:GO:0005576 GO:GO:0009897 GO:GO:0034698 GO:GO:0043204 GO:GO:0009749
          GO:GO:0051384 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
          InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
          PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
          PROSITE:PS00139 PROSITE:PS00639 GO:GO:0043005 GO:GO:0007283
          GO:GO:0004177 GO:GO:0005764 GO:GO:0042277 GO:GO:0009267 GO:GO:0021675
          GO:GO:0005902 GO:GO:0010259 GO:GO:0004197 GO:GO:0048102 GO:GO:0046697
          GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 KO:K01365
          OrthoDB:EOG48PMKF MEROPS:C01.032 OMA:FDQNLDT CTD:1514
          BRENDA:3.4.22.15 GO:GO:0060008 EMBL:AF025476 EMBL:BC063175
          EMBL:S85184 IPI:IPI00326070 PIR:S07098 RefSeq:NP_037288.1
          UniGene:Rn.1294 ProteinModelPortal:P07154 SMR:P07154 IntAct:P07154
          STRING:P07154 PhosphoSite:P07154 PRIDE:P07154
          Ensembl:ENSRNOT00000025462 GeneID:25697 KEGG:rno:25697 UCSC:RGD:2448
          InParanoid:P07154 SABIO-RK:P07154 BindingDB:P07154 ChEMBL:CHEMBL2305
          NextBio:607715 Genevestigator:P07154 GermOnline:ENSRNOG00000018566
          Uniprot:P07154
        Length = 334

 Score = 607 (218.7 bits), Expect = 3.5e-59, P = 3.5e-59
 Identities = 127/338 (37%), Positives = 195/338 (57%)

Query:    21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
             ++ L V C    +++    +    + H+ W + H R Y    E+E R  ++++N+  I+ 
Sbjct:     4 LLLLAVLCLGTALATPKFDQTFNAQWHQ-WKSTHRRLYGTN-EEEWRRAVWEKNMRMIQL 61

Query:    81 ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVP 137
              N E   G   + +  N F D+TN+EFR +  GY+     H+        +Q   M  +P
Sbjct:    62 HNGEYSNGKHGFTMEMNAFGDMTNEEFRQIVNGYR--HQKHKKGRL----FQEPLMLQIP 115

Query:   138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
              ++DWR+KG VTP+KNQ +CG CWAF+A   +EG   +++G LI LSEQ L+DCS + GN
Sbjct:   116 KTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGN 175

Query:   197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
              GC GG  + AF YI +N G+ +E+ YPY+A  G+C    + A A  + + ++P   E+A
Sbjct:   176 QGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEYAVANDTGFVDIPQ-QEKA 234

Query:   257 LLKAVS-MQPVSIAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFG---TTEDGAN 310
             L+KAV+ + P+S+A+ A     Q Y  GI+    C ++ LDH V +VG+G   T  +   
Sbjct:   235 LMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDK 294

Query:   311 YWLIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYPL 347
             YWL+KNSWG  WG  GY+KI +D    CG+ T +SYP+
Sbjct:   295 YWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPI 332


>UNIPROTKB|O60911 [details] [associations]
            symbol:CTSL2 "Cathepsin L2" species:9606 "Homo sapiens"
            [GO:0004177 "aminopeptidase activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA] [GO:0005902
            "microvillus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0007283 "spermatogenesis" evidence=IEA] [GO:0009267 "cellular
            response to starvation" evidence=IEA] [GO:0009749 "response to
            glucose stimulus" evidence=IEA] [GO:0009897 "external side of
            plasma membrane" evidence=IEA] [GO:0010259 "multicellular
            organismal aging" evidence=IEA] [GO:0021675 "nerve development"
            evidence=IEA] [GO:0030984 "kininogen binding" evidence=IEA]
            [GO:0032403 "protein complex binding" evidence=IEA] [GO:0034698
            "response to gonadotropin stimulus" evidence=IEA] [GO:0042277
            "peptide binding" evidence=IEA] [GO:0043005 "neuron projection"
            evidence=IEA] [GO:0043204 "perikaryon" evidence=IEA] [GO:0046697
            "decidualization" evidence=IEA] [GO:0048102 "autophagic cell death"
            evidence=IEA] [GO:0051384 "response to glucocorticoid stimulus"
            evidence=IEA] [GO:0060008 "Sertoli cell differentiation"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=TAS] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_118779 Reactome:REACT_6900
            GO:GO:0009897 GO:GO:0019886 GO:GO:0034698 GO:GO:0043204
            GO:GO:0009749 GO:GO:0030141 GO:GO:0051384 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0045177 GO:GO:0043005 GO:GO:0007283
            GO:GO:0004177 GO:GO:0042277 GO:GO:0009267 GO:GO:0021675
            GO:GO:0043202 GO:GO:0005902 GO:GO:0010259 GO:GO:0004197
            GO:GO:0048102 GO:GO:0046697 HOVERGEN:HBG011513 CTD:1515
            OrthoDB:EOG48PMKF OMA:FDQNLDT GO:GO:0060008 EMBL:Y14734
            EMBL:AB001928 EMBL:AF070448 EMBL:AB019534 EMBL:AY358641
            EMBL:AL445670 EMBL:BC023504 EMBL:BC110512 IPI:IPI00000013
            RefSeq:NP_001188504.1 RefSeq:NP_001324.2 UniGene:Hs.610096 PDB:1FH0
            PDB:3H6S PDB:3KFQ PDBsum:1FH0 PDBsum:3H6S PDBsum:3KFQ
            ProteinModelPortal:O60911 SMR:O60911 IntAct:O60911 STRING:O60911
            MEROPS:I29.010 PhosphoSite:O60911 PaxDb:O60911 PeptideAtlas:O60911
            PRIDE:O60911 Ensembl:ENST00000259470 Ensembl:ENST00000538255
            GeneID:1515 KEGG:hsa:1515 UCSC:uc004awt.3 GeneCards:GC09M099794
            HGNC:HGNC:2538 HPA:CAB017112 MIM:603308 neXtProt:NX_O60911
            PharmGKB:PA27036 InParanoid:O60911 KO:K01375 PhylomeDB:O60911
            BRENDA:3.4.22.43 SABIO-RK:O60911 BindingDB:O60911 ChEMBL:CHEMBL3272
            ChiTaRS:CTSL2 EvolutionaryTrace:O60911 GenomeRNAi:1515 NextBio:6277
            Bgee:O60911 CleanEx:HS_CTSL2 Genevestigator:O60911
            GermOnline:ENSG00000136943 Uniprot:O60911
        Length = 334

 Score = 601 (216.6 bits), Expect = 1.5e-58, P = 1.5e-58
 Identities = 131/340 (38%), Positives = 197/340 (57%)

Query:    18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
             M + + L   C   + S+    +Q++     +W A H R Y    E+  R  ++++N++ 
Sbjct:     1 MNLSLVLAAFCLG-IASAVPKFDQNLDTKWYQWKATHRRLYGAN-EEGWRRAVWEKNMKM 58

Query:    78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMT 134
             IE  N E   G   + +  N F D+TN+EFR +   ++    + +      F+ + L + 
Sbjct:    59 IELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFR----NQKFRKGKVFR-EPLFL- 112

Query:   135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST- 193
             D+P S+DWR KG VTP+KNQK+CG CWAF+A  A+EG    ++G L+ LSEQ L+DCS  
Sbjct:   113 DLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRP 172

Query:   194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
              GN GC GG   +AF Y+ +N G+ +E+ YPY AV   C    + + A  + +  V  G 
Sbjct:   173 QGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTVVAPGK 232

Query:   254 EQALLKAVS-MQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFG---TTED 307
             E+AL+KAV+ + P+S+A+ A  + FQ YK GI F   C ++ LDH V +VG+G      +
Sbjct:   233 EKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSN 292

Query:   308 GANYWLIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYP 346
              + YWL+KNSWG  WG  GY+KI +D+   CGI T +SYP
Sbjct:   293 NSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYP 332


>DICTYBASE|DDB_G0279799 [details] [associations]
            symbol:cprB "cysteine proteinase 2" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0279799 GenomeReviews:CM000152_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            MEROPS:I29.003 KO:K01365 EMBL:AAFI02000033 EMBL:M16039 EMBL:X03344
            PIR:A25439 RefSeq:XP_641494.1 ProteinModelPortal:P04989 SMR:P04989
            EnsemblProtists:DDB0214998 GeneID:8622234 KEGG:ddi:DDB_G0279799
            OMA:YVNITAG Uniprot:P04989
        Length = 376

 Score = 487 (176.5 bits), Expect = 2.3e-58, Sum P(2) = 2.3e-58
 Identities = 109/293 (37%), Positives = 155/293 (52%)

Query:    18 MFIIITLLV--SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
             +F+I+ + V  S A+   + R   E        +W  +  R Y    E   R  IFK N+
Sbjct:     5 VFLILLIFVNFSFANVRPNGRRFSESQYRTAFTEWTLKFNRQYSSS-EFSNRYSIFKSNM 63

Query:    76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXX-FKYQNLSMT 134
             +Y++  N +G+    LG N F+D+TN+E+R  Y G ++ + S+           ++L   
Sbjct:    64 DYVDNWNSKGDSQTVLGLNNFADITNEEYRKTYLGTRVNAHSYNGYDGREVLNVEDLQTN 123

Query:   135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
               P S+DWR K AVTPIK+Q +CG CW+F+   + EG   +++  L+ LSEQ L+DCS  
Sbjct:   124 --PKSIDWRTKNAVTPIKDQGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGP 181

Query:   195 GNN-GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSG 252
               N GC GG    AF YII+N+GI TE  YPY A  G TC   +    A I  Y  + +G
Sbjct:   182 EENFGCDGGLMNNAFDYIIKNKGIDTESSYPYTAETGSTCLFNKSDIGATIKGYVNITAG 241

Query:   253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF-NGVCG-TQLDHAVTIVGFG 303
              E +L       PVS+AI A    FQ Y  GI+    C  T+LDH V +VG+G
Sbjct:   242 SEISLENGAQHGPVSVAIDASHNSFQLYTSGIYYEPKCSPTELDHGVLVVGYG 294

 Score = 130 (50.8 bits), Expect = 2.3e-58, Sum P(2) = 2.3e-58
 Identities = 22/40 (55%), Positives = 30/40 (75%)

Query:   310 NYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
             NYW++KNSWG +WG  GY+ + +D +  CGI + SSYPLA
Sbjct:   337 NYWIVKNSWGTSWGIKGYILMSKDRKNNCGIASVSSYPLA 376


>UNIPROTKB|P25975 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 EMBL:X91755 EMBL:BC102312 EMBL:AB017648
            IPI:IPI00687440 PIR:S15845 RefSeq:NP_776457.1 UniGene:Bt.3987
            ProteinModelPortal:P25975 SMR:P25975 STRING:P25975
            Ensembl:ENSBTAT00000022710 Ensembl:ENSBTAT00000036427 GeneID:281108
            KEGG:bta:281108 CTD:1515 InParanoid:P25975 KO:K01365 OMA:EEFRATH
            OrthoDB:EOG48PMKF BindingDB:P25975 ChEMBL:CHEMBL2113
            NextBio:20805179 ArrayExpress:P25975 Uniprot:P25975
        Length = 334

 Score = 593 (213.8 bits), Expect = 1.1e-57, P = 1.1e-57
 Identities = 131/336 (38%), Positives = 199/336 (59%)

Query:    24 LLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN 82
             L V C    V+S +      ++ H  +W A H R Y    E+E R  ++++N + I+  N
Sbjct:     7 LTVLCLG--VASAAPKLDPNLDAHWHQWKATHRRLYGMN-EEEWRRAVWEKNKKIIDLHN 63

Query:    83 KE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTS 139
             +E   G   +++  N F D+TN+EFR +  G++  +  H+        ++ L + DVP S
Sbjct:    64 QEYSEGKHGFRMAMNAFGDMTNEEFRQVMNGFQ--NQKHKKGKLF---HEPL-LVDVPKS 117

Query:   140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNG 198
             +DW  KG VTP+KNQ +CG CWAF+A  A+EG    ++G L+ LSEQ L+DCS   GN G
Sbjct:   118 VDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQG 177

Query:   199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVP-GTCSAAQKPAAAKISNYEEVPSGDEQAL 257
             C GG  + AF YI  N G+ +E+ YPY A    +C+   + +AA  + + ++P   E+AL
Sbjct:   178 CNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQR-EKAL 236

Query:   258 LKAVS-MQPVSIAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFG---TTEDGANY 311
             +KAV+ + P+S+AI A  T FQ YK GI+ +  C ++ LDH V +VG+G   T  +   +
Sbjct:   237 MKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNNNKF 296

Query:   312 WLIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYP 346
             W++KNSWG  WG  GY+K+ +D+   CGI T +SYP
Sbjct:   297 WIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYP 332


>UNIPROTKB|Q28944 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9823 "Sus scrofa"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 KO:K01365 OrthoDB:EOG48PMKF MEROPS:C01.032
            CTD:1514 EMBL:D37917 EMBL:AJ315771 PIR:A58195 RefSeq:NP_999057.1
            UniGene:Ssc.54036 ProteinModelPortal:Q28944 SMR:Q28944
            STRING:Q28944 Ensembl:ENSSSCT00000012233 GeneID:396926
            KEGG:ssc:396926 OMA:DASETGK ArrayExpress:Q28944 Uniprot:Q28944
        Length = 334

 Score = 593 (213.8 bits), Expect = 1.1e-57, P = 1.1e-57
 Identities = 127/311 (40%), Positives = 191/311 (61%)

Query:    49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFR 105
             KW A HGR Y    E+  R  ++++N++ IE  N+E   G   + +  N F D+TN+EFR
Sbjct:    31 KWKATHGRLYGMN-EEGWRRAVWEKNMKMIELHNQEYSQGKHGFSMAMNAFGDMTNEEFR 89

Query:   106 ALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
              +  G++  +  H+        +++L + +VP S+DWR+KG VT +KNQ +CG CWAF+A
Sbjct:    90 QVMNGFQ--NQKHKKGKVF---HESLVL-EVPKSVDWREKGYVTAVKNQGQCGSCWAFSA 143

Query:   166 VAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYP 224
               A+EG    ++G L+ LSEQ L+DCS   GN GC GG  + AF Y+  N G+ TE+ YP
Sbjct:   144 TGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTEESYP 203

Query:   225 YQAVPGTCSAAQKP--AAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYK 281
             Y     T S   KP  +AA  + + ++P   E+AL+KAV+ + P+S+AI A  + FQ YK
Sbjct:   204 YLGRE-TNSCTYKPECSAANDTGFVDIPQR-EKALMKAVATVGPISVAIDAGHSSFQFYK 261

Query:   282 EGIF-NGVCGTQ-LDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGL 336
              GI+ +  C ++ LDH V +VG+G   T  + + +W++KNSWG  WG  GY+K+ +D+  
Sbjct:   262 SGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWGPEWGWNGYVKMAKDQNN 321

Query:   337 -CGIGTRSSYP 346
              CGI T +SYP
Sbjct:   322 HCGISTAASYP 332


>ZFIN|ZDB-GENE-050626-55 [details] [associations]
            symbol:ctssb.2 "cathepsin S, b.2" species:7955
            "Danio rerio" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-050626-55
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            KO:K01368 EMBL:BC093339 IPI:IPI00507098 RefSeq:NP_001017661.1
            UniGene:Dr.132688 ProteinModelPortal:Q566T8 SMR:Q566T8
            GeneID:337572 KEGG:dre:337572 CTD:337572 InParanoid:Q566T8
            NextBio:20812306 ArrayExpress:Q566T8 Uniprot:Q566T8
        Length = 330

 Score = 593 (213.8 bits), Expect = 1.1e-57, P = 1.1e-57
 Identities = 127/322 (39%), Positives = 187/322 (58%)

Query:    34 SSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTY 89
             S+   H    ++ H E W  +H + Y  E E+  R ++++ NLE I   N E   G  +Y
Sbjct:    13 SAALAHFNKNLDQHWELWKKKHVKLYSCEDEEVGRRELWERNLELIAIHNLEASMGMHSY 72

Query:    90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVT 149
              L  N  +D+T +E        ++P P  +       +Y + S   VP +LDWRDKG VT
Sbjct:    73 DLAINHMADMTTEEILQTLAVTRVP-PGFKRPTA---EYVSSSFAVVPDTLDWRDKGYVT 128

Query:   150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAF 208
              +KNQ  CG CWAF++V A+EG     +G L+ LS Q L+DCS+  GN GC GG   +AF
Sbjct:   129 SVKNQGACGSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGYMSQAF 188

Query:   209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVS 267
              Y+I N GI +E  YPYQ   G+C       AA  ++Y+ V  GDEQAL +A++ + PVS
Sbjct:   189 QYVIDNGGIDSESSYPYQGTQGSCRYDPSQRAANCTSYKFVSQGDEQALKEALANIGPVS 248

Query:   268 IAIAAYSTEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAG 326
             +AI A   +F  Y+ G+++   C  +++H V  VG+GT   G +YWL+KNSWG  +GD G
Sbjct:   249 VAIDATRPQFIFYRSGVYDDPSCTQKVNHGVLAVGYGTLS-GQDYWLVKNSWGAGFGDGG 307

Query:   327 YMKIVRDEG-LCGIGTRSSYPL 347
             Y++I R++  +CGI + + YP+
Sbjct:   308 YIRIARNKNNMCGIASEACYPI 329


>UNIPROTKB|Q9GL24 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 CTD:1515 KO:K01365
            OrthoDB:EOG48PMKF EMBL:AJ279008 RefSeq:NP_001239115.1
            UniGene:Cfa.3571 ProteinModelPortal:Q9GL24 SMR:Q9GL24
            MEROPS:C01.032 Ensembl:ENSCAFT00000001770
            Ensembl:ENSCAFT00000023837 GeneID:100684364 KEGG:cfa:100684364
            InParanoid:Q9GL24 OMA:FDQNLDT NextBio:20817211 Uniprot:Q9GL24
        Length = 333

 Score = 592 (213.5 bits), Expect = 1.4e-57, P = 1.4e-57
 Identities = 126/326 (38%), Positives = 193/326 (59%)

Query:    32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRT 88
             + S+    +QS+     +W A H R Y    E+  R  ++++N++ IE  N+E   G   
Sbjct:    14 IASAAPKFDQSLNAQWYQWKATHRRLYGMN-EEGWRRAVWEKNMKMIELHNREYSQGKHG 72

Query:    89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAV 148
             + +  N F D+TN+EFR +  G++  +  H+        +Q     ++P S+DWR+KG V
Sbjct:    73 FTMAMNAFGDMTNEEFRQVMNGFQ--NQKHKKGKM----FQEPLFAEIPKSVDWREKGYV 126

Query:   149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKA 207
             TP+KNQ +CG CWAF+A  A+EG    ++G L+ LSEQ L+DCS   GN GC GG  + A
Sbjct:   127 TPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNA 186

Query:   208 FAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQP 265
             F Y+  N G+ +E+ YPY      TC+   + +AA  + + ++P   E+AL+KAV+ + P
Sbjct:   187 FRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDLPQR-EKALMKAVATLGP 245

Query:   266 VSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFGT--TEDGANYWLIKNSWGNT 321
             +S+AI A    FQ YK GI F+  C ++ LDH V +VG+G   T+    +W++KNSWG  
Sbjct:   246 ISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKFWIVKNSWGPE 305

Query:   322 WGDAGYMKIVRDEGL-CGIGTRSSYP 346
             WG  GY+K+ +D+   CGI T +SYP
Sbjct:   306 WGWNGYVKMAKDQNNHCGIATAASYP 331


>UNIPROTKB|P43235 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0001957
            "intramembranous ossification" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0045453 "bone resorption"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=TAS] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=TAS] [GO:0036021 "endolysosome lumen" evidence=TAS]
            [GO:0045087 "innate immune response" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            Reactome:REACT_118779 Reactome:REACT_6900 GO:GO:0005615
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087 GO:GO:0045453
            EMBL:CH471121 EMBL:AL355860 GO:GO:0004197 GO:GO:0001957
            HOVERGEN:HBG011513 GO:GO:0036021 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:U13665 EMBL:X82153
            EMBL:U20280 EMBL:S79895 EMBL:CR541675 EMBL:AL356292 EMBL:BC016058
            IPI:IPI00300599 PIR:JC2476 RefSeq:NP_000387.1 UniGene:Hs.632466
            PDB:1ATK PDB:1AU0 PDB:1AU2 PDB:1AU3 PDB:1AU4 PDB:1AYU PDB:1AYV
            PDB:1AYW PDB:1BGO PDB:1BY8 PDB:1MEM PDB:1NL6 PDB:1NLJ PDB:1Q6K
            PDB:1SNK PDB:1TU6 PDB:1U9V PDB:1U9W PDB:1U9X PDB:1VSN PDB:1YK7
            PDB:1YK8 PDB:1YT7 PDB:2ATO PDB:2AUX PDB:2AUZ PDB:2BDL PDB:2R6N
            PDB:3C9E PDB:3H7D PDB:3KW9 PDB:3KWB PDB:3KWZ PDB:3KX1 PDB:3O0U
            PDB:3O1G PDB:3OVZ PDB:4DMX PDB:4DMY PDB:7PCK PDBsum:1ATK
            PDBsum:1AU0 PDBsum:1AU2 PDBsum:1AU3 PDBsum:1AU4 PDBsum:1AYU
            PDBsum:1AYV PDBsum:1AYW PDBsum:1BGO PDBsum:1BY8 PDBsum:1MEM
            PDBsum:1NL6 PDBsum:1NLJ PDBsum:1Q6K PDBsum:1SNK PDBsum:1TU6
            PDBsum:1U9V PDBsum:1U9W PDBsum:1U9X PDBsum:1VSN PDBsum:1YK7
            PDBsum:1YK8 PDBsum:1YT7 PDBsum:2ATO PDBsum:2AUX PDBsum:2AUZ
            PDBsum:2BDL PDBsum:2R6N PDBsum:3C9E PDBsum:3H7D PDBsum:3KW9
            PDBsum:3KWB PDBsum:3KWZ PDBsum:3KX1 PDBsum:3O0U PDBsum:3O1G
            PDBsum:3OVZ PDBsum:4DMX PDBsum:4DMY PDBsum:7PCK
            ProteinModelPortal:P43235 SMR:P43235 DIP:DIP-39993N IntAct:P43235
            STRING:P43235 PhosphoSite:P43235 DMDM:1168793 PaxDb:P43235
            PRIDE:P43235 DNASU:1513 Ensembl:ENST00000271651 GeneID:1513
            KEGG:hsa:1513 UCSC:uc001evp.2 GeneCards:GC01M150768 HGNC:HGNC:2536
            MIM:265800 MIM:601105 neXtProt:NX_P43235 Orphanet:763
            PharmGKB:PA27034 InParanoid:P43235 OMA:LKVPPSH PhylomeDB:P43235
            BindingDB:P43235 ChEMBL:CHEMBL268 EvolutionaryTrace:P43235
            GenomeRNAi:1513 NextBio:6267 ArrayExpress:P43235 Bgee:P43235
            CleanEx:HS_CTSK CleanEx:HS_CTSO Genevestigator:P43235
            GermOnline:ENSG00000143387 Uniprot:P43235
        Length = 329

 Score = 591 (213.1 bits), Expect = 1.7e-57, P = 1.7e-57
 Identities = 126/324 (38%), Positives = 194/324 (59%)

Query:    33 VSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRT 88
             V S + + + +++ H E W   H + Y +++++  R  I+++NL+YI   N E   G  T
Sbjct:    11 VVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHT 70

Query:    89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXF--KYQNLSMTDVPTSLDWRDKG 146
             Y+L  N   D+T++E     TG K+P  SH       +  +++  +    P S+D+R KG
Sbjct:    71 YELAMNHLGDMTSEEVVQKMTGLKVPL-SHSRSNDTLYIPEWEGRA----PDSVDYRKKG 125

Query:   147 AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREK 206
              VTP+KNQ +CG CWAF++V A+EG  K ++G L+ LS Q L+DC +  N+GC GG    
Sbjct:   126 YVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE-NDGCGGGYMTN 184

Query:   207 AFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQP 265
             AF Y+ +N+GI +ED YPY     +C       AAK   Y E+P G+E+AL +AV+ + P
Sbjct:   185 AFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGP 244

Query:   266 VSIAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWG 323
             VS+AI A  T FQ Y +G++ +  C +  L+HAV  VG+G  + G  +W+IKNSWG  WG
Sbjct:   245 VSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGI-QKGNKHWIIKNSWGENWG 303

Query:   324 DAGYMKIVRDEG-LCGIGTRSSYP 346
             + GY+ + R++   CGI   +S+P
Sbjct:   304 NKGYILMARNKNNACGIANLASFP 327


>RGD|61810 [details] [associations]
            symbol:Ctsk "cathepsin K" species:10116 "Rattus norvegicus"
           [GO:0001957 "intramembranous ossification" evidence=IEP] [GO:0005615
           "extracellular space" evidence=IDA] [GO:0005737 "cytoplasm"
           evidence=IDA] [GO:0005764 "lysosome" evidence=IDA] [GO:0006508
           "proteolysis" evidence=TAS] [GO:0008234 "cysteine-type peptidase
           activity" evidence=TAS] [GO:0045453 "bone resorption" evidence=IMP]
           InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
           Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
           RGD:61810 GO:GO:0005615 eggNOG:COG4870 HOGENOM:HOG000230774
           GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
           InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
           PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
           GO:GO:0045453 GO:GO:0001957 GeneTree:ENSGT00560000076577
           HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
           OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:AF010306 EMBL:BC078793
           IPI:IPI00206378 RefSeq:NP_113748.1 UniGene:Rn.5598
           ProteinModelPortal:O35186 SMR:O35186 STRING:O35186
           PhosphoSite:O35186 PRIDE:O35186 Ensembl:ENSRNOT00000028730
           GeneID:29175 KEGG:rno:29175 UCSC:RGD:61810 InParanoid:O35186
           OMA:YKEIPEG BindingDB:O35186 ChEMBL:CHEMBL3034 NextBio:608248
           Genevestigator:O35186 GermOnline:ENSRNOG00000021155 Uniprot:O35186
        Length = 329

 Score = 586 (211.3 bits), Expect = 5.9e-57, P = 5.9e-57
 Identities = 127/322 (39%), Positives = 188/322 (58%)

Query:    32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRT 88
             VVS   + E+++    E W   HG+ Y  ++++  R  I+++NL+ I   N E   G  T
Sbjct:    11 VVSFALSPEETLDTQWELWKKTHGKQYNSKVDEISRRLIWEKNLKKISVHNLEASLGAHT 70

Query:    89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAV 148
             Y+L  N   D+T++E     TG ++P PS        +  +      VP S+D+R KG V
Sbjct:    71 YELAMNHLGDMTSEEVVQKMTGLRVP-PSRSFSNDTLYTPEWEGR--VPDSIDYRKKGYV 127

Query:   149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
             TP+KNQ +CG CWAF++  A+EG  K ++G L+ LS Q L+DC +  N GC GG    AF
Sbjct:   128 TPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVSE-NYGCGGGYMTTAF 186

Query:   209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVS 267
              Y+ QN GI +ED YPY     +C       AAK   Y E+P G+E+AL +AV+ + PVS
Sbjct:   187 QYVQQNGGIDSEDAYPYVGQDESCMYNATAKAAKCRGYREIPVGNEKALKRAVARVGPVS 246

Query:   268 IAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDA 325
             ++I A  T FQ Y  G++ +  C    ++HAV +VG+GT + G  YW+IKNSWG +WG+ 
Sbjct:   247 VSIDASLTSFQFYSRGVYYDENCDRDNVNHAVLVVGYGT-QKGNKYWIIKNSWGESWGNK 305

Query:   326 GYMKIVRDEG-LCGIGTRSSYP 346
             GY+ + R++   CGI   +S+P
Sbjct:   306 GYVLLARNKNNACGITNLASFP 327


>UNIPROTKB|Q9GLE3 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9823 "Sus scrofa"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0005576 "extracellular
            region" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005576 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 MEROPS:I29.007
            CTD:1513 KO:K01371 OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55
            OMA:LKVPPSH EMBL:AF292030 RefSeq:NP_999467.1 UniGene:Ssc.1020
            ProteinModelPortal:Q9GLE3 SMR:Q9GLE3 STRING:Q9GLE3
            Ensembl:ENSSSCT00000007283 GeneID:397569 KEGG:ssc:397569
            ArrayExpress:Q9GLE3 Uniprot:Q9GLE3
        Length = 330

 Score = 585 (211.0 bits), Expect = 7.5e-57, P = 7.5e-57
 Identities = 125/322 (38%), Positives = 189/322 (58%)

Query:    33 VSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRT 88
             V S + + + +++   E W   + + Y  ++++  R  I+++NL++I   N E   G  T
Sbjct:    12 VMSSALYPEEILDTQWELWKKTYRKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHT 71

Query:    89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAV 148
             Y+L  N   D+T++E     TG K+P PSH       +       T  P S+D+R KG V
Sbjct:    72 YELAMNHLGDMTSEEVVQKMTGLKVP-PSHSRSNDTLYIPDWEGRT--PDSIDYRKKGYV 128

Query:   149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
             TP+KNQ +CG CWAF++V A+EG  K ++G L+ LS Q L+DC +  N+GC GG    AF
Sbjct:   129 TPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE-NDGCGGGYMTNAF 187

Query:   209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVS 267
              Y+ +N+GI +ED YPY      C       AAK   Y E+P G+E+AL +AV+ + PVS
Sbjct:   188 QYVQKNRGIDSEDAYPYVGQDENCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVS 247

Query:   268 IAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDA 325
             +AI A  T FQ Y +G++ +  C +  L+HAV  VG+G  + G  +W+IKNSWG  WG+ 
Sbjct:   248 VAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGI-QKGKKHWIIKNSWGENWGNK 306

Query:   326 GYMKIVRDEG-LCGIGTRSSYP 346
             GY+ + R++   CGI   +S+P
Sbjct:   307 GYILMARNKNNACGIANLASFP 328


>ZFIN|ZDB-GENE-030131-3539 [details] [associations]
            symbol:ctsh "cathepsin H" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-030131-3539
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 KO:K01366 HOVERGEN:HBG011513
            CTD:1512 OrthoDB:EOG4W9J43 MEROPS:I29.003 HSSP:P43235 EMBL:BC067615
            IPI:IPI00506892 RefSeq:NP_997853.1 UniGene:Dr.14176
            ProteinModelPortal:Q6NWF2 SMR:Q6NWF2 PRIDE:Q6NWF2 GeneID:324818
            KEGG:dre:324818 InParanoid:Q6NWF2 NextBio:20808976 Bgee:Q6NWF2
            Uniprot:Q6NWF2
        Length = 330

 Score = 585 (211.0 bits), Expect = 7.5e-57, P = 7.5e-57
 Identities = 130/337 (38%), Positives = 191/337 (56%)

Query:    20 IIITLLVSCASQVVSSRSTHEQSVVEIHEK-WMAQHGRSYKDELEKEMRLKIFKENLEYI 78
             +I+ +L +   QV++     E+   E H K WM+Q+ + Y+   E   RL+IF EN + I
Sbjct:     4 LILAVLFAVLYQVLAVPLYTEED--EYHFKSWMSQYNKKYEIN-EFYQRLQIFLENKKRI 60

Query:    79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDV-P 137
             ++ N EGN  + +G NQFSD+T  EF+  Y    +  P +        +  ++S   + P
Sbjct:    61 DQHN-EGNHKFSMGLNQFSDMTFAEFKKTYL---LTEPQNCSAT----RGNHVSSNGLYP 112

Query:   138 TSLDWRDKGA-VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
              ++DWR KG  +T +KNQ  CG CW F+    +E +T I +G L+QL+EQQL+DC+ +  
Sbjct:   113 DAIDWRTKGHYITDVKNQGPCGSCWTFSTTGCLESVTAIATGKLLQLAEQQLIDCAGDFD 172

Query:   196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
             N+GC GG    AF YI+ N+G+ TED+YPYQA  G C    + AAA +     +   DE 
Sbjct:   173 NHGCNGGLPSHAFEYIMYNKGLMTEDDYPYQAKGGQCRFKPQLAAAFVKEVVNITKYDEM 232

Query:   256 ALLKAVS-MQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQLD---HAVTIVGFGTTEDGAN 310
              ++ AV+ + PVS A    S +F  YK+GI+    C    D   HAV  VG+   E+G  
Sbjct:   233 GMVDAVARLNPVSFAYEVTS-DFMHYKDGIYTSTECHNTTDMVNHAVLAVGYAE-ENGTP 290

Query:   311 YWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 347
             YW++KNSWG  WG  GY  I R + +CG+   SSYP+
Sbjct:   291 YWIVKNSWGTNWGIKGYFYIERGKNMCGLAACSSYPI 327


>UNIPROTKB|Q5E998 [details] [associations]
            symbol:CTSL2 "Cathepsin L2" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            HOVERGEN:HBG011513 UniGene:Bt.3987 MEROPS:C01.032 EMBL:BT021022
            IPI:IPI00711962 ProteinModelPortal:Q5E998 SMR:Q5E998 STRING:Q5E998
            InParanoid:Q5E998 Uniprot:Q5E998
        Length = 334

 Score = 584 (210.6 bits), Expect = 9.6e-57, P = 9.6e-57
 Identities = 130/336 (38%), Positives = 198/336 (58%)

Query:    24 LLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN 82
             L V C    V+S +      ++ H  +W A H R Y    E+E R  ++++N + I+  N
Sbjct:     7 LTVLCLG--VASAAPKLDPNLDAHWHQWKATHRRLYGMN-EEEWRRAVWEKNKKIIDLHN 63

Query:    83 KE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTS 139
             +E   G   +++  N F D+TN+EFR +  G++  +  H+        ++ L + DVP S
Sbjct:    64 QEYSEGKHGFRMAMNAFGDMTNEEFRQVMNGFQ--NQKHKKGKLF---HEPL-LVDVPKS 117

Query:   140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNG 198
             +DW  KG VTP+KNQ +CG CWAF+A  A+EG    ++G L+ LSEQ L+DCS   GN G
Sbjct:   118 VDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQG 177

Query:   199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVP-GTCSAAQKPAAAKISNYEEVPSGDEQAL 257
             C GG  + AF YI  N  + +E+ YPY A    +C+   + +AA  + + ++P   E+AL
Sbjct:   178 CNGGLMDNAFQYIKDNGCLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQR-EKAL 236

Query:   258 LKAVS-MQPVSIAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFG---TTEDGANY 311
             +KAV+ + P+S+AI A  T FQ YK GI+ +  C ++ LDH V +VG+G   T  +   +
Sbjct:   237 MKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNNNKF 296

Query:   312 WLIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYP 346
             W++KNSWG  WG  GY+K+ +D+   CGI T +SYP
Sbjct:   297 WIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYP 332


>MGI|MGI:1349426 [details] [associations]
            symbol:Ctsj "cathepsin J" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=ISO]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0048471 "perinuclear region
            of cytoplasm" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:1349426 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF136272
            EMBL:AF158182 EMBL:AY034579 EMBL:AK005526 EMBL:AK131661
            EMBL:BC103769 IPI:IPI00126770 RefSeq:NP_036137.1 UniGene:Mm.31948
            ProteinModelPortal:Q9R014 SMR:Q9R014 MEROPS:C01.038 PRIDE:Q9R014
            Ensembl:ENSMUST00000071526 GeneID:26898 KEGG:mmu:26898
            UCSC:uc007qwa.1 CTD:26898 InParanoid:Q9R014 KO:K09599
            NextBio:304745 Bgee:Q9R014 CleanEx:MM_CTSJ Genevestigator:Q9R014
            GermOnline:ENSMUSG00000055298 Uniprot:Q9R014
        Length = 334

 Score = 584 (210.6 bits), Expect = 9.6e-57, P = 9.6e-57
 Identities = 126/335 (37%), Positives = 187/335 (55%)

Query:    22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
             + LL+ C   V S    H+  +    + W  ++ +SY  + E+ +R  +++EN+  I+  
Sbjct:     5 VLLLILCFG-VASGAQAHDPKLDAEWKDWKTKYAKSYSPK-EEALRRAVWEENMRMIKLH 62

Query:    82 NKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPT 138
             NKE   G   + +  N+F D T++EFR       +P P+           QN     +P 
Sbjct:    63 NKENSLGKNNFTMKMNKFGDQTSEEFRKSIDN--IPIPAAMTDPHA----QNHVSIGLPD 116

Query:   139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNN 197
               DWR++G VTP++NQ +CG CWAFAA  A+EG    ++GNL  LS Q LLDCS T GN 
Sbjct:   117 YKDWREEGYVTPVRNQGKCGSCWAFAAAGAIEGQMFWKTGNLTPLSVQNLLDCSKTVGNK 176

Query:   198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
             GC  G+  +AF Y+++N+G+  E  YPY+   G C    + A+A I++Y  +P  +    
Sbjct:   177 GCQSGTAHQAFEYVLKNKGLEAEATYPYEGKDGPCRYRSENASANITDYVNLPPNELYLW 236

Query:   258 LKAVSMQPVSIAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFGT---TEDGANYW 312
             +   S+ PVS AI A    F+ Y  GI+    C +  ++HAV +VG+G+    +DG NYW
Sbjct:   237 VAVASIGPVSAAIDASHDSFRFYNGGIYYEPNCSSYFVNHAVLVVGYGSEGDVKDGNNYW 296

Query:   313 LIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYP 346
             LIKNSWG  WG  GYM+I +D    CGI + +SYP
Sbjct:   297 LIKNSWGEEWGMNGYMQIAKDHNNHCGIASLASYP 331


>UNIPROTKB|P83654 [details] [associations]
            symbol:P83654 "Ervatamin-C" species:52861 "Tabernaemontana
            divaricata" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0006508 "proteolysis" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005576 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197 PDB:1O0E PDB:2PNS
            PDBsum:1O0E PDBsum:2PNS MEROPS:C01.116 EvolutionaryTrace:P83654
            Uniprot:P83654
        Length = 208

 Score = 583 (210.3 bits), Expect = 1.2e-56, P = 1.2e-56
 Identities = 112/213 (52%), Positives = 145/213 (68%)

Query:   136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
             +P  +DWR KGAVTP+KNQ  CG CWAF+ V+ VE I +IR+GNLI LSEQ+L+DC    
Sbjct:     1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKK- 59

Query:   196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
             N+GCLGG+   A+ YII N GI T+  YPY+AV G C AA K     I  Y  VP  +E 
Sbjct:    60 NHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPCQAASK--VVSIDGYNGVPFCNEX 117

Query:   256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
             AL +AV++QP ++AI A S +FQ Y  GIF+G CGT+L+H VTIVG+      ANYW+++
Sbjct:   118 ALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQ-----ANYWIVR 172

Query:   316 NSWGNTWGDAGYMKIVRDEG--LCGIGTRSSYP 346
             NSWG  WG+ GY++++R  G  LCGI     YP
Sbjct:   173 NSWGRYWGEKGYIRMLRVGGCGLCGIARLPYYP 205


>ZFIN|ZDB-GENE-050522-559 [details] [associations]
            symbol:ctssb.1 "cathepsin S, b.1" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-050522-559 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 MEROPS:C01.034
            EMBL:BC095694 IPI:IPI00607338 UniGene:Dr.75553
            ProteinModelPortal:Q502H6 SMR:Q502H6 InParanoid:Q502H6
            ArrayExpress:Q502H6 Uniprot:Q502H6
        Length = 330

 Score = 583 (210.3 bits), Expect = 1.2e-56, P = 1.2e-56
 Identities = 126/322 (39%), Positives = 183/322 (56%)

Query:    34 SSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTY 89
             S+   H  + ++ H E W   +G+ Y  E+E+  R ++++ NL+ I   N E   G  +Y
Sbjct:    13 SAALAHFNTNLDQHWELWKKTYGKIYTTEVEEFGRRQLWERNLQLITVHNLEASMGMHSY 72

Query:    90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVT 149
              L  N   DLT +E         +PS   R            S   VP SLDWR+KG V+
Sbjct:    73 DLSMNHMGDLTTEEILQTLALTHVPSGFKRQIANIV----GSSGDAVPDSLDWREKGYVS 128

Query:   150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAF 208
              +K Q  CG CWAF++V A+EG  K  +G L+ LS Q L+DCS+  GN GC GG    AF
Sbjct:   129 SVKMQGACGSCWAFSSVGALEGQLKKTTGKLVDLSPQNLVDCSSKYGNKGCNGGFMSDAF 188

Query:   209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVS 267
              Y+I N GIA++  YPY+ V   CS +    AA  + Y  V  GDE AL +AV S+ P+S
Sbjct:   189 QYVIDNGGIASDSAYPYRGVQQQCSYSSSQRAANCTKYYFVRQGDENALKQAVASVGPIS 248

Query:   268 IAIAAYSTEFQSYKEGIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAG 326
             +AI A   +F  Y  G++N   C  +++HAV +VG+GT   G ++WL+KNSWG  +GD G
Sbjct:   249 VAIDATRPQFVLYHSGVYNDPTCSKRVNHAVLVVGYGTLS-GQDHWLVKNSWGTRFGDGG 307

Query:   327 YMKIVRDEG-LCGIGTRSSYPL 347
             Y+++ R++  +CGI + + YP+
Sbjct:   308 YIRMARNKNNMCGIASYACYPV 329


>UNIPROTKB|F1NZ37 [details] [associations]
            symbol:LOC420160 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 EMBL:AADN02062018
            IPI:IPI00587784 Ensembl:ENSGALT00000006765 OMA:CGVANQA
            Uniprot:F1NZ37
        Length = 340

 Score = 582 (209.9 bits), Expect = 1.6e-56, P = 1.6e-56
 Identities = 130/352 (36%), Positives = 196/352 (55%)

Query:     8 SGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMR 67
             S  F+  T P+ +++ LL  C + +       +  + E  E+W + + + Y  E E  +R
Sbjct:     3 SPQFRAMTVPLGLLLALL-GCTTAL-------DPVLEEAWERWKSLYAKEYPGEAEL-IR 53

Query:    68 LKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXX 124
              ++++ NL  IE+ N E   G  T++LG N + DL ++EF  L  G+   +P        
Sbjct:    54 REVWENNLRRIEQHNWEESQGQHTFRLGMNHYGDLMDEEFNQLLNGF---APVQHEEPAL 110

Query:   125 XFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS 184
              F+      T  P  +DWR +G VTP+KNQ  CG CWAF+A  A+EG+    +G L  LS
Sbjct:   111 TFQASAAQKT--PAEVDWRMRGYVTPVKNQGHCGSCWAFSATGALEGLVFNWTGKLAVLS 168

Query:   185 EQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA--AA 241
             EQ L+DCS   GNNGC GG   +AF Y+  N G+ +E  YPYQA   T S    PA  AA
Sbjct:   169 EQNLIDCSWKLGNNGCQGGYMTRAFQYVHDNGGMNSEHIYPYQATD-TSSCRYNPADRAA 227

Query:   242 KISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQLDHAVTI 299
               S    V  G E AL +AV+ + PVS+A+ A S  F  YK GIFN + C  +++H +  
Sbjct:   228 NCSTVWLVAQGSEAALEQAVATVGPVSVAVDASSFFFHFYKSGIFNSMFCSQKVNHGMLA 287

Query:   300 VGFGTTEDG---ANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
             VG+G +++     +YW++KNSW   WG+ GY+++++     CG+  ++S+PL
Sbjct:   288 VGYGISQEARKNVSYWILKNSWSEVWGEKGYIRLLKGVNNHCGVANQASFPL 339


>UNIPROTKB|G1K2A7 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9615 "Canis lupus
            familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 PANTHER:PTHR12411:SF55 OMA:LKVPPSH
            EMBL:AAEX03011051 Ensembl:ENSCAFT00000019202 Uniprot:G1K2A7
        Length = 333

 Score = 576 (207.8 bits), Expect = 6.8e-56, P = 6.8e-56
 Identities = 122/322 (37%), Positives = 191/322 (59%)

Query:    33 VSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRT 88
             ++S + + + +++   + W   + + Y  ++++  R  I+++NL++I   N E   G  T
Sbjct:    15 MASFALYPEEILDTQWDLWKKTYRKQYNSKVDELSRRLIWEKNLKHISIHNLEASLGVHT 74

Query:    89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAV 148
             Y+L  N   D+T++E     TG K+P PSH       +     S    P S+D+R KG V
Sbjct:    75 YELAMNHLGDMTSEEVVQKMTGLKVP-PSHSRSNDTLYIPDWESRA--PDSVDYRKKGYV 131

Query:   149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
             TP+KNQ +CG CWAF++V A+EG  K ++G L+ LS Q L+DC +  N+GC GG    AF
Sbjct:   132 TPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE-NDGCGGGYMTNAF 190

Query:   209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVS 267
              Y+ +N+GI +ED YPY     +C       AAK   Y E+P G+E+AL +AV+ + P+S
Sbjct:   191 QYVQKNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPIS 250

Query:   268 IAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDA 325
             +AI A  T FQ Y +G++ +  C +  L+HAV  VG+G  + G  +W+IKNSWG  WG+ 
Sbjct:   251 VAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGI-QKGNKHWIIKNSWGENWGNK 309

Query:   326 GYMKIVRDEG-LCGIGTRSSYP 346
             GY+ + R++   CGI   +S+P
Sbjct:   310 GYILMARNKNNACGIANLASFP 331


>UNIPROTKB|Q3ZKN1 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:AY738221
            RefSeq:NP_001029168.1 UniGene:Cfa.588 HSSP:P43235
            ProteinModelPortal:Q3ZKN1 SMR:Q3ZKN1 STRING:Q3ZKN1 GeneID:608843
            KEGG:cfa:608843 InParanoid:Q3ZKN1 NextBio:20894470 Uniprot:Q3ZKN1
        Length = 330

 Score = 576 (207.8 bits), Expect = 6.8e-56, P = 6.8e-56
 Identities = 122/322 (37%), Positives = 191/322 (59%)

Query:    33 VSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRT 88
             ++S + + + +++   + W   + + Y  ++++  R  I+++NL++I   N E   G  T
Sbjct:    12 MASFALYPEEILDTQWDLWKKTYRKQYNSKVDELSRRLIWEKNLKHISIHNLEASLGVHT 71

Query:    89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAV 148
             Y+L  N   D+T++E     TG K+P PSH       +     S    P S+D+R KG V
Sbjct:    72 YELAMNHLGDMTSEEVVQKMTGLKVP-PSHSRSNDTLYIPDWESRA--PDSVDYRKKGYV 128

Query:   149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
             TP+KNQ +CG CWAF++V A+EG  K ++G L+ LS Q L+DC +  N+GC GG    AF
Sbjct:   129 TPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE-NDGCGGGYMTNAF 187

Query:   209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVS 267
              Y+ +N+GI +ED YPY     +C       AAK   Y E+P G+E+AL +AV+ + P+S
Sbjct:   188 QYVQKNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPIS 247

Query:   268 IAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDA 325
             +AI A  T FQ Y +G++ +  C +  L+HAV  VG+G  + G  +W+IKNSWG  WG+ 
Sbjct:   248 VAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGI-QKGNKHWIIKNSWGENWGNK 306

Query:   326 GYMKIVRDEG-LCGIGTRSSYP 346
             GY+ + R++   CGI   +S+P
Sbjct:   307 GYILMARNKNNACGIANLASFP 328


>DICTYBASE|DDB_G0278401 [details] [associations]
            symbol:cprH "cysteine proteinase 8" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0278401 EMBL:AAFI02000023
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 ProtClustDB:CLSZ2430780 RefSeq:XP_642342.1
            ProteinModelPortal:Q54Y60 MEROPS:C01.A62 EnsemblProtists:DDB0205428
            GeneID:8621547 KEGG:ddi:DDB_G0278401 InParanoid:Q54Y60 OMA:FANMENE
            Uniprot:Q54Y60
        Length = 337

 Score = 575 (207.5 bits), Expect = 8.6e-56, P = 8.6e-56
 Identities = 135/344 (39%), Positives = 190/344 (55%)

Query:    24 LLVSCASQVVSSRSTHEQSVVEIHEK---WMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
             L V CA  +  + +  E S  +  +    WM  + +SY    E   R  IFK N +YIE+
Sbjct:     4 LSVLCALLITVATAKQELSESQYRDAFTDWMISNQKSYSSS-EFITRYNIFKTNFDYIEE 62

Query:    81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSL 140
              N +G+ T  LG N+ +D+TN+E+R+LY G    + S         K + L      +++
Sbjct:    63 WNSKGSETV-LGLNKMADITNEEYRSLYLGKPFDASS-----LIGTKEEILFSNKFSSTV 116

Query:   141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN---LIQLSEQQLLDCSTN-GN 196
             DWR KGAVT +KNQ+ C  CW+F+A  A EG  K+ +     L+ LSEQ L+DCST  GN
Sbjct:   117 DWRKKGAVTHVKNQQSCSGCWSFSATGATEGAHKLANNGTNELVSLSEQNLIDCSTPFGN 176

Query:   197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
              GC GG    AF YII N GI TE  YP++   GTC    + + A IS+Y  V  G E +
Sbjct:   177 TGCNGGVITYAFEYIISNGGIDTEKSYPFEGTDGTCRYKSENSGATISSYVNVTFGSESS 236

Query:   257 LLKAVSMQPVSIAIAAYSTEFQSYKEGI-FNGVCG-TQLDHAVTIVGFGT----TEDGA- 309
             L  AV++ PV+ +I A  + F  YK GI F   C  T LDH V +VG+GT    ++D + 
Sbjct:   237 LESAVNVNPVACSIDASHSSFLFYKSGIYFEPACSRTNLDHGVLVVGYGTENSQSQDSSS 296

Query:   310 -----NYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
                  NYW+ KNSWG      GY+ + +D + +CGI T +S+P+
Sbjct:   297 EPNHSNYWIAKNSWGIN----GYILMSKDRDNMCGISTLASFPI 336


>RGD|708447 [details] [associations]
            symbol:Testin "testin gene" species:10116 "Rattus norvegicus"
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0030054 "cell junction" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 RGD:708447 GO:GO:0005576 GO:GO:0030054 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            MEROPS:C01.972 OMA:RYHAENS OrthoDB:EOG4XWG0N EMBL:U16858
            IPI:IPI00207173 PIR:I52525 PIR:PC1251 RefSeq:NP_775155.1
            UniGene:Rn.10029 ProteinModelPortal:P15242 SMR:P15242
            Ensembl:ENSRNOT00000024467 GeneID:286916 KEGG:rno:286916
            UCSC:RGD:708447 CTD:286916 InParanoid:P15242 NextBio:625036
            Genevestigator:P15242 GermOnline:ENSRNOG00000018028 Uniprot:P15242
        Length = 333

 Score = 575 (207.5 bits), Expect = 8.6e-56, P = 8.6e-56
 Identities = 129/342 (37%), Positives = 194/342 (56%)

Query:    18 MFIIITLLVSCASQVVSSRSTHEQSV-VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
             M  ++ L + C  +V S+  T + S+ VE +E W  +HG++Y    E+ ++  ++++N +
Sbjct:     1 MIAVLFLAILCL-EVDSTAPTPDPSLDVEWNE-WRTKHGKTYNMN-EERLKRAVWEKNFK 57

Query:    77 YIEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSM 133
              IE  N    EG   + +  N F DLTN EF  + TG++      R        +Q+   
Sbjct:    58 MIELHNWEYLEGRHDFTMAMNAFGDLTNIEFVKMMTGFQ------RQKIKKTHIFQDHQF 111

Query:   134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC-S 192
               VP  +DWR  G VTP+KNQ  C   WAF+A  ++EG    ++  LI LSEQ LLDC  
Sbjct:   112 LYVPKRVDWRQLGYVTPVKNQGHCASSWAFSATGSLEGQMFRKTERLIPLSEQNLLDCMG 171

Query:   193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
             +N  +GC GG  + AF Y+  N G+ATE+ YPY+     C    + +AA + ++ ++P G
Sbjct:   172 SNVTHGCSGGFMQYAFQYVKDNGGLATEESYPYRGQGRECRYHAENSAANVRDFVQIP-G 230

Query:   253 DEQALLKAVS-MQPVSIAIAAYSTEFQSYKEGIF-NGVCG-TQLDHAVTIVGFG---TTE 306
              E+AL+KAV+ + P+S+A+ A    FQ Y  GI+    C    L+HAV +VG+G      
Sbjct:   231 SEEALMKAVAKVGPISVAVDASHGSFQFYGSGIYYEPQCKRVHLNHAVLVVGYGFEGEES 290

Query:   307 DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
             DG ++WL+KNSWG  WG  GYMK+ +D    CGI T S+YP+
Sbjct:   291 DGNSFWLVKNSWGEEWGMKGYMKLAKDWSNHCGIATYSTYPI 332


>ZFIN|ZDB-GENE-001205-4 [details] [associations]
            symbol:ctsk "cathepsin K" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-001205-4 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            CTD:1513 KO:K01371 OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55
            EMBL:BC092901 IPI:IPI00512751 RefSeq:NP_001017778.1
            UniGene:Dr.76224 ProteinModelPortal:Q568D6 SMR:Q568D6 GeneID:550475
            KEGG:dre:550475 InParanoid:Q568D6 NextBio:20879718
            ArrayExpress:Q568D6 Uniprot:Q568D6
        Length = 333

 Score = 572 (206.4 bits), Expect = 1.8e-55, P = 1.8e-55
 Identities = 123/320 (38%), Positives = 174/320 (54%)

Query:    35 SRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKL 91
             + S    S+ E  E W   H R Y    E+ +R  I+++N+ +IE  NKE   G  TY L
Sbjct:    18 AHSLDNLSLDEAWESWKITHKREYNGLNEESIRRTIWEKNMLFIEAHNKEYELGIHTYDL 77

Query:    92 GTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPI 151
             G N F D+T +E      G +MP   +R           +    +P S+D+R  G VT +
Sbjct:    78 GMNHFGDMTLEEVAEKVMGLQMPM--YRDPANTFVPDDRVGK--LPKSIDYRKLGYVTSV 133

Query:   152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
             KNQ  CG CWAF++V A+EG      G L+ LS Q L+DC T  N+GC GG    AF Y+
Sbjct:   134 KNQGSCGSCWAFSSVGALEGQLMKTKGQLVDLSPQNLVDCVTE-NDGCGGGYMTNAFRYV 192

Query:   212 IQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAI 270
               NQGI +E+ YPY      C+      AA    Y+E+P G+E+AL  AV+ + PVS+ I
Sbjct:   193 SNNQGIDSEESYPYVGTDQQCAYNTSGVAASCRGYKEIPQGNERALTAAVANVGPVSVGI 252

Query:   271 AAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
              A  + F  YK G++ +  C  + ++HAV  VG+G T  G  YW++KNSWG  WG  GY+
Sbjct:   253 DAMQSTFLYYKSGVYYDPNCNKEDVNHAVLAVGYGATPRGKKYWIVKNSWGEEWGKKGYV 312

Query:   329 KIVRDEG-LCGIGTRSSYPL 347
              + R+    CGI   +S+P+
Sbjct:   313 LMARNRNNACGIANLASFPV 332


>UNIPROTKB|F1PMM9 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9615 "Canis lupus
            familiaris" [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00639 GO:GO:0004197 GO:GO:0002250
            GeneTree:ENSGT00560000076577 GO:GO:0097067 EMBL:AAEX03000499
            Ensembl:ENSCAFT00000002029 OMA:EFKQVLN Uniprot:F1PMM9
        Length = 341

 Score = 571 (206.1 bits), Expect = 2.3e-55, P = 2.3e-55
 Identities = 116/326 (35%), Positives = 189/326 (57%)

Query:    33 VSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRT 88
             ++S +  +   ++ H  +W   HG+ Y D+ E+  R  +++ N+E IE+ N+E   G  +
Sbjct:    22 IASAAPQQDHSLDAHWSQWKEAHGKLY-DKDEEGWRRTVWERNMEMIEQHNQEYSQGEHS 80

Query:    89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAV 148
             + L  N F D+TN+EF+ +   +K+    H+        +      +VP+S+DWR++G V
Sbjct:    81 FTLAMNAFGDMTNEEFKQVLNDFKIQK--HKKGKV----FPAPLFAEVPSSVDWREQGYV 134

Query:   149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKA 207
             TP+K+Q +C  CWAF+A  A+EG    ++G L+ LSEQ L+DCS + GN GC GG  E A
Sbjct:   135 TPVKDQGQCLGCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWSQGNRGCNGGLMEYA 194

Query:   208 FAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
             F Y+  N G+ +E+ YPY A    C    + +AA ++ +  + + ++  +    ++ PVS
Sbjct:   195 FQYVKDNGGLDSEESYPYLARNEPCKYRPEKSAANVTAFWPILNEEDGLMTTVATVGPVS 254

Query:   268 IAIAAYSTEFQSYKEGIF-NGVCGTQL-DHAVTIVGFG---TTEDGANYWLIKNSWGNTW 322
              A+ +    FQ YK+GI+ +  C  +L +H V +VG+G      D   YW++KNSWG  W
Sbjct:   255 AAVDSSPQSFQFYKKGIYYDPKCSNKLLNHGVLVVGYGFEGAESDNKKYWIVKNSWGTNW 314

Query:   323 GDAGYMKIVRD-EGLCGIGTRSSYPL 347
             G  GYM + +D +  CGI TR+SYP+
Sbjct:   315 GMQGYMLLAKDRDNHCGIATRASYPV 340


>MGI|MGI:107823 [details] [associations]
            symbol:Ctsk "cathepsin K" species:10090 "Mus musculus"
            [GO:0005615 "extracellular space" evidence=ISO] [GO:0005737
            "cytoplasm" evidence=ISO] [GO:0005764 "lysosome" evidence=ISO]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0045453 "bone resorption" evidence=ISO]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:107823 GO:GO:0005615 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0045453
            GO:GO:0001957 HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 OMA:LKVPPSH EMBL:X94444
            EMBL:AJ006033 EMBL:BC046320 IPI:IPI00316575 PIR:S74227
            RefSeq:NP_031828.2 UniGene:Mm.272085 ProteinModelPortal:P55097
            SMR:P55097 MINT:MINT-3089515 STRING:P55097 PhosphoSite:P55097
            PRIDE:P55097 Ensembl:ENSMUST00000015664 GeneID:13038 KEGG:mmu:13038
            InParanoid:P55097 BioCyc:MetaCyc:MONOMER-14811 ChEMBL:CHEMBL1075277
            NextBio:282924 Bgee:P55097 CleanEx:MM_CTSK Genevestigator:P55097
            GermOnline:ENSMUSG00000028111 Uniprot:P55097
        Length = 329

 Score = 569 (205.4 bits), Expect = 3.7e-55, P = 3.7e-55
 Identities = 123/322 (38%), Positives = 185/322 (57%)

Query:    32 VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRT 88
             +VS   + E+ +    E W   H + Y  ++++  R  I+++NL+ I   N E   G  T
Sbjct:    11 MVSFALSPEEMLDTQWELWKKTHQKQYNSKVDEISRRLIWEKNLKQISAHNLEASLGVHT 70

Query:    89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAV 148
             Y+L  N   D+T++E     TG ++P PS        +  +      VP S+D+R KG V
Sbjct:    71 YELAMNHLGDMTSEEVVQKMTGLRIP-PSRSYSNDTLYTPEWEGR--VPDSIDYRKKGYV 127

Query:   149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
             TP+KNQ +CG CWAF++  A+EG  K ++G L+ LS Q L+DC T  N GC GG    AF
Sbjct:   128 TPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVTE-NYGCGGGYMTTAF 186

Query:   209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVS 267
              Y+ QN GI +ED YPY     +C       AAK   Y E+P G+E+AL +AV+ + P+S
Sbjct:   187 QYVQQNGGIDSEDAYPYVGQDESCMYNATAKAAKCRGYREIPVGNEKALKRAVARVGPIS 246

Query:   268 IAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDA 325
             ++I A    FQ Y  G++ +  C    ++HAV +VG+GT + G+ +W+IKNSWG +WG+ 
Sbjct:   247 VSIDASLASFQFYSRGVYYDENCDRDNVNHAVLVVGYGT-QKGSKHWIIKNSWGESWGNK 305

Query:   326 GYMKIVRDEG-LCGIGTRSSYP 346
             GY  + R++   CGI   +S+P
Sbjct:   306 GYALLARNKNNACGITNMASFP 327


>RGD|621513 [details] [associations]
            symbol:Ctss "cathepsin S" species:10116 "Rattus norvegicus"
            [GO:0001656 "metanephros development" evidence=IEP] [GO:0002250
            "adaptive immune response" evidence=ISO] [GO:0005764 "lysosome"
            evidence=IEA;ISO] [GO:0006508 "proteolysis" evidence=IEA;ISO]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0009986 "cell
            surface" evidence=IDA] [GO:0016020 "membrane" evidence=ISO]
            [GO:0043231 "intracellular membrane-bounded organelle"
            evidence=ISO] [GO:0045453 "bone resorption" evidence=IMP]
            [GO:0051930 "regulation of sensory perception of pain"
            evidence=IMP] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            RGD:621513 GO:GO:0009986 GO:GO:0051930 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0045453
            GO:GO:0001656 HOVERGEN:HBG011513 CTD:1520 KO:K01368 MEROPS:I29.004
            BRENDA:3.4.22.27 EMBL:L03201 IPI:IPI00210228 PIR:A45087
            RefSeq:NP_059016.1 UniGene:Rn.11347 ProteinModelPortal:Q02765
            PhosphoSite:Q02765 PRIDE:Q02765 GeneID:50654 KEGG:rno:50654
            UCSC:RGD:621513 ChEMBL:CHEMBL1075217 NextBio:610462
            Genevestigator:Q02765 Uniprot:Q02765
        Length = 330

 Score = 568 (205.0 bits), Expect = 4.8e-55, P = 4.8e-55
 Identities = 127/322 (39%), Positives = 188/322 (58%)

Query:    37 STHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
             +T E+  ++ H + W     R   D+ E+++R  I+++NL++I   N E   G  +Y +G
Sbjct:    15 ATAERPTLDHHWDLWKKTRMRRNTDQNEEDVRRLIWEKNLKFIMLHNLEHSMGMHSYSVG 74

Query:    93 TNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
              N   D+T +E        ++P P +R         Q L     P S+DWR+KG VT +K
Sbjct:    75 MNHMGDMTPEEVIGYMGSLRIPRPWNRSGTLKSSSNQTL-----PDSVDWREKGCVTNVK 129

Query:   153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSREKAFA 209
              Q  CG CWAF+A  A+EG  K+++G L+ LS Q L+DCST    GN GC GG   +AF 
Sbjct:   130 YQGSCGSCWAFSAEGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEAFQ 189

Query:   210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSI 268
             YII    I +E  YPY+A+   C    K  AA  S Y E+P GDE+AL +AV+ + PVS+
Sbjct:   190 YIIDTS-IDSEASYPYKAMDEKCLYDPKNRAATCSRYIELPFGDEEALKEAVATKGPVSV 248

Query:   269 AI--AAYSTEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDA 325
              I  A++S+ F  Y+ G+++   C   ++H V +VG+GT  DG +YWL+KNSWG  +GD 
Sbjct:   249 GIDDASHSSFFL-YQSGVYDDPSCTENMNHGVLVVGYGTL-DGKDYWLVKNSWGLHFGDQ 306

Query:   326 GYMKIVRD-EGLCGIGTRSSYP 346
             GY+++ R+ +  CGI +  SYP
Sbjct:   307 GYIRMARNNKNHCGIASYCSYP 328


>TAIR|locus:2078312 [details] [associations]
            symbol:AT3G45310 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005773 EMBL:CP002686
            GenomeReviews:BA000014_GR eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AL132953
            EMBL:AY091771 IPI:IPI00540369 PIR:T47471 RefSeq:NP_566880.1
            UniGene:At.25239 ProteinModelPortal:Q8RWQ9 SMR:Q8RWQ9
            MEROPS:C01.162 PaxDb:Q8RWQ9 PRIDE:Q8RWQ9 EnsemblPlants:AT3G45310.1
            GeneID:823669 KEGG:ath:AT3G45310 GeneFarm:5032 TAIR:At3g45310
            InParanoid:Q8RWQ9 KO:K01366 OMA:AFEVVHE PhylomeDB:Q8RWQ9
            ProtClustDB:CLSN2689015 Genevestigator:Q8RWQ9 Uniprot:Q8RWQ9
        Length = 358

 Score = 568 (205.0 bits), Expect = 4.8e-55, P = 4.8e-55
 Identities = 122/301 (40%), Positives = 169/301 (56%)

Query:    53 QHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK 112
             ++G+ Y+   E ++R  +FKENL+ I   NK+G  +YKL  NQF+DLT  EF+     YK
Sbjct:    65 RYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKG-LSYKLSLNQFADLTWQEFQR----YK 119

Query:   113 MPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGI 172
             + +  +            ++   VP + DWR+ G V+P+K Q  CG CW F+   A+E  
Sbjct:   120 LGAAQNCSATLKGS--HKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAA 177

Query:   173 TKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT 231
                  G  I LSEQQL+DC+   NN GC GG   +AF YI  N G+ TE+ YPY    G 
Sbjct:   178 YHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGG 237

Query:   232 CSAAQKPAAAKISNYEEVPSGDEQALLKAVSM-QPVSIAIAAYSTEFQSYKEGIF-NGVC 289
             C  + K    ++ +   +  G E  L  AV + +PVS+A      EF+ YK+G+F +  C
Sbjct:   238 CKFSAKNIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVVH-EFRFYKKGVFTSNTC 296

Query:   290 G-TQLD--HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYP 346
             G T +D  HAV  VG+G  ED   YWLIKNSWG  WGD GY K+   + +CG+ T SSYP
Sbjct:   297 GNTPMDVNHAVLAVGYGV-EDDVPYWLIKNSWGGEWGDNGYFKMEMGKNMCGVATCSSYP 355

Query:   347 L 347
             +
Sbjct:   356 V 356


>ZFIN|ZDB-GENE-040426-1583 [details] [associations]
            symbol:ctssa "cathepsin S, a" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-040426-1583
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            GeneTree:ENSGT00660000095458 EMBL:CR548627 IPI:IPI00491948
            UniGene:Dr.81560 SMR:Q1L8W8 Ensembl:ENSDART00000053638 OMA:RNTREER
            OrthoDB:EOG480HX9 Uniprot:Q1L8W8
        Length = 328

 Score = 568 (205.0 bits), Expect = 4.8e-55, P = 4.8e-55
 Identities = 116/304 (38%), Positives = 177/304 (58%)

Query:    50 WMAQHGRSYKDELEKEMRLKIFKENLEYI---EKANKEGNRTYKLGTNQFSDLTNDEFRA 106
             W +QH ++Y++  E+ +R  ++K+NL+ I    +A   G  +Y LG NQ SD+T DE   
Sbjct:    30 WKSQHNKTYRNTREERLRRSVWKQNLQDILLHNEAAAVGLHSYTLGLNQLSDMTADEVND 89

Query:   107 LYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
             +    +   P           +   S+  +P  ++W + G V+P++NQ  CG CWAF+AV
Sbjct:    90 MNGLLEEDFPDVNAT------FSPPSLQTLPQRVNWTEHGMVSPVQNQGPCGSCWAFSAV 143

Query:   167 AAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
              ++E   K R+  L+ LS Q LLDCS + GN GC GG   +AF Y+IQN+GI +   YPY
Sbjct:   144 GSLEAQMKRRTAALVPLSAQNLLDCSVSLGNRGCKGGFLSRAFLYVIQNRGIDSSTFYPY 203

Query:   226 QAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKEGI 284
             +   G C  +    A   + +  VP  +E AL  AV+ + PVS+ I A    F  Y+ GI
Sbjct:   204 EHKEGVCRYSVSGRAGYCTGFRIVPRHNEAALQSAVANIGPVSVGINAKLLSFHRYRSGI 263

Query:   285 FNGV-CGTQL-DHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTR 342
             +N   C + L +HAV +VG+G+ E+G +YWL+KNSWG  WG+ GY+++ R++ +CGI + 
Sbjct:   264 YNDPKCSSALINHAVLVVGYGS-ENGQDYWLVKNSWGTAWGENGYIRMARNKNMCGISSF 322

Query:   343 SSYP 346
               YP
Sbjct:   323 GIYP 326


>RGD|2447 [details] [associations]
            symbol:Ctsh "cathepsin H" species:10116 "Rattus norvegicus"
          [GO:0001520 "outer dense fiber" evidence=IDA] [GO:0001656
          "metanephros development" evidence=IEP] [GO:0001669 "acrosomal
          vesicle" evidence=IDA] [GO:0001913 "T cell mediated cytotoxicity"
          evidence=ISO;ISS] [GO:0002250 "adaptive immune response"
          evidence=ISO] [GO:0002764 "immune response-regulating signaling
          pathway" evidence=ISO;ISS] [GO:0004175 "endopeptidase activity"
          evidence=ISO] [GO:0004177 "aminopeptidase activity" evidence=ISO;IDA]
          [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISO;ISS]
          [GO:0004252 "serine-type endopeptidase activity" evidence=ISO;ISS]
          [GO:0005615 "extracellular space" evidence=ISO;ISS;IDA] [GO:0005764
          "lysosome" evidence=ISO;ISS;IDA] [GO:0005829 "cytosol"
          evidence=ISO;ISS] [GO:0006508 "proteolysis" evidence=IEP;ISO]
          [GO:0007283 "spermatogenesis" evidence=IEP] [GO:0008233 "peptidase
          activity" evidence=ISO] [GO:0008234 "cysteine-type peptidase
          activity" evidence=ISO] [GO:0008284 "positive regulation of cell
          proliferation" evidence=ISO;ISS] [GO:0010628 "positive regulation of
          gene expression" evidence=ISO;ISS] [GO:0010634 "positive regulation
          of epithelial cell migration" evidence=ISO;ISS] [GO:0010813
          "neuropeptide catabolic process" evidence=ISO;ISS] [GO:0010815
          "bradykinin catabolic process" evidence=ISO;ISS] [GO:0010952
          "positive regulation of peptidase activity" evidence=ISO;ISS]
          [GO:0016505 "apoptotic protease activator activity" evidence=ISO;ISS]
          [GO:0030108 "HLA-A specific activating MHC class I receptor activity"
          evidence=ISO;ISS] [GO:0030335 "positive regulation of cell migration"
          evidence=ISO;ISS] [GO:0030984 "kininogen binding" evidence=IPI]
          [GO:0031638 "zymogen activation" evidence=ISO;ISS] [GO:0031648
          "protein destabilization" evidence=ISO;ISS] [GO:0032403 "protein
          complex binding" evidence=IPI] [GO:0032526 "response to retinoic
          acid" evidence=ISO;ISS] [GO:0033619 "membrane protein proteolysis"
          evidence=ISO;ISS] [GO:0035085 "cilium axoneme" evidence=IDA]
          [GO:0043066 "negative regulation of apoptotic process"
          evidence=ISO;ISS] [GO:0043129 "surfactant homeostasis"
          evidence=ISO;ISS] [GO:0043621 "protein self-association"
          evidence=IDA] [GO:0045766 "positive regulation of angiogenesis"
          evidence=ISO;ISS] [GO:0060448 "dichotomous subdivision of terminal
          units involved in lung branching" evidence=ISO;ISS] [GO:0070324
          "thyroid hormone binding" evidence=ISO;ISS] [GO:0070371 "ERK1 and
          ERK2 cascade" evidence=ISO;ISS] [GO:0097067 "cellular response to
          thyroid hormone stimulus" evidence=ISO;IEP] [GO:0097208 "alveolar
          lamellar body" evidence=ISO;ISS;IDA] InterPro:IPR000668
          InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
          InterPro:IPR000169 RGD:2447 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
          GO:GO:0008284 GO:GO:0070371 GO:GO:0001669 eggNOG:COG4870
          HOGENOM:HOG000230774 InterPro:IPR025661 InterPro:IPR025660
          InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
          PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0007283
          GO:GO:0045766 GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
          GO:GO:0043621 GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 KO:K01366
          GO:GO:0016505 GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
          HOVERGEN:HBG011513 GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
          GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT OrthoDB:EOG4W9J43
          GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
          GO:GO:0010813 GO:GO:0043129 MEROPS:I29.003 EMBL:Y00708 EMBL:BC085352
          EMBL:M38135 IPI:IPI00212809 PIR:S00211 RefSeq:NP_037071.1
          UniGene:Rn.1997 ProteinModelPortal:P00786 SMR:P00786 STRING:P00786
          PRIDE:P00786 Ensembl:ENSRNOT00000019285 GeneID:25425 KEGG:rno:25425
          UCSC:RGD:2447 InParanoid:P00786 BindingDB:P00786 NextBio:606599
          Genevestigator:P00786 GermOnline:ENSRNOG00000014064 GO:GO:0035086
          GO:GO:0001520 Uniprot:P00786
        Length = 333

 Score = 567 (204.7 bits), Expect = 6.1e-55, P = 6.1e-55
 Identities = 128/343 (37%), Positives = 192/343 (55%)

Query:    18 MFIIITLLVSCASQ-VVSSRSTHEQSVVEIHE----KWMAQHGRSYKDELEKEMRLKIFK 72
             M+  + LL  CA   ++S+ +T E +V  I +     WM QH ++Y    E   RL++F 
Sbjct:     1 MWTALPLL--CAGAWLLSAGATAELTVNAIEKFHFTSWMKQHQKTYSSR-EYSHRLQVFA 57

Query:    73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLS 132
              N   I+ A+ + N T+K+G NQFSD++   F  +   Y    P +       +    L 
Sbjct:    58 NNWRKIQ-AHNQRNHTFKMGLNQFSDMS---FAEIKHKYLWSEPQNCSATKSNY----LR 109

Query:   133 MTD-VPTSLDWRDKG-AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
              T   P+S+DWR KG  V+P+KNQ  CG CW F+   A+E    I SG ++ L+EQQL+D
Sbjct:   110 GTGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMMTLAEQQLVD 169

Query:   191 CSTNGNN-GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
             C+ N NN GC GG   +AF YI+ N+GI  ED YPY    G C    + A A + N   +
Sbjct:   170 CAQNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIGKNGQCKFNPEKAVAFVKNVVNI 229

Query:   250 PSGDEQALLKAVSM-QPVSIAIAAYSTEFQSYKEGIFNG-VCGT---QLDHAVTIVGFGT 304
                DE A+++AV++  PVS A    + +F  YK G+++   C     +++HAV  VG+G 
Sbjct:   230 TLNDEAAMVEAVALYNPVSFAFEV-TEDFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGE 288

Query:   305 TEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 347
              ++G  YW++KNSWG+ WG+ GY  I R + +CG+   +SYP+
Sbjct:   289 -QNGLLYWIVKNSWGSNWGNNGYFLIERGKNMCGLAACASYPI 330


>RGD|69241 [details] [associations]
            symbol:Ctsj "cathepsin J" species:10116 "Rattus norvegicus"
           [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
           evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
           evidence=IEA] [GO:0048471 "perinuclear region of cytoplasm"
           evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
           PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:L14776
           RGD:69241 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
           InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246
           SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
           GO:GO:0005764 HOVERGEN:HBG011513 MEROPS:C01.038 CTD:26898 KO:K09599
           EMBL:AF310623 EMBL:BC097263 IPI:IPI00205027 PIR:I58002
           RefSeq:NP_058817.1 UniGene:Rn.34875 ProteinModelPortal:Q63088
           SMR:Q63088 PRIDE:Q63088 GeneID:29174 KEGG:rno:29174 NextBio:608244
           Genevestigator:Q63088 Uniprot:Q63088
        Length = 334

 Score = 567 (204.7 bits), Expect = 6.1e-55, P = 6.1e-55
 Identities = 121/335 (36%), Positives = 190/335 (56%)

Query:    22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
             + L++ C   V S     + ++    + W  ++ +SY   +E+E++  +++ENL+ I+  
Sbjct:     5 VFLVILCFG-VASGAPARDPNLDAEWQDWKTKYAKSYSP-VEEELKRAVWEENLKMIQLH 62

Query:    82 NKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPT 138
             NKE   G   + +  N F+D T +EFR   +   +P+             + +S+  +P 
Sbjct:    63 NKENGLGKNGFTMEMNAFADTTGEEFRKSLSDILIPAAVTNPSAQ-----KQVSI-GLPN 116

Query:   139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNN 197
               DWR +G VTP++NQ +CG CWAFAAV A+EG    ++GNL  LS Q LLDCS + GNN
Sbjct:   117 FKDWRKEGYVTPVRNQGKCGSCWAFAAVGAIEGQMFSKTGNLTPLSVQNLLDCSKSEGNN 176

Query:   198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
             GC  G+  +AF Y+++N+G+  E  YPY+   G C    + A+A I+ +  +P  +    
Sbjct:   177 GCRWGTAHQAFNYVLKNKGLEAEATYPYEGKDGPCRYHSENASANITGFVNLPPNELYLW 236

Query:   258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQL-DHAVTIVGFG---TTEDGANYW 312
             +   S+ PVS AI A    F+ Y  G+++   C + + +HAV +VG+G      DG NYW
Sbjct:   237 VAVASIGPVSAAIDASHDSFRFYSGGVYHEPNCSSYVVNHAVLVVGYGFEGNETDGNNYW 296

Query:   313 LIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYP 346
             LIKNSWG  WG  G+MKI +D    CGI +++S+P
Sbjct:   297 LIKNSWGEEWGINGFMKIAKDRNNHCGIASQASFP 331


>UNIPROTKB|G1SQF0 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9986
            "Oryctolagus cuniculus" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 RefSeq:XP_002721635.1 UniGene:Ocu.7137
            Ensembl:ENSOCUT00000006138 GeneID:100101597 Uniprot:G1SQF0
        Length = 333

 Score = 566 (204.3 bits), Expect = 7.8e-55, P = 7.8e-55
 Identities = 121/312 (38%), Positives = 179/312 (57%)

Query:    45 EIHEK-WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDE 103
             + H K WM+QH + Y  E E   RL+ F  N   I  A+  GN T+++G NQFSD++   
Sbjct:    30 KFHFKSWMSQHHKKYSAE-EYPRRLQTFVRNWRKIN-AHNNGNHTFQMGLNQFSDMS--- 84

Query:   104 FRALYTGYKMPSPSHRXXXXXXFKYQNLSMTD-VPTSLDWRDKGA-VTPIKNQKECGCCW 161
             F  +   Y    P +       +    L  T   P+S+DWR KG  V+P+KNQ  CG CW
Sbjct:    85 FAEIKHKYLWTEPQNCSATKSNY----LRGTGPYPSSVDWRKKGNFVSPVKNQGACGSCW 140

Query:   162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIATE 220
              F+   A+E    I  G ++ L+EQQL+DC+ N NN GC GG   +AF YI+ N+GI  E
Sbjct:   141 TFSTTGALESAVAIAGGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQAFEYILYNKGIMGE 200

Query:   221 DEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSM-QPVSIAIAAYSTEFQS 279
             D YPY+A+ G C    + A A + +   +   DE+A+++AV++  PVS A    + +F  
Sbjct:   201 DSYPYRAMEGRCKFQPQKAIAFVKDVANITLNDEEAMVEAVALYNPVSFAFEV-TEDFMQ 259

Query:   280 YKEGIFNGV-CGT---QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG 335
             Y++GI++   C     +++HAV  VG+G  E+G  YW++KNSWG+ WG  GY  I R + 
Sbjct:   260 YRKGIYSSTSCHKTPDKVNHAVLAVGYGE-ENGVPYWIVKNSWGSHWGMNGYFYIERGKN 318

Query:   336 LCGIGTRSSYPL 347
             +CG+   +SYP+
Sbjct:   319 MCGLAACASYPI 330


>MGI|MGI:1922258 [details] [associations]
            symbol:4930486L24Rik "RIKEN cDNA 4930486L24 gene"
            species:10090 "Mus musculus" [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030054 "cell
            junction" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 MGI:MGI:1922258
            GO:GO:0005576 GO:GO:0030054 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 HSSP:P07711
            EMBL:AY146988 EMBL:AK145933 EMBL:BC061218 IPI:IPI00280732
            RefSeq:NP_835199.1 UniGene:Mm.19839 ProteinModelPortal:Q80UB0
            SMR:Q80UB0 MEROPS:C01.972 PRIDE:Q80UB0 Ensembl:ENSMUST00000091569
            GeneID:214639 KEGG:mmu:214639 UCSC:uc007qvs.1 InParanoid:Q80UB0
            OMA:RYHAENS OrthoDB:EOG4XWG0N NextBio:374408 Bgee:Q80UB0
            CleanEx:MM_4930486L24RIK Genevestigator:Q80UB0 Uniprot:Q80UB0
        Length = 333

 Score = 566 (204.3 bits), Expect = 7.8e-55, P = 7.8e-55
 Identities = 127/342 (37%), Positives = 192/342 (56%)

Query:    18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
             M  ++ L + C  ++ S+  T + S+     +W  +HG++Y    E+ +R  ++++N + 
Sbjct:     1 MIAVLFLAILCL-EIDSTAPTLDPSLDVQWNEWRTKHGKAYNVN-EERLRRAVWEKNFKM 58

Query:    78 IEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMT 134
             IE  N    EG   + +  N F DLTN EF  + TG++      R        +Q+    
Sbjct:    59 IELHNWEYLEGKHDFTMTMNAFGDLTNTEFVKMMTGFR------RQKIKRMHVFQDHQFL 112

Query:   135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC-ST 193
              VP  +DWR  G VTP+KNQ  C   WAF+A  ++EG    ++G L+ LSEQ LLDC  +
Sbjct:   113 YVPKYVDWRMLGYVTPVKNQGYCASSWAFSATGSLEGQMFKKTGRLVPLSEQNLLDCMGS 172

Query:   194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVPSG 252
             N  + C GG  + AF Y+  N G+ATE+ YPY   PG  C    + +AA + ++ ++P G
Sbjct:   173 NVTHDCSGGFMQNAFQYVKDNGGLATEESYPYIG-PGRKCRYHAENSAANVRDFVQIP-G 230

Query:   253 DEQALLKAVS-MQPVSIAIAAYSTEFQSYKEGIF-NGVCG-TQLDHAVTIVGFG---TTE 306
              E+AL+KAV+ + P+S+A+ A    FQ Y  GI+    C    L+HAV +VG+G      
Sbjct:   231 REEALMKAVAKVGPISVAVDASHDSFQFYDSGIYYEPQCKRVHLNHAVLVVGYGFEGEES 290

Query:   307 DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
             DG +YWL+KNSWG  WG  GY+KI +D    CGI T ++YP+
Sbjct:   291 DGNSYWLVKNSWGEEWGMKGYIKIAKDWNNHCGIATLATYPI 332


>UNIPROTKB|Q4QRC2 [details] [associations]
            symbol:Ctsql2 "Protein Ctsql2" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 HOVERGEN:HBG011513 EMBL:CH474032
            RGD:1303225 EMBL:BC097257 IPI:IPI00421946 RefSeq:NP_001002813.2
            UniGene:Rn.128678 SMR:Q4QRC2 MEROPS:C01.111
            Ensembl:ENSRNOT00000038758 GeneID:408201 KEGG:rno:408201 CTD:408201
            InParanoid:Q4QRC2 OMA:NDEGALM NextBio:696394 Genevestigator:Q4QRC2
            Uniprot:Q4QRC2
        Length = 343

 Score = 566 (204.3 bits), Expect = 7.8e-55, P = 7.8e-55
 Identities = 126/345 (36%), Positives = 190/345 (55%)

Query:    18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
             M   + L++ C   VVS  S    S+    ++W  ++ + Y  E E+ ++  +++EN++ 
Sbjct:     1 MTAALFLIILCLG-VVSGASAFNLSLDVQWQEWKMKYEKLYSPE-EELLKRVVWEENVKK 58

Query:    78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSH-----RXXXXXXFKYQ 129
             IE  N+E   G  TY +  N F+DLT++EF+ + TG  +P  +      +      F   
Sbjct:    59 IELHNRENSLGKNTYIMEINNFADLTDEEFKDMITGITLPINNTMKSLWKRALGSPFPNS 118

Query:   130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
                   +P S+DWR +G VT ++ Q +C  CWAF    A+EG    ++G L  LS Q L+
Sbjct:   119 WYWRDALPKSIDWRKEGYVTRVREQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLV 178

Query:   190 DCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
             DCS   GN GC GG+   AF Y++QN G+ +E  YPY+   G C    K A AKI+ +  
Sbjct:   179 DCSKPQGNKGCRGGTTYNAFQYVLQNGGLESEATYPYKGKEGLCKYNPKNAYAKITRFVA 238

Query:   249 VPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGIFNGV-CGTQLDHAVTIVGFG--- 303
             +P  DE  L+ A++ + PV+  I    +  + YK+GI++   C  +++HAV +VG+G   
Sbjct:   239 LPE-DEDVLMDALATKGPVAAGIHVVYSSLRFYKKGIYHEPKCNNRVNHAVLVVGYGFEG 297

Query:   304 TTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYPL 347
                DG NYWLIKNSWG  WG  GYMKI +D    CGI T + YP+
Sbjct:   298 NETDGNNYWLIKNSWGKQWGLKGYMKIAKDRNNHCGIATFAQYPI 342


>UNIPROTKB|Q5E968 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:BT021052
            EMBL:BC109853 IPI:IPI00709374 RefSeq:NP_001029607.1
            UniGene:Bt.23218 ProteinModelPortal:Q5E968 SMR:Q5E968 STRING:Q5E968
            MEROPS:I29.007 PRIDE:Q5E968 Ensembl:ENSBTAT00000028016
            GeneID:513038 KEGG:bta:513038 CTD:1513 InParanoid:Q5E968 KO:K01371
            OrthoDB:EOG4SJ5FC NextBio:20870669 PANTHER:PTHR12411:SF55
            Uniprot:Q5E968
        Length = 329

 Score = 565 (203.9 bits), Expect = 9.9e-55, P = 9.9e-55
 Identities = 121/323 (37%), Positives = 189/323 (58%)

Query:    33 VSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRT 88
             V S + + + +++   E W   + + Y  + ++  R  I+++NL++I   N E   G  T
Sbjct:    11 VVSFALYPEEILDTQWELWKKTYRKQYNSKGDEISRRLIWEKNLKHISIHNLEASLGVHT 70

Query:    89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFK-YQNLSMTDVPTSLDWRDKGA 147
             Y+L  N   D+T++E     TG K+P+   R         ++  +    P S+D+R KG 
Sbjct:    71 YELAMNHLGDMTSEEVVQKMTGLKVPASRSRSNDTLYIPDWEGRA----PDSVDYRKKGY 126

Query:   148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKA 207
             VTP+KNQ +CG CWAF++V A+EG  K ++G L+ LS Q L+DC +  N+GC GG    A
Sbjct:   127 VTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE-NDGCGGGYMTNA 185

Query:   208 FAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPV 266
             F Y+ +N+GI +ED YPY      C       AAK   Y E+P G+E+AL +AV+ + P+
Sbjct:   186 FQYVQKNRGIDSEDAYPYVGQDENCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPI 245

Query:   267 SIAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGD 324
             S+AI A  T FQ Y++G++ +  C +  L+HAV  VG+G  + G  +W+IKNSWG  WG+
Sbjct:   246 SVAIDASLTSFQFYRKGVYYDENCNSDNLNHAVLAVGYGI-QKGNKHWIIKNSWGENWGN 304

Query:   325 AGYMKIVRDEG-LCGIGTRSSYP 346
              GY+ + R++   CGI   +S+P
Sbjct:   305 KGYILMARNKNNACGIANLASFP 327


>UNIPROTKB|P83443 [details] [associations]
            symbol:P83443 "Macrodontain-1" species:203992 "Pseudananas
            sagenarius" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0006508 "proteolysis" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197
            ProteinModelPortal:P83443 SMR:P83443 MEROPS:C01.028 Uniprot:P83443
        Length = 213

 Score = 561 (202.5 bits), Expect = 2.6e-54, P = 2.6e-54
 Identities = 105/215 (48%), Positives = 144/215 (66%)

Query:   136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
             VP S+DWRD GAV  +KNQ  CG CWAFAA+A VEGI KIR GNL+ LSEQ++LDC+ + 
Sbjct:     2 VPQSIDWRDYGAVNEVKNQGPCGGCWAFAAIATVEGIYKIRKGNLVYLSEQEVLDCAVS- 60

Query:   196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
               GC GG   +A+ +II N G+ T++ YPY+A  GTC+A   P +A I+ Y  V   DE 
Sbjct:    61 -YGCKGGWVNRAYDFIISNNGVTTDENYPYRAYQGTCNANYFPNSAYITGYSYVRRNDES 119

Query:   256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
              ++ AVS QP++  I A    FQ YK G+++G CG  L+HA+TI+G+G   D  +YW+++
Sbjct:   120 HMMYAVSNQPIAALIDASGDNFQYYKGGVYSGPCGFSLNHAITIIGYG--RD--SYWIVR 175

Query:   316 NSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
             NSWG++WG  GY++I RD     G+CGI     +P
Sbjct:   176 NSWGSSWGQGGYVRIRRDVSHSGGVCGIAMSPLFP 210


>UNIPROTKB|O46427 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9823 "Sus scrofa"
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0032526 "response to retinoic acid" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0043129
            "surfactant homeostasis" evidence=ISS] [GO:0010815 "bradykinin
            catabolic process" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0005829 "cytosol"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0030335 "positive regulation of cell
            migration" evidence=ISS] [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0016505 "apoptotic protease activator
            activity" evidence=ISS] [GO:0004252 "serine-type endopeptidase
            activity" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=ISS] [GO:0031638 "zymogen activation"
            evidence=ISS] [GO:0030108 "HLA-A specific activating MHC class I
            receptor activity" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0010628 "positive regulation of gene
            expression" evidence=ISS] [GO:0070324 "thyroid hormone binding"
            evidence=ISS] [GO:0006508 "proteolysis" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0060448
            "dichotomous subdivision of terminal units involved in lung
            branching" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] [GO:0005764 "lysosome" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0004177 "aminopeptidase
            activity" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=ISS] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 MEROPS:C01.040 CTD:1512 OMA:STSCHKT
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 EMBL:AF001169
            RefSeq:NP_999094.1 UniGene:Ssc.3593 PDB:1NB3 PDB:1NB5 PDB:8PCH
            PDBsum:1NB3 PDBsum:1NB5 PDBsum:8PCH ProteinModelPortal:O46427
            SMR:O46427 Ensembl:ENSSSCT00000001983 GeneID:396969 KEGG:ssc:396969
            EvolutionaryTrace:O46427 ArrayExpress:O46427 Uniprot:O46427
        Length = 335

 Score = 559 (201.8 bits), Expect = 4.3e-54, P = 4.3e-54
 Identities = 124/315 (39%), Positives = 176/315 (55%)

Query:    42 SVVEIHEK-WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
             S  ++H K WM QH + Y  E E   RL++F  N   I  A+  GN T+KLG NQFSD++
Sbjct:    29 SFEKLHFKSWMVQHQKKYSLE-EYHHRLQVFVSNWRKIN-AHNAGNHTFKLGLNQFSDMS 86

Query:   101 NDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTD-VPTSLDWRDKGA-VTPIKNQKECG 158
              DE R  Y       P +       +    L  T   P S+DWR KG  V+P+KNQ  CG
Sbjct:    87 FDEIRHKYL---WSEPQNCSATKGNY----LRGTGPYPPSMDWRKKGNFVSPVKNQGSCG 139

Query:   159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI 217
              CW F+   A+E    I +G ++ L+EQQL+DC+ N NN GC GG   +AF YI  N+GI
Sbjct:   140 SCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGI 199

Query:   218 ATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSM-QPVSIAIAAYSTE 276
               ED YPY+     C      A A + +   +   DE+A+++AV++  PVS A    + +
Sbjct:   200 MGEDTYPYKGQDDHCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFAFEV-TND 258

Query:   277 FQSYKEGIFNGV-CGT---QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
             F  Y++GI++   C     +++HAV  VG+G  E+G  YW++KNSWG  WG  GY  I R
Sbjct:   259 FLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGE-ENGIPYWIVKNSWGPQWGMNGYFLIER 317

Query:   333 DEGLCGIGTRSSYPL 347
              + +CG+   +SYP+
Sbjct:   318 GKNMCGLAACASYPI 332


>UNIPROTKB|Q3T0I2 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9913 "Bos taurus"
            [GO:0031638 "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=ISS] [GO:0010815 "bradykinin
            catabolic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0010813 "neuropeptide catabolic
            process" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0045766 "positive regulation of angiogenesis" evidence=ISS]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=ISS] [GO:0008234 "cysteine-type peptidase activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0033619 "membrane protein proteolysis" evidence=ISS]
            [GO:0043066 "negative regulation of apoptotic process"
            evidence=ISS] [GO:0004252 "serine-type endopeptidase activity"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0016505 "apoptotic protease activator activity"
            evidence=ISS] [GO:0010952 "positive regulation of peptidase
            activity" evidence=ISS] [GO:0030108 "HLA-A specific activating MHC
            class I receptor activity" evidence=ISS] [GO:0002764 "immune
            response-regulating signaling pathway" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0070324 "thyroid
            hormone binding" evidence=ISS] [GO:0006508 "proteolysis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0097208
            "alveolar lamellar body" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004175
            "endopeptidase activity" evidence=ISS] [GO:0032526 "response to
            retinoic acid" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 EMBL:BC102386 IPI:IPI00693034
            RefSeq:NP_001029557.1 UniGene:Bt.52393 ProteinModelPortal:Q3T0I2
            SMR:Q3T0I2 STRING:Q3T0I2 MEROPS:C01.040 PRIDE:Q3T0I2
            Ensembl:ENSBTAT00000014593 GeneID:510524 KEGG:bta:510524 CTD:1512
            InParanoid:Q3T0I2 OMA:STSCHKT OrthoDB:EOG4W9J43 NextBio:20869490
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 Uniprot:Q3T0I2
        Length = 335

 Score = 557 (201.1 bits), Expect = 7.0e-54, P = 7.0e-54
 Identities = 123/315 (39%), Positives = 173/315 (54%)

Query:    42 SVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
             S+ + H + WM QH + Y  E E   RL+ F  NL  I   N   N T+K+G NQFSD++
Sbjct:    29 SLEKFHFQSWMVQHQKKYSSE-EYYHRLQAFASNLREINAHNAR-NHTFKMGLNQFSDMS 86

Query:   101 NDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTD-VPTSLDWRDKGA-VTPIKNQKECG 158
              DE +  Y       P +       +    L  T   P S+DWR KG  VTP+KNQ  CG
Sbjct:    87 FDELKRKYL---WSEPQNCSATKSNY----LRGTGPYPPSMDWRKKGNFVTPVKNQGSCG 139

Query:   159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI 217
              CW F+   A+E    I +G L  L+EQQL+DC+ N NN GC GG   +AF YI  N+GI
Sbjct:   140 SCWTFSTTGALESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGI 199

Query:   218 ATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTE 276
               ED YPY+   G C      A A + +   +   DE+A+++AV++  PVS A    + +
Sbjct:   200 MGEDTYPYRGQDGDCKYQPSKAIAFVKDVANITLNDEEAMVEAVALHNPVSFAFEV-TAD 258

Query:   277 FQSYKEGIFNGV-CGT---QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
             F  Y++GI++   C     +++HAV  VG+G  E G  YW++KNSWG  WG  GY  I R
Sbjct:   259 FMMYRKGIYSSTSCHKTPDKVNHAVLAVGYGE-EKGIPYWIVKNSWGPNWGMKGYFLIER 317

Query:   333 DEGLCGIGTRSSYPL 347
              + +CG+   +S+P+
Sbjct:   318 GKNMCGLAACASFPI 332


>ZFIN|ZDB-GENE-050208-336 [details] [associations]
            symbol:ctskl "cathepsin K, like" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-050208-336 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:BX465190
            GeneTree:ENSGT00660000095458 IPI:IPI00491185 RefSeq:XP_695425.1
            UniGene:Dr.110795 Ensembl:ENSDART00000062749 GeneID:567046
            KEGG:dre:567046 CTD:567046 NextBio:20888499 Bgee:F1QCP8
            Uniprot:F1QCP8
        Length = 349

 Score = 551 (199.0 bits), Expect = 3.0e-53, P = 3.0e-53
 Identities = 127/337 (37%), Positives = 193/337 (57%)

Query:    21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
             +  LLV    QV S   + E++  E +  W  +H  SY +E E   R  I++ N++ I K
Sbjct:    18 VFALLVWAPVQVASE--SEEEAPTEWN-LWKKKHEISYDEESEDVHRKTIWETNMQKIWK 74

Query:    81 ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVP 137
              N +   G   +K+  N++ DLT+ E++ L  G K+    +R       +   L+   + 
Sbjct:    75 NNNDFSFGLSMFKMAMNKYGDLTSVEYKRLL-GSKIKGTGNRKGKITSAQMLRLNAKRLG 133

Query:   138 -TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
              T++D+R KG VT +K+Q  CG CW+F+   A+EG     +G L+ LSEQQL+DCS + G
Sbjct:   134 VTNIDYRAKGYVTEVKDQGYCGSCWSFSTTGAIEGQMYKHTGRLVSLSEQQLVDCSRSYG 193

Query:   196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVPSGDE 254
               GC G     A+ Y+I N  + + D YPY +V    C   +  A A IS+Y  VP+G+E
Sbjct:   194 TYGCSGAWMANAYDYVINN-ALESSDTYPYTSVDTQPCFYEKNLAMAGISDYRFVPAGNE 252

Query:   255 QALLKAVS-MQPVSIAIAAYSTEFQSYKEGIFN-GVCG-TQLDHAVTIVGFGTTEDGANY 311
             QAL  AV+ + PVS+AI A +  F  Y  GI+    C    L+HAV +VG+G+ E+G +Y
Sbjct:   253 QALADAVATVGPVSVAIDADNPSFLFYSSGIYKESNCNPNNLNHAVLVVGYGS-EEGTDY 311

Query:   312 WLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
             W+IKNSWG  WG+ GYM+++R+ +  CGI + + YP+
Sbjct:   312 WIIKNSWGTGWGEGGYMRMIRNGKNTCGIASYALYPI 348


>UNIPROTKB|E9PSK9 [details] [associations]
            symbol:Ctsql2 "Protein Ctsql2" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            IPI:IPI00562656 Ensembl:ENSRNOT00000045847 RGD:1303225
            ArrayExpress:E9PSK9 Uniprot:E9PSK9
        Length = 342

 Score = 550 (198.7 bits), Expect = 3.8e-53, P = 3.8e-53
 Identities = 125/345 (36%), Positives = 188/345 (54%)

Query:    18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
             M   + L++ C   VVS  S    S+    ++W  ++ + Y  E E+ ++  +++EN++ 
Sbjct:     1 MTAALFLIILCLG-VVSGASAFNLSLDVQWQEWKMKYEKLYSPE-EELLKRVVWEENVKK 58

Query:    78 IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSH-----RXXXXXXFKYQ 129
             IE  N+E   G  TY +  N F+DLT++EF+ + TG  +P  +      +      F   
Sbjct:    59 IELHNRENSLGKNTYIMEINNFADLTDEEFKDMITGITLPINNTMKSLWKRALGSPFPNS 118

Query:   130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
                   +P S+DWR +G VT ++ Q +C  CWAF    A+EG    ++G L  LS Q L+
Sbjct:   119 WYWRDALPKSIDWRKEGYVTRVREQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLV 178

Query:   190 DCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
             DCS   GN GC GG+   AF Y++QN G+ +E  YPY+   G C    K A AKI+ +  
Sbjct:   179 DCSKPQGNKGCRGGTTYNAFQYVLQNGGLESEATYPYKGKEGLCKYNPKNAYAKITRFVA 238

Query:   249 VPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGIFNGV-CGTQLDHAVTIVGFG--- 303
             +P  DE  L+ A++ + PV+  I    + F  +  GI++   C  +++HAV +VG+G   
Sbjct:   239 LPE-DEDVLMDALATKGPVAAGIHVVYSYFH-FVSGIYHEPKCNNRVNHAVLVVGYGFEG 296

Query:   304 TTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYPL 347
                DG NYWLIKNSWG  WG  GYMKI +D    CGI T + YP+
Sbjct:   297 NETDGNNYWLIKNSWGKQWGLKGYMKIAKDRNNHCGIATFAQYPI 341


>DICTYBASE|DDB_G0272298 [details] [associations]
            symbol:DDB_G0272298 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0272298 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246
            SMART:SM00848 EMBL:AAFI02000008 KO:K01365 RefSeq:XP_645281.1
            ProteinModelPortal:Q559Q3 MEROPS:C01.A53 EnsemblProtists:DDB0203746
            GeneID:8618447 KEGG:ddi:DDB_G0272298 InParanoid:Q559Q3 OMA:PANINWR
            Uniprot:Q559Q3
        Length = 305

 Score = 549 (198.3 bits), Expect = 4.9e-53, P = 4.9e-53
 Identities = 118/306 (38%), Positives = 172/306 (56%)

Query:    51 MAQHGRSYKDELEKEMRLKIFKENLEYI-EKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
             M ++ + YK+  E   R  IF++N  +I    NK G    ++  N++SDLT  EF   + 
Sbjct:     1 MVKYNKHYKNNKEYLKRFDIFQDNYNFILNHRNKNGENI-EMDLNEYSDLTQKEFADKFF 59

Query:   110 GYKMPSPSH---RXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
                +P P            FK+ N++ T +P S DWRD GAV  +KNQ  C  CW+F+A+
Sbjct:    60 EKLVPEPRSGPINDIKATPFKH-NVNAT-IPKSFDWRDHGAVGKVKNQGSCASCWSFSAL 117

Query:   167 AAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
              A+EG   I+ G L+ LSEQ L+DC+T  G  GC  G    AF YII + G+  E +YPY
Sbjct:   118 GALEGHYYIKYGELLDLSEQNLVDCATPFGPKGCKTGWMHDAFKYIISSGGVNLESQYPY 177

Query:   226 QAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI 284
                   C   Q    AK+S +  +P  DE AL++A+++  PV++ I   + EFQ    GI
Sbjct:   178 TGKDEVCKFNQSEKEAKVSGFVMIPKFDESALMEAIALYGPVAVPIDTSTKEFQHLSGGI 237

Query:   285 F-NGVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGT 341
             + +  C      HAV  +G+GT E+G +Y+L+KNSWG +WG  G+ K+ R  +G CGI T
Sbjct:   238 YYSDSCDPWNTIHAVLAIGYGTDENGVDYFLMKNSWGKSWGTNGFFKVKRGVKGKCGIVT 297

Query:   342 RSSYPL 347
              +SYP+
Sbjct:   298 AASYPI 303


>UNIPROTKB|G1M0X4 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9646
            "Ailuropoda melanoleuca" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 EMBL:ACTA01057330 EMBL:ACTA01065330
            Ensembl:ENSAMET00000013529 Uniprot:G1M0X4
        Length = 337

 Score = 549 (198.3 bits), Expect = 4.9e-53, P = 4.9e-53
 Identities = 119/312 (38%), Positives = 175/312 (56%)

Query:    45 EIHEK-WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDE 103
             ++H K WM QH + Y  E E + RL+ F  N   I  A+  GN T+K+G NQFSD++   
Sbjct:    34 KVHFKSWMVQHQKKYSSE-EYQHRLRTFVGNWRKIN-AHNAGNHTFKMGLNQFSDMS--- 88

Query:   104 FRALYTGYKMPSPSHRXXXXXXFKYQNLSMTD-VPTSLDWRDKGA-VTPIKNQKECGCCW 161
             F  +   Y    P +       +    L  T   P  +DWR KG  V+P+KNQ  CG CW
Sbjct:    89 FAEIKRKYLWSEPQNCSATKGNY----LRGTGPYPPFVDWRKKGKFVSPVKNQGGCGSCW 144

Query:   162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIATE 220
              F+   A+E    I++G L+ L+EQQL+DC+ + NN GC GG   +AF YI  N+GI  E
Sbjct:   145 TFSTTGALESAIAIKTGKLLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYIRYNRGIMGE 204

Query:   221 DEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSM-QPVSIAIAAYSTEFQS 279
             D YPY+   G C      A A + +   +   DEQA+++AV++  PVS A    + +F  
Sbjct:   205 DSYPYKGQDGDCKFQPSKAIAFVKDVANITINDEQAMVEAVALFNPVSFAFEV-TGDFMM 263

Query:   280 YKEGIFNGV-CGT---QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG 335
             Y++G+++   C     +++HAV  VG+G  ++G  YW++KNSWG  WG  GY  I R + 
Sbjct:   264 YRKGVYSSTSCHKTPDKVNHAVLAVGYGE-QNGVPYWIVKNSWGPQWGMHGYFLIERGKN 322

Query:   336 LCGIGTRSSYPL 347
             +CG+   +SYP+
Sbjct:   323 MCGLAACASYPI 334


>RGD|631421 [details] [associations]
            symbol:Ctsq "cathepsin Q" species:10116 "Rattus norvegicus"
            [GO:0005764 "lysosome" evidence=NAS] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 RGD:631421 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 UniGene:Rn.34875 EMBL:AF187323 IPI:IPI00214897
            PIR:JC7183 RefSeq:NP_640355.1 UniGene:Rn.35820
            ProteinModelPortal:Q9QZE3 SMR:Q9QZE3 STRING:Q9QZE3 MEROPS:C01.039
            PRIDE:Q9QZE3 Ensembl:ENSRNOT00000024208 GeneID:246147
            KEGG:rno:246147 UCSC:RGD:631421 CTD:104002 InParanoid:Q9QZE3
            OMA:ESEDVLM OrthoDB:EOG4HHP48 NextBio:623425 Genevestigator:Q9QZE3
            GermOnline:ENSRNOG00000017946 Uniprot:Q9QZE3
        Length = 343

 Score = 549 (198.3 bits), Expect = 4.9e-53, P = 4.9e-53
 Identities = 118/340 (34%), Positives = 191/340 (56%)

Query:    22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
             + L++ C   VV   S  + S+    ++W  ++ + Y  E E+ ++  +++EN++ IE  
Sbjct:     5 VFLVILCLG-VVPGASALDLSLDVQWQEWKIKYEKLYSPE-EEVLKRVVWEENVKKIELH 62

Query:    82 NKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMP--SPSHRX--XXXXXFKYQNLSMT 134
             N+E   G  TY +  N F+D+T++EF+ +  G+++P  +   R        F   + +  
Sbjct:    63 NRENSLGKNTYTMEINDFADMTDEEFKDMIIGFQLPVHNTEKRLWKRALGSFFPNSWNWR 122

Query:   135 D-VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
             D +P  +DWR++G VT ++ Q  C  CWAF    A+EG    ++G LI LS Q L+DCS 
Sbjct:   123 DALPKFVDWRNEGYVTRVRKQGGCSSCWAFPVTGAIEGQMFKKTGKLIPLSVQNLIDCSK 182

Query:   194 -NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
               GN GCL G+   AF Y++ N G+  E  YPY+   G C    K ++AKI+ +  +P  
Sbjct:   183 PQGNRGCLWGNTYNAFQYVLHNGGLEAEATYPYERKEGVCRYNPKNSSAKITGFVVLPES 242

Query:   253 DEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGIFNGV-CGTQLDHAVTIVGFG---TTED 307
              E  L+ AV+ + P++  +   S+ F+ Y++G+++   C + ++HAV +VG+G      D
Sbjct:   243 -EDVLMDAVATKGPIATGVHVISSSFRFYQKGVYHEPKCSSYVNHAVLVVGYGFEGNETD 301

Query:   308 GANYWLIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYP 346
             G NYWLIKNSWG  WG  GYMKI +D    C I + + YP
Sbjct:   302 GNNYWLIKNSWGKRWGLRGYMKIAKDRNNHCAIASLAQYP 341


>MGI|MGI:107285 [details] [associations]
            symbol:Ctsh "cathepsin H" species:10090 "Mus musculus"
            [GO:0001520 "outer dense fiber" evidence=ISO] [GO:0001669
            "acrosomal vesicle" evidence=ISO] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=IGI] [GO:0002764 "immune response-regulating
            signaling pathway" evidence=ISO] [GO:0004175 "endopeptidase
            activity" evidence=ISO;IMP] [GO:0004177 "aminopeptidase activity"
            evidence=ISO] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISO;IDA] [GO:0004252 "serine-type endopeptidase activity"
            evidence=IMP] [GO:0005615 "extracellular space" evidence=ISO]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0005829 "cytosol"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=ISO;IMP]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0008284
            "positive regulation of cell proliferation" evidence=IMP]
            [GO:0010628 "positive regulation of gene expression" evidence=ISO]
            [GO:0010634 "positive regulation of epithelial cell migration"
            evidence=IMP] [GO:0010813 "neuropeptide catabolic process"
            evidence=ISO] [GO:0010815 "bradykinin catabolic process"
            evidence=ISO] [GO:0010952 "positive regulation of peptidase
            activity" evidence=IGI;ISO] [GO:0016505 "apoptotic protease
            activator activity" evidence=IGI;ISO] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0030108 "HLA-A specific activating MHC
            class I receptor activity" evidence=ISO] [GO:0030335 "positive
            regulation of cell migration" evidence=ISO] [GO:0030984 "kininogen
            binding" evidence=ISO] [GO:0031638 "zymogen activation"
            evidence=ISO;IMP] [GO:0031648 "protein destabilization"
            evidence=ISO;IMP] [GO:0032403 "protein complex binding"
            evidence=ISO] [GO:0032526 "response to retinoic acid" evidence=IDA]
            [GO:0033619 "membrane protein proteolysis" evidence=ISO;IMP]
            [GO:0035085 "cilium axoneme" evidence=ISO] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IMP] [GO:0043129
            "surfactant homeostasis" evidence=ISO] [GO:0043621 "protein
            self-association" evidence=ISO] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IMP] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=IMP]
            [GO:0070324 "thyroid hormone binding" evidence=ISO] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISO] [GO:0097208 "alveolar
            lamellar body" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:107285 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 EMBL:CH466560 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            HOVERGEN:HBG011513 GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT OrthoDB:EOG4W9J43
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 BRENDA:3.4.22.16
            EMBL:U06119 EMBL:AK149949 EMBL:AK150583 EMBL:AK157376 EMBL:AK160026
            EMBL:Y18464 IPI:IPI00118987 RefSeq:NP_031827.2 UniGene:Mm.2277
            ProteinModelPortal:P49935 SMR:P49935 STRING:P49935 MEROPS:I29.003
            PhosphoSite:P49935 PaxDb:P49935 PRIDE:P49935
            Ensembl:ENSMUST00000034915 GeneID:13036 KEGG:mmu:13036
            InParanoid:Q3UCD6 ChEMBL:CHEMBL1949491 NextBio:282920 Bgee:P49935
            CleanEx:MM_CTSH Genevestigator:P49935 GermOnline:ENSMUSG00000032359
            Uniprot:P49935
        Length = 333

 Score = 548 (198.0 bits), Expect = 6.3e-53, P = 6.3e-53
 Identities = 126/343 (36%), Positives = 193/343 (56%)

Query:    18 MFIIITLLVSCASQ-VVSSRSTHEQSV--VE-IHEK-WMAQHGRSYKDELEKEMRLKIFK 72
             M+  + LL  CA   ++S+ +T E +V  +E  H K WM QH ++Y   +E   RL++F 
Sbjct:     1 MWAALPLL--CAGAWLLSTGATAELTVNAIEKFHFKSWMKQHQKTYSS-VEYNHRLQMFA 57

Query:    73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLS 132
              N   I+ A+ + N T+K+  NQFSD++   F  +   +    P +       +    L 
Sbjct:    58 NNWRKIQ-AHNQRNHTFKMALNQFSDMS---FAEIKHKFLWSEPQNCSATKSNY----LR 109

Query:   133 MTD-VPTSLDWRDKG-AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
              T   P+S+DWR KG  V+P+KNQ  CG CW F+   A+E    I SG ++ L+EQQL+D
Sbjct:   110 GTGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVD 169

Query:   191 CSTNGNN-GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
             C+   NN GC GG   +AF YI+ N+GI  ED YPY     +C    + A A + N   +
Sbjct:   170 CAQAFNNHGCKGGLPSQAFEYILYNKGIMEEDSYPYIGKDSSCRFNPQKAVAFVKNVVNI 229

Query:   250 PSGDEQALLKAVSM-QPVSIAIAAYSTEFQSYKEGIFNGV-CGT---QLDHAVTIVGFGT 304
                DE A+++AV++  PVS A    + +F  YK G+++   C     +++HAV  VG+G 
Sbjct:   230 TLNDEAAMVEAVALYNPVSFAFEV-TEDFLMYKSGVYSSKSCHKTPDKVNHAVLAVGYGE 288

Query:   305 TEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 347
              ++G  YW++KNSWG+ WG+ GY  I R + +CG+   +SYP+
Sbjct:   289 -QNGLLYWIVKNSWGSQWGENGYFLIERGKNMCGLAACASYPI 330


>UNIPROTKB|F6X9C1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00660000095458
            OMA:STSCHKT Ensembl:ENSCAFT00000036196 EMBL:AAEX03002388
            Uniprot:F6X9C1
        Length = 305

 Score = 547 (197.6 bits), Expect = 8.0e-53, P = 8.0e-53
 Identities = 121/312 (38%), Positives = 173/312 (55%)

Query:    45 EIHEK-WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDE 103
             ++H K W  QH + Y  E E   RL+ F  N   I  A+  GN T+K+G NQFSD+    
Sbjct:     2 KVHFKSWAVQHQKKYSSE-EYLQRLQTFVGNWRKIN-AHNAGNHTFKMGLNQFSDMN--- 56

Query:   104 FRALYTGYKMPSPSHRXXXXXXFKYQNLSMTD-VPTSLDWRDKGA-VTPIKNQKECGCCW 161
             F  +   Y    P +       +    L  T   P  +DWR KG  V+P+KNQ  CG CW
Sbjct:    57 FAEIKHKYLWSEPQNCSATKGNY----LRGTGPYPPFVDWRKKGKFVSPVKNQGSCGSCW 112

Query:   162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIATE 220
              F+   A+E    I+SG L+ L+EQQL+DC+ N NN GC GG+  +AF YI  N+GI  E
Sbjct:   113 TFSTTGALESAIAIKSGKLLSLAEQQLVDCAQNFNNHGCQGGAPLQAFEYIRYNKGIMGE 172

Query:   221 DEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSM-QPVSIAIAAYSTEFQS 279
             D YPY+   G C      A A + +   +   DEQA+++AV++  PVS A    S +F  
Sbjct:   173 DSYPYKGQDGDCKYQPSKAIAFVKDVANITINDEQAMVEAVALYNPVSFAFEVTS-DFMM 231

Query:   280 YKEGIFNGV-CGT---QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG 335
             Y++GI++   C     +++HAV  VG+G  ++G  YW++KNSWG  WG  GY  + R + 
Sbjct:   232 YRKGIYSSTSCHKTPDKVNHAVLAVGYGE-QNGIPYWIVKNSWGPQWGMNGYFLMERGKN 290

Query:   336 LCGIGTRSSYPL 347
             +CG+   +SYP+
Sbjct:   291 MCGLAACASYPI 302


>RGD|1562210 [details] [associations]
            symbol:MGC114246 "similar to cathepsin R" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1562210 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 EMBL:CH474032 MEROPS:C01.042 KO:K09601
            OMA:ASHESFK OrthoDB:EOG4ZCT6D EMBL:BC091563 IPI:IPI00555186
            RefSeq:NP_001017509.1 UniGene:Rn.198321 SMR:Q5BJA0
            Ensembl:ENSRNOT00000061470 GeneID:498688 KEGG:rno:498688
            UCSC:RGD:1562210 InParanoid:Q5BJA0 NextBio:700535
            Genevestigator:Q5BJA0 Uniprot:Q5BJA0
        Length = 334

 Score = 547 (197.6 bits), Expect = 8.0e-53, P = 8.0e-53
 Identities = 121/341 (35%), Positives = 185/341 (54%)

Query:    16 TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
             TP  + I +L  C   V S     + S+    ++W  ++ +SY  E E+E+R  +++ENL
Sbjct:     2 TPA-VFIAIL--CLG-VASGAPILDPSLDAEWQEWKKKYDKSYSLE-EEELRRAVWEENL 56

Query:    76 EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLS 132
             + I+  N E   G   + +  N+F D T +EFR +   +  P  +HR         +  +
Sbjct:    57 KMIKLHNGENGLGKNGFTMEINEFGDTTGEEFRKMMVEF--PVQTHREGKSIM---KRAA 111

Query:   133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              +  P  +DWR KG VTP++ Q  C  CWAF+   A+E  T  +SG LI LS Q L+DCS
Sbjct:   112 GSIFPKFVDWRKKGYVTPVRRQGNCNACWAFSVTGAIEAQTIWQSGKLIPLSVQNLVDCS 171

Query:   193 T-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
                GNNGCLGG    AF Y++ N G+ +E  YPY+   G C    K ++A+I+ +  +P 
Sbjct:   172 KPQGNNGCLGGDTYNAFQYVLHNGGLQSEATYPYEGKDGPCRYNPKNSSAEITGFVSLPE 231

Query:   252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFG---TTE 306
              ++  ++   ++ P+S  I A    F+ YK+GI++   C +  + H V +VG+G      
Sbjct:   232 SEDILMVAVATIGPISAGIDASHESFKFYKKGIYHEPNCSSNSVTHGVLVVGYGFKGNDT 291

Query:   307 DGANYWLIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYP 346
              G +YWLIKNSWG  WG  GYMKI +D+   C I + + YP
Sbjct:   292 GGDHYWLIKNSWGKQWGIRGYMKITKDKNNHCAIASYAHYP 332


>UNIPROTKB|F7B939 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458 CTD:1512
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 EMBL:ACFV01158341
            EMBL:ACFV01158342 EMBL:ACFV01158343 RefSeq:XP_002753411.1
            Ensembl:ENSCJAT00000004397 GeneID:100413104 Uniprot:F7B939
        Length = 336

 Score = 546 (197.3 bits), Expect = 1.0e-52, P = 1.0e-52
 Identities = 120/315 (38%), Positives = 172/315 (54%)

Query:    42 SVVEIHEK-WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
             S+ + H K WMA+H ++Y  E E   RL+ F  N   I  A+  GN T+K+  NQFSD++
Sbjct:    29 SLEKFHFKSWMAKHHKTYSREEEYHQRLQTFASNWRKIN-AHNNGNHTFKMAVNQFSDMS 87

Query:   101 NDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTD-VPTSLDWRDKGA-VTPIKNQKECG 158
                F  +   Y    P +       +    L  T   P S+DWR KG  V+P+KNQ  CG
Sbjct:    88 ---FAEIKRKYLWSEPQNCSATKSNY----LRGTGPYPPSVDWRKKGHFVSPVKNQGACG 140

Query:   159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI 217
              CW F+   A+E    I +G ++ L+EQQL+DC+ + NN GC GG   +AF YI+ N GI
Sbjct:   141 SCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNNGI 200

Query:   218 ATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSM-QPVSIAIAAYSTE 276
               ED YPYQ     C      A   + +   +   DE A+++AV++  PVS A    + +
Sbjct:   201 MGEDTYPYQGKDSDCKFQPGKAIGFVKDVANITIYDEDAMVEAVALYNPVSFAFEV-TQD 259

Query:   277 FQSYKEGIFNGV-CGT---QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
             F  YK GI++   C     +++HAV  VG+G  E+G  YW++KNSWG  WG  GY  I R
Sbjct:   260 FMMYKRGIYSSTSCHKTPDKVNHAVLAVGYGE-ENGIPYWIVKNSWGPQWGMNGYFLIER 318

Query:   333 DEGLCGIGTRSSYPL 347
              + +CG+   +SYP+
Sbjct:   319 GKNMCGLAACASYPV 333


>UNIPROTKB|F7BRD4 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001656 "metanephros development"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0001656
            GeneTree:ENSGT00660000095458 EMBL:ACFV01158341 EMBL:ACFV01158342
            EMBL:ACFV01158343 Ensembl:ENSCJAT00000004396 Uniprot:F7BRD4
        Length = 336

 Score = 546 (197.3 bits), Expect = 1.0e-52, P = 1.0e-52
 Identities = 122/325 (37%), Positives = 175/325 (53%)

Query:    33 VSSRSTHEQSVVE-IHEK-WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
             VS +   +   +E  H K WMA+H ++Y  E E   RL+ F  N   I  A+  GN T+K
Sbjct:    19 VSKKKKKKMLALEKFHFKSWMAKHHKTYSREEEYHQRLQTFASNWRKIN-AHNNGNHTFK 77

Query:    91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTD-VPTSLDWRDKGA-V 148
             +  NQFSD++   F  +   Y    P +       +    L  T   P S+DWR KG  V
Sbjct:    78 MAVNQFSDMS---FAEIKRKYLWSEPQNCSATKSNY----LRGTGPYPPSVDWRKKGHFV 130

Query:   149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKA 207
             +P+KNQ  CG CW F+   A+E    I +G ++ L+EQQL+DC+ + NN GC GG   +A
Sbjct:   131 SPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQA 190

Query:   208 FAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSM-QPV 266
             F YI+ N GI  ED YPYQ     C      A   + +   +   DE A+++AV++  PV
Sbjct:   191 FEYILYNNGIMGEDTYPYQGKDSDCKFQPGKAIGFVKDVANITIYDEDAMVEAVALYNPV 250

Query:   267 SIAIAAYSTEFQSYKEGIFNGV-CGT---QLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 322
             S A    + +F  YK GI++   C     +++HAV  VG+G  E+G  YW++KNSWG  W
Sbjct:   251 SFAFEV-TQDFMMYKRGIYSSTSCHKTPDKVNHAVLAVGYGE-ENGIPYWIVKNSWGPQW 308

Query:   323 GDAGYMKIVRDEGLCGIGTRSSYPL 347
             G  GY  I R + +CG+   +SYP+
Sbjct:   309 GMNGYFLIERGKNMCGLAACASYPV 333


>UNIPROTKB|D3ZZR3 [details] [associations]
            symbol:D3ZZR3 "Uncharacterized protein" species:10116
            "Rattus norvegicus" [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0016020 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0002250 GeneTree:ENSGT00560000076577 GO:GO:0097067
            OrthoDB:EOG4JM7Q2 IPI:IPI00210228 PRIDE:D3ZZR3
            Ensembl:ENSRNOT00000028732 Uniprot:D3ZZR3
        Length = 331

 Score = 546 (197.3 bits), Expect = 1.0e-52, P = 1.0e-52
 Identities = 121/321 (37%), Positives = 180/321 (56%)

Query:    37 STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGT 93
             +T E+ +    + W   H + YKD+ E+++R  I+++NL++I   N E   G  +Y +G 
Sbjct:    15 ATAERPLDHHWDLWKKTHEKEYKDQNEEDVRRLIWEKNLKFIMLHNLEHSMGMHSYSVGM 74

Query:    94 NQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRD--KGAVTPI 151
             N   D+  +         ++P              QNL     P  + W++  KG    +
Sbjct:    75 NHMGDMVAETIIGEMGSERLPRKRKALGLIPSSVNQNL-----PAGVKWKERTKGCWKNL 129

Query:   152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSREKAF 208
               Q  CG CWAF+AV A+EG  K+++G L+ LS Q L+DCST    GN GC GG   +AF
Sbjct:   130 VFQGSCGSCWAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEAF 189

Query:   209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVS 267
              YII N GI +E  YPY+A+   C    K  AA  S Y E+P GDE+AL +AV+ + PVS
Sbjct:   190 QYIIDNGGIDSEASYPYKAMDEKCHYDPKNRAATCSRYIELPFGDEEALKEAVATKGPVS 249

Query:   268 IAIAAYSTEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAG 326
             + I A  + F  Y+ G+++   C   ++H V +VG+GT  DG +YWL+KNSWG  +GD G
Sbjct:   250 VGIDASHSSFFLYQSGVYDDPSCTENVNHGVLVVGYGTL-DGKDYWLVKNSWGLHFGDQG 308

Query:   327 YMKIVRD-EGLCGIGTRSSYP 346
             Y+++ R+ +  CGI +  SYP
Sbjct:   309 YIRMARNNKNHCGIASYCSYP 329


>DICTYBASE|DDB_G0281605 [details] [associations]
            symbol:cfaD "peptidase C1A family protein"
            species:44689 "Dictyostelium discoideum" [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IDA] [GO:0006508 "proteolysis" evidence=IDA] [GO:0031410
            "cytoplasmic vesicle" evidence=IDA] [GO:0031288 "sorocarp
            morphogenesis" evidence=IMP] [GO:0008285 "negative regulation of
            cell proliferation" evidence=IGI;IDA] [GO:0005576 "extracellular
            region" evidence=IEA;IDA] [GO:0005515 "protein binding"
            evidence=IPI] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0281605
            GO:GO:0008285 GO:GO:0005615 GenomeReviews:CM000152_GR
            eggNOG:COG4870 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0031410 EMBL:AAFI02000042
            GO:GO:0031288 RefSeq:XP_640530.1 HSSP:P07711
            ProteinModelPortal:Q54TR1 STRING:Q54TR1 PRIDE:Q54TR1
            EnsemblProtists:DDB0229857 GeneID:8623140 KEGG:ddi:DDB_G0281605
            InParanoid:Q54TR1 OMA:PSAHEHE ProtClustDB:CLSZ2430523
            Uniprot:Q54TR1
        Length = 531

 Score = 544 (196.6 bits), Expect = 1.7e-52, P = 1.7e-52
 Identities = 113/316 (35%), Positives = 177/316 (56%)

Query:    40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
             E+    + +++ AQ+ + Y  + E + R   FK   + I   N + + +YKLG N ++DL
Sbjct:   218 EEQASNLFKEYKAQYNKEYSSQDEHDERFINFKAARKIIATHNAKES-SYKLGMNHYADL 276

Query:   100 TNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
             +N EF  L    K+  PS          + + S+  +P+++DWR++  VTP+K+Q  CG 
Sbjct:   277 SNKEFNTLVKP-KVARPSVTGADSV---HDDESLRSIPSTVDWRNQNCVTPVKDQGICGS 332

Query:   160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIA 218
             CW F +  ++EG   + +G L+ LSEQQL+DC+   G+ GC GG    AF Y+++   +A
Sbjct:   333 CWTFGSTGSLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGSLA 392

Query:   219 TEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTE 276
             TE  YPY    G C      P+   I+ Y  V SG E AL  A++   PV+IAI A   +
Sbjct:   393 TESNYPYLMQNGLCRDRTVTPSGVSITGYVNVTSGSESALQNAIATTGPVAIAIDASVDD 452

Query:   277 FQSYKEGIFNG-VCGT---QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
             F+ Y  G++N   C      LDH V  +G+GT + G +Y+L+KNSW   WG  GY+ + R
Sbjct:   453 FRYYMSGVYNNPACKNGLDDLDHEVLAIGYGTYQ-GQDYFLVKNSWSTNWGMDGYVYMAR 511

Query:   333 -DEGLCGIGTRSSYPL 347
              D  LCG+ ++++YP+
Sbjct:   512 NDNNLCGVSSQATYPI 527


>UNIPROTKB|F6R7P5 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9544 "Macaca
            mulatta" [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=ISS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0004175 "endopeptidase activity" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0004252
            "serine-type endopeptidase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005829 "cytosol" evidence=ISS] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0010628 "positive regulation of
            gene expression" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0010815 "bradykinin catabolic
            process" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0016505 "apoptotic protease
            activator activity" evidence=ISS] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=ISS] [GO:0030335
            "positive regulation of cell migration" evidence=ISS] [GO:0031638
            "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0032526 "response to retinoic
            acid" evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0043129 "surfactant homeostasis"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0070324
            "thyroid hormone binding" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            GO:GO:0043066 GO:GO:0005615 GO:GO:0008284 GO:GO:0070371
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458 CTD:1512
            OMA:STSCHKT GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 RefSeq:XP_001108862.1
            UniGene:Mmu.3000 Ensembl:ENSMMUT00000014095 GeneID:711437
            KEGG:mcc:711437 NextBio:19969972 Uniprot:F6R7P5
        Length = 335

 Score = 543 (196.2 bits), Expect = 2.1e-52, P = 2.1e-52
 Identities = 124/343 (36%), Positives = 186/343 (54%)

Query:    18 MFIIITLLVSCA----SQVVSSRSTHEQSVVEIHEK-WMAQHGRSYKDELEKEMRLKIFK 72
             M++ + LL + A    + V  +      S+ + H K WM++H ++Y  E E   R++ F 
Sbjct:     1 MWVTLPLLCAGAWLLGAPVCGAAELSVNSLEKFHFKSWMSKHHKTYSTE-EYHHRMQTFA 59

Query:    73 ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLS 132
              N   I  A+  GN T+K+  NQFSD++   F  +   Y    P +       +    L 
Sbjct:    60 SNWRKIN-AHNNGNHTFKMALNQFSDMS---FAEIKHKYLWSEPQNCSATKSNY----LR 111

Query:   133 MTD-VPTSLDWRDKGA-VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
              T   P S+DWR KG  V+P+KNQ  CG CW F+   A+E    I +G ++ L+EQQL+D
Sbjct:   112 GTGPYPPSMDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVD 171

Query:   191 CSTNGNN-GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
             C+ + NN GC GG   +AF YI+ N+GI  ED YPYQ   G C      A   + +   +
Sbjct:   172 CAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGDCKFRPGKAIGFVKDVANI 231

Query:   250 PSGDEQALLKAVSM-QPVSIAIAAYSTEFQSYKEGIFNGV-CGT---QLDHAVTIVGFGT 304
                DE+A+++AV++  PVS A    + +F  YK GI++   C     +++HAV  VG+G 
Sbjct:   232 TIYDEEAMVEAVALYNPVSFAFEV-TQDFMIYKTGIYSSTSCHKTPDKVNHAVLAVGYGE 290

Query:   305 TEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 347
              E+G  YW++KNSWG  WG  GY  I R + +CG+   +SYP+
Sbjct:   291 -ENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPI 332


>UNIPROTKB|F7BJD8 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9796 "Equus
            caballus" [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=ISS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0004175 "endopeptidase activity" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0004252
            "serine-type endopeptidase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005829 "cytosol" evidence=ISS] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0010628 "positive regulation of
            gene expression" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0010815 "bradykinin catabolic
            process" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0016505 "apoptotic protease
            activator activity" evidence=ISS] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=ISS] [GO:0030335
            "positive regulation of cell migration" evidence=ISS] [GO:0031638
            "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0032526 "response to retinoic
            acid" evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0043129 "surfactant homeostasis"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0070324
            "thyroid hormone binding" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            GO:GO:0043066 GO:GO:0005615 GO:GO:0008284 GO:GO:0070371
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458
            OMA:STSCHKT GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129
            Ensembl:ENSECAT00000013967 Uniprot:F7BJD8
        Length = 305

 Score = 542 (195.9 bits), Expect = 2.7e-52, P = 2.7e-52
 Identities = 120/311 (38%), Positives = 169/311 (54%)

Query:    45 EIHEK-WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDE 103
             + H K WM QH + Y  E E   RL+ F  N   I  A+  GN T+++G NQFS +    
Sbjct:     2 KFHFKSWMVQHQKKYSSE-EYHHRLQTFVSNWRKIN-AHNTGNHTFRMGLNQFSAMN--- 56

Query:   104 FRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGA-VTPIKNQKECGCCWA 162
             F  L   Y    P +       +          P S+DWR KG  V+P+KNQ  CG CW 
Sbjct:    57 FAELKHKYLWSEPQNCSATKGNYLR---GAGPYPPSVDWRKKGNFVSPVKNQGGCGSCWT 113

Query:   163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIATED 221
             F+   A+E    I SG L+ L+EQQL+DC+ N NN GC GG   +AF YI  N+GI  ED
Sbjct:   114 FSTTGALESAVAIASGKLLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGED 173

Query:   222 EYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSM-QPVSIAIAAYSTEFQSY 280
              YPY+   G C      A A + +   +   DE+A+++AV++  PVS A    + +F  Y
Sbjct:   174 TYPYKGQDGDCKFQPNKAIAFVKDVANITLNDEKAMVEAVALYNPVSFAFEV-TEDFMMY 232

Query:   281 KEGIFNGV-CGT---QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGL 336
             ++GI++   C     +++HAV  VG+G  E+G  YW++KNSWG  WG  GY  I R + +
Sbjct:   233 RKGIYSSTSCHKTPDKVNHAVLAVGYGE-ENGIPYWIVKNSWGPHWGMNGYFLIERGKNM 291

Query:   337 CGIGTRSSYPL 347
             CG+   +SYP+
Sbjct:   292 CGLAACASYPI 302


>UNIPROTKB|Q24940 [details] [associations]
            symbol:Cat-1 "Cathepsin L-like proteinase" species:6192
            "Fasciola hepatica" [GO:0004175 "endopeptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005576 "extracellular region" evidence=IDA]
            [GO:0006508 "proteolysis" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005576 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0004197 EMBL:L33771 PIR:S43991 PDB:2O6X
            PDBsum:2O6X ProteinModelPortal:Q24940 SMR:Q24940 MEROPS:C01.033
            EvolutionaryTrace:Q24940 Uniprot:Q24940
        Length = 326

 Score = 540 (195.1 bits), Expect = 4.4e-52, P = 4.4e-52
 Identities = 115/311 (36%), Positives = 177/311 (56%)

Query:    45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTN 101
             ++  +W   + + Y    + + R  I+++N+++I++ N     G  TY LG NQF+D+T 
Sbjct:    19 DLWHQWKRMYNKEYNGA-DDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTF 77

Query:   102 DEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
             +EF+A Y   +M   S        ++  N +   VP  +DWR+ G VT +K+Q  CG CW
Sbjct:    78 EEFKAKYLT-EMSRASDILSHGVPYEANNRA---VPDKIDWRESGYVTEVKDQGNCGSCW 133

Query:   162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATE 220
             AF+    +EG         I  SEQQL+DCS   GNNGC GG  E A+ Y+ Q  G+ TE
Sbjct:   134 AFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQF-GLETE 192

Query:   221 DEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQS 279
               YPY AV G C   ++   AK++ Y  V SG E  L   V + +P ++A+   S +F  
Sbjct:   193 SSYPYTAVEGQCRYNKQLGVAKVTGYYTVHSGSEVELKNLVGARRPAAVAVDVES-DFMM 251

Query:   280 YKEGIFNG-VCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-L 336
             Y+ GI+    C   +++HAV  VG+GT + G +YW++KNSWG  WG+ GY+++ R+ G +
Sbjct:   252 YRSGIYQSQTCSPLRVNHAVLAVGYGT-QGGTDYWIVKNSWGTYWGERGYIRMARNRGNM 310

Query:   337 CGIGTRSSYPL 347
             CGI + +S P+
Sbjct:   311 CGIASLASLPM 321


>UNIPROTKB|G1RBY1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:61853
            "Nomascus leucogenys" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 EMBL:ADFV01087552 RefSeq:XP_003275518.1
            Ensembl:ENSNLET00000011249 GeneID:100584322 Uniprot:G1RBY1
        Length = 335

 Score = 539 (194.8 bits), Expect = 5.6e-52, P = 5.6e-52
 Identities = 118/315 (37%), Positives = 176/315 (55%)

Query:    42 SVVEIHEK-WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
             S+ + H K WM++H ++Y  E E   RL++F  N   I  A+  GN T+K+  NQFSD++
Sbjct:    29 SLEKFHFKSWMSKHHKTYSTE-EYHHRLQMFASNWRKIN-AHNNGNHTFKMALNQFSDMS 86

Query:   101 NDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTD-VPTSLDWRDKGA-VTPIKNQKECG 158
                F  +   Y    P +       +    L  T   P S+DWR KG  V+P+KNQ  CG
Sbjct:    87 ---FAEIKHKYLWSEPQNCSATKSNY----LRGTGPYPPSMDWRKKGNFVSPVKNQGACG 139

Query:   159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI 217
              CW F+   A+E    I +G ++ L+EQQL+DC+ + NN GC GG   +AF YI+ N+GI
Sbjct:   140 SCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGI 199

Query:   218 ATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSM-QPVSIAIAAYSTE 276
               ED YPYQ   G C      A   + +   +   DE+A+++AV++  PVS A    + +
Sbjct:   200 MGEDTYPYQGKDGYCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEV-TQD 258

Query:   277 FQSYKEGIFNGV-CGT---QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
             F  Y+ GI++   C     +++HAV  VG+G  ++G  YW++KNSWG  WG  GY  I R
Sbjct:   259 FMMYRRGIYSSTSCHKTPDKVNHAVLAVGYGE-KNGIPYWIVKNSWGPQWGMNGYFLIER 317

Query:   333 DEGLCGIGTRSSYPL 347
              + +CG+   +SYP+
Sbjct:   318 GKNMCGLAACASYPI 332


>UNIPROTKB|P09668 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9606 "Homo sapiens"
            [GO:0001520 "outer dense fiber" evidence=IEA] [GO:0001669
            "acrosomal vesicle" evidence=IEA] [GO:0007283 "spermatogenesis"
            evidence=IEA] [GO:0030984 "kininogen binding" evidence=IEA]
            [GO:0032403 "protein complex binding" evidence=IEA] [GO:0043621
            "protein self-association" evidence=IEA] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0031648 "protein destabilization"
            evidence=IMP] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=ISS] [GO:0032526 "response to retinoic acid"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0030108 "HLA-A
            specific activating MHC class I receptor activity" evidence=IDA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEP] [GO:0010813 "neuropeptide catabolic process"
            evidence=IDA] [GO:0010815 "bradykinin catabolic process"
            evidence=IDA] [GO:0030335 "positive regulation of cell migration"
            evidence=IDA] [GO:0070371 "ERK1 and ERK2 cascade" evidence=IDA]
            [GO:0010628 "positive regulation of gene expression" evidence=IDA]
            [GO:0006508 "proteolysis" evidence=IDA;TAS] [GO:0031638 "zymogen
            activation" evidence=IDA] [GO:0016505 "apoptotic protease activator
            activity" evidence=IDA] [GO:0010952 "positive regulation of
            peptidase activity" evidence=IDA] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=ISS] [GO:0004252 "serine-type endopeptidase
            activity" evidence=ISS] [GO:0043066 "negative regulation of
            apoptotic process" evidence=ISS] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0033619 "membrane protein proteolysis"
            evidence=IDA] [GO:0004175 "endopeptidase activity" evidence=IDA]
            [GO:0004177 "aminopeptidase activity" evidence=IDA] [GO:0005764
            "lysosome" evidence=IDA] [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0002250 "adaptive immune response" evidence=IEP]
            [GO:0019882 "antigen processing and presentation" evidence=TAS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0070324 "thyroid hormone binding" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0008284
            "positive regulation of cell proliferation" evidence=ISS]
            [GO:0045766 "positive regulation of angiogenesis" evidence=ISS]
            [GO:0008233 "peptidase activity" evidence=IDA] [GO:0097208
            "alveolar lamellar body" evidence=IDA] [GO:0043129 "surfactant
            homeostasis" evidence=IDA] [GO:0004197 "cysteine-type endopeptidase
            activity" evidence=IDA;TAS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 Reactome:REACT_6900 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913 MEROPS:C01.040 CTD:1512
            OMA:STSCHKT OrthoDB:EOG4W9J43 GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 EMBL:X16832 EMBL:AF426247 EMBL:AK314698 EMBL:AC011944
            EMBL:BC002479 EMBL:X07549 IPI:IPI00297487 PIR:S12486
            RefSeq:NP_004381.2 UniGene:Hs.148641 PDB:1BZN PDBsum:1BZN
            ProteinModelPortal:P09668 SMR:P09668 IntAct:P09668 STRING:P09668
            PhosphoSite:P09668 DMDM:288558851 PaxDb:P09668 PRIDE:P09668
            DNASU:1512 Ensembl:ENST00000220166 GeneID:1512 KEGG:hsa:1512
            UCSC:uc021srk.1 GeneCards:GC15M079213 H-InvDB:HIX0012481
            HGNC:HGNC:2535 HPA:CAB000458 HPA:HPA003524 MIM:116820
            neXtProt:NX_P09668 PharmGKB:PA27033 InParanoid:P09668
            PhylomeDB:P09668 BRENDA:3.4.22.16 ChEMBL:CHEMBL2225 GenomeRNAi:1512
            NextBio:6261 ArrayExpress:P09668 Bgee:P09668 CleanEx:HS_CTSH
            Genevestigator:P09668 GermOnline:ENSG00000103811 GO:GO:0019882
            Uniprot:P09668
        Length = 335

 Score = 535 (193.4 bits), Expect = 1.5e-51, P = 1.5e-51
 Identities = 118/315 (37%), Positives = 175/315 (55%)

Query:    42 SVVEIHEK-WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
             S+ + H K WM++H ++Y  E E   RL+ F  N   I  A+  GN T+K+  NQFSD++
Sbjct:    29 SLEKFHFKSWMSKHRKTYSTE-EYHHRLQTFASNWRKIN-AHNNGNHTFKMALNQFSDMS 86

Query:   101 NDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTD-VPTSLDWRDKGA-VTPIKNQKECG 158
                F  +   Y    P +       +    L  T   P S+DWR KG  V+P+KNQ  CG
Sbjct:    87 ---FAEIKHKYLWSEPQNCSATKSNY----LRGTGPYPPSVDWRKKGNFVSPVKNQGACG 139

Query:   159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI 217
              CW F+   A+E    I +G ++ L+EQQL+DC+ + NN GC GG   +AF YI+ N+GI
Sbjct:   140 SCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGI 199

Query:   218 ATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSM-QPVSIAIAAYSTE 276
               ED YPYQ   G C      A   + +   +   DE+A+++AV++  PVS A    + +
Sbjct:   200 MGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEV-TQD 258

Query:   277 FQSYKEGIFNGV-CGT---QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
             F  Y+ GI++   C     +++HAV  VG+G  ++G  YW++KNSWG  WG  GY  I R
Sbjct:   259 FMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGE-KNGIPYWIVKNSWGPQWGMNGYFLIER 317

Query:   333 DEGLCGIGTRSSYPL 347
              + +CG+   +SYP+
Sbjct:   318 GKNMCGLAACASYPI 332


>FB|FBgn0034229 [details] [associations]
            symbol:CG4847 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0032504
            "multicellular organism reproduction" evidence=IEP] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0005615 "extracellular space"
            evidence=ISM;IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:AE013599 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 GO:GO:0032504 GeneTree:ENSGT00560000076599
            KO:K01371 EMBL:BT099507 RefSeq:NP_725686.1 UniGene:Dm.4677
            SMR:A1ZAU4 IntAct:A1ZAU4 MEROPS:C01.A28 EnsemblMetazoa:FBtr0086935
            GeneID:36973 KEGG:dme:Dmel_CG4847 UCSC:CG4847-RB
            FlyBase:FBgn0034229 InParanoid:A1ZAU4 OMA:GGFQEYA OrthoDB:EOG4J9KFC
            ChiTaRS:CG4847 GenomeRNAi:36973 NextBio:801302 Uniprot:A1ZAU4
        Length = 420

 Score = 533 (192.7 bits), Expect = 2.4e-51, P = 2.4e-51
 Identities = 111/308 (36%), Positives = 169/308 (54%)

Query:    50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDLTNDEFRA 106
             +++Q G++Y    ++ +    F      +E  N    +G  T+K   N F+DLT+ EF +
Sbjct:   115 FLSQSGKTYLSAADRALHEGAFASTKNLVEAGNAAFAQGVHTFKQAVNAFADLTHSEFLS 174

Query:   107 LYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
               TG K  SP  +       K  NL    +P + DWR+ G VTP+K Q  CG CWAFA  
Sbjct:   175 QLTGLKR-SPEAKARAAASLKLVNLPAKPIPDAFDWREHGGVTPVKFQGTCGSCWAFATT 233

Query:   167 AAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSREKAFAYIIQNQ-GIATEDE 222
              A+EG T  ++G+L  LSEQ L+DC      G NGC GG +E AF +I + Q G++ E  
Sbjct:   234 GAIEGHTFRKTGSLPNLSEQNLVDCGPVEDFGLNGCDGGFQEAAFCFIDEVQKGVSQEGA 293

Query:   223 YPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYK 281
             YPY    GTC      + A +  +  +P  DE+ L K V+ + PV+ ++    T  ++Y 
Sbjct:   294 YPYIDNKGTCKYDGSKSGATLQGFAAIPPKDEEQLKKVVATLGPVACSVNGLET-LKNYA 352

Query:   282 EGIFNG-VCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
              GI+N   C   + +H++ +VG+G+ E G +YW++KNSW +TWG+ GY ++ R +  C I
Sbjct:   353 GGIYNDDECNKGEPNHSILVVGYGS-EKGQDYWIVKNSWDDTWGEKGYFRLPRGKNYCFI 411

Query:   340 GTRSSYPL 347
                 SYP+
Sbjct:   412 AEECSYPV 419


>UNIPROTKB|G3SSC1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9785
            "Loxodonta africana" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 RefSeq:XP_003413898.1
            Ensembl:ENSLAFT00000003415 GeneID:100662496 Uniprot:G3SSC1
        Length = 335

 Score = 532 (192.3 bits), Expect = 3.1e-51, P = 3.1e-51
 Identities = 119/315 (37%), Positives = 168/315 (53%)

Query:    42 SVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
             S  + H + WMAQH + Y  E E   R + F  N   I   N   N T+K+  NQFSD+T
Sbjct:    29 SYEKFHFQSWMAQHQKKYSSE-EYHQRQQTFVSNWRKINAHNAR-NHTFKMALNQFSDMT 86

Query:   101 NDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTD-VPTSLDWRDKGA-VTPIKNQKECG 158
                F  +   Y    P +       +    L  T   P  +DWR KG  V+P+KNQ  CG
Sbjct:    87 ---FAEIKQKYLWSEPQNCSATKGNY----LRGTGPYPPFVDWRKKGHFVSPVKNQGACG 139

Query:   159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI 217
              CW F+   A+E    I  G L+ L+EQQL+DC+ + NN GC GG   +AF YI+ N+GI
Sbjct:   140 SCWTFSTTGALESAIAIAGGKLLSLAEQQLVDCAKDFNNHGCQGGLPSQAFEYILYNKGI 199

Query:   218 ATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSM-QPVSIAIAAYSTE 276
               ED YPY+     C    K A A + +   +   DE+A+++AV++  PVS A    + +
Sbjct:   200 MGEDTYPYKGQDDVCKFQPKKAIAFVKDVANITLNDEEAMVEAVALYNPVSFAFEV-TDD 258

Query:   277 FQSYKEGIFNGV-CGT---QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
             F  Y +GI++   C     +++HAV  VG+G  E G  YW++KNSWG  WG  GY  I R
Sbjct:   259 FMKYSKGIYSSTSCHKTPDKVNHAVLAVGYGE-EKGIPYWIVKNSWGPYWGMDGYFLIER 317

Query:   333 DEGLCGIGTRSSYPL 347
              + +CG+   +SYP+
Sbjct:   318 GKNMCGLAACASYPI 332


>UNIPROTKB|H9KYW5 [details] [associations]
            symbol:CTSS "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0002250 "adaptive immune response" evidence=IEA]
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 OMA:YEPACTQ EMBL:AADN02010496
            Ensembl:ENSGALT00000001122 Uniprot:H9KYW5
        Length = 245

 Score = 530 (191.6 bits), Expect = 5.1e-51, P = 5.1e-51
 Identities = 108/251 (43%), Positives = 152/251 (60%)

Query:   100 TNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
             T+++  AL TG ++PS  H         Y+       P ++DWR+KG VT +KNQ  CG 
Sbjct:     1 TSEDVAALLTGLRVPS-GHNQTST----YRRRG--GAPDAMDWREKGCVTEVKNQGACGA 53

Query:   160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIA 218
             CWAF+AV A+E   K+++G L+ LS Q L+DCS   GN GC GG   +AF YII N GI 
Sbjct:    54 CWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSMMYGNKGCGGGFMTRAFQYIIDNNGID 113

Query:   219 TEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEF 277
             +E+ YPY A  GTC       AA  S Y E+P  DE AL  AV+ + PVS+AI A    F
Sbjct:   114 SEESYPYMAQNGTCQYNVSTRAATCSKYVELPYADEAALKDAVANVGPVSVAIDATQPTF 173

Query:   278 QSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGL 336
               Y+ G+++   C  +++H V +VG+GT  +  ++WL+KNSWG  +GD GY+++ R+   
Sbjct:   174 FLYRSGVYDDPRCTQEVNHGVLVVGYGTLNE-KDFWLVKNSWGERFGDGGYIRMSRNHAN 232

Query:   337 -CGIGTRSSYP 346
              CGI + +SYP
Sbjct:   233 HCGIASYASYP 243


>UNIPROTKB|G3R9A7 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9595 "Gorilla
            gorilla gorilla" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 OMA:STSCHKT GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 RefSeq:XP_004056662.1 Ensembl:ENSGGOT00000012331
            GeneID:101144312 Uniprot:G3R9A7
        Length = 335

 Score = 530 (191.6 bits), Expect = 5.1e-51, P = 5.1e-51
 Identities = 115/306 (37%), Positives = 170/306 (55%)

Query:    50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
             WM++H ++Y  E E   RL+ F  N   I  A+  GN T+K+  NQFSD++   F  +  
Sbjct:    38 WMSKHRKTYSTE-EYHHRLQTFASNWRKIN-AHNNGNHTFKMALNQFSDMS---FAEIKH 92

Query:   110 GYKMPSPSHRXXXXXXFKYQNLSMTD-VPTSLDWRDKGA-VTPIKNQKECGCCWAFAAVA 167
              Y    P +       +    L  T   P S+DWR KG  V+P+KNQ  CG CW F+   
Sbjct:    93 KYLWSEPQNCSATKSNY----LRGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTG 148

Query:   168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIATEDEYPYQ 226
             A+E    I +G ++ L+EQQL+DC+ + NN GC GG   +AF YI+ N+GI  ED YPYQ
Sbjct:   149 ALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQ 208

Query:   227 AVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSM-QPVSIAIAAYSTEFQSYKEGIF 285
                G C      A   + +   +   DE+A+++AV++  PVS A    + +F  Y+ GI+
Sbjct:   209 GKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEV-TQDFMMYRTGIY 267

Query:   286 NGV-CGT---QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGT 341
             +   C     +++HAV  VG+G  ++G  YW++KNSWG  WG  GY  I R + +CG+  
Sbjct:   268 SSTSCHKTPDKVNHAVLAVGYGE-KNGIPYWIVKNSWGPKWGMNGYFLIERGKNMCGLAA 326

Query:   342 RSSYPL 347
              +SYP+
Sbjct:   327 CASYPI 332


>TAIR|locus:2175088 [details] [associations]
            symbol:ALP "aleurain-like protease" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009723 "response to ethylene stimulus" evidence=IEP]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0005829 "cytosol"
            evidence=RCA] [GO:0006096 "glycolysis" evidence=RCA] [GO:0006816
            "calcium ion transport" evidence=RCA] [GO:0006833 "water transport"
            evidence=RCA] [GO:0006972 "hyperosmotic response" evidence=RCA]
            [GO:0007030 "Golgi organization" evidence=RCA] [GO:0009266
            "response to temperature stimulus" evidence=RCA] [GO:0009651
            "response to salt stress" evidence=RCA] [GO:0009750 "response to
            fructose stimulus" evidence=RCA] [GO:0042744 "hydrogen peroxide
            catabolic process" evidence=RCA] [GO:0046686 "response to cadmium
            ion" evidence=RCA] [GO:0007568 "aging" evidence=IEP]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002688 GO:GO:0005773
            GO:GO:0007568 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AB011483 KO:K01366
            ProtClustDB:CLSN2689015 UniGene:At.25414 IPI:IPI00846287
            RefSeq:NP_001078774.1 ProteinModelPortal:A8MQZ1 SMR:A8MQZ1
            STRING:A8MQZ1 PRIDE:A8MQZ1 EnsemblPlants:AT5G60360.3 GeneID:836158
            KEGG:ath:AT5G60360 OMA:CGSTPMD Genevestigator:A8MQZ1 Uniprot:A8MQZ1
        Length = 361

 Score = 530 (191.6 bits), Expect = 5.1e-51, P = 5.1e-51
 Identities = 118/292 (40%), Positives = 162/292 (55%)

Query:    53 QHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK 112
             ++G+ Y++  E ++R  IFKENL+ I   NK+G  +YKLG NQF+DLT  EF+    G  
Sbjct:    65 RYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKG-LSYKLGVNQFADLTWQEFQRTKLG-- 121

Query:   113 MPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGI 172
               + +         K    ++   P + DWR+ G V+P+K+Q  CG CW F+   A+E  
Sbjct:   122 -AAQNCSATLKGSHKVTEAAL---PETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAA 177

Query:   173 TKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT 231
                  G  I LSEQQL+DC+   NN GC GG   +AF YI  N G+ TE  YPY     T
Sbjct:   178 YHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKDET 237

Query:   232 CSAAQKPAAAKISNYEEVPSGDEQALLKAVSM-QPVSIAIAAYSTEFQSYKEGIF-NGVC 289
             C  + +    ++ N   +  G E  L  AV + +PVSIA     + F+ YK G++ +  C
Sbjct:   238 CKFSAENVGVQVLNSVNITLGAEDELKHAVGLVRPVSIAFEVIHS-FRLYKSGVYTDSHC 296

Query:   290 G-TQLD--HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCG 338
             G T +D  HAV  VG+G  EDG  YWLIKNSWG  WGD GY K+   + +CG
Sbjct:   297 GSTPMDVNHAVLAVGYGV-EDGVPYWLIKNSWGADWGDKGYFKMEMGKNMCG 347


>UNIPROTKB|F1NEC8 [details] [associations]
            symbol:CTSL2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:AADN02067812 IPI:IPI00820956 Ensembl:ENSGALT00000037988
            ArrayExpress:F1NEC8 Uniprot:F1NEC8
        Length = 218

 Score = 525 (189.9 bits), Expect = 1.7e-50, P = 1.7e-50
 Identities = 103/217 (47%), Positives = 142/217 (65%)

Query:   137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST-NG 195
             P S+DWR+KG VTP+K+Q +CG CWAF+   A+EG    ++G L+ LSEQ L+DCS   G
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEG 61

Query:   196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVPSGDE 254
             N GC GG  ++AF Y+  N GI +E+ YPY A     C    +  AA  + + ++P G E
Sbjct:    62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHE 121

Query:   255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFGTTEDGANY 311
             +AL+KAV S+ PVS+AI A  + FQ Y+ GI+    C ++ LDH V +VG+G  EDG  Y
Sbjct:   122 RALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGF-EDGKKY 180

Query:   312 WLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
             W++KNSWG  WGD GY+ + +D +  CGI T +SYPL
Sbjct:   181 WIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPL 217


>UNIPROTKB|G3V9F8 [details] [associations]
            symbol:Ctsm "RCG24133" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015645 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 EMBL:CH474032
            PANTHER:PTHR12411:SF58 Ensembl:ENSRNOT00000045830 RGD:631420
            Uniprot:G3V9F8
        Length = 333

 Score = 521 (188.5 bits), Expect = 4.6e-50, P = 4.6e-50
 Identities = 118/336 (35%), Positives = 187/336 (55%)

Query:    22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
             + L + C  +++SS    +  +    +KW  ++ ++Y  E E + R  +++EN++ I+  
Sbjct:     5 VFLAILCL-RLISSSPAPDPVLDAEWQKWKIKYEKTYSLEEEGQKRA-VWEENMKKIKLH 62

Query:    82 NKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPT 138
             N E   G   + +  N F D+T +EFR L    ++P P+ +         Q     +VP 
Sbjct:    63 NGENGLGKHGFTMEMNAFGDMTIEEFRKLMI--EIPIPTVKKENSV----QKRQAVNVPN 116

Query:   139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNN 197
              ++WR +G VTP++ Q  C  CWAF+   A+EG    ++G LI LS Q L+DCS   GN 
Sbjct:   117 FINWRKRGYVTPVRRQGRCNVCWAFSVAGAIEGQMFQKTGQLIPLSVQNLVDCSRPQGNL 176

Query:   198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
             GC  G+   A  Y+ +N G+ +E  YPY+   G+C      + A I+++E VP  +E AL
Sbjct:   177 GCYLGNTYLALQYVKENGGLESEATYPYEEKEGSCRYHPDNSTASITDFEFVPK-NEDAL 235

Query:   258 LKAVS-MQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQL-DHAVTIVGFGTT---EDGANY 311
             + AV+ + P+S+AI A    F  Y+ GI++   C + +  HA+ +VG+G      DG  Y
Sbjct:   236 MNAVATLGPISVAIDARHESFLFYRNGIYHEPNCSSSVVTHAMLLVGYGFVGEESDGRKY 295

Query:   312 WLIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYP 346
             W++KNS GN WG+ GYMKI +D+G  CGI T + YP
Sbjct:   296 WILKNSMGNKWGNRGYMKIAKDQGNHCGIATYALYP 331


>MGI|MGI:1927229 [details] [associations]
            symbol:Ctsm "cathepsin M" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008152 "metabolic process" evidence=ISS] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015645 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1927229 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF202528
            EMBL:AY014777 EMBL:AY057446 EMBL:AK005550 EMBL:AK005428
            IPI:IPI00131133 RefSeq:NP_071721.2 UniGene:Mm.279933
            ProteinModelPortal:Q9JL96 SMR:Q9JL96 STRING:Q9JL96 MEROPS:C01.023
            PRIDE:Q9JL96 DNASU:64139 Ensembl:ENSMUST00000099451 GeneID:64139
            KEGG:mmu:64139 UCSC:uc007qwj.1 CTD:64139 InParanoid:Q9JL96
            KO:K09600 OrthoDB:EOG4TTGKR NextBio:319931 Bgee:Q9JL96
            CleanEx:MM_CTSM Genevestigator:Q9JL96 GermOnline:ENSMUSG00000074484
            GermOnline:ENSMUSG00000074871 PANTHER:PTHR12411:SF58 Uniprot:Q9JL96
        Length = 333

 Score = 520 (188.1 bits), Expect = 5.8e-50, P = 5.8e-50
 Identities = 121/336 (36%), Positives = 190/336 (56%)

Query:    22 ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
             I L + C    + S +      VE  +KW  ++G++Y  E E + R  ++++N++ I+  
Sbjct:     5 IFLAMLCLGMALPSPAPDPILDVE-WQKWKIKYGKAYSLEEEGQKRA-VWEDNMKKIKLH 62

Query:    82 NKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPT 138
             N E   G   + +  N F D+T +EFR +    ++P P+ +         + LS+ ++P 
Sbjct:    63 NGENGLGKHGFTMEMNAFGDMTLEEFRKVMI--EIPVPTVKKGKSVQ---KRLSV-NLPK 116

Query:   139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNN 197
              ++W+ +G VTP++ Q  C  CWAF+   A+EG    ++G LI LS Q L+DCS   GN 
Sbjct:   117 FINWKKRGYVTPVQTQGRCNSCWAFSVTGAIEGQMFRKTGQLIPLSVQNLVDCSRPQGNW 176

Query:   198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
             GC  G+   A  Y+++N G+ +E  YPY+   G+C  + + + A I+ +E VP  +E AL
Sbjct:   177 GCYLGNTYLALHYVMENGGLESEATYPYEEKDGSCRYSPENSTANITGFEFVPK-NEDAL 235

Query:   258 LKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVCGT-QLDHAVTIVGFGTT---EDGANY 311
             + AV S+ P+S+AI A    F  YK GI+    C +  + H++ +VG+G T    DG  Y
Sbjct:   236 MNAVASIGPISVAIDARHASFLFYKRGIYYEPNCSSCVVTHSMLLVGYGFTGRESDGRKY 295

Query:   312 WLIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYP 346
             WL+KNS G  WG+ GYMKI RD+G  CGI T + YP
Sbjct:   296 WLVKNSMGTQWGNKGYMKISRDKGNHCGIATYALYP 331


>MGI|MGI:1861723 [details] [associations]
            symbol:Ctsr "cathepsin R" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=ISA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=ISA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0030163 "protein
            catabolic process" evidence=ISA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1861723 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0030163
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF245399
            EMBL:AY014778 EMBL:AK014432 EMBL:AK005429 IPI:IPI00120321
            RefSeq:NP_064680.1 UniGene:Mm.315715 ProteinModelPortal:Q9JIA9
            SMR:Q9JIA9 MEROPS:C01.042 PRIDE:Q9JIA9 Ensembl:ENSMUST00000021889
            GeneID:56835 KEGG:mmu:56835 CTD:56835 InParanoid:Q9JIA9 KO:K09601
            OMA:ASHESFK OrthoDB:EOG4ZCT6D NextBio:313379 Bgee:Q9JIA9
            CleanEx:MM_CTSR Genevestigator:Q9JIA9 GermOnline:ENSMUSG00000055679
            Uniprot:Q9JIA9
        Length = 334

 Score = 519 (187.8 bits), Expect = 7.4e-50, P = 7.4e-50
 Identities = 110/309 (35%), Positives = 173/309 (55%)

Query:    48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
             + W  ++ +SY  + EK  R+ +++E L+ I+  N+E   G   + +  N+F D T++EF
Sbjct:    30 QDWKIKYNKSYSLKEEKLKRV-VWEEKLKMIKLHNRENSLGKNGFTMKMNEFGDQTDEEF 88

Query:   105 RALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
             R +    ++   +HR       K +  S+  +P  +DWR KG VTP++ Q +C  CWAFA
Sbjct:    89 RKMMI--EISVWTHREGKSI-MKREAGSI--LPKFVDWRKKGYVTPVRRQGDCDACWAFA 143

Query:   165 AVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
                A+E     ++G L  LS Q L+DCS   GNNGCLGG    AF Y++ N G+ +E  Y
Sbjct:   144 VTGAIEAQAIWQTGKLTPLSVQNLVDCSKPQGNNGCLGGDTYNAFQYVLHNGGLESEATY 203

Query:   224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEG 283
             PY+   G C    K + A+I+ +  +P  ++  +    ++ P++  I A    F++YK G
Sbjct:   204 PYEGKDGPCRYNPKNSKAEITGFVSLPQSEDILMAAVATIGPITAGIDASHESFKNYKGG 263

Query:   284 IFNGV-CGTQ-LDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGL-C 337
             I++   C +  + H V +VG+G      DG +YWLIKNSWG  WG  GYMK+ +D+   C
Sbjct:   264 IYHEPNCSSDTVTHGVLVVGYGFKGIETDGNHYWLIKNSWGKRWGIRGYMKLAKDKNNHC 323

Query:   338 GIGTRSSYP 346
             GI + + YP
Sbjct:   324 GIASYAHYP 332


>UNIPROTKB|P09648 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9031 "Gallus gallus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            OrthoDB:EOG48PMKF MEROPS:C01.032 IPI:IPI00602255 PIR:S00081
            UniGene:Gga.523 ProteinModelPortal:P09648 SMR:P09648 Uniprot:P09648
        Length = 218

 Score = 514 (186.0 bits), Expect = 2.5e-49, P = 2.5e-49
 Identities = 102/217 (47%), Positives = 139/217 (64%)

Query:   137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST-NG 195
             P S+DWR+KG VTP+K+Q +CG CWAF+   A+EG      G L+ LSEQ L+DCS   G
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRPEG 61

Query:   196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVPSGDE 254
             N GC GG  ++AF Y+  N GI +E+ YPY A     C    +  AA  + + ++P G E
Sbjct:    62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHE 121

Query:   255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFGTTEDGANY 311
             +AL+KAV S+ PVS+AI A  + FQ Y+ GI+    C ++ LDH V +VG+G  E G  Y
Sbjct:   122 RALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGF-EGGKKY 180

Query:   312 WLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
             W++KNSWG  WGD GY+ + +D +  CGI T +SYPL
Sbjct:   181 WIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPL 217


>FB|FBgn0250848 [details] [associations]
            symbol:26-29-p "26-29kD-proteinase" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005811
            "lipid particle" evidence=IDA] [GO:0005875 "microtubule associated
            complex" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005875 EMBL:AE014296 GO:GO:0005811 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 MEROPS:I29.003 HSSP:O65039
            EMBL:AY122222 EMBL:AB011376 RefSeq:NP_620470.1 UniGene:Dm.3049
            SMR:Q9V3U6 MINT:MINT-890485 STRING:Q9V3U6
            EnsemblMetazoa:FBtr0075766 GeneID:39547 KEGG:dme:Dmel_CG8947
            UCSC:CG8947-RA CTD:39547 FlyBase:FBgn0250848 InParanoid:Q9V3U6
            OMA:IHSKNRA OrthoDB:EOG4BVQ8T GenomeRNAi:39547 NextBio:814210
            Uniprot:Q9V3U6
        Length = 549

 Score = 512 (185.3 bits), Expect = 4.1e-49, P = 4.1e-49
 Identities = 114/315 (36%), Positives = 165/315 (52%)

Query:    40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
             ++ V +    +  +HG +Y  + E E R  IF++NL YI   N+    TY L  N  +D 
Sbjct:   238 DEHVDKAFHHFKRKHGVAYHSDTEHEHRKNIFRQNLRYIHSKNR-AKLTYTLAVNHLADK 296

Query:   100 TNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTD-VPTSLDWRDKGAVTPIKNQKECG 158
             T +E +A   GYK    S        F Y      D +P   DWR  GAVTP+K+Q  CG
Sbjct:   297 TEEELKAR-RGYKS---SGIYNTGKPFPYDVPKYKDEIPDQYDWRLYGAVTPVKDQSVCG 352

Query:   159 CCWAFAAVAAVEGITKIRSG-NLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQG 216
              CW+F  +  +EG   +++G NL++LS+Q L+DCS   GNNGC GG   + + +++Q+ G
Sbjct:   353 SCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDCSWAYGNNGCDGGEDFRVYQWMLQSGG 412

Query:   217 IATEDEY-PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL-LKAVSMQPVSIAIAAYS 274
             + TE+EY PY    G C        A I  +  V S D  A  L  +   P+S+AI A  
Sbjct:   413 VPTEEEYGPYLGQDGYCHVNNVTLVAPIKGFVNVTSNDPNAFKLALLKHGPLSVAIDASP 472

Query:   275 TEFQSYKEGIF-NGVCGTQ---LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
               F  Y  G++    C      LDHAV  VG+G+  +G +YWL+KNSW   WG+ GY+ +
Sbjct:   473 KTFSFYSHGVYYEPTCKNDVDGLDHAVLAVGYGSI-NGEDYWLVKNSWSTYWGNDGYILM 531

Query:   331 VRDEGLCGIGTRSSY 345
                +  CG+ T  +Y
Sbjct:   532 SAKKNNCGVMTMPTY 546


>UNIPROTKB|J9P7C5 [details] [associations]
            symbol:J9P7C5 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 EMBL:AAEX03010953
            Ensembl:ENSCAFT00000012925 Uniprot:J9P7C5
        Length = 321

 Score = 512 (185.3 bits), Expect = 4.1e-49, P = 4.1e-49
 Identities = 115/307 (37%), Positives = 176/307 (57%)

Query:    49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFR 105
             +W A H R Y    E+  R  ++++N++ IE  N+E   G   + +  N F D+TN+EFR
Sbjct:    26 QWKAMHRRLYGMN-EEGWRRAVWEKNMKMIELHNREYSQGKHGFTMAMNAFGDMTNEEFR 84

Query:   106 ALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
              +  G++  +  H+        +Q     ++P S+DWR+KG VTP+KNQ +CG CWAF+A
Sbjct:    85 QVINGFQ--NQKHKKGKV----FQEPLFAEIPKSVDWREKGYVTPVKNQGQCGSCWAFSA 138

Query:   166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
               A EG    ++GNL+ LSEQ L      GN GC GG  + AF Y+  N+ + +E+ YPY
Sbjct:   139 TGAFEGQMFWKTGNLVPLSEQNL----AQGNEGCNGGLMDNAFQYVKDNRCLDSEESYPY 194

Query:   226 QAVP-GTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKEG 283
                   TC+   + +AA  S + ++P   E+AL+KA++ +  +++AI A    FQ YK  
Sbjct:   195 LGRDTDTCNYKPECSAAHDSGFVDLPQR-EKALMKAMATLGSITVAIDAGHQYFQFYKSS 253

Query:   284 I-FNGVCGTQ-LDHAVTIVGFGTT-EDGANYWLIKNSWGNTWGDAGYMKIVRDEGL-CGI 339
             I F+  C ++ LDH V +VG+G    D  N W++KNSW   WG   Y+K+ + +   CGI
Sbjct:   254 IYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKWIVKNSWSPEWGWNSYVKMAKGQNNHCGI 313

Query:   340 GTRSSYP 346
              T +SYP
Sbjct:   314 -TAASYP 319


>WB|WBGene00007055 [details] [associations]
            symbol:tag-196 species:6239 "Caenorhabditis elegans"
            [GO:0004869 "cysteine-type endopeptidase inhibitor activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000010
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00031 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00043 SMART:SM00645 InterPro:IPR000169
            GO:GO:0004869 eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 EMBL:FO080488 PIR:T31871
            RefSeq:NP_505215.2 HSSP:Q9UBX1 ProteinModelPortal:O16454 SMR:O16454
            DIP:DIP-27400N IntAct:O16454 MINT:MINT-1044990 MEROPS:C01.A50
            PaxDb:O16454 EnsemblMetazoa:F41E6.6.1 EnsemblMetazoa:F41E6.6.2
            EnsemblMetazoa:F41E6.6.3 GeneID:179240 KEGG:cel:CELE_F41E6.6
            UCSC:F41E6.6.1 CTD:179240 WormBase:F41E6.6 InParanoid:O16454
            OMA:GGGLMTN NextBio:904514 Uniprot:O16454
        Length = 477

 Score = 512 (185.3 bits), Expect = 4.1e-49, P = 4.1e-49
 Identities = 110/294 (37%), Positives = 161/294 (54%)

Query:    53 QHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK 112
             +H + Y ++ E   R ++FK+N + I +  K    T   G  +FSD+T  EF+ +   Y+
Sbjct:   180 RHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFKKIMLPYQ 239

Query:   113 MPSPSHRXXXXXXFKYQ-NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEG 171
                P +        K+   ++  D+P S DWR+KGAVT +KNQ  CG CWAF+    VEG
Sbjct:   240 WEQPVYPMEQANFEKHDVTINEEDLPESFDWREKGAVTQVKNQGNCGSCWAFSTTGNVEG 299

Query:   172 ITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT 231
                I    L+ LSEQ+L+DC +  + GC GG    A+  II+  G+  ED YPY     T
Sbjct:   300 AWFIAKNKLVSLSEQELVDCDSM-DQGCNGGLPSNAYKEIIRMGGLEPEDAYPYDGRGET 358

Query:   232 CSAAQKPAAAKISNYEEVPSGDEQALLK-AVSMQPVSIAIAAYSTEFQSYKEGI---FNG 287
             C   +K  A  I+   E+P  DE  + K  V+  P+SI + A + +F  Y+ G+   F  
Sbjct:   359 CHLVRKDIAVYINGSVELPH-DEVEMQKWLVTKGPISIGLNANTLQF--YRHGVVHPFKI 415

Query:   288 VCGT-QLDHAVTIVGFGTTEDGAN-YWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
              C    L+H V IVG+G  +DG   YW++KNSWG  WG+AGY K+ R + +CG+
Sbjct:   416 FCEPFMLNHGVLIVGYG--KDGRKPYWIVKNSWGPNWGEAGYFKLYRGKNVCGV 467


>UNIPROTKB|Q90686 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9031 "Gallus gallus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            CTD:1513 KO:K01371 PANTHER:PTHR12411:SF55 EMBL:U37691
            IPI:IPI00575213 RefSeq:NP_990302.1 UniGene:Gga.51509
            ProteinModelPortal:Q90686 SMR:Q90686 MEROPS:C01.036 GeneID:395818
            KEGG:gga:395818 NextBio:20815886 Uniprot:Q90686
        Length = 334

 Score = 510 (184.6 bits), Expect = 6.7e-49, P = 6.7e-49
 Identities = 105/272 (38%), Positives = 157/272 (57%)

Query:    79 EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPT 138
             ++  + G  +++L  N   D+T++E     TG ++P    R        Y     +  P 
Sbjct:    66 QRGARLGKHSFQLAMNYLGDMTSEEVVRTMTGLRVPRSRPRPNGTL---YVPDWSSRAPA 122

Query:   139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
             ++DWR KG VTP+K+Q +CG CWAF++V A+EG  K R+G L+ LS Q L+ C +N NNG
Sbjct:   123 AVDWRRKGYVTPVKDQGQCGSCWAFSSVGALEGQLKRRTGKLLSLSPQNLVYCVSN-NNG 181

Query:   199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALL 258
             C GG    AF Y+  N+GI +ED YPY     +C  +    AAK   Y E+P  +E+AL 
Sbjct:   182 CGGGYMTNAFEYVRLNRGIDSEDAYPYIGQDESCMYSPTGKAAKCRGYREIPEDNEKALK 241

Query:   259 KAVS-MQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIK 315
             +AV+ + PVS+ I A    FQ Y  G++    C  + ++HAV  VG+G  + G  +W+IK
Sbjct:   242 RAVARIGPVSVGIDASLPSFQFYSRGVYYDTGCNPENINHAVLAVGYGA-QKGTKHWIIK 300

Query:   316 NSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
             NSWG  WG+ GY+ + R+ +  CGI   +S+P
Sbjct:   301 NSWGTEWGNKGYVLLARNMKQTCGIANLASFP 332


>GENEDB_PFALCIPARUM|PF11_0165 [details] [associations]
            symbol:PF11_0165 "falcipain 2 precursor"
            species:5833 "Plasmodium falciparum" [GO:0020020 "food vacuole"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014186 HSSP:O65039 GO:GO:0020020
            RefSeq:XP_001347836.1 ProteinModelPortal:Q8I6U4 SMR:Q8I6U4
            IntAct:Q8I6U4 MINT:MINT-1559493 MEROPS:C01.046
            EnsemblProtists:PF11_0165:mRNA GeneID:810712 KEGG:pfa:PF11_0165
            EuPathDB:PlasmoDB:PF3D7_1115700 HOGENOM:HOG000065857 OMA:NESLHAN
            ProtClustDB:PTZ00021 BindingDB:Q8I6U4 ChEMBL:CHEMBL3470
            Uniprot:Q8I6U4
        Length = 484

 Score = 509 (184.2 bits), Expect = 8.5e-49, P = 8.5e-49
 Identities = 121/316 (38%), Positives = 168/316 (53%)

Query:    54 HGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM 113
             + + Y    E + R ++F +N   +   N   N  YK   N+F+DLT  EF+  Y   + 
Sbjct:   172 NNKQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKELNRFADLTYHEFKNKYLSLRS 231

Query:   114 PSP---SHRXXXXXXF-----KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
               P   S        +     KY+     D   + DWR    VTP+K+QK CG CWAF++
Sbjct:   232 SKPLKNSKYLLDQMNYEEVIKKYKGNENFD-HAAYDWRLHSGVTPVKDQKNCGSCWAFSS 290

Query:   166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
             + +VE    IR   LI LSEQ+L+DCS   N GC GG    AF  +I+  GI T+D+YPY
Sbjct:   291 IGSVESQYAIRKNKLITLSEQELVDCSFK-NYGCNGGLINNAFEDMIELGGICTDDDYPY 349

Query:   226 QA-VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI 284
              +  P  C+  +      I NY  VP    +  L+ +   P+SI++A  S +F  YKEGI
Sbjct:   350 VSDAPNLCNIDRCTEKYGIKNYLSVPDNKLKEALRFLG--PISISVAV-SDDFAFYKEGI 406

Query:   285 FNGVCGTQLDHAVTIVGFGT-------TEDGAN--YWLIKNSWGNTWGDAGYMKIVRDE- 334
             F+G CG QL+HAV +VGFG        T+ G    Y++IKNSWG  WG+ G++ I  DE 
Sbjct:   407 FDGECGDQLNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDES 466

Query:   335 GL---CGIGTRSSYPL 347
             GL   CG+GT +  PL
Sbjct:   467 GLMRKCGLGTDAFIPL 482


>UNIPROTKB|Q10991 [details] [associations]
            symbol:CTSL "Cathepsin L1" species:9940 "Ovis aries"
            [GO:0005515 "protein binding" evidence=IPI] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            MEROPS:C01.032 ProteinModelPortal:Q10991 SMR:Q10991 Uniprot:Q10991
        Length = 217

 Score = 509 (184.2 bits), Expect = 8.5e-49, P = 8.5e-49
 Identities = 98/216 (45%), Positives = 143/216 (66%)

Query:   136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST-N 194
             VP S+DW  KG VTP+KNQ +CG CWAF+A  A+EG    ++G L+ LSEQ L+D S   
Sbjct:     1 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQ 60

Query:   195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
             GN GC GG  + AF YI +N G+ +E+ YPY+A   +C+   + +AAK + + ++P   E
Sbjct:    61 GNQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTSCNYKPEYSAAKDTGFVDIPQR-E 119

Query:   255 QALLKAVS-MQPVSIAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFGTTEDGANY 311
             +AL+KAV+ + P+S+AI A  + FQ YK GI+ +  C ++ LDH V +VG+G       +
Sbjct:   120 KALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTNNKF 179

Query:   312 WLIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYP 346
             W++KNSWG  WG+ GY+K+ +D+   CGI T +SYP
Sbjct:   180 WIVKNSWGPEWGNKGYVKMAKDQNNHCGIATAASYP 215


>UNIPROTKB|Q8I6U4 [details] [associations]
            symbol:PF11_0165 "Falcipain-2A" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            EMBL:AE014186 HSSP:O65039 GO:GO:0020020 RefSeq:XP_001347836.1
            ProteinModelPortal:Q8I6U4 SMR:Q8I6U4 IntAct:Q8I6U4
            MINT:MINT-1559493 MEROPS:C01.046 EnsemblProtists:PF11_0165:mRNA
            GeneID:810712 KEGG:pfa:PF11_0165 EuPathDB:PlasmoDB:PF3D7_1115700
            HOGENOM:HOG000065857 OMA:NESLHAN ProtClustDB:PTZ00021
            BindingDB:Q8I6U4 ChEMBL:CHEMBL3470 Uniprot:Q8I6U4
        Length = 484

 Score = 509 (184.2 bits), Expect = 8.5e-49, P = 8.5e-49
 Identities = 121/316 (38%), Positives = 168/316 (53%)

Query:    54 HGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM 113
             + + Y    E + R ++F +N   +   N   N  YK   N+F+DLT  EF+  Y   + 
Sbjct:   172 NNKQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKELNRFADLTYHEFKNKYLSLRS 231

Query:   114 PSP---SHRXXXXXXF-----KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
               P   S        +     KY+     D   + DWR    VTP+K+QK CG CWAF++
Sbjct:   232 SKPLKNSKYLLDQMNYEEVIKKYKGNENFD-HAAYDWRLHSGVTPVKDQKNCGSCWAFSS 290

Query:   166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
             + +VE    IR   LI LSEQ+L+DCS   N GC GG    AF  +I+  GI T+D+YPY
Sbjct:   291 IGSVESQYAIRKNKLITLSEQELVDCSFK-NYGCNGGLINNAFEDMIELGGICTDDDYPY 349

Query:   226 QA-VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI 284
              +  P  C+  +      I NY  VP    +  L+ +   P+SI++A  S +F  YKEGI
Sbjct:   350 VSDAPNLCNIDRCTEKYGIKNYLSVPDNKLKEALRFLG--PISISVAV-SDDFAFYKEGI 406

Query:   285 FNGVCGTQLDHAVTIVGFGT-------TEDGAN--YWLIKNSWGNTWGDAGYMKIVRDE- 334
             F+G CG QL+HAV +VGFG        T+ G    Y++IKNSWG  WG+ G++ I  DE 
Sbjct:   407 FDGECGDQLNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDES 466

Query:   335 GL---CGIGTRSSYPL 347
             GL   CG+GT +  PL
Sbjct:   467 GLMRKCGLGTDAFIPL 482


>UNIPROTKB|E9PTT3 [details] [associations]
            symbol:Ctsr "Protein Ctsr" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            IPI:IPI00627092 Ensembl:ENSRNOT00000024115 RGD:631422
            Uniprot:E9PTT3
        Length = 334

 Score = 506 (183.2 bits), Expect = 1.8e-48, P = 1.8e-48
 Identities = 113/313 (36%), Positives = 176/313 (56%)

Query:    45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTN 101
             E H+    ++ +SY  E E+  R  +++EN++ I+  N+E   G   + +  N+F DLT 
Sbjct:    28 EWHDX-KTEYEKSYTME-EEGHRRAVWEENMKMIKLHNRENSLGKNGFIMEMNEFGDLTA 85

Query:   102 DEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
             +EFR +     +P  SHR       + +++    +P  +DWR KG VT ++NQK C  CW
Sbjct:    86 EEFRKMMVN--IPIRSHRKGKI--IRKRDVGNV-LPKFVDWRKKGYVTRVQNQKFCNSCW 140

Query:   162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATE 220
             AFA   A+EG    ++G L  LS Q L+DC+ + GN GC  G    A+ Y++ N G+  E
Sbjct:   141 AFAVTGAIEGQMFNKTGQLTPLSVQNLVDCTKSQGNEGCQWGDPHIAYEYVLNNGGLEAE 200

Query:   221 DEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQS 279
               YPY+   G C    K + A+I+ +  +P   E  L++AV+ + P+S+A+ A    F  
Sbjct:   201 ATYPYKGKEGVCRYNPKHSKAEITGFVSLPES-EDILMEAVATIGPISVAVDASFNSFGF 259

Query:   280 YKEGIFNGV-CGTQ-LDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRDE 334
             YK+G+++   C    ++H+V +VG+G      DG +YWLIKNSWG  WG  GYMKI +D+
Sbjct:   260 YKKGLYDEPNCSNNTVNHSVLVVGYGFEGNETDGNSYWLIKNSWGRKWGLRGYMKIPKDQ 319

Query:   335 G-LCGIGTRSSYP 346
                C I + + YP
Sbjct:   320 NNFCAIASYAHYP 332


>GENEDB_PFALCIPARUM|PF11_0161 [details] [associations]
            symbol:PF11_0161 "falcipain-2 precursor,
            putative" species:5833 "Plasmodium falciparum" [GO:0020020 "food
            vacuole" evidence=TAS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014186 HSSP:O65039 OMA:NNVEHIN GO:GO:0020020
            MEROPS:C01.046 HOGENOM:HOG000065857 ProtClustDB:PTZ00021
            RefSeq:XP_001347832.1 ProteinModelPortal:Q8I6U5 SMR:Q8I6U5
            IntAct:Q8I6U5 MINT:MINT-1546851 EnsemblProtists:PF11_0161:mRNA
            GeneID:810708 KEGG:pfa:PF11_0161 EuPathDB:PlasmoDB:PF3D7_1115300
            Uniprot:Q8I6U5
        Length = 482

 Score = 503 (182.1 bits), Expect = 3.7e-48, P = 3.7e-48
 Identities = 120/316 (37%), Positives = 169/316 (53%)

Query:    54 HGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM 113
             + + Y    E + R ++F +N   ++  N      YK   N+F+DLT  EF++ Y   + 
Sbjct:   170 NNKQYNSPNEMKERFQVFLQNAHKVKMHNNNKKSLYKKELNRFADLTYHEFKSKYLTLRS 229

Query:   114 PSP---SHRXXXXXXF-----KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
               P   S        +     KY+     D   + DWR    VTP+K+QK CG CWAF++
Sbjct:   230 SKPLKNSKYLLDQINYDAVIKKYKGNENFD-HAAYDWRLHSGVTPVKDQKNCGSCWAFSS 288

Query:   166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
             + +VE    IR   LI LSEQ+L+DCS   N GC GG    AF  +I+  GI T+D+YPY
Sbjct:   289 IGSVESQYAIRKNKLITLSEQELVDCSFK-NYGCNGGLINNAFEDMIELGGICTDDDYPY 347

Query:   226 QA-VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI 284
              +  P  C+  +      I NY  VP    +  L+ +   P+SI+IA  S +F  YKEGI
Sbjct:   348 VSDAPNLCNIDRCTEKYGIKNYLSVPDNKLKEALRFLG--PISISIAV-SDDFPFYKEGI 404

Query:   285 FNGVCGTQLDHAVTIVGFGT-------TEDGAN--YWLIKNSWGNTWGDAGYMKIVRDE- 334
             F+G CG +L+HAV +VGFG        T+ G    Y++IKNSWG  WG+ G++ I  DE 
Sbjct:   405 FDGECGDELNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDES 464

Query:   335 GL---CGIGTRSSYPL 347
             GL   CG+GT +  PL
Sbjct:   465 GLMRKCGLGTDAFIPL 480


>UNIPROTKB|Q8I6U5 [details] [associations]
            symbol:PF11_0161 "Falcipain-2B" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            EMBL:AE014186 HSSP:O65039 OMA:NNVEHIN GO:GO:0020020 MEROPS:C01.046
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347832.1
            ProteinModelPortal:Q8I6U5 SMR:Q8I6U5 IntAct:Q8I6U5
            MINT:MINT-1546851 EnsemblProtists:PF11_0161:mRNA GeneID:810708
            KEGG:pfa:PF11_0161 EuPathDB:PlasmoDB:PF3D7_1115300 Uniprot:Q8I6U5
        Length = 482

 Score = 503 (182.1 bits), Expect = 3.7e-48, P = 3.7e-48
 Identities = 120/316 (37%), Positives = 169/316 (53%)

Query:    54 HGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM 113
             + + Y    E + R ++F +N   ++  N      YK   N+F+DLT  EF++ Y   + 
Sbjct:   170 NNKQYNSPNEMKERFQVFLQNAHKVKMHNNNKKSLYKKELNRFADLTYHEFKSKYLTLRS 229

Query:   114 PSP---SHRXXXXXXF-----KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
               P   S        +     KY+     D   + DWR    VTP+K+QK CG CWAF++
Sbjct:   230 SKPLKNSKYLLDQINYDAVIKKYKGNENFD-HAAYDWRLHSGVTPVKDQKNCGSCWAFSS 288

Query:   166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
             + +VE    IR   LI LSEQ+L+DCS   N GC GG    AF  +I+  GI T+D+YPY
Sbjct:   289 IGSVESQYAIRKNKLITLSEQELVDCSFK-NYGCNGGLINNAFEDMIELGGICTDDDYPY 347

Query:   226 QA-VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI 284
              +  P  C+  +      I NY  VP    +  L+ +   P+SI+IA  S +F  YKEGI
Sbjct:   348 VSDAPNLCNIDRCTEKYGIKNYLSVPDNKLKEALRFLG--PISISIAV-SDDFPFYKEGI 404

Query:   285 FNGVCGTQLDHAVTIVGFGT-------TEDGAN--YWLIKNSWGNTWGDAGYMKIVRDE- 334
             F+G CG +L+HAV +VGFG        T+ G    Y++IKNSWG  WG+ G++ I  DE 
Sbjct:   405 FDGECGDELNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDES 464

Query:   335 GL---CGIGTRSSYPL 347
             GL   CG+GT +  PL
Sbjct:   465 GLMRKCGLGTDAFIPL 480


>GENEDB_PFALCIPARUM|PF11_0162 [details] [associations]
            symbol:PF11_0162 "falcipain-3" species:5833
            "Plasmodium falciparum" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 HSSP:P43235 EMBL:AE014186 GO:GO:0020020
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347833.1
            ProteinModelPortal:Q8IIL0 SMR:Q8IIL0 MEROPS:C01.063
            EnsemblProtists:PF11_0162:mRNA GeneID:810709 KEGG:pfa:PF11_0162
            EuPathDB:PlasmoDB:PF3D7_1115400 OMA:ENDEDYW ChEMBL:CHEMBL1250373
            Uniprot:Q8IIL0
        Length = 492

 Score = 502 (181.8 bits), Expect = 4.7e-48, P = 4.7e-48
 Identities = 116/321 (36%), Positives = 171/321 (53%)

Query:    50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
             ++ ++ + Y+   E + R  IF EN   IE  NK+ N  YK G N+F DL+ +EFR+ Y 
Sbjct:   174 FLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKFGDLSPEEFRSKYL 233

Query:   110 GYKMPSPSHRXXXXXXFK--YQNLSMTDVPT-------SLDWRDKGAVTPIKNQKECGCC 160
               K   P         ++  Y+++     P        + DWR  G VTP+K+Q  CG C
Sbjct:   234 NLKTHGPFKTLSPPVSYEANYEDVIKKYKPADAKLDRIAYDWRLHGGVTPVKDQALCGSC 293

Query:   161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATE 220
             WAF++V +VE    IR   L   SEQ+L+DCS   NNGC GG    AF  +I   G+ ++
Sbjct:   294 WAFSSVGSVESQYAIRKKALFLFSEQELVDCSVK-NNGCYGGYITNAFDDMIDLGGLCSQ 352

Query:   221 DEYPYQA-VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
             D+YPY + +P TC+  +      I +Y  +P    +  L+ +   P+SI+IAA S +F  
Sbjct:   353 DDYPYVSNLPETCNLKRCNERYTIKSYVSIPDDKFKEALRYLG--PISISIAA-SDDFAF 409

Query:   280 YKEGIFNGVCGTQLDHAVTIVGFGT----TEDGAN-----YWLIKNSWGNTWGDAGYMKI 330
             Y+ G ++G CG   +HAV +VG+G      ED        Y++IKNSWG+ WG+ GY+ +
Sbjct:   410 YRGGFYDGECGAAPNHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWGSDWGEGGYINL 469

Query:   331 VRDEG----LCGIGTRSSYPL 347
               DE      C IGT +  PL
Sbjct:   470 ETDENGYKKTCSIGTEAYVPL 490


>UNIPROTKB|Q8IIL0 [details] [associations]
            symbol:PF11_0162 "Falcipain-3" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 HSSP:P43235 EMBL:AE014186 GO:GO:0020020
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347833.1
            ProteinModelPortal:Q8IIL0 SMR:Q8IIL0 MEROPS:C01.063
            EnsemblProtists:PF11_0162:mRNA GeneID:810709 KEGG:pfa:PF11_0162
            EuPathDB:PlasmoDB:PF3D7_1115400 OMA:ENDEDYW ChEMBL:CHEMBL1250373
            Uniprot:Q8IIL0
        Length = 492

 Score = 502 (181.8 bits), Expect = 4.7e-48, P = 4.7e-48
 Identities = 116/321 (36%), Positives = 171/321 (53%)

Query:    50 WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
             ++ ++ + Y+   E + R  IF EN   IE  NK+ N  YK G N+F DL+ +EFR+ Y 
Sbjct:   174 FLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKFGDLSPEEFRSKYL 233

Query:   110 GYKMPSPSHRXXXXXXFK--YQNLSMTDVPT-------SLDWRDKGAVTPIKNQKECGCC 160
               K   P         ++  Y+++     P        + DWR  G VTP+K+Q  CG C
Sbjct:   234 NLKTHGPFKTLSPPVSYEANYEDVIKKYKPADAKLDRIAYDWRLHGGVTPVKDQALCGSC 293

Query:   161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATE 220
             WAF++V +VE    IR   L   SEQ+L+DCS   NNGC GG    AF  +I   G+ ++
Sbjct:   294 WAFSSVGSVESQYAIRKKALFLFSEQELVDCSVK-NNGCYGGYITNAFDDMIDLGGLCSQ 352

Query:   221 DEYPYQA-VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
             D+YPY + +P TC+  +      I +Y  +P    +  L+ +   P+SI+IAA S +F  
Sbjct:   353 DDYPYVSNLPETCNLKRCNERYTIKSYVSIPDDKFKEALRYLG--PISISIAA-SDDFAF 409

Query:   280 YKEGIFNGVCGTQLDHAVTIVGFGT----TEDGAN-----YWLIKNSWGNTWGDAGYMKI 330
             Y+ G ++G CG   +HAV +VG+G      ED        Y++IKNSWG+ WG+ GY+ +
Sbjct:   410 YRGGFYDGECGAAPNHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWGSDWGEGGYINL 469

Query:   331 VRDEG----LCGIGTRSSYPL 347
               DE      C IGT +  PL
Sbjct:   470 ETDENGYKKTCSIGTEAYVPL 490


>DICTYBASE|DDB_G0290957 [details] [associations]
            symbol:cprA "cysteine proteinase 1" species:44689
            "Dictyostelium discoideum" [GO:0006972 "hyperosmotic response"
            evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0290957
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GenomeReviews:CM000154_GR GO:GO:0005764
            GO:GO:0006972 EMBL:AAFI02000174 KO:K01376 EMBL:X02407 PIR:A22827
            RefSeq:XP_635417.1 ProteinModelPortal:P04988 MEROPS:C01.022
            GlycoSuiteDB:P04988 SWISS-2DPAGE:P04988 EnsemblProtists:DDB0201647
            GeneID:8627918 KEGG:ddi:DDB_G0290957 OMA:KISNFTM
            ProtClustDB:CLSZ2429603 Uniprot:P04988
        Length = 343

 Score = 500 (181.1 bits), Expect = 7.7e-48, P = 7.7e-48
 Identities = 124/343 (36%), Positives = 180/343 (52%)

Query:    18 MFIIITLLVSCASQVVSSRST--HEQS-VVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
             M +I+  +++  +  VSSR     EQS  +E  +K+  ++  S+++ LE   R +IFK N
Sbjct:     1 MKVILLFVLAVFTVFVSSRGIPLEEQSQFLEFQDKFNKKY--SHEEYLE---RFEIFKSN 55

Query:    75 LEYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNL 131
             L  IE+ N          K G N+F+DL++DEF+  Y   K    +           + +
Sbjct:    56 LGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFI 115

Query:   132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
             +   +PT+ DWR +GAVTP+KNQ +CG CW+F+    VEG   I    L+ LSEQ L+DC
Sbjct:   116 N--SIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDC 173

Query:   192 STN-----G----NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAA 241
                     G    + GC GG +  A+ YII+N GI TE  YPY A  GT C+       A
Sbjct:   174 DHECMEYEGEQACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGA 233

Query:   242 KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCG-TQLDHAVTIV 300
             KISN+  +P  +       VS  P  +AIAA + E+Q Y  G+F+  C    LDH + IV
Sbjct:   234 KISNFTMIPKNETVMAGYIVSTGP--LAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIV 291

Query:   301 GFGTTED----GANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             G+            YW++KNSWG  WG+ GY+ + R +  CG+
Sbjct:   292 GYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGV 334


>UNIPROTKB|E2RR02 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            OMA:LAPPEWD GeneTree:ENSGT00660000095458 EMBL:AAEX03011628
            Ensembl:ENSCAFT00000019742 Uniprot:E2RR02
        Length = 460

 Score = 497 (180.0 bits), Expect = 1.6e-47, P = 1.6e-47
 Identities = 111/308 (36%), Positives = 168/308 (54%)

Query:    43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
             +  + ++++  + R+Y+ + E E R+ +F  N+   +K       T + G  +FSDLT +
Sbjct:   158 MASVFKEFVTTYNRTYETKEEAEWRMSVFSNNMVRAQKIQALDRGTAQYGITKFSDLTEE 217

Query:   103 EFRALYTGYKMPSPSHRXXXXXXFKY-QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
             EFR +Y      +P  R       +  +++S    P   DWR KGAVT +K+Q  CG CW
Sbjct:   218 EFRTIYL-----NPLLRENRGKKMRLAKSISDHAPPPEWDWRSKGAVTKVKDQGMCGSCW 272

Query:   162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATED 221
             AF+    VEG   ++ G L+ LSEQ+LLDC    +  CLGG    A++ I+   G+ TED
Sbjct:   273 AFSVTGNVEGQWFLKEGTLLSLSEQELLDCD-KVDKACLGGLPSNAYSAIMTLGGLETED 331

Query:   222 EYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSY 280
             +Y YQ     CS + K A   I++  E+ S +EQ L   ++ + P+S+AI A+  +F  Y
Sbjct:   332 DYSYQGHLQACSFSAKKARVYINDSMEL-SQNEQKLAAWLAKKGPISVAINAFGMQF--Y 388

Query:   281 KEGI---FNGVCGTQL-DHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGL 336
             + GI      +C   L DHAV +VG+G    G  +W IKNSWG  WG+ GY  + R  G 
Sbjct:   389 RHGISHPLRPLCSPWLIDHAVLLVGYGN-RSGIPFWAIKNSWGTDWGEEGYYYLHRGSGA 447

Query:   337 CGIGTRSS 344
             CG+ T +S
Sbjct:   448 CGVNTMAS 455


>FB|FBgn0260462 [details] [associations]
            symbol:CG12163 species:7227 "Drosophila melanogaster"
            [GO:0035071 "salivary gland cell autophagic cell death"
            evidence=IEP] [GO:0048102 "autophagic cell death" evidence=IEP]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0004869 "cysteine-type
            endopeptidase inhibitor activity" evidence=IEA] [GO:0045169
            "fusome" evidence=IDA] [GO:0035220 "wing disc development"
            evidence=IGI] [GO:0022416 "chaeta development" evidence=IGI]
            InterPro:IPR000010 InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00043 SMART:SM00645
            InterPro:IPR000169 EMBL:AE014297 GO:GO:0004869 eggNOG:COG4870
            KO:K01373 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0022416 GO:GO:0035220 GO:GO:0035071
            GO:GO:0045169 GeneTree:ENSGT00660000095458 EMBL:AY121614
            EMBL:BT003231 RefSeq:NP_649521.1 RefSeq:NP_730901.1
            RefSeq:NP_730902.2 UniGene:Dm.7315 ProteinModelPortal:Q9VN93
            SMR:Q9VN93 DIP:DIP-17491N IntAct:Q9VN93 MINT:MINT-763966
            STRING:Q9VN93 MEROPS:C01.A27 PaxDb:Q9VN93
            EnsemblMetazoa:FBtr0078823 GeneID:40628 KEGG:dme:Dmel_CG12163
            UCSC:CG12163-RA FlyBase:FBgn0260462 InParanoid:Q9VN93 OMA:GPRWGEQ
            OrthoDB:EOG4CC2G9 PhylomeDB:Q9VN93 GenomeRNAi:40628 NextBio:819744
            Bgee:Q9VN93 GermOnline:CG12163 Uniprot:Q9VN93
        Length = 614

 Score = 493 (178.6 bits), Expect = 4.2e-47, P = 4.2e-47
 Identities = 111/349 (31%), Positives = 183/349 (52%)

Query:     6 ERSGSFKINTTPMFIII-TLLVSCASQVVSSRSTHEQSVVE-IHEKWMAQHGRSYKDELE 63
             E   +FK    P+     T  V  A +    + +H    V+ +  K+  + GR Y    E
Sbjct:   265 EHEITFKCRNQPVVQARHTRSVEWAEKKTHKKHSHRFDKVDHLFYKFQVRFGRRYVSTAE 324

Query:    64 KEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXX 123
             ++MRL+IF++NL+ IE+ N     + K G  +F+D+T+ E++   TG      +      
Sbjct:   325 RQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSEYKER-TGLWQRDEAKATGGS 383

Query:   124 XXF--KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLI 181
                   Y      ++P   DWR K AVT +KNQ  CG CWAF+    +EG+  +++G L 
Sbjct:   384 AAVVPAYHG----ELPKEFDWRQKDAVTQVKNQGSCGSCWAFSVTGNIEGLYAVKTGELK 439

Query:   182 QLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAA 241
             + SEQ+LLDC T  ++ C GG  + A+  I    G+  E EYPY+A    C   +  +  
Sbjct:   440 EFSEQELLDCDTT-DSACNGGLMDNAYKAIKDIGGLEYEAEYPYKAKKNQCHFNRTLSHV 498

Query:   242 KISNYEEVPSGDEQALLK-AVSMQPVSIAIAAYSTEFQSYKEGI---FNGVCGTQ-LDHA 296
             +++ + ++P G+E A+ +  ++  P+SI I A + +F  Y+ G+   +  +C  + LDH 
Sbjct:   499 QVAGFVDLPKGNETAMQEWLLANGPISIGINANAMQF--YRGGVSHPWKALCSKKNLDHG 556

Query:   297 VTIVGFGTTEDGAN------YWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             V +VG+G + D  N      YW++KNSWG  WG+ GY ++ R +  CG+
Sbjct:   557 VLVVGYGVS-DYPNFHKTLPYWIVKNSWGPRWGEQGYYRVYRGDNTCGV 604


>FB|FBgn0032228 [details] [associations]
            symbol:CG5367 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE014134 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 HSSP:P80067
            RefSeq:NP_609387.1 UniGene:Dm.26782 ProteinModelPortal:Q9VKY4
            SMR:Q9VKY4 MEROPS:C01.A30 EnsemblMetazoa:FBtr0080055 GeneID:34401
            KEGG:dme:Dmel_CG5367 UCSC:CG5367-RA FlyBase:FBgn0032228
            InParanoid:Q9VKY4 OMA:QIVDCSV OrthoDB:EOG4THT8X PhylomeDB:Q9VKY4
            GenomeRNAi:34401 NextBio:788324 ArrayExpress:Q9VKY4 Bgee:Q9VKY4
            Uniprot:Q9VKY4
        Length = 338

 Score = 493 (178.6 bits), Expect = 4.2e-47, P = 4.2e-47
 Identities = 103/307 (33%), Positives = 170/307 (55%)

Query:    48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEF 104
             EK+   + R Y    ++    K F+EN + IE+ N   KEG  +++L  N F+D++ D +
Sbjct:    37 EKFKNNNNRKYLRTYDEMRSYKAFEENFKVIEEHNQNYKEGQTSFRLKPNIFADMSTDGY 96

Query:   105 RALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
                +      +               L M +VP SLDWR KG +TP  NQ  CG C+AF+
Sbjct:    97 LKGFLRLLKSNIEDSADNMAEIVGSPL-MANVPESLDWRSKGFITPPYNQLSCGSCYAFS 155

Query:   165 AVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
                ++ G    R+G ++ LS+QQ++DCS ++GN GC+GGS     +Y+    GI  + +Y
Sbjct:   156 IAESIMGQVFKRTGKILSLSKQQIVDCSVSHGNQGCVGGSLRNTLSYLQSTGGIMRDQDY 215

Query:   224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKE 282
             PY A  G C      +   ++++  +P  DEQA+  AV+ + PV+I+I A    FQ Y +
Sbjct:   216 PYVARKGKCQFVPDLSVVNVTSWAILPVRDEQAIQAAVTHIGPVAISINASPKTFQLYSD 275

Query:   283 GIFNG-VCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIG 340
             GI++  +C +  ++HA+ ++GFG      +YW++KN WG  WG+ GY++I +   +CGI 
Sbjct:   276 GIYDDPLCSSASVNHAMVVIGFGK-----DYWILKNWWGQNWGENGYIRIRKGVNMCGIA 330

Query:   341 TRSSYPL 347
               ++Y +
Sbjct:   331 NYAAYAI 337


>MGI|MGI:1861434 [details] [associations]
            symbol:Ctsf "cathepsin F" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=ISS]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISS] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:1861434 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513 MEROPS:C01.018
            CTD:8722 OMA:LAPPEWD OrthoDB:EOG4CC41T EMBL:AF136280 EMBL:AF217224
            EMBL:AJ131851 EMBL:AK075862 EMBL:BC058758 IPI:IPI00126769
            RefSeq:NP_063914.1 UniGene:Mm.29561 ProteinModelPortal:Q9R013
            SMR:Q9R013 STRING:Q9R013 PhosphoSite:Q9R013 PaxDb:Q9R013
            PRIDE:Q9R013 Ensembl:ENSMUST00000119694 GeneID:56464 KEGG:mmu:56464
            UCSC:uc008gbc.1 GeneTree:ENSGT00660000095458 InParanoid:Q9R013
            NextBio:312722 Bgee:Q9R013 CleanEx:MM_CTSF Genevestigator:Q9R013
            GermOnline:ENSMUSG00000006458 Uniprot:Q9R013
        Length = 462

 Score = 492 (178.3 bits), Expect = 5.4e-47, P = 5.4e-47
 Identities = 109/304 (35%), Positives = 158/304 (51%)

Query:    46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
             + + +M  + R+Y+   E + RL +F  N+   +K       T + G  +FSDLT +EF 
Sbjct:   164 LFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFH 223

Query:   106 ALYTGYKMPSPSHRXXXXXXFKYQNLSMTDV-PTSLDWRDKGAVTPIKNQKECGCCWAFA 164
              +Y    +   S R            S+ D+ P   DWR KGAVT +KNQ  CG CWAF+
Sbjct:   224 TIYLNPLLQKESGRKMSPAK------SINDLAPPEWDWRKKGAVTEVKNQGMCGSCWAFS 277

Query:   165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYP 224
                 VEG   +  G L+ LSEQ+LLDC    +  CLGG    A+A I    G+ TED+Y 
Sbjct:   278 VTGNVEGQWFLNRGTLLSLSEQELLDCD-KVDKACLGGLPSNAYAAIKNLGGLETEDDYG 336

Query:   225 YQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI 284
             YQ    TC+ + + A   I++  E+   + +         P+S+AI A+  +F  Y+ GI
Sbjct:   337 YQGHVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKGPISVAINAFGMQF--YRHGI 394

Query:   285 ---FNGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIG 340
                F  +C    +DHAV +VG+G   +   YW IKNSWG+ WG+ GY  + R  G CG+ 
Sbjct:   395 AHPFRPLCSPWFIDHAVLLVGYGNRSN-IPYWAIKNSWGSDWGEEGYYYLYRGSGACGVN 453

Query:   341 TRSS 344
             T +S
Sbjct:   454 TMAS 457


>RGD|1588248 [details] [associations]
            symbol:Cts8 "cathepsin 8" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1588248 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 IPI:IPI00765053
            RefSeq:NP_001121688.1 UniGene:Rn.220599 Ensembl:ENSRNOT00000061486
            GeneID:680718 KEGG:rno:680718 UCSC:RGD:1588248 CTD:56094
            OMA:DSEWQEW OrthoDB:EOG4JT07C NextBio:719350 Uniprot:D3ZP54
        Length = 333

 Score = 492 (178.3 bits), Expect = 5.4e-47, P = 5.4e-47
 Identities = 111/338 (32%), Positives = 182/338 (53%)

Query:    21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
             ++ L + C   V  +    + S+    ++W  ++ ++Y  E E + R  +++EN++ +++
Sbjct:     4 VVLLAILCLG-VARATQPSDPSLDSEWQEWKTKYEKNYSLEEEGQKRA-VWEENMKVVKQ 61

Query:    81 ANKEGN---RTYKLGTNQFSDLTNDEFRALYTGYKMPS-PSHRXXXXXXFKYQNLSMTDV 136
              N E +   + + +  N F+D+T +EFR + T   + +    +      F+Y       +
Sbjct:    62 HNIEYDQEKKNFTMELNAFADMTGEEFRKMMTNIPVQNLRKKKSIHQPIFRY-------L 114

Query:   137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST-NG 195
             P  +DWR +G VT +KNQ  C  CWAF+   A+EG    ++G L+ LS Q L+DCS   G
Sbjct:   115 PKFVDWRRRGYVTSVKNQGTCNSCWAFSVAGAIEGQMFRKTGRLVSLSPQNLVDCSRPEG 174

Query:   196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
             N+GC  GS   A  Y+  N G+  E  YPY+   G C    + +AA+++ +  V +  E+
Sbjct:   175 NHGCHMGSTLYALKYVWSNGGLEAESTYPYEGKEGPCRYLPRRSAARVTGFSTV-ARSEE 233

Query:   256 ALLKAVS-MQPVSIAIAAYSTEFQSYKEGIF-NGVCGT-QLDHAVTIVGFG---TTEDGA 309
             AL+ AV+ + P+S+ I A    F+ Y+ GI+    C + +++H+V +VG+G      DG 
Sbjct:   234 ALMHAVATIGPISVGIDASHVSFRFYRRGIYYEPRCSSNRINHSVLVVGYGYEGRESDGR 293

Query:   310 NYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
              YWLIKNS G  WG  GYMK+ R     CGI T   YP
Sbjct:   294 KYWLIKNSHGVGWGMNGYMKLARGWNNHCGIATYGFYP 331


>UNIPROTKB|F1NHB8 [details] [associations]
            symbol:F1NHB8 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 EMBL:AADN02044011
            IPI:IPI00586027 Ensembl:ENSGALT00000021873 OMA:SELDHAV
            Uniprot:F1NHB8
        Length = 329

 Score = 491 (177.9 bits), Expect = 6.9e-47, P = 6.9e-47
 Identities = 110/300 (36%), Positives = 158/300 (52%)

Query:    55 GRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP 114
             G+ Y  E E E R + F  N+ ++   N+    +Y L  N  +D T  E  AL    +  
Sbjct:    34 GKRYSSEEEHEHRKRTFIHNMRFVHSKNRAA-LSYSLALNHLADRTPQEMAALRGRRRSG 92

Query:   115 SPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITK 174
              P         F  Q  +   +P SLDWR  GAVTP+K+Q  CG CW+FA   A+EG   
Sbjct:    93 DPK----SGQPFSMQLYASLVLPESLDWRLYGAVTPVKDQAVCGSCWSFATTGAMEGALF 148

Query:   175 IRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY-PYQAVPGTC 232
             +++G L  LS+Q L+DCS   GN  C GG   +A+ +I ++ GIA+ + Y PY    G C
Sbjct:   149 LKTGVLTPLSQQVLIDCSWGFGNYACDGGEEWRAYEWIKKHGGIASTESYGPYLGQNGYC 208

Query:   233 SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGIFNGV-CG 290
                Q    A ++ Y  V SG+ +AL  A+    PV++ I A    F  Y  G++    CG
Sbjct:   209 HYNQSELVAPLAGYVTVESGNAEALKAALFKHGPVAVNIDASHKSFTFYANGVYEEPHCG 268

Query:   291 ---TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 347
                ++LDHAV  VG+G    G +YWLIKNSW   WG+ GY+ +   +  CG+ T +S+P+
Sbjct:   269 NETSELDHAVLAVGYGVLH-GKSYWLIKNSWSTYWGNDGYILMAMKDNNCGVATAASFPI 327


>RGD|1309226 [details] [associations]
            symbol:Cts7 "cathepsin 7" species:10116 "Rattus norvegicus"
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005768 "endosome" evidence=IEA] [GO:0005794 "Golgi apparatus"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0007067
            "mitosis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IEA] [GO:0051301 "cell division" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 RGD:1309226 GO:GO:0005634
            GO:GO:0005794 GO:GO:0048471 GO:GO:0005615 GO:GO:0051301
            GO:GO:0007067 GO:GO:0005768 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 MEROPS:C01.016 CTD:56092
            GeneTree:ENSGT00560000076577 OrthoDB:EOG44QT2S EMBL:CH474032
            IPI:IPI00870531 RefSeq:NP_001099569.1 UniGene:Rn.218615
            Ensembl:ENSRNOT00000043686 GeneID:290970 KEGG:rno:290970
            UCSC:RGD:1309226 OMA:VESFNAN Uniprot:D3ZZ07
        Length = 331

 Score = 489 (177.2 bits), Expect = 1.1e-46, P = 1.1e-46
 Identities = 111/341 (32%), Positives = 182/341 (53%)

Query:    18 MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
             M + + L + C    +++    + S+    E+W   + ++Y  E EK+ R  +++EN++ 
Sbjct:     1 MTVAVFLAILCLRAALAAPRP-DYSLDAEWEEWKRNNAKTYSPEEEKQRRA-VWEENVKM 58

Query:    78 IEKANKEGN---RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMT 134
             I+    +       + +  N+F D+T +E R +       S +         + +N+   
Sbjct:    59 IKWHTMQNGLWMNNFTIEMNEFGDMTGEEMRMM-----TDSSALTLRNGKHIQKRNVK-- 111

Query:   135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
              +P +LDWRD G V P+++Q  CG CWAF+  A++E     ++G LI LS Q L+DC+ T
Sbjct:   112 -IPKTLDWRDTGCVAPVRSQGGCGACWAFSVAASIESQLFKKTGKLIPLSVQNLIDCTVT 170

Query:   194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
              GNN C GG    AF Y+  N G+  E  YPY+A    C    + +  KI+ +  VP  +
Sbjct:   171 YGNNDCSGGKPYTAFQYVKNNGGLEAEATYPYEAKLRHCRYRPERSVVKIARFFVVPR-N 229

Query:   254 EQALLKA-VSMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTT---ED 307
             E+AL++A V+  P+++AI      F+ Y+ GI++   C    LDH + +VG+G      +
Sbjct:   230 EEALMQALVTYGPIAVAIDGSHASFKRYRGGIYHEPKCRRDTLDHGLLLVGYGYEGHESE 289

Query:   308 GANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYPL 347
                YWL+KNS G  WG+ GYMK+ RD+   CGI + + YPL
Sbjct:   290 NRKYWLLKNSHGEQWGERGYMKLPRDQNNYCGIASYAMYPL 330


>TAIR|locus:2120222 [details] [associations]
            symbol:RD19 "RESPONSIVE TO DEHYDRATION 19" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009269 "response to desiccation" evidence=IEP] [GO:0006970
            "response to osmotic stress" evidence=IGI] [GO:0005634 "nucleus"
            evidence=IDA] [GO:0005773 "vacuole" evidence=IDA] [GO:0042742
            "defense response to bacterium" evidence=IMP] [GO:0006096
            "glycolysis" evidence=RCA] [GO:0006833 "water transport"
            evidence=RCA] [GO:0006972 "hyperosmotic response" evidence=RCA]
            [GO:0007030 "Golgi organization" evidence=RCA] [GO:0009266
            "response to temperature stimulus" evidence=RCA] [GO:0009651
            "response to salt stress" evidence=IEP;RCA] [GO:0046686 "response
            to cadmium ion" evidence=RCA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=ISS] [GO:0009414 "response to
            water deprivation" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005634 GO:GO:0005773 EMBL:CP002687
            GenomeReviews:CT486007_GR GO:GO:0009651 GO:GO:0042742
            eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            ProtClustDB:CLSN2688311 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AL035679 EMBL:AL161594 GO:GO:0004197
            MEROPS:C01.022 EMBL:D13042 EMBL:AY080598 EMBL:AY133844
            IPI:IPI00544363 PIR:JN0718 RefSeq:NP_568052.1 UniGene:At.2850
            UniGene:At.74924 ProteinModelPortal:P43296 SMR:P43296 STRING:P43296
            PaxDb:P43296 PRIDE:P43296 EnsemblPlants:AT4G39090.1 GeneID:830064
            KEGG:ath:AT4G39090 TAIR:At4g39090 InParanoid:P43296 OMA:EDFDWRD
            PhylomeDB:P43296 Genevestigator:P43296 GermOnline:AT4G39090
            Uniprot:P43296
        Length = 368

 Score = 487 (176.5 bits), Expect = 1.8e-46, P = 1.8e-46
 Identities = 109/312 (34%), Positives = 164/312 (52%)

Query:    55 GRSYKDELEKEMRLKIFKENLEYIEKANK-EGNRTYKLGTNQFSDLTNDEFRALY----T 109
             G+ Y    E + R  +FK NL    +  K + + T+  G  QFSDLT  EFR  +    +
Sbjct:    59 GKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATH--GVTQFSDLTRSEFRKKHLGVRS 116

Query:   110 GYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAV 169
             G+K+P  +++           L   ++P   DWRD GAVTP+KNQ  CG CW+F+A  A+
Sbjct:   117 GFKLPKDANKAPI--------LPTENLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGAL 168

Query:   170 EGITKIRSGNLIQLSEQQLLDC--------STNGNNGCLGGSREKAFAYIIQNQGIATED 221
             EG   + +G L+ LSEQQL+DC        + + ++GC GG    AF Y ++  G+  E+
Sbjct:   169 EGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEE 228

Query:   222 EYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAI-AAYSTEFQS 279
             +YPY    G TC   +    A +SN+  +   +EQ     V   P+++AI A Y    Q+
Sbjct:   229 DYPYTGKDGKTCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAGY---MQT 285

Query:   280 YKEGIFNG-VCGTQLDHAVTIVGFGTTEDGAN------YWLIKNSWGNTWGDAGYMKIVR 332
             Y  G+    +C  +L+H V +VG+G             YW+IKNSWG TWG+ G+ KI +
Sbjct:   286 YIGGVSCPYICTRRLNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGETWGENGFYKICK 345

Query:   333 DEGLCGIGTRSS 344
                +CG+ +  S
Sbjct:   346 GRNICGVDSMVS 357


>FB|FBgn0033874 [details] [associations]
            symbol:CG6347 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE013599 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 HSSP:P53634 EMBL:AY069609
            RefSeq:NP_610906.1 UniGene:Dm.608 SMR:Q7K0S6 MEROPS:C01.A29
            EnsemblMetazoa:FBtr0087637 GeneID:36531 KEGG:dme:Dmel_CG6347
            UCSC:CG6347-RA FlyBase:FBgn0033874 InParanoid:Q7K0S6 OMA:FEYIRDH
            OrthoDB:EOG4FQZ74 GenomeRNAi:36531 NextBio:799046 Uniprot:Q7K0S6
        Length = 352

 Score = 486 (176.1 bits), Expect = 2.3e-46, P = 2.3e-46
 Identities = 113/321 (35%), Positives = 178/321 (55%)

Query:    44 VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLT 100
             V+  + ++ Q G+ Y DE E+  R  IF   +  I  +NK    G   ++LG N  +D+T
Sbjct:    35 VQNFDDFLRQTGKVYSDE-ERVYRESIFAAKMSLITLSNKNADNGVSGFRLGVNTLADMT 93

Query:   101 NDEFRALYTGYKMPSPSHRXXXXXX-F-KYQNLSMTDVPTSLDWRDKGAVTPIKNQKE-C 157
               E   L  G K+     R       F   +N +  ++P   DWR+KG VTP   Q   C
Sbjct:    94 RKEIATLL-GSKISEFGERYTNGHINFVTARNPASANLPEMFDWREKGGVTPPGFQGVGC 152

Query:   158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQG 216
             G CW+FA   A+EG    R+G L  LS+Q L+DC+ + GN GC GG +E  F YI ++ G
Sbjct:   153 GACWSFATTGALEGHLFRRTGVLASLSQQNLVDCADDYGNMGCDGGFQEYGFEYI-RDHG 211

Query:   217 IATEDEYPYQAVPGTC----SAAQKP--AAAKISNYEEVPSGDEQALLKAVS-MQPVSIA 269
             +   ++YPY      C    +A + P  +  KI +Y  +  GDE+ + + ++ + P++ +
Sbjct:   212 VTLANKYPYTQTEMQCRQNETAGRPPRESLVKIRDYATITPGDEEKMKEVIATLGPLACS 271

Query:   270 IAAYSTEFQSYKEGIFNGV-CGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
             + A +  F+ Y  GI+    C   +L+H+VT+VG+GT E+G +YW+IKNS+   WG+ G+
Sbjct:   272 MNADTISFEQYSGGIYEDEECNQGELNHSVTVVGYGT-ENGRDYWIIKNSYSQNWGEGGF 330

Query:   328 MKIVRDEG-LCGIGTRSSYPL 347
             M+I+R+ G  CGI +  SYP+
Sbjct:   331 MRILRNAGGFCGIASECSYPI 351


>ZFIN|ZDB-GENE-050417-107 [details] [associations]
            symbol:zgc:110239 "zgc:110239" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-050417-107
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 MEROPS:I29.003 OrthoDB:EOG412M56 EMBL:BC092817
            IPI:IPI00503987 RefSeq:NP_001017633.1 UniGene:Dr.39081
            ProteinModelPortal:Q568K7 GeneID:550326 KEGG:dre:550326
            HOGENOM:HOG000007373 HOVERGEN:HBG105018 InParanoid:Q568K7
            NextBio:20879584 ArrayExpress:Q568K7 Uniprot:Q568K7
        Length = 546

 Score = 486 (176.1 bits), Expect = 2.3e-46, P = 2.3e-46
 Identities = 104/299 (34%), Positives = 162/299 (54%)

Query:    56 RSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS 115
             R Y +E+E E R   F  N+ Y+   N+ G  ++ L  N  +D +  E  ++  G +   
Sbjct:   252 RQYDNEMEHEEREHNFVHNIRYVHSMNRAG-LSFSLSVNHLADRSQKEL-SMMRGCQRTH 309

Query:   116 PSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKI 175
               HR       + ++++    P S+DWR  GAVTP+K+Q  CG CW+FA    +EG   +
Sbjct:   310 KVHRKAQPFPSEIRSIA---TPNSVDWRLYGAVTPVKDQAVCGSCWSFATTGTLEGALFL 366

Query:   176 RSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY-PYQAVPGTCS 233
             ++G L  LS+Q L+DC+   GNNGC GG   +AF +I+++ GI+T + Y  Y  + G C 
Sbjct:   367 KTGQLTSLSQQMLVDCTWGFGNNGCDGGEEWRAFEWIMKHGGISTAESYGAYMGMNGLCH 426

Query:   234 AAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF------N 286
               +    A+++ Y  V SGD  AL  A+    PV+++I A    F  Y  G++      N
Sbjct:   427 YDKSSMVAQLTGYTNVTSGDILALKAAIFKFGPVAVSIDAAHRSFAFYSNGVYYEPECKN 486

Query:   287 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSY 345
             G+    LDHAV  VG+G   +  +YWL+KNSW + WG+ GY+ +   +  CG+ T + Y
Sbjct:   487 GI--NDLDHAVLAVGYGIMNN-ESYWLVKNSWSSYWGNDGYILMSMKDNNCGVATDAIY 542


>UNIPROTKB|F1NT07 [details] [associations]
            symbol:LOC100857883 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 EMBL:AADN02044012
            EMBL:AADN02044013 EMBL:AADN02044014 IPI:IPI00577314
            Ensembl:ENSGALT00000000192 OMA:IYKHGPV Uniprot:F1NT07
        Length = 317

 Score = 482 (174.7 bits), Expect = 6.2e-46, P = 6.2e-46
 Identities = 112/302 (37%), Positives = 158/302 (52%)

Query:    55 GRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP 114
             GR Y    E E R +IF  ++ ++   N+    +Y L  N  +D T  E  AL    +  
Sbjct:    20 GRPYGSAREMEHRQRIFAHHMRFVHSKNRAA-LSYSLALNHLADRTPQEMAALRGRRRSG 78

Query:   115 SPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITK 174
              P+H       F  ++ +   +P SLDWR  GAVTP+K+Q  CG CW+FA   A+EG   
Sbjct:    79 DPNH----GLPFPAEHYTGIILPESLDWRMYGAVTPVKDQAVCGSCWSFATTGAMEGALF 134

Query:   175 IRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIA-TED--EYPYQAVPG 230
             +++G L  LS+Q L+DCS   GN  C GG   +A  +I ++ GIA TE    +P     G
Sbjct:   135 LKTGVLTPLSQQVLIDCSWGKGNYACDGGEEWRAKGWIKKHGGIASTESPPSFPLVLQNG 194

Query:   231 TCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGIF-NGV 288
              C   Q    AKI+ Y  V SG+  A+  A+    PV+++I A    F  Y  GI+    
Sbjct:   195 LCHYNQSEMLAKITGYVNVTSGNITAVKTAIYKHGPVAVSIDASHKTFSFYSNGIYYEPK 254

Query:   289 CGT---QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSY 345
             C     QLDHAV  VG+G  + G  YWLIKNSW   WG+ GY+ +   +  CG+ T ++Y
Sbjct:   255 CANKPGQLDHAVLAVGYGVLQ-GETYWLIKNSWSTYWGNDGYILMAMKDNNCGVATEATY 313

Query:   346 PL 347
             P+
Sbjct:   314 PI 315


>UNIPROTKB|Q9UBX1 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=TAS] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_6900 GO:GO:0019886 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0043202
            GO:GO:0004197 HOVERGEN:HBG011513 EMBL:AJ007331 EMBL:AF088886
            EMBL:AF132894 EMBL:AF136279 EMBL:AF071748 EMBL:AF071749
            EMBL:AK313657 EMBL:BC011682 EMBL:BC036451 EMBL:AL137742
            IPI:IPI00002816 RefSeq:NP_003784.2 UniGene:Hs.11590 PDB:1D5U
            PDB:1M6D PDBsum:1D5U PDBsum:1M6D ProteinModelPortal:Q9UBX1
            SMR:Q9UBX1 STRING:Q9UBX1 MEROPS:C01.018 PhosphoSite:Q9UBX1
            DMDM:12643325 PaxDb:Q9UBX1 PeptideAtlas:Q9UBX1 PRIDE:Q9UBX1
            DNASU:8722 Ensembl:ENST00000310325 GeneID:8722 KEGG:hsa:8722
            UCSC:uc001oip.3 CTD:8722 GeneCards:GC11M066332 HGNC:HGNC:2531
            HPA:CAB002141 MIM:603539 neXtProt:NX_Q9UBX1 PharmGKB:PA27031
            InParanoid:Q9UBX1 OMA:LAPPEWD OrthoDB:EOG4CC41T PhylomeDB:Q9UBX1
            BindingDB:Q9UBX1 ChEMBL:CHEMBL2517 ChiTaRS:CTSF
            EvolutionaryTrace:Q9UBX1 GenomeRNAi:8722 NextBio:32715
            ArrayExpress:Q9UBX1 Bgee:Q9UBX1 CleanEx:HS_CTSF
            Genevestigator:Q9UBX1 GermOnline:ENSG00000174080 Uniprot:Q9UBX1
        Length = 484

 Score = 482 (174.7 bits), Expect = 6.2e-46, P = 6.2e-46
 Identities = 111/308 (36%), Positives = 165/308 (53%)

Query:    43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
             +  I + ++  + R+Y+ + E   RL +F  N+   +K       T + G  +FSDLT +
Sbjct:   183 MASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEE 242

Query:   103 EFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDV-PTSLDWRDKGAVTPIKNQKECGCCW 161
             EFR +Y    +     R       K Q  S+ D+ P   DWR KGAVT +K+Q  CG CW
Sbjct:   243 EFRTIYLNTLL-----RKEPGNKMK-QAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCW 296

Query:   162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATED 221
             AF+    VEG   +  G L+ LSEQ+LLDC    +  C+GG    A++ I    G+ TED
Sbjct:   297 AFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKM-DKACMGGLPSNAYSAIKNLGGLETED 355

Query:   222 EYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSY 280
             +Y YQ    +C+ + + A   I++  E+ S +EQ L   ++ + P+S+AI A+  +F  Y
Sbjct:   356 DYSYQGHMQSCNFSAEKAKVYINDSVEL-SQNEQKLAAWLAKRGPISVAINAFGMQF--Y 412

Query:   281 KEGI---FNGVCGTQL-DHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGL 336
             + GI      +C   L DHAV +VG+G   D   +W IKNSWG  WG+ GY  + R  G 
Sbjct:   413 RHGISRPLRPLCSPWLIDHAVLLVGYGNRSD-VPFWAIKNSWGTDWGEKGYYYLHRGSGA 471

Query:   337 CGIGTRSS 344
             CG+ T +S
Sbjct:   472 CGVNTMAS 479


>MGI|MGI:1860262 [details] [associations]
            symbol:Cts7 "cathepsin 7" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005576
            "extracellular region" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005768 "endosome" evidence=IEA]
            [GO:0005794 "Golgi apparatus" evidence=IEA] [GO:0006508
            "proteolysis" evidence=ISA] [GO:0007049 "cell cycle" evidence=IEA]
            [GO:0007067 "mitosis" evidence=IEA] [GO:0008152 "metabolic process"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=ISA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0051301 "cell
            division" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:1860262 GO:GO:0005634 GO:GO:0005794 GO:GO:0048471
            GO:GO:0005615 GO:GO:0051301 GO:GO:0007067 GO:GO:0005768
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GO:GO:0008233 EMBL:CH466546
            EMBL:AY014779 EMBL:CT030645 EMBL:BC064740 EMBL:AF250837
            IPI:IPI00131132 RefSeq:NP_062412.1 UniGene:Mm.3692 HSSP:O60911
            ProteinModelPortal:Q91ZF2 SMR:Q91ZF2 STRING:Q91ZF2 MEROPS:C01.016
            PRIDE:Q91ZF2 Ensembl:ENSMUST00000021892 GeneID:56092 KEGG:mmu:56092
            UCSC:uc007qwi.1 CTD:56092 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 InParanoid:Q91ZF2 OMA:ERRVIWE OrthoDB:EOG44QT2S
            NextBio:311908 Bgee:Q91ZF2 Genevestigator:Q91ZF2 Uniprot:Q91ZF2
        Length = 331

 Score = 482 (174.7 bits), Expect = 6.2e-46, P = 6.2e-46
 Identities = 108/310 (34%), Positives = 168/310 (54%)

Query:    48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN---RTYKLGTNQFSDLTNDEF 104
             E+W   + R+Y  E EK+ R  +++ N+++I++   E       + +  N+F D+T +E 
Sbjct:    30 EEWKRSNDRTYSPEEEKQRRA-VWEGNVKWIKQHIMENGLWMNNFTIEMNEFGDMTGEEM 88

Query:   105 RALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
             + L       S S+        + +N     +P +LDWR +G VTP++ Q  CG CWAF+
Sbjct:    89 KML-----TESSSYPLRNGKHIQKRN---PKIPPTLDWRKEGYVTPVRRQGSCGACWAFS 140

Query:   165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
               A +EG    ++G LI LS Q L+DCS + G  GC GG    AF Y+  N G+  E  Y
Sbjct:   141 VTACIEGQLFKKTGKLIPLSVQNLMDCSVSYGTKGCDGGRPYDAFQYVKNNGGLEAEATY 200

Query:   224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKA-VSMQPVSIAIAAYSTEFQSYKE 282
             PY+A    C    + +  K++ +  VP  +E+ALL+A V+  P+++AI      F SY+ 
Sbjct:   201 PYEAKAKHCRYRPERSVVKVNRFFVVPR-NEEALLQALVTHGPIAVAIDGSHASFHSYRG 259

Query:   283 GIFNGV-CGTQ-LDHAVTIVGFGTT---EDGANYWLIKNSWGNTWGDAGYMKIVRDEG-L 336
             GI++   C    LDH + +VG+G      +   YWL+KNS G  WG+ GYMK+ R +   
Sbjct:   260 GIYHEPKCRKDTLDHGLLLVGYGYEGHESENRKYWLLKNSHGERWGENGYMKLPRGQNNY 319

Query:   337 CGIGTRSSYP 346
             CGI + + YP
Sbjct:   320 CGIASYAMYP 329


>UNIPROTKB|Q0VCU3 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            HOVERGEN:HBG011513 MEROPS:C01.018 CTD:8722 OMA:LAPPEWD
            OrthoDB:EOG4CC41T GeneTree:ENSGT00660000095458 EMBL:DAAA02063594
            EMBL:BC120003 IPI:IPI00717812 RefSeq:NP_001068884.1 UniGene:Bt.7264
            SMR:Q0VCU3 Ensembl:ENSBTAT00000014587 GeneID:509715 KEGG:bta:509715
            InParanoid:Q0VCU3 NextBio:20869091 Uniprot:Q0VCU3
        Length = 460

 Score = 477 (173.0 bits), Expect = 2.1e-45, P = 2.1e-45
 Identities = 111/308 (36%), Positives = 161/308 (52%)

Query:    43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
             +  I + ++  + R+Y  + E   R+ +F  N+   +K       T + G  +FSDLT +
Sbjct:   159 MASIFKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTARYGVTKFSDLTEE 218

Query:   103 EFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPT-SLDWRDKGAVTPIKNQKECGCCW 161
             EFR +Y    +     R             +TDVP    DWR+KGAVT +K+Q  CG CW
Sbjct:   219 EFRTIYLNPLLKDAPGRNMRPAQ------PVTDVPPPQWDWRNKGAVTNVKDQGMCGSCW 272

Query:   162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATED 221
             AF+    VEG   ++ G L+ LSEQ+LLDC    +  CLGG    A++ I    G+ TED
Sbjct:   273 AFSVTGNVEGQWFLKRGTLLSLSEQELLDCDKT-DKACLGGLPSNAYSAIRTLGGLETED 331

Query:   222 EYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSY 280
             +Y Y+    TCS + + A   I++  E+ S +EQ L   ++   PVSIAI A+  +F  Y
Sbjct:   332 DYSYRGRLQTCSFSAEKAKVYINDSVEL-SKNEQKLAAWLAKNGPVSIAINAFGMQF--Y 388

Query:   281 KEGI---FNGVCGTQL-DHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGL 336
             + GI      +C   L DHAV +VG+G       +W IKNSWG  WG+ GY  + R  G 
Sbjct:   389 RHGISHPLRPLCSPWLIDHAVLLVGYGN-RSAIPFWAIKNSWGTDWGEEGYYYLHRGSGA 447

Query:   337 CGIGTRSS 344
             CG+   +S
Sbjct:   448 CGVNIMAS 455


>UNIPROTKB|F1P3U9 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005829 "cytosol" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=IEA] [GO:0010628 "positive regulation of gene expression"
            evidence=IEA] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=IEA] [GO:0010813 "neuropeptide catabolic
            process" evidence=IEA] [GO:0010815 "bradykinin catabolic process"
            evidence=IEA] [GO:0016505 "apoptotic protease activator activity"
            evidence=IEA] [GO:0030108 "HLA-A specific activating MHC class I
            receptor activity" evidence=IEA] [GO:0031638 "zymogen activation"
            evidence=IEA] [GO:0031648 "protein destabilization" evidence=IEA]
            [GO:0032526 "response to retinoic acid" evidence=IEA] [GO:0033619
            "membrane protein proteolysis" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0043129
            "surfactant homeostasis" evidence=IEA] [GO:0045766 "positive
            regulation of angiogenesis" evidence=IEA] [GO:0060448 "dichotomous
            subdivision of terminal units involved in lung branching"
            evidence=IEA] [GO:0070324 "thyroid hormone binding" evidence=IEA]
            [GO:0070371 "ERK1 and ERK2 cascade" evidence=IEA] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA]
            [GO:0097208 "alveolar lamellar body" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066
            GO:GO:0005615 GO:GO:0008284 GO:GO:0070371 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0032526 GO:GO:0010628
            GO:GO:0070324 GO:GO:0016505 GO:GO:0010634 GO:GO:0004197
            GO:GO:0042599 GO:GO:0031648 GO:GO:0097067 GO:GO:0031638
            GO:GO:0001913 GeneTree:ENSGT00660000095458 OMA:STSCHKT
            GO:GO:0030108 GO:GO:0010815 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 EMBL:AADN02038832 EMBL:AADN02038831 IPI:IPI00594147
            Ensembl:ENSGALT00000013440 Uniprot:F1P3U9
        Length = 261

 Score = 476 (172.6 bits), Expect = 2.7e-45, P = 2.7e-45
 Identities = 103/266 (38%), Positives = 146/266 (54%)

Query:    89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGA- 147
             + +  NQFSD+T  EF+ LY       P +       F   +      P ++DWR KG  
Sbjct:     1 FLVALNQFSDMTFAEFKKLYL---WSEPQNCSATRGNFLRSD---GPCPEAVDWRKKGNF 54

Query:   148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREK 206
             VTP+KNQ  CG CW F+    +E    I +G L+ L+EQ L+DC+   NN GC GG   +
Sbjct:    55 VTPVKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQLLVDCAQAFNNHGCSGGLPSQ 114

Query:   207 AFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-P 265
             AF YI+ N+G+  ED YPY+A  GTC      A A + +   +   DE  +++AV    P
Sbjct:   115 AFEYILYNKGLMGEDAYPYRAQNGTCKFQPDKAIAFVKDVINITQYDEAGMVEAVGKHNP 174

Query:   266 VSIAIAAYSTEFQSYKEGIF-NGVCG---TQLDHAVTIVGFGTTEDGANYWLIKNSWGNT 321
             VS A    S +F  Y++G++ N  C     +++HAV  VG+G  EDG  YW++KNSWG  
Sbjct:   175 VSFAFEVTS-DFMHYRKGVYSNPRCEHTPDKVNHAVLAVGYGE-EDGRPYWIVKNSWGPL 232

Query:   322 WGDAGYMKIVRDEGLCGIGTRSSYPL 347
             WG  GY  I R + +CG+   +SYP+
Sbjct:   233 WGMDGYFLIERGKNMCGLAACASYPV 258


>TAIR|locus:2130180 [details] [associations]
            symbol:AT4G16190 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0042744 "hydrogen peroxide catabolic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005773
            EMBL:CP002687 HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 EMBL:Z97340 EMBL:AL161543 UniGene:At.25555
            EMBL:AY039556 EMBL:AY129473 EMBL:AY136316 EMBL:BT000733
            EMBL:AK226366 IPI:IPI00543588 PIR:D71428 RefSeq:NP_567489.1
            HSSP:P25779 ProteinModelPortal:Q9SUL1 SMR:Q9SUL1 STRING:Q9SUL1
            MEROPS:C01.A06 PRIDE:Q9SUL1 EnsemblPlants:AT4G16190.1 GeneID:827311
            KEGG:ath:AT4G16190 TAIR:At4g16190 InParanoid:Q9SUL1 OMA:NACGINK
            PhylomeDB:Q9SUL1 ProtClustDB:CLSN2917559 Genevestigator:Q9SUL1
            Uniprot:Q9SUL1
        Length = 373

 Score = 474 (171.9 bits), Expect = 4.4e-45, P = 4.4e-45
 Identities = 117/332 (35%), Positives = 171/332 (51%)

Query:    31 QVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
             QVV   +  +    E H   + +++ ++Y  ++E + R ++FK NL    + N+  + + 
Sbjct:    38 QVVPEENDEQLLNAEHHFTLFKSKYEKTYATQVEHDHRFRVFKANLRRARR-NQLLDPSA 96

Query:    90 KLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVT 149
               G  QFSDLT  EFR  + G K      R           L  +D+PT  DWR++GAVT
Sbjct:    97 VHGVTQFSDLTPKEFRRKFLGLKRRG--FRLPTDTQTA-PILPTSDLPTEFDWREQGAVT 153

Query:   150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-------TNG-NNGCLG 201
             P+KNQ  CG CW+F+A+ A+EG   + +  L+ LSEQQL+DC         N  ++GC G
Sbjct:   154 PVKNQGMCGSCWSFSAIGALEGAHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSG 213

Query:   202 GSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVPSGDEQALLKA 260
             G    AF Y ++  G+  E++YPY     T C   +    A +SN+  V S ++Q     
Sbjct:   214 GLMNNAFEYALKAGGLMKEEDYPYTGRDHTACKFDKSKIVASVSNFSVVSSDEDQIAANL 273

Query:   261 VSMQPVSIAIAAYSTEFQSYKEGIFNG-VCGTQLDHAVTIVGFGTTEDGA------NYWL 313
             V   P++IAI A     Q+Y  G+    VC    DH V +VGFG++           YW+
Sbjct:   274 VQHGPLAIAINAMW--MQTYIGGVSCPYVCSKSQDHGVLLVGFGSSGYAPIRLKEKPYWI 331

Query:   314 IKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSS 344
             IKNSWG  WG+ GY KI R    +CG+ T  S
Sbjct:   332 IKNSWGAMWGEHGYYKICRGPHNMCGMDTMVS 363


>RGD|1308181 [details] [associations]
            symbol:Ctsf "cathepsin F" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1308181 eggNOG:COG4870 HOGENOM:HOG000230774
            KO:K01373 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 MEROPS:C01.018
            CTD:8722 OMA:LAPPEWD OrthoDB:EOG4CC41T GeneTree:ENSGT00660000095458
            EMBL:CH473953 EMBL:BC099780 EMBL:EU253481 IPI:IPI00201100
            RefSeq:NP_001029282.1 UniGene:Rn.25087 SMR:Q499S6
            Ensembl:ENSRNOT00000026718 GeneID:361704 KEGG:rno:361704
            UCSC:RGD:1308181 InParanoid:Q499S6 NextBio:677325
            Genevestigator:Q499S6 Uniprot:Q499S6
        Length = 462

 Score = 472 (171.2 bits), Expect = 7.1e-45, P = 7.1e-45
 Identities = 107/305 (35%), Positives = 159/305 (52%)

Query:    46 IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
             + + +M  + R+Y+   E + RL +F  N+   +K       T + G  +FSDLT +EF 
Sbjct:   164 LFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFH 223

Query:   106 ALYTGYKMPSPSHRXXXXXXFKYQNLSMTDV-PTSLDWRDKGAVTPIKNQKECGCCWAFA 164
              +Y    +   S              S+ D+ P   DWR KGAVT +K+Q  CG CWAF+
Sbjct:   224 TIYLNPLLQKESGGKMSLAK------SINDLAPPEWDWRKKGAVTEVKDQGMCGSCWAFS 277

Query:   165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYP 224
                 VEG   +  G L+ LSEQ+LLDC    +  C+GG    A+  I    G+ TED+Y 
Sbjct:   278 VTGNVEGQWFLNRGTLLSLSEQELLDCDKM-DKACMGGLPSNAYTAIKNLGGLETEDDYG 336

Query:   225 YQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEG 283
             YQ     C+ + + A   I++  E+ S DE  +   ++ + P+S+AI A+  +F  Y+ G
Sbjct:   337 YQGHVQACNFSTQMAKVYINDSVEL-SRDENKIAAWLAQKGPISVAINAFGMQF--YRHG 393

Query:   284 I---FNGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             I   F  +C    +DHAV +VG+G   +   YW IKNSWG  WG+ GY  + R  G CG+
Sbjct:   394 IAHPFRPLCSPWFIDHAVLLVGYGNRSN-IPYWAIKNSWGRDWGEEGYYYLYRGSGACGV 452

Query:   340 GTRSS 344
              T +S
Sbjct:   453 NTMAS 457


>WB|WBGene00019986 [details] [associations]
            symbol:R09F10.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            EMBL:FO081137 HSSP:P53634 PIR:D89588 RefSeq:NP_509408.1
            ProteinModelPortal:Q23030 SMR:Q23030 STRING:Q23030 MEROPS:C01.A44
            PaxDb:Q23030 EnsemblMetazoa:R09F10.1 GeneID:181087
            KEGG:cel:CELE_R09F10.1 UCSC:R09F10.1 CTD:181087 WormBase:R09F10.1
            InParanoid:Q23030 OMA:EYPYSAL NextBio:912346 Uniprot:Q23030
        Length = 383

 Score = 472 (171.2 bits), Expect = 7.1e-45, P = 7.1e-45
 Identities = 114/343 (33%), Positives = 178/343 (51%)

Query:    20 IIITLLVSCASQVVSSRSTHEQSVVEIHEK----WMAQHGRSYKDELEKEMRLKIFKENL 75
             +++T+L+   S  V  R  H+   ++ HE+    ++ +  R Y    E E R +IF  N+
Sbjct:    53 VLLTMLI-LLSFFVFQRLNHKMENLK-HEQMFNDFILKFDRKYTSVEEFEYRYQIFLRNV 110

Query:    76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL-----YTGYKMPSPSHRXXXXXXFKYQN 130
                E A +E N    L  N+F+D T++E + +     YT Y   +P           Y  
Sbjct:   111 IEFE-AEEERNLGLDLDVNEFTDWTDEELQKMVQENKYTKYDFDTPKFEGS------YLE 163

Query:   131 LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
               +   P S+DWR++G +TPIKNQ +CG CWAFA VA+VE    I+ G L+ LSEQ+++D
Sbjct:   164 TGVIR-PASIDWREQGKLTPIKNQGQCGSCWAFATVASVEAQNAIKKGKLVSLSEQEMVD 222

Query:   191 CSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVP-GTCSAAQKPAAAKISNYEEV 249
             C    NNGC GG R  A  ++ +N G+ +E EYPY A+    C   +      I ++  +
Sbjct:   223 CDGR-NNGCSGGYRPYAMKFVKEN-GLESEKEYPYSALKHDQCFLKENDTRVFIDDFRML 280

Query:   250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN-GV--CGTQL--DHAVTIVGFGT 304
              + +E       +  PV+  +      + SY+ GIFN  V  C  +    HA+TI+G+G 
Sbjct:   281 SNNEEDIANWVGTKGPVTFGMNVVKAMY-SYRSGIFNPSVEDCTEKSMGAHALTIIGYGG 339

Query:   305 TEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 347
               + A YW++KNSWG +WG +GY ++ R    CG+      P+
Sbjct:   340 EGESA-YWIVKNSWGTSWGASGYFRLARGVNSCGLANTVVAPI 381


>UNIPROTKB|F1RU48 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            OMA:LAPPEWD GeneTree:ENSGT00660000095458 EMBL:CU928034
            EMBL:FP565364 Ensembl:ENSSSCT00000014140 Ensembl:ENSSSCT00000014154
            Uniprot:F1RU48
        Length = 460

 Score = 468 (169.8 bits), Expect = 1.9e-44, P = 1.9e-44
 Identities = 102/306 (33%), Positives = 157/306 (51%)

Query:    43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
             +  I ++++  + R+Y  + E   R+ +F  N+   +K       T + G  +FSDLT +
Sbjct:   159 MASIFKEFVTTYNRTYDTKEEARWRMSVFANNMVRAQKIQALDTGTARYGVTKFSDLTEE 218

Query:   103 EFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
             EFR +Y    +     R         +++S    P   DWR KGAVT +K+Q  CG CWA
Sbjct:   219 EFRTIYLNPLLQEEPGRKMRLA----KSVSSLP-PPEWDWRKKGAVTKVKDQGMCGSCWA 273

Query:   163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
             F+    VEG   ++ G L+ LSEQ+LLDC    + GC+GG    A++ I    G+ TE++
Sbjct:   274 FSVTGNVEGQWFLKQGTLLSLSEQELLDCD-KVDKGCMGGLPSNAYSAIKTLGGLETEED 332

Query:   223 YPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
             Y Y+    TCS   + A   I++  E+   +++         P+S+AI A+  +F  Y+ 
Sbjct:   333 YSYRGHLQTCSFNAEKAKVYINDSVELSQNEQKLAAWLAEKGPISVAINAFGMQF--YRH 390

Query:   283 GI---FNGVCGTQL-DHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCG 338
             GI      +C   L DHAV +VG+G       +W IKNSWG  WG+ GY  + R  G CG
Sbjct:   391 GISHPLRPLCSPWLIDHAVLLVGYGN-RSATPFWAIKNSWGTDWGEEGYYYLYRGSGACG 449

Query:   339 IGTRSS 344
             +   +S
Sbjct:   450 VNIMAS 455


>ZFIN|ZDB-GENE-030131-9831 [details] [associations]
            symbol:ctsf "cathepsin F" species:7955 "Danio
            rerio" [GO:0004869 "cysteine-type endopeptidase inhibitor activity"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000010 InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00031 Pfam:PF00112 PRINTS:PR00705 SMART:SM00043
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-030131-9831
            GO:GO:0004869 eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 HOVERGEN:HBG011513 CTD:8722 OrthoDB:EOG4CC41T
            MEROPS:I25.006 EMBL:BC124243 IPI:IPI00503226 RefSeq:NP_001071036.1
            UniGene:Dr.81265 ProteinModelPortal:Q08CH0 SMR:Q08CH0 GeneID:565588
            KEGG:dre:565588 InParanoid:Q08CH0 NextBio:20885952
            ArrayExpress:Q08CH0 Uniprot:Q08CH0
        Length = 473

 Score = 468 (169.8 bits), Expect = 1.9e-44, P = 1.9e-44
 Identities = 105/303 (34%), Positives = 166/303 (54%)

Query:    43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE--KANKEGNRTYKLGTNQFSDLT 100
             ++ + + +M  + R+Y  + E E RL+IF++N++  +  ++ ++G+  Y  G  +FSDLT
Sbjct:   171 LLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEY--GITKFSDLT 228

Query:   101 NDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
              DEFR +Y     P  S +       K    +    P + DWRD GAV+P+KNQ  CG C
Sbjct:   229 EDEFRMMYLN---PMLS-QWSLKKEMKPAIPASAPAPDTWDWRDHGAVSPVKNQGMCGSC 284

Query:   161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATE 220
             WAF+    +EG    ++G L+ LSEQ+L+DC    +  C GG    A+  I    G+ TE
Sbjct:   285 WAFSVTGNIEGQWFKKTGQLLSLSEQELVDCDKL-DQACGGGLPSNAYEAIENLGGLETE 343

Query:   221 DEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
              +Y Y     +C  +    AA I++  E+P  +++         PVS A+ A++ +F  Y
Sbjct:   344 TDYSYTGHKQSCDFSTGKVAAYINSSVELPKDEKEIAAFLAENGPVSAALNAFAMQF--Y 401

Query:   281 KEGIFNGV---CGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGL 336
             ++G+ + +   C    +DHAV +VGFG   +G  +W IKNSWG  +G+ GY  + R  GL
Sbjct:   402 RKGVSHPLKIFCNPWMIDHAVLLVGFGQ-RNGVPFWAIKNSWGEDYGEQGYYYLYRGSGL 460

Query:   337 CGI 339
             CGI
Sbjct:   461 CGI 463


>TAIR|locus:2050145 [details] [associations]
            symbol:AT2G21430 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002685 GenomeReviews:CT485783_GR
            EMBL:AC006841 EMBL:X74359 IPI:IPI00519637 PIR:B84601
            RefSeq:NP_565512.1 UniGene:At.14069 ProteinModelPortal:P43295
            SMR:P43295 MEROPS:C01.A04 PRIDE:P43295 EnsemblPlants:AT2G21430.1
            GeneID:816682 KEGG:ath:AT2G21430 TAIR:At2g21430 eggNOG:COG4870
            HOGENOM:HOG000230774 InParanoid:P43295 KO:K01373 OMA:GSIEEHY
            PhylomeDB:P43295 ProtClustDB:CLSN2688311 Genevestigator:P43295
            GermOnline:AT2G21430 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 Uniprot:P43295
        Length = 361

 Score = 466 (169.1 bits), Expect = 3.1e-44, P = 3.1e-44
 Identities = 114/349 (32%), Positives = 179/349 (51%)

Query:    18 MFIIITLLVSCASQVVSSRSTHEQSVVEI-----H-EKWMAQHGRSYKDELEKEMRLKIF 71
             +F+ +++ V C  + V  R   +++  ++     H   +  + G+ Y    E   R  +F
Sbjct:    14 IFVFVSVSV-CGDEDVLIRQVVDETEPKVLSSEDHFTLFKKKFGKVYGSIEEHYYRFSVF 72

Query:    72 KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG----YKMPSPSHRXXXXXXFK 127
             K NL    +  K  + + + G  QFSDLT  EFR  + G    +K+P  +++        
Sbjct:    73 KANLLRAMRHQKM-DPSARHGVTQFSDLTRSEFRRKHLGVKGGFKLPKDANQAPILPT-- 129

Query:   128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
              QNL     P   DWRD+GAVTP+KNQ  CG CW+F+   A+EG   + +G L+ LSEQQ
Sbjct:   130 -QNL-----PEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLSEQQ 183

Query:   188 LLDCS------TNGN--NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKP 238
             L+DC         G+  +GC GG    AF Y ++  G+  E +YPY    G +C   +  
Sbjct:   184 LVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMREKDYPYTGTDGGSCKLDRSK 243

Query:   239 AAAKISNYEEVPSGDEQALLKAVSMQPVSIAI-AAYSTEFQSYKEGIFNG-VCGTQLDHA 296
               A +SN+  V   ++Q     +   P+++AI AAY    Q+Y  G+    +C  +L+H 
Sbjct:   244 IVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAY---MQTYIGGVSCPYICSRRLNHG 300

Query:   297 VTIVGFGTTE-DGAN-----YWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             V +VG+G+     A      YW+IKNSWG +WG+ G+ KI +   +CG+
Sbjct:   301 VLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNICGV 349


>MGI|MGI:1338045 [details] [associations]
            symbol:Ctsw "cathepsin W" species:10090 "Mus musculus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 MGI:MGI:1338045 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 MEROPS:C01.037
            CTD:1521 eggNOG:NOG288820 HOVERGEN:HBG100117 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 EMBL:AF014941 EMBL:AC122861 IPI:IPI00111727
            RefSeq:NP_034115.2 UniGene:Mm.113590 ProteinModelPortal:P56203
            SMR:P56203 PhosphoSite:P56203 PRIDE:P56203 DNASU:13041
            Ensembl:ENSMUST00000025844 GeneID:13041 KEGG:mmu:13041
            InParanoid:P56203 NextBio:282936 Bgee:P56203 CleanEx:MM_CTSW
            Genevestigator:P56203 GermOnline:ENSMUSG00000024910 Uniprot:P56203
        Length = 371

 Score = 356 (130.4 bits), Expect = 9.6e-44, Sum P(2) = 9.6e-44
 Identities = 87/272 (31%), Positives = 137/272 (50%)

Query:    45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
             E+ + +  +  RSY +  E   RL IF  NL   ++  +E   T + G   FSDLT +EF
Sbjct:    38 EVFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDLGTAEFGETPFSDLTEEEF 97

Query:   105 RALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRD-KGAVTPIKNQKECGCCWAF 163
               LY   + P    R          N     VP + DWR  K  ++ +KNQ  C CCWA 
Sbjct:    98 GQLYGQERSPE---RTPNMTKKVESNTWGESVPRTCDWRKAKNIISSVKNQGSCKCCWAM 154

Query:   164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
             AA   ++ + +I+    + +S Q+LLDC   GN GC GG    A+  ++ N G+A+E +Y
Sbjct:   155 AAADNIQALWRIKHQQFVDVSVQELLDCERCGN-GCNGGFVWDAYLTVLNNSGLASEKDY 213

Query:   224 PYQA--VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSY 280
             P+Q    P  C A +    A I ++  + S +EQA+   +++  P+++ I       Q Y
Sbjct:   214 PFQGDRKPHRCLAKKYKKVAWIQDFTML-SNNEQAIAHYLAVHGPITVTINMKL--LQHY 270

Query:   281 KEGIFNGV---CGT-QLDHAVTIVGFGTTEDG 308
             ++G+       C   Q+DH+V +VGFG  ++G
Sbjct:   271 QKGVIKATPSSCDPRQVDHSVLLVGFGKEKEG 302

 Score = 122 (48.0 bits), Expect = 9.6e-44, Sum P(2) = 9.6e-44
 Identities = 25/71 (35%), Positives = 37/71 (52%)

Query:   292 QLDHAVTIVGFGTTEDGAN----------------YWLIKNSWGNTWGDAGYMKIVRDEG 335
             Q+DH+V +VGFG  ++G                  YW++KNSWG  WG+ GY ++ R   
Sbjct:   286 QVDHSVLLVGFGKEKEGMQTGTVLSHSRKRRHSSPYWILKNSWGAHWGEKGYFRLYRGNN 345

Query:   336 LCGIGTRSSYP 346
              CG+   + YP
Sbjct:   346 TCGV---TKYP 353


>DICTYBASE|DDB_G0272742 [details] [associations]
            symbol:DDB_G0272742 species:44689 "Dictyostelium
            discoideum" [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0272742 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639 EMBL:AAFI02000008
            eggNOG:NOG331187 RefSeq:XP_644986.1 ProteinModelPortal:Q7KWP5
            PRIDE:Q7KWP5 EnsemblProtists:DDB0168242 GeneID:8618663
            KEGG:ddi:DDB_G0272742 InParanoid:Q7KWP5 OMA:ATESAHF Uniprot:Q7KWP5
        Length = 345

 Score = 460 (167.0 bits), Expect = 1.3e-43, P = 1.3e-43
 Identities = 122/346 (35%), Positives = 183/346 (52%)

Query:    20 IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
             +I+ L ++C+     S+ T  Q   E    WM  + R+Y    E   R   FK NL++I 
Sbjct:     7 LILILFINCSF----SKLTEIQYRNEF-TAWMTSNQRTYASS-EFTNRYNTFKSNLDFIN 60

Query:    80 KANKEGNRTYKLGTNQFSDLTNDEFRALYTGY-----KMPSPSHRXXXXXXFKYQNLSMT 134
             + N +G++T  L  N+F+D++N+E+R  Y        K+ S           K  + S +
Sbjct:    61 QWNSKGSKTV-LALNEFADISNEEYRKNYLRNDNNINKLSSLLINDKEDKEIKSSSSSGS 119

Query:   135 DVPTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGN--LIQLSEQQLLDC 191
                + +DWR KGAV  +K+Q   CG  W   AV A E    + +     I LS Q L+DC
Sbjct:   120 G-SSGIDWRKKGAVPSVKSQIGGCGS-WPITAVGATESAHFLANPKDPFISLSMQNLIDC 177

Query:   192 STNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA-VPGTCSAAQKPAAAKISNYEEVP 250
             S N N  C  G+  +AF YII+N GI +E+ Y +    PG C      + AKI++YE+V 
Sbjct:   178 S-NLNKQCYQGTVNEAFQYIIENGGIDSEESYKFSGGEPGKCKYNSSNSVAKITSYEKVK 236

Query:   251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF-NGVCG-TQLDHAVTIVGFG---TT 305
             SG E +L  AVS++PV+  I A  + FQ Y  GI+    C  T L+H++ IVGF    TT
Sbjct:   237 SGSESSLESAVSLKPVAAYIDASLSSFQFYSSGIYYEPSCNSTDLNHSILIVGFSDFSTT 296

Query:   306 -----EDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSY 345
                  +  +NYW+++NS+G  WG+ GY+ + +D +  CGI   +SY
Sbjct:   297 PTDSLKHSSNYWIVQNSFGKNWGENGYIFMSKDRDDNCGISKMASY 342


>WB|WBGene00012747 [details] [associations]
            symbol:Y40H7A.10 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000230773 EMBL:AL033510
            HSSP:P80067 MEROPS:C01.A48 PIR:T26792 RefSeq:NP_502836.1
            ProteinModelPortal:Q9XWA4 SMR:Q9XWA4 STRING:Q9XWA4
            EnsemblMetazoa:Y40H7A.10 GeneID:189809 KEGG:cel:CELE_Y40H7A.10
            UCSC:Y40H7A.10 CTD:189809 WormBase:Y40H7A.10 eggNOG:NOG286423
            InParanoid:Q9XWA4 OMA:NGPMIVC NextBio:943702 Uniprot:Q9XWA4
        Length = 343

 Score = 458 (166.3 bits), Expect = 2.2e-43, P = 2.2e-43
 Identities = 123/352 (34%), Positives = 177/352 (50%)

Query:     1 MVLIFERSGSFKINTTPMFIIITLLVSCASQVVSSRS--THEQSVVEIHEKWMAQHGRSY 58
             + + F   G  K N  P + I  L      Q++      T +       + ++ ++ R Y
Sbjct:     8 LAIFFVHFGCAKPNLLPSYQISDL-----DQILQRHHIPTPDVKYTNAFQNFLVKYLREY 62

Query:    59 KDELEKEMRLKIFKENLEYIEKANKE--GNRTYKLGTNQFSDLTNDEFRALYTGYKM-PS 115
              +E E   R  IF  NL+ +E+ NKE  G  TY+L  N FSDLT +E++     Y M P 
Sbjct:    63 PNEYEIVKRFTIFSRNLDLVERYNKEDAGKVTYEL--NDFSDLTEEEWKK----YLMTPK 116

Query:   116 PSHRXXXXXXFKYQNL-SMTDVPTSLDWRD-KGA--VTPIKNQKECGCCWAFAAVAAVEG 171
             P H        K + L    ++P S+DWR+  G   VT IK Q  CG CWAFA  AA+E 
Sbjct:   117 PDH---SEKSLKPKTLIDKKNLPNSVDWRNVNGTNHVTGIKYQGPCGSCWAFATAAAIES 173

Query:   172 ITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT 231
                I  G L  LS QQLLDC+   +  C GG   +A  Y  Q+ GI T   YPY      
Sbjct:   174 AVSISGGGLQSLSSQQLLDCTVVSDK-CGGGEPVEALKYA-QSHGITTAHNYPYYFWTTK 231

Query:   232 CSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST-EFQSYKEGIFNGV-C 289
             C     P  A+IS++ +  S DE A + A++  P+ I  A ++T + + Y  GI     C
Sbjct:   232 CRETV-PTVARISSWMKAESEDEMAQIVALN-GPM-IVCANFATNKNRFYHSGIAEDPDC 288

Query:   290 GTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGT 341
             GT+  HA+ ++G+G      +YW++KN++   WG+ GYM++ RD   CGI T
Sbjct:   289 GTEPTHALIVIGYGP-----DYWILKNTYSKVWGEKGYMRVKRDVNWCGINT 335


>DICTYBASE|DDB_G0274385 [details] [associations]
            symbol:DDB_G0274385 "Cysteine proteinase 1,
            mitochondrial" species:44689 "Dictyostelium discoideum" [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0274385 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 EMBL:AAFI02000012 RefSeq:XP_644301.1
            ProteinModelPortal:Q86KD4 EnsemblProtists:DDB0167535 GeneID:8619729
            KEGG:ddi:DDB_G0274385 InParanoid:Q86KD4 OMA:SICVDAS Uniprot:Q86KD4
        Length = 358

 Score = 455 (165.2 bits), Expect = 4.5e-43, P = 4.5e-43
 Identities = 114/324 (35%), Positives = 165/324 (50%)

Query:    40 EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
             + S+ +    W  +H + YKD +E E R   FKEN++   + N       K  +N FSDL
Sbjct:    37 DSSMRDTFNHWAKKHSKIYKDSIEMENRFSNFKENMKKNIELNSMHAGKAKFESNGFSDL 96

Query:   100 TNDEFRALYTG--YK----------MPSPSHRXXXXXXFK-YQNLSMTDVPTSLDWRDKG 146
             + +EF   +    +K           P P+        +K  +N  + ++  S+DWR KG
Sbjct:    97 SEEEFSNFHLNKAFKGKPSHLRNSIKPQPTPHHSLINGYKEMENGDLNEL-YSIDWRKKG 155

Query:   147 AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL-IQLSEQQLLDCSTNGNNGCLGGSRE 205
              VTP+K+Q +CG C+ F+AV  +E    I++GN  I LSEQQ +DC       C GG   
Sbjct:   156 LVTPVKDQGQCGSCYIFSAVEQIE-TAWIKAGNKPILLSEQQAVDCDPYDGQ-CGGGDPY 213

Query:   206 KAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKA-VSMQ 264
               + Y  Q  G++T  +YPY A  GTC    + A   +S +     GDE  L+K  V+  
Sbjct:   214 TVYEYFSQVGGVSTNAQYPYTATDGTCVNMSR-AVPVVSYHYVTQGGDENTLIKTIVNDG 272

Query:   265 PVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE-DGAN---YWLIKNSWGN 320
             PVSI + A ST +QSY  GI    CG  +DH V +VG    + D +N   Y++I+NSWG 
Sbjct:   273 PVSICVDA-ST-WQSYSGGIITTGCGKNIDHCVQVVGLEVDKTDPSNPVQYYIIRNSWGT 330

Query:   321 TWGDAGYMKIVRDEGLCGIGTRSS 344
              WG  GY+ +     LCGI   S+
Sbjct:   331 DWGIDGYIYVATGSDLCGITYEST 354


>GENEDB_PFALCIPARUM|PF14_0553 [details] [associations]
            symbol:PF14_0553 "cysteine proteinase
            falcipain-1" species:5833 "Plasmodium falciparum" [GO:0042540
            "hemoglobin catabolic process" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014187 KO:K01376 HSSP:P07688 GO:GO:0042540
            RefSeq:XP_001348727.1 ProteinModelPortal:Q8I6V0 PRIDE:Q8I6V0
            EnsemblProtists:PF14_0553:mRNA GeneID:812135 KEGG:pfa:PF14_0553
            EuPathDB:PlasmoDB:PF3D7_1458000 HOGENOM:HOG000065906
            ProtClustDB:CLSZ2457715 ChEMBL:CHEMBL1250371 Uniprot:Q8I6V0
        Length = 569

 Score = 292 (107.8 bits), Expect = 5.2e-43, Sum P(3) = 5.2e-43
 Identities = 61/172 (35%), Positives = 100/172 (58%)

Query:   136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
             VP  LD+R+KG V   K+Q  CG CWAFA+V  +E +   ++ N++  SEQ+++DCS + 
Sbjct:   333 VPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD- 391

Query:   196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVPSGDE 254
             N GC GG    +F Y++QN+ +   DEY Y+A     C   +      +S+   V   + 
Sbjct:   392 NFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVK--EN 448

Query:   255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
             Q +L    + P+S+ +   + +F +Y EG++NG C  +L+H+V +VG+G  E
Sbjct:   449 QLILALNEVGPLSVNVGV-NNDFVAYSEGVYNGTCSEELNHSVLLVGYGQVE 499

 Score = 111 (44.1 bits), Expect = 5.2e-43, Sum P(3) = 5.2e-43
 Identities = 19/41 (46%), Positives = 25/41 (60%)

Query:   311 YWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
             YW+IKNSW   WG+ G+M++ R    D   CGIG    YP+
Sbjct:   528 YWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPI 568

 Score = 99 (39.9 bits), Expect = 5.2e-43, Sum P(3) = 5.2e-43
 Identities = 26/72 (36%), Positives = 41/72 (56%)

Query:    49 KWMAQHGRSYKDELEKEMR-LKIFKENLEYIEKANK-EGNRTYKLGTNQFSDLTNDEFRA 106
             K+M +H + YK+ ++++MR  +IFK N   I+  NK   N  YK   NQFSD + +E + 
Sbjct:   227 KFMKEHNKVYKN-IDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKE 285

Query:   107 LYTGYKMPSPSH 118
              Y    +  P+H
Sbjct:   286 -YFKTLLHVPNH 296

 Score = 43 (20.2 bits), Expect = 3.5e-37, Sum P(3) = 3.5e-37
 Identities = 12/46 (26%), Positives = 25/46 (54%)

Query:    59 KDELEK-EMRLKIFKENLEYI--EKANKEGNRTYKLGTNQFSDLTN 101
             K+E+E   + L+ +K+  + I  E +N+E    Y L +  +++  N
Sbjct:    76 KEEIELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNN 121


>UNIPROTKB|Q8I6V0 [details] [associations]
            symbol:PF14_0553 "Cysteine proteinase falcipain-1"
            species:36329 "Plasmodium falciparum 3D7" [GO:0042540 "hemoglobin
            catabolic process" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014187 KO:K01376 HSSP:P07688 GO:GO:0042540
            RefSeq:XP_001348727.1 ProteinModelPortal:Q8I6V0 PRIDE:Q8I6V0
            EnsemblProtists:PF14_0553:mRNA GeneID:812135 KEGG:pfa:PF14_0553
            EuPathDB:PlasmoDB:PF3D7_1458000 HOGENOM:HOG000065906
            ProtClustDB:CLSZ2457715 ChEMBL:CHEMBL1250371 Uniprot:Q8I6V0
        Length = 569

 Score = 292 (107.8 bits), Expect = 5.2e-43, Sum P(3) = 5.2e-43
 Identities = 61/172 (35%), Positives = 100/172 (58%)

Query:   136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
             VP  LD+R+KG V   K+Q  CG CWAFA+V  +E +   ++ N++  SEQ+++DCS + 
Sbjct:   333 VPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD- 391

Query:   196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVPSGDE 254
             N GC GG    +F Y++QN+ +   DEY Y+A     C   +      +S+   V   + 
Sbjct:   392 NFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVK--EN 448

Query:   255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
             Q +L    + P+S+ +   + +F +Y EG++NG C  +L+H+V +VG+G  E
Sbjct:   449 QLILALNEVGPLSVNVGV-NNDFVAYSEGVYNGTCSEELNHSVLLVGYGQVE 499

 Score = 111 (44.1 bits), Expect = 5.2e-43, Sum P(3) = 5.2e-43
 Identities = 19/41 (46%), Positives = 25/41 (60%)

Query:   311 YWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
             YW+IKNSW   WG+ G+M++ R    D   CGIG    YP+
Sbjct:   528 YWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPI 568

 Score = 99 (39.9 bits), Expect = 5.2e-43, Sum P(3) = 5.2e-43
 Identities = 26/72 (36%), Positives = 41/72 (56%)

Query:    49 KWMAQHGRSYKDELEKEMR-LKIFKENLEYIEKANK-EGNRTYKLGTNQFSDLTNDEFRA 106
             K+M +H + YK+ ++++MR  +IFK N   I+  NK   N  YK   NQFSD + +E + 
Sbjct:   227 KFMKEHNKVYKN-IDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKE 285

Query:   107 LYTGYKMPSPSH 118
              Y    +  P+H
Sbjct:   286 -YFKTLLHVPNH 296

 Score = 43 (20.2 bits), Expect = 3.5e-37, Sum P(3) = 3.5e-37
 Identities = 12/46 (26%), Positives = 25/46 (54%)

Query:    59 KDELEK-EMRLKIFKENLEYI--EKANKEGNRTYKLGTNQFSDLTN 101
             K+E+E   + L+ +K+  + I  E +N+E    Y L +  +++  N
Sbjct:    76 KEEIELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNN 121


>DICTYBASE|DDB_G0291191 [details] [associations]
            symbol:DDB_G0291191 "cysteine protease" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0291191
            GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AAFI02000175 MEROPS:C01.022
            ProtClustDB:CLSZ2429603 RefSeq:XP_635374.1
            ProteinModelPortal:Q54F16 PRIDE:Q54F16 EnsemblProtists:DDB0252831
            GeneID:8628022 KEGG:ddi:DDB_G0291191 OMA:NETQIAS Uniprot:Q54F16
        Length = 352

 Score = 453 (164.5 bits), Expect = 7.3e-43, P = 7.3e-43
 Identities = 118/342 (34%), Positives = 168/342 (49%)

Query:    19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
             FI+  +L+  A       S  E   +    K+   +  S ++ L K    K    N++ +
Sbjct:     3 FILFFVLMLTALAAGRRLSVEESQFIAFQNKYNKIY--SAEEYLVKFETFKSNLLNIDAL 60

Query:    79 EK-ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK-------MPSPSHRXXXXXXFKYQN 130
              K A   G+ T K G N+F+DL+ +EF+  Y   K       +P   +            
Sbjct:    61 NKQATTIGSDT-KFGVNKFADLSKEEFKKYYLSSKEARLTDDLPMLPNLSDDIISATPAA 119

Query:   131 LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
                 +   S  +     VT +KNQ +CG CW+F+    VEG   + +G L+ LSEQ L+D
Sbjct:   120 FDWRNTGGSTKFPQGTPVTAVKNQGQCGSCWSFSTTGNVEGQHYLSTGTLVGLSEQNLVD 179

Query:   191 CS------TNGN--N-GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAA 241
             C        N N  N GC GG +  A+ YII+N GI TE  YPY AV G C        A
Sbjct:   180 CDHTCMTYENENVCNAGCDGGLQPNAYNYIIKNGGIQTEATYPYTAVDGECKFNSAQVGA 239

Query:   242 KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVG 301
             KIS++  VP  + Q      +  P  +AIAA + E+Q Y  G+F+  CG  LDH + IVG
Sbjct:   240 KISSFTMVPQNETQIASYLFNNGP--LAIAADAEEWQFYMGGVFDFPCGQTLDHGILIVG 297

Query:   302 FGTTED--GAN--YWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             +G  +   G N  YW+IKNSWG  WG+AGY+K+ R+   CG+
Sbjct:   298 YGAQDTIVGKNTPYWIIKNSWGADWGEAGYLKVERNTDKCGV 339


>TAIR|locus:2082687 [details] [associations]
            symbol:AT3G54940 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002686 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HSSP:P53634
            OMA:GGGLMTN EMBL:AY070063 IPI:IPI00528988 RefSeq:NP_567010.5
            UniGene:At.28412 ProteinModelPortal:Q8VYS0 SMR:Q8VYS0 PRIDE:Q8VYS0
            EnsemblPlants:AT3G54940.2 GeneID:824659 KEGG:ath:AT3G54940
            TAIR:At3g54940 PhylomeDB:Q8VYS0 ProtClustDB:CLSN2718801
            ArrayExpress:Q8VYS0 Genevestigator:Q8VYS0 Uniprot:Q8VYS0
        Length = 367

 Score = 452 (164.2 bits), Expect = 9.3e-43, P = 9.3e-43
 Identities = 106/324 (32%), Positives = 163/324 (50%)

Query:    38 THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN-LEYIEKANKEGNRTYKLGTNQF 96
             TH +S   +   +M+ +G++Y    E   RL IF +N L+  E    + +  +  G  QF
Sbjct:    45 THTESKFRL---FMSDYGKNYSTREEYIHRLGIFAKNVLKAAEHQMMDPSAVH--GVTQF 99

Query:    97 SDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
             SDLT +EF+ +YTG      S         +   + +  +P   DWR+KG VT +KNQ  
Sbjct:   100 SDLTEEEFKRMYTGVADVGGSRGGTVGA--EAPMVEVDGLPEDFDWREKGGVTEVKNQGA 157

Query:   157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN--------NGCLGGSREKAF 208
             CG CWAF+   A EG   + +G L+ LSEQQL+DC    +        NGC GG    A+
Sbjct:   158 CGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCDQACDPKDKKACDNGCGGGLMTNAY 217

Query:   209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSI 268
              Y+++  G+  E  YPY    G C    +  A ++ N+  +P  + Q     V   P+++
Sbjct:   218 EYLMEAGGLEEERSYPYTGKRGHCKFDPEKVAVRVLNFTTIPLDENQIAANLVRHGPLAV 277

Query:   269 AIAAYSTEFQSYKEGIFNG-VCGTQ-LDHAVTIVGFGTTEDG----AN--YWLIKNSWGN 320
              + A     Q+Y  G+    +C  + ++H V +VG+G+        +N  YW+IKNSWG 
Sbjct:   278 GLNAVF--MQTYIGGVSCPLICSKRNVNHGVLLVGYGSKGFSILRLSNKPYWIIKNSWGK 335

Query:   321 TWGDAGYMKIVRDEGLCGIGTRSS 344
              WG+ GY K+ R   +CGI +  S
Sbjct:   336 KWGENGYYKLCRGHDICGINSMVS 359


>UNIPROTKB|F1MHV4 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 OMA:GRCGDGC EMBL:DAAA02063574
            IPI:IPI00716321 Ensembl:ENSBTAT00000027681 Uniprot:F1MHV4
        Length = 375

 Score = 361 (132.1 bits), Expect = 1.8e-42, Sum P(2) = 1.8e-42
 Identities = 87/270 (32%), Positives = 142/270 (52%)

Query:    45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
             E+   +  Q+ RSY +  E   RL IF +NL   ++  +E   T + G  QFSDLT +EF
Sbjct:    40 EVFRLFQMQYNRSYPNPAEYARRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSDLTEEEF 99

Query:   105 RALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
               LY G ++   +         +    S    P + DWR  G ++P+++Q+ C CCWA A
Sbjct:   100 VQLY-GSQVAGEALGVSRKVGSEEWGESE---PQTCDWRKVGTISPVRDQRNCNCCWAMA 155

Query:   165 AVAAVEGITKIRSGNLIQLSEQ-QLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
             A   +E +  I+  + +++S Q +LLDC   GN GC GG    AF  ++ N G+A+E +Y
Sbjct:   156 AAGNIEALWAIKFRHFVEVSVQPELLDCDRCGN-GCRGGFVWDAFLTVLNNSGLASEKDY 214

Query:   224 PYQAVPGT--CSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSY 280
             P+     T  C A +    A I ++  +    EQ++ + ++ + P+++ I    T  Q Y
Sbjct:   215 PFNGSGKTHRCLAKKYKKVAWIQDFI-ILQACEQSMARHLATEGPITVTINM--TLLQQY 271

Query:   281 KEGIFNGV---CG-TQLDHAVTIVGFGTTE 306
             ++G+       C  TQ+DH+V +VGFG T+
Sbjct:   272 QKGVIKATPTTCDPTQVDHSVLLVGFGKTK 301

 Score = 105 (42.0 bits), Expect = 1.8e-42, Sum P(2) = 1.8e-42
 Identities = 15/29 (51%), Positives = 20/29 (68%)

Query:   311 YWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             YW++KNSWG  WG+ GY ++ R    CGI
Sbjct:   325 YWILKNSWGPQWGEEGYFRLHRGSNTCGI 353


>RGD|1309354 [details] [associations]
            symbol:Ctsw "cathepsin W" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1309354 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 MEROPS:C01.037
            CTD:1521 eggNOG:NOG288820 HOVERGEN:HBG100117 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 EMBL:CH473953 EMBL:BC093401 IPI:IPI00371471
            RefSeq:NP_001019413.1 UniGene:Rn.34406 Ensembl:ENSRNOT00000037404
            GeneID:293676 KEGG:rno:293676 UCSC:RGD:1309354 InParanoid:Q561Q9
            NextBio:636716 Genevestigator:Q561Q9 Uniprot:Q561Q9
        Length = 371

 Score = 351 (128.6 bits), Expect = 2.8e-42, Sum P(2) = 2.8e-42
 Identities = 91/306 (29%), Positives = 152/306 (49%)

Query:    15 TTPMFIIITLLVS----CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
             T  +F  + LL++      S +          + E+ + +  Q  RSY +  E   RL I
Sbjct:     4 TAHLFYFLALLLAGQGLSDSLLTKDAGPRPLELKEVFKLFQIQFNRSYSNPAEYTRRLGI 63

Query:    71 FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQN 130
             F  NL   ++  +E   T + G   FSDLT +EF  LY G++  +P          K + 
Sbjct:    64 FAHNLAQAQRLQEEDLGTAEFGQTPFSDLTEEEFGQLY-GHQR-APERILNMAKKVKSER 121

Query:   131 LSMTDVPTSLDWRD-KGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
                + VP + DWR  K  ++ IKNQ  C CCWA AA   ++ + +I++   + +S Q+LL
Sbjct:   122 WGES-VPPTCDWRKVKNIISSIKNQGNCRCCWAIAAADNIQTLWRIKTQQFVDVSVQELL 180

Query:   190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA--VPGTCSAAQKPAAAKISNYE 247
             DC   GN GC GG    A+  ++ N G+A+E++YP+Q    P  C A +    A I ++ 
Sbjct:   181 DCDRCGN-GCNGGFVWDAYITVLNNSGLASEEDYPFQGHQKPHRCLADKYRKVAWIQDFT 239

Query:   248 EVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGIFNGV---CGTQL-DHAVTIVGF 302
              + S +EQ +   +++  P+++ I     ++  Y++G+       C   L +H+V +VGF
Sbjct:   240 ML-SSNEQVIAGYLAIHGPITVTINMKLLQY--YQKGVIKATPSTCDPHLVNHSVLLVGF 296

Query:   303 GTTEDG 308
             G  + G
Sbjct:   297 GKEKGG 302

 Score = 113 (44.8 bits), Expect = 2.8e-42, Sum P(2) = 2.8e-42
 Identities = 24/71 (33%), Positives = 36/71 (50%)

Query:   293 LDHAVTIVGFGTTEDGAN----------------YWLIKNSWGNTWGDAGYMKIVRDEGL 336
             ++H+V +VGFG  + G                  YW++KNSWG  WG+ GY ++ R    
Sbjct:   287 VNHSVLLVGFGKEKGGMQTGTLLSHSRKPRRSTPYWILKNSWGAEWGEKGYFRLYRGNNT 346

Query:   337 CGIGTRSSYPL 347
             CGI   + YP+
Sbjct:   347 CGI---AKYPI 354


>FB|FBgn0037396 [details] [associations]
            symbol:CG11459 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE014297 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 KO:K01365 HSSP:P07711 EMBL:AY060710
            RefSeq:NP_649608.1 UniGene:Dm.3894 SMR:Q9VNK6 MEROPS:C01.A31
            EnsemblMetazoa:FBtr0078623 GeneID:40741 KEGG:dme:Dmel_CG11459
            UCSC:CG11459-RA FlyBase:FBgn0037396 InParanoid:Q9VNK6 OMA:NYDEREL
            OrthoDB:EOG4MGQPX ChiTaRS:CG11459 GenomeRNAi:40741 NextBio:820359
            Uniprot:Q9VNK6
        Length = 336

 Score = 446 (162.1 bits), Expect = 4.0e-42, P = 4.0e-42
 Identities = 99/311 (31%), Positives = 169/311 (54%)

Query:    48 EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDLTNDEF 104
             +++ A++ + Y++  +K  R  ++++ +  +E  N+   +G   +K+G N+FSD   D+ 
Sbjct:    31 DQYKAKYNKQYRNR-DKYHRA-LYEQRVLAVESHNQLYLQGKVAFKMGLNKFSD--TDQ- 85

Query:   105 RALYTGYK--MPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ-KECGCCW 161
             R L+  Y+  +P+P                   +   +DWR  G ++P+ +Q  EC  CW
Sbjct:    86 RILFN-YRSSIPAPLETSTNALTETVNYKRYDQITEGIDWRQYGYISPVGDQGTECLSCW 144

Query:   162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATED 221
             AF+    +E     + GNL+ LS + L+DC    NNGC GG    AF Y  ++ GIAT++
Sbjct:   145 AFSTSGVLEAHMAKKYGNLVPLSPKHLVDCVPYPNNGCSGGWVSVAFNYT-RDHGIATKE 203

Query:   222 EYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSY 280
              YPY+ V G C      +A  +S Y  + + DE+ L + V ++ PV+++I     EF  Y
Sbjct:   204 SYPYEPVSGECLWKSDRSAGTLSGYVTLGNYDERELAEVVYNIGPVAVSIDHLHEEFDQY 263

Query:   281 KEGIFN-GVCGTQ---LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EG 335
               G+ +   C ++   L H+V +VGFGT     +YW+IKNS+G  WG++GY+K+ R+   
Sbjct:   264 SGGVLSIPACRSKRQDLTHSVLLVGFGTHRKWGDYWIIKNSYGTDWGESGYLKLARNANN 323

Query:   336 LCGIGTRSSYP 346
             +CG+ +   YP
Sbjct:   324 MCGVASLPQYP 334


>DICTYBASE|DDB_G0281077 [details] [associations]
            symbol:DDB_G0281077 species:44689 "Dictyostelium
            discoideum" [GO:0030246 "carbohydrate binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR008965 InterPro:IPR013128 InterPro:IPR019028
            Pfam:PF00112 Pfam:PF09478 PRINTS:PR00705 SMART:SM00645
            SMART:SM01063 InterPro:IPR000169 dictyBase:DDB_G0281077
            GO:GO:0030246 EMBL:AAFI02000040 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 SUPFAM:SSF49384 ProtClustDB:CLSZ2430562
            RefSeq:XP_640803.1 ProteinModelPortal:Q54UH3
            EnsemblProtists:DDB0203998 GeneID:8622857 KEGG:ddi:DDB_G0281077
            InParanoid:Q54UH3 OMA:LINDFNF Uniprot:Q54UH3
        Length = 662

 Score = 361 (132.1 bits), Expect = 5.1e-42, Sum P(2) = 5.1e-42
 Identities = 76/183 (41%), Positives = 106/183 (57%)

Query:   137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
             P S+DWR  G V+ +KNQ  CG C+AF+ V A+E     ++  ++ LSEQ L+DC+ N G
Sbjct:   472 PISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALEAHYYRKNNRMLNLSEQNLVDCTRNYG 531

Query:   196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
             N  C GG     F YI +N GI  +  YPY+   G C      A ++ISNY  +   DE+
Sbjct:   532 NGECSGGWMHNCFRYIKENGGINLQSTYPYEGRVGLCRYNSGDAQSRISNYVMIKQHDEE 591

Query:   256 ALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNG-VCGT-QLDHAVTIVGFGTTEDGANYW 312
              L  AV S+ PVS+A  A + EF  Y  GI+N   C   +  HAV +VG+G  E+G ++W
Sbjct:   592 DLANAVASVGPVSVAYDASTREFMYYSSGIYNSDSCDKYRTTHAVVVVGYGI-ENGVDFW 650

Query:   313 LIK 315
             +IK
Sbjct:   651 IIK 653

 Score = 114 (45.2 bits), Expect = 5.1e-42, Sum P(2) = 5.1e-42
 Identities = 23/62 (37%), Positives = 40/62 (64%)

Query:    49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEG-NRTYKLGTNQFSDLTNDEFRAL 107
             +W  Q  R+Y+ + +  ++ + FK++  +IE+  +E  N T +LG  QFSD+T+DEF  +
Sbjct:   164 QWSNQFNRTYRAD-QFLLKYEAFKDSSRFIEQYKRENQNSTMELGLTQFSDMTHDEFLNI 222

Query:   108 YT 109
             YT
Sbjct:   223 YT 224


>UNIPROTKB|F1RU23 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 KO:K08569 EMBL:CU928325
            RefSeq:XP_003122571.1 UniGene:Ssc.28940 Ensembl:ENSSSCT00000014177
            GeneID:100525853 KEGG:ssc:100525853 OMA:CWAMAAV Uniprot:F1RU23
        Length = 367

 Score = 438 (159.2 bits), Expect = 2.8e-41, P = 2.8e-41
 Identities = 111/321 (34%), Positives = 164/321 (51%)

Query:    45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
             E+   +  Q+ RSY +  E   RL IF +NL   ++  +E   T + G   FSDLT +EF
Sbjct:    40 EVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDLTEEEF 99

Query:   105 RALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRDK-GAVTPIKNQKECGCCWAF 163
               L+ G+   +             +  S   VP S DWR K G ++ IK+QK+C CCWA 
Sbjct:   100 GQLH-GHHWGAGKAPSMGIKVGSEE--SGETVPQSCDWRKKPGVISAIKHQKDCNCCWAM 156

Query:   164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
             AAV  VE    I+    +QLS QQ+LDC   GN GC GG    AF  ++   G+A+E +Y
Sbjct:   157 AAVDNVEAQWAIKYHQAVQLSVQQVLDCDRCGN-GCNGGFVWDAFLTVLNTSGLASEQDY 215

Query:   224 PYQAVPGT--CSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSY 280
             PY+    T  C A Q    A I ++  +    EQ++ + ++ + P+++ I A     Q Y
Sbjct:   216 PYKGTVKTHRCLAKQHRKVAWIQDFLMLQFC-EQSIARYLATEGPITVTINAGL--LQQY 272

Query:   281 KEGIFNGV---CGTQL-DHAVTIVGFGTTE--DGAN--------YWLIKNSWGNTWGDAG 326
             K G+       C   L +H+V +VGFG ++  +G          YW++KNSWG  WG+ G
Sbjct:   273 KRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGRRPRPGHSIPYWILKNSWGPDWGEEG 332

Query:   327 YMKIVRDEGLCGIGTRSSYPL 347
             Y ++ R    CGI   + YP+
Sbjct:   333 YFRLHRGSNTCGI---TKYPV 350


>UNIPROTKB|P56202 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 GO:GO:0006955 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AF013611
            EMBL:AF015954 EMBL:AF055903 EMBL:AP001201 EMBL:BC048255
            IPI:IPI00328978 RefSeq:NP_001326.2 UniGene:Hs.416848
            ProteinModelPortal:P56202 SMR:P56202 STRING:P56202 MEROPS:C01.037
            PhosphoSite:P56202 DMDM:259016196 PaxDb:P56202 PRIDE:P56202
            Ensembl:ENST00000307886 GeneID:1521 KEGG:hsa:1521 UCSC:uc001ogc.1
            CTD:1521 GeneCards:GC11P065647 HGNC:HGNC:2546 HPA:CAB016345
            MIM:602364 neXtProt:NX_P56202 PharmGKB:PA27042 eggNOG:NOG288820
            HOVERGEN:HBG100117 InParanoid:P56202 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 PhylomeDB:P56202 GenomeRNAi:1521 NextBio:6295
            ArrayExpress:P56202 Bgee:P56202 CleanEx:HS_CTSW
            Genevestigator:P56202 GermOnline:ENSG00000172543 Uniprot:P56202
        Length = 376

 Score = 346 (126.9 bits), Expect = 4.1e-41, Sum P(2) = 4.1e-41
 Identities = 86/275 (31%), Positives = 131/275 (47%)

Query:    45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
             E  + +  Q  RSY    E   RL IF  NL   ++  +E   T + G   FSDLT +EF
Sbjct:    40 EAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEF 99

Query:   105 RALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRD-KGAVTPIKNQKECGCCWAF 163
               LY GY+  +             +      VP S DWR    A++PIK+QK C CCWA 
Sbjct:   100 GQLY-GYRRAAGGVPSMGREIRSEE--PEESVPFSCDWRKVASAISPIKDQKNCNCCWAM 156

Query:   164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
             AA   +E + +I   + + +S Q+LLDC   G+ GC GG    AF  ++ N G+A+E +Y
Sbjct:   157 AAAGNIETLWRISFWDFVDVSVQELLDCGRCGD-GCHGGFVWDAFITVLNNSGLASEKDY 215

Query:   224 PYQAVPGT--CSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
             P+Q       C   +    A I ++  + + + +      +  P+++ I       Q Y+
Sbjct:   216 PFQGKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINM--KPLQLYR 273

Query:   282 EGIFNGV---CGTQL-DHAVTIVGFGTTEDGANYW 312
             +G+       C  QL DH+V +VGFG+ +     W
Sbjct:   274 KGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEGIW 308

 Score = 107 (42.7 bits), Expect = 4.1e-41, Sum P(2) = 4.1e-41
 Identities = 17/37 (45%), Positives = 24/37 (64%)

Query:   311 YWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 347
             YW++KNSWG  WG+ GY ++ R    CGI   + +PL
Sbjct:   326 YWILKNSWGAQWGEKGYFRLHRGSNTCGI---TKFPL 359


>DICTYBASE|DDB_G0282991 [details] [associations]
            symbol:DDB_G0282991 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0282991 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            EMBL:AAFI02000049 eggNOG:NOG331187 RefSeq:XP_639299.1
            ProteinModelPortal:Q54RQ2 EnsemblProtists:DDB0185304 GeneID:8623870
            KEGG:ddi:DDB_G0282991 InParanoid:Q54RQ2 OMA:PENGNEY Uniprot:Q54RQ2
        Length = 339

 Score = 435 (158.2 bits), Expect = 5.9e-41, P = 5.9e-41
 Identities = 109/309 (35%), Positives = 172/309 (55%)

Query:    49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
             +W  ++ + Y ++ E  MR   FK+N EY+++ N++   T  L  N F+DL+ +E+   Y
Sbjct:    29 EWTNKYNKIYSNK-EFYMRFNNFKKNKEYVDQWNEKQLETI-LELNFFADLSRNEYINNY 86

Query:   109 TGYKMPSPSHRXXXXXXFKYQ-NL--SMTDVPTSLDWRDKGAVTPIKNQKEC-GCCWAFA 164
                 +   +         KY+ NL  +  +   S+DWR+  AVTP+KNQ  C G  ++F+
Sbjct:    87 LASFIDISNIEQKNT---KYEGNLKNNFNNSIKSIDWRNFDAVTPVKNQGLCSGAGYSFS 143

Query:   165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
             A+  +E    I++  LI LSEQ ++DC+T+ GNNGC+GG    AF YII+ +GI +E  Y
Sbjct:   144 AIGVIESSHFIKNKELITLSEQNIIDCTTDMGNNGCMGGLALIAFDYIIKQKGIDSEFNY 203

Query:   224 PYQAV---P----GTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
             PY+     P    G C      + A IS+Y E+   +E  L +++   PVS+ I A    
Sbjct:   204 PYEGYLIEPYEGRGRCRYNSFYSKASISSYIEIERFNENELTQSLIKSPVSVMIDASQLS 263

Query:   277 FQSYKEGIFNGV-CG-TQLDHAVTIVGFGTT-EDGANYWLIKNSWGNTWGDAGYMKIVRD 333
             F  YK G++    C  T L+H +  +GFG T E+G  Y+++KNS+G+ WG  GY+ + R+
Sbjct:   264 FMLYKSGVYKDPSCSSTILNHGILNIGFGVTPENGNEYYILKNSFGSKWGMKGYIYLSRN 323

Query:   334 -EGLCGIGT 341
                 CGI +
Sbjct:   324 FNNHCGISS 332


>UNIPROTKB|E2RPX3 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 CTD:1521 KO:K08569 OMA:GRCGDGC
            EMBL:AAEX03011632 RefSeq:XP_540846.2 Ensembl:ENSCAFT00000020910
            GeneID:483725 KEGG:cfa:483725 Uniprot:E2RPX3
        Length = 374

 Score = 333 (122.3 bits), Expect = 7.4e-40, Sum P(2) = 7.4e-40
 Identities = 86/274 (31%), Positives = 136/274 (49%)

Query:    45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
             ++   +  Q+ RSY +  E   RL IF  NL   ++   E   T + G   FSDLT +EF
Sbjct:    40 QVFALFQIQYNRSYSNPEEYARRLDIFAHNLAQAQQLEDEDLGTAEFGVTPFSDLTEEEF 99

Query:   105 RALYTGYKMPSPSHRXXXXXXFKYQNLSMTD-VPTSLDWRD-KGAVTPIKNQKECGCCWA 162
                Y   +M   +         K ++    + VP + DWR   G ++PIK Q  C CCWA
Sbjct:   100 GQFYGHQRMAGEAPSVGR----KVESEEWGEPVPPTCDWRKLPGIISPIKQQGNCRCCWA 155

Query:   163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
              AA   +E +  IR    +++S Q+LLDC   G+ GC GG    AF  ++ N G+A+  +
Sbjct:   156 MAAAGNIEALWGIRYHQPVEVSVQELLDCGRCGD-GCKGGFTWDAFITVLNNSGLASAKD 214

Query:   223 YPY--QAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQS 279
             YP+     P  C A +    A I ++  +  G+EQA+   ++ + P+++ I       Q 
Sbjct:   215 YPFLGNTKPHRCLAKKYKKVAWIQDFIML-QGNEQAIAWYLATKGPITVTINMKL--LQH 271

Query:   280 YKEGIFNGV---CGTQ-LDHAVTIVGFGTTEDGA 309
             Y++G+       C  Q +DH+V +VGFG ++  A
Sbjct:   272 YQKGVIQATHTTCDPQRVDHSVLLVGFGKSKSVA 305

 Score = 108 (43.1 bits), Expect = 7.4e-40, Sum P(2) = 7.4e-40
 Identities = 17/37 (45%), Positives = 24/37 (64%)

Query:   311 YWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 347
             YW++KNSWG  WG+ GY ++ R    CGI   + YP+
Sbjct:   324 YWILKNSWGAEWGEEGYFRLHRGNNTCGI---TKYPV 357


>UNIPROTKB|E1BPI9 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 CTD:1519 KO:K01374 OMA:SNVCGIA
            EMBL:DAAA02044933 IPI:IPI01004081 RefSeq:XP_002694471.2
            RefSeq:XP_874012.4 Ensembl:ENSBTAT00000014691 GeneID:616804
            KEGG:bta:616804 Uniprot:E1BPI9
        Length = 313

 Score = 420 (152.9 bits), Expect = 2.3e-39, P = 2.3e-39
 Identities = 100/277 (36%), Positives = 148/277 (53%)

Query:    71 FKENLE---YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFK 127
             F+E+L    Y+       N T   G NQFS L  +EF+A+Y      SPS R       +
Sbjct:    36 FRESLNRQRYLNSLFPYENSTAVYGINQFSYLFPEEFKAIYL---RSSPS-RFPRFPAEE 91

Query:   128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
             Y ++S   +P   DWRDK  VT ++NQK CG CWAF+ V AVE +  I+   L  LS QQ
Sbjct:    92 YTSISNLSLPLRFDWRDKHVVTQVRNQKTCGGCWAFSVVGAVESVCAIKGQPLEVLSVQQ 151

Query:   188 LLDCSTNGNNGCLGGSREKAFAYIIQNQ-GIATEDEYPYQAVPGTCSA-AQKPAAAKISN 245
             ++DCS + N GC GGS   A  ++ + Q  +  + EYP+QA  G C   +   + + I  
Sbjct:   152 VIDCSYS-NYGCNGGSPLSALYWLNKLQVKLVRDSEYPFQAQNGLCRYFSDSHSGSSIKG 210

Query:   246 YEEVP-SGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGVCGT-QLDHAVTIVGF 302
             Y     SG E  + +A+ ++ P+ + + A S  +Q Y  GI    C + + +HAV + GF
Sbjct:   211 YSAYDFSGQEDKMAEALLALGPLIVVVDAMS--WQDYLGGIIQHHCSSGEANHAVLVTGF 268

Query:   303 GTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
               T     YW+++NSWG +WG  GY+++     +CGI
Sbjct:   269 DKT-GSIPYWIVRNSWGTSWGIDGYVRVKMGGNVCGI 304


>WB|WBGene00011102 [details] [associations]
            symbol:R07E3.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            EMBL:Z49207 HSSP:P53634 PIR:T24030 RefSeq:NP_001041280.1
            ProteinModelPortal:Q21810 SMR:Q21810 STRING:Q21810 MEROPS:C01.A43
            PaxDb:Q21810 EnsemblMetazoa:R07E3.1a GeneID:181242
            KEGG:cel:CELE_R07E3.1 UCSC:R07E3.1a CTD:181242 WormBase:R07E3.1a
            HOGENOM:HOG000021028 InParanoid:Q21810 OMA:ACKNEVI NextBio:913066
            ArrayExpress:Q21810 Uniprot:Q21810
        Length = 402

 Score = 420 (152.9 bits), Expect = 2.3e-39, P = 2.3e-39
 Identities = 113/344 (32%), Positives = 167/344 (48%)

Query:    18 MFII--ITLLVSCASQVVSSRS-THEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIF 71
             MFII  + L     ++ VS R+ T+E+ +  I ++++A   +  +SY    E   RL  +
Sbjct:    55 MFIIFVVPLFTKLQAEKVSRRAHTNERGIQNIAKEYIAYTEKFDKSYATSQESLKRLNAY 114

Query:    72 ---KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKY 128
                 EN+      N+ G+  Y  G N  SD T++EF             H+         
Sbjct:   115 YNTDENIANWNIQNEHGSAEY--GHNDMSDWTDEEFEKTLLPKSFYKRLHKEAEFIEPIP 172

Query:   129 QNL------SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
             ++L      S +  P   DWRDK  +TP+K Q +CG CWAFA+ A VE    I  G    
Sbjct:   173 ESLTAKKGESSSPFPDFFDWRDKNVITPVKAQGQCGSCWAFASTATVEAAWAIAHGEKRN 232

Query:   183 LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAV-PGTCSAAQKPAAA 241
             LSEQ LLDC    +N C GG  +KAF YI +N G+A   + PY A     C+        
Sbjct:   233 LSEQTLLDCDLV-DNACDGGDEDKAFRYIHRN-GLANAVDLPYVAHRQNGCAVNDHWNTT 290

Query:   242 KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG---VCGTQLD--HA 296
             +I     +   ++  +   V+  PV+I +A      ++YK G+F      C  ++   HA
Sbjct:   291 RIKAAYFLHHDEDSIINWLVNFGPVNIGMAVIQP-MRAYKGGVFTPSEYACKNEVIGLHA 349

Query:   297 VTIVGFGTTEDGANYWLIKNSWGNTWG-DAGYMKIVRDEGLCGI 339
             + I G+GT++ G  YW++KNSWGNTWG + GY+   R    CGI
Sbjct:   350 LLITGYGTSKTGEKYWIVKNSWGNTWGVEHGYIYFARGINACGI 393


>MGI|MGI:2139628 [details] [associations]
            symbol:Ctso "cathepsin O" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:2139628 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0005764 GeneTree:ENSGT00560000076599 MEROPS:C01.035 CTD:1519
            HOVERGEN:HBG105050 KO:K01374 OMA:SNVCGIA OrthoDB:EOG4V6ZH1
            EMBL:AK034490 EMBL:AK049470 EMBL:AK165930 EMBL:AK166103
            EMBL:BC044664 IPI:IPI00453524 RefSeq:NP_808330.1 UniGene:Mm.254642
            ProteinModelPortal:Q8BM88 SMR:Q8BM88 STRING:Q8BM88
            PhosphoSite:Q8BM88 PRIDE:Q8BM88 Ensembl:ENSMUST00000029649
            GeneID:229445 KEGG:mmu:229445 UCSC:uc008pon.1 InParanoid:Q8BM88
            NextBio:379433 Bgee:Q8BM88 CleanEx:MM_CTSO Genevestigator:Q8BM88
            GermOnline:ENSMUSG00000028015 Uniprot:Q8BM88
        Length = 312

 Score = 418 (152.2 bits), Expect = 3.7e-39, P = 3.7e-39
 Identities = 103/298 (34%), Positives = 155/298 (52%)

Query:    51 MAQHGRSYKDELEKEMRLKIFKENLE---YIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
             + +HG +       +      +E+L    Y+     E N T   G NQFS L  +EF+AL
Sbjct:    16 LGRHGVAGTWSWSHQREAAALRESLHRHRYLNSFPHE-NSTAFYGVNQFSYLFPEEFKAL 74

Query:   108 YTGYKMP-SPSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
             Y G K   +P  R          N+S+   P   DWRDK  V P++NQ+ CG CWAF+ V
Sbjct:    75 YLGSKYAWAP--RYPAEGQRPIPNVSL---PLRFDWRDKHVVNPVRNQEMCGGCWAFSVV 129

Query:   167 AAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ-GIATEDEYPY 225
             +A+E    I+  +L  LS QQ++DCS N N+GCLGGS   A  ++ + Q  +  + +YP+
Sbjct:   130 SAIESARAIQGKSLDYLSVQQVIDCSFN-NSGCLGGSPLCALRWLNETQLKLVADSQYPF 188

Query:   226 QAVPGTCSA-AQKPAAAKISNYEEVP-SGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
             +AV G C    Q  A   + ++      G E  + +A+ S  P+ + + A S  +Q Y  
Sbjct:   189 KAVNGQCRHFPQSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMS--WQDYLG 246

Query:   283 GIFNGVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             GI    C + + +HAV I GF  T +   YW+++NSWG++WG  GY  +     +CGI
Sbjct:   247 GIIQHHCSSGEANHAVLITGFDRTGN-TPYWMVRNSWGSSWGVEGYAHVKMGGNVCGI 303


>ZFIN|ZDB-GENE-080724-8 [details] [associations]
            symbol:ctso "cathepsin O" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            ZFIN:ZDB-GENE-080724-8 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 CTD:1519 KO:K01374 EMBL:CR931784
            IPI:IPI00513613 RefSeq:XP_695717.3 UniGene:Dr.88386
            Ensembl:ENSDART00000074786 GeneID:567333 KEGG:dre:567333
            NextBio:20888622 Uniprot:E7FA09
        Length = 334

 Score = 418 (152.2 bits), Expect = 3.7e-39, P = 3.7e-39
 Identities = 102/331 (30%), Positives = 169/331 (51%)

Query:    19 FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEM--RLKIFKENLE 76
             FI++ +     + ++S     + S+ E  E+   QH  +++ ++  E+  R   ++ +L+
Sbjct:     7 FIVLIIYQELLTGIISVEVIRK-SLTE-GER--LQHSDTFQQDVNNELYQRWINYQSSLQ 62

Query:    77 ---YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSM 133
                ++  A  + N++ + G NQFS L+  +F+  Y   +  +           K +    
Sbjct:    63 RQAFLNSALGKSNQSAQYGVNQFSYLSQKQFKEQYLTARAEAAPKFDQSKSEIKVK---- 118

Query:   134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
              + P   DWRD G V P+ NQ  CG CWAF+ V A+E ++      L QLS QQ++DCS 
Sbjct:   119 ANNPPRFDWRDHGVVGPVHNQGSCGGCWAFSIVEAIESVSAKGGEKLQQLSVQQVIDCSY 178

Query:   194 NGNNGCLGGSREKAFAYIIQNQ-GIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVP- 250
               N GC GGS  +A  ++ Q++  + +E EYP++   G C    Q  A   + NY     
Sbjct:   179 Q-NQGCNGGSPVEALYWLTQSKLKLVSEAEYPFKGADGVCQFFPQAHAGVAVRNYSAYDF 237

Query:   251 SGDEQALLKA-VSMQPVSIAIAAYSTEFQSYKEGIFNGVCGT-QLDHAVTIVGFGTTEDG 308
             SG E+ ++ A V   P+ + + A S  +Q Y  GI    C + + +HAV I G+ TT + 
Sbjct:   238 SGQEEVMMSALVDFGPLVVIVDAIS--WQDYLGGIIQHHCSSHKANHAVLITGYDTTGE- 294

Query:   309 ANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
               YW+++NSWG +WGD GY  I     +CG+
Sbjct:   295 VPYWIVRNSWGTSWGDDGYAYIKIGNDVCGV 325


>DICTYBASE|DDB_G0281079 [details] [associations]
            symbol:DDB_G0281079 species:44689 "Dictyostelium
            discoideum" [GO:0030246 "carbohydrate binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR008965 InterPro:IPR013128 InterPro:IPR019028
            Pfam:PF00112 Pfam:PF09478 PRINTS:PR00705 SMART:SM00645
            SMART:SM01063 InterPro:IPR000169 dictyBase:DDB_G0281079
            GO:GO:0030246 EMBL:AAFI02000040 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 SUPFAM:SSF49384 RefSeq:XP_640804.1
            ProteinModelPortal:Q54UH2 EnsemblProtists:DDB0204000 GeneID:8622858
            KEGG:ddi:DDB_G0281079 InParanoid:Q54UH2 OMA:ALESHYY
            ProtClustDB:CLSZ2430562 Uniprot:Q54UH2
        Length = 664

 Score = 333 (122.3 bits), Expect = 8.6e-39, Sum P(2) = 8.6e-39
 Identities = 71/185 (38%), Positives = 104/185 (56%)

Query:   137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
             P S+DWR  G V+ +KNQ  CG C+AF+ V A+E     ++  ++ LSEQ L+DC+ +  
Sbjct:   471 PISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALESHYYRKNNRMLDLSEQNLVDCTASNK 530

Query:   196 --NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
               N GC GG     ++YI +N GI  E  YPY+   G C      A ++IS +  +   D
Sbjct:   531 YRNGGCSGGWMHNCYSYIQENGGINQESTYPYEGKFGQCRYNSGDAQSRISKFVMIKQHD 590

Query:   254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVCGT-QLDHAVTIVGFGTTEDGAN 310
             E+ L   V S+ PVS+A  A + EF  Y  GI+ +  C   +  HAV +VG+   E+G +
Sbjct:   591 EEDLADTVASVGPVSVAYDASTREFMYYSRGIYYSDNCNKYRTTHAVVVVGYDN-ENGVD 649

Query:   311 YWLIK 315
             YW+IK
Sbjct:   650 YWIIK 654

 Score = 113 (44.8 bits), Expect = 8.6e-39, Sum P(2) = 8.6e-39
 Identities = 23/62 (37%), Positives = 40/62 (64%)

Query:    49 KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEG-NRTYKLGTNQFSDLTNDEFRAL 107
             +W  Q  R+Y+ + +  ++ + FK++  +IE+  +E  N T +LG  QFSD+T+DEF  +
Sbjct:   163 QWSNQFNRTYRAD-QFLLKYEAFKDSSRFIEQYKRENQNSTMELGLTQFSDMTHDEFLNV 221

Query:   108 YT 109
             YT
Sbjct:   222 YT 223


>RGD|1564827 [details] [associations]
            symbol:RGD1564827 "similar to cathepsin M" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 IPI:IPI00192321
            Ensembl:ENSRNOT00000023990 ArrayExpress:D3ZY04 Uniprot:D3ZY04
        Length = 338

 Score = 412 (150.1 bits), Expect = 1.6e-38, P = 1.6e-38
 Identities = 84/200 (42%), Positives = 115/200 (57%)

Query:   154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYII 212
             Q  C  CWAF  V A+EG    ++G L  LS Q L+DCS   GN GC GG+   AF Y++
Sbjct:   139 QGRCNSCWAFPVVGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVL 198

Query:   213 QNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
             QN G+ +E  YPY+   G C      ++AKI+     P  +E  L+ AV+ +PV+  I  
Sbjct:   199 QNGGLESEATYPYEGKEGLCRYNPN-SSAKITXICAPPQKNEDVLMDAVATKPVAAGIHV 257

Query:   273 YSTEFQSYKEGIFNGV-CGTQLDHAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYM 328
               +  + YK+GI++   C   ++HAV +VG+G      DG NYWLI+NSWG  WG  GYM
Sbjct:   258 VHSSLRFYKKGIYHEPKCNNYVNHAVLVVGYGFEGNETDGNNYWLIQNSWGERWGLNGYM 317

Query:   329 KIVRDEGL-CGIGTRSSYPL 347
             KI +D    CGI T + YP+
Sbjct:   318 KIAKDRNNHCGIATFAQYPI 337


>WB|WBGene00013076 [details] [associations]
            symbol:Y51A2D.8 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            EMBL:AL021497 HSSP:P53634 HOGENOM:HOG000019851 PIR:T27079
            RefSeq:NP_507627.1 ProteinModelPortal:Q9XXQ7 SMR:Q9XXQ7
            MEROPS:C01.A49 EnsemblMetazoa:Y51A2D.8 GeneID:180208
            KEGG:cel:CELE_Y51A2D.8 UCSC:Y51A2D.8 CTD:180208 WormBase:Y51A2D.8
            eggNOG:NOG307864 InParanoid:Q9XXQ7 OMA:VAVYFKV NextBio:908434
            Uniprot:Q9XXQ7
        Length = 386

 Score = 332 (121.9 bits), Expect = 1.7e-38, Sum P(2) = 1.7e-38
 Identities = 89/279 (31%), Positives = 136/279 (48%)

Query:    86 NRTYKLGTNQFSDLTNDEFRALYTGYKM-----PSPSHRXXXXXXFKYQNLSMTDVPTSL 140
             N+   L T +F    ++   +  TG  M       P  R       +++  S T  P   
Sbjct:    93 NKFSDLSTAEFHGRLSNVVPSNNTGLPMLNFDKKKPDFRAADMNKTRHKRRS-TRYPDYF 151

Query:   141 DWRDK---GA--VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
             D R++   G   V PIK+Q +C CCW FA  A VE +    SG    LS+Q++ DC T G
Sbjct:   152 DLRNEKINGRYIVGPIKDQGQCACCWGFAVTALVETVYAAHSGKFKSLSDQEVCDCGTEG 211

Query:   196 NNGCLGGSREKAFAYIIQNQGIATEDEYPY---QAVPGT-CSAAQ--KPAAAKISNYEEV 249
               GC GGS      Y+ +  G++ +++YPY   +A  G  C   +  +   A+  N+  +
Sbjct:   212 TPGCKGGSLTLGVQYV-KKYGLSGDEDYPYDQNRANQGRRCRLRETDRIVPARAFNFAVI 270

Query:   250 -PSGDEQALLKAVSMQPVSIAIA-AYSTEFQSYKEG-IFNGVC--GTQLDHAVTIVGFGT 304
              P   E+ +++ ++   V +A+      +F+ YKEG I    C   TQ  HA  IVG+ T
Sbjct:   271 NPRRAEEQIIQVLTEWKVPVAVYFKVGDQFKEYKEGVIIEDDCRRATQW-HAGAIVGYDT 329

Query:   305 TEDGA----NYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
              ED      +YW+IKNSWG  W ++GY+++VR    C I
Sbjct:   330 VEDSRGRSHDYWIIKNSWGGDWAESGYVRVVRGRDWCSI 368

 Score = 96 (38.9 bits), Expect = 1.7e-38, Sum P(2) = 1.7e-38
 Identities = 21/69 (30%), Positives = 36/69 (52%)

Query:    39 HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT-Y--KLGTNQ 95
             H + + +  E +  ++ R YKDE E + R   F ++   ++K N +     Y  + G N+
Sbjct:    35 HPEKLYKAFEDFKKKYNRKYKDESENQQRFNNFVKSYNNVDKLNAKSKAAGYDTQFGINK 94

Query:    96 FSDLTNDEF 104
             FSDL+  EF
Sbjct:    95 FSDLSTAEF 103


>UNIPROTKB|P43234 [details] [associations]
            symbol:CTSO "Cathepsin O" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 Reactome:REACT_6900
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0004197
            CleanEx:HS_CTSO EMBL:X77383 EMBL:BC049206 IPI:IPI00017257
            PIR:A55090 RefSeq:NP_001325.1 UniGene:Hs.75262
            ProteinModelPortal:P43234 SMR:P43234 IntAct:P43234 STRING:P43234
            MEROPS:C01.035 PhosphoSite:P43234 DMDM:1168795 PRIDE:P43234
            DNASU:1519 Ensembl:ENST00000433477 GeneID:1519 KEGG:hsa:1519
            UCSC:uc003ipg.3 CTD:1519 GeneCards:GC04M156845 HGNC:HGNC:2542
            HPA:HPA002041 MIM:600550 neXtProt:NX_P43234 PharmGKB:PA27040
            HOVERGEN:HBG105050 InParanoid:P43234 KO:K01374 OMA:SNVCGIA
            OrthoDB:EOG4V6ZH1 PhylomeDB:P43234 BindingDB:P43234
            ChEMBL:CHEMBL3035 GenomeRNAi:1519 NextBio:6287 Bgee:P43234
            Genevestigator:P43234 GermOnline:ENSG00000151792 Uniprot:P43234
        Length = 321

 Score = 408 (148.7 bits), Expect = 4.3e-38, P = 4.3e-38
 Identities = 101/284 (35%), Positives = 143/284 (50%)

Query:    64 KEMRLKIFKENLE---YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRX 120
             +E     F+E+L    Y+       N T   G NQFS L  +EF+A+Y   K PS   R 
Sbjct:    37 REREAAAFRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSK-PSKFPRY 95

Query:   121 XXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
                      N+S+   P   DWRDK  VT ++NQ+ CG CWAF+ V AVE    I+   L
Sbjct:    96 SAEVHMSIPNVSL---PLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPL 152

Query:   181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ-GIATEDEYPYQAVPGTCSA-AQKP 238
               LS QQ++DCS N N GC GGS   A  ++ + Q  +  + EYP++A  G C   +   
Sbjct:   153 EDLSVQQVIDCSYN-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSGSH 211

Query:   239 AAAKISNYEEVPSGD-EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGVCGT-QLDH 295
             +   I  Y      D E  + KA+ +  P+ + + A S  +Q Y  GI    C + + +H
Sbjct:   212 SGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVS--WQDYLGGIIQHHCSSGEANH 269

Query:   296 AVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             AV I GF  T     YW+++NSWG++WG  GY  +     +CGI
Sbjct:   270 AVLITGFDKT-GSTPYWIVRNSWGSSWGVDGYAHVKMGSNVCGI 312


>UNIPROTKB|F1PGK4 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 OMA:SNVCGIA
            EMBL:AAEX03010073 Ensembl:ENSCAFT00000013638 Uniprot:F1PGK4
        Length = 316

 Score = 403 (146.9 bits), Expect = 1.5e-37, P = 1.5e-37
 Identities = 97/277 (35%), Positives = 142/277 (51%)

Query:    71 FKENLE---YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFK 127
             F+E+L    Y+       N +   G NQFS L+ +EF+A+Y   K PS S R        
Sbjct:    39 FRESLNRHRYLNSVFPRENSSAVYGINQFSYLSPEEFKAIYLRSK-PSRSPRYPAEVRTS 97

Query:   128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
              +N+S+   P   DWRDK  VT ++NQ+ CG CWAF+ V AVE    I+   L  +S QQ
Sbjct:    98 IRNVSL---PLRFDWRDKRVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGKPLADISVQQ 154

Query:   188 LLDCSTNGNNGCLGGSREKAFAYIIQNQ-GIATEDEYPYQAVPGTCSA-AQKPAAAKISN 245
             ++DCS N N GC GGS   A  ++ + Q  +  + EYP++A  G C   +   +   I  
Sbjct:   155 VIDCSYN-NYGCSGGSTLNALNWLNKTQVKLVRDSEYPFKAQNGLCHYFSDSYSGFSIRG 213

Query:   246 YEEVPSGD-EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGVCGT-QLDHAVTIVGF 302
             Y      D E  + K + +  P+ + + A S  +Q Y  GI    C + + +HAV I GF
Sbjct:   214 YSAYDFSDQEDEMAKVLLTFGPLVVVVDAVS--WQDYLGGIIQHHCSSGEANHAVLITGF 271

Query:   303 GTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
                     YW+++NSWG++WG  GY  +     +CGI
Sbjct:   272 DKI-GSTPYWIVRNSWGSSWGVDGYAHVKMGGNICGI 307


>UNIPROTKB|E9PI30 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            EMBL:AP001201 HGNC:HGNC:2546 IPI:IPI00984532
            ProteinModelPortal:E9PI30 SMR:E9PI30 Ensembl:ENST00000528419
            ArrayExpress:E9PI30 Bgee:E9PI30 Uniprot:E9PI30
        Length = 364

 Score = 346 (126.9 bits), Expect = 3.1e-37, Sum P(2) = 3.1e-37
 Identities = 86/275 (31%), Positives = 131/275 (47%)

Query:    45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
             E  + +  Q  RSY    E   RL IF  NL   ++  +E   T + G   FSDLT +EF
Sbjct:    40 EAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEF 99

Query:   105 RALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRD-KGAVTPIKNQKECGCCWAF 163
               LY GY+  +             +      VP S DWR    A++PIK+QK C CCWA 
Sbjct:   100 GQLY-GYRRAAGGVPSMGREIRSEE--PEESVPFSCDWRKVASAISPIKDQKNCNCCWAM 156

Query:   164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
             AA   +E + +I   + + +S Q+LLDC   G+ GC GG    AF  ++ N G+A+E +Y
Sbjct:   157 AAAGNIETLWRISFWDFVDVSVQELLDCGRCGD-GCHGGFVWDAFITVLNNSGLASEKDY 215

Query:   224 PYQAVPGT--CSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
             P+Q       C   +    A I ++  + + + +      +  P+++ I       Q Y+
Sbjct:   216 PFQGKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINM--KPLQLYR 273

Query:   282 EGIFNGV---CGTQL-DHAVTIVGFGTTEDGANYW 312
             +G+       C  QL DH+V +VGFG+ +     W
Sbjct:   274 KGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEGIW 308

 Score = 70 (29.7 bits), Expect = 3.1e-37, Sum P(2) = 3.1e-37
 Identities = 9/14 (64%), Positives = 12/14 (85%)

Query:   311 YWLIKNSWGNTWGD 324
             YW++KNSWG  WG+
Sbjct:   326 YWILKNSWGAQWGE 339


>UNIPROTKB|F1P0K2 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            OMA:SNVCGIA EMBL:AADN02016534 IPI:IPI00651180
            Ensembl:ENSGALT00000015270 Uniprot:F1P0K2
        Length = 320

 Score = 384 (140.2 bits), Expect = 1.5e-35, P = 1.5e-35
 Identities = 94/273 (34%), Positives = 141/273 (51%)

Query:    81 ANKEGNRTYKLGTNQFSDLTNDEFRALYTG---YKMPSPSHRXXXXXXFKYQNLSMTDVP 137
             +N  G+  Y  G NQFS L  +EF+A+Y     YK+P            K        +P
Sbjct:    60 SNDNGSAFY--GKNQFSHLFPEEFKAIYLRSIPYKLPR---------YIKVPKGEEKPLP 108

Query:   138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
                DWRDK  +  ++NQ+ CG CWAF+ V  +E    I+  NL +LS QQ++DCS + N 
Sbjct:   109 KKFDWRDKKVIAEVRNQQTCGGCWAFSVVGGIESAYAIKGHNLEELSVQQVIDCSYS-NY 167

Query:   198 GCLGGSREKAFAYIIQNQ-GIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVP-SGDE 254
             GC GGS   A +++ Q +  +  + EY ++A  G C           I+ +     SG E
Sbjct:   168 GCSGGSTITALSWLNQTKVKLVRDSEYTFKAQTGLCHYFPHSDFGVSITGFAAYDFSGQE 227

Query:   255 QALLKA-VSMQPVSIAIAAYSTEFQSYKEGIFNGVCGT-QLDHAVTIVGFGTTEDGANYW 312
             + +++  V   P+++ + A S  +Q Y  GI    C + + +HAV I GF TT     YW
Sbjct:   228 EEMMRVLVDWGPLAVTVDAVS--WQDYLGGIIQYHCSSGKANHAVLITGFDTTGI-IPYW 284

Query:   313 LIKNSWGNTWGDAGYMKIVRDEGLCGIG-TRSS 344
             +++NSWG TWG  GY+++     +CGI  T SS
Sbjct:   285 IVQNSWGRTWGIDGYVRVKIGSNVCGIADTVSS 317


>UNIPROTKB|Q5T8F0 [details] [associations]
            symbol:CTSL1 "Cathepsin L1 light chain" species:9606 "Homo
            sapiens" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            EMBL:AL160279 UniGene:Hs.731507 UniGene:Hs.731952 HGNC:HGNC:2537
            ChiTaRS:CTSL1 IPI:IPI00640540 SMR:Q5T8F0 Ensembl:ENST00000342020
            ChEMBL:CHEMBL1293261 Uniprot:Q5T8F0
        Length = 225

 Score = 375 (137.1 bits), Expect = 1.3e-34, P = 1.3e-34
 Identities = 78/208 (37%), Positives = 118/208 (56%)

Query:    24 LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN- 82
             +L +    + S+  T + S+     KW A H R Y    E+  R  ++++N++ IE  N 
Sbjct:     6 ILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKMIELHNQ 64

Query:    83 --KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSL 140
               +EG  ++ +  N F D+T++EFR +  G++   P           +Q     + P S+
Sbjct:    65 EYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKV------FQEPLFYEAPRSV 118

Query:   141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGC 199
             DWR+KG VTP+KNQ +CG CWAF+A  A+EG    ++G LI LSEQ L+DCS   GN GC
Sbjct:   119 DWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGC 178

Query:   200 LGGSREKAFAYIIQNQGIATEDEYPYQA 227
              GG  + AF Y+  N G+ +E+ YPY+A
Sbjct:   179 NGGLMDYAFQYVQDNGGLDSEESYPYEA 206


>WB|WBGene00008231 [details] [associations]
            symbol:tag-329 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            eggNOG:NOG288820 EMBL:Z70750 HSSP:P53634 HOGENOM:HOG000019851
            PIR:T20110 RefSeq:NP_505458.1 ProteinModelPortal:Q18740 SMR:Q18740
            MEROPS:C01.A36 EnsemblMetazoa:C50F4.3 GeneID:183677
            KEGG:cel:CELE_C50F4.3 UCSC:C50F4.3 CTD:183677 WormBase:C50F4.3
            InParanoid:Q18740 OMA:WIFRNSW NextBio:921986 Uniprot:Q18740
        Length = 374

 Score = 368 (134.6 bits), Expect = 7.4e-34, P = 7.4e-34
 Identities = 101/333 (30%), Positives = 149/333 (44%)

Query:    36 RSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT---YKLG 92
             R+  E+   E  E ++ ++ R+YKDE+EK+ R + F      + K NK   +     K G
Sbjct:    37 RNNPEKLYKEF-EDFIVKYKRNYKDEIEKKFRFQQFVATHNRVGKMNKAAKKAGHDTKYG 95

Query:    93 TNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNL----SMTDVPTSLDWRDK--G 146
              N+FSDL+  E   +Y+ +    P         F  +NL     M  +P + D R+K  G
Sbjct:    96 INKFSDLSKKEIHGMYSKF---GPPKNNTNVPKFNLKNLRVKRQMEGLPKTFDLRNKKVG 152

Query:   147 A---VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
                 + PIK Q  C CCW FAA A  E    +     + LSEQ++ DC+     GC GG 
Sbjct:   153 GHYIIGPIKTQDSCACCWGFAATAVAEAALTVHLKKAMNLSEQEVCDCAPKHGPGCNGGD 212

Query:   204 REKAFAYIIQNQGIATEDEYPYQAVPGT----CSAAQKPAAA---KISNYEEVPSGDEQA 256
                   YI +  G+    EYP+     T    C + +        ++  Y   P   E  
Sbjct:   213 PVDGLEYI-KEMGLTGGKEYPFNVNRSTQLGRCESEKYDRELNPLELDYYAIDPFNAEYQ 271

Query:   257 LLKAVSMQ--PVSIAIAAYSTEFQSYKEGIFN-GVCGTQLD---HAVTIVGFGTTEDGA- 309
             +   + +   P+S+A         SY  GI     C  +     H+  IVG+GTT++ A 
Sbjct:   272 MTHHLYLLNLPISVAFRT-GASLSSYLSGILELADCDDEKGGHWHSGAIVGYGTTKNSAG 330

Query:   310 ---NYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
                +YW+ +NSW   WGD GY +IVR E  C I
Sbjct:   331 RTVDYWIFRNSWWTDWGDDGYARIVRGEDWCSI 363


>UNIPROTKB|H0YD65 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 EMBL:AP002748
            HGNC:HGNC:2531 ChiTaRS:CTSF Ensembl:ENST00000524994 Uniprot:H0YD65
        Length = 283

 Score = 352 (129.0 bits), Expect = 3.7e-32, P = 3.7e-32
 Identities = 90/263 (34%), Positives = 138/263 (52%)

Query:    43 VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
             +  I + ++  + R+Y+ + E   RL +F  N+   +K       T + G  +FSDLT +
Sbjct:    32 MASIFKNFVITYNRTYESK-EARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEE 90

Query:   103 EFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDV-PTSLDWRDKGAVTPIKNQKECGCCW 161
             EFR +Y    +     R       K Q  S+ D+ P   DWR KGAVT +K+Q  CG CW
Sbjct:    91 EFRTIYLNTLL-----RKEPGNKMK-QAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCW 144

Query:   162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATED 221
             AF+    VEG   +  G L+ LSEQ+LLDC    +  C+GG    A++ I    G+ TED
Sbjct:   145 AFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKM-DKACMGGLPSNAYSAIKNLGGLETED 203

Query:   222 EYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSY 280
             +Y YQ    +C+ + + A   I++  E+ S +EQ L   ++ + P+S+AI A+  +F  Y
Sbjct:   204 DYSYQGHMQSCNFSAEKAKVYINDSVEL-SQNEQKLAAWLAKRGPISVAINAFGMQF--Y 260

Query:   281 KEGI---FNGVCGTQL-DHAVTI 299
             + GI      +C   L DHAV +
Sbjct:   261 RHGISRPLRPLCSPWLIDHAVLL 283


>DICTYBASE|DDB_G0286015 [details] [associations]
            symbol:gmsA species:44689 "Dictyostelium discoideum"
            [GO:0005615 "extracellular space" evidence=IDA] [GO:0019953 "sexual
            reproduction" evidence=IEP] [GO:0009897 "external side of plasma
            membrane" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA;ISS] [GO:0000747 "conjugation with cellular
            fusion" evidence=IMP] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005576 "extracellular
            region" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0286015 Pfam:PF00188 GO:GO:0005615
            GenomeReviews:CM000153_GR GO:GO:0009897 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AAFI02000085 GO:GO:0000747
            Gene3D:3.40.33.10 InterPro:IPR001283 InterPro:IPR014044
            SMART:SM00198 SUPFAM:SSF55797 HSSP:P07688 RefSeq:XP_637893.1
            ProteinModelPortal:Q54ME1 MEROPS:C01.A52 EnsemblProtists:DDB0191145
            GeneID:8625403 KEGG:ddi:DDB_G0286015 InParanoid:Q54ME1 OMA:PGIAYEK
            ProtClustDB:CLSZ2429919 Uniprot:Q54ME1
        Length = 448

 Score = 319 (117.4 bits), Expect = 5.5e-32, Sum P(2) = 5.5e-32
 Identities = 84/223 (37%), Positives = 118/223 (52%)

Query:   134 TDVPTS---LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSG----NLIQLSEQ 186
             T  PTS   +DW      TPI++Q +CG CWAFA+ AA+E    I+ G    + +QLS Q
Sbjct:   235 TPAPTSTLTVDWTSYQ--TPIRDQGQCGSCWAFASSAALESRYLIKYGTAQKSTLQLSNQ 292

Query:   187 QLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISN 245
               ++C  +G NG   G+    F    +  GIA E + PY+AV GT C      A  K +N
Sbjct:   293 NAVNCIASGCNGGWSGNYFNFF----KTPGIAYEKDDPYKAVTGTSCITTSSVARFKYTN 348

Query:   246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCG-TQLDHAVTIVGFGT 304
             Y       + ALL  +   PV+IA+   S  FQ+YK GI+N     T ++H V +VG+  
Sbjct:   349 YGYTEK-TKAALLAELKKGPVTIAVYVDSA-FQNYKSGIYNSATKYTGINHLVLLVGYDQ 406

Query:   305 TEDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIGTRSSYP 346
               D    + IKNSWG+ WG++GYM+I   ++ L      S YP
Sbjct:   407 ATDA---YKIKNSWGSWWGESGYMRITASNDNLAIFAYNSYYP 446

 Score = 47 (21.6 bits), Expect = 5.5e-32, Sum P(2) = 5.5e-32
 Identities = 10/34 (29%), Positives = 17/34 (50%)

Query:    18 MFIIITLLVSCASQVVSSRS-THEQSVVEIHEKW 50
             + +++  L+S    V    S T +Q +V  H KW
Sbjct:     3 LILVLLCLISTLFVVKGGLSPTEQQIIVSYHNKW 36


>UNIPROTKB|Q5QP40 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015644 Pfam:PF00112
            InterPro:IPR000169 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 EMBL:AL355860 HOVERGEN:HBG011513
            PANTHER:PTHR12411:SF55 EMBL:AL356292 UniGene:Hs.632466
            HGNC:HGNC:2536 IPI:IPI00514633 SMR:Q5QP40 STRING:Q5QP40
            Ensembl:ENST00000443913 Uniprot:Q5QP40
        Length = 258

 Score = 335 (123.0 bits), Expect = 2.3e-30, P = 2.3e-30
 Identities = 73/195 (37%), Positives = 116/195 (59%)

Query:    33 VSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRT 88
             V S + + + +++ H E W   H + Y +++++  R  I+++NL+YI   N E   G  T
Sbjct:    70 VVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHT 129

Query:    89 YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXF--KYQNLSMTDVPTSLDWRDKG 146
             Y+L  N   D+T++E     TG K+P  SH       +  +++  +    P S+D+R KG
Sbjct:   130 YELAMNHLGDMTSEEVVQKMTGLKVPL-SHSRSNDTLYIPEWEGRA----PDSVDYRKKG 184

Query:   147 AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREK 206
              VTP+KNQ +CG CWAF++V A+EG  K ++G L+ LS Q L+DC +  N+GC GG    
Sbjct:   185 YVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE-NDGCGGGYMTN 243

Query:   207 AFAYIIQNQGIATED 221
             AF Y+ +N+GI +ED
Sbjct:   244 AFQYVQKNRGIDSED 258


>WB|WBGene00013764 [details] [associations]
            symbol:Y113G7B.15 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 GeneTree:ENSGT00560000076599
            EMBL:AL110477 HOGENOM:HOG000019851 RefSeq:NP_507904.2
            ProteinModelPortal:Q9U2X1 SMR:Q9U2X1 DIP:DIP-25339N IntAct:Q9U2X1
            MINT:MINT-1058673 STRING:Q9U2X1 MEROPS:C01.A47
            EnsemblMetazoa:Y113G7B.15 GeneID:190976 KEGG:cel:CELE_Y113G7B.15
            UCSC:Y113G7B.15 CTD:190976 WormBase:Y113G7B.15 eggNOG:NOG302449
            OMA:AEEDIME Uniprot:Q9U2X1
        Length = 362

 Score = 335 (123.0 bits), Expect = 2.3e-30, P = 2.3e-30
 Identities = 97/356 (27%), Positives = 163/356 (45%)

Query:    18 MFIIITL-LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
             +FII  L L+S     V+  S    S          +H R+  ++  +        + ++
Sbjct:     3 LFIISPLFLLSLCQPTVTQHSQEVLSHFNNFTMHHKKHYRTPAEKDRRLAHFAKNHQKIQ 62

Query:    77 YIE-KANKEGNRTYKLGTNQFSDLTNDEFRAL--------YTGYKMPSPSH-RXXXXXXF 126
              +  KA +EG R    G N+F+D    E  A         +T   +  P H R       
Sbjct:    63 ELNAKARREG-RNVTFGWNKFADKNRQELSARNSKIHPKNHTDLPIYKPRHPRGSRNHHN 121

Query:   127 KYQNLSMTDVPTSLDWRD---KGA--VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLI 181
             K       D+P   D RD    G+  V P+K+Q++CGCCWAFA  A  E    + S +  
Sbjct:   122 KRSKRQSGDIPDYFDLRDIYVDGSPVVGPVKDQEQCGCCWAFATTAITEAANTLYSKSFT 181

Query:   182 QLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIATEDEYPYQA----VPGTCSAAQ 236
              LS+Q++ DC+ +G+  GC+GG        ++  +G +++ +YPY+       G C   +
Sbjct:   182 SLSDQEICDCADSGDTPGCVGGDPRNGLK-MVHLRGQSSDGDYPYEEYRANTTGNCVGDE 240

Query:   237 KPAAAK---ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE-FQSYKEGIFNGVCGTQ 292
             K    +   ++ Y       E+ +++ + +  +  A+     E F+ Y  G+       Q
Sbjct:   241 KSTVIQPETLNVYRFDQDYAEEDIMENLYLNHIPTAVYFRVGENFEWYTSGVLQSEDCYQ 300

Query:   293 LD----HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSS 344
             +     H+V IVG+GT++DG  YWL++NSW + WG  GY+KI R    C I + ++
Sbjct:   301 MTPAEWHSVAIVGYGTSDDGVPYWLVRNSWNSDWGLHGYVKIRRGVNWCLIESHAA 356


>DICTYBASE|DDB_G0276111 [details] [associations]
            symbol:DDB_G0276111 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0276111 Pfam:PF00188
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            PROSITE:PS00139 EMBL:AAFI02000014 Gene3D:3.40.33.10
            InterPro:IPR001283 InterPro:IPR014044 PRINTS:PR00837 SMART:SM00198
            SUPFAM:SSF55797 ProtClustDB:CLSZ2429919 RefSeq:XP_643261.1
            ProteinModelPortal:Q75JH0 EnsemblProtists:DDB0169514 GeneID:8620304
            KEGG:ddi:DDB_G0276111 InParanoid:Q75JH0 OMA:GFVTSIK Uniprot:Q75JH0
        Length = 415

 Score = 333 (122.3 bits), Expect = 3.8e-30, P = 3.8e-30
 Identities = 80/213 (37%), Positives = 123/213 (57%)

Query:   136 VPTS----LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL----IQLSEQQ 187
             +PTS    +DW+  G VT IKNQ +CG C++FA  AA+E    I++ NL    I LSEQ 
Sbjct:   205 LPTSSTGDVDWKSLGFVTSIKNQGQCGGCYSFATCAALESAYLIKN-NLPNTDIDLSEQN 263

Query:   188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC-SAAQKPAAAKISNY 246
              + C    N GC GG+ +     + ++ GI  E  YPY+AV G+C +  Q P   K + Y
Sbjct:   264 FVSCV---NYGCGGGNGQSCLDKL-KSTGIMYETSYPYKAVTGSCPNVIQSPQPFKWTGY 319

Query:   247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
               +  G+++A L A+   P+  ++   S  FQ YK GI++    +  +HA+TIVG+ + +
Sbjct:   320 SNI-QGNKEAFLNALKSGPIYASLYVDSG-FQLYKSGIYSCSQSSTPNHAITIVGYSSAD 377

Query:   307 DGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
                N +LIKNSWG  +G++GY+++   EG C +
Sbjct:   378 ---NSYLIKNSWGTIYGESGYIRL--KEGSCNL 405


>WB|WBGene00044760 [details] [associations]
            symbol:Y71H2AM.25 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0004181
            "metallocarboxypeptidase activity" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            GeneTree:ENSGT00560000076599 EMBL:FO081822 eggNOG:NOG331187
            HOGENOM:HOG000114005 RefSeq:NP_001040887.1
            ProteinModelPortal:Q2AAB9 SMR:Q2AAB9 EnsemblMetazoa:Y71H2AM.25
            GeneID:4363054 KEGG:cel:CELE_Y71H2AM.25 UCSC:Y71H2AM.25 CTD:4363054
            WormBase:Y71H2AM.25 InParanoid:Q2AAB9 NextBio:959635 Uniprot:Q2AAB9
        Length = 299

 Score = 323 (118.8 bits), Expect = 4.4e-29, P = 4.4e-29
 Identities = 93/298 (31%), Positives = 141/298 (47%)

Query:    57 SYKDELEKEMRLKIFKE-NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS 115
             S  DE +K+   K+F     ++ EK  K  N   KL +     LT   F  +  GY +  
Sbjct:     9 SKADEAKKQ---KVFVALGEDFFEKPAKSRN---KLNSIILFTLTALTFYII--GYLVQQ 60

Query:   116 -PSHRXXXXXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGI-T 173
               S        +K    ++      LDWRDKG V P+K+Q +C    AFA  +++E +  
Sbjct:    61 RTSESLPTTFQWKTPKYTIQTTEEFLDWRDKGIVGPVKDQGKCNASHAFAISSSIESMYA 120

Query:   174 KIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVP-GTC 232
             K  +G+L+  SEQQL+DC  +G  GC       A +Y I + GI TE +YPY     G C
Sbjct:   121 KATNGSLLSFSEQQLIDCDDHGFKGCEEQPAINAVSYFIFH-GIETEADYPYAGKENGKC 179

Query:   233 SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV---C 289
             +     +  ++ + E V S + Q      +  P    + A  + +  YK GI+N     C
Sbjct:   180 TFDSTKSKIQLKDAEFVVSNETQGKELVTNYGPAFFTMRAPPSLYD-YKIGIYNPSIEEC 238

Query:   290 -GTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYP 346
               T    ++ IVG+G  E    YW++K S+G +WG+ GYMK+ RD   C +    + P
Sbjct:   239 TSTHEIRSMVIVGYGI-EGVQKYWIVKGSFGTSWGEQGYMKLARDVNACAMADFITVP 295


>WB|WBGene00019314 [details] [associations]
            symbol:K02E7.10 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 SMART:SM00645 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 HSSP:P07711
            EMBL:FO080411 PIR:T32392 RefSeq:NP_493904.1 UniGene:Cel.14828
            ProteinModelPortal:O17255 SMR:O17255 EnsemblMetazoa:K02E7.10
            GeneID:186889 KEGG:cel:CELE_K02E7.10 UCSC:K02E7.10 CTD:186889
            WormBase:K02E7.10 eggNOG:NOG331187 HOGENOM:HOG000114005
            InParanoid:O17255 OMA:GNANEAR NextBio:933344 Uniprot:O17255
        Length = 299

 Score = 319 (117.4 bits), Expect = 1.2e-28, P = 1.2e-28
 Identities = 76/229 (33%), Positives = 117/229 (51%)

Query:   127 KYQN-LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGI-TKIRSGNLIQLS 184
             +YQ  LS       LDWR+KG V P+K+Q +C   +AFAA+AA+E +  K  +G L+  S
Sbjct:    70 QYQTKLSHHMTQDFLDWREKGIVGPVKDQGKCNASYAFAAIAAIESMYAKANNGKLLSFS 129

Query:   185 EQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKIS 244
             EQQ++DC+ N  N C            ++  G+ TE +YPY             +  K+ 
Sbjct:   130 EQQIIDCA-NFTNPCQENLENVLSNRFLKENGVGTEADYPYVGKENVGKCEYDSSKMKLR 188

Query:   245 -NYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV---CGTQLD-HAVTI 299
               Y +V   +E A     +       + +  + F  YK GI+N     CG   +  ++ I
Sbjct:   189 PTYIDVYPNEEWARAHITTFGTGYFRMRSPPSFFH-YKTGIYNPTKEECGNANEARSLAI 247

Query:   300 VGFGTTEDGAN-YWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 347
             VG+G  +DGA  YW++K S+G +WG+ GYMK+ R+   CG+    S P+
Sbjct:   248 VGYG--KDGAEKYWIVKGSFGTSWGEHGYMKLARNVNACGMAESISIPI 294


>UNIPROTKB|F1STR1 [details] [associations]
            symbol:CTSC "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0004252
            "serine-type endopeptidase activity" evidence=IEA] [GO:0001913 "T
            cell mediated cytotoxicity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 KO:K01275 InterPro:IPR014882
            Pfam:PF08773 OMA:YDDFLHY GO:GO:0001913 EMBL:CU855751
            RefSeq:XP_003129789.1 UniGene:Ssc.6155 Ensembl:ENSSSCT00000016280
            GeneID:100522387 KEGG:ssc:100522387 Uniprot:F1STR1
        Length = 463

 Score = 308 (113.5 bits), Expect = 2.3e-27, P = 2.3e-27
 Identities = 86/295 (29%), Positives = 141/295 (47%)

Query:    63 EKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXX 122
             +K+   +++K N ++++  N            ++  LT  E      GY    P  +   
Sbjct:   160 QKKYSNRLYKYNHDFVKAINGIQKSWTATAYMEYETLTLKEMTQRGGGYNQRLPRPKPAP 219

Query:   123 XXXFKYQNLSMTDVPTSLDWRD-KGA--VTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
                 + Q  S+  +P S DWR+ +G   VTP++NQ  CG C++FA++  +E   +I + N
Sbjct:   220 ITA-EIQEKSL-HLPASWDWRNVRGTNFVTPVRNQASCGSCYSFASMGMMEARIRILTNN 277

Query:   180 LIQ--LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK 237
                  LS Q+++ CS     GC GG          Q+ G+  E  +PY      C+  + 
Sbjct:   278 TQTPILSPQEVVSCSQYAQ-GCAGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCTVKEG 336

Query:   238 PAAAKISNYEEVPS---GDEQALLKA--VSMQPVSIAIAAYSTEFQSYKEGIFN--GVCG 290
                   S Y  V     G  +AL+K   V   P+++A   Y  +F  Y++GI++  G+  
Sbjct:   337 CFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYD-DFLHYRKGIYHHTGLRD 395

Query:   291 T----QL-DHAVTIVGFGTT-EDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
                  +L +HAV +VG+GT    G +YW++KNSWG +WG+ GY +I R    C I
Sbjct:   396 PFNPFELTNHAVLLVGYGTDLASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAI 450


>ZFIN|ZDB-GENE-030619-9 [details] [associations]
            symbol:ctsc "cathepsin C" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030619-9 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 MEROPS:C01.070 HSSP:P43235
            EMBL:BC064286 IPI:IPI00486570 RefSeq:NP_999887.1 UniGene:Dr.32463
            ProteinModelPortal:Q6P2V1 SMR:Q6P2V1 PRIDE:Q6P2V1 GeneID:368704
            KEGG:dre:368704 InParanoid:Q6P2V1 NextBio:20813127
            ArrayExpress:Q6P2V1 Bgee:Q6P2V1 Uniprot:Q6P2V1
        Length = 455

 Score = 303 (111.7 bits), Expect = 7.6e-27, P = 7.6e-27
 Identities = 77/222 (34%), Positives = 117/222 (52%)

Query:   136 VPTSLDWRD-KGA--VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ--LSEQQLLD 190
             +P   DWR+  G   V+P++NQ +CG C++FA +  +E   +I++ N  Q   S QQ++ 
Sbjct:   224 LPQHWDWRNVNGVNFVSPVRNQAQCGSCYSFATMGMLEARVRIQTNNTQQPVFSPQQVVS 283

Query:   191 CSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVP 250
             CS   + GC GG       YI Q+ GI  ED +PY      C+   K      S+Y  V 
Sbjct:   284 CSQY-SQGCDGGFPYLIGKYI-QDFGIVEEDCFPYTGSDSPCNLPAKCTKYYASDYHYVG 341

Query:   251 S--G--DEQAL-LKAVSMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQ-----LDHAVT 298
                G   E A+ L+ V   P+ +A+  Y  +F +YKEGI++  G+          +HAV 
Sbjct:   342 GFYGGCSESAMMLELVKNGPMGVALEVYP-DFMNYKEGIYHHTGLRDANNPFELTNHAVL 400

Query:   299 IVGFGTT-EDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             +VG+G   + G  YW++KNSWG+ WG+ G+ +I R    C I
Sbjct:   401 LVGYGQCHKTGEKYWIVKNSWGSGWGENGFFRIRRGTDECAI 442


>UNIPROTKB|F1N455 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1 exclusion domain chain"
            species:9913 "Bos taurus" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 IPI:IPI00697314 UniGene:Bt.49573
            InterPro:IPR014882 Pfam:PF08773 OMA:YDDFLHY GO:GO:0001913
            EMBL:DAAA02062487 EMBL:DAAA02062488 Ensembl:ENSBTAT00000014735
            Uniprot:F1N455
        Length = 463

 Score = 299 (110.3 bits), Expect = 2.6e-26, P = 2.6e-26
 Identities = 86/298 (28%), Positives = 141/298 (47%)

Query:    62 LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGY--KMPSPSHR 119
             LE+    ++++ N ++++  N            ++  LT  E      G+  ++P P   
Sbjct:   159 LEETYSNRLYRYNHDFVKAINAIQKSWTAAPYMEYETLTLKEMIRRGGGHSRRIPRPKPA 218

Query:   120 XXXXXXFKYQNLSMTDVPTSLDWRD-KGA--VTPIKNQKECGCCWAFAAVAAVEGITKIR 176
                    K     +  +PTS DWR+  G   VTP++NQ  CG C++FA++  +E   +I 
Sbjct:   219 PITAEIQK----KILHLPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRIL 274

Query:   177 SGNLIQ--LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA 234
             + N     LS Q+++ CS     GC GG          Q+ G+  ED +PY      C  
Sbjct:   275 TNNTQTPILSPQEVVSCSQYAQ-GCEGGFPYLIAGKYAQDFGLVEEDCFPYTGTDSPCRL 333

Query:   235 AQKPAAAKISNYEEVPS---GDEQALLKA--VSMQPVSIAIAAYSTEFQSYKEGIFN--G 287
              +       S Y  V     G  +AL+K   V   P+++A   Y  +F  Y++G+++  G
Sbjct:   334 KEGCFRYYSSEYHYVGGFYGGCNEALMKLELVHQGPMAVAFEVYD-DFLHYRKGVYHHTG 392

Query:   288 VCGT----QL-DHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             +       +L +HAV +VG+GT    G +YW++KNSWG +WG+ GY +I R    C I
Sbjct:   393 LRDPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAI 450


>UNIPROTKB|Q3ZCJ8 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9913 "Bos
            taurus" [GO:0031638 "zymogen activation" evidence=IDA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0005764 EMBL:BC102115 IPI:IPI00697314 RefSeq:NP_001028789.1
            UniGene:Bt.49573 ProteinModelPortal:Q3ZCJ8 SMR:Q3ZCJ8 STRING:Q3ZCJ8
            PRIDE:Q3ZCJ8 GeneID:352958 KEGG:bta:352958 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 InParanoid:Q3ZCJ8 KO:K01275
            OrthoDB:EOG4H19VZ BindingDB:Q3ZCJ8 ChEMBL:CHEMBL1075050
            NextBio:20812686 GO:GO:0031638 InterPro:IPR014882 Pfam:PF08773
            Uniprot:Q3ZCJ8
        Length = 463

 Score = 299 (110.3 bits), Expect = 2.6e-26, P = 2.6e-26
 Identities = 86/298 (28%), Positives = 141/298 (47%)

Query:    62 LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGY--KMPSPSHR 119
             LE+    ++++ N ++++  N            ++  LT  E      G+  ++P P   
Sbjct:   159 LEETYSNRLYRYNHDFVKAINAIQKSWTAAPYMEYETLTLKEMIRRGGGHSRRIPRPKPA 218

Query:   120 XXXXXXFKYQNLSMTDVPTSLDWRD-KGA--VTPIKNQKECGCCWAFAAVAAVEGITKIR 176
                    K     +  +PTS DWR+  G   VTP++NQ  CG C++FA++  +E   +I 
Sbjct:   219 PITAEIQK----KILHLPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRIL 274

Query:   177 SGNLIQ--LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA 234
             + N     LS Q+++ CS     GC GG          Q+ G+  ED +PY      C  
Sbjct:   275 TNNTQTPILSPQEVVSCSQYAQ-GCEGGFPYLIAGKYAQDFGLVEEDCFPYTGTDSPCRL 333

Query:   235 AQKPAAAKISNYEEVPS---GDEQALLKA--VSMQPVSIAIAAYSTEFQSYKEGIFN--G 287
              +       S Y  V     G  +AL+K   V   P+++A   Y  +F  Y++G+++  G
Sbjct:   334 KEGCFRYYSSEYHYVGGFYGGCNEALMKLELVHQGPMAVAFEVYD-DFLHYRKGVYHHTG 392

Query:   288 VCGT----QL-DHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             +       +L +HAV +VG+GT    G +YW++KNSWG +WG+ GY +I R    C I
Sbjct:   393 LRDPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAI 450


>UNIPROTKB|F1RWA9 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 EMBL:CU855637
            Ensembl:ENSSSCT00000009707 OMA:WAFSIVG Uniprot:F1RWA9
        Length = 194

 Score = 296 (109.3 bits), Expect = 3.2e-26, P = 3.2e-26
 Identities = 67/189 (35%), Positives = 101/189 (53%)

Query:   156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
             +CG CWAF+ V+AVE    I+   L  LS QQ++DCS N N GC GGS   A  ++ + Q
Sbjct:     1 QCGGCWAFSVVSAVESAYAIKGQPLEVLSVQQVIDCSYN-NYGCNGGSTLNALYWLNKTQ 59

Query:   216 -GIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVP-SGDEQALLKAV-SMQPVSIAIA 271
               + ++ EYP++A  G C       +   I +Y     SG E  + K + ++ P+ + + 
Sbjct:    60 VKVVSDSEYPFKAQNGLCHYFSCSHSGVSIKDYSAYDFSGQEDEMAKTLLTLGPLIVIVD 119

Query:   272 AYSTEFQSYKEGIFNGVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
             A S  +Q Y  GI    C + + +HAV + GF  T     YW+++NSWG+ WG  GY  +
Sbjct:   120 AVS--WQDYLGGIIQHHCSSGEANHAVLVTGFDKT-GSTPYWIVRNSWGSAWGIDGYALV 176

Query:   331 VRDEGLCGI 339
                  +CGI
Sbjct:   177 KMGGNICGI 185


>WB|WBGene00008861 [details] [associations]
            symbol:F15D4.4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 SMART:SM00848 PROSITE:PS00640
            GeneTree:ENSGT00560000076599 EMBL:Z80344 HSSP:P53634
            eggNOG:NOG310593 PIR:T20981 ProteinModelPortal:Q93512 SMR:Q93512
            MEROPS:C01.A45 EnsemblMetazoa:F15D4.4 KEGG:cel:CELE_F15D4.4
            UCSC:F15D4.4 CTD:184530 WormBase:F15D4.4 InParanoid:Q93512
            OMA:ITMEQNI NextBio:925068 Uniprot:Q93512
        Length = 608

 Score = 303 (111.7 bits), Expect = 3.4e-26, P = 3.4e-26
 Identities = 88/289 (30%), Positives = 137/289 (47%)

Query:    67 RLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXX 123
             R  ++ +  + +++ N   + G  +YK+ TNQFS   + E   L       +P+      
Sbjct:   154 RFNVYSKVKKEVDEHNIMYELGMSSYKMSTNQFSVALDGEVAPLTLNLDALTPTATVIPA 213

Query:   124 XXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
                   +    D   ++DWR    + PI +Q  CG CWAF+ ++ +E    I+  N   L
Sbjct:   214 TI---SSRKKRDTEPTVDWRP--FLKPILDQSTCGGCWAFSMISMIESFFAIQGYNTSSL 268

Query:   184 SEQQLLDC-----STNG--NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC-SAA 235
             S QQLL C     ST G  N GC GG  + A +Y+ +          P+     +C S+ 
Sbjct:   269 SVQQLLTCDTKVDSTYGLANVGCKGGYFQIAGSYL-EVSAARDASLIPFDLEDTSCDSSF 327

Query:   236 QKPAAAKISNYEE-VPSGDEQALLKAVSMQPVS-------IAIA-AYSTEFQSYKEGIFN 286
               P    I  +++   SG+  A       Q +        IA+  A   +   Y EG+++
Sbjct:   328 FPPVVPTILLFDDGYISGNFTAAQLITMEQNIEDKVRKGPIAVGMAAGPDIYKYSEGVYD 387

Query:   287 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG 335
             G CGT ++HAV IVGF  T+D   YW+I+NSWG +WG+AGY ++ R  G
Sbjct:   388 GDCGTIINHAVVIVGF--TDD---YWIIRNSWGASWGEAGYFRVKRTPG 431


>UNIPROTKB|O97578 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9615 "Canis
            lupus familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 EMBL:AF060171 RefSeq:NP_001182763.1
            UniGene:Cfa.28653 ProteinModelPortal:O97578 SMR:O97578
            MEROPS:C01.070 PRIDE:O97578 GeneID:403458 KEGG:cfa:403458
            InParanoid:O97578 NextBio:20816976 Uniprot:O97578
        Length = 435

 Score = 295 (108.9 bits), Expect = 4.3e-26, P = 4.3e-26
 Identities = 88/298 (29%), Positives = 142/298 (47%)

Query:    60 DELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR 119
             + L++    +++K N E+++  N            ++  LT  +      G K+P P   
Sbjct:   132 ERLQENNSNRLYKYNYEFVKAINTIQKSWTATRYIEYETLTLRDMMTRVGGRKIPRPKPT 191

Query:   120 XXXXXXFKYQNLSMTDVPTSLDWRD-KGA--VTPIKNQKECGCCWAFAAVAAVEGITKIR 176
                     ++ +S   +PTS DWR+ +G   V+P++NQ  CG C+AFA+ A +E   +I 
Sbjct:   192 PLTAEI--HEEISR--LPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTAMLEARIRIL 247

Query:   177 SGNLIQ--LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA 234
             + N     LS Q+++ CS     GC GG          Q+ G+  E  +PY      C  
Sbjct:   248 TNNTQTPILSPQEIVSCSQYAQ-GCEGGFPYLIAGKYAQDFGLVEEACFPYAGSDSPCKP 306

Query:   235 AQKPAAAKISNYEEVPS--GD-EQALLKA--VSMQPVSIAIAAYSTEFQSYKEGIF--NG 287
                      S Y  V    G   +AL+K   V   P+++A   Y  +F  Y++GI+   G
Sbjct:   307 -NDCFRYYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYD-DFFHYQKGIYYHTG 364

Query:   288 VCGT----QL-DHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             +       +L +HAV +VG+GT +  G +YW++KNSWG+ WG+ GY +I R    C I
Sbjct:   365 LRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAI 422


>TAIR|locus:505006093 [details] [associations]
            symbol:AT1G02305 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0005829 "cytosol" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684 GO:GO:0005773
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 HOGENOM:HOG000241341
            KO:K01363 PANTHER:PTHR12411:SF16 OMA:CCGFLCG UniGene:At.23486
            UniGene:At.42610 UniGene:At.43952 EMBL:AY039887 EMBL:AF428337
            EMBL:BT002227 IPI:IPI00524601 RefSeq:NP_563648.1 HSSP:P07858
            ProteinModelPortal:Q93VC9 SMR:Q93VC9 IntAct:Q93VC9 STRING:Q93VC9
            MEROPS:C01.049 PRIDE:Q93VC9 ProMEX:Q93VC9 EnsemblPlants:AT1G02305.1
            GeneID:839538 KEGG:ath:AT1G02305 TAIR:At1g02305 InParanoid:Q93VC9
            PhylomeDB:Q93VC9 ProtClustDB:CLSN2687619 Genevestigator:Q93VC9
            Uniprot:Q93VC9
        Length = 362

 Score = 207 (77.9 bits), Expect = 6.4e-26, Sum P(2) = 6.4e-26
 Identities = 38/98 (38%), Positives = 57/98 (58%)

Query:   243 ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLD-HAVTIVG 301
             +S Y+ V S  +  + +     PV +A   Y  +F  YK G++  + GT +  HAV ++G
Sbjct:   238 VSAYK-VRSHPDDIMAEVYKNGPVEVAFTVYE-DFAHYKSGVYKHITGTNIGGHAVKLIG 295

Query:   302 FGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             +GT++DG +YWL+ N W  +WGD GY KI R    CGI
Sbjct:   296 WGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGI 333

 Score = 142 (55.0 bits), Expect = 6.4e-26, Sum P(2) = 6.4e-26
 Identities = 51/186 (27%), Positives = 82/186 (44%)

Query:    60 DELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTN-QFSDLTNDEFRALYTGYKMPSPSH 118
             + L K+       +N E +++ N+  N  +K   N +F++ T  EF+ L  G K P+P  
Sbjct:    34 ENLSKQKLTSWILQN-EIVKEVNENPNAGWKASFNDRFANATVAEFKRLL-GVK-PTPKT 90

Query:   119 RXXXXXXFKYQNLSMTDVPTSLD----WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITK 174
                      + ++S+  +P   D    W    ++  I +Q  CG CWAF AV ++     
Sbjct:    91 EFLGVPIVSH-DISLK-LPKEFDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFC 148

Query:   175 IRSGNLIQLSEQQLLDC-STNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS 233
             I+    + LS   LL C       GC GG    A+ Y  ++ G+ TE+  PY    G   
Sbjct:   149 IKYNMNVSLSVNDLLACCGFLCGQGCNGGYPIAAWRYF-KHHGVVTEECDPYFDNTGCSH 207

Query:   234 AAQKPA 239
                +PA
Sbjct:   208 PGCEPA 213


>UNIPROTKB|F1NWG2 [details] [associations]
            symbol:CTSC "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 InterPro:IPR014882 Pfam:PF08773
            OMA:YDDFLHY GO:GO:0001913 EMBL:AADN02004805 IPI:IPI00577371
            Ensembl:ENSGALT00000027869 Uniprot:F1NWG2
        Length = 463

 Score = 295 (108.9 bits), Expect = 7.7e-26, P = 7.7e-26
 Identities = 77/225 (34%), Positives = 119/225 (52%)

Query:   133 MTDVPTSLDWRD-KGA--VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ--LSEQQ 187
             ++ +P S DWR+  G   V+P++NQ  CG C+AFA++  +E   +I + N  +   S QQ
Sbjct:   228 VSGLPESWDWRNVNGVNYVSPVRNQASCGSCYAFASMGMLEARIRILTNNTQKPVFSPQQ 287

Query:   188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYE 247
             ++ CS   + GC GG         +Q+ G+  ED +PY A    C   +       S Y 
Sbjct:   288 VVSCSQY-SQGCDGGFPYLIAGKYVQDFGVVEEDCFPYTAKDTPCLFKRSCYHYYTSEYH 346

Query:   248 EVPS--GD-EQALLKA--VSMQPVSIAIAAYSTEFQSYKEGIFN--GVCGT----QL-DH 295
              V    G   +AL+K   V   P+++A   Y+ +F  YKEGI++  G+       +L +H
Sbjct:   347 YVGGFYGACNEALMKLELVLSGPMAVAFEVYN-DFMFYKEGIYHHTGLKDEFNPFELTNH 405

Query:   296 AVTIVGFGTT-EDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             AV +VG+G   E G  +W++KNSWG +WG+ GY +I R    C I
Sbjct:   406 AVLLVGYGKDPESGEKFWIVKNSWGTSWGEDGYFRIRRGTDECAI 450


>WB|WBGene00022189 [details] [associations]
            symbol:Y71H2AR.2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0004181
            "metallocarboxypeptidase activity" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] [GO:0008340 "determination of adult lifespan"
            evidence=IMP] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0008340 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 HSSP:P07711
            eggNOG:NOG331187 HOGENOM:HOG000114005 EMBL:FO081570
            RefSeq:NP_497627.1 UniGene:Cel.28419 ProteinModelPortal:Q9BL26
            SMR:Q9BL26 EnsemblMetazoa:Y71H2AR.2 GeneID:190615
            KEGG:cel:CELE_Y71H2AR.2 UCSC:Y71H2AR.2 CTD:190615
            WormBase:Y71H2AR.2 InParanoid:Q9BL26 OMA:CAMATTI NextBio:946382
            Uniprot:Q9BL26
        Length = 345

 Score = 292 (107.8 bits), Expect = 8.4e-26, P = 8.4e-26
 Identities = 72/210 (34%), Positives = 107/210 (50%)

Query:   140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGI-TKIRSGNLIQLSEQQLLDCSTNGNNG 198
             LDWR+KG V P+K+Q +C    AFA  +++E +  K  +G L+  SEQQL+DC+  G  G
Sbjct:    86 LDWREKGIVGPVKDQGKCNASHAFAITSSIESMYAKATNGTLLSFSEQQLIDCNDQGYKG 145

Query:   199 CLGGSREKAFAYIIQNQGIATEDEYPY-QAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
             C       A  Y+  + GI TE +YPY       C+       +KI   + V +   + L
Sbjct:   146 CEEQFAMNAIGYLATH-GIETEADYPYVDKTNEKCTFDS--TKSKIHLKKGVVAEGNEVL 202

Query:   258 LKA--VSMQPVSIAIAAYSTEFQSYKEGIFNGV---C-GTQLDHAVTIVGFGTTEDGANY 311
              K    +  P    + A  + +  YK GI+N     C  T    ++ IVG+G  E    Y
Sbjct:   203 GKVYVTNYGPAFFTMRAPPSLYD-YKIGIYNPSIEECTSTHEIRSMVIVGYGI-EGEQKY 260

Query:   312 WLIKNSWGNTWGDAGYMKIVRDEGLCGIGT 341
             W++K S+G +WG+ GYMK+ RD   C + T
Sbjct:   261 WIVKGSFGTSWGEQGYMKLARDVNACAMAT 290


>DICTYBASE|DDB_G0288221 [details] [associations]
            symbol:DDB_G0288221 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0288221 Pfam:PF00188 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 PANTHER:PTHR12411 EMBL:AAFI02000109 Gene3D:3.40.33.10
            InterPro:IPR001283 InterPro:IPR014044 SMART:SM00198 SUPFAM:SSF55797
            MEROPS:C01.A52 ProtClustDB:CLSZ2429919 RefSeq:XP_636852.1
            ProteinModelPortal:Q54J84 EnsemblProtists:DDB0187839 GeneID:8626520
            KEGG:ddi:DDB_G0288221 InParanoid:Q54J84 Uniprot:Q54J84
        Length = 395

 Score = 284 (105.0 bits), Expect = 5.9e-25, P = 5.9e-25
 Identities = 72/213 (33%), Positives = 114/213 (53%)

Query:   139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSG----NLIQLSEQQLLDCSTN 194
             S+DW D    TP+++Q EC  CW F ++AA+E    I++G    + + LS Q  ++C T+
Sbjct:   191 SVDWSDYQ--TPVRDQGECKSCWVFGSLAALESRYLIKNGVSEKSTLHLSAQNAMNCITS 248

Query:   195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
             G   C  G     F Y  ++ GIA E +YPY A+ G+ +        + S Y+ V +  +
Sbjct:   249 G---CESGWPANVFDYF-ESSGIAFEKDYPYDAI-GSDNCTSSSNKFEYSGYDSVEN-TK 302

Query:   255 QALLKAVSMQPVSIAIAAYS-TEFQSYKEGIFNGVCGTQ-LDHAVTIVGFGTTEDGANYW 312
              +L++ +   P++IA+  YS T FQSY  GI++ V   + ++H V +VG+    D    W
Sbjct:   303 DSLIQELKNGPITIAL--YSDTAFQSYAGGIYDSVEEYKDVNHIVLLVGYDKPTDS---W 357

Query:   313 LIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSY 345
              IKNS G  WG+ GY +I       GI   +S+
Sbjct:   358 KIKNSLGTKWGELGYARITASNDKLGILLYNSF 390


>UNIPROTKB|F1PSK8 [details] [associations]
            symbol:F1PSK8 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 InterPro:IPR014882
            Pfam:PF08773 EMBL:AAEX03012741 Ensembl:ENSCAFT00000007054
            Uniprot:F1PSK8
        Length = 405

 Score = 284 (105.0 bits), Expect = 5.9e-25, P = 5.9e-25
 Identities = 88/299 (29%), Positives = 142/299 (47%)

Query:    60 DELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR 119
             + L++    +++K N E+++  N            ++  LT  +      G K+P P   
Sbjct:   101 ERLQENNSNRLYKYNYEFVKAINTIQKSWTATRYIEYETLTLRDMMTRGGGRKIPRPKPT 160

Query:   120 XXXXXXFKYQNLSMTDVPTSLDWRD-KGA--VTPIKNQK-ECGCCWAFAAVAAVEGITKI 175
                     ++ +S   +PTS DWR+ +G   V+P++NQ   CG C+AFA+ A +E   +I
Sbjct:   161 PLTAEI--HEEISR--LPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRI 216

Query:   176 RSGNLIQ--LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS 233
              + N     LS Q+++ CS     GC GG          Q+ G+  E  +PY      C 
Sbjct:   217 LTNNTQTPILSPQEIVSCSQYAQ-GCEGGFPYLIAGKYAQDFGLVEEACFPYAGSDSPCK 275

Query:   234 AAQKPAAAKISNYEEVPS--GD-EQALLKA--VSMQPVSIAIAAYSTEFQSYKEGIF--N 286
                       S Y  V    G   +AL+K   V   P+++A   Y  +F  Y++GI+   
Sbjct:   276 P-NDCFRYYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYD-DFFHYQKGIYYHT 333

Query:   287 GVCGT----QL-DHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             G+       +L +HAV +VG+GT +  G +YW++KNSWG+ WG+ GY +I R    C I
Sbjct:   334 GLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAI 392


>UNIPROTKB|P53634 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9606 "Homo
            sapiens" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0005783 "endoplasmic reticulum" evidence=IEA] [GO:0005794
            "Golgi apparatus" evidence=IEA] [GO:0007568 "aging" evidence=IEA]
            [GO:0010033 "response to organic substance" evidence=IEA]
            [GO:0031404 "chloride ion binding" evidence=IEA] [GO:0042802
            "identical protein binding" evidence=IEA] [GO:0043621 "protein
            self-association" evidence=IEA] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0006508 "proteolysis" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0006955
            "immune response" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005783 GO:GO:0005794 Reactome:REACT_6900
            GO:GO:0006955 GO:GO:0007568 GO:GO:0010033 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004252 GO:GO:0005764 GO:GO:0031404 CTD:1075
            HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ InterPro:IPR014882
            Pfam:PF08773 MEROPS:C01.070 EMBL:X87212 EMBL:U79415 EMBL:AF234263
            EMBL:AF234264 EMBL:AF254757 EMBL:AF525032 EMBL:AF525033
            EMBL:AK292117 EMBL:AK311923 EMBL:AK223038 EMBL:BX537913
            EMBL:AC011088 EMBL:CH471185 EMBL:BC054028 EMBL:BC100891
            EMBL:BC100892 EMBL:BC100893 EMBL:BC100894 EMBL:BC109386
            EMBL:BC110071 EMBL:BC113850 EMBL:BC113897 IPI:IPI00022810
            IPI:IPI00171323 IPI:IPI00872258 PIR:S23941 PIR:S66504
            RefSeq:NP_001107645.1 RefSeq:NP_001805.3 RefSeq:NP_680475.1
            UniGene:Hs.128065 PDB:1K3B PDB:2DJF PDB:2DJG PDB:3PDF PDBsum:1K3B
            PDBsum:2DJF PDBsum:2DJG PDBsum:3PDF ProteinModelPortal:P53634
            SMR:P53634 IntAct:P53634 MINT:MINT-4655964 STRING:P53634
            PhosphoSite:P53634 DMDM:1705632 PaxDb:P53634 PRIDE:P53634
            DNASU:1075 Ensembl:ENST00000227266 Ensembl:ENST00000524463
            Ensembl:ENST00000529974 GeneID:1075 KEGG:hsa:1075 UCSC:uc001pck.4
            UCSC:uc001pcm.4 GeneCards:GC11M088026 HGNC:HGNC:2528 HPA:CAB025364
            MIM:170650 MIM:245000 MIM:245010 MIM:602365 neXtProt:NX_P53634
            Orphanet:2342 Orphanet:678 PharmGKB:PA27028 HOGENOM:HOG000127503
            InParanoid:P53634 OMA:YDDFLHY PhylomeDB:P53634
            BioCyc:MetaCyc:HS03265-MONOMER SABIO-RK:P53634 BindingDB:P53634
            ChEMBL:CHEMBL2252 EvolutionaryTrace:P53634 GenomeRNAi:1075
            NextBio:4488 PMAP-CutDB:P53634 ArrayExpress:P53634 Bgee:P53634
            Genevestigator:P53634 GermOnline:ENSG00000109861 GO:GO:0001913
            Uniprot:P53634
        Length = 463

 Score = 287 (106.1 bits), Expect = 6.2e-25, P = 6.2e-25
 Identities = 75/222 (33%), Positives = 114/222 (51%)

Query:   136 VPTSLDWRD-KGA--VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ--LSEQQLLD 190
             +PTS DWR+  G   V+P++NQ  CG C++FA++  +E   +I + N     LS Q+++ 
Sbjct:   231 LPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVS 290

Query:   191 CSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVP 250
             CS     GC GG          Q+ G+  E  +PY      C   +       S Y  V 
Sbjct:   291 CSQYAQ-GCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCKMKEDCFRYYSSEYHYVG 349

Query:   251 S---GDEQALLKA--VSMQPVSIAIAAYSTEFQSYKEGIFN--GVCGT----QL-DHAVT 298
                 G  +AL+K   V   P+++A   Y  +F  YK+GI++  G+       +L +HAV 
Sbjct:   350 GFYGGCNEALMKLELVHHGPMAVAFEVYD-DFLHYKKGIYHHTGLRDPFNPFELTNHAVL 408

Query:   299 IVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             +VG+GT +  G +YW++KNSWG  WG+ GY +I R    C I
Sbjct:   409 LVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAI 450


>UNIPROTKB|J9NSE7 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 InterPro:IPR014882 Pfam:PF08773
            EMBL:AAEX03017125 Ensembl:ENSCAFT00000014269 OMA:INGQICH
            Uniprot:J9NSE7
        Length = 458

 Score = 283 (104.7 bits), Expect = 1.7e-24, P = 1.7e-24
 Identities = 86/298 (28%), Positives = 140/298 (46%)

Query:    60 DELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR 119
             + L++    +++K N E+++  N            ++  LT  +      G K+P P   
Sbjct:   155 ERLQENNSNRLYKYNYEFVKAINTIQKSWTATRYIEYETLTLRDMMRRAGGRKIPRPKPT 214

Query:   120 XXXXXXFKYQNLSMTDVPTSLDWRD-KGA--VTPIKNQKECGCCWAFAAVAAVEGITKIR 176
                     ++ +S   +PTS DWR+ +G   V+P++NQ  CG C+AFA+   +E   +I 
Sbjct:   215 PLTAEI--HEEISR--LPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTVMLEARIRIL 270

Query:   177 SGNLIQ--LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA 234
             + N     LS Q+++ CS     GC GG          Q+ G+  E  + Y      C  
Sbjct:   271 TNNTQTPILSPQEIVSCSQYAQ-GCEGGFPYLIAGKYAQDFGLVDEACFSYAGSDSPCKP 329

Query:   235 AQKPAAAKISNYEEVPS--GD-EQALLKA--VSMQPVSIAIAAYSTEFQSYKEGIF--NG 287
                      S Y  V    G   +AL+K   V   P+++A   Y  +F  Y++GI+   G
Sbjct:   330 -NDCFHYYSSEYHYVGGFYGACNEALMKLELVRHGPMAVAFEVYD-DFFHYQKGIYYHTG 387

Query:   288 VCGT----QL-DHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             +       +L +HAV +VG+GT +  G +YW++KNSWG+ WG+ GY +I R    C I
Sbjct:   388 LRDPINPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFQICRGTDECAI 445


>MGI|MGI:109553 [details] [associations]
            symbol:Ctsc "cathepsin C" species:10090 "Mus musculus"
            [GO:0001913 "T cell mediated cytotoxicity" evidence=IGI]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IMP]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0005783 "endoplasmic
            reticulum" evidence=ISO] [GO:0005794 "Golgi apparatus"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=ISO;IMP]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0010033
            "response to organic substance" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0031404 "chloride ion
            binding" evidence=ISO] [GO:0042802 "identical protein binding"
            evidence=ISO] [GO:0043621 "protein self-association" evidence=ISO]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 MGI:MGI:109553 GO:GO:0005783
            GO:GO:0005794 GO:GO:0007568 GO:GO:0010033 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004252 GO:GO:0005764 GO:GO:0031404 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 MEROPS:C01.070 OMA:YDDFLHY
            GO:GO:0001913 EMBL:U89269 EMBL:U74683 EMBL:BC067063 IPI:IPI00130015
            RefSeq:NP_034112.3 UniGene:Mm.322945 ProteinModelPortal:P97821
            SMR:P97821 STRING:P97821 PhosphoSite:P97821 PaxDb:P97821
            PRIDE:P97821 Ensembl:ENSMUST00000032779 GeneID:13032 KEGG:mmu:13032
            InParanoid:P97821 BindingDB:P97821 ChEMBL:CHEMBL3454 ChiTaRS:CTSC
            NextBio:282904 Bgee:P97821 CleanEx:MM_CTSC Genevestigator:P97821
            Uniprot:P97821
        Length = 462

 Score = 280 (103.6 bits), Expect = 3.9e-24, P = 3.9e-24
 Identities = 73/223 (32%), Positives = 115/223 (51%)

Query:   135 DVPTSLDWRD-KGA--VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ--LSEQQLL 189
             ++P S DWR+ +G   V+P++NQ+ CG C++FA++  +E   +I + N     LS Q+++
Sbjct:   229 NLPESWDWRNVQGVNYVSPVRNQESCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVV 288

Query:   190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
              CS     GC GG          Q+ G+  E  +PY A    C   +       S+Y  V
Sbjct:   289 SCSPYAQ-GCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDSPCKPRENCLRYYSSDYYYV 347

Query:   250 PS---GDEQALLKA--VSMQPVSIAIAAYSTEFQSYKEGIFN--GVCGT----QL-DHAV 297
                  G  +AL+K   V   P+++A   +  +F  Y  GI++  G+       +L +HAV
Sbjct:   348 GGFYGGCNEALMKLELVKHGPMAVAFEVHD-DFLHYHSGIYHHTGLSDPFNPFELTNHAV 406

Query:   298 TIVGFGTTE-DGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
              +VG+G     G  YW+IKNSWG+ WG++GY +I R    C I
Sbjct:   407 LLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRGTDECAI 449


>RGD|2445 [details] [associations]
            symbol:Ctsc "cathepsin C" species:10116 "Rattus norvegicus"
          [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA;ISO]
          [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
          [GO:0004252 "serine-type endopeptidase activity" evidence=IEA;ISO]
          [GO:0005764 "lysosome" evidence=IDA;TAS] [GO:0005783 "endoplasmic
          reticulum" evidence=IDA] [GO:0005794 "Golgi apparatus" evidence=IDA]
          [GO:0006508 "proteolysis" evidence=IEP;ISO;TAS] [GO:0007568 "aging"
          evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
          evidence=ISO] [GO:0010033 "response to organic substance"
          evidence=IDA] [GO:0031404 "chloride ion binding" evidence=IDA]
          [GO:0042802 "identical protein binding" evidence=IDA] [GO:0043621
          "protein self-association" evidence=IDA] InterPro:IPR000668
          InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
          InterPro:IPR000169 RGD:2445 GO:GO:0005783 GO:GO:0005794 GO:GO:0007568
          GO:GO:0010033 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
          InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139
          PROSITE:PS00639 GO:GO:0004252 GO:GO:0005764 GO:GO:0043621
          GO:GO:0042802 GO:GO:0031404 GO:GO:0004197
          GeneTree:ENSGT00560000076599 CTD:1075 HOGENOM:HOG000068022
          HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ InterPro:IPR014882
          Pfam:PF08773 MEROPS:C01.070 OMA:YDDFLHY GO:GO:0001913 EMBL:D90404
          IPI:IPI00193765 PIR:A41158 RefSeq:NP_058793.1 UniGene:Rn.203177
          PDB:1JQP PDBsum:1JQP ProteinModelPortal:P80067 SMR:P80067
          STRING:P80067 PhosphoSite:P80067 PRIDE:P80067
          Ensembl:ENSRNOT00000022342 GeneID:25423 KEGG:rno:25423
          InParanoid:P80067 SABIO-RK:P80067 EvolutionaryTrace:P80067
          NextBio:606591 ArrayExpress:P80067 Genevestigator:P80067
          GermOnline:ENSRNOG00000016496 Uniprot:P80067
        Length = 462

 Score = 280 (103.6 bits), Expect = 3.9e-24, P = 3.9e-24
 Identities = 72/222 (32%), Positives = 115/222 (51%)

Query:   136 VPTSLDWRD-KGA--VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ--LSEQQLLD 190
             +P S DWR+ +G   V+P++NQ+ CG C++FA++  +E   +I + N     LS Q+++ 
Sbjct:   230 LPESWDWRNVRGINFVSPVRNQESCGSCYSFASLGMLEARIRILTNNSQTPILSPQEVVS 289

Query:   191 CSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVP 250
             CS     GC GG          Q+ G+  E+ +PY A    C   +       S Y  V 
Sbjct:   290 CSPYAQ-GCDGGFPYLIAGKYAQDFGVVEENCFPYTATDAPCKPKENCLRYYSSEYYYVG 348

Query:   251 S---GDEQALLKA--VSMQPVSIAIAAYSTEFQSYKEGIFN--GVCGT----QL-DHAVT 298
                 G  +AL+K   V   P+++A   +  +F  Y  GI++  G+       +L +HAV 
Sbjct:   349 GFYGGCNEALMKLELVKHGPMAVAFEVHD-DFLHYHSGIYHHTGLSDPFNPFELTNHAVL 407

Query:   299 IVGFGTTE-DGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             +VG+G     G +YW++KNSWG+ WG++GY +I R    C I
Sbjct:   408 LVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRIRRGTDECAI 449


>TAIR|locus:2133402 [details] [associations]
            symbol:AT4G01610 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0005829 "cytosol" evidence=IDA] [GO:0000902 "cell
            morphogenesis" evidence=RCA] [GO:0006635 "fatty acid
            beta-oxidation" evidence=RCA] [GO:0010162 "seed dormancy process"
            evidence=RCA] [GO:0016049 "cell growth" evidence=RCA] [GO:0048193
            "Golgi vesicle transport" evidence=RCA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0005773 EMBL:CP002687
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 eggNOG:NOG315657
            HOGENOM:HOG000241341 KO:K01363 PANTHER:PTHR12411:SF16 OMA:DAIPDHF
            HSSP:P07858 ProtClustDB:CLSN2687619 EMBL:AF370193 EMBL:AY065167
            EMBL:AY114015 EMBL:AY086034 EMBL:AF083797 EMBL:BT001190
            EMBL:AK175280 EMBL:AK175481 EMBL:AK175539 EMBL:AK176165
            EMBL:AK176244 EMBL:AK176281 EMBL:AK176330 EMBL:AK176416
            EMBL:AK176433 EMBL:AK176487 EMBL:AK221398 EMBL:AK230235
            IPI:IPI00530811 RefSeq:NP_567215.1 UniGene:At.24471
            ProteinModelPortal:Q94K85 SMR:Q94K85 STRING:Q94K85 MEROPS:C01.144
            PaxDb:Q94K85 PRIDE:Q94K85 EnsemblPlants:AT4G01610.1 GeneID:826792
            KEGG:ath:AT4G01610 TAIR:At4g01610 InParanoid:Q94K85
            PhylomeDB:Q94K85 Genevestigator:Q94K85 Uniprot:Q94K85
        Length = 359

 Score = 187 (70.9 bits), Expect = 3.9e-24, Sum P(2) = 3.9e-24
 Identities = 34/98 (34%), Positives = 53/98 (54%)

Query:   243 ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLD-HAVTIVG 301
             +S Y  V S  +  + +     PV ++   Y  +F  YK G++  + G+ +  HAV ++G
Sbjct:   235 VSTYT-VKSNPQDIMAEVYKNGPVEVSFTVYE-DFAHYKSGVYKHITGSNIGGHAVKLIG 292

Query:   302 FGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             +GT+ +G +YWL+ N W   WGD GY  I R    CGI
Sbjct:   293 WGTSSEGEDYWLMANQWNRGWGDDGYFMIRRGTNECGI 330

 Score = 150 (57.9 bits), Expect = 3.9e-24, Sum P(2) = 3.9e-24
 Identities = 54/183 (29%), Positives = 84/183 (45%)

Query:    63 EKEMRLKIFKENLEYIEKANKEGNRTYKLGTN-QFSDLTNDEFRALYTGYKMPSPSHRXX 121
             ++++  KI ++  E ++K N+  N  +K   N +FS+ T  EF+ L  G K P+P     
Sbjct:    35 KQKLDSKILQD--EIVKKVNENPNAGWKAAINDRFSNATVAEFKRLL-GVK-PTPKKHFL 90

Query:   122 XXXXFKYQNLSMTDVPTSLD----WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS 177
                   +   S+  +P + D    W    ++  I +Q  CG CWAF AV ++     I+ 
Sbjct:    91 GVPIVSHDP-SLK-LPKAFDARTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIQF 148

Query:   178 GNLIQLSEQQLLDC-STNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ 236
             G  I LS   LL C      +GC GG    A+ Y   + G+ TE+  PY    G      
Sbjct:   149 GMNISLSVNDLLACCGFRCGDGCDGGYPIAAWQYFSYS-GVVTEECDPYFDNTGCSHPGC 207

Query:   237 KPA 239
             +PA
Sbjct:   208 EPA 210


>UNIPROTKB|J9P219 [details] [associations]
            symbol:J9P219 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 InterPro:IPR014882
            Pfam:PF08773 OMA:YDDFLHY EMBL:AAEX03012741
            Ensembl:ENSCAFT00000050015 Uniprot:J9P219
        Length = 406

 Score = 275 (101.9 bits), Expect = 5.3e-24, P = 5.3e-24
 Identities = 86/290 (29%), Positives = 137/290 (47%)

Query:    69 KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKY 128
             +++K N E+++  N            ++  LT  +      G K+P             +
Sbjct:   110 RLYKYNYEFVKAINTIQKSWTATRYIEYETLTLRDMMTRGGGRKIPRKPKPTPLTAEI-H 168

Query:   129 QNLSMTDVPTSLDWRD-KGA--VTPIKNQK-ECGCCWAFAAVAAVEGITKIRSGNLIQ-- 182
             + +S   +PTS DWR+ +G   V+P++NQ   CG C+AFA+ A +E   +I + N     
Sbjct:   169 EEISR--LPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRILTNNTQTPI 226

Query:   183 LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK 242
             LS Q+++ CS     GC GG          Q+ G+  E  +PY      C          
Sbjct:   227 LSPQEIVSCSQYAQ-GCEGGFPYLIAGKYAQDFGLVEEACFPYAGSDSPCKP-NDCFRYY 284

Query:   243 ISNYEEVPS--GD-EQALLKA--VSMQPVSIAIAAYSTEFQSYKEGIF--NGVCGT---- 291
              S Y  V    G   +AL+K   V   P+++A   Y  +F  Y++GI+   G+       
Sbjct:   285 SSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYD-DFFHYQKGIYYHTGLRDPFNPF 343

Query:   292 QL-DHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             +L +HAV +VG+GT +  G +YW++KNSWG+ WG+ GY +I R    C I
Sbjct:   344 ELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAI 393


>DICTYBASE|DDB_G0292462 [details] [associations]
            symbol:DDB_G0292462 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0292462 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            EMBL:AAFI02000190 RefSeq:XP_629634.1 MEROPS:C01.A56
            EnsemblProtists:DDB0184413 GeneID:8628698 KEGG:ddi:DDB_G0292462
            InParanoid:Q54D62 OMA:NTQVESH Uniprot:Q54D62
        Length = 323

 Score = 272 (100.8 bits), Expect = 1.1e-23, P = 1.1e-23
 Identities = 75/217 (34%), Positives = 109/217 (50%)

Query:   135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ--LSEQQLLDCS 192
             DV T+  W D   ++P++ Q+ CG CWA      +     I S   I+  LS Q L+DC 
Sbjct:    51 DVRTN--WGD--CMSPVREQQSCGSCWAQVTSGILADRMCIESDKNIKMLLSPQYLMDCD 106

Query:   193 ----TNG----NNGCLGGSREKAFAYIIQNQGIATEDEYPYQA-----VPGTC---SAAQ 236
                 ++G    NNGC GG    A   +I N+GI +++   YQA      P TC   S   
Sbjct:   107 GSCVSDGVSGCNNGCKGGFVGLALTRLI-NEGIVSDECLSYQASKDSSCPTTCDDGSPIS 165

Query:   237 KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLD-H 295
                  K ++    P+  + A  + ++  PV      YS +F+ +K  ++     TQ++ H
Sbjct:   166 NTTIYKATSCRAFPTVQD-AQYEIMTNGPVIATFMLYS-DFKPHKWDVYIKSSNTQVESH 223

Query:   296 AVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
             AV +VG+GTT DG +YW+  NSWG  WGD GY KI R
Sbjct:   224 AVRVVGWGTTSDGVDYWIAANSWGTGWGDKGYFKIRR 260


>DICTYBASE|DDB_G0288563 [details] [associations]
            symbol:DDB_G0288563 species:44689 "Dictyostelium
            discoideum" [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0288563
            GO:GO:0005615 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197
            EMBL:AAFI02000117 PANTHER:PTHR12411:SF16 RefSeq:XP_636643.1
            MEROPS:C01.A58 PRIDE:Q54IS1 EnsemblProtists:DDB0187993
            GeneID:8626689 KEGG:ddi:DDB_G0288563 InParanoid:Q54IS1 OMA:AWEYMEL
            Uniprot:Q54IS1
        Length = 314

 Score = 268 (99.4 bits), Expect = 2.9e-23, P = 2.9e-23
 Identities = 72/227 (31%), Positives = 110/227 (48%)

Query:   136 VPTSLD----WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ---LSEQQL 188
             +PTS D    W D   + PI NQ++CG CWAF++   +     I S N      LS Q L
Sbjct:    88 IPTSFDSRVQWPD--CIHPILNQEQCGSCWAFSSSEVLSDRLCIASNNKTNPGALSPQTL 145

Query:   189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK--ISNY 246
             + C   GN+GC GG  + A+ Y+ + +G+ T+   PY A  GT  + Q+  +     S Y
Sbjct:   146 VACDVYGNDGCSGGIPQLAWEYM-ELKGLPTDSCVPYTAGNGTVYSCQRSCSDSEDYSLY 204

Query:   247 EEVP-----SGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQL--DHAVT 298
                P         Q + + + +  P+   +  Y  +F SY  G++    G+ L   HA+ 
Sbjct:   205 RAKPFTLKTCSSVQCIQENILAYGPIVGTMEVYE-DFMSYSSGVYVMTPGSSLLGGHAIK 263

Query:   299 IVGFGTTEDGA-NYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSS 344
             IVG+G  +    NYW++ NSWG  WG  G+  I  +   C I + +S
Sbjct:   264 IVGWGFDQTSQLNYWIVANSWGADWGQQGFFFISMET--CSISSDAS 308


>UNIPROTKB|E9PKT6 [details] [associations]
            symbol:CTSH "Cathepsin H" species:9606 "Homo sapiens"
            [GO:0001520 "outer dense fiber" evidence=IEA] [GO:0001656
            "metanephros development" evidence=IEA] [GO:0001669 "acrosomal
            vesicle" evidence=IEA] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0007283 "spermatogenesis" evidence=IEA] [GO:0008284 "positive
            regulation of cell proliferation" evidence=IEA] [GO:0010634
            "positive regulation of epithelial cell migration" evidence=IEA]
            [GO:0016505 "apoptotic protease activator activity" evidence=IEA]
            [GO:0030984 "kininogen binding" evidence=IEA] [GO:0031638 "zymogen
            activation" evidence=IEA] [GO:0031648 "protein destabilization"
            evidence=IEA] [GO:0032403 "protein complex binding" evidence=IEA]
            [GO:0032526 "response to retinoic acid" evidence=IEA] [GO:0033619
            "membrane protein proteolysis" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0043621 "protein
            self-association" evidence=IEA] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IEA] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            InterPro:IPR000169 GO:GO:0043066 GO:GO:0008284 PANTHER:PTHR12411
            PROSITE:PS00139 GO:GO:0045766 GO:GO:0004252 GO:GO:0032526
            GO:GO:0016505 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 GO:GO:0060448 GO:GO:0033619
            EMBL:AC011944 HGNC:HGNC:2535 IPI:IPI00375426
            ProteinModelPortal:E9PKT6 SMR:E9PKT6 PRIDE:E9PKT6
            Ensembl:ENST00000528741 ArrayExpress:E9PKT6 Bgee:E9PKT6
            Uniprot:E9PKT6
        Length = 134

 Score = 262 (97.3 bits), Expect = 1.3e-22, P = 1.3e-22
 Identities = 57/139 (41%), Positives = 78/139 (56%)

Query:    91 LGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTD-VPTSLDWRDKGA-V 148
             +  NQFSD++   F  +   Y    P +       +    L  T   P S+DWR KG  V
Sbjct:     1 MALNQFSDMS---FAEIKHKYLWSEPQNCSATKSNY----LRGTGPYPPSVDWRKKGNFV 53

Query:   149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKA 207
             +P+KNQ  CG CW F+   A+E    I +G ++ L+EQQL+DC+ + NN GC GG   +A
Sbjct:    54 SPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQA 113

Query:   208 FAYIIQNQGIATEDEYPYQ 226
             F YI+ N+GI  ED YPYQ
Sbjct:   114 FEYILYNKGIMGEDTYPYQ 132


>UNIPROTKB|E2R6Q7 [details] [associations]
            symbol:CTSB "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730 "nucleolus"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790
            GO:GO:0005764 GO:GO:0004197 CTD:1508 GeneTree:ENSGT00560000076599
            KO:K01363 OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16
            EMBL:AAEX03014318 RefSeq:XP_543203.3 Ensembl:ENSCAFT00000012692
            GeneID:486077 KEGG:cfa:486077 NextBio:20859923 Uniprot:E2R6Q7
        Length = 339

 Score = 190 (71.9 bits), Expect = 1.1e-21, Sum P(2) = 1.1e-21
 Identities = 41/112 (36%), Positives = 59/112 (52%)

Query:   229 PGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV 288
             PG   + ++      S+Y  V   +++ + +     PV  A   YS +F  YK G++  V
Sbjct:   213 PGYSPSYKEDKHYGCSSYS-VSDNEKEIMAEIYKNGPVEAAFTVYS-DFLLYKSGVYQHV 270

Query:   289 CGTQLD-HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
              G  +  HAV I+G+G  EDG  YWL+ NSW   WGD G+ KI+R    CGI
Sbjct:   271 TGEMMGGHAVRILGWGV-EDGTPYWLVGNSWNTDWGDNGFFKILRGRDHCGI 321

 Score = 121 (47.7 bits), Expect = 1.1e-21, Sum P(2) = 1.1e-21
 Identities = 45/155 (29%), Positives = 75/155 (48%)

Query:    76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTD 135
             E ++  NK  N T+K G N F ++     R L  G  +  P  +      F  +NL +  
Sbjct:    29 ELVDYVNKR-NTTWKAGHN-FHNVDPSYLRRL-CGTFLGGP--KLPQRVQFA-KNLIL-- 80

Query:   136 VPTSLD----WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS-GNL-IQLSEQQLL 189
              P S D    W +   +  I++Q  CG CWAF AV A+     IR+ G++ +++S + +L
Sbjct:    81 -PESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNVEVSAEDML 139

Query:   190 DCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEY 223
              C  +   +GC GG   +A+ +  + QG+ +   Y
Sbjct:   140 TCCGDQCGDGCNGGFPAEAWNFWTK-QGLVSGGLY 173


>ZFIN|ZDB-GENE-070323-1 [details] [associations]
            symbol:ctsbb "capthepsin B, b" species:7955 "Danio
            rerio" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-070323-1 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197
            GeneTree:ENSGT00560000076599 PANTHER:PTHR12411:SF16 OMA:CCGFLCG
            EMBL:CU207296 EMBL:CABZ01037785 IPI:IPI00877452
            Ensembl:ENSDART00000097263 Bgee:F1QZT5 Uniprot:F1QZT5
        Length = 326

 Score = 194 (73.4 bits), Expect = 2.6e-21, Sum P(2) = 2.6e-21
 Identities = 39/97 (40%), Positives = 55/97 (56%)

Query:   244 SNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLD-HAVTIVGF 302
             S    VPS  +Q + +  +  PV  A   Y  +F  YK G++  + G+ L  HAV I+G+
Sbjct:   221 SKVYNVPSDQQQIMTELYTNGPVEAAFTVYE-DFPLYKSGVYQHLTGSALGGHAVKILGW 279

Query:   303 GTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             G  E+G  +WL+ NSW + WGD GY KI+R    CGI
Sbjct:   280 GE-ENGTPFWLVANSWNSDWGDNGYFKILRGHDECGI 315

 Score = 111 (44.1 bits), Expect = 2.6e-21, Sum P(2) = 2.6e-21
 Identities = 31/94 (32%), Positives = 45/94 (47%)

Query:   136 VPTSLDWRDKG----AVTPIKNQKECGCCWAFAAVAAVEGITKIRS-GNLI-QLSEQQLL 189
             +P S D RD+      +  I++Q  CG CWAF AV ++     I S G    ++S + LL
Sbjct:    75 LPDSFDLRDQWPNCKTLNQIRDQGSCGSCWAFGAVESISDRICIHSKGKQSPEISAEDLL 134

Query:   190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
              C      GC GG   +A+ Y  +  G+ T   Y
Sbjct:   135 SCCDQCGFGCSGGFPAEAWDYW-RRSGLVTGGLY 167


>UNIPROTKB|E2QV47 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0097208 "alveolar lamellar body"
            evidence=IEA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0070371 "ERK1 and ERK2 cascade"
            evidence=IEA] [GO:0070324 "thyroid hormone binding" evidence=IEA]
            [GO:0060448 "dichotomous subdivision of terminal units involved in
            lung branching" evidence=IEA] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IEA] [GO:0043129 "surfactant homeostasis"
            evidence=IEA] [GO:0043066 "negative regulation of apoptotic
            process" evidence=IEA] [GO:0033619 "membrane protein proteolysis"
            evidence=IEA] [GO:0032526 "response to retinoic acid" evidence=IEA]
            [GO:0031648 "protein destabilization" evidence=IEA] [GO:0031638
            "zymogen activation" evidence=IEA] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=IEA] [GO:0016505
            "apoptotic protease activator activity" evidence=IEA] [GO:0010815
            "bradykinin catabolic process" evidence=IEA] [GO:0010813
            "neuropeptide catabolic process" evidence=IEA] [GO:0010634
            "positive regulation of epithelial cell migration" evidence=IEA]
            [GO:0010628 "positive regulation of gene expression" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=IEA] [GO:0005829 "cytosol" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0004252 "serine-type endopeptidase activity"
            evidence=IEA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IEA] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 GO:GO:0016505 GO:GO:0010634
            GO:GO:0004197 GO:GO:0042599 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 Ensembl:ENSCAFT00000036196 Uniprot:E2QV47
        Length = 136

 Score = 245 (91.3 bits), Expect = 8.0e-21, P = 8.0e-21
 Identities = 49/133 (36%), Positives = 77/133 (57%)

Query:   220 EDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSM-QPVSIAIAAYSTEFQ 278
             ED YPY+   G C      A A + +   +   DEQA+++AV++  PVS A    S +F 
Sbjct:     3 EDSYPYKGQDGDCKYQPSKAIAFVKDVANITINDEQAMVEAVALYNPVSFAFEVTS-DFM 61

Query:   279 SYKEGIFNGV-CGT---QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE 334
              Y++GI++   C     +++HAV  VG+G  ++G  YW++KNSWG  WG  GY  + R +
Sbjct:    62 MYRKGIYSSTSCHKTPDKVNHAVLAVGYGE-QNGIPYWIVKNSWGPQWGMNGYFLMERGK 120

Query:   335 GLCGIGTRSSYPL 347
              +CG+   +SYP+
Sbjct:   121 NMCGLAACASYPI 133


>WB|WBGene00000784 [details] [associations]
            symbol:cpr-4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39895 EMBL:L39926 EMBL:FO081381
            PIR:T37280 RefSeq:NP_504682.1 UniGene:Cel.5404
            ProteinModelPortal:P43508 SMR:P43508 DIP:DIP-25376N
            MINT:MINT-1069892 STRING:P43508 MEROPS:C01.A34 PaxDb:P43508
            EnsemblMetazoa:F44C4.3 GeneID:179053 KEGG:cel:CELE_F44C4.3
            UCSC:F44C4.3 CTD:179053 WormBase:F44C4.3 InParanoid:P43508
            OMA:CCGFLCG NextBio:903704 Uniprot:P43508
        Length = 335

 Score = 173 (66.0 bits), Expect = 9.1e-21, Sum P(2) = 9.1e-21
 Identities = 33/86 (38%), Positives = 49/86 (56%)

Query:   255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLD-HAVTIVGFGTTEDGANYWL 313
             Q   + ++  PV  A   Y  +F  YK G++    G +L  HA+ I+G+GT ++G  YWL
Sbjct:   241 QIQAEIIAHGPVEAAFTVYE-DFYQYKTGVYVHTTGQELGGHAIRILGWGT-DNGTPYWL 298

Query:   314 IKNSWGNTWGDAGYMKIVRDEGLCGI 339
             + NSW   WG+ GY +I+R    CGI
Sbjct:   299 VANSWNVNWGENGYFRIIRGTNECGI 324

 Score = 133 (51.9 bits), Expect = 9.1e-21, Sum P(2) = 9.1e-21
 Identities = 31/97 (31%), Positives = 48/97 (49%)

Query:   136 VPTSLD----WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ--LSEQQLL 189
             +P + D    W +  ++  I++Q +CG CWAFAA  A      I S   +   LS + +L
Sbjct:    81 IPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVL 140

Query:   190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQ 226
              C +N   GC GG    A+ Y++++ G  T   Y  Q
Sbjct:   141 SCCSNCGYGCEGGYPINAWKYLVKS-GFCTGGSYEAQ 176


>FB|FBgn0033873 [details] [associations]
            symbol:CG6337 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 EMBL:AE013599
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 HSSP:P80067 EMBL:AY084123
            RefSeq:NP_610905.1 UniGene:Dm.5230 SMR:Q7JYA0 IntAct:Q7JYA0
            EnsemblMetazoa:FBtr0087646 GeneID:36530 KEGG:dme:Dmel_CG6337
            UCSC:CG6337-RA FlyBase:FBgn0033873 eggNOG:NOG310593
            InParanoid:Q7JYA0 OMA:NRTTYRE OrthoDB:EOG4MCVFZ GenomeRNAi:36530
            NextBio:799041 Uniprot:Q7JYA0
        Length = 340

 Score = 245 (91.3 bits), Expect = 1.5e-20, P = 1.5e-20
 Identities = 84/330 (25%), Positives = 142/330 (43%)

Query:    24 LLVSCASQVVSS--RSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
             LL+ C    + S     H Q +V+  + +     ++Y     +      F  N   + + 
Sbjct:     4 LLLCCLLLTIDSGWAFNHGQDLVDF-QTYEDNFNKTYASTSARNFANYYFIYNRNQVAQH 62

Query:    82 NKEGNR---TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPT 138
             N + +R   TY+   NQFSD+   +F AL     +P   +           + + +    
Sbjct:    63 NAQADRNRTTYREAVNQFSDIRLIQFAAL-----LPKAVNTVTSAASDPPASQAAS---A 114

Query:   139 SLDW-RDKGAVTPIKNQK-ECGCCWAFAAVAAVEGITKIRSGNLI--QLSEQQLLDCSTN 194
             S D   D G    +++Q   C   WA+A   AVE +  +++ N +   LS QQLLDC+  
Sbjct:   115 SFDIITDFGLTVAVEDQGVNCSSSWAYATAKAVEIMNAVQTANPLPSSLSAQQLLDCAGM 174

Query:   195 GNNGCLGGSREKAFAYIIQ--NQGIATEDEYPYQ---AVPGTCSAAQKPAAA-KISNYEE 248
             G  GC   +   A  Y+ Q  +  +  E +YP       PG C      +   K++ Y  
Sbjct:   175 GT-GCSTQTPLAALNYLTQLTDAYLYPEVDYPNNNSLKTPGMCQPPSSVSVGVKLAGYST 233

Query:   249 VPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGIF----NGVCGTQLDHAVTIVGFG 303
             V   D+ A+++ VS   PV +     +  F  Y  G++      +   +    + +VG+ 
Sbjct:   234 VADNDDAAVMRYVSNGFPVIVEYNPATFGFMQYSSGVYVQETRALTNPKSSQFLVVVGYD 293

Query:   304 TTEDG-ANYWLIKNSWGNTWGDAGYMKIVR 332
                D   +YW   NS+G+TWG+ GY++IVR
Sbjct:   294 HDVDSNLDYWRCLNSFGDTWGEEGYIRIVR 323


>DICTYBASE|DDB_G0283921 [details] [associations]
            symbol:ctsB "cathepsin B precursor" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0283921 GO:GO:0005615
            GenomeReviews:CM000153_GR GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:AAFI02000058
            eggNOG:NOG315657 PANTHER:PTHR12411:SF16 OMA:CSLSCQS
            RefSeq:XP_638805.1 HSSP:P07688 MEROPS:C01.A59
            EnsemblProtists:DDB0233997 GeneID:8624329 KEGG:ddi:DDB_G0283921
            Uniprot:Q54QD9
        Length = 311

 Score = 160 (61.4 bits), Expect = 2.9e-20, Sum P(2) = 2.9e-20
 Identities = 41/128 (32%), Positives = 63/128 (49%)

Query:   128 YQNLSMTDVPTSLD----WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
             Y  L +  +PTS +    W +   ++ I+NQ  CG CWAF A  +      I +   +QL
Sbjct:    72 YDPLGV-QIPTSFNAQTNWPNCTTISQIQNQARCGSCWAFGATESATDRLCIHNNENVQL 130

Query:   184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKI 243
             S   ++ C    +NGC GG    A+ ++ + QG  +E+  PY  +P TC  AQ+P     
Sbjct:   131 SFMDMVTCDET-DNGCEGGDAFSAWNWL-RKQGAVSEECLPY-TIP-TCPPAQQPCL--- 183

Query:   244 SNYEEVPS 251
              N+   PS
Sbjct:   184 -NFVNTPS 190

 Score = 142 (55.0 bits), Expect = 2.9e-20, Sum P(2) = 2.9e-20
 Identities = 36/104 (34%), Positives = 52/104 (50%)

Query:   237 KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLD-H 295
             K   AKI +++     DE  + + V+  PV      +  +F +YK G++    G  L  H
Sbjct:   207 KHKMAKIYSFDS----DEAIMQEIVTNGPVEACFTVFE-DFLAYKSGVYVHTTGKDLGGH 261

Query:   296 AVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
              V +VGFGT  +G +Y+   N W  +WGD G   I R  G CGI
Sbjct:   262 CVKLVGFGTL-NGVDYYAANNQWTTSWGDNGTFLIKR--GDCGI 302


>WB|WBGene00000785 [details] [associations]
            symbol:cpr-5 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39896 EMBL:L39927 EMBL:FO081739
            PIR:T37277 RefSeq:NP_503383.1 UniGene:Cel.19730
            ProteinModelPortal:P43509 SMR:P43509 DIP:DIP-25329N IntAct:P43509
            MINT:MINT-1051285 STRING:P43509 MEROPS:C01.A35 PaxDb:P43509
            EnsemblMetazoa:W07B8.5 GeneID:178612 KEGG:cel:CELE_W07B8.5
            UCSC:W07B8.5.1 CTD:178612 WormBase:W07B8.5 InParanoid:P43509
            OMA:DAIPDHF NextBio:901840 Uniprot:P43509
        Length = 344

 Score = 168 (64.2 bits), Expect = 3.6e-20, Sum P(2) = 3.6e-20
 Identities = 32/87 (36%), Positives = 48/87 (55%)

Query:   254 EQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLD-HAVTIVGFGTTEDGANYW 312
             EQ   + ++  P+ +A   Y  +F  Y  G++    G  L  HAV I+G+G  ++G  YW
Sbjct:   245 EQIQTEILTNGPIEVAFTVYE-DFYQYTTGVYVHTAGASLGGHAVKILGWGV-DNGTPYW 302

Query:   313 LIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             L+ NSW   WG+ GY +I+R    CGI
Sbjct:   303 LVANSWNVAWGEKGYFRIIRGLNECGI 329

 Score = 134 (52.2 bits), Expect = 3.6e-20, Sum P(2) = 3.6e-20
 Identities = 34/100 (34%), Positives = 52/100 (52%)

Query:   136 VPTSLDWRDKG----AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ--LSEQQLL 189
             +P   D RD+     ++  I++Q +CG CWAFAA  A+   T I S   +   LS + LL
Sbjct:    82 IPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLL 141

Query:   190 DCST---NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQ 226
              C T   +  NGC GG   +A+ + +++ G+ T   Y  Q
Sbjct:   142 SCCTGMFSCGNGCEGGYPIQAWKWWVKH-GLVTGGSYETQ 180


>DICTYBASE|DDB_G0283401 [details] [associations]
            symbol:ctsZ "cathepsin Z precursor" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            dictyBase:DDB_G0283401 GO:GO:0005615 GenomeReviews:CM000153_GR
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 EMBL:AAFI02000055 KO:K08568 OMA:QCGTCTE
            eggNOG:NOG275763 RefSeq:XP_639036.1 ProteinModelPortal:Q54R55
            IntAct:Q54R55 MEROPS:C01.A60 PRIDE:Q54R55
            EnsemblProtists:DDB0233836 GeneID:8624061 KEGG:ddi:DDB_G0283401
            InParanoid:Q54R55 Uniprot:Q54R55
        Length = 296

 Score = 237 (88.5 bits), Expect = 5.7e-20, P = 5.7e-20
 Identities = 64/195 (32%), Positives = 101/195 (51%)

Query:   157 CGCCWAFAAVAAVEGITKI-RSGNL--IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
             CG CWAFA+ +++    KI R      + ++ Q L+DC  NG   C GG    AFA+I +
Sbjct:    85 CGGCWAFASTSSISDRIKIQRKAAFPDVNVAPQHLIDC--NGGGTCDGGDPGDAFAFINE 142

Query:   214 NQGIATEDEYPYQA--VPGTCSAAQK-----------PAAAKIS--NYEEVPSGDEQALL 258
             N GI  E   PYQA  +P  CS A K           P    I+   Y  V  G +  + 
Sbjct:   143 N-GIVDETCKPYQAKNLPDECSPACKTCNPDGTCQAIPVHTNITVTEYGSV-RGAKDMMA 200

Query:   259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQL-DHAVTIVGFGTTEDGANYWLIKNS 317
             +  +  P++ +I A S + ++Y  GIF       L +H ++++G+G  +D   YW+++NS
Sbjct:   201 EIYARGPIACSIDATS-KLEAYTSGIFKEFKLDPLPNHIISVIGWGV-QDSTPYWIVRNS 258

Query:   318 WGNTWGDAGYMKIVR 332
             WG+ +G+ G+  IV+
Sbjct:   259 WGSYYGEGGFFNIVQ 273

 Score = 165 (63.1 bits), Expect = 6.2e-10, P = 6.2e-10
 Identities = 46/114 (40%), Positives = 61/114 (53%)

Query:   135 DVPTSLDWRDKGAV---TPIKNQ---KECGCCWAFAAVAAVEGITKI-RSGNL--IQLSE 185
             +VP S DWR+   V   T  +NQ   + CG CWAFA+ +++    KI R      + ++ 
Sbjct:    57 EVPQSWDWRNVSGVNYLTMNRNQHIPQYCGGCWAFASTSSISDRIKIQRKAAFPDVNVAP 116

Query:   186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA--VPGTCSAAQK 237
             Q L+DC  NG   C GG    AFA+I +N GI  E   PYQA  +P  CS A K
Sbjct:   117 QHLIDC--NGGGTCDGGDPGDAFAFINEN-GIVDETCKPYQAKNLPDECSPACK 167


>ZFIN|ZDB-GENE-040426-2650 [details] [associations]
            symbol:ctsba "cathepsin B, a" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0031101 "fin regeneration"
            evidence=IEP] [GO:0008233 "peptidase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 ZFIN:ZDB-GENE-040426-2650 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790
            GO:GO:0004197 GO:GO:0031101 MEROPS:C01.060 HOVERGEN:HBG003480
            PANTHER:PTHR12411:SF16 HSSP:P07688 EMBL:BC044517 IPI:IPI00485996
            UniGene:Dr.3374 ProteinModelPortal:Q803E4 SMR:Q803E4 STRING:Q803E4
            PRIDE:Q803E4 InParanoid:Q803E4 ArrayExpress:Q803E4 Bgee:Q803E4
            Uniprot:Q803E4
        Length = 330

 Score = 186 (70.5 bits), Expect = 8.6e-20, Sum P(2) = 8.6e-20
 Identities = 38/92 (41%), Positives = 50/92 (54%)

Query:   249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLD-HAVTIVGFGTTED 307
             VPS     + +     PV  A   Y  +F  YK G++  + G+ L  HA+ I+G+G  E+
Sbjct:   231 VPSNQNGIMAELFKNGPVEAAFTVYE-DFLLYKSGVYQHMSGSALGGHAIKILGWGE-EN 288

Query:   308 GANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             G  YWL  NSW   WGD GY KI+R E  CGI
Sbjct:   289 GVPYWLAANSWNTDWGDNGYFKILRGEDHCGI 320

 Score = 107 (42.7 bits), Expect = 8.6e-20, Sum P(2) = 8.6e-20
 Identities = 26/84 (30%), Positives = 40/84 (47%)

Query:   142 WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLI--QLSEQQLLDCSTNGNNGC 199
             W +   +  I++Q  CG CWAF A  A+     I+S   +  ++S Q LL C  +   GC
Sbjct:    89 WPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIQSNAKVSVEISSQDLLTCCDSCGMGC 148

Query:   200 LGGSREKAFAYIIQNQGIATEDEY 223
              GG    A+ +   + G+ T   Y
Sbjct:   149 NGGYPSAAWDFWTTD-GLVTGGLY 171


>UNIPROTKB|F1N9D7 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9031 "Gallus gallus"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005730 "nucleolus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005739 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0005764
            GO:GO:0004197 GeneTree:ENSGT00560000076599 OMA:GYPSGAW
            GO:GO:0097067 PANTHER:PTHR12411:SF16 IPI:IPI00573387
            EMBL:AADN02018292 Ensembl:ENSGALT00000026896
            Ensembl:ENSGALT00000036723 Uniprot:F1N9D7
        Length = 340

 Score = 190 (71.9 bits), Expect = 8.7e-20, Sum P(2) = 8.7e-20
 Identities = 41/112 (36%), Positives = 61/112 (54%)

Query:   229 PGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV 288
             PG   + ++     I++Y  VP  +++ + +     PV  A   Y  +F  YK G++  V
Sbjct:   214 PGYSPSYKEDKHYGITSYG-VPRSEKEIMAEIYKNGPVEGAFIVYE-DFLMYKSGVYQHV 271

Query:   289 CGTQLD-HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
              G Q+  HA+ I+G+G  E+G  YWL  NSW   WGD G+ KI+R E  CGI
Sbjct:   272 SGEQVGGHAIRILGWGV-ENGTPYWLAANSWNTDWGDNGFFKILRGEDHCGI 322

 Score = 103 (41.3 bits), Expect = 8.7e-20, Sum P(2) = 8.7e-20
 Identities = 25/83 (30%), Positives = 40/83 (48%)

Query:   135 DVPTSLD----WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLI--QLSEQQL 188
             D+P + D    W +   ++ I++Q  CG CWAF AV A+     + +   +  ++S + L
Sbjct:    79 DLPDTFDSRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDL 138

Query:   189 LDC-STNGNNGCLGGSREKAFAY 210
             L C       GC GG    A+ Y
Sbjct:   139 LSCCGFECGMGCNGGYPSGAWRY 161


>WB|WBGene00021072 [details] [associations]
            symbol:W07B8.4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:FO081739 PIR:T31728 RefSeq:NP_503382.1
            HSSP:P53634 ProteinModelPortal:O16288 SMR:O16288 STRING:O16288
            MEROPS:C01.A39 PaxDb:O16288 EnsemblMetazoa:W07B8.4 GeneID:178611
            KEGG:cel:CELE_W07B8.4 UCSC:W07B8.4 CTD:178611 WormBase:W07B8.4
            InParanoid:O16288 OMA:ESQYGCK NextBio:901836 Uniprot:O16288
        Length = 335

 Score = 166 (63.5 bits), Expect = 1.1e-19, Sum P(2) = 1.1e-19
 Identities = 32/76 (42%), Positives = 44/76 (57%)

Query:   265 PVSIAIAAYSTEFQSYKEGIFNGVCGTQLD-HAVTIVGFGTTEDGANYWLIKNSWGNTWG 323
             PV +    Y  +F  YK GI+  V G +L  HAV ++G+G  ++G  YWL  NSW   WG
Sbjct:   247 PVEVGFIVYE-DFYLYKTGIYTHVAGGELGGHAVKMLGWGV-DNGTPYWLAANSWNTVWG 304

Query:   324 DAGYMKIVRDEGLCGI 339
             + GY +I+R    CGI
Sbjct:   305 EKGYFRILRGVDECGI 320

 Score = 131 (51.2 bits), Expect = 1.1e-19, Sum P(2) = 1.1e-19
 Identities = 39/110 (35%), Positives = 54/110 (49%)

Query:   136 VPTSLDWRDKG----AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ--LSEQQLL 189
             +P S D RD      +V  I++Q  CG CWA AA  A+   T I S   +   LS + +L
Sbjct:    73 IPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGDVNTLLSAEDIL 132

Query:   190 DCST---NGNNGCLGGSREKAFAYIIQNQGIAT----EDEY---PYQAVP 229
              C T   N  +GC GG   +A+ Y ++N G+ T    E +Y   PY   P
Sbjct:   133 TCCTGKFNCGDGCEGGYPIQAWRYWVKN-GLVTGGSFESQYGCKPYSIAP 181


>WB|WBGene00000788 [details] [associations]
            symbol:cpz-1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] [GO:0009792 "embryo development ending in
            birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0010171 "body morphogenesis" evidence=IMP]
            [GO:0018996 "molting cycle, collagen and cuticulin-based cuticle"
            evidence=IMP] [GO:0031012 "extracellular matrix" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0009792 GO:GO:0040007 GO:GO:0002119
            GO:GO:0018996 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0010171 GO:GO:0031012
            GeneTree:ENSGT00560000076599 KO:K08568 OMA:QCGTCTE EMBL:FO081275
            EMBL:BK001409 PIR:T29872 RefSeq:NP_491023.2 HSSP:Q9UBR2
            ProteinModelPortal:G5EGP8 SMR:G5EGP8 IntAct:G5EGP8 MEROPS:C01.A38
            EnsemblMetazoa:F32B5.8 GeneID:171829 KEGG:cel:CELE_F32B5.8
            CTD:171829 WormBase:F32B5.8 NextBio:872879 Uniprot:G5EGP8
        Length = 306

 Score = 234 (87.4 bits), Expect = 1.2e-19, P = 1.2e-19
 Identities = 63/197 (31%), Positives = 95/197 (48%)

Query:   157 CGCCWAFAAVAAVEGITKIRSGNL---IQLSEQQLLDCSTNGNNGC-LGGSREKAFAYII 212
             CG CWAF A +A+     I+  N      LS Q+++DCS  G   C +GG     + Y  
Sbjct:    92 CGSCWAFGATSALADRINIKRKNAWPQAYLSVQEVIDCSGAGT--CVMGGEPGGVYKYAH 149

Query:   213 QNQGIATE--DEY--------PYQAV----PGTCSAAQKPAAAKISNYEEVPSGDEQALL 258
             ++ GI  E  + Y        PY       PG C + +     K+S Y  V  G E+   
Sbjct:   150 EH-GIPHETCNNYQARDGKCDPYNRCGSCWPGECFSIKNYTLYKVSEYGTV-HGYEKMKA 207

Query:   259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT-EDGANYWLIKNS 317
             +     P++  IAA +  F++Y  GI+  V    +DH +++ G+G   E G  YW+ +NS
Sbjct:   208 EIYHKGPIACGIAA-TKAFETYAGGIYKEVTDEDIDHIISVHGWGVDHESGVEYWIGRNS 266

Query:   318 WGNTWGDAGYMKIVRDE 334
             WG  WG+ G+ KIV  +
Sbjct:   267 WGEPWGEHGWFKIVTSQ 283

 Score = 131 (51.2 bits), Expect = 6.1e-06, P = 6.1e-06
 Identities = 37/117 (31%), Positives = 57/117 (48%)

Query:   126 FKYQNLSMTDVPTSLDWRDKGAV---TPIKNQ---KECGCCWAFAAVAAVEGITKIRSGN 179
             ++ ++    D+P + DWRD   +   +  +NQ   + CG CWAF A +A+     I+  N
Sbjct:    55 YETEDFDSEDLPKTWDWRDANGINYASADRNQHIPQYCGSCWAFGATSALADRINIKRKN 114

Query:   180 L---IQLSEQQLLDCSTNGNNGC-LGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC 232
                   LS Q+++DCS  G   C +GG     + Y  ++ GI  E    YQA  G C
Sbjct:   115 AWPQAYLSVQEVIDCSGAGT--CVMGGEPGGVYKYAHEH-GIPHETCNNYQARDGKC 168


>WB|WBGene00013072 [details] [associations]
            symbol:Y51A2D.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            EMBL:AL021497 RefSeq:NP_001256811.1 ProteinModelPortal:O62484
            SMR:O62484 MEROPS:C01.A37 EnsemblMetazoa:Y51A2D.1 GeneID:180204
            KEGG:cel:CELE_Y51A2D.1 UCSC:Y51A2D.1 CTD:180204 WormBase:Y51A2D.1a
            HOGENOM:HOG000019851 NextBio:908416 Uniprot:O62484
        Length = 314

 Score = 152 (58.6 bits), Expect = 1.4e-19, Sum P(2) = 1.4e-19
 Identities = 44/155 (28%), Positives = 69/155 (44%)

Query:    39 HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQ 95
             H + V +   ++  +  R+YK E E ++RL+ F ++   + + NK   +  R      NQ
Sbjct:    36 HPEKVYQEFVEFKKKFSRTYKSEAENQLRLQNFVKSRNNVVRLNKNAQKAGRNSNFAVNQ 95

Query:    96 FSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMT-------DVPTSLDWRDKGA- 147
             FSDLT  E     + +  P+ +        FK + L  T       +   + D R +   
Sbjct:    96 FSDLTTSELHQRLSRFP-PNLTENSVFHKNFK-KLLGKTRTKRQNSEFARNFDLRSQKVN 153

Query:   148 ----VTPIKNQKECGCCWAFAAVAAVEGITKIRSG 178
                 V PIKNQ +C CCW FA  A +E I  +  G
Sbjct:   154 GRYIVGPIKNQGQCACCWGFAVTAMLETIYAVNVG 188

 Score = 145 (56.1 bits), Expect = 1.4e-19, Sum P(2) = 1.4e-19
 Identities = 35/101 (34%), Positives = 54/101 (53%)

Query:   250 PSGDEQALLKAVSM--QPVSIAIAAYSTEFQSYKEGIFN----GVCGTQLDHAVTIVGFG 303
             P   E  +++ ++    PV++  AA  T F  YK G+       + GT + HA  IVG+G
Sbjct:   201 PENAESEIIEILNTWKTPVAVYFAA-GTAFLQYKSGVLVTEDCDLAGT-VWHAGAIVGYG 258

Query:   304 TTED----GANYWLIKNSWG-NTWGDAGYMKIVRDEGLCGI 339
                D       +W++KNSWG + WG  GY+K++R +  CGI
Sbjct:   259 EENDLRGRSQRFWIMKNSWGVSGWGTGGYVKLIRGKNWCGI 299


>TAIR|locus:2204873 [details] [associations]
            symbol:AT1G02300 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002684 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 KO:K01363
            PANTHER:PTHR12411:SF16 OMA:ADDINAC IPI:IPI00534431
            RefSeq:NP_563647.1 UniGene:At.43952 ProteinModelPortal:F4HVZ1
            SMR:F4HVZ1 MEROPS:C01.A10 EnsemblPlants:AT1G02300.1 GeneID:839576
            KEGG:ath:AT1G02300 ArrayExpress:F4HVZ1 Uniprot:F4HVZ1
        Length = 379

 Score = 204 (76.9 bits), Expect = 1.7e-19, Sum P(2) = 1.7e-19
 Identities = 37/89 (41%), Positives = 54/89 (60%)

Query:   253 DEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGIFNGVCGTQLD-HAVTIVGFGTTEDGAN 310
             D Q ++  V    PV +A   Y  +F  YK G++  + GT++  HAV ++G+GT++DG +
Sbjct:   263 DPQDIMAEVYKNGPVEVAFTVYE-DFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGED 321

Query:   311 YWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             YWL+ N W  +WGD GY KI R    CGI
Sbjct:   322 YWLLANQWNRSWGDDGYFKIRRGTNECGI 350

 Score = 86 (35.3 bits), Expect = 1.7e-19, Sum P(2) = 1.7e-19
 Identities = 25/78 (32%), Positives = 36/78 (46%)

Query:   157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC-STNGNNGCLGGSREKAFAYIIQNQ 215
             CG CWAF AV ++     I+    + LS   ++ C       GC GG    A+ Y  +  
Sbjct:   148 CGSCWAFGAVESLSDRFCIKYNLNVSLSANDVIACCGLLCGFGCNGGFPMGAWLYF-KYH 206

Query:   216 GIATEDEYPYQAVPGTCS 233
             G+ T++  PY    G CS
Sbjct:   207 GVVTQECDPYFDNTG-CS 223


>DICTYBASE|DDB_G0280187 [details] [associations]
            symbol:DDB_G0280187 "cathepsin Z-like protein"
            species:44689 "Dictyostelium discoideum" [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            dictyBase:DDB_G0280187 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            EMBL:AAFI02000035 KO:K08568 RefSeq:XP_641294.1
            ProteinModelPortal:Q54VR1 MEROPS:C01.A61 PRIDE:Q54VR1
            EnsemblProtists:DDB0233838 GeneID:8622427 KEGG:ddi:DDB_G0280187
            InParanoid:Q54VR1 OMA:VWKVGDY Uniprot:Q54VR1
        Length = 291

 Score = 148 (57.2 bits), Expect = 2.9e-19, Sum P(2) = 2.9e-19
 Identities = 39/108 (36%), Positives = 59/108 (54%)

Query:   136 VPTSLDWRD-KGA--VTPIKNQ---KECGCCWAFAAVAAVEGITKI-RSGNL--IQLSEQ 186
             +PT  DWR+  G+  +T  +NQ   + CG CWA    +A+    KI R G    + L+ Q
Sbjct:    49 LPTQYDWRNISGSSYITITRNQHLPQYCGSCWAHGTTSALGDRIKIGRKGTFPEVVLAPQ 108

Query:   187 QLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA 234
              LL+C+   +N C GG   +A+AY+   +GI  E   PY+A+   C+A
Sbjct:   109 VLLNCA-GPDNTCDGGDPTEAYAYMAA-KGITDETCAPYEAIDNECNA 154

 Score = 145 (56.1 bits), Expect = 2.9e-19, Sum P(2) = 2.9e-19
 Identities = 32/127 (25%), Positives = 63/127 (49%)

Query:   215 QGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
             +GI     +        C A        +  + +V +G    + +  +  P++  +   +
Sbjct:   155 EGICKNCNFDLSNPTADCFAQPTYTTYFVEEHGQV-NGSVAMMQEIFARGPIACGMEV-T 212

Query:   275 TEFQSYKEGIFNGVCGT--QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
               F+SY  G+F    G+  +++H ++I+G+GT E+G +YW+ +NSWG  +G+ G+ +I R
Sbjct:   213 DAFESYTSGVFTSSVGSTGEINHEISIIGWGT-ENGVDYWIGRNSWGTYFGELGFFRIQR 271

Query:   333 DEGLCGI 339
                L  I
Sbjct:   272 GIDLLSI 278


>UNIPROTKB|P07858 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9606 "Homo sapiens"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0005739 "mitochondrion" evidence=IEA] [GO:0042470 "melanosome"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042981 "regulation of apoptotic process" evidence=TAS]
            [GO:0006508 "proteolysis" evidence=IDA] [GO:0005764 "lysosome"
            evidence=IDA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEP] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IDA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IDA] [GO:0005622 "intracellular" evidence=TAS]
            [GO:0036021 "endolysosome lumen" evidence=TAS] [GO:0045087 "innate
            immune response" evidence=TAS] [GO:0008233 "peptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0005730 "nucleolus" evidence=IDA] [GO:0043231 "intracellular
            membrane-bounded organelle" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_118779 GO:GO:0005739
            GO:GO:0042470 GO:GO:0048471 Reactome:REACT_6900 GO:GO:0005615
            GO:GO:0009612 GO:GO:0009611 GO:GO:0005730 GO:GO:0042981
            GO:GO:0009897 GO:GO:0045471 GO:GO:0016324 GO:GO:0009749
            GO:GO:0006914 GO:GO:0043434 eggNOG:COG4870 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087
            GO:GO:0050790 GO:GO:0042383 GO:GO:0014070 GO:GO:0042277
            GO:GO:0060548 GO:GO:0005901 GO:GO:0014075 GO:GO:0004197
            GO:GO:0070670 EMBL:CH471157 GO:GO:0007519 MEROPS:C01.060 CTD:1508
            HOGENOM:HOG000241341 HOVERGEN:HBG003480 KO:K01363 OMA:GYPSGAW
            OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 GO:GO:0097067
            PANTHER:PTHR12411:SF16 EMBL:M14221 EMBL:L16510 EMBL:AK092070
            EMBL:AK075393 EMBL:BC010240 EMBL:BC095408 EMBL:M13230
            IPI:IPI00295741 PIR:A26498 RefSeq:NP_001899.1 RefSeq:NP_680090.1
            RefSeq:NP_680091.1 RefSeq:NP_680092.1 RefSeq:NP_680093.1
            UniGene:Hs.520898 PDB:1CSB PDB:1GMY PDB:1HUC PDB:1PBH PDB:2IPP
            PDB:2PBH PDB:3AI8 PDB:3CBJ PDB:3CBK PDB:3K9M PDB:3PBH PDBsum:1CSB
            PDBsum:1GMY PDBsum:1HUC PDBsum:1PBH PDBsum:2IPP PDBsum:2PBH
            PDBsum:3AI8 PDBsum:3CBJ PDBsum:3CBK PDBsum:3K9M PDBsum:3PBH
            ProteinModelPortal:P07858 SMR:P07858 DIP:DIP-42785N IntAct:P07858
            MINT:MINT-1397666 STRING:P07858 PhosphoSite:P07858 DMDM:68067549
            SWISS-2DPAGE:P07858 UCD-2DPAGE:P07858 PaxDb:P07858
            PeptideAtlas:P07858 PRIDE:P07858 DNASU:1508 Ensembl:ENST00000345125
            Ensembl:ENST00000353047 Ensembl:ENST00000434271
            Ensembl:ENST00000453527 Ensembl:ENST00000530640
            Ensembl:ENST00000531089 Ensembl:ENST00000533455
            Ensembl:ENST00000534510 GeneID:1508 KEGG:hsa:1508 UCSC:uc003wum.3
            GeneCards:GC08M011700 H-InvDB:HIX0007320 HGNC:HGNC:2527
            HPA:CAB000457 HPA:HPA018156 MIM:116810 neXtProt:NX_P07858
            PharmGKB:PA27027 InParanoid:P07858 PhylomeDB:P07858
            BindingDB:P07858 ChEMBL:CHEMBL4072 ChiTaRS:CTSB
            EvolutionaryTrace:P07858 GenomeRNAi:1508 NextBio:6235
            PMAP-CutDB:P07858 ArrayExpress:P07858 Bgee:P07858 CleanEx:HS_CTSB
            Genevestigator:P07858 GermOnline:ENSG00000164733 GO:GO:0036021
            Uniprot:P07858
        Length = 339

 Score = 182 (69.1 bits), Expect = 3.6e-19, Sum P(2) = 3.6e-19
 Identities = 36/96 (37%), Positives = 54/96 (56%)

Query:   245 NYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLD-HAVTIVGFG 303
             N   V + ++  + +     PV  A + YS +F  YK G++  V G  +  HA+ I+G+G
Sbjct:   228 NSYSVSNSEKDIMAEIYKNGPVEGAFSVYS-DFLLYKSGVYQHVTGEMMGGHAIRILGWG 286

Query:   304 TTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
               E+G  YWL+ NSW   WGD G+ KI+R +  CGI
Sbjct:   287 V-ENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGI 321

 Score = 107 (42.7 bits), Expect = 3.6e-19, Sum P(2) = 3.6e-19
 Identities = 40/143 (27%), Positives = 64/143 (44%)

Query:    76 EYIEKANKEGNRTYKLGTNQFS-DLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMT 134
             E +   NK  N T++ G N ++ D++  + R   T    P P  R       K       
Sbjct:    29 ELVNYVNKR-NTTWQAGHNFYNVDMSYLK-RLCGTFLGGPKPPQRVMFTEDLK------- 79

Query:   135 DVPTSLDWRDKGAVTP----IKNQKECGCCWAFAAVAAVEGITKIRSGN--LIQLSEQQL 188
              +P S D R++    P    I++Q  CG CWAF AV A+     I +     +++S + L
Sbjct:    80 -LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDL 138

Query:   189 LDC-STNGNNGCLGGSREKAFAY 210
             L C  +   +GC GG   +A+ +
Sbjct:   139 LTCCGSMCGDGCNGGYPAEAWNF 161


>UNIPROTKB|A1E295 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9823 "Sus scrofa"
            [GO:0005615 "extracellular space" evidence=IDA] [GO:0042470
            "melanosome" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730
            "nucleolus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0042470
            GO:GO:0005615 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 MEROPS:C01.060 CTD:1508
            GeneTree:ENSGT00560000076599 HOVERGEN:HBG003480 KO:K01363
            OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16 EMBL:EF095956
            RefSeq:NP_001090927.1 UniGene:Ssc.53773 ProteinModelPortal:A1E295
            SMR:A1E295 PRIDE:A1E295 Ensembl:ENSSSCT00000026923 GeneID:100037961
            KEGG:ssc:100037961 Uniprot:A1E295
        Length = 335

 Score = 181 (68.8 bits), Expect = 4.4e-19, Sum P(2) = 4.4e-19
 Identities = 38/112 (33%), Positives = 60/112 (53%)

Query:   229 PGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV 288
             PG   + ++      S+Y  +   +++ + +     PV  A   YS +F  YK G++  V
Sbjct:   213 PGYTPSYKEDKHFGCSSYS-ISRNEKEIMAEIYKNGPVEGAFTVYS-DFLQYKSGVYQHV 270

Query:   289 CGTQLD-HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
              G  +  HA+ I+G+G  E+G  YWL+ NSW   WGD G+ KI+R +  CGI
Sbjct:   271 TGDLMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGI 321

 Score = 107 (42.7 bits), Expect = 4.4e-19, Sum P(2) = 4.4e-19
 Identities = 26/74 (35%), Positives = 40/74 (54%)

Query:   136 VPTSLD----WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS-GNL-IQLSEQQLL 189
             +P S D    W +   +  I++Q  CG CWAF AV A+     IRS G + +++S + +L
Sbjct:    80 LPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDML 139

Query:   190 DCSTNG-NNGCLGG 202
              C  +   +GC GG
Sbjct:   140 TCCGDECGDGCNGG 153


>UNIPROTKB|P07688 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9913 "Bos taurus"
            [GO:0042470 "melanosome" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730
            "nucleolus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0042470
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 EMBL:L06075 EMBL:M64620
            EMBL:U16336 EMBL:U16337 EMBL:U16338 EMBL:U16339 EMBL:U16341
            EMBL:U16342 EMBL:U16343 EMBL:BC102997 IPI:IPI00692061 PIR:S38328
            RefSeq:NP_776456.1 UniGene:Bt.393 PDB:1ITO PDB:1QDQ PDB:1SP4
            PDB:2DC6 PDB:2DC7 PDB:2DC8 PDB:2DC9 PDB:2DCA PDB:2DCB PDB:2DCC
            PDB:2DCD PDBsum:1ITO PDBsum:1QDQ PDBsum:1SP4 PDBsum:2DC6
            PDBsum:2DC7 PDBsum:2DC8 PDBsum:2DC9 PDBsum:2DCA PDBsum:2DCB
            PDBsum:2DCC PDBsum:2DCD ProteinModelPortal:P07688 SMR:P07688
            STRING:P07688 MEROPS:C01.060 PRIDE:P07688
            Ensembl:ENSBTAT00000036795 GeneID:281105 KEGG:bta:281105 CTD:1508
            eggNOG:NOG315657 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 InParanoid:P07688 KO:K01363 OMA:GYPSGAW
            OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 BindingDB:P07688
            ChEMBL:CHEMBL2323 EvolutionaryTrace:P07688 NextBio:20805177
            ArrayExpress:P07688 GO:GO:0097067 PANTHER:PTHR12411:SF16
            Uniprot:P07688
        Length = 335

 Score = 182 (69.1 bits), Expect = 6.8e-19, Sum P(2) = 6.8e-19
 Identities = 39/112 (34%), Positives = 62/112 (55%)

Query:   229 PGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV 288
             PG   + ++      S+Y  V + +++ + +     PV  A + YS +F  YK G++  V
Sbjct:   213 PGYSPSYKEDKHFGCSSYS-VANNEKEIMAEIYKNGPVEGAFSVYS-DFLLYKSGVYQHV 270

Query:   289 CGTQLD-HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
              G  +  HA+ I+G+G  E+G  YWL+ NSW   WGD G+ KI+R +  CGI
Sbjct:   271 SGEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGI 321

 Score = 104 (41.7 bits), Expect = 6.8e-19, Sum P(2) = 6.8e-19
 Identities = 37/133 (27%), Positives = 59/133 (44%)

Query:    76 EYIEKANKEGNRTYKLGTNQFS-DLTNDE--FRALYTGYKMPSPSHRXXXXXXFKYQNLS 132
             E +   NK+ N T+K G N ++ DL+  +    A+  G K+P           F    + 
Sbjct:    29 ELVNFVNKQ-NTTWKAGHNFYNVDLSYVKKLCGAILGGPKLPQRD-------AFAADVVL 80

Query:   133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS-GNL-IQLSEQQLLD 190
                      W +   +  I++Q  CG CWAF AV A+     I S G + +++S + +L 
Sbjct:    81 PESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLT 140

Query:   191 CSTNG-NNGCLGG 202
             C      +GC GG
Sbjct:   141 CCGGECGDGCNGG 153


>UNIPROTKB|P43233 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9031 "Gallus gallus"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 MEROPS:C01.060 CTD:1508
            HOGENOM:HOG000241341 HOVERGEN:HBG003480 KO:K01363 OrthoDB:EOG4K6G4C
            PANTHER:PTHR12411:SF16 EMBL:U18083 IPI:IPI00573387 PIR:S58770
            RefSeq:NP_990702.1 UniGene:Gga.3854 ProteinModelPortal:P43233
            SMR:P43233 STRING:P43233 PRIDE:P43233 GeneID:396329 KEGG:gga:396329
            InParanoid:P43233 NextBio:20816377 Uniprot:P43233
        Length = 340

 Score = 183 (69.5 bits), Expect = 7.0e-19, Sum P(2) = 7.0e-19
 Identities = 40/112 (35%), Positives = 60/112 (53%)

Query:   229 PGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV 288
             PG   + ++     I++Y  VP  +++ + +     PV  A   Y  +F  YK G++  V
Sbjct:   214 PGYSPSYKEDKHYGITSYG-VPRSEKEIMAEIYKNGPVEGAFIVYE-DFLMYKSGVYQHV 271

Query:   289 CGTQLD-HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
              G Q+  HA+ I+G+G  E+G  YWL  NSW   WG  G+ KI+R E  CGI
Sbjct:   272 SGEQVGGHAIRILGWGV-ENGTPYWLAANSWNTDWGITGFFKILRGEDHCGI 322

 Score = 103 (41.3 bits), Expect = 7.0e-19, Sum P(2) = 7.0e-19
 Identities = 25/83 (30%), Positives = 40/83 (48%)

Query:   135 DVPTSLD----WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLI--QLSEQQL 188
             D+P + D    W +   ++ I++Q  CG CWAF AV A+     + +   +  ++S + L
Sbjct:    79 DLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDL 138

Query:   189 LDC-STNGNNGCLGGSREKAFAY 210
             L C       GC GG    A+ Y
Sbjct:   139 LSCCGFECGMGCNGGYPSGAWRY 161


>MGI|MGI:88561 [details] [associations]
            symbol:Ctsb "cathepsin B" species:10090 "Mus musculus"
            [GO:0004175 "endopeptidase activity" evidence=ISO] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISO] [GO:0005576
            "extracellular region" evidence=ISO] [GO:0005615 "extracellular
            space" evidence=ISO] [GO:0005737 "cytoplasm" evidence=ISO]
            [GO:0005739 "mitochondrion" evidence=ISO;IDA] [GO:0005764
            "lysosome" evidence=ISO;IDA] [GO:0005901 "caveola" evidence=ISO]
            [GO:0006508 "proteolysis" evidence=ISO] [GO:0008233 "peptidase
            activity" evidence=ISO] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISO] [GO:0009897 "external side of plasma
            membrane" evidence=ISO] [GO:0009986 "cell surface" evidence=ISO]
            [GO:0016324 "apical plasma membrane" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0030984 "kininogen binding"
            evidence=ISO] [GO:0032403 "protein complex binding" evidence=ISO]
            [GO:0042277 "peptide binding" evidence=ISO] [GO:0042383
            "sarcolemma" evidence=ISO] [GO:0043621 "protein self-association"
            evidence=ISO] [GO:0048471 "perinuclear region of cytoplasm"
            evidence=ISO] [GO:0050790 "regulation of catalytic activity"
            evidence=IEA] [GO:0060548 "negative regulation of cell death"
            evidence=ISO] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 MGI:MGI:88561
            GO:GO:0005739 GO:GO:0042470 GO:GO:0048471 GO:GO:0005615
            GO:GO:0009612 GO:GO:0009611 GO:GO:0009897 GO:GO:0045471
            GO:GO:0016324 GO:GO:0009749 GO:GO:0006914 GO:GO:0043434
            eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0042383 GO:GO:0014070
            GO:GO:0042277 GO:GO:0060548 GO:GO:0005901 GO:GO:0014075
            GO:GO:0004197 GO:GO:0070670 GO:GO:0007519 MEROPS:C01.060 CTD:1508
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 KO:K01363 OMA:GYPSGAW OrthoDB:EOG4K6G4C
            BRENDA:3.4.22.1 GO:GO:0097067 PANTHER:PTHR12411:SF16 ChiTaRS:CTSB
            EMBL:M65270 EMBL:M65263 EMBL:M65264 EMBL:M65265 EMBL:M65266
            EMBL:M65267 EMBL:M65268 EMBL:M65269 EMBL:M14222 EMBL:X54966
            EMBL:S69034 EMBL:AK083393 EMBL:AK147192 EMBL:AK149884 EMBL:AK151790
            EMBL:AK167361 EMBL:BC006656 IPI:IPI00113517 PIR:A38458
            RefSeq:NP_031824.1 UniGene:Mm.236553 UniGene:Mm.489070
            ProteinModelPortal:P10605 SMR:P10605 IntAct:P10605 STRING:P10605
            PhosphoSite:P10605 SWISS-2DPAGE:P10605 PaxDb:P10605 PRIDE:P10605
            Ensembl:ENSMUST00000006235 GeneID:13030 KEGG:mmu:13030
            UCSC:uc007uhh.1 InParanoid:P10605 BioCyc:MetaCyc:MONOMER-14810
            BindingDB:P10605 ChEMBL:CHEMBL5187 NextBio:282900 Bgee:P10605
            CleanEx:MM_CTSB Genevestigator:P10605 GermOnline:ENSMUSG00000021939
            Uniprot:P10605
        Length = 339

 Score = 170 (64.9 bits), Expect = 2.2e-18, Sum P(2) = 2.2e-18
 Identities = 37/114 (32%), Positives = 59/114 (51%)

Query:   231 TCSAAQKPAAAKISNYE----EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN 286
             +C A   P+  +  ++      V +  ++ + +     PV  A   +S +F +YK G++ 
Sbjct:   210 SCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFS-DFLTYKSGVYK 268

Query:   287 GVCGTQLD-HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
                G  +  HA+ I+G+G  E+G  YWL  NSW   WGD G+ KI+R E  CGI
Sbjct:   269 HEAGDMMGGHAIRILGWGV-ENGVPYWLAANSWNLDWGDNGFFKILRGENHCGI 321

 Score = 114 (45.2 bits), Expect = 2.2e-18, Sum P(2) = 2.2e-18
 Identities = 28/83 (33%), Positives = 46/83 (55%)

Query:   135 DVPTSLDWRDKGAVTP----IKNQKECGCCWAFAAVAAVEGITKIRS-GNL-IQLSEQQL 188
             D+P + D R++ +  P    I++Q  CG CWAF AV A+   T I + G + +++S + L
Sbjct:    79 DLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDL 138

Query:   189 LDC-STNGNNGCLGGSREKAFAY 210
             L C      +GC GG    A+++
Sbjct:   139 LTCCGIQCGDGCNGGYPSGAWSF 161


>UNIPROTKB|H0YDT2 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            EMBL:AP001201 HGNC:HGNC:2546 Ensembl:ENST00000526034 Bgee:H0YDT2
            Uniprot:H0YDT2
        Length = 211

 Score = 199 (75.1 bits), Expect = 3.7e-18, Sum P(2) = 3.7e-18
 Identities = 49/143 (34%), Positives = 68/143 (47%)

Query:    45 EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
             E  + +  Q  RSY    E   RL IF  NL   ++  +E   T + G   FSDLT +EF
Sbjct:    39 EAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEF 98

Query:   105 RALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLDWRD-KGAVTPIKNQKECGCCWAF 163
               LY GY+  +             +      VP S DWR    A++PIK+QK C CCWA 
Sbjct:    99 GQLY-GYRRAAGGVPSMGREIRSEE--PEESVPFSCDWRKVASAISPIKDQKNCNCCWAM 155

Query:   164 AAVAAVEGITKIRSGNLIQLSEQ 186
             AA   +E + +I   + + +S Q
Sbjct:   156 AAAGNIETLWRISFWDFVDVSVQ 178

 Score = 41 (19.5 bits), Expect = 3.7e-18, Sum P(2) = 3.7e-18
 Identities = 6/11 (54%), Positives = 10/11 (90%)

Query:   216 GIATEDEYPYQ 226
             G+A+E +YP+Q
Sbjct:   180 GLASEKDYPFQ 190


>WB|WBGene00000782 [details] [associations]
            symbol:cpr-2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 eggNOG:NOG315657 GeneTree:ENSGT00560000076599
            HOGENOM:HOG000241341 PANTHER:PTHR12411:SF16 EMBL:Z81531
            RefSeq:NP_507186.3 ProteinModelPortal:O45466 SMR:O45466
            MEROPS:C01.A40 PaxDb:O45466 EnsemblMetazoa:F36D3.9 GeneID:185355
            KEGG:cel:CELE_F36D3.9 CTD:185355 WormBase:F36D3.9 OMA:FDARLRW
            Uniprot:O45466
        Length = 326

 Score = 173 (66.0 bits), Expect = 6.3e-18, Sum P(2) = 6.3e-18
 Identities = 34/79 (43%), Positives = 49/79 (62%)

Query:   265 PVSIAIAAYSTEFQSYKEGIFNGVCG-TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWG 323
             PV  A   Y  +F+ YK GI+  + G ++  HAV ++G+GT E G  YWL  NSWG+ WG
Sbjct:   243 PVVAAFIVYE-DFEKYKSGIYRHIAGRSKGGHAVKLIGWGT-ERGTPYWLAVNSWGSQWG 300

Query:   324 DAGYMKIVRDEGLCGIGTR 342
             ++G  +I+R    CGI +R
Sbjct:   301 ESGTFRILRGVDECGIESR 319

 Score = 105 (42.0 bits), Expect = 6.3e-18, Sum P(2) = 6.3e-18
 Identities = 32/123 (26%), Positives = 50/123 (40%)

Query:   105 RALYTGYKMPSPSHRXXXXXXFKYQNLSMT-DVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
             R+++  +  P P         F      +  D  T   W    ++  I+ Q  CG CWAF
Sbjct:    57 RSMHEKFNAPFPDEFRATEREFVLDATPLNFDARTR--WPQCKSMKLIREQSNCGSCWAF 114

Query:   164 AAVAAVEGITKIRSGNLIQ--LSEQQLLDC-STNGNNGCLGGSREKAFAYIIQNQGIATE 220
             +    +   T I S    Q  +S   LL C   +   GC GG   +AF +  + +G+ T 
Sbjct:   115 STAEVISDRTCIASNGTQQPIISPTDLLTCCGMSCGEGCDGGFPYRAFQWWAR-RGVVTG 173

Query:   221 DEY 223
              +Y
Sbjct:   174 GDY 176


>WB|WBGene00000786 [details] [associations]
            symbol:cpr-6 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            PANTHER:PTHR12411:SF16 EMBL:L39894 EMBL:L39939 EMBL:FO080666
            PIR:T37274 RefSeq:NP_741818.1 UniGene:Cel.18138
            ProteinModelPortal:P43510 SMR:P43510 DIP:DIP-25139N
            MINT:MINT-1074025 STRING:P43510 MEROPS:C01.A51 PaxDb:P43510
            PRIDE:P43510 EnsemblMetazoa:C25B8.3a GeneID:180931
            KEGG:cel:CELE_C25B8.3 UCSC:C25B8.3a CTD:180931 WormBase:C25B8.3a
            InParanoid:P43510 OMA:KAKWGLM NextBio:911608 ArrayExpress:P43510
            Uniprot:P43510
        Length = 379

 Score = 146 (56.5 bits), Expect = 6.4e-18, Sum P(2) = 6.4e-18
 Identities = 32/90 (35%), Positives = 50/90 (55%)

Query:   253 DEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGIFNGVCGTQLD--HAVTIVGFGTTEDGA 309
             D +A+ K +    P+ IA   Y  +F +Y  G++    G +L   HAV ++G+G  +DG 
Sbjct:   262 DVEAIQKELMTHGPLEIAFEVYE-DFLNYDGGVYVHT-GGKLGGGHAVKLIGWGI-DDGI 318

Query:   310 NYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
              YW + NSW   WG+ G+ +I+R    CGI
Sbjct:   319 PYWTVANSWNTDWGEDGFFRILRGVDECGI 348

 Score = 140 (54.3 bits), Expect = 6.4e-18, Sum P(2) = 6.4e-18
 Identities = 38/103 (36%), Positives = 53/103 (51%)

Query:   127 KYQNLSMTDVPTSLDWRDK----GAVTPIKNQKECGCCWAFAAVAAVEGITKIRS-GNL- 180
             K ++L + D+P S D RD      ++  I++Q  CG CWAF AV A+     I S G L 
Sbjct:    97 KTKDLDL-DIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQ 155

Query:   181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
             + LS   LL C  +   GC GG    A+ Y +++ GI T   Y
Sbjct:   156 VTLSADDLLSCCKSCGFGCNGGDPLAAWRYWVKD-GIVTGSNY 197


>DICTYBASE|DDB_G0286055 [details] [associations]
            symbol:DDB_G0286055 "peptidase C1A family protein"
            species:44689 "Dictyostelium discoideum" [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 dictyBase:DDB_G0286055 Pfam:PF00188 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411 EMBL:AAFI02000085
            Gene3D:3.40.33.10 InterPro:IPR001283 InterPro:IPR014044
            PRINTS:PR00837 SMART:SM00198 SUPFAM:SSF55797
            ProtClustDB:CLSZ2429919 RefSeq:XP_637918.1
            ProteinModelPortal:Q54MB6 EnsemblProtists:DDB0186794 GeneID:8625429
            KEGG:ddi:DDB_G0286055 InParanoid:Q54MB6 OMA:GENGFAR Uniprot:Q54MB6
        Length = 435

 Score = 235 (87.8 bits), Expect = 7.4e-18, P = 7.4e-18
 Identities = 73/233 (31%), Positives = 102/233 (43%)

Query:   136 VPT--SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
             VPT  S DWRD G V   K+   C   WAF A    E  + +R+ +    S QQL+DC  
Sbjct:   206 VPTDGSFDWRDNGVVGFPKDSSNCASGWAFTAAGIFESRSAMRTRHRYDYSAQQLIDCIN 265

Query:   193 ------TN---GN-NGC--LGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPA 239
                   +N   GN   C    G   KA  Y  Q  G+     YPY       CS  Q   
Sbjct:   266 VCIIIFSNFSIGNYTKCSRFSGELNKALMYA-QAYGLQATSTYPYVGASSIGCSYNQSSI 324

Query:   240 AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF--NG--VCGTQLDH 295
             A +  + E    G +  + K     PV + I   + EF  Y  GIF  N   +    ++H
Sbjct:   325 AVEGGDVEYSQVGRDSIVEKCRKQGPVGVGIYV-TNEFLYYAGGIFECNNTLIDNANINH 383

Query:   296 AVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGL-CGIGTRSSYPL 347
              V +VG+   +   NY++IKN++G TWG+ G+ +I  D    C I    +Y +
Sbjct:   384 NVLLVGYNEKD---NYYIIKNNFGRTWGENGFARITADVNKDCLIAKNPAYSI 433


>FB|FBgn0030521 [details] [associations]
            symbol:CtsB1 "Cathepsin B1" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS] [GO:0035071 "salivary gland cell autophagic cell
            death" evidence=IEP] [GO:0048102 "autophagic cell death"
            evidence=IEP] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:AE014298 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0035071
            GO:GO:0004197 MEROPS:C01.060 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 KO:K01363 PANTHER:PTHR12411:SF16
            HSSP:P07688 EMBL:AY060640 RefSeq:NP_572920.1 UniGene:Dm.3926
            SMR:Q9VY87 IntAct:Q9VY87 MINT:MINT-932864 STRING:Q9VY87
            EnsemblMetazoa:FBtr0073838 GeneID:32341 KEGG:dme:Dmel_CG10992
            UCSC:CG10992-RA FlyBase:FBgn0030521 InParanoid:Q9VY87 OMA:TEGHIRR
            OrthoDB:EOG48W9HM ChiTaRS:CG10992 GenomeRNAi:32341 NextBio:778020
            Uniprot:Q9VY87
        Length = 340

 Score = 151 (58.2 bits), Expect = 8.5e-18, Sum P(2) = 8.5e-18
 Identities = 31/77 (40%), Positives = 44/77 (57%)

Query:   265 PVSIAIAAYSTEFQSYKEGIFNGVCGTQLD-HAVTIVGFGTT-EDGANYWLIKNSWGNTW 322
             PV  A   Y  +   YK+G++    G +L  HA+ I+G+G   E+   YWLI NSW   W
Sbjct:   254 PVEGAFTVYE-DLILYKDGVYQHEHGKELGGHAIRILGWGVWGEEKIPYWLIGNSWNTDW 312

Query:   323 GDAGYMKIVRDEGLCGI 339
             GD G+ +I+R +  CGI
Sbjct:   313 GDHGFFRILRGQDHCGI 329

 Score = 131 (51.2 bits), Expect = 8.5e-18, Sum P(2) = 8.5e-18
 Identities = 44/158 (27%), Positives = 67/158 (42%)

Query:    76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY----TGYKMPSPSHRXXXXXXFKYQNL 131
             E+IE    +  +T+ +G N  + +T    R L       +K   P  R        Y N 
Sbjct:    27 EFIEVVRSKA-KTWTVGRNFDASVTEGHIRRLMGVHPDAHKFALPDKREVLGDL--YVN- 82

Query:   132 SMTDVPTSLD----WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL--SE 185
             S+ ++P   D    W +   +  I++Q  CG CWAF AV A+     I SG  +    S 
Sbjct:    83 SVDELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSA 142

Query:   186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
               L+ C      GC GG    A++Y  + +GI +   Y
Sbjct:   143 DDLVSCCHTCGFGCNGGFPGAAWSYWTR-KGIVSGGPY 179


>MGI|MGI:1891190 [details] [associations]
            symbol:Ctsz "cathepsin Z" species:10090 "Mus musculus"
            [GO:0005615 "extracellular space" evidence=ISO] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1891190 GO:GO:0005783 GO:GO:0005615 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0060441 GeneTree:ENSGT00560000076599 MEROPS:C01.013 CTD:1522
            HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN OMA:QCGTCTE
            ChiTaRS:CTSZ EMBL:AJ242663 EMBL:AF136277 EMBL:AF136278
            EMBL:BC008619 IPI:IPI00986833 RefSeq:NP_071720.1 UniGene:Mm.156919
            ProteinModelPortal:Q9WUU7 SMR:Q9WUU7 IntAct:Q9WUU7 STRING:Q9WUU7
            PaxDb:Q9WUU7 PRIDE:Q9WUU7 Ensembl:ENSMUST00000016400 GeneID:64138
            KEGG:mmu:64138 InParanoid:Q9WUU7 NextBio:319927 Bgee:Q9WUU7
            CleanEx:MM_CTSZ Genevestigator:Q9WUU7 GermOnline:ENSMUSG00000016256
            Uniprot:Q9WUU7
        Length = 306

 Score = 221 (82.9 bits), Expect = 2.1e-17, P = 2.1e-17
 Identities = 68/212 (32%), Positives = 100/212 (47%)

Query:   157 CGCCWAFAAVAAVEGITKI-RSGNL--IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
             CG CWA  + +A+     I R G    I LS Q ++DC   G+  C GG+    + Y   
Sbjct:    91 CGSCWAHGSTSAMADRINIKRKGAWPSILLSVQNVIDCGNAGS--CEGGNDLPVWEYA-H 147

Query:   214 NQGIATEDEYPYQAVP---------GTCS------AAQKPAAAKISNYEEVPSGDEQALL 258
               GI  E    YQA           GTC+        Q     ++ +Y  + SG E+ + 
Sbjct:   148 KHGIPDETCNNYQAKDQDCDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSL-SGREKMMA 206

Query:   259 KAVSMQPVSIAIAAYSTEFQS-YKEGIF-NGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
             +  +  P+S  I A  TE  S Y  GI+        ++H +++ G+G + DG  YW+++N
Sbjct:   207 EIYANGPISCGIMA--TEMMSNYTGGIYAEHQDQAVINHIISVAGWGVSNDGIEYWIVRN 264

Query:   317 SWGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 348
             SWG  WG+ G+M+IV      G GT  SY LA
Sbjct:   265 SWGEPWGEKGWMRIVTST-YKG-GTGDSYNLA 294

 Score = 116 (45.9 bits), Expect = 0.00031, P = 0.00031
 Identities = 37/114 (32%), Positives = 53/114 (46%)

Query:   128 YQNLSMTDVPTSLDWRDKGAV---TPIKNQ---KECGCCWAFAAVAAVEGITKI-RSGNL 180
             ++ LS  D+P + DWR+   V   +  +NQ   + CG CWA  + +A+     I R G  
Sbjct:    56 HEYLSPADLPKNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAW 115

Query:   181 --IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC 232
               I LS Q ++DC   G+  C GG+    + Y     GI  E    YQA    C
Sbjct:   116 PSILLSVQNVIDCGNAGS--CEGGNDLPVWEYA-HKHGIPDETCNNYQAKDQDC 166


>RGD|708479 [details] [associations]
            symbol:Ctsz "cathepsin Z" species:10116 "Rattus norvegicus"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=TAS]
            [GO:0005615 "extracellular space" evidence=IEA;ISO] [GO:0005783
            "endoplasmic reticulum" evidence=IEA;ISO] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0060441 "epithelial tube branching involved in
            lung morphogenesis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:708479 GO:GO:0005576 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004197 MEROPS:C01.013 CTD:1522 HOVERGEN:HBG004456 KO:K08568
            EMBL:AB023781 EMBL:BC091110 IPI:IPI00207663 RefSeq:NP_899159.1
            UniGene:Rn.1475 ProteinModelPortal:Q9R1T3 SMR:Q9R1T3 PRIDE:Q9R1T3
            GeneID:252929 KEGG:rno:252929 BindingDB:Q9R1T3 NextBio:624097
            Genevestigator:Q9R1T3 Uniprot:Q9R1T3
        Length = 306

 Score = 221 (82.9 bits), Expect = 2.1e-17, P = 2.1e-17
 Identities = 65/211 (30%), Positives = 99/211 (46%)

Query:   157 CGCCWAFAAVAAVEGITKI-RSGNLIQ--LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
             CG CWA  + +A+     I R G      LS Q ++DC   G+  C GG+    + Y   
Sbjct:    91 CGSCWAHGSTSALADRINIKRKGAWPSTLLSVQNVIDCGNAGS--CEGGNDLPVWEYA-H 147

Query:   214 NQGIATEDEYPYQAVP---------GTCS------AAQKPAAAKISNYEEVPSGDEQALL 258
               GI  E    YQA           GTC+        Q     ++ +Y  + SG E+ + 
Sbjct:   148 KHGIPDETCNNYQAKDQECDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSL-SGREKMMA 206

Query:   259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQL-DHAVTIVGFGTTEDGANYWLIKNS 317
             +  +  P+S  I A +    +Y  GI+       + +H +++ G+G + DG  YW+++NS
Sbjct:   207 EIYANGPISCGIMA-TERMSNYTGGIYTEYQNQAIINHIISVAGWGVSNDGIEYWIVRNS 265

Query:   318 WGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 348
             WG  WG+ G+M+IV      G GT SSY LA
Sbjct:   266 WGEPWGERGWMRIVTST-YKG-GTGSSYNLA 294


>RGD|621509 [details] [associations]
            symbol:Ctsb "cathepsin B" species:10116 "Rattus norvegicus"
            [GO:0004175 "endopeptidase activity" evidence=IMP;IDA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA;ISO;IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005576
            "extracellular region" evidence=IDA] [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0005730 "nucleolus" evidence=IEA;ISO]
            [GO:0005737 "cytoplasm" evidence=IDA] [GO:0005739 "mitochondrion"
            evidence=IEA;ISO;IDA] [GO:0005764 "lysosome" evidence=IEA;ISO;IDA]
            [GO:0006508 "proteolysis" evidence=IEA;IEP;ISO;IMP;IDA;TAS]
            [GO:0006914 "autophagy" evidence=IEP] [GO:0006950 "response to
            stress" evidence=IEP] [GO:0007283 "spermatogenesis" evidence=IEP]
            [GO:0007519 "skeletal muscle tissue development" evidence=IEP]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0009611
            "response to wounding" evidence=IEP] [GO:0009612 "response to
            mechanical stimulus" evidence=IEP] [GO:0009749 "response to glucose
            stimulus" evidence=IEP] [GO:0009897 "external side of plasma
            membrane" evidence=IDA] [GO:0009986 "cell surface" evidence=IDA]
            [GO:0014070 "response to organic cyclic compound" evidence=IEP]
            [GO:0014075 "response to amine stimulus" evidence=IEP] [GO:0016324
            "apical plasma membrane" evidence=IDA] [GO:0030984 "kininogen
            binding" evidence=IPI] [GO:0032403 "protein complex binding"
            evidence=IPI] [GO:0034097 "response to cytokine stimulus"
            evidence=IEP] [GO:0042277 "peptide binding" evidence=IDA]
            [GO:0042383 "sarcolemma" evidence=IDA] [GO:0042470 "melanosome"
            evidence=IEA] [GO:0043231 "intracellular membrane-bounded
            organelle" evidence=ISO] [GO:0043434 "response to peptide hormone
            stimulus" evidence=IEP] [GO:0043621 "protein self-association"
            evidence=IDA] [GO:0045471 "response to ethanol" evidence=IEP]
            [GO:0048471 "perinuclear region of cytoplasm" evidence=ISO;IDA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0060548 "negative regulation of cell death" evidence=IMP]
            [GO:0070670 "response to interleukin-4" evidence=IEP] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA;ISO]
            [GO:0005901 "caveola" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:621509 GO:GO:0005739 GO:GO:0042470
            GO:GO:0005615 GO:GO:0009612 GO:GO:0009611 GO:GO:0009897
            GO:GO:0045471 GO:GO:0016324 GO:GO:0009749 GO:GO:0006914
            GO:GO:0043434 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0007283
            GO:GO:0005764 GO:GO:0042383 GO:GO:0043621 GO:GO:0014070
            GO:GO:0042277 GO:GO:0060548 GO:GO:0014075 GO:GO:0004197
            GO:GO:0070670 GO:GO:0007519 MEROPS:C01.060 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 GO:GO:0097067
            PANTHER:PTHR12411:SF16 EMBL:X82396 EMBL:M11305 IPI:IPI00212811
            PIR:S51041 UniGene:Rn.100909 PDB:1CPJ PDB:1CTE PDB:1MIR PDB:1THE
            PDBsum:1CPJ PDBsum:1CTE PDBsum:1MIR PDBsum:1THE
            ProteinModelPortal:P00787 SMR:P00787 STRING:P00787 PRIDE:P00787
            UCSC:RGD:621509 InParanoid:P00787 SABIO-RK:P00787 BindingDB:P00787
            ChEMBL:CHEMBL2602 EvolutionaryTrace:P00787 ArrayExpress:P00787
            Genevestigator:P00787 GermOnline:ENSRNOG00000010331 Uniprot:P00787
        Length = 339

 Score = 172 (65.6 bits), Expect = 2.2e-17, Sum P(2) = 2.2e-17
 Identities = 34/92 (36%), Positives = 52/92 (56%)

Query:   249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLD-HAVTIVGFGTTED 307
             V   +++ + +     PV  A   +S +F +YK G++    G  +  HA+ I+G+G  E+
Sbjct:   232 VSDSEKEIMAEIYKNGPVEGAFTVFS-DFLTYKSGVYKHEAGDVMGGHAIRILGWGI-EN 289

Query:   308 GANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             G  YWL+ NSW   WGD G+ KI+R E  CGI
Sbjct:   290 GVPYWLVANSWNVDWGDNGFFKILRGENHCGI 321

 Score = 102 (41.0 bits), Expect = 2.2e-17, Sum P(2) = 2.2e-17
 Identities = 25/75 (33%), Positives = 39/75 (52%)

Query:   135 DVPTSLD----WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS-GNL-IQLSEQQL 188
             ++P S D    W +   +  I++Q  CG CWAF AV A+     I + G + +++S + L
Sbjct:    79 NLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDL 138

Query:   189 LDC-STNGNNGCLGG 202
             L C      +GC GG
Sbjct:   139 LTCCGIQCGDGCNGG 153


>UNIPROTKB|Q6IN22 [details] [associations]
            symbol:Ctsb "Cathepsin B" species:10116 "Rattus norvegicus"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 RGD:621509 GO:GO:0005739
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 CTD:1508
            GeneTree:ENSGT00560000076599 HOVERGEN:HBG003480 KO:K01363
            OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16 UniGene:Rn.100909
            EMBL:CH474023 HSSP:P00785 EMBL:BC072490 IPI:IPI00562653
            RefSeq:NP_072119.2 SMR:Q6IN22 IntAct:Q6IN22 STRING:Q6IN22
            Ensembl:ENSRNOT00000014177 GeneID:64529 KEGG:rno:64529
            InParanoid:Q6IN22 NextBio:613362 Genevestigator:Q6IN22
            Uniprot:Q6IN22
        Length = 339

 Score = 172 (65.6 bits), Expect = 2.2e-17, Sum P(2) = 2.2e-17
 Identities = 34/92 (36%), Positives = 52/92 (56%)

Query:   249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLD-HAVTIVGFGTTED 307
             V   +++ + +     PV  A   +S +F +YK G++    G  +  HA+ I+G+G  E+
Sbjct:   232 VSDSEKEIMAEIYKNGPVEGAFTVFS-DFLTYKSGVYKHEAGDVMGGHAIRILGWGI-EN 289

Query:   308 GANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             G  YWL+ NSW   WGD G+ KI+R E  CGI
Sbjct:   290 GVPYWLVANSWNVDWGDNGFFKILRGENHCGI 321

 Score = 102 (41.0 bits), Expect = 2.2e-17, Sum P(2) = 2.2e-17
 Identities = 25/75 (33%), Positives = 39/75 (52%)

Query:   135 DVPTSLD----WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS-GNL-IQLSEQQL 188
             ++P S D    W +   +  I++Q  CG CWAF AV A+     I + G + +++S + L
Sbjct:    79 NLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDL 138

Query:   189 LDC-STNGNNGCLGG 202
             L C      +GC GG
Sbjct:   139 LTCCGIQCGDGCNGG 153


>UNIPROTKB|F1PIF2 [details] [associations]
            symbol:CTSZ "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0060441 "epithelial tube branching involved
            in lung morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0005783 GO:GO:0005615 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GO:GO:0060441 GeneTree:ENSGT00560000076599 OMA:QCGTCTE
            EMBL:AAEX03014054 Ensembl:ENSCAFT00000019357 Uniprot:F1PIF2
        Length = 261

 Score = 213 (80.0 bits), Expect = 3.1e-17, P = 3.1e-17
 Identities = 66/211 (31%), Positives = 104/211 (49%)

Query:   157 CGCCWAFAAVAAVEGITKI-RSGNLIQ--LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
             CG CWA  + +A+     I R G      LS Q +LDC+  G+  C GG+    ++Y  +
Sbjct:    47 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVLDCANAGS--CEGGNDLPVWSYAHE 104

Query:   214 NQGIATEDEYPYQAVP---------GTCS------AAQKPAAAKISNYEEVPSGDEQALL 258
             + GI  E    YQA           GTC+      A Q     ++ +Y  + SG E+ + 
Sbjct:   105 H-GIPDETCNNYQAKDQECNKFNQCGTCTEFKECHAIQNYTLWRVGDYGSL-SGREKMMA 162

Query:   259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCG-TQLDHAVTIVGFGTTEDGANYWLIKNS 317
             +  +  P+S  I A + +  +Y  GI         ++H +++VG+G + DG  YW+++NS
Sbjct:   163 EIYANGPISCGIMA-TEKMVNYTGGIHAEYQEQAYINHVISVVGWGVS-DGTEYWIVRNS 220

Query:   318 WGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 348
             WG  WG+ G+M+IV      G G  +SY LA
Sbjct:   221 WGEPWGERGWMRIVTSTYKDGKG--ASYNLA 249

 Score = 122 (48.0 bits), Expect = 4.4e-05, P = 4.4e-05
 Identities = 38/115 (33%), Positives = 58/115 (50%)

Query:   128 YQNLSMTDVPTSLDWRDKGAV---TPIKNQ---KECGCCWAFAAVAAVEGITKI-RSGNL 180
             ++ LS +D+P S DWR+   V   +  +NQ   + CG CWA  + +A+     I R G  
Sbjct:    12 HEYLSPSDLPKSWDWRNVNGVNYASATRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAW 71

Query:   181 IQ--LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS 233
                 LS Q +LDC+  G+  C GG+    ++Y  ++ GI  E    YQA    C+
Sbjct:    72 PSTLLSVQHVLDCANAGS--CEGGNDLPVWSYAHEH-GIPDETCNNYQAKDQECN 123


>WB|WBGene00000789 [details] [associations]
            symbol:cpz-2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GeneTree:ENSGT00560000076599 KO:K08568 EMBL:Z81103
            HSSP:P80067 PIR:T23720 RefSeq:NP_506318.1 ProteinModelPortal:P92005
            SMR:P92005 STRING:P92005 MEROPS:C01.A41 PaxDb:P92005
            EnsemblMetazoa:M04G12.2 GeneID:179818 KEGG:cel:CELE_M04G12.2
            UCSC:M04G12.2 CTD:179818 WormBase:M04G12.2 eggNOG:NOG275763
            InParanoid:P92005 OMA:VEYWIAR NextBio:906990 Uniprot:P92005
        Length = 467

 Score = 229 (85.7 bits), Expect = 5.8e-17, P = 5.8e-17
 Identities = 59/212 (27%), Positives = 100/212 (47%)

Query:   149 TPIKNQK---ECGCCWAFAAVAAVEGITKI-RSGN--LIQLSEQQLLDCSTNGNNGCLGG 202
             +P +NQ     CG CW F    A+     + R G   + QLS Q+++DC  NG   C GG
Sbjct:   237 SPTRNQHIPVYCGSCWVFGTTGALNDRFNVARKGRWPMTQLSPQEIIDC--NGKGNCQGG 294

Query:   203 SREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAA------KISNYEEV---PSGD 253
                    +  + QG+  E    Y+A  G C+   +  +        ++NY        G 
Sbjct:   295 EIGNVLEHA-KIQGLVEEGCNVYRATNGECNPYHRCGSCWPNECFSLTNYTRYYVKDYGQ 353

Query:   254 EQALLKAVSM----QPVSIAIAAYSTEFQ-SYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
              Q   K +S      P++ AI A + +F+  Y +G+++     + +H +++ G+G  E+G
Sbjct:   354 VQGRDKIMSEIKKGGPIACAIGA-TKKFEYEYVKGVYSEKSDLESNHIISLTGWGVDENG 412

Query:   309 ANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIG 340
               YW+ +NSWG  WG+ G+ ++V  +   G G
Sbjct:   413 VEYWIARNSWGEAWGELGWFRVVTSKFKDGQG 444

 Score = 142 (55.0 bits), Expect = 8.1e-07, P = 8.1e-07
 Identities = 36/108 (33%), Positives = 51/108 (47%)

Query:   135 DVPTSLDWRDKGAV---TPIKNQK---ECGCCWAFAAVAAVEGITKI-RSGN--LIQLSE 185
             D+PT  DWR+   V   +P +NQ     CG CW F    A+     + R G   + QLS 
Sbjct:   220 DLPTGWDWRNVSGVNYCSPTRNQHIPVYCGSCWVFGTTGALNDRFNVARKGRWPMTQLSP 279

Query:   186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS 233
             Q+++DC  NG   C GG       +  + QG+  E    Y+A  G C+
Sbjct:   280 QEIIDC--NGKGNCQGGEIGNVLEHA-KIQGLVEEGCNVYRATNGECN 324


>WB|WBGene00000783 [details] [associations]
            symbol:cpr-3 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0009792 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39890 EMBL:L39925 EMBL:Z81119
            EMBL:Z82057 PIR:T37282 RefSeq:NP_506790.1 UniGene:Cel.23503
            ProteinModelPortal:P43507 SMR:P43507 MEROPS:C01.A33
            EnsemblMetazoa:T10H4.12 GeneID:180033 KEGG:cel:CELE_T10H4.12
            UCSC:T10H4.12 CTD:180033 WormBase:T10H4.12 eggNOG:NOG240190
            InParanoid:P43507 OMA:PVEASYK NextBio:907824 Uniprot:P43507
        Length = 370

 Score = 154 (59.3 bits), Expect = 2.0e-16, Sum P(2) = 2.0e-16
 Identities = 31/76 (40%), Positives = 45/76 (59%)

Query:   265 PVSIAIAAYSTEFQSYKEGIFNGVCGTQLD-HAVTIVGFGTTEDGANYWLIKNSWGNTWG 323
             PV  +   Y  +F  YK G+++   G  +  HAV I+G+G  E+G +YWLI NSWG ++G
Sbjct:   254 PVEASYKVYE-DFYHYKSGVYHYTSGKLVGGHAVKIIGWGV-ENGVDYWLIANSWGTSFG 311

Query:   324 DAGYMKIVRDEGLCGI 339
             + G+ KI R    C I
Sbjct:   312 EKGFFKIRRGTNECQI 327

 Score = 116 (45.9 bits), Expect = 2.0e-16, Sum P(2) = 2.0e-16
 Identities = 40/161 (24%), Positives = 62/161 (38%)

Query:    94 NQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDVPTSLD----WRDKGAVT 149
             N+ S+    +F+ +   +  P           F    +    +P + D    W D   + 
Sbjct:    51 NEISEFEM-KFKVMDVKFAEPLEKDSDVASELFVRGEIVPEPLPDTFDAREKWPDCNTIK 109

Query:   150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ--LSEQQLLDC-STNGNNGCLGGSREK 206
              I+NQ  CG CWAF A   +     I+S    Q  +S + +L C  T    GC GG   +
Sbjct:   110 LIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDILSCCGTTCGYGCKGGYSIE 169

Query:   207 AFAYIIQNQGIATEDEY------PYQAVPGT--CSAAQKPA 239
             A  +   + G  T  +Y      PY   P T  C  +  P+
Sbjct:   170 ALRFWASS-GAVTGGDYGGHGCMPYSFAPCTKNCPESTTPS 209

 Score = 41 (19.5 bits), Expect = 1.2e-08, Sum P(2) = 1.2e-08
 Identities = 11/32 (34%), Positives = 18/32 (56%)

Query:    46 IHEKWMAQHGRSYKDELE-KEMRLKIFKENLE 76
             +   W+A+H    + E++ K M +K F E LE
Sbjct:    42 VQTSWVAEHNEISEFEMKFKVMDVK-FAEPLE 72


>UNIPROTKB|Q9UBR2 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0060441 "epithelial tube
            branching involved in lung morphogenesis" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            Reactome:REACT_11123 Reactome:REACT_17015 InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 EMBL:CH471077 GO:GO:0005615 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AL109840 GO:GO:0060441 MEROPS:C01.013 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN
            BRENDA:3.4.18.1 EMBL:AF073890 EMBL:AF032906 EMBL:AF136273
            EMBL:AF136276 EMBL:AF136274 EMBL:AF136275 EMBL:AK314931
            EMBL:BC042168 EMBL:AF009923 IPI:IPI00002745 RefSeq:NP_001327.2
            UniGene:Hs.252549 PDB:1DEU PDB:1EF7 PDBsum:1DEU PDBsum:1EF7
            ProteinModelPortal:Q9UBR2 SMR:Q9UBR2 STRING:Q9UBR2 DMDM:12643324
            PaxDb:Q9UBR2 PeptideAtlas:Q9UBR2 PRIDE:Q9UBR2 DNASU:1522
            Ensembl:ENST00000217131 GeneID:1522 KEGG:hsa:1522 UCSC:uc002yai.2
            GeneCards:GC20M057570 HGNC:HGNC:2547 HPA:CAB025114 MIM:603169
            neXtProt:NX_Q9UBR2 PharmGKB:PA27043 InParanoid:Q9UBR2 OMA:QCGTCTE
            PhylomeDB:Q9UBR2 BindingDB:Q9UBR2 ChEMBL:CHEMBL4160 ChiTaRS:CTSZ
            EvolutionaryTrace:Q9UBR2 GenomeRNAi:1522 NextBio:6299 Bgee:Q9UBR2
            CleanEx:HS_CTSZ Genevestigator:Q9UBR2 GermOnline:ENSG00000101160
            Uniprot:Q9UBR2
        Length = 303

 Score = 214 (80.4 bits), Expect = 2.8e-16, P = 2.8e-16
 Identities = 66/211 (31%), Positives = 103/211 (48%)

Query:   157 CGCCWAFAAVAAVEGITKI-RSGNLIQ--LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
             CG CWA A+ +A+     I R G      LS Q ++DC   G+  C GG+    + Y  Q
Sbjct:    89 CGSCWAHASTSAMADRINIKRKGAWPSTLLSVQNVIDCGNAGS--CEGGNDLSVWDYAHQ 146

Query:   214 NQGIATEDEYPYQAVP---------GTCSAAQKPAAAK------ISNYEEVPSGDEQALL 258
             + GI  E    YQA           GTC+  ++  A +      + +Y  + SG E+ + 
Sbjct:   147 H-GIPDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSL-SGREKMMA 204

Query:   259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNS 317
             +  +  P+S  I A +    +Y  GI+     T  ++H V++ G+G + DG  YW+++NS
Sbjct:   205 EIYANGPISCGIMA-TERLANYTGGIYAEYQDTTYINHVVSVAGWGIS-DGTEYWIVRNS 262

Query:   318 WGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 348
             WG  WG+ G+++IV      G G R  Y LA
Sbjct:   263 WGEPWGERGWLRIVTSTYKDGKGAR--YNLA 291

 Score = 119 (46.9 bits), Expect = 0.00014, P = 0.00014
 Identities = 39/114 (34%), Positives = 55/114 (48%)

Query:   128 YQNLSMTDVPTSLDWRDKGAV---TPIKNQ---KECGCCWAFAAVAAVEGITKI-RSGNL 180
             ++ LS  D+P S DWR+   V   +  +NQ   + CG CWA A+ +A+     I R G  
Sbjct:    54 HEYLSPADLPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAW 113

Query:   181 IQ--LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC 232
                 LS Q ++DC   G+  C GG+    + Y  Q+ GI  E    YQA    C
Sbjct:   114 PSTLLSVQNVIDCGNAGS--CEGGNDLSVWDYAHQH-GIPDETCNNYQAKDQEC 164


>ZFIN|ZDB-GENE-041010-139 [details] [associations]
            symbol:ctsz "cathepsin Z" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0001525 "angiogenesis"
            evidence=IMP] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 ZFIN:ZDB-GENE-041010-139 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0001525
            CTD:1522 HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568
            OrthoDB:EOG42Z4QN UniGene:Dr.935 eggNOG:NOG275763 EMBL:BC083369
            IPI:IPI00483065 RefSeq:NP_001006043.1 ProteinModelPortal:Q5XJD4
            SMR:Q5XJD4 STRING:Q5XJD4 GeneID:450022 KEGG:dre:450022
            InParanoid:Q5XJD4 NextBio:20833005 ArrayExpress:Q5XJD4
            Uniprot:Q5XJD4
        Length = 301

 Score = 211 (79.3 bits), Expect = 7.4e-16, P = 7.4e-16
 Identities = 61/211 (28%), Positives = 99/211 (46%)

Query:   157 CGCCWAFAAVAAVEGITKIR---SGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
             CG CWA  + +A+     I+   +     LS Q ++DC   G+  C GG     + Y   
Sbjct:    81 CGSCWAHGSTSALADRINIKRKAAWPSAYLSVQNVIDCGDAGS--CSGGDHSGVWEYA-H 137

Query:   214 NQGIATEDEYPYQAV-----P----------GTCSAAQKPAAAKISNYEEVPSGDEQALL 258
             N+GI  E    YQA      P          G C+  +     K+ +Y    SG ++   
Sbjct:   138 NKGIPDETCNNYQAKDQDCKPFNQCGTCTTFGVCNIVKNFTLWKVGDYGSA-SGLDKMKA 196

Query:   259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
             +  S  P+S  I A + +  +Y  G+++  V    ++H V++ G+G  E+G  +W+++NS
Sbjct:   197 EIYSGGPISCGIMA-TDKLDAYTGGLYSEYVQEPYINHIVSVAGWGVDENGVEFWVVRNS 255

Query:   318 WGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 348
             WG  WG+ G+++IV      G G  S Y LA
Sbjct:   256 WGEPWGEKGWLRIVTSAYKGGSG--SQYNLA 284

 Score = 127 (49.8 bits), Expect = 1.7e-05, P = 1.7e-05
 Identities = 35/114 (30%), Positives = 56/114 (49%)

Query:   128 YQNLSMTDVPTSLDWRD-KGA--VTPIKNQ---KECGCCWAFAAVAAVEGITKIR---SG 178
             Y+++++ ++P   DWR+ KG   V+  +NQ   + CG CWA  + +A+     I+   + 
Sbjct:    46 YESMNLKELPKEWDWRNIKGVNYVSTTRNQHIPQYCGSCWAHGSTSALADRINIKRKAAW 105

Query:   179 NLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC 232
                 LS Q ++DC   G+  C GG     + Y   N+GI  E    YQA    C
Sbjct:   106 PSAYLSVQNVIDCGDAGS--CSGGDHSGVWEYA-HNKGIPDETCNNYQAKDQDC 156


>FB|FBgn0034709 [details] [associations]
            symbol:Swim "Secreted Wg-interacting molecule" species:7227
            "Drosophila melanogaster" [GO:0004197 "cysteine-type endopeptidase
            activity" evidence=ISS] [GO:0006955 "immune response" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044 "scavenger
            receptor activity" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0042600 "chorion" evidence=IDA]
            [GO:0035593 "positive regulation of Wnt receptor signaling pathway
            by establishment of Wnt protein localization to extracellular
            region" evidence=IMP] [GO:0030177 "positive regulation of Wnt
            receptor signaling pathway" evidence=IDA] [GO:0005615
            "extracellular space" evidence=IDA] [GO:0017147 "Wnt-protein
            binding" evidence=IDA] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR013128 Pfam:PF00112 PROSITE:PS50958 SMART:SM00201
            SMART:SM00645 EMBL:AE013599 GO:GO:0005615 GO:GO:0006955
            GO:GO:0030247 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GO:GO:0017147 GO:GO:0005044
            GeneTree:ENSGT00560000076599 GO:GO:0042600 eggNOG:NOG310046
            OMA:DNCNRCT HSSP:P80067 EMBL:AY113377 RefSeq:NP_611652.2
            RefSeq:NP_726176.1 UniGene:Dm.732 SMR:Q7JWQ7 IntAct:Q7JWQ7
            EnsemblMetazoa:FBtr0071784 EnsemblMetazoa:FBtr0071785 GeneID:37537
            KEGG:dme:Dmel_CG3074 UCSC:CG3074-RA FlyBase:FBgn0034709
            HOGENOM:HOG000264150 InParanoid:Q7JWQ7 OrthoDB:EOG48CZ9P
            GenomeRNAi:37537 NextBio:804155 GO:GO:0035593 Uniprot:Q7JWQ7
        Length = 431

 Score = 141 (54.7 bits), Expect = 8.9e-16, Sum P(2) = 8.9e-16
 Identities = 27/79 (34%), Positives = 42/79 (53%)

Query:   265 PVSIAIAAYSTEFQSYKEGIFNGVCGTQLD----HAVTIVGFGTTEDGANYWLIKNSWGN 320
             PV  A    + +F +Y  G++      +      H+V +VG+G   +G  YW+  NSWG+
Sbjct:   334 PVQ-ATMRVNRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGS 392

Query:   321 TWGDAGYMKIVRDEGLCGI 339
              WG+ GY +I+R    CGI
Sbjct:   393 WWGEHGYFRILRGSNECGI 411

 Score = 127 (49.8 bits), Expect = 8.9e-16, Sum P(2) = 8.9e-16
 Identities = 34/129 (26%), Positives = 60/129 (46%)

Query:   134 TD-VPTSLDWRDKGA--VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL--IQLSEQQL 188
             TD +P+S +  DK +  ++ + +Q  CG  W  +  +       I+S     +QLS Q +
Sbjct:   184 TDGLPSSFNALDKWSSYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNI 243

Query:   189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
             L C T    GC GG  + A+ Y+   +G+  E+ YPY     TC       + + +  ++
Sbjct:   244 LSC-TRRQQGCEGGHLDAAWRYL-HKKGVVDENCYPYTQHRDTCKIRHNSRSLRANGCQK 301

Query:   249 VPSGDEQAL 257
               + D  +L
Sbjct:   302 PVNVDRDSL 310


>WB|WBGene00010204 [details] [associations]
            symbol:F57F5.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0006898
            "receptor-mediated endocytosis" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0009792 GO:GO:0006898
            GO:GO:0040007 GO:GO:0002119 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0040011
            EMBL:Z75953 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            PANTHER:PTHR12411:SF16 RefSeq:NP_506011.2 ProteinModelPortal:Q20950
            SMR:Q20950 DIP:DIP-24447N IntAct:Q20950 MINT:MINT-211137
            STRING:Q20950 MEROPS:C01.A42 EnsemblMetazoa:F57F5.1 GeneID:179645
            KEGG:cel:CELE_F57F5.1 UCSC:F57F5.1 CTD:179645 WormBase:F57F5.1
            OMA:ADDINAC Uniprot:Q20950
        Length = 351

 Score = 162 (62.1 bits), Expect = 1.2e-15, Sum P(2) = 1.2e-15
 Identities = 30/76 (39%), Positives = 43/76 (56%)

Query:   265 PVSIAIAAYSTEFQSYKEGIFNGVCGTQLD-HAVTIVGFGTTEDGANYWLIKNSWGNTWG 323
             PV +A   Y  +F+ Y  G++    G  L  HAV ++G+G  ++G  YWL  NSW   WG
Sbjct:   267 PVEVAFTVYE-DFEHYSGGVYVHTAGASLGGHAVKMLGWGV-DNGTPYWLCANSWNEDWG 324

Query:   324 DAGYMKIVRDEGLCGI 339
             + GY +I+R    CGI
Sbjct:   325 ENGYFRIIRGVNECGI 340

 Score = 98 (39.6 bits), Expect = 1.2e-15, Sum P(2) = 1.2e-15
 Identities = 39/171 (22%), Positives = 69/171 (40%)

Query:    60 DELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR 119
             D +  E ++   +E ++Y+ K         +LG+  FS    D  +    G KM      
Sbjct:    26 DAIPVEAQMLRGQELVDYVNKVQTSFKA--ELGS-YFSSYP-DTIKKQLMGAKMVEIPEE 81

Query:   120 XXXXXXFKYQNLSMTDVPTSLD----WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKI 175
                     +  +    VP S D    W +  +++ I++Q  CG CWA +A   +     I
Sbjct:    82 YRVFE-MTHPEVEDAAVPDSFDSRTAWPNCPSISKIRDQSSCGSCWAVSAAETISDRICI 140

Query:   176 RSG--NLIQLSEQQL-LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
              S    ++ +S   +   C     NGC GG   +A+ + ++ +G  T   Y
Sbjct:   141 ASNAKTILSISADDINACCGMVCGNGCNGGYPIEAWRHYVK-KGYVTGGSY 190


>WB|WBGene00021070 [details] [associations]
            symbol:W07B8.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:FO081739 HSSP:P07688 PIR:T31730
            RefSeq:NP_503384.1 ProteinModelPortal:O16289 SMR:O16289
            EnsemblMetazoa:W07B8.1 GeneID:178613 KEGG:cel:CELE_W07B8.1
            UCSC:W07B8.1 CTD:178613 WormBase:W07B8.1 eggNOG:NOG245289
            InParanoid:O16289 OMA:TTGIYVH NextBio:901844 Uniprot:O16289
        Length = 335

 Score = 138 (53.6 bits), Expect = 1.9e-15, Sum P(2) = 1.9e-15
 Identities = 27/94 (28%), Positives = 47/94 (50%)

Query:   247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDH-AVTIVGFGTT 305
             +++P+   +     +   P+      Y  +F  Y  GI+  + G +  H +V I+G+G  
Sbjct:   232 DQLPNSQIEIQSDVMLNGPIQATFEVYD-DFLQYTTGIYVHLTGNKQGHLSVRIIGWGVW 290

Query:   306 EDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
             + G  YWL  NSWG  WG+ G  +++R    CG+
Sbjct:   291 Q-GVPYWLCANSWGRQWGENGTFRVLRGTNECGL 323

 Score = 123 (48.4 bits), Expect = 1.9e-15, Sum P(2) = 1.9e-15
 Identities = 41/115 (35%), Positives = 55/115 (47%)

Query:   126 FKYQNLSMT----DVPTSLD----WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS 177
             FK QN  ++    D+  S D    W +  ++  I +  EC   WAFAA  ++     I S
Sbjct:    62 FKIQNFGVSQANSDLSPSFDARERWPECMSIPQINDISECKTSWAFAAAESMSDRLCINS 121

Query:   178 G---NLIQLSEQQLLDCST---NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQ 226
             G   N I LS ++LL C T   +   GC GG+  KA+ YI Q  GI T   Y  Q
Sbjct:   122 GGFKNTI-LSAEELLSCCTGMFSCGEGCEGGNPFKAWQYI-QKHGIPTGGSYESQ 174


>UNIPROTKB|F1MW68 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9913 "Bos taurus"
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0060441
            GeneTree:ENSGT00560000076599 IPI:IPI00708474 UniGene:Bt.4902
            OMA:QCGTCTE EMBL:DAAA02036315 PRIDE:F1MW68
            Ensembl:ENSBTAT00000025007 Uniprot:F1MW68
        Length = 304

 Score = 207 (77.9 bits), Expect = 3.6e-15, P = 3.6e-15
 Identities = 65/211 (30%), Positives = 100/211 (47%)

Query:   157 CGCCWAFAAVAAVEGITKI-RSGNLIQ--LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
             CG CWA  + +A+     I R G      LS Q +LDC   G+  C GG+    + Y   
Sbjct:    90 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVLDCGDAGS--CEGGNDLPVWEYA-H 146

Query:   214 NQGIATEDEYPYQAVP---------GTCSAAQKPAAAK------ISNYEEVPSGDEQALL 258
               GI  E    YQA           GTC+  ++    K      + +Y  + SG E+ + 
Sbjct:   147 RHGIPDETCNNYQAKDQECDKFNQCGTCTEFKECHVIKNYTLWKVGDYGSL-SGREKMMA 205

Query:   259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNS 317
             +  +  P+S  I A + +  +Y  GI++       ++H V++ G+G + DG  YW+++NS
Sbjct:   206 EIYTNGPISCGIMA-TEKMSNYTGGIYSEYNDQAFINHIVSVAGWGVS-DGMEYWIVRNS 263

Query:   318 WGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 348
             WG  WG+ G+M+IV      G G R  Y LA
Sbjct:   264 WGEPWGEHGWMRIVTSTYKGGEGAR--YNLA 292

 Score = 116 (45.9 bits), Expect = 0.00030, P = 0.00030
 Identities = 38/114 (33%), Positives = 53/114 (46%)

Query:   128 YQNLSMTDVPTSLDWRDKGAV---TPIKNQ---KECGCCWAFAAVAAVEGITKI-RSGNL 180
             ++ LS +D+P S DWR+   V   +  +NQ   + CG CWA  + +A+     I R G  
Sbjct:    55 HEYLSPSDLPKSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAW 114

Query:   181 IQ--LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC 232
                 LS Q +LDC   G+  C GG+    + Y     GI  E    YQA    C
Sbjct:   115 PSTLLSVQHVLDCGDAGS--CEGGNDLPVWEYA-HRHGIPDETCNNYQAKDQEC 165


>ZFIN|ZDB-GENE-060503-240 [details] [associations]
            symbol:tinagl1 "tubulointerstitial nephritis
            antigen-like 1" species:7955 "Danio rerio" [GO:0005044 "scavenger
            receptor activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0030414 "peptidase inhibitor activity" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0002040 "sprouting
            angiogenesis" evidence=IMP] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR008037 InterPro:IPR013128 Pfam:PF00112 Pfam:PF05375
            PROSITE:PS00524 PROSITE:PS50958 SMART:SM00201 SMART:SM00645
            ZFIN:ZDB-GENE-060503-240 GO:GO:0006955 GO:GO:0030247 GO:GO:0030414
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 GO:GO:0002040
            GO:GO:0005044 GeneTree:ENSGT00560000076599 GO:GO:0010466
            SUPFAM:SSF57283 HOVERGEN:HBG053961 MEROPS:C01.975 OMA:DNCNRCT
            EMBL:BX950864 IPI:IPI00609339 UniGene:Dr.103937
            Ensembl:ENSDART00000087096 Ensembl:ENSDART00000126228
            InParanoid:Q1LUC6 Uniprot:Q1LUC6
        Length = 471

 Score = 146 (56.5 bits), Expect = 4.5e-15, Sum P(2) = 4.5e-15
 Identities = 51/186 (27%), Positives = 82/186 (44%)

Query:    76 EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTD 135
             + I++ N+          +QF  +T DE      G K P+   R          N++  D
Sbjct:   142 DMIQEINRRDYGWRAANYSQFWGMTLDEGLRFRLGTKRPT---RTIMNMNEMQMNMNGND 198

Query:   136 -VPTSLDWRDK--GAVTPIKNQKECGCCWAFAAVAAVEGITKIRS-GNLI-QLSEQQLLD 190
              +P+  +  DK  G +    +Q  C   WAF+  A       I+S G++  QLS Q L+ 
Sbjct:   199 HLPSYFNAVDKWPGKIHEPLDQGNCNASWAFSTAAVASDRISIQSMGHMTPQLSPQNLIS 258

Query:   191 CSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVP 250
             C T   +GC GG  + A+ + ++ +G+ T+D YP+   P   SA +   A  +     V 
Sbjct:   259 CDTRHQDGCAGGRIDGAW-WFMRRRGVVTQDCYPFS--PPEQSAVE--VARCMMQSRAVG 313

Query:   251 SGDEQA 256
              G  QA
Sbjct:   314 RGKRQA 319

 Score = 116 (45.9 bits), Expect = 4.5e-15, Sum P(2) = 4.5e-15
 Identities = 36/105 (34%), Positives = 48/105 (45%)

Query:   251 SGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGIF-----NGVCGTQL----DHAVTIV 300
             S +E  ++K +    PV  AI     +F  YK GIF     N    +Q      H+V I 
Sbjct:   343 STNENEIMKEIMDNGPVQ-AIMEVHEDFFVYKSGIFRHTDVNYHKPSQYRKHATHSVRIT 401

Query:   301 GFGTTEDGAN----YWLIKNSWGNTWGDAGYMKIVRDEGLCGIGT 341
             G+G   D +     YW+  NSWG  WG+ GY +I R    C I T
Sbjct:   402 GWGEERDYSGRTRKYWIGANSWGKNWGEDGYFRIARGVNECDIET 446


>WB|WBGene00009158 [details] [associations]
            symbol:F26E4.3 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005576
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005044
            GeneTree:ENSGT00560000076599 HSSP:P07711 EMBL:Z81070
            eggNOG:NOG310046 HOGENOM:HOG000241342 OMA:DNCNRCT PIR:T21421
            RefSeq:NP_492593.2 ProteinModelPortal:P90850 SMR:P90850
            PaxDb:P90850 EnsemblMetazoa:F26E4.3.1 EnsemblMetazoa:F26E4.3.2
            GeneID:172827 KEGG:cel:CELE_F26E4.3 UCSC:F26E4.3.1 CTD:172827
            WormBase:F26E4.3 InParanoid:P90850 NextBio:877161 Uniprot:P90850
        Length = 452

 Score = 135 (52.6 bits), Expect = 6.7e-15, Sum P(2) = 6.7e-15
 Identities = 34/106 (32%), Positives = 55/106 (51%)

Query:   135 DVPTSLDWRDKGA--VTPIKNQKECGCCWAFAAVA-AVEGITKIRSGNLIQ-LSEQQLLD 190
             ++P   D RDK    + P+ +Q +CG  W+ +  A + + +  I  G +   LS QQLL 
Sbjct:   183 ELPEHFDARDKWGPLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQLLS 242

Query:   191 CSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA----VPGTC 232
             C+ +   GC GG  ++A+ YI +  G+  +  YPY +     PG C
Sbjct:   243 CNQHRQKGCEGGYLDRAWWYI-RKLGVVGDHCYPYVSGQSREPGHC 287

 Score = 126 (49.4 bits), Expect = 6.7e-15, Sum P(2) = 6.7e-15
 Identities = 34/119 (28%), Positives = 54/119 (45%)

Query:   233 SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN------ 286
             S +Q   A K++   +V S +E    + ++  PV      +  +F  Y  G++       
Sbjct:   304 SGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHE-DFFMYAGGVYQHSDLAA 362

Query:   287 --GVCGTQLD-HAVTIVGFG---TTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
               G        H+V ++G+G   +T     YWL  NSWG  WG+ GY K++R E  C I
Sbjct:   363 QKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGYFKVLRGENHCEI 421


>UNIPROTKB|A5GFX7 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9823 "Sus scrofa"
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GO:GO:0060441 GeneTree:ENSGT00560000076599 MEROPS:C01.013 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN
            OMA:QCGTCTE EMBL:CR956646 RefSeq:NP_001116576.1 UniGene:Ssc.16769
            ProteinModelPortal:A5GFX7 SMR:A5GFX7 STRING:A5GFX7
            Ensembl:ENSSSCT00000008249 GeneID:100141405 KEGG:ssc:100141405
            ArrayExpress:A5GFX7 Uniprot:A5GFX7
        Length = 304

 Score = 205 (77.2 bits), Expect = 7.0e-15, P = 7.0e-15
 Identities = 65/211 (30%), Positives = 96/211 (45%)

Query:   157 CGCCWAFAAVAAVEGITKI-RSGNLIQ--LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
             CG CWA  + +A+     I R G      LS Q ++DC   G+  C GG     +AY   
Sbjct:    90 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGNAGS--CEGGDDLPVWAYA-H 146

Query:   214 NQGIATEDEYPYQAVP---------GTCS------AAQKPAAAKISNYEEVPSGDEQALL 258
               GI  E    YQA           GTC+        Q     K+ +Y  V SG E+ + 
Sbjct:   147 RHGIPDETCNNYQAKDQVCDKFNQCGTCTEFKECHVIQNYTLWKVGDYGSV-SGREKMMA 205

Query:   259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCG-TQLDHAVTIVGFGTTEDGANYWLIKNS 317
             +  +  P+S  I A + +  +Y  GI+        ++H V++ G+G +  G  YW+++NS
Sbjct:   206 EIYANGPISCGIMA-TEKMSNYTGGIYAEYKDQAYINHIVSVAGWGVS-GGTEYWIVRNS 263

Query:   318 WGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 348
             WG  WG+ G+M+IV      G G    Y LA
Sbjct:   264 WGEPWGERGWMRIVTSTYKDGRGAH--YNLA 292

 Score = 120 (47.3 bits), Expect = 0.00011, P = 0.00011
 Identities = 38/114 (33%), Positives = 53/114 (46%)

Query:   128 YQNLSMTDVPTSLDWRDKGAV---TPIKNQ---KECGCCWAFAAVAAVEGITKI-RSGNL 180
             ++ LS +D+P S DWR+   V   +  +NQ   + CG CWA  + +A+     I R G  
Sbjct:    55 HEYLSPSDLPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAW 114

Query:   181 IQ--LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC 232
                 LS Q ++DC   G+  C GG     +AY     GI  E    YQA    C
Sbjct:   115 PSTLLSVQHVIDCGNAGS--CEGGDDLPVWAYA-HRHGIPDETCNNYQAKDQVC 165


>UNIPROTKB|P05689 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:BC122603
            EMBL:X01809 IPI:IPI00708474 PIR:A29172 RefSeq:NP_001071303.1
            UniGene:Bt.4902 ProteinModelPortal:P05689 SMR:P05689 MEROPS:C01.013
            PRIDE:P05689 GeneID:404187 KEGG:bta:404187 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 InParanoid:P05689 KO:K08568
            OrthoDB:EOG42Z4QN BRENDA:3.4.18.1 NextBio:20817615 Uniprot:P05689
        Length = 304

 Score = 205 (77.2 bits), Expect = 7.0e-15, P = 7.0e-15
 Identities = 64/211 (30%), Positives = 100/211 (47%)

Query:   157 CGCCWAFAAVAAVEGITKI-RSGNLIQ--LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
             CG CWA  + +A+     I R G      LS Q ++DC   G+  C GG+    + Y   
Sbjct:    90 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGDAGS--CEGGNDLPVWEYA-H 146

Query:   214 NQGIATEDEYPYQAVP---------GTCSAAQKPAAAK------ISNYEEVPSGDEQALL 258
               GI  E    YQA           GTC+  ++    K      + +Y  + SG E+ + 
Sbjct:   147 RHGIPDETCNNYQAKDQECDKFNQCGTCTEFKECHVIKNYTLWKVGDYGSL-SGREKMMA 205

Query:   259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNS 317
             +  +  P+S  I A + +  +Y  GI++       ++H V++ G+G + DG  YW+++NS
Sbjct:   206 EIYTNGPISCGIMA-TEKMSNYTGGIYSEYNDQAFINHIVSVAGWGVS-DGMEYWIVRNS 263

Query:   318 WGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 348
             WG  WG+ G+M+IV      G G R  Y LA
Sbjct:   264 WGEPWGEHGWMRIVTSTYKGGEGAR--YNLA 292

 Score = 114 (45.2 bits), Expect = 0.00051, P = 0.00051
 Identities = 37/114 (32%), Positives = 53/114 (46%)

Query:   128 YQNLSMTDVPTSLDWRDKGAV---TPIKNQ---KECGCCWAFAAVAAVEGITKI-RSGNL 180
             ++ LS +D+P S DWR+   V   +  +NQ   + CG CWA  + +A+     I R G  
Sbjct:    55 HEYLSPSDLPKSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAW 114

Query:   181 IQ--LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC 232
                 LS Q ++DC   G+  C GG+    + Y     GI  E    YQA    C
Sbjct:   115 PSTLLSVQHVIDCGDAGS--CEGGNDLPVWEYA-HRHGIPDETCNNYQAKDQEC 165


>UNIPROTKB|E1C4M3 [details] [associations]
            symbol:CTSZ "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0060441 "epithelial tube branching
            involved in lung morphogenesis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GeneTree:ENSGT00560000076599 CTD:1522 KO:K08568 OMA:QCGTCTE
            EMBL:AADN02019004 IPI:IPI00596430 RefSeq:XP_417483.3
            Ensembl:ENSGALT00000012067 GeneID:419311 KEGG:gga:419311
            Uniprot:E1C4M3
        Length = 305

 Score = 203 (76.5 bits), Expect = 1.3e-14, P = 1.3e-14
 Identities = 61/211 (28%), Positives = 99/211 (46%)

Query:   157 CGCCWAFAAVAAVEGITKI-RSGNL--IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
             CG CWA  + +A+     I R G      LS Q ++DC+  G+  C GG     + Y   
Sbjct:    90 CGSCWAHGSTSALADRINIKRKGAWPSAYLSVQNVIDCANAGS--CEGGDHTGVWMYA-H 146

Query:   214 NQGIATEDEYPYQAVP---------------GTCSAAQKPAAAKISNYEEVPSGDEQALL 258
             + GI  E    YQA                 G C   +     K+++Y  V SG E+ + 
Sbjct:   147 DHGIPDETCNNYQAKNQKCKKFNQCGTCVTFGECHVIKNYTLWKVADYGAV-SGREKMMA 205

Query:   259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNS 317
             +  +  P+S  I A + +  +Y  G++     +  ++H V++ G+G  E+G  YW+++NS
Sbjct:   206 EIYANGPISCGIMA-TEKLDAYTGGLYTEYNPSPTVNHIVSVAGWGV-ENGTEYWIVRNS 263

Query:   318 WGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 348
             WG  WG+ G+++IV      G G  + Y LA
Sbjct:   264 WGEPWGERGWLRIVTSAYKGGRG--AEYNLA 292

 Score = 118 (46.6 bits), Expect = 0.00018, P = 0.00018
 Identities = 36/114 (31%), Positives = 53/114 (46%)

Query:   128 YQNLSMTDVPTSLDWRDKGAV---TPIKNQ---KECGCCWAFAAVAAVEGITKI-RSGNL 180
             ++ L M ++P S DWR+   V   +  +NQ   + CG CWA  + +A+     I R G  
Sbjct:    55 HEYLDMAELPQSWDWRNVNGVNYASTTRNQHIPQYCGSCWAHGSTSALADRINIKRKGAW 114

Query:   181 --IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC 232
                 LS Q ++DC+  G+  C GG     + Y   + GI  E    YQA    C
Sbjct:   115 PSAYLSVQNVIDCANAGS--CEGGDHTGVWMYA-HDHGIPDETCNNYQAKNQKC 165


>WB|WBGene00022026 [details] [associations]
            symbol:Y65B4A.2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 SMART:SM00645 GO:GO:0008234
            GO:GO:0006508 PANTHER:PTHR12411 GeneTree:ENSGT00560000076599
            PANTHER:PTHR12411:SF16 HSSP:P07688 EMBL:FO081482 RefSeq:NP_490763.1
            ProteinModelPortal:Q9BL59 MEROPS:C01.A46 PaxDb:Q9BL59
            EnsemblMetazoa:Y65B4A.2.1 EnsemblMetazoa:Y65B4A.2.2 GeneID:171655
            KEGG:cel:CELE_Y65B4A.2 UCSC:Y65B4A.2 CTD:171655 WormBase:Y65B4A.2
            eggNOG:NOG311760 HOGENOM:HOG000017674 InParanoid:Q9BL59 OMA:DRIVYWH
            NextBio:872169 Uniprot:Q9BL59
        Length = 421

 Score = 130 (50.8 bits), Expect = 2.0e-14, Sum P(2) = 2.0e-14
 Identities = 28/76 (36%), Positives = 39/76 (51%)

Query:   265 PVSIAIAAYSTEFQSYKEGIFNGVCGTQLD------HAVTIVGFGTTEDGANYWLIKNSW 318
             P ++A      EF  Y  G+F        D      H V ++G+G ++DG +YWL  NS+
Sbjct:   334 PTTMAFPV-PEEFLHYSSGVFRPYPTDGFDDRIVYWHVVRLIGWGESDDGTHYWLAVNSF 392

Query:   319 GNTWGDAGYMKIVRDE 334
             GN WGD G  KI  D+
Sbjct:   393 GNHWGDNGLFKINTDD 408

 Score = 126 (49.4 bits), Expect = 2.0e-14, Sum P(2) = 2.0e-14
 Identities = 58/211 (27%), Positives = 91/211 (43%)

Query:    21 IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
             +I LL++    V  S   + + V + ++K      R   + L K +R         +  K
Sbjct:    32 VILLLLAVLGLVYGSFYLYRRYVTDANDK------RDNDEYLRKLVRQVNDSPETTWKAK 85

Query:    81 ANKEG--NRTY--KLGTNQFS-DLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTD 135
              NK G  NR+Y  K   NQ + +   ++ R  +    M    H        + +N + +D
Sbjct:    86 FNKFGVKNRSYGFKYTRNQTAVEEYVEQIRKFFESDAMKR--HLD------ELENFNSSD 137

Query:   136 VPTSLDWRDKG----AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ--LSEQQLL 189
             VP + D R K     +++ + NQ  CG C+A AA         I S    +  LSE+ ++
Sbjct:   138 VPKNFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSNGTFKSLLSEEDII 197

Query:   190 DC-STNGNNGCLGGSREKAFAYIIQNQGIAT 219
              C S  GN  C GG   KA  Y + NQG+ T
Sbjct:   198 GCCSVCGN--CYGGDPLKALTYWV-NQGLVT 225


>UNIPROTKB|E1BTI7 [details] [associations]
            symbol:TINAG "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0005044 "scavenger receptor activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0006955 "immune
            response" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0030247 "polysaccharide binding"
            evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
            [GO:0007155 "cell adhesion" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GO:GO:0007155 GO:GO:0005604 GO:GO:0005044
            GeneTree:ENSGT00560000076599 CTD:27283 OMA:WGQLTSS
            EMBL:AADN02002720 EMBL:AADN02002721 IPI:IPI00581566
            RefSeq:XP_419905.3 UniGene:Gga.11215 Ensembl:ENSGALT00000026295
            GeneID:421888 KEGG:gga:421888 Uniprot:E1BTI7
        Length = 467

 Score = 135 (52.6 bits), Expect = 8.1e-14, Sum P(2) = 8.1e-14
 Identities = 37/136 (27%), Positives = 67/136 (49%)

Query:   213 QNQGIATEDEYPYQAVPGTCSAAQKPAAA--KISNYEEVPSGDEQALLKAVSMQPVSIAI 270
             +NQ   +  EY      G C  A + +    +  ++  V S +   + + ++  PV   +
Sbjct:   325 ENQCYVSS-EYGKNHTNGPCPNALEDSNRLYRCGSHYRVSSKETDIMEEIMAKGPVQAIM 383

Query:   271 AAYSTEFQSYKEGIFNGV--CGTQLD-HAVTIVGFGTT--EDGAN--YWLIKNSWGNTWG 323
               Y  +F  YKEGI+      G++   H+V ++G+G+   ++G    +W+  NSWG  WG
Sbjct:   384 KVYE-DFFLYKEGIYRHSYKAGSKWKTHSVKLLGWGSLPGKNGQKQKFWIAANSWGKYWG 442

Query:   324 DAGYMKIVRDEGLCGI 339
             + GY +I+R +  C I
Sbjct:   443 ENGYFRILRGQNECDI 458

 Score = 116 (45.9 bits), Expect = 8.1e-14, Sum P(2) = 8.1e-14
 Identities = 27/74 (36%), Positives = 39/74 (52%)

Query:   153 NQKECGCCWAFA-AVAAVEGITKIRSGNLIQ-LSEQQLLDCSTNGNNGCLGGSREKAFAY 210
             +Q+ CG  WAF+ A  A + IT    G +   LS Q L+ C T    GC GGS + A+ Y
Sbjct:   241 DQRNCGASWAFSTASVAADRITIHSDGQITDNLSVQNLISCDTGNQRGCNGGSIDGAWRY 300

Query:   211 IIQNQGIATEDEYP 224
             +  + G+ +   YP
Sbjct:   301 LTTH-GVVSYACYP 313


>TAIR|locus:2060420 [details] [associations]
            symbol:AT2G22160 "AT2G22160" species:3702 "Arabidopsis
            thaliana" [GO:0005575 "cellular_component" evidence=ND] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] EMBL:CP002685
            GenomeReviews:CT485783_GR InterPro:IPR013201 Pfam:PF08246
            SMART:SM00848 EMBL:AC007168 IPI:IPI00544896 PIR:F84609
            RefSeq:NP_179806.1 UniGene:At.66231 HSSP:P25774
            ProteinModelPortal:Q9SIE8 SMR:Q9SIE8 EnsemblPlants:AT2G22160.1
            GeneID:816750 KEGG:ath:AT2G22160 TAIR:At2g22160 eggNOG:NOG297278
            InParanoid:Q9SIE8 OMA:HRCITLA PhylomeDB:Q9SIE8 ArrayExpress:Q9SIE8
            Genevestigator:Q9SIE8 Uniprot:Q9SIE8
        Length = 105

 Score = 180 (68.4 bits), Expect = 1.8e-13, P = 1.8e-13
 Identities = 41/92 (44%), Positives = 56/92 (60%)

Query:    63 EKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRXXX 122
             + E    +FK+N EYI K NKE  + YKL  N+F++LT+ EF   +T + M S   +   
Sbjct:    10 QTESSFDVFKKNAEYIVKTNKE-RKPYKLKLNKFANLTDVEFVNAHTCFDM-SDHKKILD 67

Query:   123 XXXFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
                F Y+N  MT  P SLDWR+KGAVT +K+Q
Sbjct:    68 SKPFFYEN--MTQAPDSLDWREKGAVTNVKDQ 97


>WB|WBGene00000781 [details] [associations]
            symbol:cpr-1 species:6239 "Caenorhabditis elegans"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008340 "determination
            of adult lifespan" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008340 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            eggNOG:NOG315657 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            KO:K01363 PANTHER:PTHR12411:SF16 EMBL:M74797 EMBL:Z78012 PIR:T20148
            RefSeq:NP_506002.2 ProteinModelPortal:P25807 SMR:P25807
            DIP:DIP-25619N MINT:MINT-1058393 STRING:P25807 MEROPS:C01.A32
            PaxDb:P25807 EnsemblMetazoa:C52E4.1 GeneID:179637
            KEGG:cel:CELE_C52E4.1 UCSC:C52E4.1 CTD:179637 WormBase:C52E4.1
            InParanoid:P25807 OMA:CSLSCQS NextBio:906250 Uniprot:P25807
        Length = 329

 Score = 192 (72.6 bits), Expect = 5.5e-13, P = 5.5e-13
 Identities = 42/115 (36%), Positives = 60/115 (52%)

Query:   231 TCSAAQKPAAAK-----ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF 285
             +C +    A AK     +S Y  VP        +  +  PV  A + Y  +F  YK G++
Sbjct:   207 SCQSGYSTAYAKDKHFGVSAYA-VPKNAASIQAEIYANGPVEAAFSVYE-DFYKYKSGVY 264

Query:   286 NGVCGTQLD-HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGI 339
                 G  L  HA+ I+G+GT E G+ YWL+ NSWG  WG++G+ KI R +  CGI
Sbjct:   265 KHTAGKYLGGHAIKIIGWGT-ESGSPYWLVANSWGVNWGESGFFKIYRGDDQCGI 318

 Score = 129 (50.5 bits), Expect = 1.2e-05, P = 1.2e-05
 Identities = 42/162 (25%), Positives = 72/162 (44%)

Query:   129 QNLSMTDVPTSLD----WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ-- 182
             Q + +  VP + D    W +  ++  I++Q  CG CWAF A   +   T I +    Q  
Sbjct:    78 QEVVLASVPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPI 137

Query:   183 LSEQQLLDC-STNGNNGCLGGSREKAFAYIIQNQGIATEDEY------PYQAVP---GTC 232
             +S   LL C  ++  NGC GG   +A  +   ++G+ T  +Y      PY   P   G C
Sbjct:   138 ISPDDLLSCCGSSCGNGCEGGYPIQALRWW-DSKGVVTGGDYHGAGCKPYPIAPCTSGNC 196

Query:   233 SAAQKPAAAKI--SNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
               ++ P+ +    S Y    + D+   + A ++   + +I A
Sbjct:   197 PESKTPSCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQA 238


>RGD|70956 [details] [associations]
            symbol:Tinagl1 "tubulointerstitial nephritis antigen-like 1"
           species:10116 "Rattus norvegicus" [GO:0005044 "scavenger receptor
           activity" evidence=IEA] [GO:0005576 "extracellular region"
           evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA;ISO] [GO:0006508
           "proteolysis" evidence=IEA] [GO:0006955 "immune response"
           evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
           evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
           [GO:0031012 "extracellular matrix" evidence=IEA;ISO] [GO:0043236
           "laminin binding" evidence=IEA;ISO] InterPro:IPR000668
           InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
           PROSITE:PS50958 SMART:SM00201 SMART:SM00645 RGD:70956 GO:GO:0005737
           GO:GO:0005576 GO:GO:0006955 GO:GO:0030247 GO:GO:0008234
           GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
           GO:GO:0031012 GO:GO:0005044 eggNOG:NOG310046 HOGENOM:HOG000241342
           HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OrthoDB:EOG4BG8W0
           EMBL:AB050717 IPI:IPI00190428 RefSeq:NP_446034.1 UniGene:Rn.1256
           ProteinModelPortal:Q9EQT5 PRIDE:Q9EQT5 GeneID:94174 KEGG:rno:94174
           UCSC:RGD:70956 InParanoid:Q9EQT5 NextBio:617830 ArrayExpress:Q9EQT5
           Genevestigator:Q9EQT5 GermOnline:ENSRNOG00000013179 Uniprot:Q9EQT5
        Length = 467

 Score = 127 (49.8 bits), Expect = 1.4e-12, Sum P(2) = 1.4e-12
 Identities = 45/165 (27%), Positives = 71/165 (43%)

Query:    80 KANKEGNRTYKLGTNQ-FSDLTNDEFRALYTGYKMPSPSHRXXXXXXFKYQNLSMTDV-P 137
             KA   GN  ++ G +  F  +T DE      G   PS S          Y  L   +V P
Sbjct:   147 KAINRGNYGWQAGNHSAFWGMTLDEGIRYRLGTIRPSSSVMNMNEI---YTVLGQGEVLP 203

Query:   138 TSLDWRDK--GAVTPIKNQKECGCCWAFAAVAAVEGITKIRS-GNLIQ-LSEQQLLDCST 193
             T+ +  +K    +    +Q  C   WAF+  A       I S G++   LS Q LL C T
Sbjct:   204 TAFEASEKWPNLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSCDT 263

Query:   194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP 238
             +   GC GG  + A+ + ++ +G+ +++ YP+        A+  P
Sbjct:   264 HHQKGCRGGRLDGAW-WFLRRRGVVSDNCYPFSGREQNDEASPTP 307

 Score = 113 (44.8 bits), Expect = 1.4e-12, Sum P(2) = 1.4e-12
 Identities = 34/105 (32%), Positives = 50/105 (47%)

Query:   251 SGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGIFNGVCGTQL---------DHAVTIV 300
             + DE+ ++K +    PV  A+     +F  Y+ GI++    +Q           H+V I 
Sbjct:   347 ASDEKEIMKELMENGPVQ-ALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKIT 405

Query:   301 GFG--TTEDGAN--YWLIKNSWGNTWGDAGYMKIVRDEGLCGIGT 341
             G+G  T  DG    YW   NSWG  WG+ G+ +IVR    C I T
Sbjct:   406 GWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGINECDIET 450

WARNING:  HSPs involving 48 database sequences were not reported due to the
          limiting value of parameter B = 250.


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.316   0.131   0.392    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      348       342   0.00096  116 3  11 22  0.42    34
                                                     33  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  298
  No. of states in DFA:  616 (65 KB)
  Total size of DFA:  245 KB (2132 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  28.95u 0.12s 29.07t   Elapsed:  00:00:01
  Total cpu time:  29.00u 0.12s 29.12t   Elapsed:  00:00:01
  Start:  Tue May 21 02:28:02 2013   End:  Tue May 21 02:28:03 2013
WARNINGS ISSUED:  2

Back to top