BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>017419
MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG
KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK
RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI
VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS
RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD
HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS
QNSAKPKPHSSA

High Scoring Gene Products

Symbol, full name Information P value
RD21B
esponsive to dehydration 21B
protein from Arabidopsis thaliana 2.7e-123
RD21A
responsive to dehydration 21A
protein from Arabidopsis thaliana 4.6e-121
AT3G19390 protein from Arabidopsis thaliana 4.4e-116
AT3G19400 protein from Arabidopsis thaliana 4.0e-106
AT4G23520 protein from Arabidopsis thaliana 6.9e-102
XBCP3
xylem bark cysteine peptidase 3
protein from Arabidopsis thaliana 2.4e-92
CEP1
cysteine endopeptidase 1
protein from Arabidopsis thaliana 2.7e-91
XCP1
xylem cysteine peptidase 1
protein from Arabidopsis thaliana 5.7e-91
XCP2
AT1G20850
protein from Arabidopsis thaliana 4.0e-90
CP1
cysteine protease 1
protein from Arabidopsis thaliana 1.4e-89
CP2
cysteine protease 2
protein from Arabidopsis thaliana 1.1e-87
AT3G43960 protein from Arabidopsis thaliana 6.0e-87
CEP3
cysteine endopeptidase 3
protein from Arabidopsis thaliana 3.0e-85
SAG12
senescence-associated gene 12
protein from Arabidopsis thaliana 1.2e-83
AT1G06260 protein from Arabidopsis thaliana 3.9e-83
AT2G27420 protein from Arabidopsis thaliana 8.1e-76
AT3G49340 protein from Arabidopsis thaliana 2.0e-72
AT2G34080 protein from Arabidopsis thaliana 1.8e-71
AT1G29080 protein from Arabidopsis thaliana 2.1e-68
AT1G29090 protein from Arabidopsis thaliana 3.5e-68
CTSL2
Uncharacterized protein
protein from Gallus gallus 9.4e-66
cprC
cysteine proteinase 3
gene from Dictyostelium discoideum 3.2e-65
ctsl1a
cathepsin L, 1 a
gene_product from Danio rerio 2.3e-64
ctsl.1
cathepsin L.1
gene_product from Danio rerio 2.9e-64
CTSL1
Cathepsin L1
protein from Sus scrofa 7.6e-64
CTSL1
Cathepsin L1
protein from Canis lupus familiaris 1.6e-63
CTSL2
Cathepsin L2
protein from Homo sapiens 5.4e-63
CTSL1
Cathepsin L1
protein from Bos taurus 6.9e-63
Cp1
Cysteine proteinase-1
protein from Drosophila melanogaster 1.1e-62
wu:fb37b09 gene_product from Danio rerio 1.1e-62
Ssc.54235
Uncharacterized protein
protein from Sus scrofa 1.4e-62
zgc:174855 gene_product from Danio rerio 1.8e-62
CTSL1
Cathepsin L1
protein from Homo sapiens 2.3e-62
ctssb.2
cathepsin S, b.2
gene_product from Danio rerio 3.8e-62
CTSL2
Cathepsin L2
protein from Bos taurus 6.2e-62
P83654
Ervatamin-C
protein from Tabernaemontana divaricata 6.2e-62
Ctsl
cathepsin L
protein from Mus musculus 7.9e-62
Ctsl1
cathepsin L1
gene from Rattus norvegicus 1.3e-61
cpl-1 gene from Caenorhabditis elegans 1.6e-61
cprD
cysteine proteinase 4
gene from Dictyostelium discoideum 2.0e-61
Ctss
cathepsin S
protein from Mus musculus 4.3e-61
Ctsll3
cathepsin L-like 3
gene from Rattus norvegicus 7.1e-61
ctsl1b
cathepsin L, 1 b
gene_product from Danio rerio 7.1e-61
zgc:174153 gene_product from Danio rerio 9.0e-61
cprF
cysteine proteinase 6
gene from Dictyostelium discoideum 1.8e-60
cprB
cysteine proteinase 2
gene from Dictyostelium discoideum 3.8e-60
AT1G29110 protein from Arabidopsis thaliana 1.0e-59
ctsll
cathepsin L, like
gene_product from Danio rerio 1.0e-59
Cys
Crustapain
protein from Pandalus borealis 1.7e-59
CTSL1
CTSL1 protein
protein from Bos taurus 4.5e-59
CTSS
Uncharacterized protein
protein from Sus scrofa 7.3e-59
RGD1308751
similar to Cathepsin L precursor (Major excreted protein) (MEP)
gene from Rattus norvegicus 7.3e-59
Ctsk
cathepsin K
gene from Rattus norvegicus 1.2e-58
CTSK
Cathepsin K
protein from Homo sapiens 5.1e-58
CTSK
Cathepsin K
protein from Bos taurus 6.6e-58
CTSS
Cathepsin S
protein from Canis lupus familiaris 6.6e-58
CTSS
Cathepsin S
protein from Canis lupus familiaris 8.4e-58
ALP
aleurain-like protease
protein from Arabidopsis thaliana 8.4e-58
ctssb.1
cathepsin S, b.1
gene_product from Danio rerio 1.4e-57
cfaD
peptidase C1A family protein
gene from Dictyostelium discoideum 3.6e-57
CTSL2
Uncharacterized protein
protein from Gallus gallus 3.6e-57
Ctsk
cathepsin K
protein from Mus musculus 5.9e-57
AT3G45310 protein from Arabidopsis thaliana 5.9e-57
Ctsh
cathepsin H
protein from Mus musculus 9.6e-57
CTSL1
Cathepsin L1
protein from Gallus gallus 1.2e-56
CTSS
Cathepsin S
protein from Bos taurus 2.5e-56
Ctsh
cathepsin H
gene from Rattus norvegicus 3.3e-56
ctsk
cathepsin K
gene_product from Danio rerio 3.3e-56
CTSK
Cathepsin K
protein from Sus scrofa 5.3e-56
CTSH
Uncharacterized protein
protein from Callithrix jacchus 5.3e-56
CTSH
Uncharacterized protein
protein from Callithrix jacchus 5.3e-56
Ctss
cathepsin S
gene from Rattus norvegicus 5.3e-56
CTSK
Cathepsin K
protein from Canis lupus familiaris 8.6e-56
CTSK
Cathepsin K
protein from Canis lupus familiaris 8.6e-56
cprH
cysteine proteinase 8
gene from Dictyostelium discoideum 1.4e-55
CTSS
Cathepsin S
protein from Homo sapiens 1.4e-55
D3ZZR3
Uncharacterized protein
protein from Rattus norvegicus 1.4e-55
Cat-1
Cathepsin L-like proteinase
protein from Fasciola hepatica 1.8e-55
CTSH
Uncharacterized protein
protein from Macaca mulatta 2.9e-55
cprE
cysteine proteinase 5
gene from Dictyostelium discoideum 4.8e-55
DDB_G0272298 gene from Dictyostelium discoideum 6.1e-55
ctsh
cathepsin H
gene_product from Danio rerio 6.1e-55
CTSH
Uncharacterized protein
protein from Nomascus leucogenys 9.9e-55
CTSH
Uncharacterized protein
protein from Oryctolagus cuniculus 9.9e-55
CTSL1
Cathepsin L1
protein from Canis lupus familiaris 1.3e-54
CTSH
Pro-cathepsin H
protein from Homo sapiens 1.3e-54
CTSH
Uncharacterized protein
protein from Gorilla gorilla gorilla 1.6e-54
CTSH
Pro-cathepsin H
protein from Sus scrofa 2.1e-54
RD19
RESPONSIVE TO DEHYDRATION 19
protein from Arabidopsis thaliana 1.9e-53
CTSH
Pro-cathepsin H
protein from Bos taurus 3.0e-53
CTSL
Cathepsin L1
protein from Ovis aries 3.0e-53
CTSH
Uncharacterized protein
protein from Ailuropoda melanoleuca 3.8e-53
CTSH
Uncharacterized protein
protein from Equus caballus 6.3e-53
LOC100662496
Uncharacterized protein
protein from Loxodonta africana 8.0e-53
J9P7C5
Uncharacterized protein
protein from Canis lupus familiaris 2.7e-52
ctskl
cathepsin K, like
gene_product from Danio rerio 3.5e-52
CTSK
Cathepsin K
protein from Gallus gallus 4.4e-52
P83443
Macrodontain-1
protein from Pseudananas sagenarius 4.4e-52
LOC420160
Uncharacterized protein
protein from Gallus gallus 7.2e-52
ctsf
cathepsin F
gene_product from Danio rerio 7.2e-52

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  017419
        (372 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2167821 - symbol:RD21B "esponsive to dehydrati...  1212  2.7e-123  1
TAIR|locus:2825832 - symbol:RD21A "responsive to dehydrat...  1191  4.6e-121  1
TAIR|locus:2090614 - symbol:AT3G19390 species:3702 "Arabi...  1144  4.4e-116  1
TAIR|locus:2090629 - symbol:AT3G19400 species:3702 "Arabi...  1050  4.0e-106  1
TAIR|locus:2117979 - symbol:AT4G23520 species:3702 "Arabi...  1010  6.9e-102  1
TAIR|locus:2024362 - symbol:XBCP3 "xylem bark cysteine pe...   920  2.4e-92   1
TAIR|locus:2157712 - symbol:CEP1 "cysteine endopeptidase ...   910  2.7e-91   1
TAIR|locus:2122113 - symbol:XCP1 "xylem cysteine peptidas...   907  5.7e-91   1
TAIR|locus:2030427 - symbol:XCP2 "xylem cysteine peptidas...   899  4.0e-90   1
TAIR|locus:2128243 - symbol:AT4G11310 species:3702 "Arabi...   894  1.4e-89   1
TAIR|locus:2128253 - symbol:AT4G11320 species:3702 "Arabi...   876  1.1e-87   1
TAIR|locus:2097104 - symbol:AT3G43960 species:3702 "Arabi...   869  6.0e-87   1
TAIR|locus:505006391 - symbol:CEP3 "cysteine endopeptidas...   853  3.0e-85   1
TAIR|locus:2152445 - symbol:SAG12 "senescence-associated ...   838  1.2e-83   1
TAIR|locus:2038515 - symbol:AT1G06260 species:3702 "Arabi...   833  3.9e-83   1
TAIR|locus:2038588 - symbol:AT2G27420 species:3702 "Arabi...   764  8.1e-76   1
TAIR|locus:2082881 - symbol:AT3G49340 species:3702 "Arabi...   732  2.0e-72   1
TAIR|locus:2055440 - symbol:AT2G34080 species:3702 "Arabi...   723  1.8e-71   1
TAIR|locus:2029934 - symbol:AT1G29080 species:3702 "Arabi...   694  2.1e-68   1
TAIR|locus:2029924 - symbol:AT1G29090 species:3702 "Arabi...   692  3.5e-68   1
UNIPROTKB|F1NYJ1 - symbol:CTSL2 "Uncharacterized protein"...   669  9.4e-66   1
DICTYBASE|DDB_G0283867 - symbol:cprC "cysteine proteinase...   664  3.2e-65   1
ZFIN|ZDB-GENE-030131-106 - symbol:ctsl1a "cathepsin L, 1 ...   656  2.3e-64   1
ZFIN|ZDB-GENE-040718-61 - symbol:ctsl.1 "cathepsin L.1" s...   655  2.9e-64   1
UNIPROTKB|Q28944 - symbol:CTSL1 "Cathepsin L1" species:98...   651  7.6e-64   1
UNIPROTKB|Q9GL24 - symbol:CTSL1 "Cathepsin L1" species:96...   648  1.6e-63   1
UNIPROTKB|O60911 - symbol:CTSL2 "Cathepsin L2" species:96...   643  5.4e-63   1
UNIPROTKB|P25975 - symbol:CTSL1 "Cathepsin L1" species:99...   642  6.9e-63   1
FB|FBgn0013770 - symbol:Cp1 "Cysteine proteinase-1" speci...   640  1.1e-62   1
ZFIN|ZDB-GENE-030131-572 - symbol:wu:fb37b09 "wu:fb37b09"...   640  1.1e-62   1
UNIPROTKB|F1S4J6 - symbol:Ssc.54235 "Cathepsin L1" specie...   639  1.4e-62   1
ZFIN|ZDB-GENE-071004-74 - symbol:zgc:174855 "zgc:174855" ...   638  1.8e-62   1
UNIPROTKB|P07711 - symbol:CTSL1 "Cathepsin L1" species:96...   637  2.3e-62   1
ZFIN|ZDB-GENE-050626-55 - symbol:ctssb.2 "cathepsin S, b....   635  3.8e-62   1
UNIPROTKB|Q5E998 - symbol:CTSL2 "Cathepsin L2" species:99...   633  6.2e-62   1
UNIPROTKB|P83654 - symbol:P83654 "Ervatamin-C" species:52...   633  6.2e-62   1
MGI|MGI:88564 - symbol:Ctsl "cathepsin L" species:10090 "...   632  7.9e-62   1
RGD|2448 - symbol:Ctsl1 "cathepsin L1" species:10116 "Rat...   630  1.3e-61   1
WB|WBGene00000776 - symbol:cpl-1 species:6239 "Caenorhabd...   629  1.6e-61   1
DICTYBASE|DDB_G0278721 - symbol:cprD "cysteine proteinase...   522  2.0e-61   2
MGI|MGI:107341 - symbol:Ctss "cathepsin S" species:10090 ...   625  4.3e-61   1
RGD|1560071 - symbol:Ctsll3 "cathepsin L-like 3" species:...   623  7.1e-61   1
ZFIN|ZDB-GENE-980526-285 - symbol:ctsl1b "cathepsin L, 1 ...   623  7.1e-61   1
ZFIN|ZDB-GENE-080215-7 - symbol:zgc:174153 "zgc:174153" s...   622  9.0e-61   1
DICTYBASE|DDB_G0279185 - symbol:cprF "cysteine proteinase...   520  1.8e-60   2
DICTYBASE|DDB_G0279799 - symbol:cprB "cysteine proteinase...   522  3.8e-60   2
TAIR|locus:2030027 - symbol:AT1G29110 species:3702 "Arabi...   612  1.0e-59   1
ZFIN|ZDB-GENE-041010-76 - symbol:ctsll "cathepsin L, like...   612  1.0e-59   1
UNIPROTKB|Q86GF7 - symbol:Cys "Crustapain" species:6703 "...   610  1.7e-59   1
UNIPROTKB|A4IFS7 - symbol:CTSL1 "CTSL1 protein" species:9...   606  4.5e-59   1
UNIPROTKB|F1SS93 - symbol:CTSS "Uncharacterized protein" ...   604  7.3e-59   1
RGD|1308751 - symbol:RGD1308751 "similar to Cathepsin L p...   604  7.3e-59   1
RGD|61810 - symbol:Ctsk "cathepsin K" species:10116 "Ratt...   602  1.2e-58   1
UNIPROTKB|P43235 - symbol:CTSK "Cathepsin K" species:9606...   596  5.1e-58   1
UNIPROTKB|Q5E968 - symbol:CTSK "Cathepsin K" species:9913...   595  6.6e-58   1
UNIPROTKB|F1PAK0 - symbol:CTSS "Cathepsin S" species:9615...   595  6.6e-58   1
UNIPROTKB|Q8HY81 - symbol:CTSS "Cathepsin S" species:9615...   594  8.4e-58   1
TAIR|locus:2175088 - symbol:ALP "aleurain-like protease" ...   594  8.4e-58   1
ZFIN|ZDB-GENE-050522-559 - symbol:ctssb.1 "cathepsin S, b...   592  1.4e-57   1
DICTYBASE|DDB_G0281605 - symbol:cfaD "peptidase C1A famil...   588  3.6e-57   1
UNIPROTKB|F1NEC8 - symbol:CTSL2 "Uncharacterized protein"...   588  3.6e-57   1
MGI|MGI:107823 - symbol:Ctsk "cathepsin K" species:10090 ...   586  5.9e-57   1
TAIR|locus:2078312 - symbol:AT3G45310 species:3702 "Arabi...   586  5.9e-57   1
MGI|MGI:107285 - symbol:Ctsh "cathepsin H" species:10090 ...   584  9.6e-57   1
UNIPROTKB|P09648 - symbol:CTSL1 "Cathepsin L1" species:90...   583  1.2e-56   1
UNIPROTKB|P25326 - symbol:CTSS "Cathepsin S" species:9913...   580  2.5e-56   1
RGD|2447 - symbol:Ctsh "cathepsin H" species:10116 "Rattu...   579  3.3e-56   1
ZFIN|ZDB-GENE-001205-4 - symbol:ctsk "cathepsin K" specie...   579  3.3e-56   1
UNIPROTKB|Q9GLE3 - symbol:CTSK "Cathepsin K" species:9823...   577  5.3e-56   1
UNIPROTKB|F7B939 - symbol:CTSH "Uncharacterized protein" ...   577  5.3e-56   1
UNIPROTKB|F7BRD4 - symbol:CTSH "Uncharacterized protein" ...   577  5.3e-56   1
RGD|621513 - symbol:Ctss "cathepsin S" species:10116 "Rat...   577  5.3e-56   1
UNIPROTKB|G1K2A7 - symbol:CTSK "Cathepsin K" species:9615...   575  8.6e-56   1
UNIPROTKB|Q3ZKN1 - symbol:CTSK "Cathepsin K" species:9615...   575  8.6e-56   1
DICTYBASE|DDB_G0278401 - symbol:cprH "cysteine proteinase...   573  1.4e-55   1
UNIPROTKB|P25774 - symbol:CTSS "Cathepsin S" species:9606...   573  1.4e-55   1
UNIPROTKB|D3ZZR3 - symbol:D3ZZR3 "Uncharacterized protein...   573  1.4e-55   1
UNIPROTKB|Q24940 - symbol:Cat-1 "Cathepsin L-like protein...   572  1.8e-55   1
UNIPROTKB|F6R7P5 - symbol:CTSH "Uncharacterized protein" ...   570  2.9e-55   1
DICTYBASE|DDB_G0272815 - symbol:cprE "cysteine proteinase...   568  4.8e-55   1
DICTYBASE|DDB_G0272298 - symbol:DDB_G0272298 species:4468...   567  6.1e-55   1
ZFIN|ZDB-GENE-030131-3539 - symbol:ctsh "cathepsin H" spe...   567  6.1e-55   1
UNIPROTKB|G1RBY1 - symbol:CTSH "Uncharacterized protein" ...   565  9.9e-55   1
UNIPROTKB|G1SQF0 - symbol:CTSH "Uncharacterized protein" ...   565  9.9e-55   1
UNIPROTKB|F1PMM9 - symbol:CTSL1 "Cathepsin L1" species:96...   564  1.3e-54   1
UNIPROTKB|P09668 - symbol:CTSH "Pro-cathepsin H" species:...   564  1.3e-54   1
UNIPROTKB|G3R9A7 - symbol:CTSH "Uncharacterized protein" ...   563  1.6e-54   1
UNIPROTKB|O46427 - symbol:CTSH "Pro-cathepsin H" species:...   562  2.1e-54   1
TAIR|locus:2120222 - symbol:RD19 "RESPONSIVE TO DEHYDRATI...   553  1.9e-53   1
UNIPROTKB|Q3T0I2 - symbol:CTSH "Pro-cathepsin H" species:...   551  3.0e-53   1
UNIPROTKB|Q10991 - symbol:CTSL "Cathepsin L1" species:994...   551  3.0e-53   1
UNIPROTKB|G1M0X4 - symbol:CTSH "Uncharacterized protein" ...   550  3.8e-53   1
UNIPROTKB|F7BJD8 - symbol:CTSH "Uncharacterized protein" ...   548  6.3e-53   1
UNIPROTKB|G3SSC1 - symbol:CTSH "Uncharacterized protein" ...   547  8.0e-53   1
UNIPROTKB|J9P7C5 - symbol:J9P7C5 "Uncharacterized protein...   542  2.7e-52   1
ZFIN|ZDB-GENE-050208-336 - symbol:ctskl "cathepsin K, lik...   541  3.5e-52   1
UNIPROTKB|Q90686 - symbol:CTSK "Cathepsin K" species:9031...   540  4.4e-52   1
UNIPROTKB|P83443 - symbol:P83443 "Macrodontain-1" species...   540  4.4e-52   1
UNIPROTKB|F1NZ37 - symbol:LOC420160 "Uncharacterized prot...   538  7.2e-52   1
ZFIN|ZDB-GENE-030131-9831 - symbol:ctsf "cathepsin F" spe...   538  7.2e-52   1

WARNING:  Descriptions of 198 database sequences were not reported due to the
          limiting value of parameter V = 100.


>TAIR|locus:2167821 [details] [associations]
            symbol:RD21B "esponsive to dehydration 21B" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS] [GO:0005773
            "vacuole" evidence=IDA] [GO:0009651 "response to salt stress"
            evidence=IEP] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005829 "cytosol" evidence=IDA] [GO:0052541 "plant-type cell
            wall cellulose metabolic process" evidence=RCA] [GO:0052546 "cell
            wall pectin metabolic process" evidence=RCA] [GO:0005783
            "endoplasmic reticulum" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005783 GO:GO:0005829 EMBL:CP002688
            GO:GO:0005773 GO:GO:0009651 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AB008267 HSSP:O65039
            HOGENOM:HOG000230773 KO:K01376 InterPro:IPR000118 Pfam:PF00396
            SMART:SM00277 ProtClustDB:CLSN2688498 EMBL:AY062608 EMBL:AY114661
            IPI:IPI00520971 RefSeq:NP_568620.1 UniGene:At.24130 SMR:Q9FMH8
            IntAct:Q9FMH8 STRING:Q9FMH8 MEROPS:C01.A12
            EnsemblPlants:AT5G43060.1 GeneID:834321 KEGG:ath:AT5G43060
            TAIR:At5g43060 InParanoid:Q9FMH8 OMA:ENSEASL Genevestigator:Q9FMH8
            Uniprot:Q9FMH8
        Length = 463

 Score = 1212 (431.7 bits), Expect = 2.7e-123, P = 2.7e-123
 Identities = 229/368 (62%), Positives = 278/368 (75%)

Query:     7 FLAISTLVFLF-FXXXXXXXXXXXXXYDNNHDHSSSW-RTDDEVMTIYQTWLAKHGK--- 61
             FL +S ++ L                YD NH  ++   R+D EV  IY+ W+ +HGK   
Sbjct:     3 FLKLSPMILLLAMIGVSYAMDMSIISYDENHHITTETSRSDSEVERIYEAWMVEHGKKKM 62

Query:    62 TSNGMG-HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
               NG+G   ++RF+IFKDNLRFIDEHN+ N +YK+GL +FADLTNEEYR+MYLG +    
Sbjct:    63 NQNGLGAEKDQRFEIFKDNLRFIDEHNTKNLSYKLGLTRFADLTNEEYRSMYLGAKPT-- 120

Query:   121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
             +R++K+   S RY  + GD LP+SVDWR++GAV  VKDQGSCGSCWAFST+ AVEGINKI
Sbjct:   121 KRVLKT---SDRYQARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGINKI 177

Query:   181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
             VTG+LISLSEQELVDCD   N GCNGGLMDYAF+FII+NGG+D+E DYPY  A+ +CD +
Sbjct:   178 VTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADGRCDQN 237

Query:   241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
             R+NAKVV+ID YEDV    E SLKKA+A QP+SVAIEAGGRAFQ Y SGVF G CG+ LD
Sbjct:   238 RKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYSSGVFDGLCGTELD 297

Query:   301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNS 360
             HGVVAVGYGTENG DYW+VRNSWG+ WGE+GY+K+ RN+ +  TGKCGIAMEASYP+K  
Sbjct:   298 HGVVAVGYGTENGKDYWIVRNSWGNRWGESGYIKMARNI-EAPTGKCGIAMEASYPIKKG 356

Query:   361 QNSAKPKP 368
             QN   P P
Sbjct:   357 QNPPNPGP 364


>TAIR|locus:2825832 [details] [associations]
            symbol:RD21A "responsive to dehydration 21A" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;IMP]
            [GO:0008234 "cysteine-type peptidase activity"
            evidence=ISS;IDA;IMP] [GO:0009507 "chloroplast" evidence=IDA]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0008233 "peptidase
            activity" evidence=IDA] [GO:0048046 "apoplast" evidence=IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005829 "cytosol"
            evidence=RCA] [GO:0009506 "plasmodesma" evidence=IDA] [GO:0050832
            "defense response to fungus" evidence=IMP] [GO:0006096 "glycolysis"
            evidence=RCA] [GO:0006833 "water transport" evidence=RCA]
            [GO:0006972 "hyperosmotic response" evidence=RCA] [GO:0007030
            "Golgi organization" evidence=RCA] [GO:0009266 "response to
            temperature stimulus" evidence=RCA] [GO:0009651 "response to salt
            stress" evidence=RCA] [GO:0015996 "chlorophyll catabolic process"
            evidence=RCA] [GO:0042744 "hydrogen peroxide catabolic process"
            evidence=RCA] [GO:0046686 "response to cadmium ion" evidence=RCA]
            [GO:0009414 "response to water deprivation" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0009506 GO:GO:0009507 GO:GO:0005773
            GO:GO:0050832 GO:GO:0048046 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC083835
            HOGENOM:HOG000230773 KO:K01376 InterPro:IPR000118 Pfam:PF00396
            SMART:SM00277 UniGene:At.43549 EMBL:D13043 EMBL:AY072130
            EMBL:AY133781 IPI:IPI00530094 PIR:JN0719 RefSeq:NP_564497.1
            UniGene:At.47599 UniGene:At.71705 ProteinModelPortal:P43297
            SMR:P43297 IntAct:P43297 STRING:P43297 MEROPS:C01.064 PaxDb:P43297
            PRIDE:P43297 ProMEX:P43297 EnsemblPlants:AT1G47128.1 GeneID:841122
            KEGG:ath:AT1G47128 TAIR:At1g47128 InParanoid:P43297 OMA:EAWLVKH
            PhylomeDB:P43297 ProtClustDB:CLSN2688498 Genevestigator:P43297
            GermOnline:AT1G47128 Uniprot:P43297
        Length = 462

 Score = 1191 (424.3 bits), Expect = 4.6e-121, P = 4.6e-121
 Identities = 215/359 (59%), Positives = 272/359 (75%)

Query:    13 LVFLFFXXXXXXXXXXXXXYDNNHDHSSSW-RTDDEVMTIYQTWLAKHGK--TSNGMGHN 69
             ++FL               YD  H  S++  R++ EVM+IY+ WL KHGK  + N +   
Sbjct:    10 ILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEK 69

Query:    70 EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
             ++RF+IFKDNLRF+DEHN  N +Y++GL +FADLTN+EYR+ YLG + + K      +  
Sbjct:    70 DRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGE----RRT 125

Query:   130 SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
             S RY  + GDELPES+DWR+KGAV  VKDQG CGSCWAFST+ AVEGIN+IVTG+LI+LS
Sbjct:   126 SLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLS 185

Query:   190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
             EQELVDCD   N GCNGGLMDYAF+FII+NGG+D+++DYPY G +  CD  R+NAKVV+I
Sbjct:   186 EQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTI 245

Query:   250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
             D YEDV  + E SLKKAVA QP+S+AIEAGGRAFQ Y+SG+F G CG+ LDHGVVAVGYG
Sbjct:   246 DSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYG 305

Query:   310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
             TENG DYW+VRNSWG  WGE+GY+++ RN+  +++GKCGIA+E SYP+KN +N   P P
Sbjct:   306 TENGKDYWIVRNSWGKSWGESGYLRMARNIA-SSSGKCGIAIEPSYPIKNGENPPNPGP 363


>TAIR|locus:2090614 [details] [associations]
            symbol:AT3G19390 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA;ISS] [GO:0000041 "transition metal ion
            transport" evidence=RCA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:CP002686 GenomeReviews:BA000014_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AB025624
            MEROPS:I29.003 OMA:KAMDQKC HSSP:O65039 HOGENOM:HOG000230773
            InterPro:IPR000118 Pfam:PF00396 SMART:SM00277 EMBL:AY062725
            EMBL:AY093350 IPI:IPI00520189 RefSeq:NP_566633.1 UniGene:At.27473
            ProteinModelPortal:Q9LT78 SMR:Q9LT78 IntAct:Q9LT78 STRING:Q9LT78
            PaxDb:Q9LT78 PRIDE:Q9LT78 EnsemblPlants:AT3G19390.1 GeneID:821473
            KEGG:ath:AT3G19390 TAIR:At3g19390 InParanoid:Q9LT78
            PhylomeDB:Q9LT78 ProtClustDB:CLSN2917188 Genevestigator:Q9LT78
            Uniprot:Q9LT78
        Length = 452

 Score = 1144 (407.8 bits), Expect = 4.4e-116, P = 4.4e-116
 Identities = 210/328 (64%), Positives = 265/328 (80%)

Query:    43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFA 101
             R + E   +Y+ WL ++ K  NG+G  E+RF+IFKDNL+F++EH+S+ NRTY+VGL +FA
Sbjct:    34 RNEAEARRMYERWLVENRKNYNGLGEKERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFA 93

Query:   102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
             DLTN+E+RA+YL  RS  +R   +  V  ++Y  K GD LP+++DWR KGAVNPVKDQGS
Sbjct:    94 DLTNDEFRAIYL--RSKMERT--RVPVKGEKYLYKVGDSLPDAIDWRAKGAVNPVKDQGS 149

Query:   162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
             CGSCWAFS + AVEGIN+I TGELISLSEQELVDCD   N GC GGLMDYAF+FII+NGG
Sbjct:   150 CGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCDTSYNDGCGGGLMDYAFKFIIENGG 209

Query:   222 MDSEQDYPYLGAE-NKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
             +D+E+DYPY+  + N C+  ++N +VV+IDGYEDV   DE SLKKA+A+QP+SVAIEAGG
Sbjct:   210 IDTEEDYPYIATDVNVCNSDKKNTRVVTIDGYEDVPQNDEKSLKKALANQPISVAIEAGG 269

Query:   281 RAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
             RAFQ Y SGVFTG CG++LDHGVVAVGYG+E G DYW+VRNSWGS+WGE+GY KL+RN+ 
Sbjct:   270 RAFQLYTSGVFTGTCGTSLDHGVVAVGYGSEGGQDYWIVRNSWGSNWGESGYFKLERNIK 329

Query:   341 DTNTGKCGIAMEASYPVKNSQNSAKPKP 368
             +++ GKCG+AM ASYP K+S  S  PKP
Sbjct:   330 ESS-GKCGVAMMASYPTKSS-GSNPPKP 355


>TAIR|locus:2090629 [details] [associations]
            symbol:AT3G19400 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0019344 "cysteine biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            EMBL:CP002686 GenomeReviews:BA000014_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AB025624
            MEROPS:I29.003 HOGENOM:HOG000230773 EMBL:AK118509 IPI:IPI00543468
            RefSeq:NP_566634.2 UniGene:At.38409 ProteinModelPortal:Q9LT77
            SMR:Q9LT77 PaxDb:Q9LT77 PRIDE:Q9LT77 EnsemblPlants:AT3G19400.1
            GeneID:821474 KEGG:ath:AT3G19400 TAIR:At3g19400 InParanoid:Q9LT77
            OMA:IGEHERR ProtClustDB:CLSN2679975 Genevestigator:Q9LT77
            Uniprot:Q9LT77
        Length = 362

 Score = 1050 (374.7 bits), Expect = 4.0e-106, P = 4.0e-106
 Identities = 200/322 (62%), Positives = 251/322 (77%)

Query:    43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFA 101
             R + EV  +Y+ WL ++ K  NG+G  E+RF+IFKDNL+F+DEHNS+ +RT++VGL +FA
Sbjct:    35 RNETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFA 94

Query:   102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
             DLTNEE+RA+YL  R   +R   K  V ++RY  K GD LP+ VDWR  GAV  VKDQG+
Sbjct:    95 DLTNEEFRAIYL--RKKMERT--KDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGN 150

Query:   162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNG 220
             CGSCWAFS V AVEGIN+I TGELISLSEQELVDCDR  +NAGC+GG+M+YAF+FI++NG
Sbjct:   151 CGSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNG 210

Query:   221 GMDSEQDYPYLGAE-NKCDPSRRN-AKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
             G++++QDYPY   +   C+  + N  +VV+IDGYEDV   DE SLKKAVA QPVSVAIEA
Sbjct:   211 GIETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEA 270

Query:   279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
               +AFQ Y+SGV TG CG +LDHGVV VGYG+ +G DYW++RNSWG +WG++GYVKLQRN
Sbjct:   271 SSQAFQLYKSGVMTGTCGISLDHGVVVVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQRN 330

Query:   339 LLDTNTGKCGIAMEASYPVKNS 360
             + D   GKCGIAM  SYP K+S
Sbjct:   331 I-DDPFGKCGIAMMPSYPTKSS 351


>TAIR|locus:2117979 [details] [associations]
            symbol:AT4G23520 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002687 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            KO:K01376 IPI:IPI00527171 RefSeq:NP_567686.2 UniGene:At.32421
            ProteinModelPortal:F4JNL3 SMR:F4JNL3 MEROPS:C01.A22 PRIDE:F4JNL3
            EnsemblPlants:AT4G23520.1 GeneID:828452 KEGG:ath:AT4G23520
            OMA:PANDEIS ArrayExpress:F4JNL3 Uniprot:F4JNL3
        Length = 356

 Score = 1010 (360.6 bits), Expect = 6.9e-102, P = 6.9e-102
 Identities = 191/325 (58%), Positives = 250/325 (76%)

Query:    43 RTDDEVMTIYQTWLAKHGKT-SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFA 101
             R+++EV  I+Q W++KHGKT +N +G  E+RFQ FKDNLRFID+HN+ N +Y++GL +FA
Sbjct:    38 RSNEEVEFIFQMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHNAKNLSYQLGLTRFA 97

Query:   102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
             DLT +EYR ++ G+    K+R +K+   S+RY   AGD+LPESVDWR++GAV+ +KDQG+
Sbjct:    98 DLTVQEYRDLFPGSPKP-KQRNLKT---SRRYVPLAGDQLPESVDWRQEGAVSEIKDQGT 153

Query:   162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNG-GLMDYAFQFIIQNG 220
             C SCWAFSTVAAVEG+NKIVTGELISLSEQELVDC+  +N GC G GLMD AFQF+I N 
Sbjct:   154 CNSCWAFSTVAAVEGLNKIVTGELISLSEQELVDCNL-VNNGCYGSGLMDTAFQFLINNN 212

Query:   221 GMDSEQDYPYLGAENKCDPSRRNA-KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
             G+DSE+DYPY G +  C+  +  + KV++ID YEDV   DE+SL+KAVA QPVSV ++  
Sbjct:   213 GLDSEKDYPYQGTQGSCNRKQSTSNKVITIDSYEDVPANDEISLQKAVAHQPVSVGVDKK 272

Query:   280 GRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
              + F  Y S ++ G CG+ LDH +V VGYG+ENG DYW+VRNSWG+ WG+ GY+K+ RN 
Sbjct:   273 SQEFMLYRSCIYNGPCGTNLDHALVIVGYGSENGQDYWIVRNSWGTTWGDAGYIKIARNF 332

Query:   340 LDTNTGKCGIAMEASYPVKNSQNSA 364
              D   G CGIAM ASYP+KNS ++A
Sbjct:   333 EDPK-GLCGIAMLASYPIKNSASNA 356


>TAIR|locus:2024362 [details] [associations]
            symbol:XBCP3 "xylem bark cysteine peptidase 3"
            species:3702 "Arabidopsis thaliana" [GO:0005576 "extracellular
            region" evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0005783 "endoplasmic
            reticulum" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005783 EMBL:CP002684 GO:GO:0005773 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 MEROPS:I29.003
            HOGENOM:HOG000230773 InterPro:IPR000118 Pfam:PF00396 SMART:SM00277
            UniGene:At.10233 OMA:CEIESAV EMBL:BT026490 EMBL:AK226753
            IPI:IPI00536687 RefSeq:NP_563855.1 ProteinModelPortal:Q0WVJ5
            SMR:Q0WVJ5 PRIDE:Q0WVJ5 EnsemblPlants:AT1G09850.1 GeneID:837517
            KEGG:ath:AT1G09850 TAIR:At1g09850 InParanoid:Q0WVJ5
            PhylomeDB:Q0WVJ5 ProtClustDB:CLSN2687747 Genevestigator:Q0WVJ5
            Uniprot:Q0WVJ5
        Length = 437

 Score = 920 (328.9 bits), Expect = 2.4e-92, P = 2.4e-92
 Identities = 173/330 (52%), Positives = 226/330 (68%)

Query:    40 SSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLN 98
             SS  + D++  ++  W  KHGKT       ++R QIFKDN  F+ +HN + N TY + LN
Sbjct:    20 SSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLN 79

Query:    99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
              FADLT+ E++A  LG    A   +M SK  S   + K    +P+SVDWR+KGAV  VKD
Sbjct:    80 AFADLTHHEFKASRLGLSVSAPSVIMASKGQSLGGSVK----VPDSVDWRKKGAVTNVKD 135

Query:   159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
             QGSCG+CW+FS   A+EGIN+IVTG+LISLSEQEL+DCD+  NAGCNGGLMDYAF+F+I+
Sbjct:   136 QGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIK 195

Query:   219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
             N G+D+E+DYPY   +  C   +   KVV+ID Y  V   DE +L +AVA QPVSV I  
Sbjct:   196 NHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICG 255

Query:   279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
               RAFQ Y SG+F+G C ++LDH V+ VGYG++NGVDYW+V+NSWG  WG +G++ +QRN
Sbjct:   256 SERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRN 315

Query:   339 LLDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
               + + G CGI M ASYP+K   N   P P
Sbjct:   316 T-ENSDGVCGINMLASYPIKTHPNPPPPSP 344


>TAIR|locus:2157712 [details] [associations]
            symbol:CEP1 "cysteine endopeptidase 1" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005783 EMBL:CP002688
            GenomeReviews:BA000015_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 PROSITE:PS00014
            EMBL:AB024031 MEROPS:I29.003 EMBL:HM367092 EMBL:AY091087
            IPI:IPI00516991 RefSeq:NP_568722.1 UniGene:At.7918 HSSP:O65039
            ProteinModelPortal:Q9FGR9 SMR:Q9FGR9 PaxDb:Q9FGR9 PRIDE:Q9FGR9
            EnsemblPlants:AT5G50260.1 GeneID:835091 KEGG:ath:AT5G50260
            TAIR:At5g50260 HOGENOM:HOG000230773 InParanoid:Q9FGR9 KO:K16292
            OMA:WHSKKYH PhylomeDB:Q9FGR9 ProtClustDB:CLSN2689970
            Genevestigator:Q9FGR9 Uniprot:Q9FGR9
        Length = 361

 Score = 910 (325.4 bits), Expect = 2.7e-91, P = 2.7e-91
 Identities = 171/327 (52%), Positives = 222/327 (67%)

Query:    38 HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
             H+    +++ +  +Y+ W + H   +  +    KRF +FK N++ I E N  +++YK+ L
Sbjct:    24 HNKDVESENSLWELYERWRSHH-TVARSLEEKAKRFNVFKHNVKHIHETNKKDKSYKLKL 82

Query:    98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
             NKF D+T+EE+R  Y G+     R     K A++ +     + LP SVDWR+ GAV PVK
Sbjct:    83 NKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAVTPVK 142

Query:   158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
             +QG CGSCWAFSTV AVEGIN+I T +L SLSEQELVDCD   N GCNGGLMD AF+FI 
Sbjct:   143 NQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCDTNQNQGCNGGLMDLAFEFIK 202

Query:   218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
             + GG+ SE  YPY  ++  CD ++ NA VVSIDG+EDV    E  L KAVA+QPVSVAI+
Sbjct:   203 EKGGLTSELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAID 262

Query:   278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
             AGG  FQ Y  GVFTG CG+ L+HGV  VGYGT  +G  YW+V+NSWG +WGE GY+++Q
Sbjct:   263 AGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQ 322

Query:   337 RNLLDTNTGKCGIAMEASYPVKNSQNS 363
             R +     G CGIAMEASYP+KNS  +
Sbjct:   323 RGIRHKE-GLCGIAMEASYPLKNSNTN 348


>TAIR|locus:2122113 [details] [associations]
            symbol:XCP1 "xylem cysteine peptidase 1" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0000325 "plant-type vacuole" evidence=IDA] [GO:0005634
            "nucleus" evidence=IDA] [GO:0010623 "developmental programmed cell
            death" evidence=IMP] [GO:0010413 "glucuronoxylan metabolic process"
            evidence=RCA] [GO:0045492 "xylan biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005886
            GO:GO:0005634 EMBL:CP002687 GenomeReviews:CT486007_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0000325
            EMBL:AL022604 EMBL:AL161587 GO:GO:0010623 MEROPS:I29.003
            HOGENOM:HOG000230773 EMBL:AF191027 EMBL:AK117394 EMBL:BT005179
            IPI:IPI00532220 PIR:T06122 RefSeq:NP_567983.1 UniGene:At.2280
            UniGene:At.67622 ProteinModelPortal:O65493 SMR:O65493 STRING:O65493
            PaxDb:O65493 PRIDE:O65493 EnsemblPlants:AT4G35350.1 GeneID:829688
            KEGG:ath:AT4G35350 GeneFarm:5033 TAIR:At4g35350 InParanoid:O65493
            KO:K16290 OMA:FEVFREN PhylomeDB:O65493 ProtClustDB:CLSN2689772
            Genevestigator:O65493 Uniprot:O65493
        Length = 355

 Score = 907 (324.3 bits), Expect = 5.7e-91, P = 5.7e-91
 Identities = 168/313 (53%), Positives = 221/313 (70%)

Query:    46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
             D+++ ++++W+++H K    +     RF++F++NL  ID+ N+   +Y +GLN+FADLT+
Sbjct:    45 DKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLTH 104

Query:   106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
             EE++  YLG    AK +  + +  S  +  +   +LP+SVDWR+KGAV PVKDQG CGSC
Sbjct:   105 EEFKGRYLGL---AKPQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSC 161

Query:   166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
             WAFSTVAAVEGIN+I TG L SLSEQEL+DCD   N+GCNGGLMDYAFQ+II  GG+  E
Sbjct:   162 WAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLHKE 221

Query:   226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH 285
              DYPYL  E  C   + + + V+I GYEDV   D+ SL KA+A QPVSVAIEA GR FQ 
Sbjct:   222 DDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQF 281

Query:   286 YESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
             Y+ GVF G+CG+ LDHGV AVGYG+  G DY +V+NSWG  WGE G+++++RN      G
Sbjct:   282 YKGGVFNGKCGTDLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRNT-GKPEG 340

Query:   346 KCGIAMEASYPVK 358
              CGI   ASYP K
Sbjct:   341 LCGINKMASYPTK 353


>TAIR|locus:2030427 [details] [associations]
            symbol:XCP2 "xylem cysteine peptidase 2" species:3702
            "Arabidopsis thaliana" [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009507 "chloroplast" evidence=ISM] [GO:0008233 "peptidase
            activity" evidence=ISS] [GO:0005618 "cell wall" evidence=IDA]
            [GO:0010623 "developmental programmed cell death" evidence=IMP]
            [GO:0010075 "regulation of meristem growth" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0005886 GO:GO:0005618 GO:GO:0005773
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AC069251 EMBL:AC007369 GO:GO:0010623
            OMA:YKEIPEG HOGENOM:HOG000230773 KO:K16290 EMBL:AF191028
            EMBL:BT004822 IPI:IPI00526722 PIR:A86341 RefSeq:NP_564126.1
            UniGene:At.21316 ProteinModelPortal:Q9LM66 SMR:Q9LM66 IntAct:Q9LM66
            STRING:Q9LM66 MEROPS:C01.120 PaxDb:Q9LM66 PRIDE:Q9LM66
            ProMEX:Q9LM66 EnsemblPlants:AT1G20850.1 GeneID:838677
            KEGG:ath:AT1G20850 GeneFarm:5034 TAIR:At1g20850 InParanoid:Q9LM66
            PhylomeDB:Q9LM66 ProtClustDB:CLSN2917031 Genevestigator:Q9LM66
            GermOnline:AT1G20850 Uniprot:Q9LM66
        Length = 356

 Score = 899 (321.5 bits), Expect = 4.0e-90, P = 4.0e-90
 Identities = 170/333 (51%), Positives = 231/333 (69%)

Query:    32 YDNNHDHS------SSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
             + ++HD+S          + D+++ +++ W++   K    +     RF++FKDNL+ IDE
Sbjct:    25 FASSHDYSIVGYSPEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDE 84

Query:    86 HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESV 145
              N   ++Y +GLN+FADL++EE++ MYLG ++D  RR  +   A   +A +  + +P+SV
Sbjct:    85 TNKKGKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAE--FAYRDVEAVPKSV 142

Query:   146 DWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCN 205
             DWR+KGAV  VK+QGSCGSCWAFSTVAAVEGINKIVTG L +LSEQEL+DCD   N GCN
Sbjct:   143 DWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCN 202

Query:   206 GGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
             GGLMDYAF++I++NGG+  E+DYPY   E  C+  +  ++ V+I+G++DV   DE SL K
Sbjct:   203 GGLMDYAFEYIVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLK 262

Query:   266 AVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGS 325
             A+A QP+SVAI+A GR FQ Y  GVF G CG  LDHGV AVGYG+  G DY +V+NSWG 
Sbjct:   263 ALAHQPLSVAIDASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSSKGSDYIIVKNSWGP 322

Query:   326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
              WGE GY++L+RN      G CGI   AS+P K
Sbjct:   323 KWGEKGYIRLKRNT-GKPEGLCGINKMASFPTK 354


>TAIR|locus:2128243 [details] [associations]
            symbol:AT4G11310 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005618 "cell wall"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 GO:GO:0005618 EMBL:CP002687
            GenomeReviews:CT486007_GR EMBL:AL096882 EMBL:AL161531
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            HOGENOM:HOG000230773 KO:K01376 EMBL:AY093066 EMBL:BT000099
            IPI:IPI00520496 PIR:T13022 RefSeq:NP_567376.1 UniGene:At.43189
            ProteinModelPortal:Q9SUT0 SMR:Q9SUT0 IntAct:Q9SUT0 STRING:Q9SUT0
            MEROPS:C01.A20 PaxDb:Q9SUT0 PRIDE:Q9SUT0 EnsemblPlants:AT4G11310.1
            GeneID:826733 KEGG:ath:AT4G11310 TAIR:At4g11310 InParanoid:Q9SUT0
            OMA:EVCHGAD PhylomeDB:Q9SUT0 ProtClustDB:CLSN2689395
            Genevestigator:Q9SUT0 GermOnline:AT4G11310 Uniprot:Q9SUT0
        Length = 364

 Score = 894 (319.8 bits), Expect = 1.4e-89, P = 1.4e-89
 Identities = 174/334 (52%), Positives = 228/334 (68%)

Query:    33 DNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT 92
             DNN  HS     D E   I+++W+ KHGK    +   E+R  IF+DNLRFI+  N+ N +
Sbjct:    33 DNNRLHSVF---DAEASLIFESWMVKHGKVYGSVAEKERRLTIFEDNLRFINNRNAENLS 89

Query:    93 YKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGA 152
             Y++GL  FADL+  EY+ +  G      R  +    +S RY   A D LP+SVDWR +GA
Sbjct:    90 YRLGLTGFADLSLHEYKEVCHGADPRPPRNHV-FMTSSDRYKTSADDVLPKSVDWRNEGA 148

Query:   153 VNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYA 212
             V  VKDQG C SCWAFSTV AVEG+NKIVTGEL++LSEQ+L++C+++ N GC GG ++ A
Sbjct:   149 VTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGKLETA 207

Query:   213 FQFIIQNGGMDSEQDYPYLGAENKCDPS-RRNAKVVSIDGYEDVSPFDEMSLKKAVADQP 271
             ++FI++NGG+ ++ DYPY      CD   + N K V IDGYE++   DE +L KAVA QP
Sbjct:   208 YEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNVMIDGYENLPANDESALMKAVAHQP 267

Query:   272 VSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENG 331
             V+  I++  R FQ YESGVF G CG+ L+HGVV VGYGTENG DYWLV+NS G  WGE G
Sbjct:   268 VTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVGYGTENGRDYWLVKNSRGITWGEAG 327

Query:   332 YVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
             Y+K+ RN+ +   G CGIAM ASYP+KNS ++ K
Sbjct:   328 YMKMARNIANPR-GLCGIAMRASYPLKNSFSTDK 360


>TAIR|locus:2128253 [details] [associations]
            symbol:AT4G11320 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002687 GenomeReviews:CT486007_GR EMBL:AL096882 EMBL:AL161531
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 OMA:ICHGADP
            HOGENOM:HOG000230773 KO:K01376 ProtClustDB:CLSN2689395
            EMBL:AY035055 EMBL:AY051062 IPI:IPI00520480 PIR:T13023
            RefSeq:NP_567377.1 UniGene:At.25206 ProteinModelPortal:Q9SUS9
            SMR:Q9SUS9 STRING:Q9SUS9 MEROPS:C01.A21 PaxDb:Q9SUS9 PRIDE:Q9SUS9
            EnsemblPlants:AT4G11320.1 GeneID:826734 KEGG:ath:AT4G11320
            TAIR:At4g11320 InParanoid:Q9SUS9 PhylomeDB:Q9SUS9
            Genevestigator:Q9SUS9 GermOnline:AT4G11320 Uniprot:Q9SUS9
        Length = 371

 Score = 876 (313.4 bits), Expect = 1.1e-87, P = 1.1e-87
 Identities = 169/338 (50%), Positives = 229/338 (67%)

Query:    33 DNNHDHSSSWRT----DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS 88
             DN+H  +   R     D E   ++++W+ KHGK  + +   E+R  IF+DNLRFI   N+
Sbjct:    33 DNHHVTAGPGRRQGIFDAEATLMFESWMVKHGKVYDSVAEKERRLTIFEDNLRFITNRNA 92

Query:    89 LNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWR 148
              N +Y++GLN+FADL+  EY  +  G      R  +    +S RY    GD LP+SVDWR
Sbjct:    93 ENLSYRLGLNRFADLSLHEYGEICHGADPRPPRNHV-FMTSSNRYKTSDGDVLPKSVDWR 151

Query:   149 EKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGL 208
              +GAV  VKDQG C SCWAFSTV AVEG+NKIVTGEL++LSEQ+L++C+++ N GC GG 
Sbjct:   152 NEGAVTEVKDQGLCRSCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGK 210

Query:   209 MDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS-RRNAKVVSIDGYEDVSPFDEMSLKKAV 267
             ++ A++FI+ NGG+ ++ DYPY      C+   + + K V IDGYE++   DE +L KAV
Sbjct:   211 VETAYEFIMNNGGLGTDNDYPYKALNGVCEGRLKEDNKNVMIDGYENLPANDEAALMKAV 270

Query:   268 ADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDW 327
             A QPV+  +++  R FQ YESGVF G CG+ L+HGVV VGYGTENG DYW+V+NS G  W
Sbjct:   271 AHQPVTAVVDSSSREFQLYESGVFDGTCGTNLNHGVVVVGYGTENGRDYWIVKNSRGDTW 330

Query:   328 GENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
             GE GY+K+ RN+ +   G CGIAM ASYP+KNS ++ K
Sbjct:   331 GEAGYMKMARNIANPR-GLCGIAMRASYPLKNSFSTDK 367


>TAIR|locus:2097104 [details] [associations]
            symbol:AT3G43960 species:3702 "Arabidopsis thaliana"
            [GO:0005886 "plasma membrane" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0031225 "anchored to
            membrane" evidence=TAS] [GO:0048767 "root hair elongation"
            evidence=IMP] [GO:0016132 "brassinosteroid biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:CP002686
            GenomeReviews:BA000014_GR GO:GO:0031225 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0048767 MEROPS:I29.003 HOGENOM:HOG000230773
            EMBL:AL163975 EMBL:AK118634 IPI:IPI00526842 PIR:T48950
            RefSeq:NP_566867.1 UniGene:At.43352 ProteinModelPortal:Q9LXW3
            SMR:Q9LXW3 STRING:Q9LXW3 PaxDb:Q9LXW3 PRIDE:Q9LXW3
            EnsemblPlants:AT3G43960.1 GeneID:823513 KEGG:ath:AT3G43960
            TAIR:At3g43960 eggNOG:NOG286334 InParanoid:Q9LXW3 KO:K01376
            OMA:MAISFRT PhylomeDB:Q9LXW3 ProtClustDB:CLSN2917367
            Genevestigator:Q9LXW3 GermOnline:AT3G43960 Uniprot:Q9LXW3
        Length = 376

 Score = 869 (311.0 bits), Expect = 6.0e-87, P = 6.0e-87
 Identities = 177/332 (53%), Positives = 235/332 (70%)

Query:    39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGL 97
             + S R + EV+T+Y+ WL ++GK  NG+G  E+RF+IFKDNL+ I+EHNS  NR+Y+ GL
Sbjct:    28 TESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGL 87

Query:    98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNP-V 156
             NKF+DLT +E++A YLG + + K+ L  S VA +RY  K GD LP+ VDWRE+GAV P V
Sbjct:    88 NKFSDLTADEFQASYLGGKME-KKSL--SDVA-ERYQYKEGDVLPDEVDWRERGAVVPRV 143

Query:   157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQF 215
             K QG CGSCWAF+   AVEGIN+I TGEL+SLSEQEL+DCDR   N GC GG   +AF+F
Sbjct:   144 KRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEF 203

Query:   216 IIQNGGMDSEQDYPYLGAENK-CDP-SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
             I +NGG+ S++ Y Y G +   C     +  +VV+I+G+E V   DEMSLKKAVA QP+S
Sbjct:   204 IKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPIS 263

Query:   274 VAIEAGGRAFQHYESGVFTGECGSAL-DHGVVAVGYGTENGV-DYWLVRNSWGSDWGENG 331
             V I A   +   Y+SGV+ G C +   DH V+ VGYGT +   DYWL+RNSWG +WGE G
Sbjct:   264 VMISAANMS--DYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGG 321

Query:   332 YVKLQRNLLDTNTGKCGIAMEASYPVKNSQNS 363
             Y++LQRN  +  TGKC +A+   YP+K++ +S
Sbjct:   322 YLRLQRNFHEP-TGKCAVAVAPVYPIKSNSSS 352


>TAIR|locus:505006391 [details] [associations]
            symbol:CEP3 "cysteine endopeptidase 3" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005783 EMBL:CP002686
            GenomeReviews:BA000014_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 PROSITE:PS00014
            EMBL:AL049659 HSSP:O65039 HOGENOM:HOG000230773 KO:K16292
            EMBL:AK119026 IPI:IPI00525150 PIR:T06707 RefSeq:NP_566901.1
            UniGene:At.3162 ProteinModelPortal:Q9STL5 SMR:Q9STL5 MEROPS:C01.A02
            PRIDE:Q9STL5 EnsemblPlants:AT3G48350.1 GeneID:823993
            KEGG:ath:AT3G48350 TAIR:At3g48350 InParanoid:Q9STL5 OMA:DITHHEF
            PhylomeDB:Q9STL5 ProtClustDB:CLSN2917387 Genevestigator:Q9STL5
            Uniprot:Q9STL5
        Length = 364

 Score = 853 (305.3 bits), Expect = 3.0e-85, P = 3.0e-85
 Identities = 169/330 (51%), Positives = 217/330 (65%)

Query:    44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
             T++ V  +Y+ W   H   S       KRF +F+ N+  +   N  N+ YK+ +N+FAD+
Sbjct:    30 TEENVWKLYERWRGHHS-VSRASHEAIKRFNVFRHNVLHVHRTNKKNKPYKLKINRFADI 88

Query:   104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
             T+ E+R+ Y G+     R L   K  S  +  +    +P SVDWREKGAV  VK+Q  CG
Sbjct:    89 THHEFRSSYAGSNVKHHRMLRGPKRGSGGFMYENVTRVPSSVDWREKGAVTEVKNQQDCG 148

Query:   164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
             SCWAFSTVAAVEGINKI T +L+SLSEQELVDCD + N GC GGLM+ AF+FI  NGG+ 
Sbjct:   149 SCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQGCAGGLMEPAFEFIKNNGGIK 208

Query:   224 SEQDYPYLGAENK-CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
             +E+ YPY  ++ + C  +    + V+IDG+E V   DE  L KAVA QPVSVAI+AG   
Sbjct:   209 TEETYPYDSSDVQFCRANSIGGETVTIDGHEHVPENDEEELLKAVAHQPVSVAIDAGSSD 268

Query:   283 FQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
             FQ Y  GVF GECG+ L+HGVV VGYG T+NG  YW+VRNSWG +WGE GYV+++R + +
Sbjct:   269 FQLYSEGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIERGISE 328

Query:   342 TNTGKCGIAMEASYPVKNSQNSAKPKPHSS 371
              N G+CGIAMEASYP K S     P  H S
Sbjct:   329 -NEGRCGIAMEASYPTKLSST---PSTHES 354


>TAIR|locus:2152445 [details] [associations]
            symbol:SAG12 "senescence-associated gene 12" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009723 "response to ethylene stimulus" evidence=IEP]
            [GO:0007568 "aging" evidence=IEP;TAS] [GO:0010150 "leaf senescence"
            evidence=IEP;TAS] [GO:0010282 "senescence-associated vacuole"
            evidence=IDA] [GO:0009817 "defense response to fungus, incompatible
            interaction" evidence=IEP] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:CP002688 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0010150 GO:GO:0009817 EMBL:AB016870
            HSSP:O65039 OMA:NDEQALM EMBL:AF370131 EMBL:AY040073 IPI:IPI00544181
            RefSeq:NP_568651.1 UniGene:At.75256 UniGene:At.7710
            ProteinModelPortal:Q9FJ47 SMR:Q9FJ47 IntAct:Q9FJ47 STRING:Q9FJ47
            MEROPS:C01.117 PRIDE:Q9FJ47 ProMEX:Q9FJ47 EnsemblPlants:AT5G45890.1
            GeneID:834629 KEGG:ath:AT5G45890 TAIR:At5g45890 InParanoid:Q9FJ47
            PhylomeDB:Q9FJ47 ProtClustDB:CLSN2917735 ArrayExpress:Q9FJ47
            Genevestigator:Q9FJ47 GO:GO:0010282 Uniprot:Q9FJ47
        Length = 346

 Score = 838 (300.0 bits), Expect = 1.2e-83, P = 1.2e-83
 Identities = 158/306 (51%), Positives = 212/306 (69%)

Query:    55 WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL--NRTYKVGLNKFADLTNEEYRAMY 112
             W+ KHG+    +     R+ +FK+N+  I+  NS+   RT+K+ +N+FADLTN+E+R+MY
Sbjct:    41 WMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFRSMY 100

Query:   113 LGTRS-DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
              G +   A     ++K++  RY   +   LP SVDWR+KGAV P+K+QGSCG CWAFS V
Sbjct:   101 TGFKGVSALSSQSQTKMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAV 160

Query:   172 AAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL 231
             AA+EG  +I  G+LISLSEQ+LVDCD   + GC GGLMD AF+ I   GG+ +E +YPY 
Sbjct:   161 AAIEGATQIKKGKLISLSEQQLVDCDTN-DFGCEGGLMDTAFEHIKATGGLTTESNYPYK 219

Query:   232 GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
             G +  C+  + N K  SI GYEDV   DE +L KAVA QPVSV IE GG  FQ Y SGVF
Sbjct:   220 GEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVF 279

Query:   292 TGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
             TGEC + LDH V A+GYG + NG  YW+++NSWG+ WGE+GY+++Q+++ D   G CG+A
Sbjct:   280 TGECTTYLDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGYMRIQKDVKDKQ-GLCGLA 338

Query:   351 MEASYP 356
             M+ASYP
Sbjct:   339 MKASYP 344


>TAIR|locus:2038515 [details] [associations]
            symbol:AT1G06260 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0048046 "apoplast"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0048046 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC025290
            MEROPS:I29.003 HSSP:O65039 HOGENOM:HOG000230773 OMA:METAFEF
            IPI:IPI00525965 PIR:D86198 RefSeq:NP_563764.1 UniGene:At.24617
            ProteinModelPortal:Q9LNC1 SMR:Q9LNC1 PaxDb:Q9LNC1 PRIDE:Q9LNC1
            EnsemblPlants:AT1G06260.1 GeneID:837137 KEGG:ath:AT1G06260
            TAIR:At1g06260 InParanoid:Q9LNC1 PhylomeDB:Q9LNC1
            ProtClustDB:CLSN2916975 Genevestigator:Q9LNC1 Uniprot:Q9LNC1
        Length = 343

 Score = 833 (298.3 bits), Expect = 3.9e-83, P = 3.9e-83
 Identities = 158/308 (51%), Positives = 211/308 (68%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
             ++ WL  H K   G      RF I++ N++ ID  NSL+  +K+  N+FAD+TN E++A 
Sbjct:    43 FEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEFKAH 102

Query:   112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
             +LG  + + R L K     QR  C     +P++VDWR +GAV P+++QG CG CWAFS V
Sbjct:   103 FLGLNTSSLR-LHKK----QRPVCDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAV 157

Query:   172 AAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
             AA+EGINKI TG L+SLSEQ+L+DCD    N GC+GGLM+ AF+FI  NGG+ +E DYPY
Sbjct:   158 AAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKTNGGLATETDYPY 217

Query:   231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
              G E  CD  +   KVV+I GY+ V+  +E SL+ A A QPVSV I+AGG  FQ Y SGV
Sbjct:   218 TGIEGTCDQEKSKNKVVTIQGYQKVAQ-NEASLQIAAAQQPVSVGIDAGGFIFQLYSSGV 276

Query:   291 FTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
             FT  CG+ L+HGV  VGYG E    YW+V+NSWG+ WGE GY++++R + + +TGKCGIA
Sbjct:   277 FTNYCGTNLNHGVTVVGYGVEGDQKYWIVKNSWGTGWGEEGYIRMERGVSE-DTGKCGIA 335

Query:   351 MEASYPVK 358
             M ASYP++
Sbjct:   336 MMASYPLQ 343


>TAIR|locus:2038588 [details] [associations]
            symbol:AT2G27420 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002685
            GenomeReviews:CT485783_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC006232
            MEROPS:I29.003 OMA:EEFRATH HOGENOM:HOG000230773 HSSP:P53634
            ProtClustDB:CLSN2688476 EMBL:AY064033 EMBL:AY096388 IPI:IPI00539752
            PIR:F84672 RefSeq:NP_565649.1 UniGene:At.27094
            ProteinModelPortal:Q9ZQH7 SMR:Q9ZQH7 PRIDE:Q9ZQH7
            EnsemblPlants:AT2G27420.1 GeneID:817287 KEGG:ath:AT2G27420
            TAIR:At2g27420 InParanoid:Q9ZQH7 PhylomeDB:Q9ZQH7
            ArrayExpress:Q9ZQH7 Genevestigator:Q9ZQH7 Uniprot:Q9ZQH7
        Length = 348

 Score = 764 (274.0 bits), Expect = 8.1e-76, P = 8.1e-76
 Identities = 148/314 (47%), Positives = 203/314 (64%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTNEEYRA 110
             ++ W+A+  +  +       RF IFK NL F+   N  N+ TYKV +N+F+DLT+EE+RA
Sbjct:    35 HEQWMARFNRVYSDETEKRNRFNIFKKNLEFVQNFNMNNKITYKVDINEFSDLTDEEFRA 94

Query:   111 MYLG-TRSDAKRRLMKSKVASQRYACKAGD--ELPESVDWREKGAVNPVKDQGSCGSCWA 167
              + G    +A  R+            + G+  +  ES+DWR++GAV PVK QG CG CWA
Sbjct:    95 THTGLVVPEAITRISTLSSGKNTVPFRYGNVSDNGESMDWRQEGAVTPVKYQGRCGGCWA 154

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
             FS VAAVEGI KI  GEL+SLSEQ+L+DCDR  N GC GG+M  AF++II+N G+ +E +
Sbjct:   155 FSAVAAVEGITKITKGELVSLSEQQLLDCDRDYNQGCRGGIMSKAFEYIIKNQGITTEDN 214

Query:   228 YPYLGAENKCDPSRR---NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
             YPY  ++  C  S     + +  +I GYE V   +E +L +AV+ QPVSV IE  G AF+
Sbjct:   215 YPYQESQQTCSSSTTLSSSFRAATISGYETVPMNNEEALLQAVSQQPVSVGIEGTGAAFR 274

Query:   285 HYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
             HY  GVF GECG+ L H V  VGYG +E G  YW+V+NSWG  WGENGY++++R++ D  
Sbjct:   275 HYSGGVFNGECGTDLHHAVTIVGYGMSEEGTKYWVVKNSWGETWGENGYMRIKRDV-DAP 333

Query:   344 TGKCGIAMEASYPV 357
              G CG+A+ A YP+
Sbjct:   334 QGMCGLAILAFYPL 347


>TAIR|locus:2082881 [details] [associations]
            symbol:AT3G49340 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002686
            GenomeReviews:BA000014_GR EMBL:AC012329 EMBL:AL132956
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 MEROPS:I29.003 HOGENOM:HOG000230773 HSSP:P07711
            KO:K01376 IPI:IPI00520642 PIR:T45839 RefSeq:NP_566920.1
            UniGene:At.53854 ProteinModelPortal:Q9SG15 SMR:Q9SG15
            EnsemblPlants:AT3G49340.1 GeneID:824096 KEGG:ath:AT3G49340
            TAIR:At3g49340 InParanoid:Q9SG15 OMA:PQNDEEA PhylomeDB:Q9SG15
            ProtClustDB:CLSN2688476 Genevestigator:Q9SG15 Uniprot:Q9SG15
        Length = 341

 Score = 732 (262.7 bits), Expect = 2.0e-72, P = 2.0e-72
 Identities = 144/312 (46%), Positives = 202/312 (64%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN-SLNRTYKVGLNKFADLTNEEYRA 110
             ++ W+++  +  +       RF+IF +NL+F++  N + N+TY + +N+F+DLT+EE++A
Sbjct:    35 HEQWMSRFNRVYSDDSEKTSRFEIFTNNLKFVESINMNTNKTYTLDVNEFSDLTDEEFKA 94

Query:   111 MYLG-TRSDAKRRLMKS---KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
              Y G    +   R+  +   +  S RY    G E  ES+DW ++GAV  VK Q  CG CW
Sbjct:    95 RYTGLVVPEGMTRISTTDSHETVSFRYE-NVG-ETGESMDWIQEGAVTSVKHQQQCGCCW 152

Query:   167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
             AFS VAAVEG+ KI  GEL+SLSEQ+L+DC  + N GC GG+M  AF +I +N G+ +E 
Sbjct:   153 AFSAVAAVEGMTKIANGELVSLSEQQLLDCSTE-NNGCGGGIMWKAFDYIKENQGITTED 211

Query:   227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
             +YPY GA+  C+ +   A  +S  GYE V   DE +L KAV+ QPVSVAIE  G  F HY
Sbjct:   212 NYPYQGAQQTCESNHLAAATIS--GYETVPQNDEEALLKAVSQQPVSVAIEGSGYEFIHY 269

Query:   287 ESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
               G+F GECG+ L H V  VGYG +E G+ YWL++NSWG  WGENGY+++ R++ D+  G
Sbjct:   270 SGGIFNGECGTQLTHAVTIVGYGVSEEGIKYWLLKNSWGESWGENGYMRIMRDV-DSPQG 328

Query:   346 KCGIAMEASYPV 357
              CG+A  A YPV
Sbjct:   329 MCGLASLAYYPV 340


>TAIR|locus:2055440 [details] [associations]
            symbol:AT2G34080 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002685 GenomeReviews:CT485783_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 MEROPS:I29.003 EMBL:AC002341
            HOGENOM:HOG000230773 HSSP:P53634 IPI:IPI00530325 PIR:B84752
            RefSeq:NP_565780.1 UniGene:At.28613 UniGene:At.37859
            ProteinModelPortal:O22961 SMR:O22961 EnsemblPlants:AT2G34080.1
            GeneID:817969 KEGG:ath:AT2G34080 TAIR:At2g34080 InParanoid:O22961
            OMA:SENDYSY PhylomeDB:O22961 ProtClustDB:CLSN2688064
            ArrayExpress:O22961 Genevestigator:O22961 Uniprot:O22961
        Length = 345

 Score = 723 (259.6 bits), Expect = 1.8e-71, P = 1.8e-71
 Identities = 138/312 (44%), Positives = 203/312 (65%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
             ++ W+A+  +          R  +FK NL+FI+  N   N++YK+G+N+FAD TNEE+ A
Sbjct:    39 HEQWMARFSREYRDELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLA 98

Query:   111 MYLGTRSDAK---RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
             ++ G +   +    +++   ++SQ +     D + ES DWR +GAV PVK QG CG CWA
Sbjct:    99 IHTGLKGLTEVSPSKVVAKTISSQTW--NVSDMVVESKDWRAEGAVTPVKYQGQCGCCWA 156

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
             FS VAAVEG+ KI  G L+SLSEQ+L+DCDR+ + GC+GG+M  AF +++QN G+ SE D
Sbjct:   157 FSAVAAVEGVAKIAGGNLVSLSEQQLLDCDREYDRGCDGGIMSDAFNYVVQNRGIASEND 216

Query:   228 YPYLGAENKCDPSRRNAKVVS-IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
             Y Y G++  C   R NA+  + I G++ V   +E +L +AV+ QPVSV+++A G  F HY
Sbjct:   217 YSYQGSDGGC---RSNARPAARISGFQTVPSNNERALLEAVSRQPVSVSMDATGDGFMHY 273

Query:   287 ESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTG 345
               GV+ G CG++ +H V  VGYGT ++G  YWL +NSWG  WGE GY++++R++     G
Sbjct:   274 SGGVYDGPCGTSSNHAVTFVGYGTSQDGTKYWLAKNSWGETWGEKGYIRIRRDVAWPQ-G 332

Query:   346 KCGIAMEASYPV 357
              CG+A  A YPV
Sbjct:   333 MCGVAQYAFYPV 344


>TAIR|locus:2029934 [details] [associations]
            symbol:AT1G29080 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002684 GenomeReviews:CT485782_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AC021043 MEROPS:I29.003 HOGENOM:HOG000230773
            HSSP:P53634 ProtClustDB:CLSN2688064 EMBL:DQ056468 IPI:IPI00521747
            PIR:C86413 RefSeq:NP_564320.1 UniGene:At.51814
            ProteinModelPortal:Q9LP39 SMR:Q9LP39 EnsemblPlants:AT1G29080.1
            GeneID:839783 KEGG:ath:AT1G29080 TAIR:At1g29080 InParanoid:Q9LP39
            OMA:KTWGENG PhylomeDB:Q9LP39 Genevestigator:Q9LP39 Uniprot:Q9LP39
        Length = 346

 Score = 694 (249.4 bits), Expect = 2.1e-68, P = 2.1e-68
 Identities = 135/315 (42%), Positives = 194/315 (61%)

Query:    48 VMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNE 106
             ++  +Q W+ +  +  +     + R Q+  +NL+FI+  N++ N++YK+G+N+F D T E
Sbjct:    35 IVDYHQQWMIQFSRVYDDEFEKQLRLQVLTENLKFIESFNNMGNQSYKLGVNEFTDWTKE 94

Query:   107 EYRAMYLGTRS-DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
             E+ A Y G R  +              +     D L  + DWR +GAV PVK QG CG C
Sbjct:    95 EFLATYTGLRGVNVTSPFEVVNETKPAWNWTVSDVLGTNKDWRNEGAVTPVKSQGECGGC 154

Query:   166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
             WAFS +AAVEG+ KI  G LISLSEQ+L+DC R+ N GC GG    AF +II++ G+ SE
Sbjct:   155 WAFSAIAAVEGLTKIARGNLISLSEQQLLDCTREQNNGCKGGTFVNAFNYIIKHRGISSE 214

Query:   226 QDYPYLGAENKCDPSRRNAK-VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
              +YPY   E  C   R NA+  + I G+E+V   +E +L +AV+ QPV+VAI+A    F 
Sbjct:   215 NEYPYQVKEGPC---RSNARPAILIRGFENVPSNNERALLEAVSRQPVAVAIDASEAGFV 271

Query:   285 HYESGVFTGE-CGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
             HY  GV+    CG++++H V  VGYGT   G+ YWL +NSWG  WGENGY++++R++ + 
Sbjct:   272 HYSGGVYNARNCGTSVNHAVTLVGYGTSPEGMKYWLAKNSWGKTWGENGYIRIRRDV-EW 330

Query:   343 NTGKCGIAMEASYPV 357
               G CG+A  ASYPV
Sbjct:   331 PQGMCGVAQYASYPV 345


>TAIR|locus:2029924 [details] [associations]
            symbol:AT1G29090 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002684 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            HOGENOM:HOG000230773 HSSP:P53634 ProtClustDB:CLSN2688064
            EMBL:BT004146 IPI:IPI00545702 RefSeq:NP_564321.2 UniGene:At.40814
            ProteinModelPortal:Q84W75 SMR:Q84W75 MEROPS:C01.A15
            EnsemblPlants:AT1G29090.1 GeneID:839784 KEGG:ath:AT1G29090
            TAIR:At1g29090 InParanoid:Q84W75 OMA:SIRGHED PhylomeDB:Q84W75
            ArrayExpress:Q84W75 Genevestigator:Q84W75 Uniprot:Q84W75
        Length = 355

 Score = 692 (248.7 bits), Expect = 3.5e-68, P = 3.5e-68
 Identities = 142/315 (45%), Positives = 197/315 (62%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADLTNEEYRA 110
             +Q W+ +  +  +     + RF +FK NL+FI++ N   +RTYK+G+N+FAD T EE+ A
Sbjct:    47 HQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTREEFIA 106

Query:   111 MYLGTRSD---AKRRLMKSKVASQRYACK--AGDELPESVDWREKGAVNPVKDQGSCGSC 165
              + G +          +   + S  +     AG E   + DWR +GAV PVK QG CG C
Sbjct:   107 THTGLKGVNGIPSSEFVDEMIPSWNWNVSDVAGRE---TKDWRYEGAVTPVKYQGQCGCC 163

Query:   166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
             WAFS+VAAVEG+ KIV   L+SLSEQ+L+DCDR+ + GCNGG+M  AF +II+N G+ SE
Sbjct:   164 WAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMSDAFSYIIKNRGIASE 223

Query:   226 QDYPYLGAENKCDPSRRNAKVVS-IDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
               YPY  AE  C   R N K  + I G++ V   +E +L +AV+ QPVSV+I+A G  F 
Sbjct:   224 ASYPYQAAEGTC---RYNGKPSAWIRGFQTVPSNNERALLEAVSKQPVSVSIDADGPGFM 280

Query:   285 HYESGVFTGE-CGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
             HY  GV+    CG+ ++H V  VGYGT   G+ YWL +NSWG  WGENGY++++R++   
Sbjct:   281 HYSGGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAWP 340

Query:   343 NTGKCGIAMEASYPV 357
               G CG+A  A YPV
Sbjct:   341 Q-GMCGVAQYAFYPV 354


>UNIPROTKB|F1NYJ1 [details] [associations]
            symbol:CTSL2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 IPI:IPI00602255
            OMA:DITHHEF EMBL:AADN02067812 Ensembl:ENSGALT00000020588
            ArrayExpress:F1NYJ1 Uniprot:F1NYJ1
        Length = 339

 Score = 669 (240.6 bits), Expect = 9.4e-66, P = 9.4e-66
 Identities = 143/327 (43%), Positives = 201/327 (61%)

Query:    43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLN 98
             R D ++ + +Q W + H K  +    + +R  +++ NL+ I+ HN   SL + +YK+G+N
Sbjct:    21 RVDPDLDSHWQLWKSWHSKDYHEREESWRRV-VWEKNLKMIELHNLDHSLGKHSYKLGMN 79

Query:    99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
             +F D+T EE+R +  G +     R    K    ++   +  E P SVDWREKG V PVKD
Sbjct:    80 QFGDMTAEEFRQLMNGYKHKKSER----KYRGSQFLEPSFLEAPRSVDWREKGYVTPVKD 135

Query:   159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFII 217
             QG CGSCWAFST  A+EG +   TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ++ 
Sbjct:   136 QGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQ 195

Query:   218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAI 276
              NGG+DSE+ YPY   +++    +      +  G+ D+    E +L KAVA   PVSVAI
Sbjct:   196 DNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVASVGPVSVAI 255

Query:   277 EAGGRAFQHYESGVF-TGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGEN 330
             +AG  +FQ Y+SG++   +C S  LDHGV+ VGYG E    +G  YW+V+NSWG  WG+ 
Sbjct:   256 DAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDK 315

Query:   331 GYVKLQRNLLDTNTGKCGIAMEASYPV 357
             GY+ + ++        CGIA  ASYP+
Sbjct:   316 GYIYMAKD----RKNHCGIATAASYPL 338


>DICTYBASE|DDB_G0283867 [details] [associations]
            symbol:cprC "cysteine proteinase 3" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0283867 GenomeReviews:CM000153_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:AAFI02000057
            KO:K01365 EMBL:X03930 RefSeq:XP_638859.1 ProteinModelPortal:Q23894
            SMR:Q23894 MEROPS:C01.114 EnsemblProtists:DDB0220784 GeneID:8624257
            KEGG:ddi:DDB_G0283867 OMA:NNVEHIN Uniprot:Q23894
        Length = 337

 Score = 664 (238.8 bits), Expect = 3.2e-65, P = 3.2e-65
 Identities = 139/309 (44%), Positives = 191/309 (61%)

Query:    55 WLAKHGKTSNGMGHNE--KRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMY 112
             W+  + K      H E   R++ FK N+ ++   NS      +GLN+ ADL+NEEYR  Y
Sbjct:    37 WMRSNNKAYT---HKEFMPRYEEFKKNMDYVHNWNSKGSKTVLGLNQHADLSNEEYRLNY 93

Query:   113 LGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVA 172
             LGTR+  K      +    R   +   + P +VDWREK AV PVKDQG CGSC++FST  
Sbjct:    94 LGTRAHIKLNGYHKRNLGLRLN-RPQFKQPLNVDWREKDAVTPVKDQGQCGSCYSFSTTG 152

Query:   173 AVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPY- 230
             +VEG+  I TG+L+SLSEQ ++DC     N GCNGGLM  AF++II+N G++SE+ YPY 
Sbjct:   153 SVEGVTAIKTGKLVSLSEQNILDCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYE 212

Query:   231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
             +   ++C   +  +    I  Y+++   DE  L+ A+   PVSVAI+A   +FQ Y +GV
Sbjct:   213 MKVNDECK-FQEGSVAAKITSYKEIEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAGV 271

Query:   291 F-TGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCG 348
             +    C S  LDHGV+AVG GT+NG DY++V+NSWG  WG NGY+ + RN  D N   CG
Sbjct:   272 YYEPACSSEDLDHGVLAVGMGTDNGEDYYIVKNSWGPSWGLNGYIHMARNK-DNN---CG 327

Query:   349 IAMEASYPV 357
             I+  ASYP+
Sbjct:   328 ISTMASYPI 336


>ZFIN|ZDB-GENE-030131-106 [details] [associations]
            symbol:ctsl1a "cathepsin L, 1 a" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030131-106 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 HSSP:P43235
            KO:K01365 EMBL:BC066490 IPI:IPI00495935 RefSeq:NP_997749.1
            UniGene:Dr.104499 ProteinModelPortal:Q6NYR5 SMR:Q6NYR5
            MEROPS:C01.074 PRIDE:Q6NYR5 GeneID:321453 KEGG:dre:321453
            CTD:321453 InParanoid:Q6NYR5 NextBio:20807387 ArrayExpress:Q6NYR5
            Bgee:Q6NYR5 Uniprot:Q6NYR5
        Length = 337

 Score = 656 (236.0 bits), Expect = 2.3e-64, P = 2.3e-64
 Identities = 142/326 (43%), Positives = 197/326 (60%)

Query:    45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLN-RTYKVGLNKF 100
             D ++   +  W   H K  +      +R  I++ NL+ I+ HN   S+   TY++G+N F
Sbjct:    22 DQQLNDHWDQWKKWHSKKYHATEEGWRRV-IWEKNLKKIEMHNLEHSMGIHTYRLGMNHF 80

Query:   101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
              D+T+EE+R +  G +    RR   S      +      E+P  +DWREKG V PVKDQG
Sbjct:    81 GDMTHEEFRQVMNGFKHKKDRRFRGSLFMEPNFI-----EVPNKLDWREKGYVTPVKDQG 135

Query:   161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQN 219
              CGSCWAFST  A+EG     TG+L+SLSEQ LVDC R + N GCNGGLMD AFQ++   
Sbjct:   136 ECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDQ 195

Query:   220 GGMDSEQDYPYLGAENK-CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIE 277
              G+DSE+ YPYLG +++ C    +N+   +  G+ D+    E +L KA+A   PVSVAI+
Sbjct:   196 NGLDSEESYPYLGTDDQPCHFDPKNS-AANDTGFVDIPSGKERALMKAIAAVGPVSVAID 254

Query:   278 AGGRAFQHYESGVF-TGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGENG 331
             AG  +FQ Y+SG++   EC S  LDHGV+AVGYG E    +G  YW+V+NSW  +WG+ G
Sbjct:   255 AGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVDGKKYWIVKNSWSENWGDKG 314

Query:   332 YVKLQRNLLDTNTGKCGIAMEASYPV 357
             Y+ + ++        CGIA  ASYP+
Sbjct:   315 YIYMAKD----RHNHCGIATAASYPL 336


>ZFIN|ZDB-GENE-040718-61 [details] [associations]
            symbol:ctsl.1 "cathepsin L.1" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-040718-61
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            GeneTree:ENSGT00660000095458 MEROPS:C01.092 EMBL:FP015965
            EMBL:BC075887 IPI:IPI00513499 RefSeq:NP_001002368.1
            UniGene:Dr.85174 SMR:Q6DHT0 Ensembl:ENSDART00000017756
            GeneID:436641 KEGG:dre:436641 CTD:436641 InParanoid:Q6DHT0
            OMA:GGQMENA OrthoDB:EOG41ZFB9 NextBio:20831086 Uniprot:Q6DHT0
        Length = 334

 Score = 655 (235.6 bits), Expect = 2.9e-64, P = 2.9e-64
 Identities = 137/317 (43%), Positives = 193/317 (60%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
             +  W  K GK+         R   +  N + +  HN +     ++Y++G+  FAD++NEE
Sbjct:    26 FHAWKLKFGKSYRSAEEESHRQLTWLTNRKLVLVHNMMADQGLKSYRLGMTYFADMSNEE 85

Query:   108 YRAM-YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
             YR + + G          K++  S  +  +    +P++VDWR+KG V  +KDQ  CGSCW
Sbjct:    86 YRQLVFRGCLGSMNNT--KARGGSTFFRLRKAAVVPDTVDWRDKGYVTDIKDQKQCGSCW 143

Query:   167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
             AFS   ++EG     TG+L+SLSEQ+LVDC     N GC+GGLMD AFQ+I  N G+D+E
Sbjct:   144 AFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGSYGNYGCDGGLMDQAFQYIEANKGLDTE 203

Query:   226 QDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRA 282
               YPY   + +C  +PS   A   S  GY D++  DE +L++AVA   P+SVAI+AG  +
Sbjct:   204 DSYPYEAQDGECRFNPSTVGA---SCTGYVDIASGDESALQEAVATIGPISVAIDAGHSS 260

Query:   283 FQHYESGVFTG-ECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
             FQ Y SGV+   +C S+ LDHGV+AVGYG+ NG DYW+V+NSWG DWG  GY+ + RN  
Sbjct:   261 FQLYSSGVYNEPDCSSSELDHGVLAVGYGSSNGDDYWIVKNSWGLDWGVQGYILMSRN-- 318

Query:   341 DTNTGKCGIAMEASYPV 357
                + +CGIA  ASYP+
Sbjct:   319 --KSNQCGIATAASYPL 333


>UNIPROTKB|Q28944 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9823 "Sus scrofa"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 KO:K01365 OrthoDB:EOG48PMKF MEROPS:C01.032
            CTD:1514 EMBL:D37917 EMBL:AJ315771 PIR:A58195 RefSeq:NP_999057.1
            UniGene:Ssc.54036 ProteinModelPortal:Q28944 SMR:Q28944
            STRING:Q28944 Ensembl:ENSSSCT00000012233 GeneID:396926
            KEGG:ssc:396926 OMA:DASETGK ArrayExpress:Q28944 Uniprot:Q28944
        Length = 334

 Score = 651 (234.2 bits), Expect = 7.6e-64, P = 7.6e-64
 Identities = 145/331 (43%), Positives = 199/331 (60%)

Query:    39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYK 94
             S++ + D  +   +  W A HG+   GM     R  +++ N++ I+ HN         + 
Sbjct:    16 SAAPKLDQNLDADWYKWKATHGRLY-GMNEEGWRRAVWEKNMKMIELHNQEYSQGKHGFS 74

Query:    95 VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
             + +N F D+TNEE+R +  G ++   +   K KV  +        E+P+SVDWREKG V 
Sbjct:    75 MAMNAFGDMTNEEFRQVMNGFQNQKHK---KGKVFHESLVL----EVPKSVDWREKGYVT 127

Query:   155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAF 213
              VK+QG CGSCWAFS   A+EG     TG+L+SLSEQ LVDC R + N GCNGGLMD AF
Sbjct:   128 AVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAF 187

Query:   214 QFIIQNGGMDSEQDYPYLGAE-NKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-P 271
             Q++  NGG+D+E+ YPYLG E N C   +      +  G+ D+ P  E +L KAVA   P
Sbjct:   188 QYVKDNGGLDTEESYPYLGRETNSCT-YKPECSAANDTGFVDI-PQREKALMKAVATVGP 245

Query:   272 VSVAIEAGGRAFQHYESGVFTG-ECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGS 325
             +SVAI+AG  +FQ Y+SG++   +C S  LDHGV+ VGYG E    N   +W+V+NSWG 
Sbjct:   246 ISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWGP 305

Query:   326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
             +WG NGYVK+ +   D N   CGI+  ASYP
Sbjct:   306 EWGWNGYVKMAK---DQNN-HCGISTAASYP 332


>UNIPROTKB|Q9GL24 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 CTD:1515 KO:K01365
            OrthoDB:EOG48PMKF EMBL:AJ279008 RefSeq:NP_001239115.1
            UniGene:Cfa.3571 ProteinModelPortal:Q9GL24 SMR:Q9GL24
            MEROPS:C01.032 Ensembl:ENSCAFT00000001770
            Ensembl:ENSCAFT00000023837 GeneID:100684364 KEGG:cfa:100684364
            InParanoid:Q9GL24 OMA:FDQNLDT NextBio:20817211 Uniprot:Q9GL24
        Length = 333

 Score = 648 (233.2 bits), Expect = 1.6e-63, P = 1.6e-63
 Identities = 145/330 (43%), Positives = 202/330 (61%)

Query:    39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNRT-YK 94
             S++ + D  +   +  W A H +   GM     R  +++ N++ I+ HN   S  +  + 
Sbjct:    16 SAAPKFDQSLNAQWYQWKATHRRLY-GMNEEGWRRAVWEKNMKMIELHNREYSQGKHGFT 74

Query:    95 VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
             + +N F D+TNEE+R +  G ++   +   K K+  +        E+P+SVDWREKG V 
Sbjct:    75 MAMNAFGDMTNEEFRQVMNGFQNQKHK---KGKMFQEPLFA----EIPKSVDWREKGYVT 127

Query:   155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAF 213
             PVK+QG CGSCWAFS   A+EG     TG+L+SLSEQ LVDC R + N GCNGGLMD AF
Sbjct:   128 PVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNAF 187

Query:   214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PV 272
             +++  NGG+DSE+ YPYLG + +    +      +  G+ D+ P  E +L KAVA   P+
Sbjct:   188 RYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDL-PQREKALMKAVATLGPI 246

Query:   273 SVAIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTENGVD----YWLVRNSWGSD 326
             SVAI+AG ++FQ Y+SG+ F  +C S  LDHGV+ VGYG E G D    +W+V+NSWG +
Sbjct:   247 SVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFE-GTDSNNKFWIVKNSWGPE 305

Query:   327 WGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
             WG NGYVK+ +   D N   CGIA  ASYP
Sbjct:   306 WGWNGYVKMAK---DQNN-HCGIATAASYP 331


>UNIPROTKB|O60911 [details] [associations]
            symbol:CTSL2 "Cathepsin L2" species:9606 "Homo sapiens"
            [GO:0004177 "aminopeptidase activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA] [GO:0005902
            "microvillus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0007283 "spermatogenesis" evidence=IEA] [GO:0009267 "cellular
            response to starvation" evidence=IEA] [GO:0009749 "response to
            glucose stimulus" evidence=IEA] [GO:0009897 "external side of
            plasma membrane" evidence=IEA] [GO:0010259 "multicellular
            organismal aging" evidence=IEA] [GO:0021675 "nerve development"
            evidence=IEA] [GO:0030984 "kininogen binding" evidence=IEA]
            [GO:0032403 "protein complex binding" evidence=IEA] [GO:0034698
            "response to gonadotropin stimulus" evidence=IEA] [GO:0042277
            "peptide binding" evidence=IEA] [GO:0043005 "neuron projection"
            evidence=IEA] [GO:0043204 "perikaryon" evidence=IEA] [GO:0046697
            "decidualization" evidence=IEA] [GO:0048102 "autophagic cell death"
            evidence=IEA] [GO:0051384 "response to glucocorticoid stimulus"
            evidence=IEA] [GO:0060008 "Sertoli cell differentiation"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=TAS] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_118779 Reactome:REACT_6900
            GO:GO:0009897 GO:GO:0019886 GO:GO:0034698 GO:GO:0043204
            GO:GO:0009749 GO:GO:0030141 GO:GO:0051384 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0045177 GO:GO:0043005 GO:GO:0007283
            GO:GO:0004177 GO:GO:0042277 GO:GO:0009267 GO:GO:0021675
            GO:GO:0043202 GO:GO:0005902 GO:GO:0010259 GO:GO:0004197
            GO:GO:0048102 GO:GO:0046697 HOVERGEN:HBG011513 CTD:1515
            OrthoDB:EOG48PMKF OMA:FDQNLDT GO:GO:0060008 EMBL:Y14734
            EMBL:AB001928 EMBL:AF070448 EMBL:AB019534 EMBL:AY358641
            EMBL:AL445670 EMBL:BC023504 EMBL:BC110512 IPI:IPI00000013
            RefSeq:NP_001188504.1 RefSeq:NP_001324.2 UniGene:Hs.610096 PDB:1FH0
            PDB:3H6S PDB:3KFQ PDBsum:1FH0 PDBsum:3H6S PDBsum:3KFQ
            ProteinModelPortal:O60911 SMR:O60911 IntAct:O60911 STRING:O60911
            MEROPS:I29.010 PhosphoSite:O60911 PaxDb:O60911 PeptideAtlas:O60911
            PRIDE:O60911 Ensembl:ENST00000259470 Ensembl:ENST00000538255
            GeneID:1515 KEGG:hsa:1515 UCSC:uc004awt.3 GeneCards:GC09M099794
            HGNC:HGNC:2538 HPA:CAB017112 MIM:603308 neXtProt:NX_O60911
            PharmGKB:PA27036 InParanoid:O60911 KO:K01375 PhylomeDB:O60911
            BRENDA:3.4.22.43 SABIO-RK:O60911 BindingDB:O60911 ChEMBL:CHEMBL3272
            ChiTaRS:CTSL2 EvolutionaryTrace:O60911 GenomeRNAi:1515 NextBio:6277
            Bgee:O60911 CleanEx:HS_CTSL2 Genevestigator:O60911
            GermOnline:ENSG00000136943 Uniprot:O60911
        Length = 334

 Score = 643 (231.4 bits), Expect = 5.4e-63, P = 5.4e-63
 Identities = 149/326 (45%), Positives = 197/326 (60%)

Query:    45 DDEVMTIYQTWLAKHGKTSNGMGHNEK--RFQIFKDNLRFIDEHNSLNRTYKVG----LN 98
             D  + T +  W A H +     G NE+  R  +++ N++ I+ HN      K G    +N
Sbjct:    22 DQNLDTKWYQWKATHRRL---YGANEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMAMN 78

Query:    99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
              F D+TNEE+R M +G   + K R  K KV  +        +LP+SVDWR+KG V PVK+
Sbjct:    79 AFGDMTNEEFRQM-MGCFRNQKFR--KGKVFREPLFL----DLPKSVDWRKKGYVTPVKN 131

Query:   159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFII 217
             Q  CGSCWAFS   A+EG     TG+L+SLSEQ LVDC R + N GCNGG M  AFQ++ 
Sbjct:   132 QKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVK 191

Query:   218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAI 276
             +NGG+DSE+ YPY+  +  C     N+ V +  G+  V+P  E +L KAVA   P+SVA+
Sbjct:   192 ENGGLDSEESYPYVAVDEICKYRPENS-VANDTGFTVVAPGKEKALMKAVATVGPISVAM 250

Query:   277 EAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGEN 330
             +AG  +FQ Y+SG+ F  +C S  LDHGV+ VGYG E    N   YWLV+NSWG +WG N
Sbjct:   251 DAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSN 310

Query:   331 GYVKLQRNLLDTNTGKCGIAMEASYP 356
             GYVK+ +   D N   CGIA  ASYP
Sbjct:   311 GYVKIAK---DKNN-HCGIATAASYP 332


>UNIPROTKB|P25975 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 EMBL:X91755 EMBL:BC102312 EMBL:AB017648
            IPI:IPI00687440 PIR:S15845 RefSeq:NP_776457.1 UniGene:Bt.3987
            ProteinModelPortal:P25975 SMR:P25975 STRING:P25975
            Ensembl:ENSBTAT00000022710 Ensembl:ENSBTAT00000036427 GeneID:281108
            KEGG:bta:281108 CTD:1515 InParanoid:P25975 KO:K01365 OMA:EEFRATH
            OrthoDB:EOG48PMKF BindingDB:P25975 ChEMBL:CHEMBL2113
            NextBio:20805179 ArrayExpress:P25975 Uniprot:P25975
        Length = 334

 Score = 642 (231.1 bits), Expect = 6.9e-63, P = 6.9e-63
 Identities = 144/331 (43%), Positives = 199/331 (60%)

Query:    39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYK 94
             S++ + D  +   +  W A H +   GM   E R  +++ N + ID HN         ++
Sbjct:    16 SAAPKLDPNLDAHWHQWKATHRRLY-GMNEEEWRRAVWEKNKKIIDLHNQEYSEGKHGFR 74

Query:    95 VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
             + +N F D+TNEE+R +  G ++   +   K K+  +        ++P+SVDW +KG V 
Sbjct:    75 MAMNAFGDMTNEEFRQVMNGFQNQKHK---KGKLFHEPLLV----DVPKSVDWTKKGYVT 127

Query:   155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAF 213
             PVK+QG CGSCWAFS   A+EG     TG+L+SLSEQ LVDC R + N GCNGGLMD AF
Sbjct:   128 PVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAF 187

Query:   214 QFIIQNGGMDSEQDYPYLGAE-NKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-P 271
             Q+I  NGG+DSE+ YPYL  + N C+  +      +  G+ D+ P  E +L KAVA   P
Sbjct:   188 QYIKDNGGLDSEESYPYLATDTNSCN-YKPECSAANDTGFVDI-PQREKALMKAVATVGP 245

Query:   272 VSVAIEAGGRAFQHYESGVFTG-ECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGS 325
             +SVAI+AG  +FQ Y+SG++   +C S  LDHGV+ VGYG E    N   +W+V+NSWG 
Sbjct:   246 ISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGP 305

Query:   326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
             +WG NGYVK+ +   D N   CGIA  ASYP
Sbjct:   306 EWGWNGYVKMAK---DQNN-HCGIATAASYP 332


>FB|FBgn0013770 [details] [associations]
            symbol:Cp1 "Cysteine proteinase-1" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS;NAS] [GO:0030163 "protein catabolic process"
            evidence=NAS] [GO:0005764 "lysosome" evidence=NAS] [GO:0048102
            "autophagic cell death" evidence=IEP] [GO:0035071 "salivary gland
            cell autophagic cell death" evidence=IEP] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0045169 "fusome" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:AE013599 GO:GO:0007586 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0035071 GO:GO:0045169 GeneTree:ENSGT00660000095458 KO:K01365
            EMBL:U75652 EMBL:AF012089 EMBL:BT016071 EMBL:D31970
            RefSeq:NP_523735.2 RefSeq:NP_725347.1 RefSeq:NP_725348.1
            UniGene:Dm.7400 ProteinModelPortal:Q95029 SMR:Q95029 IntAct:Q95029
            MINT:MINT-814156 STRING:Q95029 MEROPS:C01.092 PaxDb:Q95029
            EnsemblMetazoa:FBtr0087593 GeneID:36546 KEGG:dme:Dmel_CG6692
            CTD:36546 FlyBase:FBgn0013770 InParanoid:Q95029 OMA:ICHGADP
            OrthoDB:EOG46M91C PhylomeDB:Q95029 GenomeRNAi:36546 NextBio:799136
            Bgee:Q95029 GermOnline:CG6692 Uniprot:Q95029
        Length = 371

 Score = 640 (230.4 bits), Expect = 1.1e-62, P = 1.1e-62
 Identities = 142/326 (43%), Positives = 200/326 (61%)

Query:    46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQ--IFKDNLRFIDEHNSL----NRTYKVGLNK 99
             D VM  + T+  +H K  N     E+RF+  IF +N   I +HN        ++K+ +NK
Sbjct:    53 DVVMEEWHTFKLEHRK--NYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNK 110

Query:   100 FADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ--RYACKAGDELPESVDWREKGAVNPVK 157
             +ADL + E+R +  G      ++L  +  + +   +   A   LP+SVDWR KGAV  VK
Sbjct:   111 YADLLHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVK 170

Query:   158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFI 216
             DQG CGSCWAFS+  A+EG +   +G L+SLSEQ LVDC  K  N GCNGGLMD AF++I
Sbjct:   171 DQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI 230

Query:   217 IQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVADQ-PVSV 274
               NGG+D+E+ YPY   ++ C  ++    V + D G+ D+   DE  + +AVA   PVSV
Sbjct:   231 KDNGGIDTEKSYPYEAIDDSCHFNK--GTVGATDRGFTDIPQGDEKKMAEAVATVGPVSV 288

Query:   275 AIEAGGRAFQHYESGVFTG-ECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENG 331
             AI+A   +FQ Y  GV+   +C +  LDHGV+ VG+GT E+G DYWLV+NSWG+ WG+ G
Sbjct:   289 AIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKG 348

Query:   332 YVKLQRNLLDTNTGKCGIAMEASYPV 357
             ++K+ RN       +CGIA  +SYP+
Sbjct:   349 FIKMLRN----KENQCGIASASSYPL 370


>ZFIN|ZDB-GENE-030131-572 [details] [associations]
            symbol:wu:fb37b09 "wu:fb37b09" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030131-572 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:BX465840 IPI:IPI00866294 RefSeq:XP_001923796.1
            UniGene:Dr.25683 PRIDE:E9QBE2 Ensembl:ENSDART00000133962
            GeneID:321853 KEGG:dre:321853 NextBio:20807556 Uniprot:E9QBE2
        Length = 335

 Score = 640 (230.4 bits), Expect = 1.1e-62, P = 1.1e-62
 Identities = 142/320 (44%), Positives = 197/320 (61%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SL-NRTYKVGLNKFADLTNEE 107
             + +W ++HGK+ +      +R  I+++NLR I++HN   SL N T+K+G+N+F D+TNEE
Sbjct:    28 WNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSLGNHTFKMGMNQFGDMTNEE 86

Query:   108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
             +R    G + D  R          ++        P+ VDWR++G V PVKDQ  CGSCW+
Sbjct:    87 FRQAMNGYKHDPNRTSQGPLFMEPKFFAA-----PQQVDWRQRGYVTPVKDQKQCGSCWS 141

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
             FS+  A+EG     TG+LIS+SEQ LVDC R   N GCNGGLMD AFQ++ +N G+DSEQ
Sbjct:   142 FSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHGNQGCNGGLMDQAFQYVKENKGLDSEQ 201

Query:   227 DYPYLGAEN-KC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRA 282
              YPYL  ++  C  DP R N  V  I G+ D+   +E++L  AVA   PVSVAI+A  ++
Sbjct:   202 SYPYLARDDLPCRYDP-RFN--VAKITGFVDIPKGNELALMNAVAAVGPVSVAIDASHQS 258

Query:   283 FQHYESGVFTGE-CGSALDHGVVAVGYGTEN----GVDYWLVRNSWGSDWGENGYVKLQR 337
              Q Y+SG++    C S LDH V+ VGYG +     G  YW+V+NSW   WG+ GY+ + +
Sbjct:   259 LQFYQSGIYYERACTSQLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAK 318

Query:   338 NLLDTNTGKCGIAMEASYPV 357
                D N   CGIA  ASYP+
Sbjct:   319 ---DKNN-HCGIATMASYPL 334


>UNIPROTKB|F1S4J6 [details] [associations]
            symbol:Ssc.54235 "Cathepsin L1" species:9823 "Sus scrofa"
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197
            GO:GO:0002250 GeneTree:ENSGT00560000076577 GO:GO:0097067
            EMBL:CU571031 RefSeq:XP_003130681.1 Ensembl:ENSSSCT00000011983
            GeneID:100515919 KEGG:ssc:100515919 OMA:IAICATK Uniprot:F1S4J6
        Length = 332

 Score = 639 (230.0 bits), Expect = 1.4e-62, P = 1.4e-62
 Identities = 147/331 (44%), Positives = 194/331 (58%)

Query:    39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYK 94
             S++ R D  +   +  W A H K   G+    +R  I++ N++ I+ HN  +R    ++ 
Sbjct:    16 SAAPRHDHSLDADWYKWKATHRKLY-GLNEEGRRRAIWEKNMKMIERHNWEHRQGKHSFT 74

Query:    95 VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL-PESVDWREKGAV 153
             + +N F D+TNEE+R    G ++   +   K KV        AG  L P SVDWREKG V
Sbjct:    75 MAMNAFGDMTNEEFRKTMNGFQNQKHK---KGKVFLD-----AGSALTPHSVDWREKGYV 126

Query:   154 NPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYA 212
               VK+QG CGSCWAFS   A+EG     T +LISLSEQ LVDC   + N GCNGGLMD A
Sbjct:   127 TAVKNQGHCGSCWAFSATGALEGQMFRKTSKLISLSEQNLVDCSWPEGNEGCNGGLMDNA 186

Query:   213 FQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-P 271
             FQ+I  NGG+DSE+ YPY G +  C   +  +   +  GY D+ P  E +L KAVA   P
Sbjct:   187 FQYIKDNGGLDSEESYPYFGKDGSCK-YKPQSSAANDTGYVDI-PKQEKALMKAVATVGP 244

Query:   272 VSVAIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTENGVD---YWLVRNSWGSD 326
             +SV I+A   +FQ Y +G+ F  +C S  LDHGV+ VGYG E       YWLV+NSWG+ 
Sbjct:   245 ISVGIDASHESFQFYSTGIYFEPQCSSEDLDHGVLVVGYGVEGAHSNNKYWLVKNSWGNT 304

Query:   327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
             WG +GY+K+ +   D N   CGIA  ASYPV
Sbjct:   305 WGMDGYIKMTK---DQNN-HCGIATMASYPV 331


>ZFIN|ZDB-GENE-071004-74 [details] [associations]
            symbol:zgc:174855 "zgc:174855" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-071004-74
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 MEROPS:C01.032 EMBL:BX000534 EMBL:BC152282
            IPI:IPI00773140 RefSeq:NP_001096592.1 UniGene:Dr.104905 SMR:A7MCR6
            STRING:A7MCR6 Ensembl:ENSDART00000109968 GeneID:569326
            KEGG:dre:569326 NextBio:20889622 Uniprot:A7MCR6
        Length = 335

 Score = 638 (229.6 bits), Expect = 1.8e-62, P = 1.8e-62
 Identities = 144/320 (45%), Positives = 201/320 (62%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SL-NRTYKVGLNKFADLTNEE 107
             + +W ++HGK+ +      +R  I+++NLR I++HN   SL N T+K+G+N+F D+TNEE
Sbjct:    28 WNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSLGNHTFKMGMNQFGDMTNEE 86

Query:   108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
             +R    G + D  R    SK A   +   +    P+ VDWR++G V PVKDQ  CGSCW+
Sbjct:    87 FRQAMNGYKQDPNRT---SKGAL--FMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWS 141

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
             FS+  A+EG     TG+LIS+SEQ LVDC R + N GCNGG+MD AFQ++ +N G+DSEQ
Sbjct:   142 FSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQAFQYVKENKGLDSEQ 201

Query:   227 DYPYLGAEN-KC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRA 282
              YPYL  ++  C  DP R N  V  I G+ D+   +E++L  AVA   PVSVAI+A  ++
Sbjct:   202 SYPYLARDDLPCRYDP-RFN--VAKITGFVDIPRGNELALMNAVAAVGPVSVAIDASHQS 258

Query:   283 FQHYESGVFTGE-CGSALDHGVVAVGYGTEN----GVDYWLVRNSWGSDWGENGYVKLQR 337
              Q Y+SG++    C S LDH V+ VGYG +     G  YW+V+NSW   WG+ GY+ + +
Sbjct:   259 LQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAK 318

Query:   338 NLLDTNTGKCGIAMEASYPV 357
                D N   CGIA  ASYP+
Sbjct:   319 ---DKNN-HCGIATMASYPL 334


>UNIPROTKB|P07711 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9606 "Homo sapiens"
            [GO:0005576 "extracellular region" evidence=NAS] [GO:0005764
            "lysosome" evidence=IDA;NAS] [GO:0006508 "proteolysis"
            evidence=IDA] [GO:0002250 "adaptive immune response" evidence=IEP]
            [GO:0019882 "antigen processing and presentation" evidence=TAS]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0036021 "endolysosome lumen" evidence=TAS] [GO:0043202
            "lysosomal lumen" evidence=TAS] [GO:0045087 "innate immune
            response" evidence=TAS] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042393 "histone binding" evidence=IDA] [GO:0005634 "nucleus"
            evidence=TAS] [GO:0071888 "macrophage apoptotic process"
            evidence=NAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            Reactome:REACT_118779 EMBL:X12451 GO:GO:0005634 Reactome:REACT_6900
            GO:GO:0005576 GO:GO:0019886 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087
            GO:GO:0042393 GO:GO:0004197 GO:GO:0002250 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0036021 KO:K01365 OrthoDB:EOG48PMKF EMBL:M20496
            EMBL:CR457053 EMBL:BX537395 EMBL:AL160279 EMBL:BC012612 EMBL:X05256
            IPI:IPI00012887 PIR:S01002 RefSeq:NP_001244900.1
            RefSeq:NP_001244901.1 RefSeq:NP_001903.1 RefSeq:NP_666023.1
            UniGene:Hs.731507 UniGene:Hs.731952 PDB:1CJL PDB:1CS8 PDB:1ICF
            PDB:1MHW PDB:2NQD PDB:2VHS PDB:2XU1 PDB:2XU3 PDB:2XU4 PDB:2XU5
            PDB:2YJ2 PDB:2YJ8 PDB:2YJ9 PDB:2YJB PDB:2YJC PDB:3BC3 PDB:3H89
            PDB:3H8B PDB:3H8C PDB:3HHA PDB:3HWN PDB:3IV2 PDB:3K24 PDB:3KSE
            PDB:3OF8 PDB:3OF9 PDBsum:1CJL PDBsum:1CS8 PDBsum:1ICF PDBsum:1MHW
            PDBsum:2NQD PDBsum:2VHS PDBsum:2XU1 PDBsum:2XU3 PDBsum:2XU4
            PDBsum:2XU5 PDBsum:2YJ2 PDBsum:2YJ8 PDBsum:2YJ9 PDBsum:2YJB
            PDBsum:2YJC PDBsum:3BC3 PDBsum:3H89 PDBsum:3H8B PDBsum:3H8C
            PDBsum:3HHA PDBsum:3HWN PDBsum:3IV2 PDBsum:3K24 PDBsum:3KSE
            PDBsum:3OF8 PDBsum:3OF9 ProteinModelPortal:P07711 SMR:P07711
            IntAct:P07711 STRING:P07711 MEROPS:I29.001 PhosphoSite:P07711
            DMDM:115741 PaxDb:P07711 PeptideAtlas:P07711 PRIDE:P07711
            DNASU:1514 Ensembl:ENST00000340342 Ensembl:ENST00000343150
            GeneID:1514 KEGG:hsa:1514 UCSC:uc004aph.3 CTD:1514
            GeneCards:GC09P090341 H-InvDB:HIX0058839 H-InvDB:HIX0170314
            HGNC:HGNC:2537 HPA:CAB000459 MIM:116880 neXtProt:NX_P07711
            PharmGKB:PA162382890 InParanoid:P07711 OMA:REPLFAQ PhylomeDB:P07711
            BRENDA:3.4.22.15 BindingDB:P07711 ChEMBL:CHEMBL3837 ChiTaRS:CTSL1
            DrugBank:DB00040 EvolutionaryTrace:P07711 GenomeRNAi:1514
            NextBio:6271 PMAP-CutDB:P07711 ArrayExpress:P07711 Bgee:P07711
            CleanEx:HS_CTSL1 Genevestigator:P07711 GermOnline:ENSG00000135047
            GO:GO:0071888 Uniprot:P07711
        Length = 333

 Score = 637 (229.3 bits), Expect = 2.3e-62, P = 2.3e-62
 Identities = 144/330 (43%), Positives = 197/330 (59%)

Query:    39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYK 94
             S++   D  +   +  W A H +   GM     R  +++ N++ I+ HN   R    ++ 
Sbjct:    16 SATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFT 74

Query:    95 VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
             + +N F D+T+EE+R +  G ++   R+  K KV  +    +A    P SVDWREKG V 
Sbjct:    75 MAMNAFGDMTSEEFRQVMNGFQN---RKPRKGKVFQEPLFYEA----PRSVDWREKGYVT 127

Query:   155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAF 213
             PVK+QG CGSCWAFS   A+EG     TG LISLSEQ LVDC   + N GCNGGLMDYAF
Sbjct:   128 PVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAF 187

Query:   214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PV 272
             Q++  NGG+DSE+ YPY   E  C  + + + V +  G+ D+ P  E +L KAVA   P+
Sbjct:   188 QYVQDNGGLDSEESYPYEATEESCKYNPKYS-VANDTGFVDI-PKQEKALMKAVATVGPI 245

Query:   273 SVAIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYG---TENGVD-YWLVRNSWGSD 326
             SVAI+AG  +F  Y+ G+ F  +C S  +DHGV+ VGYG   TE+  + YWLV+NSWG +
Sbjct:   246 SVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEE 305

Query:   327 WGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
             WG  GYVK+ ++        CGIA  ASYP
Sbjct:   306 WGMGGYVKMAKD----RRNHCGIASAASYP 331


>ZFIN|ZDB-GENE-050626-55 [details] [associations]
            symbol:ctssb.2 "cathepsin S, b.2" species:7955
            "Danio rerio" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-050626-55
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            KO:K01368 EMBL:BC093339 IPI:IPI00507098 RefSeq:NP_001017661.1
            UniGene:Dr.132688 ProteinModelPortal:Q566T8 SMR:Q566T8
            GeneID:337572 KEGG:dre:337572 CTD:337572 InParanoid:Q566T8
            NextBio:20812306 ArrayExpress:Q566T8 Uniprot:Q566T8
        Length = 330

 Score = 635 (228.6 bits), Expect = 3.8e-62, P = 3.8e-62
 Identities = 138/315 (43%), Positives = 189/315 (60%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLN-RTYKVGLNKFADLTNEE 107
             ++ W  KH K  +       R ++++ NL  I  HN   S+   +Y + +N  AD+T EE
Sbjct:    27 WELWKKKHVKLYSCEDEEVGRRELWERNLELIAIHNLEASMGMHSYDLAINHMADMTTEE 86

Query:   108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
                    TR     +   ++  S  +A      +P+++DWR+KG V  VK+QG+CGSCWA
Sbjct:    87 ILQTLAVTRVPPGFKRPTAEYVSSSFAV-----VPDTLDWRDKGYVTSVKNQGACGSCWA 141

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
             FS+V A+EG     TG+L+ LS Q LVDC  K  N GCNGG M  AFQ++I NGG+DSE 
Sbjct:   142 FSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGYMSQAFQYVIDNGGIDSES 201

Query:   227 DYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAF 283
              YPY G +  C  DPS+R A   S   Y+ VS  DE +LK+A+A+  PVSVAI+A    F
Sbjct:   202 SYPYQGTQGSCRYDPSQRAANCTS---YKFVSQGDEQALKEALANIGPVSVAIDATRPQF 258

Query:   284 QHYESGVFTG-ECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
               Y SGV+    C   ++HGV+AVGYGT +G DYWLV+NSWG+ +G+ GY+++ RN    
Sbjct:   259 IFYRSGVYDDPSCTQKVNHGVLAVGYGTLSGQDYWLVKNSWGAGFGDGGYIRIARN---- 314

Query:   343 NTGKCGIAMEASYPV 357
                 CGIA EA YP+
Sbjct:   315 KNNMCGIASEACYPI 329


>UNIPROTKB|Q5E998 [details] [associations]
            symbol:CTSL2 "Cathepsin L2" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            HOVERGEN:HBG011513 UniGene:Bt.3987 MEROPS:C01.032 EMBL:BT021022
            IPI:IPI00711962 ProteinModelPortal:Q5E998 SMR:Q5E998 STRING:Q5E998
            InParanoid:Q5E998 Uniprot:Q5E998
        Length = 334

 Score = 633 (227.9 bits), Expect = 6.2e-62, P = 6.2e-62
 Identities = 143/331 (43%), Positives = 198/331 (59%)

Query:    39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYK 94
             S++ + D  +   +  W A H +   GM   E R  +++ N + ID HN         ++
Sbjct:    16 SAAPKLDPNLDAHWHQWKATHRRLY-GMNEEEWRRAVWEKNKKIIDLHNQEYSEGKHGFR 74

Query:    95 VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
             + +N F D+TNEE+R +  G ++   +   K K+  +        ++P+SVDW +KG V 
Sbjct:    75 MAMNAFGDMTNEEFRQVMNGFQNQKHK---KGKLFHEPLLV----DVPKSVDWTKKGYVT 127

Query:   155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAF 213
             PVK+QG CGSCWAFS   A+EG     TG+L+SLSEQ LVDC R + N GCNGGLMD AF
Sbjct:   128 PVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAF 187

Query:   214 QFIIQNGGMDSEQDYPYLGAE-NKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-P 271
             Q+I  NG +DSE+ YPYL  + N C+  +      +  G+ D+ P  E +L KAVA   P
Sbjct:   188 QYIKDNGCLDSEESYPYLATDTNSCN-YKPECSAANDTGFVDI-PQREKALMKAVATVGP 245

Query:   272 VSVAIEAGGRAFQHYESGVFTG-ECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGS 325
             +SVAI+AG  +FQ Y+SG++   +C S  LDHGV+ VGYG E    N   +W+V+NSWG 
Sbjct:   246 ISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGP 305

Query:   326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
             +WG NGYVK+ +   D N   CGIA  ASYP
Sbjct:   306 EWGWNGYVKMAK---DQNN-HCGIATAASYP 332


>UNIPROTKB|P83654 [details] [associations]
            symbol:P83654 "Ervatamin-C" species:52861 "Tabernaemontana
            divaricata" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0006508 "proteolysis" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005576 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197 PDB:1O0E PDB:2PNS
            PDBsum:1O0E PDBsum:2PNS MEROPS:C01.116 EvolutionaryTrace:P83654
            Uniprot:P83654
        Length = 208

 Score = 633 (227.9 bits), Expect = 6.2e-62, P = 6.2e-62
 Identities = 125/219 (57%), Positives = 155/219 (70%)

Query:   141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
             LPE +DWR+KGAV PVK+QGSCGSCWAFSTV+ VE IN+I TG LISLSEQELVDCD+K 
Sbjct:     1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKK- 59

Query:   201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF-D 259
             N GC GG   +A+Q+II NGG+D++ +YPY   +  C  +   +KVVSIDGY  V PF +
Sbjct:    60 NHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPCQAA---SKVVSIDGYNGV-PFCN 115

Query:   260 EMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLV 319
             E +LK+AVA QP +VAI+A    FQ Y SG+F+G CG+ L+HGV  VGY      +YW+V
Sbjct:   116 EXALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQA----NYWIV 171

Query:   320 RNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
             RNSWG  WGE GY+++ R       G CGIA    YP K
Sbjct:   172 RNSWGRYWGEKGYIRMLRV---GGCGLCGIARLPYYPTK 207


>MGI|MGI:88564 [details] [associations]
            symbol:Ctsl "cathepsin L" species:10090 "Mus musculus"
            [GO:0004177 "aminopeptidase activity" evidence=ISO] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISO] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005730 "nucleolus"
            evidence=NAS] [GO:0005737 "cytoplasm" evidence=ISO] [GO:0005764
            "lysosome" evidence=ISO] [GO:0005773 "vacuole" evidence=ISO]
            [GO:0005902 "microvillus" evidence=ISO] [GO:0006508 "proteolysis"
            evidence=ISO;IDA] [GO:0007154 "cell communication" evidence=ISO]
            [GO:0008152 "metabolic process" evidence=TAS] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISO;TAS] [GO:0009897 "external side of
            plasma membrane" evidence=ISO] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0030141 "secretory granule" evidence=ISO]
            [GO:0030984 "kininogen binding" evidence=ISO] [GO:0032403 "protein
            complex binding" evidence=ISO] [GO:0042277 "peptide binding"
            evidence=ISO] [GO:0042393 "histone binding" evidence=ISO;NAS]
            [GO:0043005 "neuron projection" evidence=ISO] [GO:0043204
            "perikaryon" evidence=ISO] [GO:0045177 "apical part of cell"
            evidence=ISO] [GO:0048863 "stem cell differentiation" evidence=NAS]
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IMP] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:88564 GO:GO:0005730 GO:GO:0009897 GO:GO:0034698
            GO:GO:0043204 GO:GO:0009749 GO:GO:0030141 GO:GO:0048863
            GO:GO:0051384 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045177 GO:GO:0043005
            GO:GO:0007283 GO:GO:0004177 GO:GO:0005764 GO:GO:0042277
            GO:GO:0009267 GO:GO:0021675 GO:GO:0042393 GO:GO:0005902
            GO:GO:0010259 GO:GO:0004197 GO:GO:0048102 GO:GO:0046697
            HOVERGEN:HBG011513 KO:K01365 OMA:EEFRATH OrthoDB:EOG48PMKF
            MEROPS:C01.032 BRENDA:3.4.22.15 ChiTaRS:CTSL1 EMBL:X06086
            EMBL:J02583 EMBL:M20495 EMBL:AF121837 EMBL:AF121838 EMBL:AF121839
            EMBL:BC068163 EMBL:X04392 IPI:IPI00128154 PIR:S01177
            RefSeq:NP_034114.1 UniGene:Mm.930 PDB:1MVV PDBsum:1MVV
            ProteinModelPortal:P06797 SMR:P06797 STRING:P06797
            PhosphoSite:P06797 PaxDb:P06797 PRIDE:P06797
            Ensembl:ENSMUST00000021933 GeneID:13039 KEGG:mmu:13039 CTD:13039
            InParanoid:P06797 BioCyc:MetaCyc:MONOMER-14812 BindingDB:P06797
            ChEMBL:CHEMBL5291 NextBio:282928 Bgee:P06797 CleanEx:MM_CTSL
            Genevestigator:P06797 GermOnline:ENSMUSG00000021477 GO:GO:0060008
            Uniprot:P06797
        Length = 334

 Score = 632 (227.5 bits), Expect = 7.9e-62, P = 7.9e-62
 Identities = 142/321 (44%), Positives = 190/321 (59%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
             +  W + H +   G    E R  I++ N+R I  HN         + + +N F D+TNEE
Sbjct:    29 WHQWKSTHRRLY-GTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEE 87

Query:   108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
             +R +  G R    +   K ++  +    K    +P+SVDWREKG V PVK+QG CGSCWA
Sbjct:    88 FRQVVNGYRHQKHK---KGRLFQEPLMLK----IPKSVDWREKGCVTPVKNQGQCGSCWA 140

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
             FS    +EG   + TG+LISLSEQ LVDC   + N GCNGGLMD+AFQ+I +NGG+DSE+
Sbjct:   141 FSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEE 200

Query:   227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQH 285
              YPY   +  C   R    V +  G+ D+ P  E +L KAVA   P+SVA++A   + Q 
Sbjct:   201 SYPYEAKDGSCK-YRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVAMDASHPSLQF 258

Query:   286 YESGVF-TGECGSA-LDHGVVAVGYGTENGVD-----YWLVRNSWGSDWGENGYVKLQRN 338
             Y SG++    C S  LDHGV+ VGYG E G D     YWLV+NSWGS+WG  GY+K+ ++
Sbjct:   259 YSSGIYYEPNCSSKNLDHGVLLVGYGYE-GTDSNKNKYWLVKNSWGSEWGMEGYIKIAKD 317

Query:   339 LLDTNTGKCGIAMEASYPVKN 359
               D +   CG+A  ASYPV N
Sbjct:   318 R-DNH---CGLATAASYPVVN 334


>RGD|2448 [details] [associations]
            symbol:Ctsl1 "cathepsin L1" species:10116 "Rattus norvegicus"
          [GO:0002250 "adaptive immune response" evidence=ISO] [GO:0004177
          "aminopeptidase activity" evidence=IDA] [GO:0004197 "cysteine-type
          endopeptidase activity" evidence=ISO;IDA] [GO:0005576 "extracellular
          region" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IDA]
          [GO:0005764 "lysosome" evidence=ISO;IDA] [GO:0005773 "vacuole"
          evidence=IDA] [GO:0005902 "microvillus" evidence=IDA] [GO:0006508
          "proteolysis" evidence=IEP;ISO] [GO:0007154 "cell communication"
          evidence=IDA] [GO:0007283 "spermatogenesis" evidence=IEP] [GO:0008234
          "cysteine-type peptidase activity" evidence=ISO] [GO:0008584 "male
          gonad development" evidence=IEP] [GO:0009267 "cellular response to
          starvation" evidence=IEP] [GO:0009749 "response to glucose stimulus"
          evidence=IEP] [GO:0009897 "external side of plasma membrane"
          evidence=IDA] [GO:0010259 "multicellular organismal aging"
          evidence=IEP] [GO:0014070 "response to organic cyclic compound"
          evidence=IEP] [GO:0021675 "nerve development" evidence=IEP]
          [GO:0030984 "kininogen binding" evidence=IPI] [GO:0032403 "protein
          complex binding" evidence=IPI] [GO:0034698 "response to gonadotropin
          stimulus" evidence=IEP] [GO:0042277 "peptide binding" evidence=IDA]
          [GO:0042393 "histone binding" evidence=ISO] [GO:0043005 "neuron
          projection" evidence=IDA] [GO:0043204 "perikaryon" evidence=IDA]
          [GO:0046697 "decidualization" evidence=IEP] [GO:0048102 "autophagic
          cell death" evidence=IEP] [GO:0051384 "response to glucocorticoid
          stimulus" evidence=IEP] [GO:0060008 "Sertoli cell differentiation"
          evidence=IEP] [GO:0097067 "cellular response to thyroid hormone
          stimulus" evidence=ISO] [GO:0030141 "secretory granule" evidence=IDA]
          [GO:0045177 "apical part of cell" evidence=IDA] [GO:0060441
          "epithelial tube branching involved in lung morphogenesis"
          evidence=ISO] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
          PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:Y00697 RGD:2448
          GO:GO:0005576 GO:GO:0009897 GO:GO:0034698 GO:GO:0043204 GO:GO:0009749
          GO:GO:0051384 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
          InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
          PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
          PROSITE:PS00139 PROSITE:PS00639 GO:GO:0043005 GO:GO:0007283
          GO:GO:0004177 GO:GO:0005764 GO:GO:0042277 GO:GO:0009267 GO:GO:0021675
          GO:GO:0005902 GO:GO:0010259 GO:GO:0004197 GO:GO:0048102 GO:GO:0046697
          GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 KO:K01365
          OrthoDB:EOG48PMKF MEROPS:C01.032 OMA:FDQNLDT CTD:1514
          BRENDA:3.4.22.15 GO:GO:0060008 EMBL:AF025476 EMBL:BC063175
          EMBL:S85184 IPI:IPI00326070 PIR:S07098 RefSeq:NP_037288.1
          UniGene:Rn.1294 ProteinModelPortal:P07154 SMR:P07154 IntAct:P07154
          STRING:P07154 PhosphoSite:P07154 PRIDE:P07154
          Ensembl:ENSRNOT00000025462 GeneID:25697 KEGG:rno:25697 UCSC:RGD:2448
          InParanoid:P07154 SABIO-RK:P07154 BindingDB:P07154 ChEMBL:CHEMBL2305
          NextBio:607715 Genevestigator:P07154 GermOnline:ENSRNOG00000018566
          Uniprot:P07154
        Length = 334

 Score = 630 (226.8 bits), Expect = 1.3e-61, P = 1.3e-61
 Identities = 142/322 (44%), Positives = 190/322 (59%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVG----LNKFADLTNEE 107
             +  W + H +   G    E R  +++ N+R I  HN      K G    +N F D+TNEE
Sbjct:    29 WHQWKSTHRRLY-GTNEEEWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTNEE 87

Query:   108 YRAMYLGTRSDA--KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
             +R +  G R     K RL +  +  Q         +P++VDWREKG V PVK+QG CGSC
Sbjct:    88 FRQIVNGYRHQKHKKGRLFQEPLMLQ---------IPKTVDWREKGCVTPVKNQGQCGSC 138

Query:   166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDS 224
             WAFS    +EG   + TG+LISLSEQ LVDC   + N GCNGGLMD+AFQ+I +NGG+DS
Sbjct:   139 WAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDS 198

Query:   225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAF 283
             E+ YPY   +  C   R    V +  G+ D+ P  E +L KAVA   P+SVA++A   + 
Sbjct:   199 EESYPYEAKDGSCK-YRAEYAVANDTGFVDI-PQQEKALMKAVATVGPISVAMDASHPSL 256

Query:   284 QHYESGVF-TGECGSA-LDHGVVAVGYG---TENGVD-YWLVRNSWGSDWGENGYVKLQR 337
             Q Y SG++    C S  LDHGV+ VGYG   T++  D YWLV+NSWG +WG +GY+K+ +
Sbjct:   257 QFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAK 316

Query:   338 NLLDTNTGKCGIAMEASYPVKN 359
                D N   CG+A  ASYP+ N
Sbjct:   317 ---DRNN-HCGLATAASYPIVN 334


>WB|WBGene00000776 [details] [associations]
            symbol:cpl-1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0040010 "positive regulation
            of growth rate" evidence=IMP] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] [GO:0040011
            "locomotion" evidence=IMP] [GO:0070265 "necrotic cell death"
            evidence=IMP] [GO:0031983 "vesicle lumen" evidence=IDA] [GO:0042718
            "yolk granule" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0009792 GO:GO:0040010 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0040011
            GO:GO:0031983 GO:GO:0070265 GeneTree:ENSGT00660000095458 KO:K01365
            GO:GO:0042718 MEROPS:I29.009 EMBL:Z92812 GeneID:180111
            KEGG:cel:CELE_T03E6.7 CTD:180111 PIR:T24387 RefSeq:NP_001256718.1
            HSSP:P80067 ProteinModelPortal:O45734 SMR:O45734 DIP:DIP-26616N
            IntAct:O45734 MINT:MINT-211563 STRING:O45734 PaxDb:O45734
            EnsemblMetazoa:T03E6.7.1 EnsemblMetazoa:T03E6.7.2 UCSC:T03E6.7.1
            WormBase:T03E6.7a InParanoid:O45734 OMA:HIENHNR NextBio:908128
            Uniprot:O45734
        Length = 337

 Score = 629 (226.5 bits), Expect = 1.6e-61, P = 1.6e-61
 Identities = 139/298 (46%), Positives = 184/298 (61%)

Query:    74 QIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM-KSKV 128
             + F  N+  I+ HN  +R    T+++GLN  ADL   +YR +      +  RRL   S++
Sbjct:    53 EAFVKNMIHIENHNRDHRLGRKTFEMGLNHIADLPFSQYRKL------NGYRRLFGDSRI 106

Query:   129 A-SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
               S  +      ++P+ VDWR+   V  VK+QG CGSCWAFS   A+EG +    G+L+S
Sbjct:   107 KNSSSFLAPFNVQVPDEVDWRDTHLVTDVKNQGMCGSCWAFSATGALEGQHARKLGQLVS 166

Query:   188 LSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKV 246
             LSEQ LVDC  K  N GCNGGLMD AF++I  N G+D+E+ YPY G + KC     N K 
Sbjct:   167 LSEQNLVDCSTKYGNHGCNGGLMDQAFEYIRDNHGVDTEESYPYKGRDMKC---HFNKKT 223

Query:   247 VSID--GYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFTGE-CGSA-LDH 301
             V  D  GY D    DE  LK AVA Q P+S+AI+AG R+FQ Y+ GV+  E C S  LDH
Sbjct:   224 VGADDKGYVDTPEGDEEQLKIAVATQGPISIAIDAGHRSFQLYKKGVYYDEECSSEELDH 283

Query:   302 GVVAVGYGT--ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
             GV+ VGYGT  E+G DYW+V+NSWG+ WGE GY+++ RN        CG+A +ASYP+
Sbjct:   284 GVLLVGYGTDPEHG-DYWIVKNSWGAGWGEKGYIRIARN----RNNHCGVATKASYPL 336


>DICTYBASE|DDB_G0278721 [details] [associations]
            symbol:cprD "cysteine proteinase 4" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0278721 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000024 EMBL:L36204 RefSeq:XP_641963.1
            ProteinModelPortal:P54639 SMR:P54639 MEROPS:C01.A57 PRIDE:P54639
            EnsemblProtists:DDB0214999 GeneID:8621695 KEGG:ddi:DDB_G0278721
            OMA:NAFADIT ProtClustDB:CLSZ2846820 Uniprot:P54639
        Length = 442

 Score = 522 (188.8 bits), Expect = 2.0e-61, Sum P(2) = 2.0e-61
 Identities = 115/268 (42%), Positives = 162/268 (60%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
             +  W+  H +T +    N  R+QIFK N+ ++ + NS      +GLN FAD+TN+EYR  
Sbjct:    30 FTNWMQAHQRTYSSEEFNA-RYQIFKSNMDYVHQWNSKGGETVLGLNVFADITNQEYRTT 88

Query:   112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
             YLGT  D    +   +   + ++  A    P +VDWR +GAV P+K+QG CG CW+FST 
Sbjct:    89 YLGTPFDGSALIGTEE--EKIFSTPA----P-TVDWRAQGAVTPIKNQGQCGGCWSFSTT 141

Query:   172 AAVEGINKIVTG---ELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQD 227
              + EG + I +G   +L+SLSEQ L+DC +   N GC GGLM  AF++II N G+D+E  
Sbjct:   142 GSTEGAHFIASGTKKDLVSLSEQNLIDCSKSYGNNGCEGGLMTLAFEYIINNKGIDTESS 201

Query:   228 YPYL---GAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQ 284
             YPY    G E K   S   A++VS   Y++V+   E SL+ A  + PVSVAI+A   +FQ
Sbjct:   202 YPYTAEDGKECKFKTSNIGAQIVS---YQNVTSGSEASLQSASNNAPVSVAIDASNESFQ 258

Query:   285 HYESGVF-TGECG-SALDHGVVAVGYGT 310
              YESG++    C  + LDHGV+ VGYG+
Sbjct:   259 LYESGIYYEPACSPTQLDHGVLVVGYGS 286

 Score = 124 (48.7 bits), Expect = 2.0e-61, Sum P(2) = 2.0e-61
 Identities = 29/73 (39%), Positives = 40/73 (54%)

Query:   288 SGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
             SG  +G  GS    G V    G     +YW+V+NSWG+ WG +GY+ + +   D N   C
Sbjct:   379 SGSGSGS-GSGSGSGAVEASSG-----NYWIVKNSWGTSWGMDGYIFMSK---DRNNN-C 428

Query:   348 GIAMEASYPVKNS 360
             GIA  AS+P  +S
Sbjct:   429 GIATMASFPTASS 441


>MGI|MGI:107341 [details] [associations]
            symbol:Ctss "cathepsin S" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=ISO;IDA] [GO:0006508 "proteolysis"
            evidence=IDA] [GO:0008152 "metabolic process" evidence=ISO]
            [GO:0008233 "peptidase activity" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0009986 "cell
            surface" evidence=ISO] [GO:0016020 "membrane" evidence=IDA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0045453 "bone
            resorption" evidence=ISO] [GO:0051930 "regulation of sensory
            perception of pain" evidence=ISO] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:107341 GO:GO:0016020 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0008233 GO:GO:0031905 Reactome:REACT_102124
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 BRENDA:3.4.22.27
            ChiTaRS:CTSS EMBL:AF051732 EMBL:AF051727 EMBL:AF051728
            EMBL:AF051729 EMBL:AF051726 EMBL:AF051730 EMBL:AF051731
            EMBL:AF038546 EMBL:AJ002386 EMBL:AC092203 EMBL:Y18466 EMBL:AJ223208
            IPI:IPI00309520 UniGene:Mm.3619 PDB:1M0H PDBsum:1M0H
            ProteinModelPortal:O70370 SMR:O70370 STRING:O70370
            PhosphoSite:O70370 PaxDb:O70370 PRIDE:O70370
            Ensembl:ENSMUST00000116304 BindingDB:O70370 ChEMBL:CHEMBL4098
            NextBio:282932 Bgee:O70370 CleanEx:MM_CTSS Genevestigator:O70370
            GermOnline:ENSMUSG00000038642 Uniprot:O70370
        Length = 340

 Score = 625 (225.1 bits), Expect = 4.3e-61, P = 4.3e-61
 Identities = 141/315 (44%), Positives = 190/315 (60%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLN-RTYKVGLNKFADLTNEE 107
             +  W   H K        E R  I++ NL+FI  HN   S+   TY+VG+N   D+TNEE
Sbjct:    36 WDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEE 95

Query:   108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
                     R    R+  K+ V  + Y+ +    LP++VDWREKG V  VK QGSCG+CWA
Sbjct:    96 ILCRMGALR--IPRQSPKT-VTFRSYSNRT---LPDTVDWREKGCVTEVKYQGSCGACWA 149

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDC--DRKI-NAGCNGGLMDYAFQFIIQNGGMDS 224
             FS V A+EG  K+ TG+LISLS Q LVDC  + K  N GC GG M  AFQ+II NGG+++
Sbjct:   150 FSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEA 209

Query:   225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF-DEMSLKKAVADQ-PVSVAIEAGGRA 282
             +  YPY   + KC  + +N +  +   Y  + PF DE +LK+AVA + PVSV I+A   +
Sbjct:   210 DASYPYKATDEKCHYNSKN-RAATCSRYIQL-PFGDEDALKEAVATKGPVSVGIDASHSS 267

Query:   283 FQHYESGVFTG-ECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
             F  Y+SGV+    C   ++HGV+ VGYGT +G DYWLV+NSWG ++G+ GY+++ RN   
Sbjct:   268 FFFYKSGVYDDPSCTGNVNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARN--- 324

Query:   342 TNTGKCGIAMEASYP 356
              N   CGIA   SYP
Sbjct:   325 -NKNHCGIASYCSYP 338


>RGD|1560071 [details] [associations]
            symbol:Ctsll3 "cathepsin L-like 3" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1560071 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:CH474032 IPI:IPI00560469 RefSeq:XP_001065834.2
            RefSeq:XP_573976.3 UniGene:Rn.104851 MEROPS:C01.107
            Ensembl:ENSRNOT00000061398 GeneID:498691 KEGG:rno:498691
            UCSC:RGD:1560071 CTD:70202 OMA:NCGIASD OrthoDB:EOG4HDSTZ
            NextBio:700548 Uniprot:D3ZJV2
        Length = 330

 Score = 623 (224.4 bits), Expect = 7.1e-61, P = 7.1e-61
 Identities = 139/324 (42%), Positives = 197/324 (60%)

Query:    45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS--LN--RTYKVGLNKF 100
             D    T+++ W  KHGKT N     +KR  ++++N++ I+ HN   L     + + +N F
Sbjct:    22 DPSFDTVWEEWKTKHGKTYNTNEEGQKR-AVWENNMKMINLHNEDYLKGKHGFSLEMNAF 80

Query:   101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
              DLTN E+R +  G +   K ++MK  V  + +    GD +P++VDWR+ G V PVK+QG
Sbjct:    81 GDLTNTEFRELMTGFQGQ-KTKMMK--VFPEPFL---GD-VPKTVDWRKHGYVTPVKNQG 133

Query:   161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQN 219
              CGSCWAFS V ++EG     TG+L+ LSEQ LVDC     N GC+GGL D+AFQ++  N
Sbjct:   134 PCGSCWAFSAVGSLEGQVFRKTGKLVPLSEQNLVDCSWSHGNKGCDGGLPDFAFQYVKDN 193

Query:   220 GGMDSEQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAI 276
             GG+D+   YPY      C  +P    AKVV   G+  + P  E +L KAVA   P+SV I
Sbjct:   194 GGLDTSVSYPYEALNGTCRYNPKYSAAKVV---GFMSIPP-SENALMKAVATVGPISVGI 249

Query:   277 EAGGRAFQHYESGVF-TGECGSA-LDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYV 333
             +   ++FQ Y+ G++   +C S  L+H V+ VGYG E+ G  YWLV+NSWG DWG +GY+
Sbjct:   250 DIKHKSFQFYKGGMYYEPDCSSTNLNHAVLVVGYGEESDGRKYWLVKNSWGRDWGMDGYI 309

Query:   334 KLQRNLLDTNTGKCGIAMEASYPV 357
             K+ +   D N   CGIA +ASYP+
Sbjct:   310 KMAK---DWNNN-CGIASDASYPI 329


>ZFIN|ZDB-GENE-980526-285 [details] [associations]
            symbol:ctsl1b "cathepsin L, 1 b" species:7955
            "Danio rerio" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005576
            "extracellular region" evidence=IDA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-980526-285 GO:GO:0005576 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:BX465840 IPI:IPI00498443 Ensembl:ENSDART00000145570
            Bgee:F1R7B3 Uniprot:F1R7B3
        Length = 352

 Score = 623 (224.4 bits), Expect = 7.1e-61, P = 7.1e-61
 Identities = 139/321 (43%), Positives = 197/321 (61%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SL-NRTYKVGLNKFADLTNEE 107
             + +W ++HGK+ +      +R  I+++NLR I++HN   S  N T+K+G+N+F D+TNEE
Sbjct:    44 WNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSYGNHTFKMGMNQFGDMTNEE 102

Query:   108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
             +R    G   D  +           +        P+ VDWR++G V PVKDQ  CGSCW+
Sbjct:   103 FRQAMNGYTHDPNQTSQGPLFMEPSFFAA-----PQQVDWRQRGYVTPVKDQKQCGSCWS 157

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
             FS+  A+EG     TG+LIS+SEQ LVDC R + N GCNGGLMD AFQ++ +N G+DSEQ
Sbjct:   158 FSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGLMDQAFQYVKENKGLDSEQ 217

Query:   227 DYPYLGAEN-KC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRA 282
              YPYL  ++  C  DP R N  V  I G+ D+   +E++L  AVA   PVSVAI+A  ++
Sbjct:   218 SYPYLARDDLPCRYDP-RFN--VAKITGFVDIPSGNELALMNAVAAVGPVSVAIDASHQS 274

Query:   283 FQHYESGVFTGE-CGSA-LDHGVVAVGYGTEN----GVDYWLVRNSWGSDWGENGYVKLQ 336
              Q Y+SG++    C S+ LDH V+ VGYG +     G  YW+V+NSW   WG+ GY+ + 
Sbjct:   275 LQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMA 334

Query:   337 RNLLDTNTGKCGIAMEASYPV 357
             +   D N   CG+A +ASYP+
Sbjct:   335 K---DKNN-HCGVATKASYPL 351


>ZFIN|ZDB-GENE-080215-7 [details] [associations]
            symbol:zgc:174153 "zgc:174153" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-080215-7
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 EMBL:BX000534 EMBL:BX322603
            IPI:IPI00483644 Ensembl:ENSDART00000113654 OMA:ITLCISA Bgee:F1R8Y0
            Uniprot:F1R8Y0
        Length = 336

 Score = 622 (224.0 bits), Expect = 9.0e-61, P = 9.0e-61
 Identities = 139/321 (43%), Positives = 197/321 (61%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SL-NRTYKVGLNKFADLTNEE 107
             + +W ++HGK+ +      +R  I+++NLR I++HN   S  N T+K+G+N+F D+TNEE
Sbjct:    28 WNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSYGNHTFKMGMNQFGDMTNEE 86

Query:   108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
             +R    G + D  +           +        P+ VDWR++G V PVKDQ  CGSCW+
Sbjct:    87 FRQAMNGYKHDPNQTSQGPLFMEPSFFAA-----PQQVDWRQRGYVTPVKDQKQCGSCWS 141

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
             FS+  A+EG     TG+LIS+SEQ LVDC R + N GCNGGLMD AFQ++ +N G+DSEQ
Sbjct:   142 FSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGLMDQAFQYVKENKGLDSEQ 201

Query:   227 DYPYLGAEN-KC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRA 282
              YPYL  ++  C  DP R N  V  I G+ D+   +E +L  AVA   PVSVAI+A  ++
Sbjct:   202 SYPYLARDDLPCRYDP-RFN--VAKITGFVDIPSGNEPALMNAVAAVGPVSVAIDASHQS 258

Query:   283 FQHYESGVFTGE-CGSA-LDHGVVAVGYGTEN----GVDYWLVRNSWGSDWGENGYVKLQ 336
              Q Y+SG++    C S+ LDH V+ VGYG +     G  YW+V+NSW   WG+ GY+ + 
Sbjct:   259 LQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMA 318

Query:   337 RNLLDTNTGKCGIAMEASYPV 357
             +   D N   CG+A +ASYP+
Sbjct:   319 K---DKNN-HCGVATKASYPL 335


>DICTYBASE|DDB_G0279185 [details] [associations]
            symbol:cprF "cysteine proteinase 6" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0279185 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000030 HSSP:P07711 ProtClustDB:CLSZ2846820 EMBL:U72745
            RefSeq:XP_641725.1 ProteinModelPortal:Q94503 SMR:Q94503
            MEROPS:C01.081 PRIDE:Q94503 EnsemblProtists:DDB0215002
            GeneID:8621921 KEGG:ddi:DDB_G0279185 Uniprot:Q94503
        Length = 434

 Score = 520 (188.1 bits), Expect = 1.8e-60, Sum P(2) = 1.8e-60
 Identities = 117/266 (43%), Positives = 153/266 (57%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
             +  W+  H +  +    N  RF IFK N+ +I+E N+      +GLN FAD+TNEEYRA 
Sbjct:    30 FTNWMIAHQRHYSSEEFNG-RFNIFKANMDYINEWNTKGSETVLGLNVFADITNEEYRAT 88

Query:   112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
             YLGT  DA    M     S++     G     SVDWR KGAV P+K+QG CG CW+FS  
Sbjct:    89 YLGTPFDASSLEM---TPSEKVF---GGVQANSVDWRAKGAVTPIKNQGECGGCWSFSAT 142

Query:   172 AAVEGINKIVTGE--LISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
              A EG   I  G+  L S+SEQ+L+DC     N GC GGLM  AF++II NGG+D+E  Y
Sbjct:   143 GATEGAQYIANGDSDLTSVSEQQLIDCSGSYGNNGCEGGLMTLAFEYIINNGGIDTESSY 202

Query:   229 PYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
             P+     KC  +PS   A++ S   Y +V+   E  L   V   P SVAI+A   +FQ Y
Sbjct:   203 PFTANTEKCKYNPSNIGAELSS---YVNVTSGSESDLAAKVTQGPTSVAIDASQPSFQFY 259

Query:   287 ESGVFTGE-CGSA-LDHGVVAVGYGT 310
              SG++    C S  LDHGV+AVG+G+
Sbjct:   260 SSGIYNEPACSSTQLDHGVLAVGFGS 285

 Score = 117 (46.2 bits), Expect = 1.8e-60, Sum P(2) = 1.8e-60
 Identities = 22/49 (44%), Positives = 31/49 (63%)

Query:   308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
             Y T+   +YW+V+NSWG DWG NGY+ + ++       +CGIA  AS P
Sbjct:   383 YPTDG--NYWIVKNSWGLDWGINGYILMSKD----KDNQCGIATMASIP 425


>DICTYBASE|DDB_G0279799 [details] [associations]
            symbol:cprB "cysteine proteinase 2" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0279799 GenomeReviews:CM000152_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            MEROPS:I29.003 KO:K01365 EMBL:AAFI02000033 EMBL:M16039 EMBL:X03344
            PIR:A25439 RefSeq:XP_641494.1 ProteinModelPortal:P04989 SMR:P04989
            EnsemblProtists:DDB0214998 GeneID:8622234 KEGG:ddi:DDB_G0279799
            OMA:YVNITAG Uniprot:P04989
        Length = 376

 Score = 522 (188.8 bits), Expect = 3.8e-60, Sum P(2) = 3.8e-60
 Identities = 113/282 (40%), Positives = 161/282 (57%)

Query:    44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKV-GLNKFAD 102
             ++ +  T +  W  K  +  +    +  R+ IFK N+ ++D  NS   +  V GLN FAD
Sbjct:    28 SESQYRTAFTEWTLKFNRQYSSSEFSN-RYSIFKSNMDYVDNWNSKGDSQTVLGLNNFAD 86

Query:   103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
             +TNEEYR  YLGTR +A           +    +     P+S+DWR K AV P+KDQG C
Sbjct:    87 ITNEEYRKTYLGTRVNAHS--YNGYDGREVLNVEDLQTNPKSIDWRTKNAVTPIKDQGQC 144

Query:   163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGG 221
             GSCW+FST  + EG + + T +L+SLSEQ LVDC   + N GC+GGLM+ AF +II+N G
Sbjct:   145 GSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAFDYIIKNKG 204

Query:   222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
             +D+E  YPY           ++    +I GY +++   E+SL+      PVSVAI+A   
Sbjct:   205 IDTESSYPYTAETGSTCLFNKSDIGATIKGYVNITAGSEISLENGAQHGPVSVAIDASHN 264

Query:   282 AFQHYESGVF-TGECG-SALDHGVVAVGYGTENGVDYWLVRN 321
             +FQ Y SG++   +C  + LDHGV+ VGYG +   D   V N
Sbjct:   265 SFQLYTSGIYYEPKCSPTELDHGVLVVGYGVQGKDDEGPVLN 306

 Score = 112 (44.5 bits), Expect = 3.8e-60, Sum P(2) = 3.8e-60
 Identities = 18/43 (41%), Positives = 28/43 (65%)

Query:   315 DYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
             +YW+V+NSWG+ WG  GY+ + ++        CGIA  +SYP+
Sbjct:   337 NYWIVKNSWGTSWGIKGYILMSKD----RKNNCGIASVSSYPL 375


>TAIR|locus:2030027 [details] [associations]
            symbol:AT1G29110 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            EMBL:CP002684 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            IPI:IPI00544534 RefSeq:NP_564322.1 UniGene:At.51816
            ProteinModelPortal:F4HZW2 SMR:F4HZW2 EnsemblPlants:AT1G29110.1
            GeneID:839786 KEGG:ath:AT1G29110 OMA:SCRANAR Uniprot:F4HZW2
        Length = 334

 Score = 612 (220.5 bits), Expect = 1.0e-59, P = 1.0e-59
 Identities = 122/317 (38%), Positives = 190/317 (59%)

Query:    45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFADL 103
             +  ++  +Q W+ +  +        E R ++FK NL+FI+  N++ N++Y +G+N+F D 
Sbjct:    31 EQSIVDYHQQWMTQFSRVYKDESEKEMRLKVFKKNLKFIENFNNMGNQSYTLGVNEFTDW 90

Query:   104 TNEEYRAMYLGTRSDAKR--RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
               EE+ A + G R +      L      S+ +     D   ES DWR++GAV PVK QG+
Sbjct:    91 KTEEFLATHTGLRVNVTSLSELFNKTKPSRNWNMSDIDMEDESKDWRDEGAVTPVKYQGA 150

Query:   162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
             C        +  + G N      L++LSEQ+L+DCD + N GCNGG  + AF++II+NGG
Sbjct:   151 C-------RLTKISGKN------LLTLSEQQLIDCDIEKNGGCNGGEFEEAFKYIIKNGG 197

Query:   222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
             +  E +YPY   +  C  + R A    I G++ V   +E +L +AV  QPVSV I+A   
Sbjct:   198 VSLETEYPYQVKKESCRANARRAPHTQIRGFQMVPSHNERALLEAVRRQPVSVLIDARAD 257

Query:   282 AFQHYESGVFTG-ECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
             +F HY+ GV+ G +CG+ ++H V  VGYGT +G++YW+++NSWG  WGENGY++++R++ 
Sbjct:   258 SFGHYKGGVYAGLDCGTDVNHAVTIVGYGTMSGLNYWVLKNSWGESWGENGYMRIRRDV- 316

Query:   341 DTNTGKCGIAMEASYPV 357
             +   G CGIA  A+YPV
Sbjct:   317 EWPQGMCGIAQVAAYPV 333


>ZFIN|ZDB-GENE-041010-76 [details] [associations]
            symbol:ctsll "cathepsin L, like" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-041010-76
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            OrthoDB:EOG48PMKF MEROPS:C01.032 EMBL:BX119902 IPI:IPI00616622
            UniGene:Dr.79994 SMR:A2BEM8 Ensembl:ENSDART00000144226
            InParanoid:A2BEM8 OMA:PRYSAAN Uniprot:A2BEM8
        Length = 337

 Score = 612 (220.5 bits), Expect = 1.0e-59, P = 1.0e-59
 Identities = 139/329 (42%), Positives = 203/329 (61%)

Query:    45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLNKF 100
             D ++   +  W   H K+ +      +R  +++ NL+ I+ HN   S+ + T+++G+N+F
Sbjct:    22 DQKLDDHWHLWKRWHEKSYHEKEEGWRRM-VWEKNLKKIELHNLEHSVGKHTFRLGMNQF 80

Query:   101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
              D+TNEE+R    G   D  R   KSK     +   +    P+ +DWR+KG V P+KDQ 
Sbjct:    81 GDMTNEEFRQAMNGYNRDPNR---KSK--GSLFIEPSFFTAPQQIDWRQKGYVTPIKDQK 135

Query:   161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQN 219
              CGSCWAFS+  A+EG     TG+L+SLSEQ L+DC R + N GC+GGLMD AFQ++  N
Sbjct:   136 RCGSCWAFSSTGALEGQVFRKTGKLVSLSEQNLMDCSRPQGNNGCDGGLMDQAFQYVQDN 195

Query:   220 GGMDSEQDYPYLGAENK-C--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVA 275
              G+DSE+ YPYL  +++ C  DP R +A   ++ G+ D+    E +L KAVA   PV+VA
Sbjct:   196 NGLDSEESYPYLATDDQPCHYDP-RYSA--ANVTGFVDIPSGKEHALMKAVAAVGPVAVA 252

Query:   276 IEAGGRAFQHYESGVFTGE-CGSA-LDHGVVAVGYGTENGVD-----YWLVRNSWGSDWG 328
             I+AG  +FQ Y+SG++  + C +  LDHGV+ VGYG E GVD     YW+V+NSW   WG
Sbjct:   253 IDAGHESFQFYQSGIYYEKACSTEELDHGVLVVGYGYE-GVDVAGRRYWIVKNSWTDRWG 311

Query:   329 ENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
             + GY+ + ++L +     CGIA  ASYP+
Sbjct:   312 DKGYIYMAKDLKN----HCGIATSASYPL 336


>UNIPROTKB|Q86GF7 [details] [associations]
            symbol:Cys "Crustapain" species:6703 "Pandalus borealis"
            [GO:0005576 "extracellular region" evidence=IC] [GO:0007586
            "digestion" evidence=NAS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IDA] [GO:0030163 "protein catabolic process"
            evidence=NAS] [GO:0030574 "collagen catabolic process"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005576
            GO:GO:0007586 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0030163 GO:GO:0030574 EMBL:AB091669
            ProteinModelPortal:Q86GF7 SMR:Q86GF7 MEROPS:C01.030 Uniprot:Q86GF7
        Length = 323

 Score = 610 (219.8 bits), Expect = 1.7e-59, P = 1.7e-59
 Identities = 137/315 (43%), Positives = 181/315 (57%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEE 107
             ++ +  K GK          R  +F D L+FI EHN        TY + +N F+DLT+EE
Sbjct:    20 WENFKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHEE 79

Query:   108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
                  L T++   RR     V  +         +   VDWR KGAV PVKDQG CGSCWA
Sbjct:    80 V----LATKTGMTRRRHPLSVLPKSAPTTP---MAADVDWRNKGAVTPVKDQGQCGSCWA 132

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
             FS VAA+EG + + TG+L+SLSEQ LVDC     N GCNGG    A+Q+II N G+D+E 
Sbjct:   133 FSAVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRGIDTES 192

Query:   227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQH 285
              YPY   ++ C     N    ++  Y + +  DE +L+ AV ++ PVSV I+AG  +F  
Sbjct:   193 SYPYKAIDDNCRYDAGNIGA-TVSSYVEPASGDESALQHAVQNEGPVSVCIDAGQSSFGS 251

Query:   286 YESGVF-TGECGS-ALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
             Y  GV+    C S   +H V AVGYGT+ NG DYW+V+NSWG+ WGE+GY+K+ RN  D 
Sbjct:   252 YGGGVYYEPNCDSWYANHAVTAVGYGTDANGGDYWIVKNSWGAWWGESGYIKMARNR-DN 310

Query:   343 NTGKCGIAMEASYPV 357
             N   C IA  + YPV
Sbjct:   311 N---CAIATYSVYPV 322


>UNIPROTKB|A4IFS7 [details] [associations]
            symbol:CTSL1 "CTSL1 protein" species:9913 "Bos taurus"
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197 GO:GO:0002250
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 GO:GO:0097067
            OrthoDB:EOG48PMKF MEROPS:C01.032 CTD:1514 EMBL:DAAA02023987
            EMBL:BC134741 IPI:IPI00708619 RefSeq:NP_001077155.1
            UniGene:Bt.23199 SMR:A4IFS7 Ensembl:ENSBTAT00000000962
            GeneID:515200 KEGG:bta:515200 InParanoid:A4IFS7 OMA:NDEQALM
            NextBio:20871707 Uniprot:A4IFS7
        Length = 333

 Score = 606 (218.4 bits), Expect = 4.5e-59, P = 4.5e-59
 Identities = 138/331 (41%), Positives = 198/331 (59%)

Query:    39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYK 94
             S++ + D  + T ++ W A H K  + +     R  ++K N++ I+ HN        ++ 
Sbjct:    16 SAAPKFDHSLDTQWKLWKAAHRKPYD-LNEEGWRKAVWKKNMKMIELHNQEYSQGKHSFS 74

Query:    95 VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
             + +N F D+TNEE+R     T +  +R+  K+K   + +       +P SVDWREKG V 
Sbjct:    75 MAMNAFGDMTNEEFRH----TMNGFQRQ--KNKKGKEFHETIFAS-IPPSVDWREKGYVT 127

Query:   155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAF 213
             PVK+QG CGSCWAFS   A+EG     TG+L+SLSEQ LVDC + + N GC+GG +D AF
Sbjct:   128 PVKNQGKCGSCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCSQPEGNRGCHGGFIDNAF 187

Query:   214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PV 272
             Q+++  GG+DSE+ YPY G    C  +  N+   +  G+ D+ P  E +L KAVA+  P+
Sbjct:   188 QYVLDVGGLDSEESYPYTGLVGTCLYNPNNS-AANETGFVDL-PKQEKALMKAVANLGPI 245

Query:   273 SVAIEAGGRAFQHYESGVF-TGECGS-ALDHGVVAVGYGTENGVD-----YWLVRNSWGS 325
             SVA++A   +FQ Y+SG++    C S ++DH V+ VGYG E G D     YWLV+NSWG 
Sbjct:   246 SVAVDAHNPSFQFYKSGIYYEPNCSSESVDHAVLVVGYGFE-GADSDDNKYWLVKNSWGE 304

Query:   326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
              WG NGY+K+ +   D N   CGIA  ASYP
Sbjct:   305 HWGMNGYIKMAK---DRNN-HCGIATMASYP 331


>UNIPROTKB|F1SS93 [details] [associations]
            symbol:CTSS "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0002250 "adaptive immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0016020 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0002250
            GeneTree:ENSGT00560000076577 GO:GO:0097067 EMBL:CU463875
            Ensembl:ENSSSCT00000007284 OMA:CEIESAV Uniprot:F1SS93
        Length = 342

 Score = 604 (217.7 bits), Expect = 7.3e-59, P = 7.3e-59
 Identities = 134/314 (42%), Positives = 184/314 (58%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLN-RTYKVGLNKFADLTNEE 107
             +  W   +GK          R  I++ NL+ +  HN   S+   +Y +G+N   D+T+EE
Sbjct:    39 WDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSYDLGMNHLGDMTSEE 98

Query:   108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
               ++    R  ++       V    Y      +LP+S+DWREKG V  VK QGSCGSCWA
Sbjct:    99 VISLMSCVRVPSQ---WPRNVT---YKSNPNQKLPDSMDWREKGCVTEVKYQGSCGSCWA 152

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDC--DRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
             FS V A+E   K+ TG L+SLS Q LVDC  ++  N GCNGG M  AFQ+II N G+DSE
Sbjct:   153 FSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDSE 212

Query:   226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF-DEMSLKKAVADQ-PVSVAIEAGGRAF 283
               YPY   + KC    +N +  +   Y ++ PF DE +LK+AVA++ PVSVAI+A   +F
Sbjct:   213 ASYPYKAVDGKCKYDSKN-RAATCSRYTEL-PFADEYALKEAVANKGPVSVAIDAKHSSF 270

Query:   284 QHYESGVFTG-ECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
               Y SGV+    C   ++HGV+ VGYG  NG DYWLV+NSWG ++G+ GY+++ RN    
Sbjct:   271 FFYRSGVYYDPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDGGYIRMARN---- 326

Query:   343 NTGKCGIAMEASYP 356
             +   CGIA   SYP
Sbjct:   327 SENHCGIANYPSYP 340


>RGD|1308751 [details] [associations]
            symbol:RGD1308751 "similar to Cathepsin L precursor (Major
            excreted protein) (MEP)" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1308751 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:CH474032 IPI:IPI00365697 RefSeq:XP_001065885.2
            RefSeq:XP_225137.5 MEROPS:C01.069 Ensembl:ENSRNOT00000061391
            GeneID:290981 KEGG:rno:290981 UCSC:RGD:1308751 CTD:290981
            OMA:ESYAYEA OrthoDB:EOG42823G NextBio:631921 Uniprot:D3ZKC3
        Length = 330

 Score = 604 (217.7 bits), Expect = 7.3e-59, P = 7.3e-59
 Identities = 134/323 (41%), Positives = 193/323 (59%)

Query:    45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS--LN--RTYKVGLNKF 100
             D    T+++ W  KHGKT N     +KR  ++++N++ I+ HN   L     + + +N F
Sbjct:    22 DPSFDTVWEEWKTKHGKTYNTNEEGQKR-AVWENNMKMINLHNEDYLKGKHGFSLEMNAF 80

Query:   101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
              DLTN E+R +  G +S   +   ++ +  + +    GD +P+S+DWRE G V PVK+QG
Sbjct:    81 GDLTNTEFRELMTGFQSMGPK---ETTIFREPFL---GD-IPKSLDWREHGYVTPVKNQG 133

Query:   161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQN 219
              CGSCWAFS V ++EG     TG+L+SLSEQ LVDC     N GCNGGLM++AFQ++ +N
Sbjct:   134 QCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDCSWSYGNLGCNGGLMEFAFQYVKEN 193

Query:   220 GGMDSEQDYPYLGAENKCDPSRRNAK--VVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAI 276
              G+D+ + Y Y   +  C   R N K    ++ G+  V P  E  L  AVA   PVSV I
Sbjct:   194 RGLDTGESYAYEAQDGLC---RYNPKYSAANVTGFVKV-PLSEDDLMSAVASVGPVSVGI 249

Query:   277 EAGGRAFQHYESGVF-TGECGSA-LDHGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYV 333
             ++  ++F+ Y  G++   +C S  +DH V+ VGYG E+ G  YWLV+NSWG DWG +GY+
Sbjct:   250 DSHHQSFRFYSGGMYYEPDCSSTEMDHAVLVVGYGEESDGGKYWLVKNSWGEDWGMDGYI 309

Query:   334 KLQRNLLDTNTGKCGIAMEASYP 356
             K+ +   D N   CGIA  A YP
Sbjct:   310 KMAK---DQNNN-CGIATYAIYP 328


>RGD|61810 [details] [associations]
            symbol:Ctsk "cathepsin K" species:10116 "Rattus norvegicus"
           [GO:0001957 "intramembranous ossification" evidence=IEP] [GO:0005615
           "extracellular space" evidence=IDA] [GO:0005737 "cytoplasm"
           evidence=IDA] [GO:0005764 "lysosome" evidence=IDA] [GO:0006508
           "proteolysis" evidence=TAS] [GO:0008234 "cysteine-type peptidase
           activity" evidence=TAS] [GO:0045453 "bone resorption" evidence=IMP]
           InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
           Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
           RGD:61810 GO:GO:0005615 eggNOG:COG4870 HOGENOM:HOG000230774
           GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
           InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
           PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
           GO:GO:0045453 GO:GO:0001957 GeneTree:ENSGT00560000076577
           HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
           OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:AF010306 EMBL:BC078793
           IPI:IPI00206378 RefSeq:NP_113748.1 UniGene:Rn.5598
           ProteinModelPortal:O35186 SMR:O35186 STRING:O35186
           PhosphoSite:O35186 PRIDE:O35186 Ensembl:ENSRNOT00000028730
           GeneID:29175 KEGG:rno:29175 UCSC:RGD:61810 InParanoid:O35186
           OMA:YKEIPEG BindingDB:O35186 ChEMBL:CHEMBL3034 NextBio:608248
           Genevestigator:O35186 GermOnline:ENSRNOG00000021155 Uniprot:O35186
        Length = 329

 Score = 602 (217.0 bits), Expect = 1.2e-58, P = 1.2e-58
 Identities = 132/319 (41%), Positives = 181/319 (56%)

Query:    45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLN-RTYKVGLNKF 100
             ++ + T ++ W   HGK  N       R  I++ NL+ I  HN   SL   TY++ +N  
Sbjct:    19 EETLDTQWELWKKTHGKQYNSKVDEISRRLIWEKNLKKISVHNLEASLGAHTYELAMNHL 78

Query:   101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
              D+T+EE      G R    R      +    Y  +    +P+S+D+R+KG V PVK+QG
Sbjct:    79 GDMTSEEVVQKMTGLRVPPSRSFSNDTL----YTPEWEGRVPDSIDYRKKGYVTPVKNQG 134

Query:   161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
              CGSCWAFS+  A+EG  K  TG+L++LS Q LVDC  + N GC GG M  AFQ++ QNG
Sbjct:   135 QCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVSE-NYGCGGGYMTTAFQYVQQNG 193

Query:   221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAG 279
             G+DSE  YPY+G +  C      AK     GY ++   +E +LK+AVA   PVSV+I+A 
Sbjct:   194 GIDSEDAYPYVGQDESC-MYNATAKAAKCRGYREIPVGNEKALKRAVARVGPVSVSIDAS 252

Query:   280 GRAFQHYESGVFTGE-CG-SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
               +FQ Y  GV+  E C    ++H V+ VGYGT+ G  YW+++NSWG  WG  GYV L R
Sbjct:   253 LTSFQFYSRGVYYDENCDRDNVNHAVLVVGYGTQKGNKYWIIKNSWGESWGNKGYVLLAR 312

Query:   338 NLLDTNTGKCGIAMEASYP 356
             N        CGI   AS+P
Sbjct:   313 N----KNNACGITNLASFP 327


>UNIPROTKB|P43235 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0001957
            "intramembranous ossification" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0045453 "bone resorption"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=TAS] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=TAS] [GO:0036021 "endolysosome lumen" evidence=TAS]
            [GO:0045087 "innate immune response" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            Reactome:REACT_118779 Reactome:REACT_6900 GO:GO:0005615
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087 GO:GO:0045453
            EMBL:CH471121 EMBL:AL355860 GO:GO:0004197 GO:GO:0001957
            HOVERGEN:HBG011513 GO:GO:0036021 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:U13665 EMBL:X82153
            EMBL:U20280 EMBL:S79895 EMBL:CR541675 EMBL:AL356292 EMBL:BC016058
            IPI:IPI00300599 PIR:JC2476 RefSeq:NP_000387.1 UniGene:Hs.632466
            PDB:1ATK PDB:1AU0 PDB:1AU2 PDB:1AU3 PDB:1AU4 PDB:1AYU PDB:1AYV
            PDB:1AYW PDB:1BGO PDB:1BY8 PDB:1MEM PDB:1NL6 PDB:1NLJ PDB:1Q6K
            PDB:1SNK PDB:1TU6 PDB:1U9V PDB:1U9W PDB:1U9X PDB:1VSN PDB:1YK7
            PDB:1YK8 PDB:1YT7 PDB:2ATO PDB:2AUX PDB:2AUZ PDB:2BDL PDB:2R6N
            PDB:3C9E PDB:3H7D PDB:3KW9 PDB:3KWB PDB:3KWZ PDB:3KX1 PDB:3O0U
            PDB:3O1G PDB:3OVZ PDB:4DMX PDB:4DMY PDB:7PCK PDBsum:1ATK
            PDBsum:1AU0 PDBsum:1AU2 PDBsum:1AU3 PDBsum:1AU4 PDBsum:1AYU
            PDBsum:1AYV PDBsum:1AYW PDBsum:1BGO PDBsum:1BY8 PDBsum:1MEM
            PDBsum:1NL6 PDBsum:1NLJ PDBsum:1Q6K PDBsum:1SNK PDBsum:1TU6
            PDBsum:1U9V PDBsum:1U9W PDBsum:1U9X PDBsum:1VSN PDBsum:1YK7
            PDBsum:1YK8 PDBsum:1YT7 PDBsum:2ATO PDBsum:2AUX PDBsum:2AUZ
            PDBsum:2BDL PDBsum:2R6N PDBsum:3C9E PDBsum:3H7D PDBsum:3KW9
            PDBsum:3KWB PDBsum:3KWZ PDBsum:3KX1 PDBsum:3O0U PDBsum:3O1G
            PDBsum:3OVZ PDBsum:4DMX PDBsum:4DMY PDBsum:7PCK
            ProteinModelPortal:P43235 SMR:P43235 DIP:DIP-39993N IntAct:P43235
            STRING:P43235 PhosphoSite:P43235 DMDM:1168793 PaxDb:P43235
            PRIDE:P43235 DNASU:1513 Ensembl:ENST00000271651 GeneID:1513
            KEGG:hsa:1513 UCSC:uc001evp.2 GeneCards:GC01M150768 HGNC:HGNC:2536
            MIM:265800 MIM:601105 neXtProt:NX_P43235 Orphanet:763
            PharmGKB:PA27034 InParanoid:P43235 OMA:LKVPPSH PhylomeDB:P43235
            BindingDB:P43235 ChEMBL:CHEMBL268 EvolutionaryTrace:P43235
            GenomeRNAi:1513 NextBio:6267 ArrayExpress:P43235 Bgee:P43235
            CleanEx:HS_CTSK CleanEx:HS_CTSO Genevestigator:P43235
            GermOnline:ENSG00000143387 Uniprot:P43235
        Length = 329

 Score = 596 (214.9 bits), Expect = 5.1e-58, P = 5.1e-58
 Identities = 136/321 (42%), Positives = 189/321 (58%)

Query:    46 DEVM-TIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLN-RTYKVGLNKF 100
             +E++ T ++ W   H K  N       R  I++ NL++I  HN   SL   TY++ +N  
Sbjct:    19 EEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHL 78

Query:   101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
              D+T+EE      G     K  L  S+     Y  +     P+SVD+R+KG V PVK+QG
Sbjct:    79 GDMTSEEVVQKMTGL----KVPLSHSRSNDTLYIPEWEGRAPDSVDYRKKGYVTPVKNQG 134

Query:   161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
              CGSCWAFS+V A+EG  K  TG+L++LS Q LVDC  + N GC GG M  AFQ++ +N 
Sbjct:   135 QCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE-NDGCGGGYMTNAFQYVQKNR 193

Query:   221 GMDSEQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIE 277
             G+DSE  YPY+G E  C  +P+ + AK     GY ++   +E +LK+AVA   PVSVAI+
Sbjct:   194 GIDSEDAYPYVGQEESCMYNPTGKAAKC---RGYREIPEGNEKALKRAVARVGPVSVAID 250

Query:   278 AGGRAFQHYESGVFTGE-CGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
             A   +FQ Y  GV+  E C S  L+H V+AVGYG + G  +W+++NSWG +WG  GY+ +
Sbjct:   251 ASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILM 310

Query:   336 QRNLLDTNTGKCGIAMEASYP 356
              RN        CGIA  AS+P
Sbjct:   311 ARN----KNNACGIANLASFP 327


>UNIPROTKB|Q5E968 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:BT021052
            EMBL:BC109853 IPI:IPI00709374 RefSeq:NP_001029607.1
            UniGene:Bt.23218 ProteinModelPortal:Q5E968 SMR:Q5E968 STRING:Q5E968
            MEROPS:I29.007 PRIDE:Q5E968 Ensembl:ENSBTAT00000028016
            GeneID:513038 KEGG:bta:513038 CTD:1513 InParanoid:Q5E968 KO:K01371
            OrthoDB:EOG4SJ5FC NextBio:20870669 PANTHER:PTHR12411:SF55
            Uniprot:Q5E968
        Length = 329

 Score = 595 (214.5 bits), Expect = 6.6e-58, P = 6.6e-58
 Identities = 134/321 (41%), Positives = 189/321 (58%)

Query:    46 DEVM-TIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLN-RTYKVGLNKF 100
             +E++ T ++ W   + K  N  G    R  I++ NL+ I  HN   SL   TY++ +N  
Sbjct:    19 EEILDTQWELWKKTYRKQYNSKGDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHL 78

Query:   101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
              D+T+EE      G +  A R    S+     Y        P+SVD+R+KG V PVK+QG
Sbjct:    79 GDMTSEEVVQKMTGLKVPASR----SRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQG 134

Query:   161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
              CGSCWAFS+V A+EG  K  TG+L++LS Q LVDC  + N GC GG M  AFQ++ +N 
Sbjct:   135 QCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE-NDGCGGGYMTNAFQYVQKNR 193

Query:   221 GMDSEQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIE 277
             G+DSE  YPY+G +  C  +P+ + AK     GY ++   +E +LK+AVA   P+SVAI+
Sbjct:   194 GIDSEDAYPYVGQDENCMYNPTGKAAKC---RGYREIPEGNEKALKRAVARVGPISVAID 250

Query:   278 AGGRAFQHYESGVFTGE-CGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
             A   +FQ Y  GV+  E C S  L+H V+AVGYG + G  +W+++NSWG +WG  GY+ +
Sbjct:   251 ASLTSFQFYRKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILM 310

Query:   336 QRNLLDTNTGKCGIAMEASYP 356
              RN        CGIA  AS+P
Sbjct:   311 ARN----KNNACGIANLASFP 327


>UNIPROTKB|F1PAK0 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9615 "Canis lupus
            familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:AAEX03011051 Ensembl:ENSCAFT00000019176 OMA:YEPACTQ
            Uniprot:F1PAK0
        Length = 339

 Score = 595 (214.5 bits), Expect = 6.6e-58, P = 6.6e-58
 Identities = 135/315 (42%), Positives = 186/315 (59%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLN-RTYKVGLNKFADLTNEE 107
             +  W   + K          R  I++ NL+F+  HN   S+   +Y +G+N   D+T EE
Sbjct:    36 WNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTGEE 95

Query:   108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
               ++ +G+      R+      +  Y   +  +LP+SVDWREKG V  VK QGSCG+CWA
Sbjct:    96 VISL-MGSL-----RVPSQWQRNVTYRSNSNQKLPDSVDWREKGCVTEVKYQGSCGACWA 149

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDC--DRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
             FS V A+E   K+ TG+L+SLS Q LVDC  ++  N GCNGG M  AFQ+II N G+DSE
Sbjct:   150 FSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGIDSE 209

Query:   226 QDYPYLGAENKCD-PSRRNAKVVSIDGYEDVSPF-DEMSLKKAVADQ-PVSVAIEAGGRA 282
               YPY     KC   S++ A   S   Y ++ PF  E +LK+AVA++ PVSVAI+A   +
Sbjct:   210 ASYPYKAVNGKCRYDSKKRAATCS--KYTEL-PFGSEDALKEAVANKGPVSVAIDASHYS 266

Query:   283 FQHYESGVF-TGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
             F  Y SGV+    C   ++HGV+ VGYG  NG DYWLV+NSWG ++G+ GY+++ RN   
Sbjct:   267 FFLYRSGVYYEPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARN--- 323

Query:   342 TNTGKCGIAMEASYP 356
              +   CGIA   SYP
Sbjct:   324 -SGNHCGIASYPSYP 337


>UNIPROTKB|Q8HY81 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            CTD:1520 KO:K01368 OrthoDB:EOG4JM7Q2 EMBL:AY156692
            RefSeq:NP_001002938.2 UniGene:Cfa.1661 ProteinModelPortal:Q8HY81
            SMR:Q8HY81 STRING:Q8HY81 MEROPS:C01.034 GeneID:403400
            KEGG:cfa:403400 InParanoid:Q8HY81 NextBio:20816922 Uniprot:Q8HY81
        Length = 331

 Score = 594 (214.2 bits), Expect = 8.4e-58, P = 8.4e-58
 Identities = 135/315 (42%), Positives = 186/315 (59%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLN-RTYKVGLNKFADLTNEE 107
             +  W   + K          R  I++ NL+F+  HN   S+   +Y +G+N   D+T EE
Sbjct:    28 WNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTGEE 87

Query:   108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
               ++ +G+      R+      +  Y   +  +LP+SVDWREKG V  VK QGSCG+CWA
Sbjct:    88 VISL-MGSL-----RVPSQWQRNVTYRSNSNQKLPDSVDWREKGCVTEVKYQGSCGACWA 141

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDC--DRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
             FS V A+E   K+ TG+L+SLS Q LVDC  ++  N GCNGG M  AFQ+II N G+DSE
Sbjct:   142 FSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGIDSE 201

Query:   226 QDYPYLGAENKCD-PSRRNAKVVSIDGYEDVSPF-DEMSLKKAVADQ-PVSVAIEAGGRA 282
               YPY     KC   S++ A   S   Y ++ PF  E +LK+AVA++ PVSVAI+A   +
Sbjct:   202 ASYPYKAMNGKCRYDSKKRAATCS--KYTEL-PFGSEDALKEAVANKGPVSVAIDASHYS 258

Query:   283 FQHYESGVF-TGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
             F  Y SGV+    C   ++HGV+ VGYG  NG DYWLV+NSWG ++G+ GY+++ RN   
Sbjct:   259 FFLYRSGVYYEPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARN--- 315

Query:   342 TNTGKCGIAMEASYP 356
              +   CGIA   SYP
Sbjct:   316 -SGNHCGIASYPSYP 329


>TAIR|locus:2175088 [details] [associations]
            symbol:ALP "aleurain-like protease" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009723 "response to ethylene stimulus" evidence=IEP]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0005829 "cytosol"
            evidence=RCA] [GO:0006096 "glycolysis" evidence=RCA] [GO:0006816
            "calcium ion transport" evidence=RCA] [GO:0006833 "water transport"
            evidence=RCA] [GO:0006972 "hyperosmotic response" evidence=RCA]
            [GO:0007030 "Golgi organization" evidence=RCA] [GO:0009266
            "response to temperature stimulus" evidence=RCA] [GO:0009651
            "response to salt stress" evidence=RCA] [GO:0009750 "response to
            fructose stimulus" evidence=RCA] [GO:0042744 "hydrogen peroxide
            catabolic process" evidence=RCA] [GO:0046686 "response to cadmium
            ion" evidence=RCA] [GO:0007568 "aging" evidence=IEP]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002688 GO:GO:0005773
            GO:GO:0007568 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AB011483 KO:K01366
            ProtClustDB:CLSN2689015 UniGene:At.25414 IPI:IPI00846287
            RefSeq:NP_001078774.1 ProteinModelPortal:A8MQZ1 SMR:A8MQZ1
            STRING:A8MQZ1 PRIDE:A8MQZ1 EnsemblPlants:AT5G60360.3 GeneID:836158
            KEGG:ath:AT5G60360 OMA:CGSTPMD Genevestigator:A8MQZ1 Uniprot:A8MQZ1
        Length = 361

 Score = 594 (214.2 bits), Expect = 8.4e-58, P = 8.4e-58
 Identities = 126/285 (44%), Positives = 173/285 (60%)

Query:    58 KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS 117
             ++GK    +   + RF IFK+NL  I   N    +YK+G+N+FADLT +E++   LG   
Sbjct:    65 RYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQ 124

Query:   118 DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGI 177
             +    L  S   ++         LPE+ DWRE G V+PVKDQG CGSCW FST  A+E  
Sbjct:   125 NCSATLKGSHKVTEA-------ALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAA 177

Query:   178 NKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
                  G+ ISLSEQ+LVDC    N  GCNGGL   AF++I  NGG+D+E+ YPY G +  
Sbjct:   178 YHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKDET 237

Query:   237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYESGVFT-GE 294
             C  S  N  V  ++   +++   E  LK AV   +PVS+A E    +F+ Y+SGV+T   
Sbjct:   238 CKFSAENVGVQVLNSV-NITLGAEDELKHAVGLVRPVSIAFEVI-HSFRLYKSGVYTDSH 295

Query:   295 CGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQ 336
             CGS    ++H V+AVGYG E+GV YWL++NSWG+DWG+ GY K++
Sbjct:   296 CGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKGYFKME 340


>ZFIN|ZDB-GENE-050522-559 [details] [associations]
            symbol:ctssb.1 "cathepsin S, b.1" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-050522-559 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 MEROPS:C01.034
            EMBL:BC095694 IPI:IPI00607338 UniGene:Dr.75553
            ProteinModelPortal:Q502H6 SMR:Q502H6 InParanoid:Q502H6
            ArrayExpress:Q502H6 Uniprot:Q502H6
        Length = 330

 Score = 592 (213.5 bits), Expect = 1.4e-57, P = 1.4e-57
 Identities = 132/314 (42%), Positives = 184/314 (58%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLN-RTYKVGLNKFADLTNEE 107
             ++ W   +GK          R Q+++ NL+ I  HN   S+   +Y + +N   DLT EE
Sbjct:    27 WELWKKTYGKIYTTEVEEFGRRQLWERNLQLITVHNLEASMGMHSYDLSMNHMGDLTTEE 86

Query:   108 Y-RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
               + + L       +R + + V S      +GD +P+S+DWREKG V+ VK QG+CGSCW
Sbjct:    87 ILQTLALTHVPSGFKRQIANIVGS------SGDAVPDSLDWREKGYVSSVKMQGACGSCW 140

Query:   167 AFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
             AFS+V A+EG  K  TG+L+ LS Q LVDC  K  N GCNGG M  AFQ++I NGG+ S+
Sbjct:   141 AFSSVGALEGQLKKTTGKLVDLSPQNLVDCSSKYGNKGCNGGFMSDAFQYVIDNGGIASD 200

Query:   226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQ 284
               YPY G + +C  S  + +  +   Y  V   DE +LK+AVA   P+SVAI+A    F 
Sbjct:   201 SAYPYRGVQQQCSYSS-SQRAANCTKYYFVRQGDENALKQAVASVGPISVAIDATRPQFV 259

Query:   285 HYESGVFTGE-CGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
              Y SGV+    C   ++H V+ VGYGT +G D+WLV+NSWG+ +G+ GY+++ RN     
Sbjct:   260 LYHSGVYNDPTCSKRVNHAVLVVGYGTLSGQDHWLVKNSWGTRFGDGGYIRMARN----K 315

Query:   344 TGKCGIAMEASYPV 357
                CGIA  A YPV
Sbjct:   316 NNMCGIASYACYPV 329


>DICTYBASE|DDB_G0281605 [details] [associations]
            symbol:cfaD "peptidase C1A family protein"
            species:44689 "Dictyostelium discoideum" [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IDA] [GO:0006508 "proteolysis" evidence=IDA] [GO:0031410
            "cytoplasmic vesicle" evidence=IDA] [GO:0031288 "sorocarp
            morphogenesis" evidence=IMP] [GO:0008285 "negative regulation of
            cell proliferation" evidence=IGI;IDA] [GO:0005576 "extracellular
            region" evidence=IEA;IDA] [GO:0005515 "protein binding"
            evidence=IPI] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0281605
            GO:GO:0008285 GO:GO:0005615 GenomeReviews:CM000152_GR
            eggNOG:COG4870 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0031410 EMBL:AAFI02000042
            GO:GO:0031288 RefSeq:XP_640530.1 HSSP:P07711
            ProteinModelPortal:Q54TR1 STRING:Q54TR1 PRIDE:Q54TR1
            EnsemblProtists:DDB0229857 GeneID:8623140 KEGG:ddi:DDB_G0281605
            InParanoid:Q54TR1 OMA:PSAHEHE ProtClustDB:CLSZ2430523
            Uniprot:Q54TR1
        Length = 531

 Score = 588 (212.0 bits), Expect = 3.6e-57, P = 3.6e-57
 Identities = 120/321 (37%), Positives = 191/321 (59%)

Query:    45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLT 104
             +++   +++ + A++ K  +    +++RF  FK   + I  HN+   +YK+G+N +ADL+
Sbjct:   218 EEQASNLFKEYKAQYNKEYSSQDEHDERFINFKAARKIIATHNAKESSYKLGMNHYADLS 277

Query:   105 NEEYRAMYLGTRSDAKRRLMKSKV--ASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
             N+E+  +        K ++ +  V  A   +  ++   +P +VDWR +  V PVKDQG C
Sbjct:   278 NKEFNTL-------VKPKVARPSVTGADSVHDDESLRSIPSTVDWRNQNCVTPVKDQGIC 330

Query:   163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGG 221
             GSCW F +  ++EG N +  GEL+SLSEQ+LVDC     + GC GG    AFQ++++ G 
Sbjct:   331 GSCWTFGSTGSLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGS 390

Query:   222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGG 280
             + +E +YPYL     C         VSI GY +V+   E +L+ A+A   PV++AI+A  
Sbjct:   391 LATESNYPYLMQNGLCRDRTVTPSGVSITGYVNVTSGSESALQNAIATTGPVAIAIDASV 450

Query:   281 RAFQHYESGVFTGE-CGSALD---HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQ 336
               F++Y SGV+    C + LD   H V+A+GYGT  G DY+LV+NSW ++WG +GYV + 
Sbjct:   451 DDFRYYMSGVYNNPACKNGLDDLDHEVLAIGYGTYQGQDYFLVKNSWSTNWGMDGYVYMA 510

Query:   337 RNLLDTNTGKCGIAMEASYPV 357
             RN  D N   CG++ +A+YP+
Sbjct:   511 RN--DNNL--CGVSSQATYPI 527


>UNIPROTKB|F1NEC8 [details] [associations]
            symbol:CTSL2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:AADN02067812 IPI:IPI00820956 Ensembl:ENSGALT00000037988
            ArrayExpress:F1NEC8 Uniprot:F1NEC8
        Length = 218

 Score = 588 (212.0 bits), Expect = 3.6e-57, P = 3.6e-57
 Identities = 115/220 (52%), Positives = 148/220 (67%)

Query:   142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KI 200
             P SVDWREKG V PVKDQG CGSCWAFST  A+EG +   TG+L+SLSEQ LVDC R + 
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEG 61

Query:   201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
             N GCNGGLMD AFQ++  NGG+DSE+ YPY   +++    +      +  G+ D+    E
Sbjct:    62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHE 121

Query:   261 MSLKKAVADQ-PVSVAIEAGGRAFQHYESGVF-TGECGSA-LDHGVVAVGYGTENGVDYW 317
              +L KAVA   PVSVAI+AG  +FQ Y+SG++   +C S  LDHGV+ VGYG E+G  YW
Sbjct:   122 RALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEDGKKYW 181

Query:   318 LVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
             +V+NSWG  WG+ GY+ + ++        CGIA  ASYP+
Sbjct:   182 IVKNSWGEKWGDKGYIYMAKD----RKNHCGIATAASYPL 217


>MGI|MGI:107823 [details] [associations]
            symbol:Ctsk "cathepsin K" species:10090 "Mus musculus"
            [GO:0005615 "extracellular space" evidence=ISO] [GO:0005737
            "cytoplasm" evidence=ISO] [GO:0005764 "lysosome" evidence=ISO]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0045453 "bone resorption" evidence=ISO]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:107823 GO:GO:0005615 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0045453
            GO:GO:0001957 HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 OMA:LKVPPSH EMBL:X94444
            EMBL:AJ006033 EMBL:BC046320 IPI:IPI00316575 PIR:S74227
            RefSeq:NP_031828.2 UniGene:Mm.272085 ProteinModelPortal:P55097
            SMR:P55097 MINT:MINT-3089515 STRING:P55097 PhosphoSite:P55097
            PRIDE:P55097 Ensembl:ENSMUST00000015664 GeneID:13038 KEGG:mmu:13038
            InParanoid:P55097 BioCyc:MetaCyc:MONOMER-14811 ChEMBL:CHEMBL1075277
            NextBio:282924 Bgee:P55097 CleanEx:MM_CTSK Genevestigator:P55097
            GermOnline:ENSMUSG00000028111 Uniprot:P55097
        Length = 329

 Score = 586 (211.3 bits), Expect = 5.9e-57, P = 5.9e-57
 Identities = 129/319 (40%), Positives = 180/319 (56%)

Query:    46 DEVM-TIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLN-RTYKVGLNKF 100
             +E++ T ++ W   H K  N       R  I++ NL+ I  HN   SL   TY++ +N  
Sbjct:    19 EEMLDTQWELWKKTHQKQYNSKVDEISRRLIWEKNLKQISAHNLEASLGVHTYELAMNHL 78

Query:   101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
              D+T+EE      G R    R      +    Y  +    +P+S+D+R+KG V PVK+QG
Sbjct:    79 GDMTSEEVVQKMTGLRIPPSRSYSNDTL----YTPEWEGRVPDSIDYRKKGYVTPVKNQG 134

Query:   161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
              CGSCWAFS+  A+EG  K  TG+L++LS Q LVDC  + N GC GG M  AFQ++ QNG
Sbjct:   135 QCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVTE-NYGCGGGYMTTAFQYVQQNG 193

Query:   221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAG 279
             G+DSE  YPY+G +  C      AK     GY ++   +E +LK+AVA   P+SV+I+A 
Sbjct:   194 GIDSEDAYPYVGQDESC-MYNATAKAAKCRGYREIPVGNEKALKRAVARVGPISVSIDAS 252

Query:   280 GRAFQHYESGVFTGE-CG-SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
               +FQ Y  GV+  E C    ++H V+ VGYGT+ G  +W+++NSWG  WG  GY  L R
Sbjct:   253 LASFQFYSRGVYYDENCDRDNVNHAVLVVGYGTQKGSKHWIIKNSWGESWGNKGYALLAR 312

Query:   338 NLLDTNTGKCGIAMEASYP 356
             N        CGI   AS+P
Sbjct:   313 N----KNNACGITNMASFP 327


>TAIR|locus:2078312 [details] [associations]
            symbol:AT3G45310 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005773 EMBL:CP002686
            GenomeReviews:BA000014_GR eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AL132953
            EMBL:AY091771 IPI:IPI00540369 PIR:T47471 RefSeq:NP_566880.1
            UniGene:At.25239 ProteinModelPortal:Q8RWQ9 SMR:Q8RWQ9
            MEROPS:C01.162 PaxDb:Q8RWQ9 PRIDE:Q8RWQ9 EnsemblPlants:AT3G45310.1
            GeneID:823669 KEGG:ath:AT3G45310 GeneFarm:5032 TAIR:At3g45310
            InParanoid:Q8RWQ9 KO:K01366 OMA:AFEVVHE PhylomeDB:Q8RWQ9
            ProtClustDB:CLSN2689015 Genevestigator:Q8RWQ9 Uniprot:Q8RWQ9
        Length = 358

 Score = 586 (211.3 bits), Expect = 5.9e-57, P = 5.9e-57
 Identities = 129/306 (42%), Positives = 180/306 (58%)

Query:    58 KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS 117
             ++GK    +   + RF +FK+NL  I   N    +YK+ LN+FADLT +E++   LG   
Sbjct:    65 RYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAAQ 124

Query:   118 DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGI 177
             +    L  S   ++         +P++ DWRE G V+PVK+QG CGSCW FST  A+E  
Sbjct:   125 NCSATLKGSHKITEA-------TVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAA 177

Query:   178 NKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
                  G+ ISLSEQ+LVDC    N  GC+GGL   AF++I  NGG+D+E+ YPY G +  
Sbjct:   178 YHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGG 237

Query:   237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYESGVFTGE- 294
             C  S +N  V   D   +++   E  LK AV   +PVSVA E     F+ Y+ GVFT   
Sbjct:   238 CKFSAKNIGVQVRDSV-NITLGAEDELKHAVGLVRPVSVAFEVV-HEFRFYKKGVFTSNT 295

Query:   295 CGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAM 351
             CG+    ++H V+AVGYG E+ V YWL++NSWG +WG+NGY K++   +  N   CG+A 
Sbjct:   296 CGNTPMDVNHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNGYFKME---MGKNM--CGVAT 350

Query:   352 EASYPV 357
              +SYPV
Sbjct:   351 CSSYPV 356


>MGI|MGI:107285 [details] [associations]
            symbol:Ctsh "cathepsin H" species:10090 "Mus musculus"
            [GO:0001520 "outer dense fiber" evidence=ISO] [GO:0001669
            "acrosomal vesicle" evidence=ISO] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=IGI] [GO:0002764 "immune response-regulating
            signaling pathway" evidence=ISO] [GO:0004175 "endopeptidase
            activity" evidence=ISO;IMP] [GO:0004177 "aminopeptidase activity"
            evidence=ISO] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISO;IDA] [GO:0004252 "serine-type endopeptidase activity"
            evidence=IMP] [GO:0005615 "extracellular space" evidence=ISO]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0005829 "cytosol"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=ISO;IMP]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0008284
            "positive regulation of cell proliferation" evidence=IMP]
            [GO:0010628 "positive regulation of gene expression" evidence=ISO]
            [GO:0010634 "positive regulation of epithelial cell migration"
            evidence=IMP] [GO:0010813 "neuropeptide catabolic process"
            evidence=ISO] [GO:0010815 "bradykinin catabolic process"
            evidence=ISO] [GO:0010952 "positive regulation of peptidase
            activity" evidence=IGI;ISO] [GO:0016505 "apoptotic protease
            activator activity" evidence=IGI;ISO] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0030108 "HLA-A specific activating MHC
            class I receptor activity" evidence=ISO] [GO:0030335 "positive
            regulation of cell migration" evidence=ISO] [GO:0030984 "kininogen
            binding" evidence=ISO] [GO:0031638 "zymogen activation"
            evidence=ISO;IMP] [GO:0031648 "protein destabilization"
            evidence=ISO;IMP] [GO:0032403 "protein complex binding"
            evidence=ISO] [GO:0032526 "response to retinoic acid" evidence=IDA]
            [GO:0033619 "membrane protein proteolysis" evidence=ISO;IMP]
            [GO:0035085 "cilium axoneme" evidence=ISO] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IMP] [GO:0043129
            "surfactant homeostasis" evidence=ISO] [GO:0043621 "protein
            self-association" evidence=ISO] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IMP] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=IMP]
            [GO:0070324 "thyroid hormone binding" evidence=ISO] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISO] [GO:0097208 "alveolar
            lamellar body" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:107285 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 EMBL:CH466560 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            HOVERGEN:HBG011513 GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT OrthoDB:EOG4W9J43
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 BRENDA:3.4.22.16
            EMBL:U06119 EMBL:AK149949 EMBL:AK150583 EMBL:AK157376 EMBL:AK160026
            EMBL:Y18464 IPI:IPI00118987 RefSeq:NP_031827.2 UniGene:Mm.2277
            ProteinModelPortal:P49935 SMR:P49935 STRING:P49935 MEROPS:I29.003
            PhosphoSite:P49935 PaxDb:P49935 PRIDE:P49935
            Ensembl:ENSMUST00000034915 GeneID:13036 KEGG:mmu:13036
            InParanoid:Q3UCD6 ChEMBL:CHEMBL1949491 NextBio:282920 Bgee:P49935
            CleanEx:MM_CTSH Genevestigator:P49935 GermOnline:ENSMUSG00000032359
            Uniprot:P49935
        Length = 333

 Score = 584 (210.6 bits), Expect = 9.6e-57, P = 9.6e-57
 Identities = 124/313 (39%), Positives = 185/313 (59%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
             +++W+ +H KT + + +N  R Q+F +N R I  HN  N T+K+ LN+F+D++  E +  
Sbjct:    33 FKSWMKQHQKTYSSVEYNH-RLQMFANNWRKIQAHNQRNHTFKMALNQFSDMSFAEIKHK 91

Query:   112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKG-AVNPVKDQGSCGSCWAFST 170
             +L +                 Y    G   P S+DWR+KG  V+PVK+QG+CGSCW FST
Sbjct:    92 FLWSEPQ------NCSATKSNYLRGTGP-YPSSMDWRKKGNVVSPVKNQGACGSCWTFST 144

Query:   171 VAAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYP 229
               A+E    I +G+++SL+EQ+LVDC +  N  GC GGL   AF++I+ N G+  E  YP
Sbjct:   145 TGALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFEYILYNKGIMEEDSYP 204

Query:   230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYES 288
             Y+G ++ C  + + A V  +    +++  DE ++ +AVA   PVS A E     F  Y+S
Sbjct:   205 YIGKDSSCRFNPQKA-VAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVT-EDFLMYKS 262

Query:   289 GVFTGE-CGSALD---HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
             GV++ + C    D   H V+AVGYG +NG+ YW+V+NSWGS WGENGY  ++R       
Sbjct:   263 GVYSSKSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSQWGENGYFLIERG-----K 317

Query:   345 GKCGIAMEASYPV 357
               CG+A  ASYP+
Sbjct:   318 NMCGLAACASYPI 330


>UNIPROTKB|P09648 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9031 "Gallus gallus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            OrthoDB:EOG48PMKF MEROPS:C01.032 IPI:IPI00602255 PIR:S00081
            UniGene:Gga.523 ProteinModelPortal:P09648 SMR:P09648 Uniprot:P09648
        Length = 218

 Score = 583 (210.3 bits), Expect = 1.2e-56, P = 1.2e-56
 Identities = 114/220 (51%), Positives = 146/220 (66%)

Query:   142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KI 200
             P SVDWREKG V PVKDQG CGSCWAFST  A+EG +    G+L+SLSEQ LVDC R + 
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRPEG 61

Query:   201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
             N GCNGGLMD AFQ++  NGG+DSE+ YPY   +++    +      +  G+ D+    E
Sbjct:    62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHE 121

Query:   261 MSLKKAVADQ-PVSVAIEAGGRAFQHYESGVF-TGECGSA-LDHGVVAVGYGTENGVDYW 317
              +L KAVA   PVSVAI+AG  +FQ Y+SG++   +C S  LDHGV+ VGYG E G  YW
Sbjct:   122 RALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGGKKYW 181

Query:   318 LVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
             +V+NSWG  WG+ GY+ + ++        CGIA  ASYP+
Sbjct:   182 IVKNSWGEKWGDKGYIYMAKD----RKNHCGIATAASYPL 217


>UNIPROTKB|P25326 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9913 "Bos taurus"
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0016020 "membrane" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0002250 "adaptive
            immune response" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0016020 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0002250 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            GO:GO:0097067 EMBL:BC102245 EMBL:M95211 EMBL:X62001 IPI:IPI00702008
            PIR:S15844 RefSeq:NP_001028787.1 UniGene:Bt.7938
            ProteinModelPortal:P25326 SMR:P25326 STRING:P25326 PRIDE:P25326
            Ensembl:ENSBTAT00000022774 GeneID:327711 KEGG:bta:327711 CTD:1520
            InParanoid:P25326 KO:K01368 OMA:KAMDQKC OrthoDB:EOG4JM7Q2
            NextBio:20810175 Uniprot:P25326
        Length = 331

 Score = 580 (209.2 bits), Expect = 2.5e-56, P = 2.5e-56
 Identities = 130/314 (41%), Positives = 183/314 (58%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLN-RTYKVGLNKFADLTNEE 107
             +  W   +GK          R  I++ NL+ +  HN   S+   +Y++G+N   D+T+EE
Sbjct:    28 WDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVTLHNLEHSMGMHSYELGMNHLGDMTSEE 87

Query:   108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
               ++    R  ++       V    Y      +LP+S+DWREKG V  VK QG+CGSCWA
Sbjct:    88 VISLMSSLRVPSQ---WPRNVT---YKSDPNQKLPDSMDWREKGCVTEVKYQGACGSCWA 141

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDCDR-KI-NAGCNGGLMDYAFQFIIQNGGMDSE 225
             FS V A+E   K+ TG+L+SLS Q LVDC   K  N GCNGG M  AFQ+II N G+DSE
Sbjct:   142 FSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNNGIDSE 201

Query:   226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF-DEMSLKKAVADQ-PVSVAIEAGGRAF 283
               YPY   + KC    +N +  +   Y ++ PF  E +LK+AVA++ PVSV I+A   +F
Sbjct:   202 ASYPYKAMDGKCQYDVKN-RAATCSRYIEL-PFGSEEALKEAVANKGPVSVGIDASHSSF 259

Query:   284 QHYESGVFTG-ECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
               Y++GV+    C   ++HGV+ VGYG  +G DYWLV+NSWG  +G+ GY+++ RN    
Sbjct:   260 FLYKTGVYYDPSCTQNVNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQGYIRMARN---- 315

Query:   343 NTGKCGIAMEASYP 356
             +   CGIA   SYP
Sbjct:   316 SGNHCGIANYPSYP 329


>RGD|2447 [details] [associations]
            symbol:Ctsh "cathepsin H" species:10116 "Rattus norvegicus"
          [GO:0001520 "outer dense fiber" evidence=IDA] [GO:0001656
          "metanephros development" evidence=IEP] [GO:0001669 "acrosomal
          vesicle" evidence=IDA] [GO:0001913 "T cell mediated cytotoxicity"
          evidence=ISO;ISS] [GO:0002250 "adaptive immune response"
          evidence=ISO] [GO:0002764 "immune response-regulating signaling
          pathway" evidence=ISO;ISS] [GO:0004175 "endopeptidase activity"
          evidence=ISO] [GO:0004177 "aminopeptidase activity" evidence=ISO;IDA]
          [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISO;ISS]
          [GO:0004252 "serine-type endopeptidase activity" evidence=ISO;ISS]
          [GO:0005615 "extracellular space" evidence=ISO;ISS;IDA] [GO:0005764
          "lysosome" evidence=ISO;ISS;IDA] [GO:0005829 "cytosol"
          evidence=ISO;ISS] [GO:0006508 "proteolysis" evidence=IEP;ISO]
          [GO:0007283 "spermatogenesis" evidence=IEP] [GO:0008233 "peptidase
          activity" evidence=ISO] [GO:0008234 "cysteine-type peptidase
          activity" evidence=ISO] [GO:0008284 "positive regulation of cell
          proliferation" evidence=ISO;ISS] [GO:0010628 "positive regulation of
          gene expression" evidence=ISO;ISS] [GO:0010634 "positive regulation
          of epithelial cell migration" evidence=ISO;ISS] [GO:0010813
          "neuropeptide catabolic process" evidence=ISO;ISS] [GO:0010815
          "bradykinin catabolic process" evidence=ISO;ISS] [GO:0010952
          "positive regulation of peptidase activity" evidence=ISO;ISS]
          [GO:0016505 "apoptotic protease activator activity" evidence=ISO;ISS]
          [GO:0030108 "HLA-A specific activating MHC class I receptor activity"
          evidence=ISO;ISS] [GO:0030335 "positive regulation of cell migration"
          evidence=ISO;ISS] [GO:0030984 "kininogen binding" evidence=IPI]
          [GO:0031638 "zymogen activation" evidence=ISO;ISS] [GO:0031648
          "protein destabilization" evidence=ISO;ISS] [GO:0032403 "protein
          complex binding" evidence=IPI] [GO:0032526 "response to retinoic
          acid" evidence=ISO;ISS] [GO:0033619 "membrane protein proteolysis"
          evidence=ISO;ISS] [GO:0035085 "cilium axoneme" evidence=IDA]
          [GO:0043066 "negative regulation of apoptotic process"
          evidence=ISO;ISS] [GO:0043129 "surfactant homeostasis"
          evidence=ISO;ISS] [GO:0043621 "protein self-association"
          evidence=IDA] [GO:0045766 "positive regulation of angiogenesis"
          evidence=ISO;ISS] [GO:0060448 "dichotomous subdivision of terminal
          units involved in lung branching" evidence=ISO;ISS] [GO:0070324
          "thyroid hormone binding" evidence=ISO;ISS] [GO:0070371 "ERK1 and
          ERK2 cascade" evidence=ISO;ISS] [GO:0097067 "cellular response to
          thyroid hormone stimulus" evidence=ISO;IEP] [GO:0097208 "alveolar
          lamellar body" evidence=ISO;ISS;IDA] InterPro:IPR000668
          InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
          InterPro:IPR000169 RGD:2447 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
          GO:GO:0008284 GO:GO:0070371 GO:GO:0001669 eggNOG:COG4870
          HOGENOM:HOG000230774 InterPro:IPR025661 InterPro:IPR025660
          InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
          PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0007283
          GO:GO:0045766 GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
          GO:GO:0043621 GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 KO:K01366
          GO:GO:0016505 GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
          HOVERGEN:HBG011513 GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
          GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT OrthoDB:EOG4W9J43
          GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
          GO:GO:0010813 GO:GO:0043129 MEROPS:I29.003 EMBL:Y00708 EMBL:BC085352
          EMBL:M38135 IPI:IPI00212809 PIR:S00211 RefSeq:NP_037071.1
          UniGene:Rn.1997 ProteinModelPortal:P00786 SMR:P00786 STRING:P00786
          PRIDE:P00786 Ensembl:ENSRNOT00000019285 GeneID:25425 KEGG:rno:25425
          UCSC:RGD:2447 InParanoid:P00786 BindingDB:P00786 NextBio:606599
          Genevestigator:P00786 GermOnline:ENSRNOG00000014064 GO:GO:0035086
          GO:GO:0001520 Uniprot:P00786
        Length = 333

 Score = 579 (208.9 bits), Expect = 3.3e-56, P = 3.3e-56
 Identities = 123/313 (39%), Positives = 181/313 (57%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
             + +W+ +H KT +   ++  R Q+F +N R I  HN  N T+K+GLN+F+D++  E +  
Sbjct:    33 FTSWMKQHQKTYSSREYSH-RLQVFANNWRKIQAHNQRNHTFKMGLNQFSDMSFAEIKHK 91

Query:   112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKG-AVNPVKDQGSCGSCWAFST 170
             YL +                 Y    G   P S+DWR+KG  V+PVK+QG+CGSCW FST
Sbjct:    92 YLWSEPQ------NCSATKSNYLRGTGP-YPSSMDWRKKGNVVSPVKNQGACGSCWTFST 144

Query:   171 VAAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYP 229
               A+E    I +G++++L+EQ+LVDC +  N  GC GGL   AF++I+ N G+  E  YP
Sbjct:   145 TGALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYP 204

Query:   230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYES 288
             Y+G   +C  +   A V  +    +++  DE ++ +AVA   PVS A E     F  Y+S
Sbjct:   205 YIGKNGQCKFNPEKA-VAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVT-EDFMMYKS 262

Query:   289 GVFTGE-CGSALD---HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
             GV++   C    D   H V+AVGYG +NG+ YW+V+NSWGS+WG NGY  ++R       
Sbjct:   263 GVYSSNSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSNWGNNGYFLIERG-----K 317

Query:   345 GKCGIAMEASYPV 357
               CG+A  ASYP+
Sbjct:   318 NMCGLAACASYPI 330


>ZFIN|ZDB-GENE-001205-4 [details] [associations]
            symbol:ctsk "cathepsin K" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-001205-4 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            CTD:1513 KO:K01371 OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55
            EMBL:BC092901 IPI:IPI00512751 RefSeq:NP_001017778.1
            UniGene:Dr.76224 ProteinModelPortal:Q568D6 SMR:Q568D6 GeneID:550475
            KEGG:dre:550475 InParanoid:Q568D6 NextBio:20879718
            ArrayExpress:Q568D6 Uniprot:Q568D6
        Length = 333

 Score = 579 (208.9 bits), Expect = 3.3e-56, P = 3.3e-56
 Identities = 127/314 (40%), Positives = 181/314 (57%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS---LN-RTYKVGLNKFADLTNEE 107
             +++W   H +  NG+     R  I++ N+ FI+ HN    L   TY +G+N F D+T EE
Sbjct:    30 WESWKITHKREYNGLNEESIRRTIWEKNMLFIEAHNKEYELGIHTYDLGMNHFGDMTLEE 89

Query:   108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
                  +G +    R    + V   R       +LP+S+D+R+ G V  VK+QGSCGSCWA
Sbjct:    90 VAEKVMGLQMPMYRDPANTFVPDDRVG-----KLPKSIDYRKLGYVTSVKNQGSCGSCWA 144

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
             FS+V A+EG      G+L+ LS Q LVDC  + N GC GG M  AF+++  N G+DSE+ 
Sbjct:   145 FSSVGALEGQLMKTKGQLVDLSPQNLVDCVTE-NDGCGGGYMTNAFRYVSNNQGIDSEES 203

Query:   228 YPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHY 286
             YPY+G + +C     +    S  GY+++   +E +L  AVA+  PVSV I+A    F +Y
Sbjct:   204 YPYVGTDQQC-AYNTSGVAASCRGYKEIPQGNERALTAAVANVGPVSVGIDAMQSTFLYY 262

Query:   287 ESGVFTG-ECGSA-LDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
             +SGV+    C    ++H V+AVGYG T  G  YW+V+NSWG +WG+ GYV + RN     
Sbjct:   263 KSGVYYDPNCNKEDVNHAVLAVGYGATPRGKKYWIVKNSWGEEWGKKGYVLMARN----R 318

Query:   344 TGKCGIAMEASYPV 357
                CGIA  AS+PV
Sbjct:   319 NNACGIANLASFPV 332


>UNIPROTKB|Q9GLE3 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9823 "Sus scrofa"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0005576 "extracellular
            region" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005576 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 MEROPS:I29.007
            CTD:1513 KO:K01371 OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55
            OMA:LKVPPSH EMBL:AF292030 RefSeq:NP_999467.1 UniGene:Ssc.1020
            ProteinModelPortal:Q9GLE3 SMR:Q9GLE3 STRING:Q9GLE3
            Ensembl:ENSSSCT00000007283 GeneID:397569 KEGG:ssc:397569
            ArrayExpress:Q9GLE3 Uniprot:Q9GLE3
        Length = 330

 Score = 577 (208.2 bits), Expect = 5.3e-56, P = 5.3e-56
 Identities = 131/321 (40%), Positives = 186/321 (57%)

Query:    46 DEVM-TIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLN-RTYKVGLNKF 100
             +E++ T ++ W   + K  N       R  I++ NL+ I  HN   SL   TY++ +N  
Sbjct:    20 EEILDTQWELWKKTYRKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHL 79

Query:   101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
              D+T+EE      G +         S+     Y        P+S+D+R+KG V PVK+QG
Sbjct:    80 GDMTSEEVVQKMTGLKVPPSH----SRSNDTLYIPDWEGRTPDSIDYRKKGYVTPVKNQG 135

Query:   161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
              CGSCWAFS+V A+EG  K  TG+L++LS Q LVDC  + N GC GG M  AFQ++ +N 
Sbjct:   136 QCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE-NDGCGGGYMTNAFQYVQKNR 194

Query:   221 GMDSEQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIE 277
             G+DSE  YPY+G +  C  +P+ + AK     GY ++   +E +LK+AVA   PVSVAI+
Sbjct:   195 GIDSEDAYPYVGQDENCMYNPTGKAAKC---RGYREIPEGNEKALKRAVARVGPVSVAID 251

Query:   278 AGGRAFQHYESGVFTGE-CGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
             A   +FQ Y  GV+  E C S  L+H V+AVGYG + G  +W+++NSWG +WG  GY+ +
Sbjct:   252 ASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGKKHWIIKNSWGENWGNKGYILM 311

Query:   336 QRNLLDTNTGKCGIAMEASYP 356
              RN        CGIA  AS+P
Sbjct:   312 ARN----KNNACGIANLASFP 328


>UNIPROTKB|F7B939 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458 CTD:1512
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 EMBL:ACFV01158341
            EMBL:ACFV01158342 EMBL:ACFV01158343 RefSeq:XP_002753411.1
            Ensembl:ENSCJAT00000004397 GeneID:100413104 Uniprot:F7B939
        Length = 336

 Score = 577 (208.2 bits), Expect = 5.3e-56, P = 5.3e-56
 Identities = 124/313 (39%), Positives = 179/313 (57%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
             +++W+AKH KT +      +R Q F  N R I+ HN+ N T+K+ +N+F+D++  E +  
Sbjct:    35 FKSWMAKHHKTYSREEEYHQRLQTFASNWRKINAHNNGNHTFKMAVNQFSDMSFAEIKRK 94

Query:   112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGA-VNPVKDQGSCGSCWAFST 170
             YL +                 Y    G   P SVDWR+KG  V+PVK+QG+CGSCW FST
Sbjct:    95 YLWSEPQ------NCSATKSNYLRGTGP-YPPSVDWRKKGHFVSPVKNQGACGSCWTFST 147

Query:   171 VAAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYP 229
               A+E    I TG+++SL+EQ+LVDC +  N  GC GGL   AF++I+ N G+  E  YP
Sbjct:   148 TGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNNGIMGEDTYP 207

Query:   230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYES 288
             Y G ++ C      A +  +    +++ +DE ++ +AVA   PVS A E   + F  Y+ 
Sbjct:   208 YQGKDSDCKFQPGKA-IGFVKDVANITIYDEDAMVEAVALYNPVSFAFEVT-QDFMMYKR 265

Query:   289 GVFTG-ECGSALD---HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
             G+++   C    D   H V+AVGYG ENG+ YW+V+NSWG  WG NGY  ++R       
Sbjct:   266 GIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERG-----K 320

Query:   345 GKCGIAMEASYPV 357
               CG+A  ASYPV
Sbjct:   321 NMCGLAACASYPV 333


>UNIPROTKB|F7BRD4 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001656 "metanephros development"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0001656
            GeneTree:ENSGT00660000095458 EMBL:ACFV01158341 EMBL:ACFV01158342
            EMBL:ACFV01158343 Ensembl:ENSCJAT00000004396 Uniprot:F7BRD4
        Length = 336

 Score = 577 (208.2 bits), Expect = 5.3e-56, P = 5.3e-56
 Identities = 124/313 (39%), Positives = 179/313 (57%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
             +++W+AKH KT +      +R Q F  N R I+ HN+ N T+K+ +N+F+D++  E +  
Sbjct:    35 FKSWMAKHHKTYSREEEYHQRLQTFASNWRKINAHNNGNHTFKMAVNQFSDMSFAEIKRK 94

Query:   112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGA-VNPVKDQGSCGSCWAFST 170
             YL +                 Y    G   P SVDWR+KG  V+PVK+QG+CGSCW FST
Sbjct:    95 YLWSEPQ------NCSATKSNYLRGTGP-YPPSVDWRKKGHFVSPVKNQGACGSCWTFST 147

Query:   171 VAAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYP 229
               A+E    I TG+++SL+EQ+LVDC +  N  GC GGL   AF++I+ N G+  E  YP
Sbjct:   148 TGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNNGIMGEDTYP 207

Query:   230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYES 288
             Y G ++ C      A +  +    +++ +DE ++ +AVA   PVS A E   + F  Y+ 
Sbjct:   208 YQGKDSDCKFQPGKA-IGFVKDVANITIYDEDAMVEAVALYNPVSFAFEVT-QDFMMYKR 265

Query:   289 GVFTG-ECGSALD---HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
             G+++   C    D   H V+AVGYG ENG+ YW+V+NSWG  WG NGY  ++R       
Sbjct:   266 GIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERG-----K 320

Query:   345 GKCGIAMEASYPV 357
               CG+A  ASYPV
Sbjct:   321 NMCGLAACASYPV 333


>RGD|621513 [details] [associations]
            symbol:Ctss "cathepsin S" species:10116 "Rattus norvegicus"
            [GO:0001656 "metanephros development" evidence=IEP] [GO:0002250
            "adaptive immune response" evidence=ISO] [GO:0005764 "lysosome"
            evidence=IEA;ISO] [GO:0006508 "proteolysis" evidence=IEA;ISO]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0009986 "cell
            surface" evidence=IDA] [GO:0016020 "membrane" evidence=ISO]
            [GO:0043231 "intracellular membrane-bounded organelle"
            evidence=ISO] [GO:0045453 "bone resorption" evidence=IMP]
            [GO:0051930 "regulation of sensory perception of pain"
            evidence=IMP] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            RGD:621513 GO:GO:0009986 GO:GO:0051930 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0045453
            GO:GO:0001656 HOVERGEN:HBG011513 CTD:1520 KO:K01368 MEROPS:I29.004
            BRENDA:3.4.22.27 EMBL:L03201 IPI:IPI00210228 PIR:A45087
            RefSeq:NP_059016.1 UniGene:Rn.11347 ProteinModelPortal:Q02765
            PhosphoSite:Q02765 PRIDE:Q02765 GeneID:50654 KEGG:rno:50654
            UCSC:RGD:621513 ChEMBL:CHEMBL1075217 NextBio:610462
            Genevestigator:Q02765 Uniprot:Q02765
        Length = 330

 Score = 577 (208.2 bits), Expect = 5.3e-56, P = 5.3e-56
 Identities = 138/318 (43%), Positives = 183/318 (57%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLN-RTYKVGLNKFADLTNEE 107
             +  W     + +      + R  I++ NL+FI  HN   S+   +Y VG+N   D+T EE
Sbjct:    26 WDLWKKTRMRRNTDQNEEDVRRLIWEKNLKFIMLHNLEHSMGMHSYSVGMNHMGDMTPEE 85

Query:   108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
                 Y+G+      R+ +    S      +   LP+SVDWREKG V  VK QGSCGSCWA
Sbjct:    86 VIG-YMGSL-----RIPRPWNRSGTLKSSSNQTLPDSVDWREKGCVTNVKYQGSCGSCWA 139

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDC--DRKI-NAGCNGGLMDYAFQFIIQNGGMDS 224
             FS   A+EG  K+ TG+L+SLS Q LVDC  + K  N GC GG M  AFQ+II    +DS
Sbjct:   140 FSAEGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEAFQYIIDTS-IDS 198

Query:   225 EQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPF-DEMSLKKAVADQ-PVSVAIE-AG 279
             E  YPY   + KC  DP  R A   +   Y ++ PF DE +LK+AVA + PVSV I+ A 
Sbjct:   199 EASYPYKAMDEKCLYDPKNRAA---TCSRYIEL-PFGDEEALKEAVATKGPVSVGIDDAS 254

Query:   280 GRAFQHYESGVFTG-ECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
               +F  Y+SGV+    C   ++HGV+ VGYGT +G DYWLV+NSWG  +G+ GY+++ RN
Sbjct:   255 HSSFFLYQSGVYDDPSCTENMNHGVLVVGYGTLDGKDYWLVKNSWGLHFGDQGYIRMARN 314

Query:   339 LLDTNTGKCGIAMEASYP 356
                 N   CGIA   SYP
Sbjct:   315 ----NKNHCGIASYCSYP 328


>UNIPROTKB|G1K2A7 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9615 "Canis lupus
            familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 PANTHER:PTHR12411:SF55 OMA:LKVPPSH
            EMBL:AAEX03011051 Ensembl:ENSCAFT00000019202 Uniprot:G1K2A7
        Length = 333

 Score = 575 (207.5 bits), Expect = 8.6e-56, P = 8.6e-56
 Identities = 130/321 (40%), Positives = 186/321 (57%)

Query:    46 DEVM-TIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLN-RTYKVGLNKF 100
             +E++ T +  W   + K  N       R  I++ NL+ I  HN   SL   TY++ +N  
Sbjct:    23 EEILDTQWDLWKKTYRKQYNSKVDELSRRLIWEKNLKHISIHNLEASLGVHTYELAMNHL 82

Query:   101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
              D+T+EE      G +           +    +  +A    P+SVD+R+KG V PVK+QG
Sbjct:    83 GDMTSEEVVQKMTGLKVPPSHSRSNDTLYIPDWESRA----PDSVDYRKKGYVTPVKNQG 138

Query:   161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
              CGSCWAFS+V A+EG  K  TG+L++LS Q LVDC  + N GC GG M  AFQ++ +N 
Sbjct:   139 QCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE-NDGCGGGYMTNAFQYVQKNR 197

Query:   221 GMDSEQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIE 277
             G+DSE  YPY+G +  C  +P+ + AK     GY ++   +E +LK+AVA   P+SVAI+
Sbjct:   198 GIDSEDAYPYVGQDESCMYNPTGKAAKC---RGYREIPEGNEKALKRAVARVGPISVAID 254

Query:   278 AGGRAFQHYESGVFTGE-CGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
             A   +FQ Y  GV+  E C S  L+H V+AVGYG + G  +W+++NSWG +WG  GY+ +
Sbjct:   255 ASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILM 314

Query:   336 QRNLLDTNTGKCGIAMEASYP 356
              RN        CGIA  AS+P
Sbjct:   315 ARN----KNNACGIANLASFP 331


>UNIPROTKB|Q3ZKN1 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:AY738221
            RefSeq:NP_001029168.1 UniGene:Cfa.588 HSSP:P43235
            ProteinModelPortal:Q3ZKN1 SMR:Q3ZKN1 STRING:Q3ZKN1 GeneID:608843
            KEGG:cfa:608843 InParanoid:Q3ZKN1 NextBio:20894470 Uniprot:Q3ZKN1
        Length = 330

 Score = 575 (207.5 bits), Expect = 8.6e-56, P = 8.6e-56
 Identities = 130/321 (40%), Positives = 186/321 (57%)

Query:    46 DEVM-TIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLN-RTYKVGLNKF 100
             +E++ T +  W   + K  N       R  I++ NL+ I  HN   SL   TY++ +N  
Sbjct:    20 EEILDTQWDLWKKTYRKQYNSKVDELSRRLIWEKNLKHISIHNLEASLGVHTYELAMNHL 79

Query:   101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
              D+T+EE      G +           +    +  +A    P+SVD+R+KG V PVK+QG
Sbjct:    80 GDMTSEEVVQKMTGLKVPPSHSRSNDTLYIPDWESRA----PDSVDYRKKGYVTPVKNQG 135

Query:   161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
              CGSCWAFS+V A+EG  K  TG+L++LS Q LVDC  + N GC GG M  AFQ++ +N 
Sbjct:   136 QCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE-NDGCGGGYMTNAFQYVQKNR 194

Query:   221 GMDSEQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIE 277
             G+DSE  YPY+G +  C  +P+ + AK     GY ++   +E +LK+AVA   P+SVAI+
Sbjct:   195 GIDSEDAYPYVGQDESCMYNPTGKAAKC---RGYREIPEGNEKALKRAVARVGPISVAID 251

Query:   278 AGGRAFQHYESGVFTGE-CGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKL 335
             A   +FQ Y  GV+  E C S  L+H V+AVGYG + G  +W+++NSWG +WG  GY+ +
Sbjct:   252 ASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILM 311

Query:   336 QRNLLDTNTGKCGIAMEASYP 356
              RN        CGIA  AS+P
Sbjct:   312 ARN----KNNACGIANLASFP 328


>DICTYBASE|DDB_G0278401 [details] [associations]
            symbol:cprH "cysteine proteinase 8" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0278401 EMBL:AAFI02000023
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 ProtClustDB:CLSZ2430780 RefSeq:XP_642342.1
            ProteinModelPortal:Q54Y60 MEROPS:C01.A62 EnsemblProtists:DDB0205428
            GeneID:8621547 KEGG:ddi:DDB_G0278401 InParanoid:Q54Y60 OMA:FANMENE
            Uniprot:Q54Y60
        Length = 337

 Score = 573 (206.8 bits), Expect = 1.4e-55, P = 1.4e-55
 Identities = 128/331 (38%), Positives = 184/331 (55%)

Query:    44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
             ++ +    +  W+  + K S        R+ IFK N  +I+E NS      +GLNK AD+
Sbjct:    22 SESQYRDAFTDWMISNQK-SYSSSEFITRYNIFKTNFDYIEEWNSKGSETVLGLNKMADI 80

Query:   104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
             TNEEYR++YLG   DA      S +   +      ++   +VDWR+KGAV  VK+Q SC 
Sbjct:    81 TNEEYRSLYLGKPFDA------SSLIGTKEEILFSNKFSSTVDWRKKGAVTHVKNQQSCS 134

Query:   164 SCWAFSTVAAVEGINKIV---TGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQN 219
              CW+FS   A EG +K+    T EL+SLSEQ L+DC     N GCNGG++ YAF++II N
Sbjct:   135 GCWSFSATGATEGAHKLANNGTNELVSLSEQNLIDCSTPFGNTGCNGGVITYAFEYIISN 194

Query:   220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAG 279
             GG+D+E+ YP+ G +  C     N+   +I  Y +V+   E SL+ AV   PV+ +I+A 
Sbjct:   195 GGIDTEKSYPFEGTDGTCRYKSENSGA-TISSYVNVTFGSESSLESAVNVNPVACSIDAS 253

Query:   280 GRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTENGV-----------DYWLVRNSWGSD 326
               +F  Y+SG+ F   C    LDHGV+ VGYGTEN             +YW+ +NSWG  
Sbjct:   254 HSSFLFYKSGIYFEPACSRTNLDHGVLVVGYGTENSQSQDSSSEPNHSNYWIAKNSWGI- 312

Query:   327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
                NGY+ + ++        CGI+  AS+P+
Sbjct:   313 ---NGYILMSKD----RDNMCGISTLASFPI 336


>UNIPROTKB|P25774 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0016020 "membrane"
            evidence=IEA] [GO:0005576 "extracellular region" evidence=NAS]
            [GO:0005764 "lysosome" evidence=IDA;NAS] [GO:0097067 "cellular
            response to thyroid hormone stimulus" evidence=IEP] [GO:0002250
            "adaptive immune response" evidence=IEP] [GO:0019882 "antigen
            processing and presentation" evidence=TAS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=TAS] [GO:0006955
            "immune response" evidence=TAS] [GO:0002474 "antigen processing and
            presentation of peptide antigen via MHC class I" evidence=TAS]
            [GO:0002480 "antigen processing and presentation of exogenous
            peptide antigen via MHC class I, TAP-independent" evidence=TAS]
            [GO:0019886 "antigen processing and presentation of exogenous
            peptide antigen via MHC class II" evidence=TAS] [GO:0036021
            "endolysosome lumen" evidence=TAS] [GO:0042590 "antigen processing
            and presentation of exogenous peptide antigen via MHC class I"
            evidence=TAS] [GO:0043202 "lysosomal lumen" evidence=TAS]
            [GO:0045087 "innate immune response" evidence=TAS] [GO:0043231
            "intracellular membrane-bounded organelle" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 Reactome:REACT_118779
            Reactome:REACT_6900 GO:GO:0005576 GO:GO:0002480 GO:GO:0016020
            GO:GO:0019886 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087 EMBL:CH471121
            GO:GO:0004197 GO:GO:0002250 HOVERGEN:HBG011513 GO:GO:0097067
            GO:GO:0036021 EMBL:AL356292 CTD:1520 KO:K01368 OMA:KAMDQKC
            OrthoDB:EOG4JM7Q2 EMBL:S93414 EMBL:M86553 EMBL:M90696 EMBL:U07374
            EMBL:U07370 EMBL:U07371 EMBL:U07372 EMBL:U07373 EMBL:CR541676
            EMBL:AK301472 EMBL:AK314482 EMBL:BC002642 IPI:IPI00299150
            IPI:IPI00910216 PIR:A42482 RefSeq:NP_001186668.1 RefSeq:NP_004070.3
            UniGene:Hs.181301 PDB:1BXF PDB:1GLO PDB:1MS6 PDB:1NPZ PDB:1NQC
            PDB:2C0Y PDB:2F1G PDB:2FQ9 PDB:2FRA PDB:2FRQ PDB:2FT2 PDB:2FUD
            PDB:2FYE PDB:2G6D PDB:2G7Y PDB:2H7J PDB:2HH5 PDB:2HHN PDB:2HXZ
            PDB:2OP3 PDB:2R9M PDB:2R9N PDB:2R9O PDB:3IEJ PDB:3KWN PDB:3MPE
            PDB:3MPF PDB:3N3G PDB:3N4C PDB:3OVX PDBsum:1BXF PDBsum:1GLO
            PDBsum:1MS6 PDBsum:1NPZ PDBsum:1NQC PDBsum:2C0Y PDBsum:2F1G
            PDBsum:2FQ9 PDBsum:2FRA PDBsum:2FRQ PDBsum:2FT2 PDBsum:2FUD
            PDBsum:2FYE PDBsum:2G6D PDBsum:2G7Y PDBsum:2H7J PDBsum:2HH5
            PDBsum:2HHN PDBsum:2HXZ PDBsum:2OP3 PDBsum:2R9M PDBsum:2R9N
            PDBsum:2R9O PDBsum:3IEJ PDBsum:3KWN PDBsum:3MPE PDBsum:3MPF
            PDBsum:3N3G PDBsum:3N4C PDBsum:3OVX ProteinModelPortal:P25774
            SMR:P25774 IntAct:P25774 STRING:P25774 MEROPS:I29.004
            PhosphoSite:P25774 DMDM:88984046 PaxDb:P25774 PeptideAtlas:P25774
            PRIDE:P25774 DNASU:1520 Ensembl:ENST00000368985
            Ensembl:ENST00000448301 GeneID:1520 KEGG:hsa:1520 UCSC:uc001evn.3
            GeneCards:GC01M150702 HGNC:HGNC:2545 HPA:CAB000460 HPA:HPA002988
            MIM:116845 neXtProt:NX_P25774 PharmGKB:PA27041 InParanoid:P25774
            PhylomeDB:P25774 BRENDA:3.4.22.27 BindingDB:P25774
            ChEMBL:CHEMBL2954 ChiTaRS:CTSS EvolutionaryTrace:P25774
            GenomeRNAi:1520 NextBio:6291 PMAP-CutDB:P25774 ArrayExpress:P25774
            Bgee:P25774 CleanEx:HS_CTSS Genevestigator:P25774
            GermOnline:ENSG00000163131 Uniprot:P25774
        Length = 331

 Score = 573 (206.8 bits), Expect = 1.4e-55, P = 1.4e-55
 Identities = 127/313 (40%), Positives = 179/313 (57%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLN-RTYKVGLNKFADLTNEE 107
             +  W   +GK          R  I++ NL+F+  HN   S+   +Y +G+N   D+T+EE
Sbjct:    28 WHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEE 87

Query:   108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
               ++    R  ++    +  +    Y       LP+SVDWREKG V  VK QGSCG+CWA
Sbjct:    88 VMSLMSSLRVPSQ---WQRNIT---YKSNPNRILPDSVDWREKGCVTEVKYQGSCGACWA 141

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDC--DRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
             FS V A+E   K+ TG+L+SLS Q LVDC  ++  N GCNGG M  AFQ+II N G+DS+
Sbjct:   142 FSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSD 201

Query:   226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQ 284
               YPY   + KC    +  +  +   Y ++    E  LK+AVA++ PVSV ++A   +F 
Sbjct:   202 ASYPYKAMDQKCQYDSKY-RAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFF 260

Query:   285 HYESGVF-TGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
              Y SGV+    C   ++HGV+ VGYG  NG +YWLV+NSWG ++GE GY+++ RN     
Sbjct:   261 LYRSGVYYEPSCTQNVNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARN----K 316

Query:   344 TGKCGIAMEASYP 356
                CGIA   SYP
Sbjct:   317 GNHCGIASFPSYP 329


>UNIPROTKB|D3ZZR3 [details] [associations]
            symbol:D3ZZR3 "Uncharacterized protein" species:10116
            "Rattus norvegicus" [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0016020 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0002250 GeneTree:ENSGT00560000076577 GO:GO:0097067
            OrthoDB:EOG4JM7Q2 IPI:IPI00210228 PRIDE:D3ZZR3
            Ensembl:ENSRNOT00000028732 Uniprot:D3ZZR3
        Length = 331

 Score = 573 (206.8 bits), Expect = 1.4e-55, P = 1.4e-55
 Identities = 136/319 (42%), Positives = 178/319 (55%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLN-RTYKVGLNKFADLTNEE 107
             +  W   H K        + R  I++ NL+FI  HN   S+   +Y VG+N   D+  E 
Sbjct:    25 WDLWKKTHEKEYKDQNEEDVRRLIWEKNLKFIMLHNLEHSMGMHSYSVGMNHMGDMVAET 84

Query:   108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWRE--KGAVNPVKDQGSCGSC 165
                  +G       RL + + A           LP  V W+E  KG    +  QGSCGSC
Sbjct:    85 I----IGEMGS--ERLPRKRKALGLIPSSVNQNLPAGVKWKERTKGCWKNLVFQGSCGSC 138

Query:   166 WAFSTVAAVEGINKIVTGELISLSEQELVDC--DRKI-NAGCNGGLMDYAFQFIIQNGGM 222
             WAFS V A+EG  K+ TG+L+SLS Q LVDC  + K  N GC GG M  AFQ+II NGG+
Sbjct:   139 WAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEAFQYIIDNGGI 198

Query:   223 DSEQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPF-DEMSLKKAVADQ-PVSVAIEA 278
             DSE  YPY   + KC  DP  R A   +   Y ++ PF DE +LK+AVA + PVSV I+A
Sbjct:   199 DSEASYPYKAMDEKCHYDPKNRAA---TCSRYIEL-PFGDEEALKEAVATKGPVSVGIDA 254

Query:   279 GGRAFQHYESGVFTG-ECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
                +F  Y+SGV+    C   ++HGV+ VGYGT +G DYWLV+NSWG  +G+ GY+++ R
Sbjct:   255 SHSSFFLYQSGVYDDPSCTENVNHGVLVVGYGTLDGKDYWLVKNSWGLHFGDQGYIRMAR 314

Query:   338 NLLDTNTGKCGIAMEASYP 356
             N    N   CGIA   SYP
Sbjct:   315 N----NKNHCGIASYCSYP 329


>UNIPROTKB|Q24940 [details] [associations]
            symbol:Cat-1 "Cathepsin L-like proteinase" species:6192
            "Fasciola hepatica" [GO:0004175 "endopeptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005576 "extracellular region" evidence=IDA]
            [GO:0006508 "proteolysis" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005576 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0004197 EMBL:L33771 PIR:S43991 PDB:2O6X
            PDBsum:2O6X ProteinModelPortal:Q24940 SMR:Q24940 MEROPS:C01.033
            EvolutionaryTrace:Q24940 Uniprot:Q24940
        Length = 326

 Score = 572 (206.4 bits), Expect = 1.8e-55, P = 1.8e-55
 Identities = 127/316 (40%), Positives = 181/316 (57%)

Query:    51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNE 106
             ++  W   + K  NG     +R  I++ N++ I EHN  +     TY +GLN+F D+T E
Sbjct:    20 LWHQWKRMYNKEYNGADDQHRR-NIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFE 78

Query:   107 EYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDE-LPESVDWREKGAVNPVKDQGSCGSC 165
             E++A YL   S A      S + S     +A +  +P+ +DWRE G V  VKDQG+CGSC
Sbjct:    79 EFKAKYLTEMSRA------SDILSHGVPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSC 132

Query:   166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDS 224
             WAFST   +EG         IS SEQ+LVDC     N GC+GGLM+ A+Q++ Q G +++
Sbjct:   133 WAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQFG-LET 191

Query:   225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAV-ADQPVSVAIEAGGRAF 283
             E  YPY   E +C  +++   V  + GY  V    E+ LK  V A +P +VA++     F
Sbjct:   192 ESSYPYTAVEGQCRYNKQLG-VAKVTGYYTVHSGSEVELKNLVGARRPAAVAVDVESD-F 249

Query:   284 QHYESGVFTGECGSAL--DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
               Y SG++  +  S L  +H V+AVGYGT+ G DYW+V+NSWG+ WGE GY+++ RN   
Sbjct:   250 MMYRSGIYQSQTCSPLRVNHAVLAVGYGTQGGTDYWIVKNSWGTYWGERGYIRMARN--- 306

Query:   342 TNTGKCGIAMEASYPV 357
                  CGIA  AS P+
Sbjct:   307 -RGNMCGIASLASLPM 321


>UNIPROTKB|F6R7P5 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9544 "Macaca
            mulatta" [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=ISS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0004175 "endopeptidase activity" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0004252
            "serine-type endopeptidase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005829 "cytosol" evidence=ISS] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0010628 "positive regulation of
            gene expression" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0010815 "bradykinin catabolic
            process" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0016505 "apoptotic protease
            activator activity" evidence=ISS] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=ISS] [GO:0030335
            "positive regulation of cell migration" evidence=ISS] [GO:0031638
            "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0032526 "response to retinoic
            acid" evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0043129 "surfactant homeostasis"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0070324
            "thyroid hormone binding" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            GO:GO:0043066 GO:GO:0005615 GO:GO:0008284 GO:GO:0070371
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458 CTD:1512
            OMA:STSCHKT GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 RefSeq:XP_001108862.1
            UniGene:Mmu.3000 Ensembl:ENSMMUT00000014095 GeneID:711437
            KEGG:mcc:711437 NextBio:19969972 Uniprot:F6R7P5
        Length = 335

 Score = 570 (205.7 bits), Expect = 2.9e-55, P = 2.9e-55
 Identities = 123/314 (39%), Positives = 181/314 (57%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
             +++W++KH KT +   ++  R Q F  N R I+ HN+ N T+K+ LN+F+D++  E +  
Sbjct:    35 FKSWMSKHHKTYSTEEYHH-RMQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKHK 93

Query:   112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGA-VNPVKDQGSCGSCWAFST 170
             YL +                 Y    G   P S+DWR+KG  V+PVK+QG+CGSCW FST
Sbjct:    94 YLWSEPQ------NCSATKSNYLRGTGP-YPPSMDWRKKGNFVSPVKNQGACGSCWTFST 146

Query:   171 VAAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYP 229
               A+E    I TG+++SL+EQ+LVDC +  N  GC GGL   AF++I+ N G+  E  YP
Sbjct:   147 TGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYP 206

Query:   230 YLGAENKCDPSRRNAKVVS-IDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYE 287
             Y G +  C    R  K +  +    +++ +DE ++ +AVA   PVS A E   + F  Y+
Sbjct:   207 YQGKDGDC--KFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVT-QDFMIYK 263

Query:   288 SGVFTG-ECGSALD---HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
             +G+++   C    D   H V+AVGYG ENG+ YW+V+NSWG  WG NGY  ++R      
Sbjct:   264 TGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERG----- 318

Query:   344 TGKCGIAMEASYPV 357
                CG+A  ASYP+
Sbjct:   319 KNMCGLAACASYPI 332


>DICTYBASE|DDB_G0272815 [details] [associations]
            symbol:cprE "cysteine proteinase 5" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0272815 GO:GO:0005615
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GenomeReviews:CM000151_GR GO:GO:0005764
            EMBL:AAFI02000008 MEROPS:I29.003 KO:K01376 EMBL:L36205
            RefSeq:XP_644977.1 ProteinModelPortal:P54640 SMR:P54640
            PRIDE:P54640 EnsemblProtists:DDB0185092 GeneID:8618654
            KEGG:ddi:DDB_G0272815 OMA:METAFEF ProtClustDB:CLSZ2430780
            Uniprot:P54640
        Length = 344

 Score = 568 (205.0 bits), Expect = 4.8e-55, P = 4.8e-55
 Identities = 134/327 (40%), Positives = 183/327 (55%)

Query:    52 YQTWLAKHGK--TSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYR 109
             +  W+  H K  TS   G    R+ IFK N+ ++ + NS      +GLN FAD+TNEEYR
Sbjct:    30 FTDWMITHQKSYTSEEFG---ARYNIFKANMDYVQQWNSKGSETVLGLNNFADITNEEYR 86

Query:   110 AMYLGTRSDAKRRL--MKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
               YLGT+ DA   +   + KV +   A         S DWR +GAV PVK+QG CG CW+
Sbjct:    87 NTYLGTKFDASSLIGTQEEKVFTTSSAA--------SKDWRSEGAVTPVKNQGQCGGCWS 138

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
             FST  + EG +    GEL+SLSEQ L+DC  + N+GC+GGLM YAF++II N G+D+E  
Sbjct:   139 FSTTGSTEGAHFQSKGELVSLSEQNLIDCSTE-NSGCDGGLMTYAFEYIINNNGIDTESS 197

Query:   228 YPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYE 287
             YPY     KC+    N+   ++  Y+ V+   E SL+ AV   PVSVAI+A  ++FQ Y 
Sbjct:   198 YPYKAENGKCEYKSENSGA-TLSSYKTVTAGSESSLESAVNVNPVSVAIDASHQSFQLYT 256

Query:   288 SGVF-TGECGSA-LDHGVVAVGYGTENGVDYW---------LVRNS----W--GSDWGEN 330
             SG++   EC S  LDHGV+AVGYG+ +G             L  +S    W   + WG +
Sbjct:   257 SGIYYEPECSSENLDHGVLAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNSWGTS 316

Query:   331 GYVKLQRNLLDTNTGKCGIAMEASYPV 357
               ++    +       CGIA  AS+PV
Sbjct:   317 WGIEGYILMSRNRDNNCGIASSASFPV 343


>DICTYBASE|DDB_G0272298 [details] [associations]
            symbol:DDB_G0272298 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0272298 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246
            SMART:SM00848 EMBL:AAFI02000008 KO:K01365 RefSeq:XP_645281.1
            ProteinModelPortal:Q559Q3 MEROPS:C01.A53 EnsemblProtists:DDB0203746
            GeneID:8618447 KEGG:ddi:DDB_G0272298 InParanoid:Q559Q3 OMA:PANINWR
            Uniprot:Q559Q3
        Length = 305

 Score = 567 (204.7 bits), Expect = 6.1e-55, P = 6.1e-55
 Identities = 119/308 (38%), Positives = 174/308 (56%)

Query:    56 LAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRAMYLG 114
             + K+ K         KRF IF+DN  FI  H + N    ++ LN+++DLT +E+   +  
Sbjct:     1 MVKYNKHYKNNKEYLKRFDIFQDNYNFILNHRNKNGENIEMDLNEYSDLTQKEFADKFFE 60

Query:   115 TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAV 174
                   R    + + +  +       +P+S DWR+ GAV  VK+QGSC SCW+FS + A+
Sbjct:    61 KLVPEPRSGPINDIKATPFKHNVNATIPKSFDWRDHGAVGKVKNQGSCASCWSFSALGAL 120

Query:   175 EGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGA 233
             EG   I  GEL+ LSEQ LVDC       GC  G M  AF++II +GG++ E  YPY G 
Sbjct:   121 EGHYYIKYGELLDLSEQNLVDCATPFGPKGCKTGWMHDAFKYIISSGGVNLESQYPYTGK 180

Query:   234 ENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFT 292
             +  C    ++ K   + G+  +  FDE +L +A+A   PV+V I+   + FQH   G++ 
Sbjct:   181 DEVCK-FNQSEKEAKVSGFVMIPKFDESALMEAIALYGPVAVPIDTSTKEFQHLSGGIYY 239

Query:   293 GECGSALD--HGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
              +     +  H V+A+GYGT ENGVDY+L++NSWG  WG NG+ K++R +     GKCGI
Sbjct:   240 SDSCDPWNTIHAVLAIGYGTDENGVDYFLMKNSWGKSWGTNGFFKVKRGV----KGKCGI 295

Query:   350 AMEASYPV 357
                ASYP+
Sbjct:   296 VTAASYPI 303


>ZFIN|ZDB-GENE-030131-3539 [details] [associations]
            symbol:ctsh "cathepsin H" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-030131-3539
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 KO:K01366 HOVERGEN:HBG011513
            CTD:1512 OrthoDB:EOG4W9J43 MEROPS:I29.003 HSSP:P43235 EMBL:BC067615
            IPI:IPI00506892 RefSeq:NP_997853.1 UniGene:Dr.14176
            ProteinModelPortal:Q6NWF2 SMR:Q6NWF2 PRIDE:Q6NWF2 GeneID:324818
            KEGG:dre:324818 InParanoid:Q6NWF2 NextBio:20808976 Bgee:Q6NWF2
            Uniprot:Q6NWF2
        Length = 330

 Score = 567 (204.7 bits), Expect = 6.1e-55, P = 6.1e-55
 Identities = 120/315 (38%), Positives = 180/315 (57%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
             +++W++++ K    +    +R QIF +N + ID+HN  N  + +GLN+F+D+T  E++  
Sbjct:    30 FKSWMSQYNKKYE-INEFYQRLQIFLENKKRIDQHNEGNHKFSMGLNQFSDMTFAEFKKT 88

Query:   112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGA-VNPVKDQGSCGSCWAFST 170
             YL T          + V+S           P+++DWR KG  +  VK+QG CGSCW FST
Sbjct:    89 YLLTEPQNCSATRGNHVSSNGL-------YPDAIDWRTKGHYITDVKNQGPCGSCWTFST 141

Query:   171 VAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
                +E +  I TG+L+ L+EQ+L+DC     N GCNGGL  +AF++I+ N G+ +E DYP
Sbjct:   142 TGCLESVTAIATGKLLQLAEQQLIDCAGDFDNHGCNGGLPSHAFEYIMYNKGLMTEDDYP 201

Query:   230 YLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHY 286
             Y     +C   P    A V  +    +++ +DEM +  AVA   PVS A E     F HY
Sbjct:   202 YQAKGGQCRFKPQLAAAFVKEV---VNITKYDEMGMVDAVARLNPVSFAYEVTSD-FMHY 257

Query:   287 ESGVFTG-ECGSALD---HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
             + G++T  EC +  D   H V+AVGY  ENG  YW+V+NSWG++WG  GY  ++R     
Sbjct:   258 KDGIYTSTECHNTTDMVNHAVLAVGYAEENGTPYWIVKNSWGTNWGIKGYFYIERG---- 313

Query:   343 NTGKCGIAMEASYPV 357
                 CG+A  +SYP+
Sbjct:   314 -KNMCGLAACSSYPI 327


>UNIPROTKB|G1RBY1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:61853
            "Nomascus leucogenys" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 EMBL:ADFV01087552 RefSeq:XP_003275518.1
            Ensembl:ENSNLET00000011249 GeneID:100584322 Uniprot:G1RBY1
        Length = 335

 Score = 565 (203.9 bits), Expect = 9.9e-55, P = 9.9e-55
 Identities = 122/314 (38%), Positives = 180/314 (57%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
             +++W++KH KT +   ++  R Q+F  N R I+ HN+ N T+K+ LN+F+D++  E +  
Sbjct:    35 FKSWMSKHHKTYSTEEYHH-RLQMFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKHK 93

Query:   112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGA-VNPVKDQGSCGSCWAFST 170
             YL +                 Y    G   P S+DWR+KG  V+PVK+QG+CGSCW FST
Sbjct:    94 YLWSEPQ------NCSATKSNYLRGTGP-YPPSMDWRKKGNFVSPVKNQGACGSCWTFST 146

Query:   171 VAAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYP 229
               A+E    I TG+++SL+EQ+LVDC +  N  GC GGL   AF++I+ N G+  E  YP
Sbjct:   147 TGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYP 206

Query:   230 YLGAENKCDPSRRNAKVVS-IDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYE 287
             Y G +  C    R  K +  +    +++ +DE ++ +AVA   PVS A E   + F  Y 
Sbjct:   207 YQGKDGYC--KFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVT-QDFMMYR 263

Query:   288 SGVFTG-ECGSALD---HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
              G+++   C    D   H V+AVGYG +NG+ YW+V+NSWG  WG NGY  ++R      
Sbjct:   264 RGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERG----- 318

Query:   344 TGKCGIAMEASYPV 357
                CG+A  ASYP+
Sbjct:   319 KNMCGLAACASYPI 332


>UNIPROTKB|G1SQF0 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9986
            "Oryctolagus cuniculus" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 RefSeq:XP_002721635.1 UniGene:Ocu.7137
            Ensembl:ENSOCUT00000006138 GeneID:100101597 Uniprot:G1SQF0
        Length = 333

 Score = 565 (203.9 bits), Expect = 9.9e-55, P = 9.9e-55
 Identities = 123/313 (39%), Positives = 177/313 (56%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
             +++W+++H K  +   +  +R Q F  N R I+ HN+ N T+++GLN+F+D++  E +  
Sbjct:    33 FKSWMSQHHKKYSAEEY-PRRLQTFVRNWRKINAHNNGNHTFQMGLNQFSDMSFAEIKHK 91

Query:   112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGA-VNPVKDQGSCGSCWAFST 170
             YL T                 Y    G   P SVDWR+KG  V+PVK+QG+CGSCW FST
Sbjct:    92 YLWTEPQ------NCSATKSNYLRGTGP-YPSSVDWRKKGNFVSPVKNQGACGSCWTFST 144

Query:   171 VAAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYP 229
               A+E    I  G+++SL+EQ+LVDC +  N  GC GGL   AF++I+ N G+  E  YP
Sbjct:   145 TGALESAVAIAGGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQAFEYILYNKGIMGEDSYP 204

Query:   230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYES 288
             Y   E +C    + A +  +    +++  DE ++ +AVA   PVS A E     F  Y  
Sbjct:   205 YRAMEGRCKFQPQKA-IAFVKDVANITLNDEEAMVEAVALYNPVSFAFEVT-EDFMQYRK 262

Query:   289 GVFTG-ECGSALD---HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
             G+++   C    D   H V+AVGYG ENGV YW+V+NSWGS WG NGY  ++R       
Sbjct:   263 GIYSSTSCHKTPDKVNHAVLAVGYGEENGVPYWIVKNSWGSHWGMNGYFYIERG-----K 317

Query:   345 GKCGIAMEASYPV 357
               CG+A  ASYP+
Sbjct:   318 NMCGLAACASYPI 330


>UNIPROTKB|F1PMM9 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9615 "Canis lupus
            familiaris" [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00639 GO:GO:0004197 GO:GO:0002250
            GeneTree:ENSGT00560000076577 GO:GO:0097067 EMBL:AAEX03000499
            Ensembl:ENSCAFT00000002029 OMA:EFKQVLN Uniprot:F1PMM9
        Length = 341

 Score = 564 (203.6 bits), Expect = 1.3e-54, P = 1.3e-54
 Identities = 128/332 (38%), Positives = 189/332 (56%)

Query:    39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYK 94
             S++ + D  +   +  W   HGK  +      +R  +++ N+  I++HN        ++ 
Sbjct:    24 SAAPQQDHSLDAHWSQWKEAHGKLYDKDEEGWRR-TVWERNMEMIEQHNQEYSQGEHSFT 82

Query:    95 VGLNKFADLTNEEYRAMYLGTRSDAK-RRLMKSKVASQRYACKAGDELPESVDWREKGAV 153
             + +N F D+TNEE++ +     +D K ++  K KV    +      E+P SVDWRE+G V
Sbjct:    83 LAMNAFGDMTNEEFKQVL----NDFKIQKHKKGKV----FPAPLFAEVPSSVDWREQGYV 134

Query:   154 NPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYA 212
              PVKDQG C  CWAFS   A+EG     TG+L+SLSEQ LVDC   + N GCNGGLM+YA
Sbjct:   135 TPVKDQGQCLGCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWSQGNRGCNGGLMEYA 194

Query:   213 FQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-P 271
             FQ++  NGG+DSE+ YPYL     C   R      ++  +  +   +E  L   VA   P
Sbjct:   195 FQYVKDNGGLDSEESYPYLARNEPCK-YRPEKSAANVTAFWPILN-EEDGLMTTVATVGP 252

Query:   272 VSVAIEAGGRAFQHYESGVFTG-ECGSAL-DHGVVAVGYGTENGVD----YWLVRNSWGS 325
             VS A+++  ++FQ Y+ G++   +C + L +HGV+ VGYG E        YW+V+NSWG+
Sbjct:   253 VSAAVDSSPQSFQFYKKGIYYDPKCSNKLLNHGVLVVGYGFEGAESDNKKYWIVKNSWGT 312

Query:   326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
             +WG  GY+ L ++  D +   CGIA  ASYPV
Sbjct:   313 NWGMQGYMLLAKDR-DNH---CGIATRASYPV 340


>UNIPROTKB|P09668 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9606 "Homo sapiens"
            [GO:0001520 "outer dense fiber" evidence=IEA] [GO:0001669
            "acrosomal vesicle" evidence=IEA] [GO:0007283 "spermatogenesis"
            evidence=IEA] [GO:0030984 "kininogen binding" evidence=IEA]
            [GO:0032403 "protein complex binding" evidence=IEA] [GO:0043621
            "protein self-association" evidence=IEA] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0031648 "protein destabilization"
            evidence=IMP] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=ISS] [GO:0032526 "response to retinoic acid"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0030108 "HLA-A
            specific activating MHC class I receptor activity" evidence=IDA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEP] [GO:0010813 "neuropeptide catabolic process"
            evidence=IDA] [GO:0010815 "bradykinin catabolic process"
            evidence=IDA] [GO:0030335 "positive regulation of cell migration"
            evidence=IDA] [GO:0070371 "ERK1 and ERK2 cascade" evidence=IDA]
            [GO:0010628 "positive regulation of gene expression" evidence=IDA]
            [GO:0006508 "proteolysis" evidence=IDA;TAS] [GO:0031638 "zymogen
            activation" evidence=IDA] [GO:0016505 "apoptotic protease activator
            activity" evidence=IDA] [GO:0010952 "positive regulation of
            peptidase activity" evidence=IDA] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=ISS] [GO:0004252 "serine-type endopeptidase
            activity" evidence=ISS] [GO:0043066 "negative regulation of
            apoptotic process" evidence=ISS] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0033619 "membrane protein proteolysis"
            evidence=IDA] [GO:0004175 "endopeptidase activity" evidence=IDA]
            [GO:0004177 "aminopeptidase activity" evidence=IDA] [GO:0005764
            "lysosome" evidence=IDA] [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0002250 "adaptive immune response" evidence=IEP]
            [GO:0019882 "antigen processing and presentation" evidence=TAS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0070324 "thyroid hormone binding" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0008284
            "positive regulation of cell proliferation" evidence=ISS]
            [GO:0045766 "positive regulation of angiogenesis" evidence=ISS]
            [GO:0008233 "peptidase activity" evidence=IDA] [GO:0097208
            "alveolar lamellar body" evidence=IDA] [GO:0043129 "surfactant
            homeostasis" evidence=IDA] [GO:0004197 "cysteine-type endopeptidase
            activity" evidence=IDA;TAS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 Reactome:REACT_6900 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913 MEROPS:C01.040 CTD:1512
            OMA:STSCHKT OrthoDB:EOG4W9J43 GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 EMBL:X16832 EMBL:AF426247 EMBL:AK314698 EMBL:AC011944
            EMBL:BC002479 EMBL:X07549 IPI:IPI00297487 PIR:S12486
            RefSeq:NP_004381.2 UniGene:Hs.148641 PDB:1BZN PDBsum:1BZN
            ProteinModelPortal:P09668 SMR:P09668 IntAct:P09668 STRING:P09668
            PhosphoSite:P09668 DMDM:288558851 PaxDb:P09668 PRIDE:P09668
            DNASU:1512 Ensembl:ENST00000220166 GeneID:1512 KEGG:hsa:1512
            UCSC:uc021srk.1 GeneCards:GC15M079213 H-InvDB:HIX0012481
            HGNC:HGNC:2535 HPA:CAB000458 HPA:HPA003524 MIM:116820
            neXtProt:NX_P09668 PharmGKB:PA27033 InParanoid:P09668
            PhylomeDB:P09668 BRENDA:3.4.22.16 ChEMBL:CHEMBL2225 GenomeRNAi:1512
            NextBio:6261 ArrayExpress:P09668 Bgee:P09668 CleanEx:HS_CTSH
            Genevestigator:P09668 GermOnline:ENSG00000103811 GO:GO:0019882
            Uniprot:P09668
        Length = 335

 Score = 564 (203.6 bits), Expect = 1.3e-54, P = 1.3e-54
 Identities = 122/313 (38%), Positives = 179/313 (57%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
             +++W++KH KT +   ++  R Q F  N R I+ HN+ N T+K+ LN+F+D++  E +  
Sbjct:    35 FKSWMSKHRKTYSTEEYHH-RLQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKHK 93

Query:   112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGA-VNPVKDQGSCGSCWAFST 170
             YL +                 Y    G   P SVDWR+KG  V+PVK+QG+CGSCW FST
Sbjct:    94 YLWSEPQ------NCSATKSNYLRGTGP-YPPSVDWRKKGNFVSPVKNQGACGSCWTFST 146

Query:   171 VAAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYP 229
               A+E    I TG+++SL+EQ+LVDC +  N  GC GGL   AF++I+ N G+  E  YP
Sbjct:   147 TGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYP 206

Query:   230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYES 288
             Y G +  C      A +  +    +++ +DE ++ +AVA   PVS A E   + F  Y +
Sbjct:   207 YQGKDGYCKFQPGKA-IGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVT-QDFMMYRT 264

Query:   289 GVFTG-ECGSALD---HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
             G+++   C    D   H V+AVGYG +NG+ YW+V+NSWG  WG NGY  ++R       
Sbjct:   265 GIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERG-----K 319

Query:   345 GKCGIAMEASYPV 357
               CG+A  ASYP+
Sbjct:   320 NMCGLAACASYPI 332


>UNIPROTKB|G3R9A7 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9595 "Gorilla
            gorilla gorilla" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 OMA:STSCHKT GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 RefSeq:XP_004056662.1 Ensembl:ENSGGOT00000012331
            GeneID:101144312 Uniprot:G3R9A7
        Length = 335

 Score = 563 (203.2 bits), Expect = 1.6e-54, P = 1.6e-54
 Identities = 122/313 (38%), Positives = 179/313 (57%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
             +++W++KH KT +   ++  R Q F  N R I+ HN+ N T+K+ LN+F+D++  E +  
Sbjct:    35 FRSWMSKHRKTYSTEEYHH-RLQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKHK 93

Query:   112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGA-VNPVKDQGSCGSCWAFST 170
             YL +                 Y    G   P SVDWR+KG  V+PVK+QG+CGSCW FST
Sbjct:    94 YLWSEPQ------NCSATKSNYLRGTGP-YPPSVDWRKKGNFVSPVKNQGACGSCWTFST 146

Query:   171 VAAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYP 229
               A+E    I TG+++SL+EQ+LVDC +  N  GC GGL   AF++I+ N G+  E  YP
Sbjct:   147 TGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYP 206

Query:   230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYES 288
             Y G +  C      A +  +    +++ +DE ++ +AVA   PVS A E   + F  Y +
Sbjct:   207 YQGKDGYCKFQPGKA-IGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVT-QDFMMYRT 264

Query:   289 GVFTG-ECGSALD---HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
             G+++   C    D   H V+AVGYG +NG+ YW+V+NSWG  WG NGY  ++R       
Sbjct:   265 GIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPKWGMNGYFLIERG-----K 319

Query:   345 GKCGIAMEASYPV 357
               CG+A  ASYP+
Sbjct:   320 NMCGLAACASYPI 332


>UNIPROTKB|O46427 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9823 "Sus scrofa"
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0032526 "response to retinoic acid" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0043129
            "surfactant homeostasis" evidence=ISS] [GO:0010815 "bradykinin
            catabolic process" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0005829 "cytosol"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0030335 "positive regulation of cell
            migration" evidence=ISS] [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0016505 "apoptotic protease activator
            activity" evidence=ISS] [GO:0004252 "serine-type endopeptidase
            activity" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=ISS] [GO:0031638 "zymogen activation"
            evidence=ISS] [GO:0030108 "HLA-A specific activating MHC class I
            receptor activity" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0010628 "positive regulation of gene
            expression" evidence=ISS] [GO:0070324 "thyroid hormone binding"
            evidence=ISS] [GO:0006508 "proteolysis" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0060448
            "dichotomous subdivision of terminal units involved in lung
            branching" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] [GO:0005764 "lysosome" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0004177 "aminopeptidase
            activity" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=ISS] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 MEROPS:C01.040 CTD:1512 OMA:STSCHKT
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 EMBL:AF001169
            RefSeq:NP_999094.1 UniGene:Ssc.3593 PDB:1NB3 PDB:1NB5 PDB:8PCH
            PDBsum:1NB3 PDBsum:1NB5 PDBsum:8PCH ProteinModelPortal:O46427
            SMR:O46427 Ensembl:ENSSSCT00000001983 GeneID:396969 KEGG:ssc:396969
            EvolutionaryTrace:O46427 ArrayExpress:O46427 Uniprot:O46427
        Length = 335

 Score = 562 (202.9 bits), Expect = 2.1e-54, P = 2.1e-54
 Identities = 123/313 (39%), Positives = 176/313 (56%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
             +++W+ +H K  + +     R Q+F  N R I+ HN+ N T+K+GLN+F+D++ +E R  
Sbjct:    35 FKSWMVQHQKKYS-LEEYHHRLQVFVSNWRKINAHNAGNHTFKLGLNQFSDMSFDEIRHK 93

Query:   112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGA-VNPVKDQGSCGSCWAFST 170
             YL +                 Y    G   P S+DWR+KG  V+PVK+QGSCGSCW FST
Sbjct:    94 YLWSEPQ------NCSATKGNYLRGTGP-YPPSMDWRKKGNFVSPVKNQGSCGSCWTFST 146

Query:   171 VAAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYP 229
               A+E    I TG+++SL+EQ+LVDC +  N  GC GGL   AF++I  N G+  E  YP
Sbjct:   147 TGALESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYP 206

Query:   230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYES 288
             Y G ++ C      A +  +    +++  DE ++ +AVA   PVS A E     F  Y  
Sbjct:   207 YKGQDDHCKFQPDKA-IAFVKDVANITMNDEEAMVEAVALYNPVSFAFEVTND-FLMYRK 264

Query:   289 GVFTG-ECGSALD---HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
             G+++   C    D   H V+AVGYG ENG+ YW+V+NSWG  WG NGY  ++R       
Sbjct:   265 GIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERG-----K 319

Query:   345 GKCGIAMEASYPV 357
               CG+A  ASYP+
Sbjct:   320 NMCGLAACASYPI 332


>TAIR|locus:2120222 [details] [associations]
            symbol:RD19 "RESPONSIVE TO DEHYDRATION 19" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009269 "response to desiccation" evidence=IEP] [GO:0006970
            "response to osmotic stress" evidence=IGI] [GO:0005634 "nucleus"
            evidence=IDA] [GO:0005773 "vacuole" evidence=IDA] [GO:0042742
            "defense response to bacterium" evidence=IMP] [GO:0006096
            "glycolysis" evidence=RCA] [GO:0006833 "water transport"
            evidence=RCA] [GO:0006972 "hyperosmotic response" evidence=RCA]
            [GO:0007030 "Golgi organization" evidence=RCA] [GO:0009266
            "response to temperature stimulus" evidence=RCA] [GO:0009651
            "response to salt stress" evidence=IEP;RCA] [GO:0046686 "response
            to cadmium ion" evidence=RCA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=ISS] [GO:0009414 "response to
            water deprivation" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005634 GO:GO:0005773 EMBL:CP002687
            GenomeReviews:CT486007_GR GO:GO:0009651 GO:GO:0042742
            eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            ProtClustDB:CLSN2688311 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AL035679 EMBL:AL161594 GO:GO:0004197
            MEROPS:C01.022 EMBL:D13042 EMBL:AY080598 EMBL:AY133844
            IPI:IPI00544363 PIR:JN0718 RefSeq:NP_568052.1 UniGene:At.2850
            UniGene:At.74924 ProteinModelPortal:P43296 SMR:P43296 STRING:P43296
            PaxDb:P43296 PRIDE:P43296 EnsemblPlants:AT4G39090.1 GeneID:830064
            KEGG:ath:AT4G39090 TAIR:At4g39090 InParanoid:P43296 OMA:EDFDWRD
            PhylomeDB:P43296 Genevestigator:P43296 GermOnline:AT4G39090
            Uniprot:P43296
        Length = 368

 Score = 553 (199.7 bits), Expect = 1.9e-53, P = 1.9e-53
 Identities = 123/315 (39%), Positives = 175/315 (55%)

Query:    58 KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS 117
             K GK       ++ RF +FK NLR    H  L+ +   G+ +F+DLT  E+R  +LG RS
Sbjct:    57 KFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFRKKHLGVRS 116

Query:   118 DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGI 177
               K  L K    + +      + LPE  DWR+ GAV PVK+QGSCGSCW+FS   A+EG 
Sbjct:   117 GFK--LPKD---ANKAPILPTENLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGA 171

Query:   178 NKIVTGELISLSEQELVDCDRKIN--------AGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
             N + TG+L+SLSEQ+LVDCD + +        +GCNGGLM+ AF++ ++ GG+  E+DYP
Sbjct:   172 NFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYP 231

Query:   230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
             Y G + K     ++  V S+  +  +S  +E      V + P++VAI AG    Q Y  G
Sbjct:   232 YTGKDGKTCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAG--YMQTYIGG 289

Query:   290 VFTGE-CGSALDHGVVAVGYGTENGVD-------YWLVRNSWGSDWGENGYVKL--QRNL 339
             V     C   L+HGV+ VGYG             YW+++NSWG  WGENG+ K+   RN+
Sbjct:   290 VSCPYICTRRLNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGETWGENGFYKICKGRNI 349

Query:   340 LDTNTGKCGIAMEAS 354
                ++    +A   S
Sbjct:   350 CGVDSMVSTVAATVS 364


>UNIPROTKB|Q3T0I2 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9913 "Bos taurus"
            [GO:0031638 "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=ISS] [GO:0010815 "bradykinin
            catabolic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0010813 "neuropeptide catabolic
            process" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0045766 "positive regulation of angiogenesis" evidence=ISS]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=ISS] [GO:0008234 "cysteine-type peptidase activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0033619 "membrane protein proteolysis" evidence=ISS]
            [GO:0043066 "negative regulation of apoptotic process"
            evidence=ISS] [GO:0004252 "serine-type endopeptidase activity"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0016505 "apoptotic protease activator activity"
            evidence=ISS] [GO:0010952 "positive regulation of peptidase
            activity" evidence=ISS] [GO:0030108 "HLA-A specific activating MHC
            class I receptor activity" evidence=ISS] [GO:0002764 "immune
            response-regulating signaling pathway" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0070324 "thyroid
            hormone binding" evidence=ISS] [GO:0006508 "proteolysis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0097208
            "alveolar lamellar body" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004175
            "endopeptidase activity" evidence=ISS] [GO:0032526 "response to
            retinoic acid" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 EMBL:BC102386 IPI:IPI00693034
            RefSeq:NP_001029557.1 UniGene:Bt.52393 ProteinModelPortal:Q3T0I2
            SMR:Q3T0I2 STRING:Q3T0I2 MEROPS:C01.040 PRIDE:Q3T0I2
            Ensembl:ENSBTAT00000014593 GeneID:510524 KEGG:bta:510524 CTD:1512
            InParanoid:Q3T0I2 OMA:STSCHKT OrthoDB:EOG4W9J43 NextBio:20869490
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 Uniprot:Q3T0I2
        Length = 335

 Score = 551 (199.0 bits), Expect = 3.0e-53, P = 3.0e-53
 Identities = 124/315 (39%), Positives = 174/315 (55%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
             +Q+W+ +H K  +   +   R Q F  NLR I+ HN+ N T+K+GLN+F+D++ +E +  
Sbjct:    35 FQSWMVQHQKKYSSEEYYH-RLQAFASNLREINAHNARNHTFKMGLNQFSDMSFDELKRK 93

Query:   112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGA-VNPVKDQGSCGSCWAFST 170
             YL +                 Y    G   P S+DWR+KG  V PVK+QGSCGSCW FST
Sbjct:    94 YLWSEPQ------NCSATKSNYLRGTGP-YPPSMDWRKKGNFVTPVKNQGSCGSCWTFST 146

Query:   171 VAAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYP 229
               A+E    I TG+L  L+EQ+LVDC +  N  GC GGL   AF++I  N G+  E  YP
Sbjct:   147 TGALESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYP 206

Query:   230 YLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHY 286
             Y G +  C   PS+  A V  +    +++  DE ++ +AVA   PVS A E     F  Y
Sbjct:   207 YRGQDGDCKYQPSKAIAFVKDV---ANITLNDEEAMVEAVALHNPVSFAFEVTAD-FMMY 262

Query:   287 ESGVFTG-ECGSALD---HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
               G+++   C    D   H V+AVGYG E G+ YW+V+NSWG +WG  GY  ++R     
Sbjct:   263 RKGIYSSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPNWGMKGYFLIERG---- 318

Query:   343 NTGKCGIAMEASYPV 357
                 CG+A  AS+P+
Sbjct:   319 -KNMCGLAACASFPI 332


>UNIPROTKB|Q10991 [details] [associations]
            symbol:CTSL "Cathepsin L1" species:9940 "Ovis aries"
            [GO:0005515 "protein binding" evidence=IPI] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            MEROPS:C01.032 ProteinModelPortal:Q10991 SMR:Q10991 Uniprot:Q10991
        Length = 217

 Score = 551 (199.0 bits), Expect = 3.0e-53, P = 3.0e-53
 Identities = 117/223 (52%), Positives = 149/223 (66%)

Query:   141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-K 199
             +P+SVDW +KG V PVK+QG CGSCWAFS   A+EG     TG+L+SLSEQ LVD  R +
Sbjct:     1 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQ 60

Query:   200 INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD--PSRRNAKVVSIDGYEDVSP 257
              N GCNGGLMD AFQ+I +NGG+DSE+ YPY   +  C+  P    AK     G+ D+ P
Sbjct:    61 GNQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTSCNYKPEYSAAKDT---GFVDI-P 116

Query:   258 FDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFTG-ECGSA-LDHGVVAVGYGTENGV 314
               E +L KAVA   P+SVAI+AG  +FQ Y+SG++   +C S  LDHGV+ VGYG E   
Sbjct:   117 QREKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTN 176

Query:   315 D-YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
             + +W+V+NSWG +WG  GYVK+ +   D N   CGIA  ASYP
Sbjct:   177 NKFWIVKNSWGPEWGNKGYVKMAK---DQNN-HCGIATAASYP 215


>UNIPROTKB|G1M0X4 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9646
            "Ailuropoda melanoleuca" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 EMBL:ACTA01057330 EMBL:ACTA01065330
            Ensembl:ENSAMET00000013529 Uniprot:G1M0X4
        Length = 337

 Score = 550 (198.7 bits), Expect = 3.8e-53, P = 3.8e-53
 Identities = 125/315 (39%), Positives = 176/315 (55%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
             +++W+ +H K  +   + + R + F  N R I+ HN+ N T+K+GLN+F+D++  E +  
Sbjct:    37 FKSWMVQHQKKYSSEEY-QHRLRTFVGNWRKINAHNAGNHTFKMGLNQFSDMSFAEIKRK 95

Query:   112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGA-VNPVKDQGSCGSCWAFST 170
             YL +                 Y    G   P  VDWR+KG  V+PVK+QG CGSCW FST
Sbjct:    96 YLWSEPQ------NCSATKGNYLRGTGP-YPPFVDWRKKGKFVSPVKNQGGCGSCWTFST 148

Query:   171 VAAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYP 229
               A+E    I TG+L+SL+EQ+LVDC +  N  GC GGL   AF++I  N G+  E  YP
Sbjct:   149 TGALESAIAIKTGKLLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYIRYNRGIMGEDSYP 208

Query:   230 YLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHY 286
             Y G +  C   PS+  A V  +    +++  DE ++ +AVA   PVS A E  G  F  Y
Sbjct:   209 YKGQDGDCKFQPSKAIAFVKDV---ANITINDEQAMVEAVALFNPVSFAFEVTGD-FMMY 264

Query:   287 ESGVFTG-ECGSALD---HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
               GV++   C    D   H V+AVGYG +NGV YW+V+NSWG  WG +GY  ++R     
Sbjct:   265 RKGVYSSTSCHKTPDKVNHAVLAVGYGEQNGVPYWIVKNSWGPQWGMHGYFLIERG---- 320

Query:   343 NTGKCGIAMEASYPV 357
                 CG+A  ASYP+
Sbjct:   321 -KNMCGLAACASYPI 334


>UNIPROTKB|F7BJD8 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9796 "Equus
            caballus" [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=ISS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0004175 "endopeptidase activity" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0004252
            "serine-type endopeptidase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005829 "cytosol" evidence=ISS] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0010628 "positive regulation of
            gene expression" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0010815 "bradykinin catabolic
            process" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0016505 "apoptotic protease
            activator activity" evidence=ISS] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=ISS] [GO:0030335
            "positive regulation of cell migration" evidence=ISS] [GO:0031638
            "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0032526 "response to retinoic
            acid" evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0043129 "surfactant homeostasis"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0070324
            "thyroid hormone binding" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            GO:GO:0043066 GO:GO:0005615 GO:GO:0008284 GO:GO:0070371
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458
            OMA:STSCHKT GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129
            Ensembl:ENSECAT00000013967 Uniprot:F7BJD8
        Length = 305

 Score = 548 (198.0 bits), Expect = 6.3e-53, P = 6.3e-53
 Identities = 121/313 (38%), Positives = 173/313 (55%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
             +++W+ +H K  +   ++  R Q F  N R I+ HN+ N T+++GLN+F+ +   E +  
Sbjct:     5 FKSWMVQHQKKYSSEEYHH-RLQTFVSNWRKINAHNTGNHTFRMGLNQFSAMNFAELKHK 63

Query:   112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGA-VNPVKDQGSCGSCWAFST 170
             YL +                 Y   AG   P SVDWR+KG  V+PVK+QG CGSCW FST
Sbjct:    64 YLWSEPQ------NCSATKGNYLRGAGP-YPPSVDWRKKGNFVSPVKNQGGCGSCWTFST 116

Query:   171 VAAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYP 229
               A+E    I +G+L+SL+EQ+LVDC +  N  GC GGL   AF++I  N G+  E  YP
Sbjct:   117 TGALESAVAIASGKLLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYP 176

Query:   230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYES 288
             Y G +  C   + N  +  +    +++  DE ++ +AVA   PVS A E     F  Y  
Sbjct:   177 YKGQDGDCK-FQPNKAIAFVKDVANITLNDEKAMVEAVALYNPVSFAFEVT-EDFMMYRK 234

Query:   289 GVFTG-ECGSALD---HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
             G+++   C    D   H V+AVGYG ENG+ YW+V+NSWG  WG NGY  ++R       
Sbjct:   235 GIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPHWGMNGYFLIERG-----K 289

Query:   345 GKCGIAMEASYPV 357
               CG+A  ASYP+
Sbjct:   290 NMCGLAACASYPI 302


>UNIPROTKB|G3SSC1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9785
            "Loxodonta africana" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 RefSeq:XP_003413898.1
            Ensembl:ENSLAFT00000003415 GeneID:100662496 Uniprot:G3SSC1
        Length = 335

 Score = 547 (197.6 bits), Expect = 8.0e-53, P = 8.0e-53
 Identities = 120/313 (38%), Positives = 176/313 (56%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
             +Q+W+A+H K  +   +++++ Q F  N R I+ HN+ N T+K+ LN+F+D+T  E +  
Sbjct:    35 FQSWMAQHQKKYSSEEYHQRQ-QTFVSNWRKINAHNARNHTFKMALNQFSDMTFAEIKQK 93

Query:   112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGA-VNPVKDQGSCGSCWAFST 170
             YL +                 Y    G   P  VDWR+KG  V+PVK+QG+CGSCW FST
Sbjct:    94 YLWSEPQ------NCSATKGNYLRGTGP-YPPFVDWRKKGHFVSPVKNQGACGSCWTFST 146

Query:   171 VAAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYP 229
               A+E    I  G+L+SL+EQ+LVDC +  N  GC GGL   AF++I+ N G+  E  YP
Sbjct:   147 TGALESAIAIAGGKLLSLAEQQLVDCAKDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYP 206

Query:   230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYES 288
             Y G ++ C    + A +  +    +++  DE ++ +AVA   PVS A E     F  Y  
Sbjct:   207 YKGQDDVCKFQPKKA-IAFVKDVANITLNDEEAMVEAVALYNPVSFAFEVTDD-FMKYSK 264

Query:   289 GVFTG-ECGSALD---HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
             G+++   C    D   H V+AVGYG E G+ YW+V+NSWG  WG +GY  ++R       
Sbjct:   265 GIYSSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPYWGMDGYFLIERG-----K 319

Query:   345 GKCGIAMEASYPV 357
               CG+A  ASYP+
Sbjct:   320 NMCGLAACASYPI 332


>UNIPROTKB|J9P7C5 [details] [associations]
            symbol:J9P7C5 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 EMBL:AAEX03010953
            Ensembl:ENSCAFT00000012925 Uniprot:J9P7C5
        Length = 321

 Score = 542 (195.9 bits), Expect = 2.7e-52, P = 2.7e-52
 Identities = 131/315 (41%), Positives = 176/315 (55%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNRT-YKVGLNKFADLTNEE 107
             YQ W A H +   GM     R  +++ N++ I+ HN   S  +  + + +N F D+TNEE
Sbjct:    25 YQ-WKAMHRRLY-GMNEEGWRRAVWEKNMKMIELHNREYSQGKHGFTMAMNAFGDMTNEE 82

Query:   108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
             +R +  G ++   +   K KV  +        E+P+SVDWREKG V PVK+QG CGSCWA
Sbjct:    83 FRQVINGFQNQKHK---KGKVFQEPLFA----EIPKSVDWREKGYVTPVKNQGQCGSCWA 135

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
             FS   A EG     TG L+ LSEQ L       N GCNGGLMD AFQ++  N  +DSE+ 
Sbjct:   136 FSATGAFEGQMFWKTGNLVPLSEQNLAQG----NEGCNGGLMDNAFQYVKDNRCLDSEES 191

Query:   228 YPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHY 286
             YPYLG +      +         G+ D+ P  E +L KA+A    ++VAI+AG + FQ Y
Sbjct:   192 YPYLGRDTDTCNYKPECSAAHDSGFVDL-PQREKALMKAMATLGSITVAIDAGHQYFQFY 250

Query:   287 ESGV-FTGECGSA-LDHGVVAVGYGTENGVDY---WLVRNSWGSDWGENGYVKLQRNLLD 341
             +S + F  +C S  LDHGV+ VGYG E G D    W+V+NSW  +WG N YVK+ +    
Sbjct:   251 KSSIYFDPDCSSKDLDHGVLVVGYGFE-GTDSNNKWIVKNSWSPEWGWNSYVKMAKG--- 306

Query:   342 TNTGKCGIAMEASYP 356
                  CGI   ASYP
Sbjct:   307 -QNNHCGITA-ASYP 319


>ZFIN|ZDB-GENE-050208-336 [details] [associations]
            symbol:ctskl "cathepsin K, like" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-050208-336 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:BX465190
            GeneTree:ENSGT00660000095458 IPI:IPI00491185 RefSeq:XP_695425.1
            UniGene:Dr.110795 Ensembl:ENSDART00000062749 GeneID:567046
            KEGG:dre:567046 CTD:567046 NextBio:20888499 Bgee:F1QCP8
            Uniprot:F1QCP8
        Length = 349

 Score = 541 (195.5 bits), Expect = 3.5e-52, P = 3.5e-52
 Identities = 123/323 (38%), Positives = 183/323 (56%)

Query:    44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNRT-YKVGLNK 99
             +++E  T +  W  KH  + +    +  R  I++ N++ I ++N   S   + +K+ +NK
Sbjct:    33 SEEEAPTEWNLWKKKHEISYDEESEDVHRKTIWETNMQKIWKNNNDFSFGLSMFKMAMNK 92

Query:   100 FADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELP-ESVDWREKGAVNPVKD 158
             + DLT+ EY+ + LG++        K K+ S +        L   ++D+R KG V  VKD
Sbjct:    93 YGDLTSVEYKRL-LGSKIKGTGN-RKGKITSAQMLRLNAKRLGVTNIDYRAKGYVTEVKD 150

Query:   159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFII 217
             QG CGSCW+FST  A+EG     TG L+SLSEQ+LVDC R     GC+G  M  A+ ++I
Sbjct:   151 QGYCGSCWSFSTTGAIEGQMYKHTGRLVSLSEQQLVDCSRSYGTYGCSGAWMANAYDYVI 210

Query:   218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAI 276
              N  ++S   YPY   + +     +N  +  I  Y  V   +E +L  AVA   PVSVAI
Sbjct:   211 NNA-LESSDTYPYTSVDTQPCFYEKNLAMAGISDYRFVPAGNEQALADAVATVGPVSVAI 269

Query:   277 EAGGRAFQHYESGVFT-GECG-SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVK 334
             +A   +F  Y SG++    C  + L+H V+ VGYG+E G DYW+++NSWG+ WGE GY++
Sbjct:   270 DADNPSFLFYSSGIYKESNCNPNNLNHAVLVVGYGSEEGTDYWIIKNSWGTGWGEGGYMR 329

Query:   335 LQRNLLDTNTGKCGIAMEASYPV 357
             + RN    NT  CGIA  A YP+
Sbjct:   330 MIRN--GKNT--CGIASYALYPI 348


>UNIPROTKB|Q90686 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9031 "Gallus gallus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            CTD:1513 KO:K01371 PANTHER:PTHR12411:SF55 EMBL:U37691
            IPI:IPI00575213 RefSeq:NP_990302.1 UniGene:Gga.51509
            ProteinModelPortal:Q90686 SMR:Q90686 MEROPS:C01.036 GeneID:395818
            KEGG:gga:395818 NextBio:20815886 Uniprot:Q90686
        Length = 334

 Score = 540 (195.1 bits), Expect = 4.4e-52, P = 4.4e-52
 Identities = 116/268 (43%), Positives = 158/268 (58%)

Query:    92 TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKG 151
             ++++ +N   D+T+EE      G R    R      +    ++ +A    P +VDWR KG
Sbjct:    75 SFQLAMNYLGDMTSEEVVRTMTGLRVPRSRPRPNGTLYVPDWSSRA----PAAVDWRRKG 130

Query:   152 AVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDY 211
              V PVKDQG CGSCWAFS+V A+EG  K  TG+L+SLS Q LV C    N GC GG M  
Sbjct:   131 YVTPVKDQGQCGSCWAFSSVGALEGQLKRRTGKLLSLSPQNLVYCVSN-NNGCGGGYMTN 189

Query:   212 AFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-Q 270
             AF+++  N G+DSE  YPY+G +  C  S    K     GY ++   +E +LK+AVA   
Sbjct:   190 AFEYVRLNRGIDSEDAYPYIGQDESCMYSP-TGKAAKCRGYREIPEDNEKALKRAVARIG 248

Query:   271 PVSVAIEAGGRAFQHYESGVF--TGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWG 328
             PVSV I+A   +FQ Y  GV+  TG     ++H V+AVGYG + G  +W+++NSWG++WG
Sbjct:   249 PVSVGIDASLPSFQFYSRGVYYDTGCNPENINHAVLAVGYGAQKGTKHWIIKNSWGTEWG 308

Query:   329 ENGYVKLQRNLLDTNTGKCGIAMEASYP 356
               GYV L RN+  T    CGIA  AS+P
Sbjct:   309 NKGYVLLARNMKQT----CGIANLASFP 332


>UNIPROTKB|P83443 [details] [associations]
            symbol:P83443 "Macrodontain-1" species:203992 "Pseudananas
            sagenarius" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0006508 "proteolysis" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197
            ProteinModelPortal:P83443 SMR:P83443 MEROPS:C01.028 Uniprot:P83443
        Length = 213

 Score = 540 (195.1 bits), Expect = 4.4e-52, P = 4.4e-52
 Identities = 97/217 (44%), Positives = 148/217 (68%)

Query:   141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
             +P+S+DWR+ GAVN VK+QG CG CWAF+ +A VEGI KI  G L+ LSEQE++DC   +
Sbjct:     2 VPQSIDWRDYGAVNEVKNQGPCGGCWAFAAIATVEGIYKIRKGNLVYLSEQEVLDC--AV 59

Query:   201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR-RNAKVVSIDGYEDVSPFD 259
             + GC GG ++ A+ FII N G+ ++++YPY   +  C+ +   N+  ++  GY  V   D
Sbjct:    60 SYGCKGGWVNRAYDFIISNNGVTTDENYPYRAYQGTCNANYFPNSAYIT--GYSYVRRND 117

Query:   260 EMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLV 319
             E  +  AV++QP++  I+A G  FQ+Y+ GV++G CG +L+H +  +GYG ++   YW+V
Sbjct:   118 ESHMMYAVSNQPIAALIDASGDNFQYYKGGVYSGPCGFSLNHAITIIGYGRDS---YWIV 174

Query:   320 RNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
             RNSWGS WG+ GYV+++R++  +  G CGIAM   +P
Sbjct:   175 RNSWGSSWGQGGYVRIRRDVSHSG-GVCGIAMSPLFP 210


>UNIPROTKB|F1NZ37 [details] [associations]
            symbol:LOC420160 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 EMBL:AADN02062018
            IPI:IPI00587784 Ensembl:ENSGALT00000006765 OMA:CGVANQA
            Uniprot:F1NZ37
        Length = 340

 Score = 538 (194.4 bits), Expect = 7.2e-52, P = 7.2e-52
 Identities = 119/320 (37%), Positives = 185/320 (57%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLNKFADLTNEE 107
             ++ W + + K   G     +R +++++NLR I++HN   S  + T+++G+N + DL +EE
Sbjct:    34 WERWKSLYAKEYPGEAELIRR-EVWENNLRRIEQHNWEESQGQHTFRLGMNHYGDLMDEE 92

Query:   108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
             +  +  G         ++ +  +  +   A  + P  VDWR +G V PVK+QG CGSCWA
Sbjct:    93 FNQLLNGFAP------VQHEEPALTFQASAAQKTPAEVDWRMRGYVTPVKNQGHCGSCWA 146

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
             FS   A+EG+    TG+L  LSEQ L+DC  K+ N GC GG M  AFQ++  NGGM+SE 
Sbjct:   147 FSATGALEGLVFNWTGKLAVLSEQNLIDCSWKLGNNGCQGGYMTRAFQYVHDNGGMNSEH 206

Query:   227 DYPYLGAE-NKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRA 282
              YPY   + + C  +P+ R A   ++     V+   E +L++AVA   PVSVA++A    
Sbjct:   207 IYPYQATDTSSCRYNPADRAANCSTV---WLVAQGSEAALEQAVATVGPVSVAVDASSFF 263

Query:   283 FQHYESGVFTGE-CGSALDHGVVAVGYG----TENGVDYWLVRNSWGSDWGENGYVKLQR 337
             F  Y+SG+F    C   ++HG++AVGYG        V YW+++NSW   WGE GY++L +
Sbjct:   264 FHFYKSGIFNSMFCSQKVNHGMLAVGYGISQEARKNVSYWILKNSWSEVWGEKGYIRLLK 323

Query:   338 NLLDTNTGKCGIAMEASYPV 357
              +       CG+A +AS+P+
Sbjct:   324 GV----NNHCGVANQASFPL 339


>ZFIN|ZDB-GENE-030131-9831 [details] [associations]
            symbol:ctsf "cathepsin F" species:7955 "Danio
            rerio" [GO:0004869 "cysteine-type endopeptidase inhibitor activity"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000010 InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00031 Pfam:PF00112 PRINTS:PR00705 SMART:SM00043
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-030131-9831
            GO:GO:0004869 eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 HOVERGEN:HBG011513 CTD:8722 OrthoDB:EOG4CC41T
            MEROPS:I25.006 EMBL:BC124243 IPI:IPI00503226 RefSeq:NP_001071036.1
            UniGene:Dr.81265 ProteinModelPortal:Q08CH0 SMR:Q08CH0 GeneID:565588
            KEGG:dre:565588 InParanoid:Q08CH0 NextBio:20885952
            ArrayExpress:Q08CH0 Uniprot:Q08CH0
        Length = 473

 Score = 538 (194.4 bits), Expect = 7.2e-52, P = 7.2e-52
 Identities = 125/328 (38%), Positives = 180/328 (54%)

Query:    38 HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVG 96
             HS   +   E++T+++ ++  + +T +     EKR +IF+ N++      SL + + + G
Sbjct:   161 HSKPMKESVELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYG 220

Query:    97 LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
             + KF+DLT +E+R MYL     ++  L K      + A  A    P++ DWR+ GAV+PV
Sbjct:   221 ITKFSDLTEDEFRMMYLNPML-SQWSLKKE----MKPAIPASAPAPDTWDWRDHGAVSPV 275

Query:   157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFI 216
             K+QG CGSCWAFS    +EG     TG+L+SLSEQELVDCD K++  C GGL   A++ I
Sbjct:   276 KNQGMCGSCWAFSVTGNIEGQWFKKTGQLLSLSEQELVDCD-KLDQACGGGLPSNAYEAI 334

Query:   217 IQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVA 275
                GG+++E DY Y G +  CD S    KV +        P DE  +   +A+  PVS A
Sbjct:   335 ENLGGLETETDYSYTGHKQSCDFS--TGKVAAYINSSVELPKDEKEIAAFLAENGPVSAA 392

Query:   276 IEAGGRAFQHYESGV---FTGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENG 331
             + A   A Q Y  GV       C    +DH V+ VG+G  NGV +W ++NSWG D+GE G
Sbjct:   393 LNAF--AMQFYRKGVSHPLKIFCNPWMIDHAVLLVGFGQRNGVPFWAIKNSWGEDYGEQG 450

Query:   332 YVKLQRNLLDTNTGKCGIAMEASYPVKN 359
             Y  L R      +G CGI    S  + N
Sbjct:   451 YYYLYRG-----SGLCGIHKMCSSAIVN 473


>ZFIN|ZDB-GENE-040426-1583 [details] [associations]
            symbol:ctssa "cathepsin S, a" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-040426-1583
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            GeneTree:ENSGT00660000095458 EMBL:CR548627 IPI:IPI00491948
            UniGene:Dr.81560 SMR:Q1L8W8 Ensembl:ENSDART00000053638 OMA:RNTREER
            OrthoDB:EOG480HX9 Uniprot:Q1L8W8
        Length = 328

 Score = 537 (194.1 bits), Expect = 9.2e-52, P = 9.2e-52
 Identities = 122/313 (38%), Positives = 172/313 (54%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKFADLTNEE 107
             + TW ++H KT         R  ++K NL+ I  HN        +Y +GLN+ +D+T +E
Sbjct:    27 WTTWKSQHNKTYRNTREERLRRSVWKQNLQDILLHNEAAAVGLHSYTLGLNQLSDMTADE 86

Query:   108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
                M      D           +  ++  +   LP+ V+W E G V+PV++QG CGSCWA
Sbjct:    87 VNDMNGLLEEDFPD-------VNATFSPPSLQTLPQRVNWTEHGMVSPVQNQGPCGSCWA 139

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
             FS V ++E   K  T  L+ LS Q L+DC   + N GC GG +  AF ++IQN G+DS  
Sbjct:   140 FSAVGSLEAQMKRRTAALVPLSAQNLLDCSVSLGNRGCKGGFLSRAFLYVIQNRGIDSST 199

Query:   227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQH 285
              YPY   E  C  S  + +     G+  V   +E +L+ AVA+  PVSV I A   +F  
Sbjct:   200 FYPYEHKEGVCRYSV-SGRAGYCTGFRIVPRHNEAALQSAVANIGPVSVGINAKLLSFHR 258

Query:   286 YESGVFTG-ECGSAL-DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTN 343
             Y SG++   +C SAL +H V+ VGYG+ENG DYWLV+NSWG+ WGENGY+++ RN     
Sbjct:   259 YRSGIYNDPKCSSALINHAVLVVGYGSENGQDYWLVKNSWGTAWGENGYIRMARN----- 313

Query:   344 TGKCGIAMEASYP 356
                CGI+    YP
Sbjct:   314 KNMCGISSFGIYP 326


>UNIPROTKB|F6X9C1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00660000095458
            OMA:STSCHKT Ensembl:ENSCAFT00000036196 EMBL:AAEX03002388
            Uniprot:F6X9C1
        Length = 305

 Score = 536 (193.7 bits), Expect = 1.2e-51, P = 1.2e-51
 Identities = 123/315 (39%), Positives = 173/315 (54%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
             +++W  +H K  +   + + R Q F  N R I+ HN+ N T+K+GLN+F+D+   E +  
Sbjct:     5 FKSWAVQHQKKYSSEEYLQ-RLQTFVGNWRKINAHNAGNHTFKMGLNQFSDMNFAEIKHK 63

Query:   112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGA-VNPVKDQGSCGSCWAFST 170
             YL +                 Y    G   P  VDWR+KG  V+PVK+QGSCGSCW FST
Sbjct:    64 YLWSEPQ------NCSATKGNYLRGTGP-YPPFVDWRKKGKFVSPVKNQGSCGSCWTFST 116

Query:   171 VAAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYP 229
               A+E    I +G+L+SL+EQ+LVDC +  N  GC GG    AF++I  N G+  E  YP
Sbjct:   117 TGALESAIAIKSGKLLSLAEQQLVDCAQNFNNHGCQGGAPLQAFEYIRYNKGIMGEDSYP 176

Query:   230 YLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHY 286
             Y G +  C   PS+  A V  +    +++  DE ++ +AVA   PVS A E     F  Y
Sbjct:   177 YKGQDGDCKYQPSKAIAFVKDV---ANITINDEQAMVEAVALYNPVSFAFEVTSD-FMMY 232

Query:   287 ESGVFTG-ECGSALD---HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
               G+++   C    D   H V+AVGYG +NG+ YW+V+NSWG  WG NGY  ++R     
Sbjct:   233 RKGIYSSTSCHKTPDKVNHAVLAVGYGEQNGIPYWIVKNSWGPQWGMNGYFLMERG---- 288

Query:   343 NTGKCGIAMEASYPV 357
                 CG+A  ASYP+
Sbjct:   289 -KNMCGLAACASYPI 302


>TAIR|locus:2050145 [details] [associations]
            symbol:AT2G21430 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002685 GenomeReviews:CT485783_GR
            EMBL:AC006841 EMBL:X74359 IPI:IPI00519637 PIR:B84601
            RefSeq:NP_565512.1 UniGene:At.14069 ProteinModelPortal:P43295
            SMR:P43295 MEROPS:C01.A04 PRIDE:P43295 EnsemblPlants:AT2G21430.1
            GeneID:816682 KEGG:ath:AT2G21430 TAIR:At2g21430 eggNOG:COG4870
            HOGENOM:HOG000230774 InParanoid:P43295 KO:K01373 OMA:GSIEEHY
            PhylomeDB:P43295 ProtClustDB:CLSN2688311 Genevestigator:P43295
            GermOnline:AT2G21430 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 Uniprot:P43295
        Length = 361

 Score = 531 (192.0 bits), Expect = 4.0e-51, P = 4.0e-51
 Identities = 115/294 (39%), Positives = 167/294 (56%)

Query:    58 KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS 117
             K GK    +  +  RF +FK NL     H  ++ + + G+ +F+DLT  E+R  +LG + 
Sbjct:    54 KFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRSEFRRKHLGVKG 113

Query:   118 DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGI 177
               K  L K   A+Q         LPE  DWR++GAV PVK+QGSCGSCW+FST  A+EG 
Sbjct:   114 GFK--LPKD--ANQAPILPT-QNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGA 168

Query:   178 NKIVTGELISLSEQELVDCDRKIN--------AGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
             + + TG+L+SLSEQ+LVDCD + +        +GCNGGLM+ AF++ ++ GG+  E+DYP
Sbjct:   169 HFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMREKDYP 228

Query:   230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
             Y G +       R+  V S+  +  VS  ++      + + P++VAI A     Q Y  G
Sbjct:   229 YTGTDGGSCKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAA--YMQTYIGG 286

Query:   290 VFTGE-CGSALDHGVVAVGYGTENGVD-------YWLVRNSWGSDWGENGYVKL 335
             V     C   L+HGV+ VGYG+            YW+++NSWG  WGENG+ K+
Sbjct:   287 VSCPYICSRRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYKI 340


>UNIPROTKB|H9KYW5 [details] [associations]
            symbol:CTSS "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0002250 "adaptive immune response" evidence=IEA]
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 OMA:YEPACTQ EMBL:AADN02010496
            Ensembl:ENSGALT00000001122 Uniprot:H9KYW5
        Length = 245

 Score = 529 (191.3 bits), Expect = 6.5e-51, P = 6.5e-51
 Identities = 105/218 (48%), Positives = 140/218 (64%)

Query:   142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI- 200
             P+++DWREKG V  VK+QG+CG+CWAFS V A+E   K+ TG+L+SLS Q LVDC     
Sbjct:    31 PDAMDWREKGCVTEVKNQGACGACWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSMMYG 90

Query:   201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
             N GC GG M  AFQ+II N G+DSE+ YPY+     C  +  + +  +   Y ++   DE
Sbjct:    91 NKGCGGGFMTRAFQYIIDNNGIDSEESYPYMAQNGTCQYNV-STRAATCSKYVELPYADE 149

Query:   261 MSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFTG-ECGSALDHGVVAVGYGTENGVDYWL 318
              +LK AVA+  PVSVAI+A    F  Y SGV+    C   ++HGV+ VGYGT N  D+WL
Sbjct:   150 AALKDAVANVGPVSVAIDATQPTFFLYRSGVYDDPRCTQEVNHGVLVVGYGTLNEKDFWL 209

Query:   319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
             V+NSWG  +G+ GY+++ RN    +   CGIA  ASYP
Sbjct:   210 VKNSWGERFGDGGYIRMSRN----HANHCGIASYASYP 243


>RGD|708447 [details] [associations]
            symbol:Testin "testin gene" species:10116 "Rattus norvegicus"
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0030054 "cell junction" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 RGD:708447 GO:GO:0005576 GO:GO:0030054 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            MEROPS:C01.972 OMA:RYHAENS OrthoDB:EOG4XWG0N EMBL:U16858
            IPI:IPI00207173 PIR:I52525 PIR:PC1251 RefSeq:NP_775155.1
            UniGene:Rn.10029 ProteinModelPortal:P15242 SMR:P15242
            Ensembl:ENSRNOT00000024467 GeneID:286916 KEGG:rno:286916
            UCSC:RGD:708447 CTD:286916 InParanoid:P15242 NextBio:625036
            Genevestigator:P15242 GermOnline:ENSRNOG00000018028 Uniprot:P15242
        Length = 333

 Score = 527 (190.6 bits), Expect = 1.1e-50, P = 1.1e-50
 Identities = 119/327 (36%), Positives = 180/327 (55%)

Query:    44 TDDEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN----SLNRTYKVGLN 98
             T D  + + +  W  KHGKT N M     +  +++ N + I+ HN         + + +N
Sbjct:    20 TPDPSLDVEWNEWRTKHGKTYN-MNEERLKRAVWEKNFKMIELHNWEYLEGRHDFTMAMN 78

Query:    99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
              F DLTN E+  M  G +   ++++ K+ +            +P+ VDWR+ G V PVK+
Sbjct:    79 AFGDLTNIEFVKMMTGFQ---RQKIKKTHIFQDHQFLY----VPKRVDWRQLGYVTPVKN 131

Query:   159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDC-DRKINAGCNGGLMDYAFQFII 217
             QG C S WAFS   ++EG     T  LI LSEQ L+DC    +  GC+GG M YAFQ++ 
Sbjct:   132 QGHCASSWAFSATGSLEGQMFRKTERLIPLSEQNLLDCMGSNVTHGCSGGFMQYAFQYVK 191

Query:   218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAI 276
              NGG+ +E+ YPY G   +C     N+   ++  +  + P  E +L KAVA   P+SVA+
Sbjct:   192 DNGGLATEESYPYRGQGRECRYHAENS-AANVRDFVQI-PGSEEALMKAVAKVGPISVAV 249

Query:   277 EAGGRAFQHYESGVF-TGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGEN 330
             +A   +FQ Y SG++   +C    L+H V+ VGYG E    +G  +WLV+NSWG +WG  
Sbjct:   250 DASHGSFQFYGSGIYYEPQCKRVHLNHAVLVVGYGFEGEESDGNSFWLVKNSWGEEWGMK 309

Query:   331 GYVKLQRNLLDTNTGKCGIAMEASYPV 357
             GY+KL ++     +  CGIA  ++YP+
Sbjct:   310 GYMKLAKDW----SNHCGIATYSTYPI 332


>MGI|MGI:1922258 [details] [associations]
            symbol:4930486L24Rik "RIKEN cDNA 4930486L24 gene"
            species:10090 "Mus musculus" [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030054 "cell
            junction" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 MGI:MGI:1922258
            GO:GO:0005576 GO:GO:0030054 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 HSSP:P07711
            EMBL:AY146988 EMBL:AK145933 EMBL:BC061218 IPI:IPI00280732
            RefSeq:NP_835199.1 UniGene:Mm.19839 ProteinModelPortal:Q80UB0
            SMR:Q80UB0 MEROPS:C01.972 PRIDE:Q80UB0 Ensembl:ENSMUST00000091569
            GeneID:214639 KEGG:mmu:214639 UCSC:uc007qvs.1 InParanoid:Q80UB0
            OMA:RYHAENS OrthoDB:EOG4XWG0N NextBio:374408 Bgee:Q80UB0
            CleanEx:MM_4930486L24RIK Genevestigator:Q80UB0 Uniprot:Q80UB0
        Length = 333

 Score = 524 (189.5 bits), Expect = 2.2e-50, P = 2.2e-50
 Identities = 122/320 (38%), Positives = 176/320 (55%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQ--IFKDNLRFIDEHN----SLNRTYKVGLNKFADLTN 105
             +  W  KHGK  N    NE+R +  +++ N + I+ HN         + + +N F DLTN
Sbjct:    29 WNEWRTKHGKAYNV---NEERLRRAVWEKNFKMIELHNWEYLEGKHDFTMTMNAFGDLTN 85

Query:   106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
              E+  M  G R    +R+   +     Y       +P+ VDWR  G V PVK+QG C S 
Sbjct:    86 TEFVKMMTGFRRQKIKRMHVFQDHQFLY-------VPKYVDWRMLGYVTPVKNQGYCASS 138

Query:   166 WAFSTVAAVEGINKIVTGELISLSEQELVDC-DRKINAGCNGGLMDYAFQFIIQNGGMDS 224
             WAFS   ++EG     TG L+ LSEQ L+DC    +   C+GG M  AFQ++  NGG+ +
Sbjct:   139 WAFSATGSLEGQMFKKTGRLVPLSEQNLLDCMGSNVTHDCSGGFMQNAFQYVKDNGGLAT 198

Query:   225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAF 283
             E+ YPY+G   KC     N+   ++  +  + P  E +L KAVA   P+SVA++A   +F
Sbjct:   199 EESYPYIGPGRKCRYHAENS-AANVRDFVQI-PGREEALMKAVAKVGPISVAVDASHDSF 256

Query:   284 QHYESGVF-TGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQR 337
             Q Y+SG++   +C    L+H V+ VGYG E    +G  YWLV+NSWG +WG  GY+K+ +
Sbjct:   257 QFYDSGIYYEPQCKRVHLNHAVLVVGYGFEGEESDGNSYWLVKNSWGEEWGMKGYIKIAK 316

Query:   338 NLLDTNTGKCGIAMEASYPV 357
                D N   CGIA  A+YP+
Sbjct:   317 ---DWNN-HCGIATLATYPI 332


>FB|FBgn0250848 [details] [associations]
            symbol:26-29-p "26-29kD-proteinase" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005811
            "lipid particle" evidence=IDA] [GO:0005875 "microtubule associated
            complex" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005875 EMBL:AE014296 GO:GO:0005811 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 MEROPS:I29.003 HSSP:O65039
            EMBL:AY122222 EMBL:AB011376 RefSeq:NP_620470.1 UniGene:Dm.3049
            SMR:Q9V3U6 MINT:MINT-890485 STRING:Q9V3U6
            EnsemblMetazoa:FBtr0075766 GeneID:39547 KEGG:dme:Dmel_CG8947
            UCSC:CG8947-RA CTD:39547 FlyBase:FBgn0250848 InParanoid:Q9V3U6
            OMA:IHSKNRA OrthoDB:EOG4BVQ8T GenomeRNAi:39547 NextBio:814210
            Uniprot:Q9V3U6
        Length = 549

 Score = 523 (189.2 bits), Expect = 2.8e-50, P = 2.8e-50
 Identities = 120/321 (37%), Positives = 175/321 (54%)

Query:    44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADL 103
             TD+ V   +  +  KHG   +    +E R  IF+ NLR+I   N    TY + +N  AD 
Sbjct:   237 TDEHVDKAFHHFKRKHGVAYHSDTEHEHRKNIFRQNLRYIHSKNRAKLTYTLAVNHLADK 296

Query:   104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
             T EE +A   G +S     +  +         K  DE+P+  DWR  GAV PVKDQ  CG
Sbjct:   297 TEEELKARR-GYKSSG---IYNTGKPFPYDVPKYKDEIPDQYDWRLYGAVTPVKDQSVCG 352

Query:   164 SCWAFSTVAAVEGINKIVTG-ELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGG 221
             SCW+F T+  +EG   +  G  L+ LS+Q L+DC     N GC+GG     +Q+++Q+GG
Sbjct:   353 SCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDCSWAYGNNGCDGGEDFRVYQWMLQSGG 412

Query:   222 MDSEQDY-PYLGAENKCDPSRRNAKVVS-IDGYEDVSPFDEMSLKKAVADQ-PVSVAIEA 278
             + +E++Y PYLG +  C  +  N  +V+ I G+ +V+  D  + K A+    P+SVAI+A
Sbjct:   413 VPTEEEYGPYLGQDGYCHVN--NVTLVAPIKGFVNVTSNDPNAFKLALLKHGPLSVAIDA 470

Query:   279 GGRAFQHYESGVF-TGECGS---ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVK 334
               + F  Y  GV+    C +    LDH V+AVGYG+ NG DYWLV+NSW + WG +GY+ 
Sbjct:   471 SPKTFSFYSHGVYYEPTCKNDVDGLDHAVLAVGYGSINGEDYWLVKNSWSTYWGNDGYI- 529

Query:   335 LQRNLLDTNTGKCGIAMEASY 355
                 L+      CG+    +Y
Sbjct:   530 ----LMSAKKNNCGVMTMPTY 546


>FB|FBgn0260462 [details] [associations]
            symbol:CG12163 species:7227 "Drosophila melanogaster"
            [GO:0035071 "salivary gland cell autophagic cell death"
            evidence=IEP] [GO:0048102 "autophagic cell death" evidence=IEP]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0004869 "cysteine-type
            endopeptidase inhibitor activity" evidence=IEA] [GO:0045169
            "fusome" evidence=IDA] [GO:0035220 "wing disc development"
            evidence=IGI] [GO:0022416 "chaeta development" evidence=IGI]
            InterPro:IPR000010 InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00043 SMART:SM00645
            InterPro:IPR000169 EMBL:AE014297 GO:GO:0004869 eggNOG:COG4870
            KO:K01373 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0022416 GO:GO:0035220 GO:GO:0035071
            GO:GO:0045169 GeneTree:ENSGT00660000095458 EMBL:AY121614
            EMBL:BT003231 RefSeq:NP_649521.1 RefSeq:NP_730901.1
            RefSeq:NP_730902.2 UniGene:Dm.7315 ProteinModelPortal:Q9VN93
            SMR:Q9VN93 DIP:DIP-17491N IntAct:Q9VN93 MINT:MINT-763966
            STRING:Q9VN93 MEROPS:C01.A27 PaxDb:Q9VN93
            EnsemblMetazoa:FBtr0078823 GeneID:40628 KEGG:dme:Dmel_CG12163
            UCSC:CG12163-RA FlyBase:FBgn0260462 InParanoid:Q9VN93 OMA:GPRWGEQ
            OrthoDB:EOG4CC2G9 PhylomeDB:Q9VN93 GenomeRNAi:40628 NextBio:819744
            Bgee:Q9VN93 GermOnline:CG12163 Uniprot:Q9VN93
        Length = 614

 Score = 523 (189.2 bits), Expect = 2.8e-50, P = 2.8e-50
 Identities = 122/325 (37%), Positives = 181/325 (55%)

Query:    46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLT 104
             D+V  ++  +  + G+        + R +IF+ NL+ I+E N+    + K G+ +FAD+T
Sbjct:   302 DKVDHLFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMT 361

Query:   105 NEEYRAMY-LGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
             + EY+    L  R +AK     + V    +      ELP+  DWR+K AV  VK+QGSCG
Sbjct:   362 SSEYKERTGLWQRDEAKATGGSAAVVPAYHG-----ELPKEFDWRQKDAVTQVKNQGSCG 416

Query:   164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
             SCWAFS    +EG+  + TGEL   SEQEL+DCD   +A CNGGLMD A++ I   GG++
Sbjct:   417 SCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSA-CNGGLMDNAYKAIKDIGGLE 475

Query:   224 SEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK-AVADQPVSVAIEAGGRA 282
              E +YPY   +N+C    R    V + G+ D+   +E ++++  +A+ P+S+ I A   A
Sbjct:   476 YEAEYPYKAKKNQCH-FNRTLSHVQVAGFVDLPKGNETAMQEWLLANGPISIGINAN--A 532

Query:   283 FQHYESGV---FTGECGSA-LDHGVVAVGYGTEN------GVDYWLVRNSWGSDWGENGY 332
              Q Y  GV   +   C    LDHGV+ VGYG  +       + YW+V+NSWG  WGE GY
Sbjct:   533 MQFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGY 592

Query:   333 VKLQRNLLDTNTGKCGIAMEASYPV 357
              ++ R     NT  CG++  A+  V
Sbjct:   593 YRVYRG---DNT--CGVSEMATSAV 612


>RGD|69241 [details] [associations]
            symbol:Ctsj "cathepsin J" species:10116 "Rattus norvegicus"
           [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
           evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
           evidence=IEA] [GO:0048471 "perinuclear region of cytoplasm"
           evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
           PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:L14776
           RGD:69241 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
           InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246
           SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
           GO:GO:0005764 HOVERGEN:HBG011513 MEROPS:C01.038 CTD:26898 KO:K09599
           EMBL:AF310623 EMBL:BC097263 IPI:IPI00205027 PIR:I58002
           RefSeq:NP_058817.1 UniGene:Rn.34875 ProteinModelPortal:Q63088
           SMR:Q63088 PRIDE:Q63088 GeneID:29174 KEGG:rno:29174 NextBio:608244
           Genevestigator:Q63088 Uniprot:Q63088
        Length = 334

 Score = 522 (188.8 bits), Expect = 3.6e-50, P = 3.6e-50
 Identities = 122/317 (38%), Positives = 179/317 (56%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVG----LNKFADLTNEE 107
             +Q W  K+ K+ + +    KR  ++++NL+ I  HN  N   K G    +N FAD T EE
Sbjct:    29 WQDWKTKYAKSYSPVEEELKR-AVWEENLKMIQLHNKENGLGKNGFTMEMNAFADTTGEE 87

Query:   108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
             +R     + SD    L+ + V +     +    LP   DWR++G V PV++QG CGSCWA
Sbjct:    88 FRK----SLSDI---LIPAAVTNPSAQKQVSIGLPNFKDWRKEGYVTPVRNQGKCGSCWA 140

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQ 226
             F+ V A+EG     TG L  LS Q L+DC +   N GC  G    AF ++++N G+++E 
Sbjct:   141 FAAVGAIEGQMFSKTGNLTPLSVQNLLDCSKSEGNNGCRWGTAHQAFNYVLKNKGLEAEA 200

Query:   227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQH 285
              YPY G +  C     NA   +I G+ ++ P +E+ L  AVA   PVS AI+A   +F+ 
Sbjct:   201 TYPYEGKDGPCRYHSENASA-NITGFVNLPP-NELYLWVAVASIGPVSAAIDASHDSFRF 258

Query:   286 YESGVF-TGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQRNL 339
             Y  GV+    C S  ++H V+ VGYG E    +G +YWL++NSWG +WG NG++K+ +  
Sbjct:   259 YSGGVYHEPNCSSYVVNHAVLVVGYGFEGNETDGNNYWLIKNSWGEEWGINGFMKIAK-- 316

Query:   340 LDTNTGKCGIAMEASYP 356
              D N   CGIA +AS+P
Sbjct:   317 -DRNN-HCGIASQASFP 331


>UNIPROTKB|Q4QRC2 [details] [associations]
            symbol:Ctsql2 "Protein Ctsql2" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 HOVERGEN:HBG011513 EMBL:CH474032
            RGD:1303225 EMBL:BC097257 IPI:IPI00421946 RefSeq:NP_001002813.2
            UniGene:Rn.128678 SMR:Q4QRC2 MEROPS:C01.111
            Ensembl:ENSRNOT00000038758 GeneID:408201 KEGG:rno:408201 CTD:408201
            InParanoid:Q4QRC2 OMA:NDEGALM NextBio:696394 Genevestigator:Q4QRC2
            Uniprot:Q4QRC2
        Length = 343

 Score = 520 (188.1 bits), Expect = 5.8e-50, P = 5.8e-50
 Identities = 122/321 (38%), Positives = 183/321 (57%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLNKFADLTNEE 107
             +Q W  K+ K  +      KR  ++++N++ I+ HN   SL + TY + +N FADLT+EE
Sbjct:    29 WQEWKMKYEKLYSPEEELLKRV-VWEENVKKIELHNRENSLGKNTYIMEINNFADLTDEE 87

Query:   108 YRAMYLGTR---SDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
             ++ M  G     ++  + L K  + S    +    D LP+S+DWR++G V  V++QG C 
Sbjct:    88 FKDMITGITLPINNTMKSLWKRALGSPFPNSWYWRDALPKSIDWRKEGYVTRVREQGKCK 147

Query:   164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGM 222
             SCWAF    A+EG     TG+L  LS Q LVDC + + N GC GG    AFQ+++QNGG+
Sbjct:   148 SCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVLQNGGL 207

Query:   223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGR 281
             +SE  YPY G E  C  + +NA    I  +  + P DE  L  A+A + PV+  I     
Sbjct:   208 ESEATYPYKGKEGLCKYNPKNA-YAKITRFVAL-PEDEDVLMDALATKGPVAAGIHVVYS 265

Query:   282 AFQHYESGVF-TGECGSALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQ 336
             + + Y+ G++   +C + ++H V+ VGYG E    +G +YWL++NSWG  WG  GY+K+ 
Sbjct:   266 SLRFYKKGIYHEPKCNNRVNHAVLVVGYGFEGNETDGNNYWLIKNSWGKQWGLKGYMKIA 325

Query:   337 RNLLDTNTGKCGIAMEASYPV 357
             +   D N   CGIA  A YP+
Sbjct:   326 K---DRNN-HCGIATFAQYPI 342


>UNIPROTKB|E2RR02 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            OMA:LAPPEWD GeneTree:ENSGT00660000095458 EMBL:AAEX03011628
            Ensembl:ENSCAFT00000019742 Uniprot:E2RR02
        Length = 460

 Score = 519 (187.8 bits), Expect = 7.4e-50, P = 7.4e-50
 Identities = 123/318 (38%), Positives = 174/318 (54%)

Query:    47 EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTN 105
             ++ ++++ ++  + +T       E R  +F +N+    +  +L+R T + G+ KF+DLT 
Sbjct:   157 KMASVFKEFVTTYNRTYETKEEAEWRMSVFSNNMVRAQKIQALDRGTAQYGITKFSDLTE 216

Query:   106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESV-DWREKGAVNPVKDQGSCGS 164
             EE+R +YL         L +++    R A    D  P    DWR KGAV  VKDQG CGS
Sbjct:   217 EEFRTIYLNPL------LRENRGKKMRLAKSISDHAPPPEWDWRSKGAVTKVKDQGMCGS 270

Query:   165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
             CWAFS    VEG   +  G L+SLSEQEL+DCD K++  C GGL   A+  I+  GG+++
Sbjct:   271 CWAFSVTGNVEGQWFLKEGTLLSLSEQELLDCD-KVDKACLGGLPSNAYSAIMTLGGLET 329

Query:   225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAF 283
             E DY Y G    C  S + A+V   D  E +S  +E  L   +A + P+SVAI A G  F
Sbjct:   330 EDDYSYQGHLQACSFSAKKARVYINDSME-LSQ-NEQKLAAWLAKKGPISVAINAFGMQF 387

Query:   284 -QHYESGVFTGECGSAL-DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
              +H  S      C   L DH V+ VGYG  +G+ +W ++NSWG+DWGE GY  L R    
Sbjct:   388 YRHGISHPLRPLCSPWLIDHAVLLVGYGNRSGIPFWAIKNSWGTDWGEEGYYYLHRG--- 444

Query:   342 TNTGKCGIAMEASYPVKN 359
               +G CG+   AS  V N
Sbjct:   445 --SGACGVNTMASSAVVN 460


>UNIPROTKB|E9PSK9 [details] [associations]
            symbol:Ctsql2 "Protein Ctsql2" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            IPI:IPI00562656 Ensembl:ENSRNOT00000045847 RGD:1303225
            ArrayExpress:E9PSK9 Uniprot:E9PSK9
        Length = 342

 Score = 519 (187.8 bits), Expect = 7.4e-50, P = 7.4e-50
 Identities = 124/321 (38%), Positives = 183/321 (57%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLNKFADLTNEE 107
             +Q W  K+ K  +      KR  ++++N++ I+ HN   SL + TY + +N FADLT+EE
Sbjct:    29 WQEWKMKYEKLYSPEEELLKRV-VWEENVKKIELHNRENSLGKNTYIMEINNFADLTDEE 87

Query:   108 YRAMYLGTR---SDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
             ++ M  G     ++  + L K  + S    +    D LP+S+DWR++G V  V++QG C 
Sbjct:    88 FKDMITGITLPINNTMKSLWKRALGSPFPNSWYWRDALPKSIDWRKEGYVTRVREQGKCK 147

Query:   164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGM 222
             SCWAF    A+EG     TG+L  LS Q LVDC + + N GC GG    AFQ+++QNGG+
Sbjct:   148 SCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVLQNGGL 207

Query:   223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGR 281
             +SE  YPY G E  C  + +NA    I  +  + P DE  L  A+A + PV+  I     
Sbjct:   208 ESEATYPYKGKEGLCKYNPKNA-YAKITRFVAL-PEDEDVLMDALATKGPVAAGIHVVYS 265

Query:   282 AFQHYESGVF-TGECGSALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQ 336
              F H+ SG++   +C + ++H V+ VGYG E    +G +YWL++NSWG  WG  GY+K+ 
Sbjct:   266 YF-HFVSGIYHEPKCNNRVNHAVLVVGYGFEGNETDGNNYWLIKNSWGKQWGLKGYMKIA 324

Query:   337 RNLLDTNTGKCGIAMEASYPV 357
             +   D N   CGIA  A YP+
Sbjct:   325 K---DRNN-HCGIATFAQYPI 341


>WB|WBGene00007055 [details] [associations]
            symbol:tag-196 species:6239 "Caenorhabditis elegans"
            [GO:0004869 "cysteine-type endopeptidase inhibitor activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000010
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00031 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00043 SMART:SM00645 InterPro:IPR000169
            GO:GO:0004869 eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 EMBL:FO080488 PIR:T31871
            RefSeq:NP_505215.2 HSSP:Q9UBX1 ProteinModelPortal:O16454 SMR:O16454
            DIP:DIP-27400N IntAct:O16454 MINT:MINT-1044990 MEROPS:C01.A50
            PaxDb:O16454 EnsemblMetazoa:F41E6.6.1 EnsemblMetazoa:F41E6.6.2
            EnsemblMetazoa:F41E6.6.3 GeneID:179240 KEGG:cel:CELE_F41E6.6
            UCSC:F41E6.6.1 CTD:179240 WormBase:F41E6.6 InParanoid:O16454
            OMA:GGGLMTN NextBio:904514 Uniprot:O16454
        Length = 477

 Score = 519 (187.8 bits), Expect = 7.4e-50, P = 7.4e-50
 Identities = 124/321 (38%), Positives = 175/321 (54%)

Query:    45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE-HNSLNRTYKVGLNKFADL 103
             D  +   +  ++ +H K         KRF++FK N + I E   +   T   G  KF+D+
Sbjct:   167 DYVIWNSFLDFVDRHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVYGFTKFSDM 226

Query:   104 TNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
             T  E++ + L  + +     M+     +       ++LPES DWREKGAV  VK+QG+CG
Sbjct:   227 TTMEFKKIMLPYQWEQPVYPMEQANFEKHDVTINEEDLPESFDWREKGAVTQVKNQGNCG 286

Query:   164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
             SCWAFST   VEG   I   +L+SLSEQELVDCD  ++ GCNGGL   A++ II+ GG++
Sbjct:   287 SCWAFSTTGNVEGAWFIAKNKLVSLSEQELVDCD-SMDQGCNGGLPSNAYKEIIRMGGLE 345

Query:   224 SEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK-AVADQPVSVAIEAGGRA 282
              E  YPY G    C   R++  V  I+G  ++ P DE+ ++K  V   P+S+ + A    
Sbjct:   346 PEDAYPYDGRGETCHLVRKDIAVY-INGSVEL-PHDEVEMQKWLVTKGPISIGLNAN--T 401

Query:   283 FQHYESGV---FTGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
              Q Y  GV   F   C    L+HGV+ VGYG +    YW+V+NSWG +WGE GY KL R 
Sbjct:   402 LQFYRHGVVHPFKIFCEPFMLNHGVLIVGYGKDGRKPYWIVKNSWGPNWGEAGYFKLYRG 461

Query:   339 LLDTNTGKCGIAMEASYPVKN 359
                 N   CG+   A+  + N
Sbjct:   462 ---KNV--CGVQEMATSALVN 477


>UNIPROTKB|Q9UBX1 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=TAS] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_6900 GO:GO:0019886 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0043202
            GO:GO:0004197 HOVERGEN:HBG011513 EMBL:AJ007331 EMBL:AF088886
            EMBL:AF132894 EMBL:AF136279 EMBL:AF071748 EMBL:AF071749
            EMBL:AK313657 EMBL:BC011682 EMBL:BC036451 EMBL:AL137742
            IPI:IPI00002816 RefSeq:NP_003784.2 UniGene:Hs.11590 PDB:1D5U
            PDB:1M6D PDBsum:1D5U PDBsum:1M6D ProteinModelPortal:Q9UBX1
            SMR:Q9UBX1 STRING:Q9UBX1 MEROPS:C01.018 PhosphoSite:Q9UBX1
            DMDM:12643325 PaxDb:Q9UBX1 PeptideAtlas:Q9UBX1 PRIDE:Q9UBX1
            DNASU:8722 Ensembl:ENST00000310325 GeneID:8722 KEGG:hsa:8722
            UCSC:uc001oip.3 CTD:8722 GeneCards:GC11M066332 HGNC:HGNC:2531
            HPA:CAB002141 MIM:603539 neXtProt:NX_Q9UBX1 PharmGKB:PA27031
            InParanoid:Q9UBX1 OMA:LAPPEWD OrthoDB:EOG4CC41T PhylomeDB:Q9UBX1
            BindingDB:Q9UBX1 ChEMBL:CHEMBL2517 ChiTaRS:CTSF
            EvolutionaryTrace:Q9UBX1 GenomeRNAi:8722 NextBio:32715
            ArrayExpress:Q9UBX1 Bgee:Q9UBX1 CleanEx:HS_CTSF
            Genevestigator:Q9UBX1 GermOnline:ENSG00000174080 Uniprot:Q9UBX1
        Length = 484

 Score = 518 (187.4 bits), Expect = 9.5e-50, P = 9.5e-50
 Identities = 125/315 (39%), Positives = 170/315 (53%)

Query:    47 EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTN 105
             ++ +I++ ++  + +T         R  +F +N+    +  +L+R T + G+ KF+DLT 
Sbjct:   182 KMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTE 241

Query:   106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
             EE+R +YL T       L K      + A   GD  P   DWR KGAV  VKDQG CGSC
Sbjct:   242 EEFRTIYLNTL------LRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSC 295

Query:   166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
             WAFS    VEG   +  G L+SLSEQEL+DCD K++  C GGL   A+  I   GG+++E
Sbjct:   296 WAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD-KMDKACMGGLPSNAYSAIKNLGGLETE 354

Query:   226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAF- 283
              DY Y G    C+ S   AKV   D  E +S  +E  L   +A + P+SVAI A G  F 
Sbjct:   355 DDYSYQGHMQSCNFSAEKAKVYINDSVE-LSQ-NEQKLAAWLAKRGPISVAINAFGMQFY 412

Query:   284 QHYESGVFTGECGSAL-DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
             +H  S      C   L DH V+ VGYG  + V +W ++NSWG+DWGE GY  L R     
Sbjct:   413 RHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKGYYYLHRG---- 468

Query:   343 NTGKCGIAMEASYPV 357
              +G CG+   AS  V
Sbjct:   469 -SGACGVNTMASSAV 482


>GENEDB_PFALCIPARUM|PF11_0161 [details] [associations]
            symbol:PF11_0161 "falcipain-2 precursor,
            putative" species:5833 "Plasmodium falciparum" [GO:0020020 "food
            vacuole" evidence=TAS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014186 HSSP:O65039 OMA:NNVEHIN GO:GO:0020020
            MEROPS:C01.046 HOGENOM:HOG000065857 ProtClustDB:PTZ00021
            RefSeq:XP_001347832.1 ProteinModelPortal:Q8I6U5 SMR:Q8I6U5
            IntAct:Q8I6U5 MINT:MINT-1546851 EnsemblProtists:PF11_0161:mRNA
            GeneID:810708 KEGG:pfa:PF11_0161 EuPathDB:PlasmoDB:PF3D7_1115300
            Uniprot:Q8I6U5
        Length = 482

 Score = 517 (187.1 bits), Expect = 1.2e-49, P = 1.2e-49
 Identities = 125/343 (36%), Positives = 185/343 (53%)

Query:    34 NNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT- 92
             N  DH       + +   Y T++  + K  N     ++RFQ+F  N   +  HN+  ++ 
Sbjct:   146 NVFDHKFLMNNVEHINQFY-TFIKTNNKQYNSPNEMKERFQVFLQNAHKVKMHNNNKKSL 204

Query:    93 YKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYAC-----KAGDELPESV-D 146
             YK  LN+FADLT  E+++ YL  RS    +  K  +    Y       K  +    +  D
Sbjct:   205 YKKELNRFADLTYHEFKSKYLTLRSSKPLKNSKYLLDQINYDAVIKKYKGNENFDHAAYD 264

Query:   147 WREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNG 206
             WR    V PVKDQ +CGSCWAFS++ +VE    I   +LI+LSEQELVDC  K N GCNG
Sbjct:   265 WRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFK-NYGCNG 323

Query:   207 GLMDYAFQFIIQNGGMDSEQDYPYLG-AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
             GL++ AF+ +I+ GG+ ++ DYPY+  A N C+  R   K   I  Y  V P  +  LK+
Sbjct:   324 GLINNAFEDMIELGGICTDDDYPYVSDAPNLCNIDRCTEKY-GIKNYLSV-P--DNKLKE 379

Query:   266 AVADQ-PVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG--------TENGVD- 315
             A+    P+S++I A    F  Y+ G+F GECG  L+H V+ VG+G        T+ G   
Sbjct:   380 ALRFLGPISISI-AVSDDFPFYKEGIFDGECGDELNHAVMLVGFGMKEIVNPLTKKGEKH 438

Query:   316 -YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
              Y++++NSWG  WGE G++ ++ +       KCG+  +A  P+
Sbjct:   439 YYYIIKNSWGQQWGERGFINIETDESGLMR-KCGLGTDAFIPL 480


>UNIPROTKB|Q8I6U5 [details] [associations]
            symbol:PF11_0161 "Falcipain-2B" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            EMBL:AE014186 HSSP:O65039 OMA:NNVEHIN GO:GO:0020020 MEROPS:C01.046
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347832.1
            ProteinModelPortal:Q8I6U5 SMR:Q8I6U5 IntAct:Q8I6U5
            MINT:MINT-1546851 EnsemblProtists:PF11_0161:mRNA GeneID:810708
            KEGG:pfa:PF11_0161 EuPathDB:PlasmoDB:PF3D7_1115300 Uniprot:Q8I6U5
        Length = 482

 Score = 517 (187.1 bits), Expect = 1.2e-49, P = 1.2e-49
 Identities = 125/343 (36%), Positives = 185/343 (53%)

Query:    34 NNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT- 92
             N  DH       + +   Y T++  + K  N     ++RFQ+F  N   +  HN+  ++ 
Sbjct:   146 NVFDHKFLMNNVEHINQFY-TFIKTNNKQYNSPNEMKERFQVFLQNAHKVKMHNNNKKSL 204

Query:    93 YKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYAC-----KAGDELPESV-D 146
             YK  LN+FADLT  E+++ YL  RS    +  K  +    Y       K  +    +  D
Sbjct:   205 YKKELNRFADLTYHEFKSKYLTLRSSKPLKNSKYLLDQINYDAVIKKYKGNENFDHAAYD 264

Query:   147 WREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNG 206
             WR    V PVKDQ +CGSCWAFS++ +VE    I   +LI+LSEQELVDC  K N GCNG
Sbjct:   265 WRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFK-NYGCNG 323

Query:   207 GLMDYAFQFIIQNGGMDSEQDYPYLG-AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
             GL++ AF+ +I+ GG+ ++ DYPY+  A N C+  R   K   I  Y  V P  +  LK+
Sbjct:   324 GLINNAFEDMIELGGICTDDDYPYVSDAPNLCNIDRCTEKY-GIKNYLSV-P--DNKLKE 379

Query:   266 AVADQ-PVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG--------TENGVD- 315
             A+    P+S++I A    F  Y+ G+F GECG  L+H V+ VG+G        T+ G   
Sbjct:   380 ALRFLGPISISI-AVSDDFPFYKEGIFDGECGDELNHAVMLVGFGMKEIVNPLTKKGEKH 438

Query:   316 -YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
              Y++++NSWG  WGE G++ ++ +       KCG+  +A  P+
Sbjct:   439 YYYIIKNSWGQQWGERGFINIETDESGLMR-KCGLGTDAFIPL 480


>RGD|1562210 [details] [associations]
            symbol:MGC114246 "similar to cathepsin R" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1562210 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 EMBL:CH474032 MEROPS:C01.042 KO:K09601
            OMA:ASHESFK OrthoDB:EOG4ZCT6D EMBL:BC091563 IPI:IPI00555186
            RefSeq:NP_001017509.1 UniGene:Rn.198321 SMR:Q5BJA0
            Ensembl:ENSRNOT00000061470 GeneID:498688 KEGG:rno:498688
            UCSC:RGD:1562210 InParanoid:Q5BJA0 NextBio:700535
            Genevestigator:Q5BJA0 Uniprot:Q5BJA0
        Length = 334

 Score = 513 (185.6 bits), Expect = 3.2e-49, P = 3.2e-49
 Identities = 123/317 (38%), Positives = 173/317 (54%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVG----LNKFADLTNEE 107
             +Q W  K+ K S  +   E R  ++++NL+ I  HN  N   K G    +N+F D T EE
Sbjct:    29 WQEWKKKYDK-SYSLEEEELRRAVWEENLKMIKLHNGENGLGKNGFTMEINEFGDTTGEE 87

Query:   108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
             +R M +       R   + K   +R    AG   P+ VDWR+KG V PV+ QG+C +CWA
Sbjct:    88 FRKMMVEFPVQTHR---EGKSIMKR---AAGSIFPKFVDWRKKGYVTPVRRQGNCNACWA 141

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
             FS   A+E      +G+LI LS Q LVDC + + N GC GG    AFQ+++ NGG+ SE 
Sbjct:   142 FSVTGAIEAQTIWQSGKLIPLSVQNLVDCSKPQGNNGCLGGDTYNAFQYVLHNGGLQSEA 201

Query:   227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQH 285
              YPY G +  C  + +N+    I G+  + P  E  L  AVA   P+S  I+A   +F+ 
Sbjct:   202 TYPYEGKDGPCRYNPKNSSA-EITGFVSL-PESEDILMVAVATIGPISAGIDASHESFKF 259

Query:   286 YESGVF-TGECGS-ALDHGVVAVGYG---TENGVD-YWLVRNSWGSDWGENGYVKLQRNL 339
             Y+ G++    C S ++ HGV+ VGYG    + G D YWL++NSWG  WG  GY+K+ +  
Sbjct:   260 YKKGIYHEPNCSSNSVTHGVLVVGYGFKGNDTGGDHYWLIKNSWGKQWGIRGYMKITK-- 317

Query:   340 LDTNTGKCGIAMEASYP 356
              D N   C IA  A YP
Sbjct:   318 -DKNN-HCAIASYAHYP 332


>TAIR|locus:2130180 [details] [associations]
            symbol:AT4G16190 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0042744 "hydrogen peroxide catabolic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005773
            EMBL:CP002687 HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 EMBL:Z97340 EMBL:AL161543 UniGene:At.25555
            EMBL:AY039556 EMBL:AY129473 EMBL:AY136316 EMBL:BT000733
            EMBL:AK226366 IPI:IPI00543588 PIR:D71428 RefSeq:NP_567489.1
            HSSP:P25779 ProteinModelPortal:Q9SUL1 SMR:Q9SUL1 STRING:Q9SUL1
            MEROPS:C01.A06 PRIDE:Q9SUL1 EnsemblPlants:AT4G16190.1 GeneID:827311
            KEGG:ath:AT4G16190 TAIR:At4g16190 InParanoid:Q9SUL1 OMA:NACGINK
            PhylomeDB:Q9SUL1 ProtClustDB:CLSN2917559 Genevestigator:Q9SUL1
            Uniprot:Q9SUL1
        Length = 373

 Score = 512 (185.3 bits), Expect = 4.1e-49, P = 4.1e-49
 Identities = 111/297 (37%), Positives = 170/297 (57%)

Query:    57 AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTR 116
             +K+ KT      ++ RF++FK NLR    +  L+ +   G+ +F+DLT +E+R  +LG  
Sbjct:    60 SKYEKTYATQVEHDHRFRVFKANLRRARRNQLLDPSAVHGVTQFSDLTPKEFRRKFLGL- 118

Query:   117 SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
                KRR  +    +Q        +LP   DWRE+GAV PVK+QG CGSCW+FS + A+EG
Sbjct:   119 ---KRRGFRLPTDTQTAPILPTSDLPTEFDWREQGAVTPVKNQGMCGSCWSFSAIGALEG 175

Query:   177 INKIVTGELISLSEQELVDCDRKIN--------AGCNGGLMDYAFQFIIQNGGMDSEQDY 228
              + + T EL+SLSEQ+LVDCD + +        +GC+GGLM+ AF++ ++ GG+  E+DY
Sbjct:   176 AHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALKAGGLMKEEDY 235

Query:   229 PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
             PY G ++      ++  V S+  +  VS  ++      V   P+++AI A     Q Y  
Sbjct:   236 PYTGRDHTACKFDKSKIVASVSNFSVVSSDEDQIAANLVQHGPLAIAINA--MWMQTYIG 293

Query:   289 GVFTGE-CGSALDHGVVAVGYGTENGVD-------YWLVRNSWGSDWGENGYVKLQR 337
             GV     C  + DHGV+ VG+G+            YW+++NSWG+ WGE+GY K+ R
Sbjct:   294 GVSCPYVCSKSQDHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGAMWGEHGYYKICR 350


>DICTYBASE|DDB_G0279187 [details] [associations]
            symbol:cprG "cysteine proteinase 7" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0279187 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000030 ProtClustDB:CLSZ2846820 MEROPS:C01.081
            EMBL:U72746 RefSeq:XP_641720.2 ProteinModelPortal:Q94504 SMR:Q94504
            PRIDE:Q94504 EnsemblProtists:DDB0215005 GeneID:8621915
            KEGG:ddi:DDB_G0279187 OMA:INTETEK Uniprot:Q94504
        Length = 460

 Score = 511 (184.9 bits), Expect = 5.2e-49, P = 5.2e-49
 Identities = 112/273 (41%), Positives = 154/273 (56%)

Query:    47 EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNE 106
             E    +  W+  H +  +    N  R+ IFK N+ +++E N+      +GLN FAD++NE
Sbjct:    25 EYRNAFTNWMIAHQRHYSSEEFNG-RYNIFKANMDYVNEWNTKGSETVLGLNVFADISNE 83

Query:   107 EYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCW 166
             EYRA YLGT  DA    M    + + +   A       VDWR +GAV P+K+QG CG CW
Sbjct:    84 EYRATYLGTPFDASSLEMTE--SDKIFDASA------QVDWRTQGAVTPIKNQGQCGGCW 135

Query:   167 AFSTVAAVEGINKIVTGE--LISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMD 223
             +FST  A EG   +  G+  L+SLSEQ L+DC     N GC GGLM  AF++II N G+D
Sbjct:   136 SFSTTGATEGAQYLANGKKNLVSLSEQNLIDCSGSYGNNGCEGGLMTLAFEYIINNKGID 195

Query:   224 SEQDYPYLGAENK-CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRA 282
             +E  YPY   + K C  + +N     +  Y +V+   E  L   V   P SVAI+A  ++
Sbjct:   196 TESSYPYTAEDGKKCKFNPKNV-AAQLSSYVNVTSGSESDLAAKVTQGPTSVAIDASNQS 254

Query:   283 FQHYESGVFTGE-CGSA-LDHGVVAVGYGTENG 313
             FQ Y SG++    C S  LDHGV+AVG+GT +G
Sbjct:   255 FQLYVSGIYNEPACSSTQLDHGVLAVGFGTGSG 287

 Score = 122 (48.0 bits), Expect = 0.00015, P = 0.00015
 Identities = 32/84 (38%), Positives = 44/84 (52%)

Query:   273 SVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGY 332
             SV+  A G A     SG  +G    +  +G V   Y T    DYW+V+NSWG+ WG +GY
Sbjct:   385 SVSGSASGSA-----SGSASGSSSGSNSNGGV---YPTAG--DYWIVKNSWGTSWGMDGY 434

Query:   333 VKLQRNLLDTNTGKCGIAMEASYP 356
             + + +     N  +CGIA  AS P
Sbjct:   435 ILMTKG----NNNQCGIATMASRP 454


>DICTYBASE|DDB_G0291191 [details] [associations]
            symbol:DDB_G0291191 "cysteine protease" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0291191
            GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AAFI02000175 MEROPS:C01.022
            ProtClustDB:CLSZ2429603 RefSeq:XP_635374.1
            ProteinModelPortal:Q54F16 PRIDE:Q54F16 EnsemblProtists:DDB0252831
            GeneID:8628022 KEGG:ddi:DDB_G0291191 OMA:NETQIAS Uniprot:Q54F16
        Length = 352

 Score = 510 (184.6 bits), Expect = 6.7e-49, P = 6.7e-49
 Identities = 121/320 (37%), Positives = 171/320 (53%)

Query:    72 RFQIFKDNLRFIDEHNSLNRTY----KVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
             +F+ FK NL  ID  N    T     K G+NKFADL+ EE++  YL ++   + RL    
Sbjct:    46 KFETFKSNLLNIDALNKQATTIGSDTKFGVNKFADLSKEEFKKYYLSSK---EARLTDDL 102

Query:   128 VASQRYACKAGDELPESVDWREKGA---------VNPVKDQGSCGSCWAFSTVAAVEGIN 178
                   +       P + DWR  G          V  VK+QG CGSCW+FST   VEG +
Sbjct:   103 PMLPNLSDDIISATPAAFDWRNTGGSTKFPQGTPVTAVKNQGQCGSCWSFSTTGNVEGQH 162

Query:   179 KIVTGELISLSEQELVDCDRKI---------NAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
              + TG L+ LSEQ LVDCD            NAGC+GGL   A+ +II+NGG+ +E  YP
Sbjct:   163 YLSTGTLVGLSEQNLVDCDHTCMTYENENVCNAGCDGGLQPNAYNYIIKNGGIQTEATYP 222

Query:   230 YLGAENKCDPSRRNAKV-VSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYE 287
             Y   + +C     +A+V   I  +  V P +E  +   + +  P+++A +A    +Q Y 
Sbjct:   223 YTAVDGEC--KFNSAQVGAKISSFTMV-PQNETQIASYLFNNGPLAIAADA--EEWQFYM 277

Query:   288 SGVFTGECGSALDHGVVAVGYGTENGV-----DYWLVRNSWGSDWGENGYVKLQRNLLDT 342
              GVF   CG  LDHG++ VGYG ++ +      YW+++NSWG+DWGE GY+K++RN    
Sbjct:   278 GGVFDFPCGQTLDHGILIVGYGAQDTIVGKNTPYWIIKNSWGADWGEAGYLKVERN---- 333

Query:   343 NTGKCGIAMEASYPVKNSQN 362
              T KCG+A   S  +  S N
Sbjct:   334 -TDKCGVANFVSSSIVGSSN 352


>GENEDB_PFALCIPARUM|PF11_0165 [details] [associations]
            symbol:PF11_0165 "falcipain 2 precursor"
            species:5833 "Plasmodium falciparum" [GO:0020020 "food vacuole"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014186 HSSP:O65039 GO:GO:0020020
            RefSeq:XP_001347836.1 ProteinModelPortal:Q8I6U4 SMR:Q8I6U4
            IntAct:Q8I6U4 MINT:MINT-1559493 MEROPS:C01.046
            EnsemblProtists:PF11_0165:mRNA GeneID:810712 KEGG:pfa:PF11_0165
            EuPathDB:PlasmoDB:PF3D7_1115700 HOGENOM:HOG000065857 OMA:NESLHAN
            ProtClustDB:PTZ00021 BindingDB:Q8I6U4 ChEMBL:CHEMBL3470
            Uniprot:Q8I6U4
        Length = 484

 Score = 510 (184.6 bits), Expect = 6.7e-49, P = 6.7e-49
 Identities = 123/343 (35%), Positives = 183/343 (53%)

Query:    34 NNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRT 92
             N  D+       + +   Y  ++  + K  N     ++RFQ+F  N   ++ HN+  N  
Sbjct:   148 NFFDNKFLMNNAEHINQFYM-FIKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSL 206

Query:    93 YKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYA-----CKAGDELPESV-D 146
             YK  LN+FADLT  E++  YL  RS    +  K  +    Y       K  +    +  D
Sbjct:   207 YKKELNRFADLTYHEFKNKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYKGNENFDHAAYD 266

Query:   147 WREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNG 206
             WR    V PVKDQ +CGSCWAFS++ +VE    I   +LI+LSEQELVDC  K N GCNG
Sbjct:   267 WRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFK-NYGCNG 325

Query:   207 GLMDYAFQFIIQNGGMDSEQDYPYLG-AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
             GL++ AF+ +I+ GG+ ++ DYPY+  A N C+  R   K   I  Y  V P  +  LK+
Sbjct:   326 GLINNAFEDMIELGGICTDDDYPYVSDAPNLCNIDRCTEKY-GIKNYLSV-P--DNKLKE 381

Query:   266 AVADQ-PVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG--------TENGVD- 315
             A+    P+S+++ A    F  Y+ G+F GECG  L+H V+ VG+G        T+ G   
Sbjct:   382 ALRFLGPISISV-AVSDDFAFYKEGIFDGECGDQLNHAVMLVGFGMKEIVNPLTKKGEKH 440

Query:   316 -YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
              Y++++NSWG  WGE G++ ++ +       KCG+  +A  P+
Sbjct:   441 YYYIIKNSWGQQWGERGFINIETDESGLMR-KCGLGTDAFIPL 482


>UNIPROTKB|Q8I6U4 [details] [associations]
            symbol:PF11_0165 "Falcipain-2A" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            EMBL:AE014186 HSSP:O65039 GO:GO:0020020 RefSeq:XP_001347836.1
            ProteinModelPortal:Q8I6U4 SMR:Q8I6U4 IntAct:Q8I6U4
            MINT:MINT-1559493 MEROPS:C01.046 EnsemblProtists:PF11_0165:mRNA
            GeneID:810712 KEGG:pfa:PF11_0165 EuPathDB:PlasmoDB:PF3D7_1115700
            HOGENOM:HOG000065857 OMA:NESLHAN ProtClustDB:PTZ00021
            BindingDB:Q8I6U4 ChEMBL:CHEMBL3470 Uniprot:Q8I6U4
        Length = 484

 Score = 510 (184.6 bits), Expect = 6.7e-49, P = 6.7e-49
 Identities = 123/343 (35%), Positives = 183/343 (53%)

Query:    34 NNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRT 92
             N  D+       + +   Y  ++  + K  N     ++RFQ+F  N   ++ HN+  N  
Sbjct:   148 NFFDNKFLMNNAEHINQFYM-FIKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSL 206

Query:    93 YKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYA-----CKAGDELPESV-D 146
             YK  LN+FADLT  E++  YL  RS    +  K  +    Y       K  +    +  D
Sbjct:   207 YKKELNRFADLTYHEFKNKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYKGNENFDHAAYD 266

Query:   147 WREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNG 206
             WR    V PVKDQ +CGSCWAFS++ +VE    I   +LI+LSEQELVDC  K N GCNG
Sbjct:   267 WRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFK-NYGCNG 325

Query:   207 GLMDYAFQFIIQNGGMDSEQDYPYLG-AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
             GL++ AF+ +I+ GG+ ++ DYPY+  A N C+  R   K   I  Y  V P  +  LK+
Sbjct:   326 GLINNAFEDMIELGGICTDDDYPYVSDAPNLCNIDRCTEKY-GIKNYLSV-P--DNKLKE 381

Query:   266 AVADQ-PVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG--------TENGVD- 315
             A+    P+S+++ A    F  Y+ G+F GECG  L+H V+ VG+G        T+ G   
Sbjct:   382 ALRFLGPISISV-AVSDDFAFYKEGIFDGECGDQLNHAVMLVGFGMKEIVNPLTKKGEKH 440

Query:   316 -YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
              Y++++NSWG  WGE G++ ++ +       KCG+  +A  P+
Sbjct:   441 YYYIIKNSWGQQWGERGFINIETDESGLMR-KCGLGTDAFIPL 482


>MGI|MGI:1861723 [details] [associations]
            symbol:Ctsr "cathepsin R" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=ISA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=ISA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0030163 "protein
            catabolic process" evidence=ISA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1861723 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0030163
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF245399
            EMBL:AY014778 EMBL:AK014432 EMBL:AK005429 IPI:IPI00120321
            RefSeq:NP_064680.1 UniGene:Mm.315715 ProteinModelPortal:Q9JIA9
            SMR:Q9JIA9 MEROPS:C01.042 PRIDE:Q9JIA9 Ensembl:ENSMUST00000021889
            GeneID:56835 KEGG:mmu:56835 CTD:56835 InParanoid:Q9JIA9 KO:K09601
            OMA:ASHESFK OrthoDB:EOG4ZCT6D NextBio:313379 Bgee:Q9JIA9
            CleanEx:MM_CTSR Genevestigator:Q9JIA9 GermOnline:ENSMUSG00000055679
            Uniprot:Q9JIA9
        Length = 334

 Score = 509 (184.2 bits), Expect = 8.5e-49, P = 8.5e-49
 Identities = 123/324 (37%), Positives = 178/324 (54%)

Query:    45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVG----LNKF 100
             D  +   +Q W  K+ K+ +      KR  ++++ L+ I  HN  N   K G    +N+F
Sbjct:    22 DSSLDAEWQDWKIKYNKSYSLKEEKLKRV-VWEEKLKMIKLHNRENSLGKNGFTMKMNEF 80

Query:   101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
              D T+EE+R M +       R   + K   +R   +AG  LP+ VDWR+KG V PV+ QG
Sbjct:    81 GDQTDEEFRKMMIEISVWTHR---EGKSIMKR---EAGSILPKFVDWRKKGYVTPVRRQG 134

Query:   161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQN 219
              C +CWAF+   A+E      TG+L  LS Q LVDC + + N GC GG    AFQ+++ N
Sbjct:   135 DCDACWAFAVTGAIEAQAIWQTGKLTPLSVQNLVDCSKPQGNNGCLGGDTYNAFQYVLHN 194

Query:   220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEA 278
             GG++SE  YPY G +  C  + +N+K   I G+  + P  E  L  AVA   P++  I+A
Sbjct:   195 GGLESEATYPYEGKDGPCRYNPKNSKA-EITGFVSL-PQSEDILMAAVATIGPITAGIDA 252

Query:   279 GGRAFQHYESGVF-TGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGENGY 332
                +F++Y+ G++    C S  + HGV+ VGYG +    +G  YWL++NSWG  WG  GY
Sbjct:   253 SHESFKNYKGGIYHEPNCSSDTVTHGVLVVGYGFKGIETDGNHYWLIKNSWGKRWGIRGY 312

Query:   333 VKLQRNLLDTNTGKCGIAMEASYP 356
             +KL +   D N   CGIA  A YP
Sbjct:   313 MKLAK---DKNN-HCGIASYAHYP 332


>RGD|1308181 [details] [associations]
            symbol:Ctsf "cathepsin F" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1308181 eggNOG:COG4870 HOGENOM:HOG000230774
            KO:K01373 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 MEROPS:C01.018
            CTD:8722 OMA:LAPPEWD OrthoDB:EOG4CC41T GeneTree:ENSGT00660000095458
            EMBL:CH473953 EMBL:BC099780 EMBL:EU253481 IPI:IPI00201100
            RefSeq:NP_001029282.1 UniGene:Rn.25087 SMR:Q499S6
            Ensembl:ENSRNOT00000026718 GeneID:361704 KEGG:rno:361704
            UCSC:RGD:1308181 InParanoid:Q499S6 NextBio:677325
            Genevestigator:Q499S6 Uniprot:Q499S6
        Length = 462

 Score = 509 (184.2 bits), Expect = 8.5e-49, P = 8.5e-49
 Identities = 123/319 (38%), Positives = 169/319 (52%)

Query:    47 EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTN 105
             ++ T+++ ++  + +T       + R  +F  N+    +  +L+R T + G+ KF+DLT 
Sbjct:   160 KMATLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTE 219

Query:   106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
             EE+  +YL         L K        A    D  P   DWR+KGAV  VKDQG CGSC
Sbjct:   220 EEFHTIYLNPL------LQKESGGKMSLAKSINDLAPPEWDWRKKGAVTEVKDQGMCGSC 273

Query:   166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
             WAFS    VEG   +  G L+SLSEQEL+DCD K++  C GGL   A+  I   GG+++E
Sbjct:   274 WAFSVTGNVEGQWFLNRGTLLSLSEQELLDCD-KMDKACMGGLPSNAYTAIKNLGGLETE 332

Query:   226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQ 284
              DY Y G    C+ S + AKV   D  E +S  DE  +   +A + P+SVAI A G  F 
Sbjct:   333 DDYGYQGHVQACNFSTQMAKVYINDSVE-LSR-DENKIAAWLAQKGPISVAINAFGMQF- 389

Query:   285 HYESGV---FTGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLL 340
              Y  G+   F   C    +DH V+ VGYG  + + YW ++NSWG DWGE GY  L R   
Sbjct:   390 -YRHGIAHPFRPLCSPWFIDHAVLLVGYGNRSNIPYWAIKNSWGRDWGEEGYYYLYRG-- 446

Query:   341 DTNTGKCGIAMEASYPVKN 359
                +G CG+   AS  V N
Sbjct:   447 ---SGACGVNTMASSAVVN 462


>MGI|MGI:1349426 [details] [associations]
            symbol:Ctsj "cathepsin J" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=ISO]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0048471 "perinuclear region
            of cytoplasm" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:1349426 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF136272
            EMBL:AF158182 EMBL:AY034579 EMBL:AK005526 EMBL:AK131661
            EMBL:BC103769 IPI:IPI00126770 RefSeq:NP_036137.1 UniGene:Mm.31948
            ProteinModelPortal:Q9R014 SMR:Q9R014 MEROPS:C01.038 PRIDE:Q9R014
            Ensembl:ENSMUST00000071526 GeneID:26898 KEGG:mmu:26898
            UCSC:uc007qwa.1 CTD:26898 InParanoid:Q9R014 KO:K09599
            NextBio:304745 Bgee:Q9R014 CleanEx:MM_CTSJ Genevestigator:Q9R014
            GermOnline:ENSMUSG00000055298 Uniprot:Q9R014
        Length = 334

 Score = 508 (183.9 bits), Expect = 1.1e-48, P = 1.1e-48
 Identities = 119/324 (36%), Positives = 179/324 (55%)

Query:    45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLNKF 100
             D ++   ++ W  K+ K+ +      +R  ++++N+R I  HN  N      + + +NKF
Sbjct:    22 DPKLDAEWKDWKTKYAKSYSPKEEALRR-AVWEENMRMIKLHNKENSLGKNNFTMKMNKF 80

Query:   101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
              D T+EE+R         A    M    A    +   G  LP+  DWRE+G V PV++QG
Sbjct:    81 GDQTSEEFRKSIDNIPIPAA---MTDPHAQNHVSI--G--LPDYKDWREEGYVTPVRNQG 133

Query:   161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQN 219
              CGSCWAF+   A+EG     TG L  LS Q L+DC + + N GC  G    AF+++++N
Sbjct:   134 KCGSCWAFAAAGAIEGQMFWKTGNLTPLSVQNLLDCSKTVGNKGCQSGTAHQAFEYVLKN 193

Query:   220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEA 278
              G+++E  YPY G +  C     NA   +I  Y ++ P +E+ L  AVA   PVS AI+A
Sbjct:   194 KGLEAEATYPYEGKDGPCRYRSENASA-NITDYVNLPP-NELYLWVAVASIGPVSAAIDA 251

Query:   279 GGRAFQHYESGVF-TGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGENGY 332
                +F+ Y  G++    C S  ++H V+ VGYG+E    +G +YWL++NSWG +WG NGY
Sbjct:   252 SHDSFRFYNGGIYYEPNCSSYFVNHAVLVVGYGSEGDVKDGNNYWLIKNSWGEEWGMNGY 311

Query:   333 VKLQRNLLDTNTGKCGIAMEASYP 356
             +++ +   D N   CGIA  ASYP
Sbjct:   312 MQIAK---DHNN-HCGIASLASYP 331


>UNIPROTKB|Q0VCU3 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            HOVERGEN:HBG011513 MEROPS:C01.018 CTD:8722 OMA:LAPPEWD
            OrthoDB:EOG4CC41T GeneTree:ENSGT00660000095458 EMBL:DAAA02063594
            EMBL:BC120003 IPI:IPI00717812 RefSeq:NP_001068884.1 UniGene:Bt.7264
            SMR:Q0VCU3 Ensembl:ENSBTAT00000014587 GeneID:509715 KEGG:bta:509715
            InParanoid:Q0VCU3 NextBio:20869091 Uniprot:Q0VCU3
        Length = 460

 Score = 506 (183.2 bits), Expect = 1.8e-48, P = 1.8e-48
 Identities = 126/318 (39%), Positives = 170/318 (53%)

Query:    47 EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTN 105
             ++ +I++ ++  + +T +       R  +F +N+    +  +L+R T + G+ KF+DLT 
Sbjct:   158 KMASIFKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTARYGVTKFSDLTE 217

Query:   106 EEYRAMYLGTR-SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
             EE+R +YL     DA  R M       R A    D  P   DWR KGAV  VKDQG CGS
Sbjct:   218 EEFRTIYLNPLLKDAPGRNM-------RPAQPVTDVPPPQWDWRNKGAVTNVKDQGMCGS 270

Query:   165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
             CWAFS    VEG   +  G L+SLSEQEL+DCD K +  C GGL   A+  I   GG+++
Sbjct:   271 CWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCD-KTDKACLGGLPSNAYSAIRTLGGLET 329

Query:   225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAF 283
             E DY Y G    C  S   AKV   D  E +S  +E  L   +A   PVS+AI A G  F
Sbjct:   330 EDDYSYRGRLQTCSFSAEKAKVYINDSVE-LSK-NEQKLAAWLAKNGPVSIAINAFGMQF 387

Query:   284 -QHYESGVFTGECGSAL-DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
              +H  S      C   L DH V+ VGYG  + + +W ++NSWG+DWGE GY  L R    
Sbjct:   388 YRHGISHPLRPLCSPWLIDHAVLLVGYGNRSAIPFWAIKNSWGTDWGEEGYYYLHRG--- 444

Query:   342 TNTGKCGIAMEASYPVKN 359
               +G CG+ + AS  V N
Sbjct:   445 --SGACGVNIMASSAVIN 460


>GENEDB_PFALCIPARUM|PF11_0162 [details] [associations]
            symbol:PF11_0162 "falcipain-3" species:5833
            "Plasmodium falciparum" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 HSSP:P43235 EMBL:AE014186 GO:GO:0020020
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347833.1
            ProteinModelPortal:Q8IIL0 SMR:Q8IIL0 MEROPS:C01.063
            EnsemblProtists:PF11_0162:mRNA GeneID:810709 KEGG:pfa:PF11_0162
            EuPathDB:PlasmoDB:PF3D7_1115400 OMA:ENDEDYW ChEMBL:CHEMBL1250373
            Uniprot:Q8IIL0
        Length = 492

 Score = 505 (182.8 bits), Expect = 2.3e-48, P = 2.3e-48
 Identities = 131/350 (37%), Positives = 185/350 (52%)

Query:    32 YDNNHDHSSSWRTDD-EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-L 89
             Y N  D  + +  D+ E + ++  +L ++ K        +KRF IF +N R I+ HN   
Sbjct:   152 YSNLFD--TKFLMDNLETVNLFYIFLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKT 209

Query:    90 NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYA-----CKAGDELPE 143
             N  YK G+NKF DL+ EE+R+ YL  ++    + +   V+ +  Y       K  D   +
Sbjct:   210 NSLYKRGMNKFGDLSPEEFRSKYLNLKTHGPFKTLSPPVSYEANYEDVIKKYKPADAKLD 269

Query:   144 SV--DWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
              +  DWR  G V PVKDQ  CGSCWAFS+V +VE    I    L   SEQELVDC  K N
Sbjct:   270 RIAYDWRLHGGVTPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVDCSVK-N 328

Query:   202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLG-AENKCDPSRRNAKVVSIDGYEDVSPFDE 260
              GC GG +  AF  +I  GG+ S+ DYPY+      C+  R N +  +I  Y  + P D+
Sbjct:   329 NGCYGGYITNAFDDMIDLGGLCSQDDYPYVSNLPETCNLKRCNERY-TIKSYVSI-PDDK 386

Query:   261 MSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN--GVD-- 315
                K+A+    P+S++I A    F  Y  G + GECG+A +H V+ VGYG ++    D  
Sbjct:   387 F--KEALRYLGPISISIAASDD-FAFYRGGFYDGECGAAPNHAVILVGYGMKDIYNEDTG 443

Query:   316 ------YWLVRNSWGSDWGENGYVKLQRNLLDTNTGK--CGIAMEASYPV 357
                   Y++++NSWGSDWGE GY+ L+    D N  K  C I  EA  P+
Sbjct:   444 RMEKFYYYIIKNSWGSDWGEGGYINLET---DENGYKKTCSIGTEAYVPL 490


>UNIPROTKB|F1RU48 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            OMA:LAPPEWD GeneTree:ENSGT00660000095458 EMBL:CU928034
            EMBL:FP565364 Ensembl:ENSSSCT00000014140 Ensembl:ENSSSCT00000014154
            Uniprot:F1RU48
        Length = 460

 Score = 505 (182.8 bits), Expect = 2.3e-48, P = 2.3e-48
 Identities = 121/318 (38%), Positives = 175/318 (55%)

Query:    47 EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTN 105
             ++ +I++ ++  + +T +       R  +F +N+    +  +L+  T + G+ KF+DLT 
Sbjct:   158 KMASIFKEFVTTYNRTYDTKEEARWRMSVFANNMVRAQKIQALDTGTARYGVTKFSDLTE 217

Query:   106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESV-DWREKGAVNPVKDQGSCGS 164
             EE+R +YL         L++ +   +    K+   LP    DWR+KGAV  VKDQG CGS
Sbjct:   218 EEFRTIYLNP-------LLQEEPGRKMRLAKSVSSLPPPEWDWRKKGAVTKVKDQGMCGS 270

Query:   165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDS 224
             CWAFS    VEG   +  G L+SLSEQEL+DCD K++ GC GGL   A+  I   GG+++
Sbjct:   271 CWAFSVTGNVEGQWFLKQGTLLSLSEQELLDCD-KVDKGCMGGLPSNAYSAIKTLGGLET 329

Query:   225 EQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAF 283
             E+DY Y G    C  +   AKV   D  E +S  +E  L   +A++ P+SVAI A G  F
Sbjct:   330 EEDYSYRGHLQTCSFNAEKAKVYINDSVE-LSQ-NEQKLAAWLAEKGPISVAINAFGMQF 387

Query:   284 -QHYESGVFTGECGSAL-DHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
              +H  S      C   L DH V+ VGYG  +   +W ++NSWG+DWGE GY  L R    
Sbjct:   388 YRHGISHPLRPLCSPWLIDHAVLLVGYGNRSATPFWAIKNSWGTDWGEEGYYYLYRG--- 444

Query:   342 TNTGKCGIAMEASYPVKN 359
               +G CG+ + AS  V N
Sbjct:   445 --SGACGVNIMASSAVVN 460


>UNIPROTKB|Q8IIL0 [details] [associations]
            symbol:PF11_0162 "Falcipain-3" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 HSSP:P43235 EMBL:AE014186 GO:GO:0020020
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347833.1
            ProteinModelPortal:Q8IIL0 SMR:Q8IIL0 MEROPS:C01.063
            EnsemblProtists:PF11_0162:mRNA GeneID:810709 KEGG:pfa:PF11_0162
            EuPathDB:PlasmoDB:PF3D7_1115400 OMA:ENDEDYW ChEMBL:CHEMBL1250373
            Uniprot:Q8IIL0
        Length = 492

 Score = 505 (182.8 bits), Expect = 2.3e-48, P = 2.3e-48
 Identities = 131/350 (37%), Positives = 185/350 (52%)

Query:    32 YDNNHDHSSSWRTDD-EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-L 89
             Y N  D  + +  D+ E + ++  +L ++ K        +KRF IF +N R I+ HN   
Sbjct:   152 YSNLFD--TKFLMDNLETVNLFYIFLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKT 209

Query:    90 NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ-RYA-----CKAGDELPE 143
             N  YK G+NKF DL+ EE+R+ YL  ++    + +   V+ +  Y       K  D   +
Sbjct:   210 NSLYKRGMNKFGDLSPEEFRSKYLNLKTHGPFKTLSPPVSYEANYEDVIKKYKPADAKLD 269

Query:   144 SV--DWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
              +  DWR  G V PVKDQ  CGSCWAFS+V +VE    I    L   SEQELVDC  K N
Sbjct:   270 RIAYDWRLHGGVTPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVDCSVK-N 328

Query:   202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLG-AENKCDPSRRNAKVVSIDGYEDVSPFDE 260
              GC GG +  AF  +I  GG+ S+ DYPY+      C+  R N +  +I  Y  + P D+
Sbjct:   329 NGCYGGYITNAFDDMIDLGGLCSQDDYPYVSNLPETCNLKRCNERY-TIKSYVSI-PDDK 386

Query:   261 MSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTEN--GVD-- 315
                K+A+    P+S++I A    F  Y  G + GECG+A +H V+ VGYG ++    D  
Sbjct:   387 F--KEALRYLGPISISIAASDD-FAFYRGGFYDGECGAAPNHAVILVGYGMKDIYNEDTG 443

Query:   316 ------YWLVRNSWGSDWGENGYVKLQRNLLDTNTGK--CGIAMEASYPV 357
                   Y++++NSWGSDWGE GY+ L+    D N  K  C I  EA  P+
Sbjct:   444 RMEKFYYYIIKNSWGSDWGEGGYINLET---DENGYKKTCSIGTEAYVPL 490


>DICTYBASE|DDB_G0290957 [details] [associations]
            symbol:cprA "cysteine proteinase 1" species:44689
            "Dictyostelium discoideum" [GO:0006972 "hyperosmotic response"
            evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0290957
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GenomeReviews:CM000154_GR GO:GO:0005764
            GO:GO:0006972 EMBL:AAFI02000174 KO:K01376 EMBL:X02407 PIR:A22827
            RefSeq:XP_635417.1 ProteinModelPortal:P04988 MEROPS:C01.022
            GlycoSuiteDB:P04988 SWISS-2DPAGE:P04988 EnsemblProtists:DDB0201647
            GeneID:8627918 KEGG:ddi:DDB_G0290957 OMA:KISNFTM
            ProtClustDB:CLSZ2429603 Uniprot:P04988
        Length = 343

 Score = 503 (182.1 bits), Expect = 3.7e-48, P = 3.7e-48
 Identities = 119/313 (38%), Positives = 175/313 (55%)

Query:    61 KTSNGMGHNE--KRFQIFKDNLRFIDEHN--SLNRTY--KVGLNKFADLTNEEYRAMYLG 114
             K +    H E  +RF+IFK NL  I+E N  ++N     K G+NKFADL+++E++  YL 
Sbjct:    35 KFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLN 94

Query:   115 TRSDAKRRLMKSKVASQRYACKAG-DELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAA 173
                  K  +    +    Y      + +P + DWR +GAV PVK+QG CGSCW+FST   
Sbjct:    95 N----KEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGN 150

Query:   174 VEGINKIVTGELISLSEQELVDCDRKI---------NAGCNGGLMDYAFQFIIQNGGMDS 224
             VEG + I   +L+SLSEQ LVDCD +          + GCNGGL   A+ +II+NGG+ +
Sbjct:   151 VEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEQACDEGCNGGLQPNAYNYIIKNGGIQT 210

Query:   225 EQDYPYLGAENKCDPSRRNAKV-VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAF 283
             E  YPY  AE     +  +A +   I  +  +   + +     V+  P+++A +A    +
Sbjct:   211 ESSYPYT-AETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAV--EW 267

Query:   284 QHYESGVFTGECG-SALDHGVVAVGYGTENGV-----DYWLVRNSWGSDWGENGYVKLQR 337
             Q Y  GVF   C  ++LDHG++ VGY  +N +      YW+V+NSWG+DWGE GY+ L+R
Sbjct:   268 QFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRR 327

Query:   338 NLLDTNTGKCGIA 350
                  NT  CG++
Sbjct:   328 G---KNT--CGVS 335


>UNIPROTKB|F1NHB8 [details] [associations]
            symbol:F1NHB8 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 EMBL:AADN02044011
            IPI:IPI00586027 Ensembl:ENSGALT00000021873 OMA:SELDHAV
            Uniprot:F1NHB8
        Length = 329

 Score = 502 (181.8 bits), Expect = 4.7e-48, P = 4.7e-48
 Identities = 120/315 (38%), Positives = 173/315 (54%)

Query:    51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRA 110
             ++  +  + GK  +    +E R + F  N+RF+   N    +Y + LN  AD T +E  A
Sbjct:    25 LFHHYKERFGKRYSSEEEHEHRKRTFIHNMRFVHSKNRAALSYSLALNHLADRTPQEMAA 84

Query:   111 MYLGTRS-DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
             +    RS D K        + Q YA      LPES+DWR  GAV PVKDQ  CGSCW+F+
Sbjct:    85 LRGRRRSGDPKSG---QPFSMQLYASLV---LPESLDWRLYGAVTPVKDQAVCGSCWSFA 138

Query:   170 TVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
             T  A+EG   + TG L  LS+Q L+DC     N  C+GG    A+++I ++GG+ S + Y
Sbjct:   139 TTGAMEGALFLKTGVLTPLSQQVLIDCSWGFGNYACDGGEEWRAYEWIKKHGGIASTESY 198

Query:   229 -PYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHY 286
              PYLG    C  ++    V  + GY  V   +  +LK A+    PV+V I+A  ++F  Y
Sbjct:   199 GPYLGQNGYCHYNQSEL-VAPLAGYVTVESGNAEALKAALFKHGPVAVNIDASHKSFTFY 257

Query:   287 ESGVFTG-ECG---SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
              +GV+    CG   S LDH V+AVGYG  +G  YWL++NSW + WG +GY+ +   + D 
Sbjct:   258 ANGVYEEPHCGNETSELDHAVLAVGYGVLHGKSYWLIKNSWSTYWGNDGYILMA--MKDN 315

Query:   343 NTGKCGIAMEASYPV 357
             N   CG+A  AS+P+
Sbjct:   316 N---CGVATAASFPI 327


>RGD|1588248 [details] [associations]
            symbol:Cts8 "cathepsin 8" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1588248 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 IPI:IPI00765053
            RefSeq:NP_001121688.1 UniGene:Rn.220599 Ensembl:ENSRNOT00000061486
            GeneID:680718 KEGG:rno:680718 UCSC:RGD:1588248 CTD:56094
            OMA:DSEWQEW OrthoDB:EOG4JT07C NextBio:719350 Uniprot:D3ZP54
        Length = 333

 Score = 501 (181.4 bits), Expect = 6.0e-48, P = 6.0e-48
 Identities = 125/330 (37%), Positives = 178/330 (53%)

Query:    44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN----SLNRTYKVGLNK 99
             +D  + + +Q W  K+ K  +     +KR  ++++N++ + +HN       + + + LN 
Sbjct:    21 SDPSLDSEWQEWKTKYEKNYSLEEEGQKR-AVWEENMKVVKQHNIEYDQEKKNFTMELNA 79

Query:   100 FADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ---RYACKAGDELPESVDWREKGAVNPV 156
             FAD+T EE+R M         + L K K   Q   RY       LP+ VDWR +G V  V
Sbjct:    80 FADMTGEEFRKMMTNI---PVQNLRKKKSIHQPIFRY-------LPKFVDWRRRGYVTSV 129

Query:   157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQF 215
             K+QG+C SCWAFS   A+EG     TG L+SLS Q LVDC R + N GC+ G   YA ++
Sbjct:   130 KNQGTCNSCWAFSVAGAIEGQMFRKTGRLVSLSPQNLVDCSRPEGNHGCHMGSTLYALKY 189

Query:   216 IIQNGGMDSEQDYPYLGAENKCD--PSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPV 272
             +  NGG+++E  YPY G E  C   P R  A+V    G+  V+  +E +L  AVA   P+
Sbjct:   190 VWSNGGLEAESTYPYEGKEGPCRYLPRRSAARVT---GFSTVARSEE-ALMHAVATIGPI 245

Query:   273 SVAIEAGGRAFQHYESGVF-TGECGS-ALDHGVVAVGYGTE----NGVDYWLVRNSWGSD 326
             SV I+A   +F+ Y  G++    C S  ++H V+ VGYG E    +G  YWL++NS G  
Sbjct:   246 SVGIDASHVSFRFYRRGIYYEPRCSSNRINHSVLVVGYGYEGRESDGRKYWLIKNSHGVG 305

Query:   327 WGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
             WG NGY+KL R         CGIA    YP
Sbjct:   306 WGMNGYMKLARGW----NNHCGIATYGFYP 331


>RGD|631421 [details] [associations]
            symbol:Ctsq "cathepsin Q" species:10116 "Rattus norvegicus"
            [GO:0005764 "lysosome" evidence=NAS] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 RGD:631421 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 UniGene:Rn.34875 EMBL:AF187323 IPI:IPI00214897
            PIR:JC7183 RefSeq:NP_640355.1 UniGene:Rn.35820
            ProteinModelPortal:Q9QZE3 SMR:Q9QZE3 STRING:Q9QZE3 MEROPS:C01.039
            PRIDE:Q9QZE3 Ensembl:ENSRNOT00000024208 GeneID:246147
            KEGG:rno:246147 UCSC:RGD:631421 CTD:104002 InParanoid:Q9QZE3
            OMA:ESEDVLM OrthoDB:EOG4HHP48 NextBio:623425 Genevestigator:Q9QZE3
            GermOnline:ENSRNOG00000017946 Uniprot:Q9QZE3
        Length = 343

 Score = 500 (181.1 bits), Expect = 7.7e-48, P = 7.7e-48
 Identities = 119/320 (37%), Positives = 180/320 (56%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLNR-TYKVGLNKFADLTNEE 107
             +Q W  K+ K  +      KR  ++++N++ I+ HN   SL + TY + +N FAD+T+EE
Sbjct:    29 WQEWKIKYEKLYSPEEEVLKRV-VWEENVKKIELHNRENSLGKNTYTMEINDFADMTDEE 87

Query:   108 YRAMYLGTR---SDAKRRLMKSKVASQ-RYACKAGDELPESVDWREKGAVNPVKDQGSCG 163
             ++ M +G +    + ++RL K  + S    +    D LP+ VDWR +G V  V+ QG C 
Sbjct:    88 FKDMIIGFQLPVHNTEKRLWKRALGSFFPNSWNWRDALPKFVDWRNEGYVTRVRKQGGCS 147

Query:   164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQNGGM 222
             SCWAF    A+EG     TG+LI LS Q L+DC + + N GC  G    AFQ+++ NGG+
Sbjct:   148 SCWAFPVTGAIEGQMFKKTGKLIPLSVQNLIDCSKPQGNRGCLWGNTYNAFQYVLHNGGL 207

Query:   223 DSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGR 281
             ++E  YPY   E  C  + +N+    I G+  V P  E  L  AVA + P++  +     
Sbjct:   208 EAEATYPYERKEGVCRYNPKNSSA-KITGFV-VLPESEDVLMDAVATKGPIATGVHVISS 265

Query:   282 AFQHYESGVF-TGECGSALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQ 336
             +F+ Y+ GV+   +C S ++H V+ VGYG E    +G +YWL++NSWG  WG  GY+K+ 
Sbjct:   266 SFRFYQKGVYHEPKCSSYVNHAVLVVGYGFEGNETDGNNYWLIKNSWGKRWGLRGYMKIA 325

Query:   337 RNLLDTNTGKCGIAMEASYP 356
             +   D N   C IA  A YP
Sbjct:   326 K---DRNN-HCAIASLAQYP 341


>MGI|MGI:1861434 [details] [associations]
            symbol:Ctsf "cathepsin F" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=ISS]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISS] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:1861434 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513 MEROPS:C01.018
            CTD:8722 OMA:LAPPEWD OrthoDB:EOG4CC41T EMBL:AF136280 EMBL:AF217224
            EMBL:AJ131851 EMBL:AK075862 EMBL:BC058758 IPI:IPI00126769
            RefSeq:NP_063914.1 UniGene:Mm.29561 ProteinModelPortal:Q9R013
            SMR:Q9R013 STRING:Q9R013 PhosphoSite:Q9R013 PaxDb:Q9R013
            PRIDE:Q9R013 Ensembl:ENSMUST00000119694 GeneID:56464 KEGG:mmu:56464
            UCSC:uc008gbc.1 GeneTree:ENSGT00660000095458 InParanoid:Q9R013
            NextBio:312722 Bgee:Q9R013 CleanEx:MM_CTSF Genevestigator:Q9R013
            GermOnline:ENSMUSG00000006458 Uniprot:Q9R013
        Length = 462

 Score = 496 (179.7 bits), Expect = 2.0e-47, P = 2.0e-47
 Identities = 121/315 (38%), Positives = 167/315 (53%)

Query:    51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTNEEYR 109
             +++ ++  + +T       + R  +F  N+    +  +L+R T + G+ KF+DLT EE+ 
Sbjct:   164 LFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFH 223

Query:   110 AMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
              +YL         L K        A    D  P   DWR+KGAV  VK+QG CGSCWAFS
Sbjct:   224 TIYLNPL------LQKESGRKMSPAKSINDLAPPEWDWRKKGAVTEVKNQGMCGSCWAFS 277

Query:   170 TVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYP 229
                 VEG   +  G L+SLSEQEL+DCD K++  C GGL   A+  I   GG+++E DY 
Sbjct:   278 VTGNVEGQWFLNRGTLLSLSEQELLDCD-KVDKACLGGLPSNAYAAIKNLGGLETEDDYG 336

Query:   230 YLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYES 288
             Y G    C+ S + AKV   D  E +S  +E  +   +A + P+SVAI A G  F  Y  
Sbjct:   337 YQGHVQTCNFSAQMAKVYINDSVE-LSR-NENKIAAWLAQKGPISVAINAFGMQF--YRH 392

Query:   289 GV---FTGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
             G+   F   C    +DH V+ VGYG  + + YW ++NSWGSDWGE GY  L R      +
Sbjct:   393 GIAHPFRPLCSPWFIDHAVLLVGYGNRSNIPYWAIKNSWGSDWGEEGYYYLYRG-----S 447

Query:   345 GKCGIAMEASYPVKN 359
             G CG+   AS  V N
Sbjct:   448 GACGVNTMASSAVVN 462


>ZFIN|ZDB-GENE-050417-107 [details] [associations]
            symbol:zgc:110239 "zgc:110239" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-050417-107
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 MEROPS:I29.003 OrthoDB:EOG412M56 EMBL:BC092817
            IPI:IPI00503987 RefSeq:NP_001017633.1 UniGene:Dr.39081
            ProteinModelPortal:Q568K7 GeneID:550326 KEGG:dre:550326
            HOGENOM:HOG000007373 HOVERGEN:HBG105018 InParanoid:Q568K7
            NextBio:20879584 ArrayExpress:Q568K7 Uniprot:Q568K7
        Length = 546

 Score = 493 (178.6 bits), Expect = 4.2e-47, P = 4.2e-47
 Identities = 113/299 (37%), Positives = 168/299 (56%)

Query:    64 NGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRL 123
             N M H E+    F  N+R++   N    ++ + +N  AD + +E  +M  G +   K   
Sbjct:   256 NEMEHEEREHN-FVHNIRYVHSMNRAGLSFSLSVNHLADRSQKEL-SMMRGCQRTHKVHR 313

Query:   124 MKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTG 183
                   S+  +       P SVDWR  GAV PVKDQ  CGSCW+F+T   +EG   + TG
Sbjct:   314 KAQPFPSEIRSIAT----PNSVDWRLYGAVTPVKDQAVCGSCWSFATTGTLEGALFLKTG 369

Query:   184 ELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDY-PYLGAENKCDPSR 241
             +L SLS+Q LVDC     N GC+GG    AF++I+++GG+ + + Y  Y+G    C   +
Sbjct:   370 QLTSLSQQMLVDCTWGFGNNGCDGGEEWRAFEWIMKHGGISTAESYGAYMGMNGLCHYDK 429

Query:   242 RNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVF-TGECGSA- 298
              ++ V  + GY +V+  D ++LK A+    PV+V+I+A  R+F  Y +GV+   EC +  
Sbjct:   430 -SSMVAQLTGYTNVTSGDILALKAAIFKFGPVAVSIDAAHRSFAFYSNGVYYEPECKNGI 488

Query:   299 --LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASY 355
               LDH V+AVGYG  N   YWLV+NSW S WG +GY+ +  ++ D N   CG+A +A Y
Sbjct:   489 NDLDHAVLAVGYGIMNNESYWLVKNSWSSYWGNDGYILM--SMKDNN---CGVATDAIY 542


>MGI|MGI:1927229 [details] [associations]
            symbol:Ctsm "cathepsin M" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008152 "metabolic process" evidence=ISS] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015645 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1927229 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF202528
            EMBL:AY014777 EMBL:AY057446 EMBL:AK005550 EMBL:AK005428
            IPI:IPI00131133 RefSeq:NP_071721.2 UniGene:Mm.279933
            ProteinModelPortal:Q9JL96 SMR:Q9JL96 STRING:Q9JL96 MEROPS:C01.023
            PRIDE:Q9JL96 DNASU:64139 Ensembl:ENSMUST00000099451 GeneID:64139
            KEGG:mmu:64139 UCSC:uc007qwj.1 CTD:64139 InParanoid:Q9JL96
            KO:K09600 OrthoDB:EOG4TTGKR NextBio:319931 Bgee:Q9JL96
            CleanEx:MM_CTSM Genevestigator:Q9JL96 GermOnline:ENSMUSG00000074484
            GermOnline:ENSMUSG00000074871 PANTHER:PTHR12411:SF58 Uniprot:Q9JL96
        Length = 333

 Score = 489 (177.2 bits), Expect = 1.1e-46, P = 1.1e-46
 Identities = 120/324 (37%), Positives = 177/324 (54%)

Query:    46 DEVMTI-YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVG----LNKF 100
             D ++ + +Q W  K+GK  +     +KR  +++DN++ I  HN  N   K G    +N F
Sbjct:    22 DPILDVEWQKWKIKYGKAYSLEEEGQKR-AVWEDNMKKIKLHNGENGLGKHGFTMEMNAF 80

Query:   101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
              D+T EE+R + +       +   K K   +R +      LP+ ++W+++G V PV+ QG
Sbjct:    81 GDMTLEEFRKVMIEIPVPTVK---KGKSVQKRLSVN----LPKFINWKKRGYVTPVQTQG 133

Query:   161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQFIIQN 219
              C SCWAFS   A+EG     TG+LI LS Q LVDC R + N GC  G    A  ++++N
Sbjct:   134 RCNSCWAFSVTGAIEGQMFRKTGQLIPLSVQNLVDCSRPQGNWGCYLGNTYLALHYVMEN 193

Query:   220 GGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEA 278
             GG++SE  YPY   +  C  S  N+   +I G+E V P +E +L  AVA   P+SVAI+A
Sbjct:   194 GGLESEATYPYEEKDGSCRYSPENS-TANITGFEFV-PKNEDALMNAVASIGPISVAIDA 251

Query:   279 GGRAFQHYESGVF-TGECGSAL-DHGVVAVGYG----TENGVDYWLVRNSWGSDWGENGY 332
                +F  Y+ G++    C S +  H ++ VGYG      +G  YWLV+NS G+ WG  GY
Sbjct:   252 RHASFLFYKRGIYYEPNCSSCVVTHSMLLVGYGFTGRESDGRKYWLVKNSMGTQWGNKGY 311

Query:   333 VKLQRNLLDTNTGKCGIAMEASYP 356
             +K+ R+        CGIA  A YP
Sbjct:   312 MKISRD----KGNHCGIATYALYP 331


>TAIR|locus:2082687 [details] [associations]
            symbol:AT3G54940 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002686 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HSSP:P53634
            OMA:GGGLMTN EMBL:AY070063 IPI:IPI00528988 RefSeq:NP_567010.5
            UniGene:At.28412 ProteinModelPortal:Q8VYS0 SMR:Q8VYS0 PRIDE:Q8VYS0
            EnsemblPlants:AT3G54940.2 GeneID:824659 KEGG:ath:AT3G54940
            TAIR:At3g54940 PhylomeDB:Q8VYS0 ProtClustDB:CLSN2718801
            ArrayExpress:Q8VYS0 Genevestigator:Q8VYS0 Uniprot:Q8VYS0
        Length = 367

 Score = 489 (177.2 bits), Expect = 1.1e-46, P = 1.1e-46
 Identities = 115/325 (35%), Positives = 174/325 (53%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
             ++ +++ +GK  +       R  IF  N+    EH  ++ +   G+ +F+DLT EE++ M
Sbjct:    51 FRLFMSDYGKNYSTREEYIHRLGIFAKNVLKAAEHQMMDPSAVHGVTQFSDLTEEEFKRM 110

Query:   112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTV 171
             Y G       R     V ++    +  D LPE  DWREKG V  VK+QG+CGSCWAFST 
Sbjct:   111 YTGVADVGGSR--GGTVGAEAPMVEV-DGLPEDFDWREKGGVTEVKNQGACGSCWAFSTT 167

Query:   172 AAVEGINKIVTGELISLSEQELVDCDRKINA--------GCNGGLMDYAFQFIIQNGGMD 223
              A EG + + TG+L+SLSEQ+LVDCD+  +         GC GGLM  A++++++ GG++
Sbjct:   168 GAAEGAHFVSTGKLLSLSEQQLVDCDQACDPKDKKACDNGCGGGLMTNAYEYLMEAGGLE 227

Query:   224 SEQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLK-KAVADQPVSVAIEAGG 280
              E+ YPY G    C  DP +   +V++        P DE  +    V   P++V + A  
Sbjct:   228 EERSYPYTGKRGHCKFDPEKVAVRVLNFT----TIPLDENQIAANLVRHGPLAVGLNAV- 282

Query:   281 RAFQHYESGVFTGE-CGSA-LDHGVVAVGYGTE-------NGVDYWLVRNSWGSDWGENG 331
                Q Y  GV     C    ++HGV+ VGYG++       +   YW+++NSWG  WGENG
Sbjct:   283 -FMQTYIGGVSCPLICSKRNVNHGVLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENG 341

Query:   332 YVKLQR--NLLDTNTGKCGIAMEAS 354
             Y KL R  ++   N+    +A + S
Sbjct:   342 YYKLCRGHDICGINSMVSAVATQVS 366


>FB|FBgn0034229 [details] [associations]
            symbol:CG4847 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0032504
            "multicellular organism reproduction" evidence=IEP] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0005615 "extracellular space"
            evidence=ISM;IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:AE013599 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 GO:GO:0032504 GeneTree:ENSGT00560000076599
            KO:K01371 EMBL:BT099507 RefSeq:NP_725686.1 UniGene:Dm.4677
            SMR:A1ZAU4 IntAct:A1ZAU4 MEROPS:C01.A28 EnsemblMetazoa:FBtr0086935
            GeneID:36973 KEGG:dme:Dmel_CG4847 UCSC:CG4847-RB
            FlyBase:FBgn0034229 InParanoid:A1ZAU4 OMA:GGFQEYA OrthoDB:EOG4J9KFC
            ChiTaRS:CG4847 GenomeRNAi:36973 NextBio:801302 Uniprot:A1ZAU4
        Length = 420

 Score = 488 (176.8 bits), Expect = 1.4e-46, P = 1.4e-46
 Identities = 115/278 (41%), Positives = 155/278 (55%)

Query:    92 TYKVGLNKFADLTNEEYRAMYLGT-RS-DAKRRLMKSKVASQRYACKAGDELPESVDWRE 149
             T+K  +N FADLT+ E+ +   G  RS +AK R      AS +        +P++ DWRE
Sbjct:   156 TFKQAVNAFADLTHSEFLSQLTGLKRSPEAKARA----AASLKLVNLPAKPIPDAFDWRE 211

Query:   150 KGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDC----DRKINAGCN 205
              G V PVK QG+CGSCWAF+T  A+EG     TG L +LSEQ LVDC    D  +N GC+
Sbjct:   212 HGGVTPVKFQGTCGSCWAFATTGAIEGHTFRKTGSLPNLSEQNLVDCGPVEDFGLN-GCD 270

Query:   206 GGLMDYAFQFIIQ-NGGMDSEQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMS 262
             GG  + AF FI +   G+  E  YPY+  +  C  D S+  A   ++ G+  + P DE  
Sbjct:   271 GGFQEAAFCFIDEVQKGVSQEGAYPYIDNKGTCKYDGSKSGA---TLQGFAAIPPKDEEQ 327

Query:   263 LKKAVADQ-PVSVAIEAGGRAFQHYESGVFTG-ECGSAL-DHGVVAVGYGTENGVDYWLV 319
             LKK VA   PV+ ++  G    ++Y  G++   EC     +H ++ VGYG+E G DYW+V
Sbjct:   328 LKKVVATLGPVACSVN-GLETLKNYAGGIYNDDECNKGEPNHSILVVGYGSEKGQDYWIV 386

Query:   320 RNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
             +NSW   WGE GY +L R         C IA E SYPV
Sbjct:   387 KNSWDDTWGEKGYFRLPRG-----KNYCFIAEECSYPV 419


>DICTYBASE|DDB_G0272742 [details] [associations]
            symbol:DDB_G0272742 species:44689 "Dictyostelium
            discoideum" [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0272742 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639 EMBL:AAFI02000008
            eggNOG:NOG331187 RefSeq:XP_644986.1 ProteinModelPortal:Q7KWP5
            PRIDE:Q7KWP5 EnsemblProtists:DDB0168242 GeneID:8618663
            KEGG:ddi:DDB_G0272742 InParanoid:Q7KWP5 OMA:ATESAHF Uniprot:Q7KWP5
        Length = 345

 Score = 487 (176.5 bits), Expect = 1.8e-46, P = 1.8e-46
 Identities = 119/337 (35%), Positives = 181/337 (53%)

Query:    39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLN 98
             S S  T+ +    +  W+  + +T         R+  FK NL FI++ NS      + LN
Sbjct:    16 SFSKLTEIQYRNEFTAWMTSNQRTY-ASSEFTNRYNTFKSNLDFINQWNSKGSKTVLALN 74

Query:    99 KFADLTNEEYRAMYLGTRSDAKRR---LMKSKVASQRYACKAGDELPESVDWREKGAVNP 155
             +FAD++NEEYR  YL   ++  +    L+  K   +  +  +       +DWR+KGAV  
Sbjct:    75 EFADISNEEYRKNYLRNDNNINKLSSLLINDKEDKEIKSSSSSGSGSSGIDWRKKGAVPS 134

Query:   156 VKDQ-GSCGSCWAFSTVAAVEGINKIVTGE--LISLSEQELVDCDRKINAGCNGGLMDYA 212
             VK Q G CGS W  + V A E  + +   +   ISLS Q L+DC   +N  C  G ++ A
Sbjct:   135 VKSQIGGCGS-WPITAVGATESAHFLANPKDPFISLSMQNLIDCSN-LNKQCYQGTVNEA 192

Query:   213 FQFIIQNGGMDSEQDYPYLGAE-NKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQP 271
             FQ+II+NGG+DSE+ Y + G E  KC  +  N+ V  I  YE V    E SL+ AV+ +P
Sbjct:   193 FQYIIENGGIDSEESYKFSGGEPGKCKYNSSNS-VAKITSYEKVKSGSESSLESAVSLKP 251

Query:   272 VSVAIEAGGRAFQHYESGVF-TGECGSA-LDHGVVAVGYGT---------ENGVDYWLVR 320
             V+  I+A   +FQ Y SG++    C S  L+H ++ VG+           ++  +YW+V+
Sbjct:   252 VAAYIDASLSSFQFYSSGIYYEPSCNSTDLNHSILIVGFSDFSTTPTDSLKHSSNYWIVQ 311

Query:   321 NSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
             NS+G +WGENGY+ + ++  D N   CGI+  ASY +
Sbjct:   312 NSFGKNWGENGYIFMSKDR-DDN---CGISKMASYVI 344


>FB|FBgn0033874 [details] [associations]
            symbol:CG6347 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE013599 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 HSSP:P53634 EMBL:AY069609
            RefSeq:NP_610906.1 UniGene:Dm.608 SMR:Q7K0S6 MEROPS:C01.A29
            EnsemblMetazoa:FBtr0087637 GeneID:36531 KEGG:dme:Dmel_CG6347
            UCSC:CG6347-RA FlyBase:FBgn0033874 InParanoid:Q7K0S6 OMA:FEYIRDH
            OrthoDB:EOG4FQZ74 GenomeRNAi:36531 NextBio:799046 Uniprot:Q7K0S6
        Length = 352

 Score = 486 (176.1 bits), Expect = 2.3e-46, P = 2.3e-46
 Identities = 106/276 (38%), Positives = 156/276 (56%)

Query:    93 YKVGLNKFADLTNEEYRAMYLGTR-SDAKRRLMKSKVASQRYACKAGDELPESVDWREKG 151
             +++G+N  AD+T +E  A  LG++ S+   R     +        A   LPE  DWREKG
Sbjct:    82 FRLGVNTLADMTRKEI-ATLLGSKISEFGERYTNGHINFVTARNPASANLPEMFDWREKG 140

Query:   152 AVNPVKDQG-SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDC-DRKINAGCNGGLM 209
              V P   QG  CG+CW+F+T  A+EG     TG L SLS+Q LVDC D   N GC+GG  
Sbjct:   141 GVTPPGFQGVGCGACWSFATTGALEGHLFRRTGVLASLSQQNLVDCADDYGNMGCDGGFQ 200

Query:   210 DYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK-----VVSIDGYEDVSPFDEMSLK 264
             +Y F++I ++ G+     YPY   E +C  +    +     +V I  Y  ++P DE  +K
Sbjct:   201 EYGFEYI-RDHGVTLANKYPYTQTEMQCRQNETAGRPPRESLVKIRDYATITPGDEEKMK 259

Query:   265 KAVADQ-PVSVAIEAGGRAFQHYESGVFTGE-CGSA-LDHGVVAVGYGTENGVDYWLVRN 321
             + +A   P++ ++ A   +F+ Y  G++  E C    L+H V  VGYGTENG DYW+++N
Sbjct:   260 EVIATLGPLACSMNADTISFEQYSGGIYEDEECNQGELNHSVTVVGYGTENGRDYWIIKN 319

Query:   322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
             S+  +WGE G++++ RN      G CGIA E SYP+
Sbjct:   320 SYSQNWGEGGFMRILRNA----GGFCGIASECSYPI 351


>UNIPROTKB|E9PTT3 [details] [associations]
            symbol:Ctsr "Protein Ctsr" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            IPI:IPI00627092 Ensembl:ENSRNOT00000024115 RGD:631422
            Uniprot:E9PTT3
        Length = 334

 Score = 483 (175.1 bits), Expect = 4.8e-46, P = 4.8e-46
 Identities = 117/304 (38%), Positives = 171/304 (56%)

Query:    67 GHNEKRFQIFKDNLRFIDEHNSLNRTYKVG----LNKFADLTNEEYRAMYLGTRSDAKRR 122
             GH   R  ++++N++ I  HN  N   K G    +N+F DLT EE+R M +     + R 
Sbjct:    46 GH---RRAVWEENMKMIKLHNRENSLGKNGFIMEMNEFGDLTAEEFRKMMVNIPIRSHR- 101

Query:   123 LMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG--INKI 180
               K K+  +R     G+ LP+ VDWR+KG V  V++Q  C SCWAF+   A+EG   NK 
Sbjct:   102 --KGKIIRKR---DVGNVLPKFVDWRKKGYVTRVQNQKFCNSCWAFAVTGAIEGQMFNK- 155

Query:   181 VTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
              TG+L  LS Q LVDC +   N GC  G    A+++++ NGG+++E  YPY G E  C  
Sbjct:   156 -TGQLTPLSVQNLVDCTKSQGNEGCQWGDPHIAYEYVLNNGGLEAEATYPYKGKEGVCRY 214

Query:   240 SRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFTG-ECGS 297
             + +++K   I G+  + P  E  L +AVA   P+SVA++A   +F  Y+ G++    C +
Sbjct:   215 NPKHSKA-EITGFVSL-PESEDILMEAVATIGPISVAVDASFNSFGFYKKGLYDEPNCSN 272

Query:   298 -ALDHGVVAVGYGTE----NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAME 352
               ++H V+ VGYG E    +G  YWL++NSWG  WG  GY+K+ +   D N   C IA  
Sbjct:   273 NTVNHSVLVVGYGFEGNETDGNSYWLIKNSWGRKWGLRGYMKIPK---DQNNF-CAIASY 328

Query:   353 ASYP 356
             A YP
Sbjct:   329 AHYP 332


>UNIPROTKB|F1P3U9 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005829 "cytosol" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=IEA] [GO:0010628 "positive regulation of gene expression"
            evidence=IEA] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=IEA] [GO:0010813 "neuropeptide catabolic
            process" evidence=IEA] [GO:0010815 "bradykinin catabolic process"
            evidence=IEA] [GO:0016505 "apoptotic protease activator activity"
            evidence=IEA] [GO:0030108 "HLA-A specific activating MHC class I
            receptor activity" evidence=IEA] [GO:0031638 "zymogen activation"
            evidence=IEA] [GO:0031648 "protein destabilization" evidence=IEA]
            [GO:0032526 "response to retinoic acid" evidence=IEA] [GO:0033619
            "membrane protein proteolysis" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0043129
            "surfactant homeostasis" evidence=IEA] [GO:0045766 "positive
            regulation of angiogenesis" evidence=IEA] [GO:0060448 "dichotomous
            subdivision of terminal units involved in lung branching"
            evidence=IEA] [GO:0070324 "thyroid hormone binding" evidence=IEA]
            [GO:0070371 "ERK1 and ERK2 cascade" evidence=IEA] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA]
            [GO:0097208 "alveolar lamellar body" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066
            GO:GO:0005615 GO:GO:0008284 GO:GO:0070371 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0032526 GO:GO:0010628
            GO:GO:0070324 GO:GO:0016505 GO:GO:0010634 GO:GO:0004197
            GO:GO:0042599 GO:GO:0031648 GO:GO:0097067 GO:GO:0031638
            GO:GO:0001913 GeneTree:ENSGT00660000095458 OMA:STSCHKT
            GO:GO:0030108 GO:GO:0010815 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 EMBL:AADN02038832 EMBL:AADN02038831 IPI:IPI00594147
            Ensembl:ENSGALT00000013440 Uniprot:F1P3U9
        Length = 261

 Score = 478 (173.3 bits), Expect = 1.6e-45, P = 1.6e-45
 Identities = 107/272 (39%), Positives = 152/272 (55%)

Query:    93 YKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGA 152
             + V LN+F+D+T  E++ +YL +         ++  A++    ++    PE+VDWR+KG 
Sbjct:     1 FLVALNQFSDMTFAEFKKLYLWSEP-------QNCSATRGNFLRSDGPCPEAVDWRKKGN 53

Query:   153 -VNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMD 210
              V PVK+QG CGSCW FST   +E    I TG+L+SL+EQ LVDC +  N  GC+GGL  
Sbjct:    54 FVTPVKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQLLVDCAQAFNNHGCSGGLPS 113

Query:   211 YAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ 270
              AF++I+ N G+  E  YPY      C      A +  +    +++ +DE  + +AV   
Sbjct:   114 QAFEYILYNKGLMGEDAYPYRAQNGTCKFQPDKA-IAFVKDVINITQYDEAGMVEAVGKH 172

Query:   271 -PVSVAIEAGGRAFQHYESGVFTG-ECGSALD---HGVVAVGYGTENGVDYWLVRNSWGS 325
              PVS A E     F HY  GV++   C    D   H V+AVGYG E+G  YW+V+NSWG 
Sbjct:   173 NPVSFAFEVTSD-FMHYRKGVYSNPRCEHTPDKVNHAVLAVGYGEEDGRPYWIVKNSWGP 231

Query:   326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
              WG +GY  ++R         CG+A  ASYPV
Sbjct:   232 LWGMDGYFLIERG-----KNMCGLAACASYPV 258


>MGI|MGI:1860262 [details] [associations]
            symbol:Cts7 "cathepsin 7" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005576
            "extracellular region" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005768 "endosome" evidence=IEA]
            [GO:0005794 "Golgi apparatus" evidence=IEA] [GO:0006508
            "proteolysis" evidence=ISA] [GO:0007049 "cell cycle" evidence=IEA]
            [GO:0007067 "mitosis" evidence=IEA] [GO:0008152 "metabolic process"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=ISA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0051301 "cell
            division" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:1860262 GO:GO:0005634 GO:GO:0005794 GO:GO:0048471
            GO:GO:0005615 GO:GO:0051301 GO:GO:0007067 GO:GO:0005768
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GO:GO:0008233 EMBL:CH466546
            EMBL:AY014779 EMBL:CT030645 EMBL:BC064740 EMBL:AF250837
            IPI:IPI00131132 RefSeq:NP_062412.1 UniGene:Mm.3692 HSSP:O60911
            ProteinModelPortal:Q91ZF2 SMR:Q91ZF2 STRING:Q91ZF2 MEROPS:C01.016
            PRIDE:Q91ZF2 Ensembl:ENSMUST00000021892 GeneID:56092 KEGG:mmu:56092
            UCSC:uc007qwi.1 CTD:56092 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 InParanoid:Q91ZF2 OMA:ERRVIWE OrthoDB:EOG44QT2S
            NextBio:311908 Bgee:Q91ZF2 Genevestigator:Q91ZF2 Uniprot:Q91ZF2
        Length = 331

 Score = 469 (170.2 bits), Expect = 1.5e-44, P = 1.5e-44
 Identities = 109/316 (34%), Positives = 169/316 (53%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEH---NSL-NRTYKVGLNKFADLTNEE 107
             ++ W   + +T +     ++R  +++ N+++I +H   N L    + + +N+F D+T EE
Sbjct:    29 WEEWKRSNDRTYSPEEEKQRR-AVWEGNVKWIKQHIMENGLWMNNFTIEMNEFGDMTGEE 87

Query:   108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
              + +   T S +        + + ++  K   ++P ++DWR++G V PV+ QGSCG+CWA
Sbjct:    88 MKML---TESSSY------PLRNGKHIQKRNPKIPPTLDWRKEGYVTPVRRQGSCGACWA 138

Query:   168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQ 226
             FS  A +EG     TG+LI LS Q L+DC       GC+GG    AFQ++  NGG+++E 
Sbjct:   139 FSVTACIEGQLFKKTGKLIPLSVQNLMDCSVSYGTKGCDGGRPYDAFQYVKNNGGLEAEA 198

Query:   227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
              YPY      C   R    VV ++ +  V   +E  L+  V   P++VAI+    +F  Y
Sbjct:   199 TYPYEAKAKHCR-YRPERSVVKVNRFFVVPRNEEALLQALVTHGPIAVAIDGSHASFHSY 257

Query:   287 ESGVF-TGECGS-ALDHGVVAVGYGTENGVD----YWLVRNSWGSDWGENGYVKLQRNLL 340
               G++   +C    LDHG++ VGYG E        YWL++NS G  WGENGY+KL R   
Sbjct:   258 RGGIYHEPKCRKDTLDHGLLLVGYGYEGHESENRKYWLLKNSHGERWGENGYMKLPRG-- 315

Query:   341 DTNTGKCGIAMEASYP 356
                   CGIA  A YP
Sbjct:   316 --QNNYCGIASYAMYP 329


>RGD|1309226 [details] [associations]
            symbol:Cts7 "cathepsin 7" species:10116 "Rattus norvegicus"
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005768 "endosome" evidence=IEA] [GO:0005794 "Golgi apparatus"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0007067
            "mitosis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IEA] [GO:0051301 "cell division" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 RGD:1309226 GO:GO:0005634
            GO:GO:0005794 GO:GO:0048471 GO:GO:0005615 GO:GO:0051301
            GO:GO:0007067 GO:GO:0005768 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 MEROPS:C01.016 CTD:56092
            GeneTree:ENSGT00560000076577 OrthoDB:EOG44QT2S EMBL:CH474032
            IPI:IPI00870531 RefSeq:NP_001099569.1 UniGene:Rn.218615
            Ensembl:ENSRNOT00000043686 GeneID:290970 KEGG:rno:290970
            UCSC:RGD:1309226 OMA:VESFNAN Uniprot:D3ZZ07
        Length = 331

 Score = 468 (169.8 bits), Expect = 1.9e-44, P = 1.9e-44
 Identities = 117/327 (35%), Positives = 173/327 (52%)

Query:    43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLN----RTYKVGLN 98
             R D  +   ++ W   + KT +     ++R  ++++N++ I  H   N      + + +N
Sbjct:    20 RPDYSLDAEWEEWKRNNAKTYSPEEEKQRR-AVWEENVKMIKWHTMQNGLWMNNFTIEMN 78

Query:    99 KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKD 158
             +F D+T EE R M   T S A   L   K   +R       ++P+++DWR+ G V PV+ 
Sbjct:    79 EFGDMTGEEMRMM---TDSSALT-LRNGKHIQKRNV-----KIPKTLDWRDTGCVAPVRS 129

Query:   159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFII 217
             QG CG+CWAFS  A++E      TG+LI LS Q L+DC     N  C+GG    AFQ++ 
Sbjct:   130 QGGCGACWAFSVAASIESQLFKKTGKLIPLSVQNLIDCTVTYGNNDCSGGKPYTAFQYVK 189

Query:   218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAI 276
              NGG+++E  YPY      C   R    VV I  +  V P +E +L +A+    P++VAI
Sbjct:   190 NNGGLEAEATYPYEAKLRHCR-YRPERSVVKIARFF-VVPRNEEALMQALVTYGPIAVAI 247

Query:   277 EAGGRAFQHYESGVF-TGECG-SALDHGVVAVGYGTENGVD----YWLVRNSWGSDWGEN 330
             +    +F+ Y  G++   +C    LDHG++ VGYG E        YWL++NS G  WGE 
Sbjct:   248 DGSHASFKRYRGGIYHEPKCRRDTLDHGLLLVGYGYEGHESENRKYWLLKNSHGEQWGER 307

Query:   331 GYVKLQRNLLDTNTGKCGIAMEASYPV 357
             GY+KL R   D N   CGIA  A YP+
Sbjct:   308 GYMKLPR---DQNN-YCGIASYAMYPL 330


>WB|WBGene00019986 [details] [associations]
            symbol:R09F10.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            EMBL:FO081137 HSSP:P53634 PIR:D89588 RefSeq:NP_509408.1
            ProteinModelPortal:Q23030 SMR:Q23030 STRING:Q23030 MEROPS:C01.A44
            PaxDb:Q23030 EnsemblMetazoa:R09F10.1 GeneID:181087
            KEGG:cel:CELE_R09F10.1 UCSC:R09F10.1 CTD:181087 WormBase:R09F10.1
            InParanoid:Q23030 OMA:EYPYSAL NextBio:912346 Uniprot:Q23030
        Length = 383

 Score = 466 (169.1 bits), Expect = 3.1e-44, P = 3.1e-44
 Identities = 110/315 (34%), Positives = 169/315 (53%)

Query:    51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRA 110
             ++  ++ K  +    +   E R+QIF  N+   +     N    + +N+F D T+EE + 
Sbjct:    81 MFNDFILKFDRKYTSVEEFEYRYQIFLRNVIEFEAEEERNLGLDLDVNEFTDWTDEELQK 140

Query:   111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
             M    +   K      K     Y  + G   P S+DWRE+G + P+K+QG CGSCWAF+T
Sbjct:   141 MVQENKY-TKYDFDTPKFEGS-YL-ETGVIRPASIDWREQGKLTPIKNQGQCGSCWAFAT 197

Query:   171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
             VA+VE  N I  G+L+SLSEQE+VDCD + N GC+GG   YA +F+ +NG ++SE++YPY
Sbjct:   198 VASVEAQNAIKKGKLVSLSEQEMVDCDGR-NNGCSGGYRPYAMKFVKENG-LESEKEYPY 255

Query:   231 LGAEN-KCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESG 289
                ++ +C     + +V  ID +  +S  +E          PV+  +    +A   Y SG
Sbjct:   256 SALKHDQCFLKENDTRVF-IDDFRMLSNNEEDIANWVGTKGPVTFGMNVV-KAMYSYRSG 313

Query:   290 VFTG---ECG--SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
             +F     +C   S   H +  +GYG E    YW+V+NSWG+ WG +GY +L R +   N+
Sbjct:   314 IFNPSVEDCTEKSMGAHALTIIGYGGEGESAYWIVKNSWGTSWGASGYFRLARGV---NS 370

Query:   345 GKCGIAMEASYPVKN 359
               CG+A     P+ N
Sbjct:   371 --CGLANTVVAPIIN 383


>UNIPROTKB|F1NT07 [details] [associations]
            symbol:LOC100857883 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 EMBL:AADN02044012
            EMBL:AADN02044013 EMBL:AADN02044014 IPI:IPI00577314
            Ensembl:ENSGALT00000000192 OMA:IYKHGPV Uniprot:F1NT07
        Length = 317

 Score = 464 (168.4 bits), Expect = 5.0e-44, P = 5.0e-44
 Identities = 112/297 (37%), Positives = 165/297 (55%)

Query:    70 EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS-DAKRRLMKSKV 128
             E R +IF  ++RF+   N    +Y + LN  AD T +E  A+    RS D    L     
Sbjct:    30 EHRQRIFAHHMRFVHSKNRAALSYSLALNHLADRTPQEMAALRGRRRSGDPNHGL---PF 86

Query:   129 ASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
              ++ Y    G  LPES+DWR  GAV PVKDQ  CGSCW+F+T  A+EG   + TG L  L
Sbjct:    87 PAEHYT---GIILPESLDWRMYGAVTPVKDQAVCGSCWSFATTGAMEGALFLKTGVLTPL 143

Query:   189 SEQELVDCD-RKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY--LGAENKCDPSRRNAK 245
             S+Q L+DC   K N  C+GG    A  +I ++GG+ S +  P   L  +N      ++  
Sbjct:   144 SQQVLIDCSWGKGNYACDGGEEWRAKGWIKKHGGIASTESPPSFPLVLQNGLCHYNQSEM 203

Query:   246 VVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVF-TGECGSA---LD 300
             +  I GY +V+  +  ++K A+    PV+V+I+A  + F  Y +G++   +C +    LD
Sbjct:   204 LAKITGYVNVTSGNITAVKTAIYKHGPVAVSIDASHKTFSFYSNGIYYEPKCANKPGQLD 263

Query:   301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
             H V+AVGYG   G  YWL++NSW + WG +GY+ +   + D N   CG+A EA+YP+
Sbjct:   264 HAVLAVGYGVLQGETYWLIKNSWSTYWGNDGYILMA--MKDNN---CGVATEATYPI 315


>UNIPROTKB|G3V9F8 [details] [associations]
            symbol:Ctsm "RCG24133" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015645 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 EMBL:CH474032
            PANTHER:PTHR12411:SF58 Ensembl:ENSRNOT00000045830 RGD:631420
            Uniprot:G3V9F8
        Length = 333

 Score = 462 (167.7 bits), Expect = 8.1e-44, P = 8.1e-44
 Identities = 119/330 (36%), Positives = 174/330 (52%)

Query:    39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVG-- 96
             SSS   D  +   +Q W  K+ KT +     +KR  ++++N++ I  HN  N   K G  
Sbjct:    16 SSSPAPDPVLDAEWQKWKIKYEKTYSLEEEGQKR-AVWEENMKKIKLHNGENGLGKHGFT 74

Query:    97 --LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
               +N F D+T EE+R + +       +   K     +R A      +P  ++WR++G V 
Sbjct:    75 MEMNAFGDMTIEEFRKLMIEIPIPTVK---KENSVQKRQAVN----VPNFINWRKRGYVT 127

Query:   155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAF 213
             PV+ QG C  CWAFS   A+EG     TG+LI LS Q LVDC R + N GC  G    A 
Sbjct:   128 PVRRQGRCNVCWAFSVAGAIEGQMFQKTGQLIPLSVQNLVDCSRPQGNLGCYLGNTYLAL 187

Query:   214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PV 272
             Q++ +NGG++SE  YPY   E  C     N+   SI  +E V P +E +L  AVA   P+
Sbjct:   188 QYVKENGGLESEATYPYEEKEGSCRYHPDNS-TASITDFEFV-PKNEDALMNAVATLGPI 245

Query:   273 SVAIEAGGRAFQHYESGVF-TGECGSAL-DHGVVAVGYG----TENGVDYWLVRNSWGSD 326
             SVAI+A   +F  Y +G++    C S++  H ++ VGYG      +G  YW+++NS G+ 
Sbjct:   246 SVAIDARHESFLFYRNGIYHEPNCSSSVVTHAMLLVGYGFVGEESDGRKYWILKNSMGNK 305

Query:   327 WGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
             WG  GY+K+ ++        CGIA  A YP
Sbjct:   306 WGNRGYMKIAKD----QGNHCGIATYALYP 331


>DICTYBASE|DDB_G0282991 [details] [associations]
            symbol:DDB_G0282991 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0282991 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            EMBL:AAFI02000049 eggNOG:NOG331187 RefSeq:XP_639299.1
            ProteinModelPortal:Q54RQ2 EnsemblProtists:DDB0185304 GeneID:8623870
            KEGG:ddi:DDB_G0282991 InParanoid:Q54RQ2 OMA:PENGNEY Uniprot:Q54RQ2
        Length = 339

 Score = 458 (166.3 bits), Expect = 2.2e-43, P = 2.2e-43
 Identities = 113/318 (35%), Positives = 170/318 (53%)

Query:    47 EVMTIYQTWLAKHGKT-SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTN 105
             E+  ++  W  K+ K  SN   +   RF  FK N  ++D+ N       + LN FADL+ 
Sbjct:    22 EIENLFIEWTNKYNKIYSNKEFY--MRFNNFKKNKEYVDQWNEKQLETILELNFFADLSR 79

Query:   106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC-GS 164
              EY   YL +  D      K+            + + +S+DWR   AV PVK+QG C G+
Sbjct:    80 NEYINNYLASFIDISNIEQKNTKYEGNLKNNFNNSI-KSIDWRNFDAVTPVKNQGLCSGA 138

Query:   165 CWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMD 223
              ++FS +  +E  + I   ELI+LSEQ ++DC   + N GC GGL   AF +II+  G+D
Sbjct:   139 GYSFSAIGVIESSHFIKNKELITLSEQNIIDCTTDMGNNGCMGGLALIAFDYIIKQKGID 198

Query:   224 SEQDYPYLG-------AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAI 276
             SE +YPY G          +C  +   +K  SI  Y ++  F+E  L +++   PVSV I
Sbjct:   199 SEFNYPYEGYLIEPYEGRGRCRYNSFYSKA-SISSYIEIERFNENELTQSLIKSPVSVMI 257

Query:   277 EAGGRAFQHYESGVFTG-ECGSA-LDHGVVAVGYGT--ENGVDYWLVRNSWGSDWGENGY 332
             +A   +F  Y+SGV+    C S  L+HG++ +G+G   ENG +Y++++NS+GS WG  GY
Sbjct:   258 DASQLSFMLYKSGVYKDPSCSSTILNHGILNIGFGVTPENGNEYYILKNSFGSKWGMKGY 317

Query:   333 VKLQRNLLDTNTGKCGIA 350
             + L RN        CGI+
Sbjct:   318 IYLSRNF----NNHCGIS 331


>FB|FBgn0032228 [details] [associations]
            symbol:CG5367 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE014134 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 HSSP:P80067
            RefSeq:NP_609387.1 UniGene:Dm.26782 ProteinModelPortal:Q9VKY4
            SMR:Q9VKY4 MEROPS:C01.A30 EnsemblMetazoa:FBtr0080055 GeneID:34401
            KEGG:dme:Dmel_CG5367 UCSC:CG5367-RA FlyBase:FBgn0032228
            InParanoid:Q9VKY4 OMA:QIVDCSV OrthoDB:EOG4THT8X PhylomeDB:Q9VKY4
            GenomeRNAi:34401 NextBio:788324 ArrayExpress:Q9VKY4 Bgee:Q9VKY4
            Uniprot:Q9VKY4
        Length = 338

 Score = 443 (161.0 bits), Expect = 8.4e-42, P = 8.4e-42
 Identities = 102/305 (33%), Positives = 173/305 (56%)

Query:    68 HNEKR-FQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLGTRSDAKRR 122
             ++E R ++ F++N + I+EHN   +    ++++  N FAD++ + Y   +L        R
Sbjct:    51 YDEMRSYKAFEENFKVIEEHNQNYKEGQTSFRLKPNIFADMSTDGYLKGFL--------R 102

Query:   123 LMKSKV--ASQRYACKAGDEL----PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
             L+KS +  ++   A   G  L    PES+DWR KG + P  +Q SCGSC+AFS   ++ G
Sbjct:   103 LLKSNIEDSADNMAEIVGSPLMANVPESLDWRSKGFITPPYNQLSCGSCYAFSIAESIMG 162

Query:   177 INKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN 235
                  TG+++SLS+Q++VDC     N GC GG +     ++   GG+  +QDYPY+  + 
Sbjct:   163 QVFKRTGKILSLSKQQIVDCSVSHGNQGCVGGSLRNTLSYLQSTGGIMRDQDYPYVARKG 222

Query:   236 KCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFTGE 294
             KC     +  VV++  +  +   DE +++ AV    PV+++I A  + FQ Y  G++   
Sbjct:   223 KCQ-FVPDLSVVNVTSWAILPVRDEQAIQAAVTHIGPVAISINASPKTFQLYSDGIYDDP 281

Query:   295 -CGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAME 352
              C SA ++H +V +G+G     DYW+++N WG +WGENGY+++++ +       CGIA  
Sbjct:   282 LCSSASVNHAMVVIGFGK----DYWILKNWWGQNWGENGYIRIRKGV-----NMCGIANY 332

Query:   353 ASYPV 357
             A+Y +
Sbjct:   333 AAYAI 337


>DICTYBASE|DDB_G0281077 [details] [associations]
            symbol:DDB_G0281077 species:44689 "Dictyostelium
            discoideum" [GO:0030246 "carbohydrate binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR008965 InterPro:IPR013128 InterPro:IPR019028
            Pfam:PF00112 Pfam:PF09478 PRINTS:PR00705 SMART:SM00645
            SMART:SM01063 InterPro:IPR000169 dictyBase:DDB_G0281077
            GO:GO:0030246 EMBL:AAFI02000040 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 SUPFAM:SSF49384 ProtClustDB:CLSZ2430562
            RefSeq:XP_640803.1 ProteinModelPortal:Q54UH3
            EnsemblProtists:DDB0203998 GeneID:8622857 KEGG:ddi:DDB_G0281077
            InParanoid:Q54UH3 OMA:LINDFNF Uniprot:Q54UH3
        Length = 662

 Score = 383 (139.9 bits), Expect = 9.3e-42, Sum P(2) = 9.3e-42
 Identities = 81/183 (44%), Positives = 109/183 (59%)

Query:   142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
             P S+DWR  G V+ VK+QGSCGSC+AFSTV A+E         +++LSEQ LVDC R   
Sbjct:   472 PISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALEAHYYRKNNRMLNLSEQNLVDCTRNYG 531

Query:   202 AG-CNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
              G C+GG M   F++I +NGG++ +  YPY G    C  +  +A+   I  Y  +   DE
Sbjct:   532 NGECSGGWMHNCFRYIKENGGINLQSTYPYEGRVGLCRYNSGDAQS-RISNYVMIKQHDE 590

Query:   261 MSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFTGE-CGS-ALDHGVVAVGYGTENGVDYW 317
               L  AVA   PVSVA +A  R F +Y SG++  + C      H VV VGYG ENGVD+W
Sbjct:   591 EDLANAVASVGPVSVAYDASTREFMYYSSGIYNSDSCDKYRTTHAVVVVGYGIENGVDFW 650

Query:   318 LVR 320
             +++
Sbjct:   651 IIK 653

 Score = 88 (36.0 bits), Expect = 9.3e-42, Sum P(2) = 9.3e-42
 Identities = 15/43 (34%), Positives = 32/43 (74%)

Query:    72 RFQIFKDNLRFIDEHN--SLNRTYKVGLNKFADLTNEEYRAMY 112
             +++ FKD+ RFI+++   + N T ++GL +F+D+T++E+  +Y
Sbjct:   181 KYEAFKDSSRFIEQYKRENQNSTMELGLTQFSDMTHDEFLNIY 223

 Score = 37 (18.1 bits), Expect = 2.1e-36, Sum P(2) = 2.1e-36
 Identities = 7/13 (53%), Positives = 8/13 (61%)

Query:    82 FIDEHNSLNRTYK 94
             FI   N  NRTY+
Sbjct:   162 FIQWSNQFNRTYR 174


>DICTYBASE|DDB_G0274385 [details] [associations]
            symbol:DDB_G0274385 "Cysteine proteinase 1,
            mitochondrial" species:44689 "Dictyostelium discoideum" [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0274385 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 EMBL:AAFI02000012 RefSeq:XP_644301.1
            ProteinModelPortal:Q86KD4 EnsemblProtists:DDB0167535 GeneID:8619729
            KEGG:ddi:DDB_G0274385 InParanoid:Q86KD4 OMA:SICVDAS Uniprot:Q86KD4
        Length = 358

 Score = 433 (157.5 bits), Expect = 9.6e-41, P = 9.6e-41
 Identities = 112/338 (33%), Positives = 163/338 (48%)

Query:    36 HDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYK 94
             H +     +D  +   +  W  KH K        E RF  FK+N++   E NS++    K
Sbjct:    28 HRNDGIIHSDSSMRDTFNHWAKKHSKIYKDSIEMENRFSNFKENMKKNIELNSMHAGKAK 87

Query:    95 VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR---------YACKAGDELPE-- 143
                N F+DL+ EE+   +L      K   +++ +  Q          Y      +L E  
Sbjct:    88 FESNGFSDLSEEEFSNFHLNKAFKGKPSHLRNSIKPQPTPHHSLINGYKEMENGDLNELY 147

Query:   144 SVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAG 203
             S+DWR+KG V PVKDQG CGSC+ FS V  +E        + I LSEQ+ VDCD   +  
Sbjct:   148 SIDWRKKGLVTPVKDQGQCGSCYIFSAVEQIETAWIKAGNKPILLSEQQAVDCD-PYDGQ 206

Query:   204 CNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF-DEMS 262
             C GG     +++  Q GG+ +   YPY   +  C    R   VVS   Y  V+   DE +
Sbjct:   207 CGGGDPYTVYEYFSQVGGVSTNAQYPYTATDGTCVNMSRAVPVVS---YHYVTQGGDENT 263

Query:   263 LKKAVA-DQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-----NGVDY 316
             L K +  D PVS+ ++A    +Q Y  G+ T  CG  +DH V  VG   +     N V Y
Sbjct:   264 LIKTIVNDGPVSICVDAS--TWQSYSGGIITTGCGKNIDHCVQVVGLEVDKTDPSNPVQY 321

Query:   317 WLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEAS 354
             +++RNSWG+DWG +GY+ +      T +  CGI  E++
Sbjct:   322 YIIRNSWGTDWGIDGYIYVA-----TGSDLCGITYEST 354


>DICTYBASE|DDB_G0281079 [details] [associations]
            symbol:DDB_G0281079 species:44689 "Dictyostelium
            discoideum" [GO:0030246 "carbohydrate binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR008965 InterPro:IPR013128 InterPro:IPR019028
            Pfam:PF00112 Pfam:PF09478 PRINTS:PR00705 SMART:SM00645
            SMART:SM01063 InterPro:IPR000169 dictyBase:DDB_G0281079
            GO:GO:0030246 EMBL:AAFI02000040 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 SUPFAM:SSF49384 RefSeq:XP_640804.1
            ProteinModelPortal:Q54UH2 EnsemblProtists:DDB0204000 GeneID:8622858
            KEGG:ddi:DDB_G0281079 InParanoid:Q54UH2 OMA:ALESHYY
            ProtClustDB:CLSZ2430562 Uniprot:Q54UH2
        Length = 664

 Score = 365 (133.5 bits), Expect = 1.0e-39, Sum P(2) = 1.0e-39
 Identities = 78/185 (42%), Positives = 107/185 (57%)

Query:   142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDC---DR 198
             P S+DWR  G V+ VK+QGSCGSC+AFSTV A+E         ++ LSEQ LVDC   ++
Sbjct:   471 PISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALESHYYRKNNRMLDLSEQNLVDCTASNK 530

Query:   199 KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF 258
               N GC+GG M   + +I +NGG++ E  YPY G   +C  +  +A+   I  +  +   
Sbjct:   531 YRNGGCSGGWMHNCYSYIQENGGINQESTYPYEGKFGQCRYNSGDAQS-RISKFVMIKQH 589

Query:   259 DEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFTGE-CGS-ALDHGVVAVGYGTENGVD 315
             DE  L   VA   PVSVA +A  R F +Y  G++  + C      H VV VGY  ENGVD
Sbjct:   590 DEEDLADTVASVGPVSVAYDASTREFMYYSRGIYYSDNCNKYRTTHAVVVVGYDNENGVD 649

Query:   316 YWLVR 320
             YW+++
Sbjct:   650 YWIIK 654

 Score = 88 (36.0 bits), Expect = 1.0e-39, Sum P(2) = 1.0e-39
 Identities = 15/43 (34%), Positives = 32/43 (74%)

Query:    72 RFQIFKDNLRFIDEHN--SLNRTYKVGLNKFADLTNEEYRAMY 112
             +++ FKD+ RFI+++   + N T ++GL +F+D+T++E+  +Y
Sbjct:   180 KYEAFKDSSRFIEQYKRENQNSTMELGLTQFSDMTHDEFLNVY 222

 Score = 37 (18.1 bits), Expect = 2.3e-34, Sum P(2) = 2.3e-34
 Identities = 7/13 (53%), Positives = 8/13 (61%)

Query:    82 FIDEHNSLNRTYK 94
             FI   N  NRTY+
Sbjct:   161 FIQWSNQFNRTYR 173


>FB|FBgn0037396 [details] [associations]
            symbol:CG11459 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE014297 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 KO:K01365 HSSP:P07711 EMBL:AY060710
            RefSeq:NP_649608.1 UniGene:Dm.3894 SMR:Q9VNK6 MEROPS:C01.A31
            EnsemblMetazoa:FBtr0078623 GeneID:40741 KEGG:dme:Dmel_CG11459
            UCSC:CG11459-RA FlyBase:FBgn0037396 InParanoid:Q9VNK6 OMA:NYDEREL
            OrthoDB:EOG4MGQPX ChiTaRS:CG11459 GenomeRNAi:40741 NextBio:820359
            Uniprot:Q9VNK6
        Length = 336

 Score = 423 (154.0 bits), Expect = 1.1e-39, P = 1.1e-39
 Identities = 110/320 (34%), Positives = 167/320 (52%)

Query:    50 TIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFADLTN 105
             T +  + AK+ K         +   +++  +  ++ HN L       +K+GLNKF+D T+
Sbjct:    28 TEWDQYKAKYNKQYRNRDKYHRA--LYEQRVLAVESHNQLYLQGKVAFKMGLNKFSD-TD 84

Query:   106 EEYRAMYLGTRSDAKRRLMKSKVA-SQRYACKAGDELPESVDWREKGAVNPVKDQGS-CG 163
             +     Y   RS     L  S  A ++    K  D++ E +DWR+ G ++PV DQG+ C 
Sbjct:    85 QRILFNY---RSSIPAPLETSTNALTETVNYKRYDQITEGIDWRQYGYISPVGDQGTECL 141

Query:   164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
             SCWAFST   +E       G L+ LS + LVDC    N GC+GG +  AF +  ++ G+ 
Sbjct:   142 SCWAFSTSGVLEAHMAKKYGNLVPLSPKHLVDCVPYPNNGCSGGWVSVAFNYT-RDHGIA 200

Query:   224 SEQDYPYLGAENKCD-PSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGR 281
             +++ YPY     +C   S R+A  +S  GY  +  +DE  L + V +  PV+V+I+    
Sbjct:   201 TKESYPYEPVSGECLWKSDRSAGTLS--GYVTLGNYDERELAEVVYNIGPVAVSIDHLHE 258

Query:   282 AFQHYESGVFT-GECGSA---LDHGVVAVGYGTENGV-DYWLVRNSWGSDWGENGYVKLQ 336
              F  Y  GV +   C S    L H V+ VG+GT     DYW+++NS+G+DWGE+GY+KL 
Sbjct:   259 EFDQYSGGVLSIPACRSKRQDLTHSVLLVGFGTHRKWGDYWIIKNSYGTDWGESGYLKLA 318

Query:   337 RNLLDTNTGKCGIAMEASYP 356
             RN        CG+A    YP
Sbjct:   319 RNA----NNMCGVASLPQYP 334


>UNIPROTKB|F1RU23 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 KO:K08569 EMBL:CU928325
            RefSeq:XP_003122571.1 UniGene:Ssc.28940 Ensembl:ENSSSCT00000014177
            GeneID:100525853 KEGG:ssc:100525853 OMA:CWAMAAV Uniprot:F1RU23
        Length = 367

 Score = 419 (152.6 bits), Expect = 2.9e-39, P = 2.9e-39
 Identities = 111/341 (32%), Positives = 171/341 (50%)

Query:    47 EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID--EHNSLNRTYKVGLNKFADLT 104
             EV T++Q    ++ ++ +    + +R  IF  NL      +   L  T + G+  F+DLT
Sbjct:    40 EVFTLFQI---QYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLG-TAEFGVTPFSDLT 95

Query:   105 NEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREK-GAVNPVKDQGSCG 163
              EE+  ++       K   M  KV S+    ++G+ +P+S DWR+K G ++ +K Q  C 
Sbjct:    96 EEEFGQLHGHHWGAGKAPSMGIKVGSE----ESGETVPQSCDWRKKPGVISAIKHQKDCN 151

Query:   164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
              CWA + V  VE    I   + + LS Q+++DCDR  N GCNGG +  AF  ++   G+ 
Sbjct:   152 CCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRCGN-GCNGGFVWDAFLTVLNTSGLA 210

Query:   224 SEQDYPYLGA--ENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGG 280
             SEQDYPY G    ++C  ++++ KV  I  +  +  F E S+ + +A + P++V I AG 
Sbjct:   211 SEQDYPYKGTVKTHRC-LAKQHRKVAWIQDFLMLQ-FCEQSIARYLATEGPITVTINAG- 267

Query:   281 RAFQHYESGVFTGE---CGSAL-DHGVVAVGYGTENGVD-----------YWLVRNSWGS 325
                Q Y+ GV       C   L +H V+ VG+G    V+           YW+++NSWG 
Sbjct:   268 -LLQQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGRRPRPGHSIPYWILKNSWGP 326

Query:   326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKP 366
             DWGE GY +L R        K  +      PVK  Q S  P
Sbjct:   327 DWGEEGYFRLHRGSNTCGITKYPVTARVDKPVKKHQISCPP 367


>RGD|1309354 [details] [associations]
            symbol:Ctsw "cathepsin W" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1309354 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 MEROPS:C01.037
            CTD:1521 eggNOG:NOG288820 HOVERGEN:HBG100117 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 EMBL:CH473953 EMBL:BC093401 IPI:IPI00371471
            RefSeq:NP_001019413.1 UniGene:Rn.34406 Ensembl:ENSRNOT00000037404
            GeneID:293676 KEGG:rno:293676 UCSC:RGD:1309354 InParanoid:Q561Q9
            NextBio:636716 Genevestigator:Q561Q9 Uniprot:Q561Q9
        Length = 371

 Score = 322 (118.4 bits), Expect = 3.2e-39, Sum P(2) = 3.2e-39
 Identities = 81/276 (29%), Positives = 142/276 (51%)

Query:    47 EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID--EHNSLNRTYKVGLNKFADLT 104
             E+  +++ +  +  ++ +      +R  IF  NL      +   L  T + G   F+DLT
Sbjct:    35 ELKEVFKLFQIQFNRSYSNPAEYTRRLGIFAHNLAQAQRLQEEDLG-TAEFGQTPFSDLT 93

Query:   105 NEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWRE-KGAVNPVKDQGSCG 163
              EE+  +Y   R+  +   M  KV S+R+    G+ +P + DWR+ K  ++ +K+QG+C 
Sbjct:    94 EEEFGQLYGHQRAPERILNMAKKVKSERW----GESVPPTCDWRKVKNIISSIKNQGNCR 149

Query:   164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
              CWA +    ++ + +I T + + +S QEL+DCDR  N GCNGG +  A+  ++ N G+ 
Sbjct:   150 CCWAIAAADNIQTLWRIKTQQFVDVSVQELLDCDRCGN-GCNGGFVWDAYITVLNNSGLA 208

Query:   224 SEQDYPYLGAE--NKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
             SE+DYP+ G +  ++C   +   KV  I  +  +S  +++         P++V I    +
Sbjct:   209 SEEDYPFQGHQKPHRCLADKYR-KVAWIQDFTMLSSNEQVIAGYLAIHGPITVTINM--K 265

Query:   282 AFQHYESGVFTGE---CGSAL-DHGVVAVGYGTENG 313
               Q+Y+ GV       C   L +H V+ VG+G E G
Sbjct:   266 LLQYYQKGVIKATPSTCDPHLVNHSVLLVGFGKEKG 301

 Score = 113 (44.8 bits), Expect = 3.2e-39, Sum P(2) = 3.2e-39
 Identities = 20/51 (39%), Positives = 27/51 (52%)

Query:   316 YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKP 366
             YW+++NSWG++WGE GY +L R        K  I      PVK +  S  P
Sbjct:   321 YWILKNSWGAEWGEKGYFRLYRGNNTCGIAKYPITARVDRPVKKAPVSCPP 371


>MGI|MGI:1338045 [details] [associations]
            symbol:Ctsw "cathepsin W" species:10090 "Mus musculus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 MGI:MGI:1338045 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 MEROPS:C01.037
            CTD:1521 eggNOG:NOG288820 HOVERGEN:HBG100117 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 EMBL:AF014941 EMBL:AC122861 IPI:IPI00111727
            RefSeq:NP_034115.2 UniGene:Mm.113590 ProteinModelPortal:P56203
            SMR:P56203 PhosphoSite:P56203 PRIDE:P56203 DNASU:13041
            Ensembl:ENSMUST00000025844 GeneID:13041 KEGG:mmu:13041
            InParanoid:P56203 NextBio:282936 Bgee:P56203 CleanEx:MM_CTSW
            Genevestigator:P56203 GermOnline:ENSMUSG00000024910 Uniprot:P56203
        Length = 371

 Score = 318 (117.0 bits), Expect = 1.1e-38, Sum P(2) = 1.1e-38
 Identities = 81/274 (29%), Positives = 137/274 (50%)

Query:    47 EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID--EHNSLNRTYKVGLNKFADLT 104
             E+  +++ +  +  ++        +R  IF  NL      +   L  T + G   F+DLT
Sbjct:    35 ELKEVFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDLG-TAEFGETPFSDLT 93

Query:   105 NEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWRE-KGAVNPVKDQGSCG 163
              EE+  +Y   RS  +   M  KV S  +    G+ +P + DWR+ K  ++ VK+QGSC 
Sbjct:    94 EEEFGQLYGQERSPERTPNMTKKVESNTW----GESVPRTCDWRKAKNIISSVKNQGSCK 149

Query:   164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
              CWA +    ++ + +I   + + +S QEL+DC+R  N GCNGG +  A+  ++ N G+ 
Sbjct:   150 CCWAMAAADNIQALWRIKHQQFVDVSVQELLDCERCGN-GCNGGFVWDAYLTVLNNSGLA 208

Query:   224 SEQDYPYLGAE--NKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
             SE+DYP+ G    ++C  +++  KV  I  +  +S  ++          P++V I    +
Sbjct:   209 SEKDYPFQGDRKPHRC-LAKKYKKVAWIQDFTMLSNNEQAIAHYLAVHGPITVTINM--K 265

Query:   282 AFQHYESGVFTGECGSA----LDHGVVAVGYGTE 311
               QHY+ GV      S     +DH V+ VG+G E
Sbjct:   266 LLQHYQKGVIKATPSSCDPRQVDHSVLLVGFGKE 299

 Score = 112 (44.5 bits), Expect = 1.1e-38, Sum P(2) = 1.1e-38
 Identities = 19/51 (37%), Positives = 27/51 (52%)

Query:   316 YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKP 366
             YW+++NSWG+ WGE GY +L R        K     +   PVK ++ S  P
Sbjct:   321 YWILKNSWGAHWGEKGYFRLYRGNNTCGVTKYPFTAQVDSPVKKARTSCPP 371


>UNIPROTKB|F1MHV4 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 OMA:GRCGDGC EMBL:DAAA02063574
            IPI:IPI00716321 Ensembl:ENSBTAT00000027681 Uniprot:F1MHV4
        Length = 375

 Score = 311 (114.5 bits), Expect = 2.4e-37, Sum P(2) = 2.4e-37
 Identities = 77/272 (28%), Positives = 139/272 (51%)

Query:    47 EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID--EHNSLNRTYKVGLNKFADLT 104
             E+  +++ +  ++ ++        +R  IF  NL      +   L  T + G+ +F+DLT
Sbjct:    37 ELKEVFRLFQMQYNRSYPNPAEYARRLDIFAQNLAKAQRLQEEDLG-TAEFGVTQFSDLT 95

Query:   105 NEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGS 164
              EE+  +Y G++   +   +  KV S+ +    G+  P++ DWR+ G ++PV+DQ +C  
Sbjct:    96 EEEFVQLY-GSQVAGEALGVSRKVGSEEW----GESEPQTCDWRKVGTISPVRDQRNCNC 150

Query:   165 CWAFSTVAAVEGINKIVTGELISLSEQ-ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
             CWA +    +E +  I     + +S Q EL+DCDR  N GC GG +  AF  ++ N G+ 
Sbjct:   151 CWAMAAAGNIEALWAIKFRHFVEVSVQPELLDCDRCGN-GCRGGFVWDAFLTVLNNSGLA 209

Query:   224 SEQDYPYLGA--ENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
             SE+DYP+ G+   ++C  +++  KV  I  +  +   ++   +    + P++V I     
Sbjct:   210 SEKDYPFNGSGKTHRC-LAKKYKKVAWIQDFIILQACEQSMARHLATEGPITVTINM--T 266

Query:   282 AFQHYESGVFTGE---CG-SALDHGVVAVGYG 309
               Q Y+ GV       C  + +DH V+ VG+G
Sbjct:   267 LLQQYQKGVIKATPTTCDPTQVDHSVLLVGFG 298

 Score = 106 (42.4 bits), Expect = 2.4e-37, Sum P(2) = 2.4e-37
 Identities = 19/51 (37%), Positives = 24/51 (47%)

Query:   316 YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKP 366
             YW+++NSWG  WGE GY +L R        K  +      P K  Q S  P
Sbjct:   325 YWILKNSWGPQWGEEGYFRLHRGSNTCGITKFPVTARVDKPKKQHQVSCPP 375


>UNIPROTKB|Q5T8F0 [details] [associations]
            symbol:CTSL1 "Cathepsin L1 light chain" species:9606 "Homo
            sapiens" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            EMBL:AL160279 UniGene:Hs.731507 UniGene:Hs.731952 HGNC:HGNC:2537
            ChiTaRS:CTSL1 IPI:IPI00640540 SMR:Q5T8F0 Ensembl:ENST00000342020
            ChEMBL:CHEMBL1293261 Uniprot:Q5T8F0
        Length = 225

 Score = 398 (145.2 bits), Expect = 4.9e-37, P = 4.9e-37
 Identities = 87/197 (44%), Positives = 118/197 (59%)

Query:    39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYK 94
             S++   D  +   +  W A H +   GM     R  +++ N++ I+ HN   R    ++ 
Sbjct:    16 SATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFT 74

Query:    95 VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
             + +N F D+T+EE+R +  G ++   R+  K KV  +    +A    P SVDWREKG V 
Sbjct:    75 MAMNAFGDMTSEEFRQVMNGFQN---RKPRKGKVFQEPLFYEA----PRSVDWREKGYVT 127

Query:   155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAF 213
             PVK+QG CGSCWAFS   A+EG     TG LISLSEQ LVDC   + N GCNGGLMDYAF
Sbjct:   128 PVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAF 187

Query:   214 QFIIQNGGMDSEQDYPY 230
             Q++  NGG+DSE+ YPY
Sbjct:   188 QYVQDNGGLDSEESYPY 204


>ZFIN|ZDB-GENE-080724-8 [details] [associations]
            symbol:ctso "cathepsin O" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            ZFIN:ZDB-GENE-080724-8 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 CTD:1519 KO:K01374 EMBL:CR931784
            IPI:IPI00513613 RefSeq:XP_695717.3 UniGene:Dr.88386
            Ensembl:ENSDART00000074786 GeneID:567333 KEGG:dre:567333
            NextBio:20888622 Uniprot:E7FA09
        Length = 334

 Score = 398 (145.2 bits), Expect = 4.9e-37, P = 4.9e-37
 Identities = 90/265 (33%), Positives = 144/265 (54%)

Query:    90 NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWRE 149
             N++ + G+N+F+ L+ ++++  YL  R++A  +  +SK        KA +  P   DWR+
Sbjct:    75 NQSAQYGVNQFSYLSQKQFKEQYLTARAEAAPKFDQSK---SEIKVKANN--PPRFDWRD 129

Query:   150 KGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLM 209
              G V PV +QGSCG CWAFS V A+E ++     +L  LS Q+++DC  + N GCNGG  
Sbjct:   130 HGVVGPVHNQGSCGGCWAFSIVEAIESVSAKGGEKLQQLSVQQVIDCSYQ-NQGCNGGSP 188

Query:   210 DYAFQFIIQNG-GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYE--DVSPFDEMSLKKA 266
               A  ++ Q+   + SE +YP+ GA+  C    +    V++  Y   D S  +E+ +   
Sbjct:   189 VEALYWLTQSKLKLVSEAEYPFKGADGVCQFFPQAHAGVAVRNYSAYDFSGQEEVMMSAL 248

Query:   267 VADQPVSVAIEAGGRAFQHYESGVFTGECGS-ALDHGVVAVGYGTENGVDYWLVRNSWGS 325
             V   P+ V ++A   ++Q Y  G+    C S   +H V+  GY T   V YW+VRNSWG+
Sbjct:   249 VDFGPLVVIVDA--ISWQDYLGGIIQHHCSSHKANHAVLITGYDTTGEVPYWIVRNSWGT 306

Query:   326 DWGENGYVKLQRNLLDTNTGKCGIA 350
              WG++GY  ++          CG+A
Sbjct:   307 SWGDDGYAYIK-----IGNDVCGVA 326


>RGD|1564827 [details] [associations]
            symbol:RGD1564827 "similar to cathepsin M" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 IPI:IPI00192321
            Ensembl:ENSRNOT00000023990 ArrayExpress:D3ZY04 Uniprot:D3ZY04
        Length = 338

 Score = 397 (144.8 bits), Expect = 6.3e-37, P = 6.3e-37
 Identities = 88/213 (41%), Positives = 123/213 (57%)

Query:   153 VNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDY 211
             V+    QG C SCWAF  V A+EG     TG+L  LS Q LVDC + + N GC GG    
Sbjct:   133 VHTASTQGRCNSCWAFPVVGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYN 192

Query:   212 AFQFIIQNGGMDSEQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD 269
             AFQ+++QNGG++SE  YPY G E  C  +P+  +AK+  I      +   E  L  AVA 
Sbjct:   193 AFQYVLQNGGLESEATYPYEGKEGLCRYNPNS-SAKITXICAPPQKN---EDVLMDAVAT 248

Query:   270 QPVSVAIEAGGRAFQHYESGVF-TGECGSALDHGVVAVGYGTE----NGVDYWLVRNSWG 324
             +PV+  I     + + Y+ G++   +C + ++H V+ VGYG E    +G +YWL++NSWG
Sbjct:   249 KPVAAGIHVVHSSLRFYKKGIYHEPKCNNYVNHAVLVVGYGFEGNETDGNNYWLIQNSWG 308

Query:   325 SDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
               WG NGY+K+ +   D N   CGIA  A YP+
Sbjct:   309 ERWGLNGYMKIAK---DRNN-HCGIATFAQYPI 337


>UNIPROTKB|E2RPX3 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 CTD:1521 KO:K08569 OMA:GRCGDGC
            EMBL:AAEX03011632 RefSeq:XP_540846.2 Ensembl:ENSCAFT00000020910
            GeneID:483725 KEGG:cfa:483725 Uniprot:E2RPX3
        Length = 374

 Score = 296 (109.3 bits), Expect = 2.7e-36, Sum P(2) = 2.7e-36
 Identities = 80/278 (28%), Positives = 137/278 (49%)

Query:    47 EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFID--EHNSLNRTYKVGLNKFADLT 104
             E+  ++  +  ++ ++ +      +R  IF  NL      E   L  T + G+  F+DLT
Sbjct:    37 ELKQVFALFQIQYNRSYSNPEEYARRLDIFAHNLAQAQQLEDEDLG-TAEFGVTPFSDLT 95

Query:   105 NEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWRE-KGAVNPVKDQGSCG 163
              EE+   Y   R   +   +  KV S+ +    G+ +P + DWR+  G ++P+K QG+C 
Sbjct:    96 EEEFGQFYGHQRMAGEAPSVGRKVESEEW----GEPVPPTCDWRKLPGIISPIKQQGNCR 151

Query:   164 SCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMD 223
              CWA +    +E +  I   + + +S QEL+DC R    GC GG    AF  ++ N G+ 
Sbjct:   152 CCWAMAAAGNIEALWGIRYHQPVEVSVQELLDCGR-CGDGCKGGFTWDAFITVLNNSGLA 210

Query:   224 SEQDYPYLG--AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGG 280
             S +DYP+LG    ++C  +++  KV  I  +  +   +E ++   +A + P++V I    
Sbjct:   211 SAKDYPFLGNTKPHRC-LAKKYKKVAWIQDFIMLQG-NEQAIAWYLATKGPITVTINM-- 266

Query:   281 RAFQHYESGVFTGE---CG-SALDHGVVAVGYGTENGV 314
             +  QHY+ GV       C    +DH V+ VG+G    V
Sbjct:   267 KLLQHYQKGVIQATHTTCDPQRVDHSVLLVGFGKSKSV 304

 Score = 111 (44.1 bits), Expect = 2.7e-36, Sum P(2) = 2.7e-36
 Identities = 21/44 (47%), Positives = 28/44 (63%)

Query:   314 VDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
             + YW+++NSWG++WGE GY +L R     NT  CGI     YPV
Sbjct:   322 IPYWILKNSWGAEWGEEGYFRLHRG---NNT--CGIT---KYPV 357


>UNIPROTKB|H0YD65 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 EMBL:AP002748
            HGNC:HGNC:2531 ChiTaRS:CTSF Ensembl:ENST00000524994 Uniprot:H0YD65
        Length = 283

 Score = 382 (139.5 bits), Expect = 2.4e-35, P = 2.4e-35
 Identities = 101/262 (38%), Positives = 139/262 (53%)

Query:    47 EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTN 105
             ++ +I++ ++  + +T         R  +F +N+    +  +L+R T + G+ KF+DLT 
Sbjct:    31 KMASIFKNFVITYNRTYESK-EARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTE 89

Query:   106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSC 165
             EE+R +YL T       L K      + A   GD  P   DWR KGAV  VKDQG CGSC
Sbjct:    90 EEFRTIYLNTL------LRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSC 143

Query:   166 WAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSE 225
             WAFS    VEG   +  G L+SLSEQEL+DCD K++  C GGL   A+  I   GG+++E
Sbjct:   144 WAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD-KMDKACMGGLPSNAYSAIKNLGGLETE 202

Query:   226 QDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAF- 283
              DY Y G    C+ S   AKV   D  E +S  +E  L   +A + P+SVAI A G  F 
Sbjct:   203 DDYSYQGHMQSCNFSAEKAKVYINDSVE-LSQ-NEQKLAAWLAKRGPISVAINAFGMQFY 260

Query:   284 QHYESGVFTGECGSAL-DHGVV 304
             +H  S      C   L DH V+
Sbjct:   261 RHGISRPLRPLCSPWLIDHAVL 282


>WB|WBGene00012747 [details] [associations]
            symbol:Y40H7A.10 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000230773 EMBL:AL033510
            HSSP:P80067 MEROPS:C01.A48 PIR:T26792 RefSeq:NP_502836.1
            ProteinModelPortal:Q9XWA4 SMR:Q9XWA4 STRING:Q9XWA4
            EnsemblMetazoa:Y40H7A.10 GeneID:189809 KEGG:cel:CELE_Y40H7A.10
            UCSC:Y40H7A.10 CTD:189809 WormBase:Y40H7A.10 eggNOG:NOG286423
            InParanoid:Q9XWA4 OMA:NGPMIVC NextBio:943702 Uniprot:Q9XWA4
        Length = 343

 Score = 379 (138.5 bits), Expect = 5.1e-35, P = 5.1e-35
 Identities = 103/316 (32%), Positives = 155/316 (49%)

Query:    45 DDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR---TYKVGLNKFA 101
             D +    +Q +L K+ +         KRF IF  NL  ++ +N  +    TY+  LN F+
Sbjct:    44 DVKYTNAFQNFLVKYLREYPNEYEIVKRFTIFSRNLDLVERYNKEDAGKVTYE--LNDFS 101

Query:   102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV---KD 158
             DLT EE++   +  + D   + +K K    +        LP SVDWR     N V   K 
Sbjct:   102 DLTEEEWKKYLMTPKPDHSEKSLKPKTLIDK------KNLPNSVDWRNVNGTNHVTGIKY 155

Query:   159 QGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
             QG CGSCWAF+T AA+E    I  G L SLS Q+L+DC   ++  C GG    A ++  Q
Sbjct:   156 QGPCGSCWAFATAAAIESAVSISGGGLQSLSSQQLLDCT-VVSDKCGGGEPVEALKYA-Q 213

Query:   219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVS-IDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
             + G+ +  +YPY     KC   R     V+ I  +      DEM+   A+ + P+ V   
Sbjct:   214 SHGITTAHNYPYYFWTTKC---RETVPTVARISSWMKAESEDEMAQIVAL-NGPMIVCAN 269

Query:   278 AGGRAFQHYESGVFTG-ECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQ 336
                   + Y SG+    +CG+   H ++ +GYG     DYW+++N++   WGE GY++++
Sbjct:   270 FATNKNRFYHSGIAEDPDCGTEPTHALIVIGYGP----DYWILKNTYSKVWGEKGYMRVK 325

Query:   337 RNLLDTNTGKCGIAME 352
             R   D N   CGI  E
Sbjct:   326 R---DVNW--CGINTE 336


>GENEDB_PFALCIPARUM|PF14_0553 [details] [associations]
            symbol:PF14_0553 "cysteine proteinase
            falcipain-1" species:5833 "Plasmodium falciparum" [GO:0042540
            "hemoglobin catabolic process" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014187 KO:K01376 HSSP:P07688 GO:GO:0042540
            RefSeq:XP_001348727.1 ProteinModelPortal:Q8I6V0 PRIDE:Q8I6V0
            EnsemblProtists:PF14_0553:mRNA GeneID:812135 KEGG:pfa:PF14_0553
            EuPathDB:PlasmoDB:PF3D7_1458000 HOGENOM:HOG000065906
            ProtClustDB:CLSZ2457715 ChEMBL:CHEMBL1250371 Uniprot:Q8I6V0
        Length = 569

 Score = 274 (101.5 bits), Expect = 3.1e-34, Sum P(2) = 3.1e-34
 Identities = 81/279 (29%), Positives = 138/279 (49%)

Query:    37 DHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNE-KRFQIFKDNL-RFID-EHNSLNRTY 93
             +H+  ++  DE M  ++ +   +    N   HN+  +  ++K  + +F D     L   +
Sbjct:   231 EHNKVYKNIDEQMRKFEIFKINYISIKN---HNKLNKNAMYKKKVNQFSDYSEEELKEYF 287

Query:    94 KVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS--QRYACKAGDELPESVDWREKG 151
             K  L+    +  E+Y   +    +  K  ++ S+  +  +R       ++PE +D+REKG
Sbjct:   288 KTLLHVPNHMI-EKYSKPF---ENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKG 343

Query:   152 AVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDY 211
              V+  KDQG CGSCWAF++V  +E +       ++S SEQE+VDC  K N GC+GG   Y
Sbjct:   344 IVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCS-KDNFGCDGGHPFY 402

Query:   212 AFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ- 270
             +F +++QN  +    +Y Y   ++    + R  + VS+     +    E  L  A+ +  
Sbjct:   403 SFLYVLQNE-LCLGDEYKYKAKDDMFCLNYRCKRKVSLSS---IGAVKENQLILALNEVG 458

Query:   271 PVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
             P+SV +      F  Y  GV+ G C   L+H V+ VGYG
Sbjct:   459 PLSVNVGVNND-FVAYSEGVYNGTCSEELNHSVLLVGYG 496

 Score = 128 (50.1 bits), Expect = 3.1e-34, Sum P(2) = 3.1e-34
 Identities = 21/47 (44%), Positives = 30/47 (63%)

Query:   311 ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
             +N + YW+++NSW   WGENG+++L RN    N   CGI  E  YP+
Sbjct:   523 DNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVF-CGIGEEVFYPI 568

 Score = 95 (38.5 bits), Expect = 1.6e-10, Sum P(2) = 1.6e-10
 Identities = 19/60 (31%), Positives = 33/60 (55%)

Query:    55 WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT--YKVGLNKFADLTNEEYRAMY 112
             ++ +H K    +    ++F+IFK N   I  HN LN+   YK  +N+F+D + EE +  +
Sbjct:   228 FMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYF 287

 Score = 41 (19.5 bits), Expect = 5.8e-05, Sum P(2) = 5.8e-05
 Identities = 15/65 (23%), Positives = 31/65 (47%)

Query:    81 RFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK--RRLMKSKVASQRYACKAG 138
             +F+ EHN + +     + KF     E ++  Y+  ++  K  +  M  K  +Q ++  + 
Sbjct:   227 KFMKEHNKVYKNIDEQMRKF-----EIFKINYISIKNHNKLNKNAMYKKKVNQ-FSDYSE 280

Query:   139 DELPE 143
             +EL E
Sbjct:   281 EELKE 285

 Score = 37 (18.1 bits), Expect = 0.00015, Sum P(2) = 0.00015
 Identities = 15/51 (29%), Positives = 23/51 (45%)

Query:    62 TSNGMGHNEKR--FQIFKDNLRFI-DEHNSLNRTYKVGLNKFADLTNEEYR 109
             T N   +N K     I  D+++   +E+ +L R       KF +  NEE R
Sbjct:   110 TLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENR 160


>UNIPROTKB|Q8I6V0 [details] [associations]
            symbol:PF14_0553 "Cysteine proteinase falcipain-1"
            species:36329 "Plasmodium falciparum 3D7" [GO:0042540 "hemoglobin
            catabolic process" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014187 KO:K01376 HSSP:P07688 GO:GO:0042540
            RefSeq:XP_001348727.1 ProteinModelPortal:Q8I6V0 PRIDE:Q8I6V0
            EnsemblProtists:PF14_0553:mRNA GeneID:812135 KEGG:pfa:PF14_0553
            EuPathDB:PlasmoDB:PF3D7_1458000 HOGENOM:HOG000065906
            ProtClustDB:CLSZ2457715 ChEMBL:CHEMBL1250371 Uniprot:Q8I6V0
        Length = 569

 Score = 274 (101.5 bits), Expect = 3.1e-34, Sum P(2) = 3.1e-34
 Identities = 81/279 (29%), Positives = 138/279 (49%)

Query:    37 DHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNE-KRFQIFKDNL-RFID-EHNSLNRTY 93
             +H+  ++  DE M  ++ +   +    N   HN+  +  ++K  + +F D     L   +
Sbjct:   231 EHNKVYKNIDEQMRKFEIFKINYISIKN---HNKLNKNAMYKKKVNQFSDYSEEELKEYF 287

Query:    94 KVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS--QRYACKAGDELPESVDWREKG 151
             K  L+    +  E+Y   +    +  K  ++ S+  +  +R       ++PE +D+REKG
Sbjct:   288 KTLLHVPNHMI-EKYSKPF---ENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKG 343

Query:   152 AVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDY 211
              V+  KDQG CGSCWAF++V  +E +       ++S SEQE+VDC  K N GC+GG   Y
Sbjct:   344 IVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCS-KDNFGCDGGHPFY 402

Query:   212 AFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ- 270
             +F +++QN  +    +Y Y   ++    + R  + VS+     +    E  L  A+ +  
Sbjct:   403 SFLYVLQNE-LCLGDEYKYKAKDDMFCLNYRCKRKVSLSS---IGAVKENQLILALNEVG 458

Query:   271 PVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
             P+SV +      F  Y  GV+ G C   L+H V+ VGYG
Sbjct:   459 PLSVNVGVNND-FVAYSEGVYNGTCSEELNHSVLLVGYG 496

 Score = 128 (50.1 bits), Expect = 3.1e-34, Sum P(2) = 3.1e-34
 Identities = 21/47 (44%), Positives = 30/47 (63%)

Query:   311 ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
             +N + YW+++NSW   WGENG+++L RN    N   CGI  E  YP+
Sbjct:   523 DNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVF-CGIGEEVFYPI 568

 Score = 95 (38.5 bits), Expect = 1.6e-10, Sum P(2) = 1.6e-10
 Identities = 19/60 (31%), Positives = 33/60 (55%)

Query:    55 WLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT--YKVGLNKFADLTNEEYRAMY 112
             ++ +H K    +    ++F+IFK N   I  HN LN+   YK  +N+F+D + EE +  +
Sbjct:   228 FMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYF 287

 Score = 41 (19.5 bits), Expect = 5.8e-05, Sum P(2) = 5.8e-05
 Identities = 15/65 (23%), Positives = 31/65 (47%)

Query:    81 RFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK--RRLMKSKVASQRYACKAG 138
             +F+ EHN + +     + KF     E ++  Y+  ++  K  +  M  K  +Q ++  + 
Sbjct:   227 KFMKEHNKVYKNIDEQMRKF-----EIFKINYISIKNHNKLNKNAMYKKKVNQ-FSDYSE 280

Query:   139 DELPE 143
             +EL E
Sbjct:   281 EELKE 285

 Score = 37 (18.1 bits), Expect = 0.00015, Sum P(2) = 0.00015
 Identities = 15/51 (29%), Positives = 23/51 (45%)

Query:    62 TSNGMGHNEKR--FQIFKDNLRFI-DEHNSLNRTYKVGLNKFADLTNEEYR 109
             T N   +N K     I  D+++   +E+ +L R       KF +  NEE R
Sbjct:   110 TLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENR 160


>UNIPROTKB|P56202 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 GO:GO:0006955 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AF013611
            EMBL:AF015954 EMBL:AF055903 EMBL:AP001201 EMBL:BC048255
            IPI:IPI00328978 RefSeq:NP_001326.2 UniGene:Hs.416848
            ProteinModelPortal:P56202 SMR:P56202 STRING:P56202 MEROPS:C01.037
            PhosphoSite:P56202 DMDM:259016196 PaxDb:P56202 PRIDE:P56202
            Ensembl:ENST00000307886 GeneID:1521 KEGG:hsa:1521 UCSC:uc001ogc.1
            CTD:1521 GeneCards:GC11P065647 HGNC:HGNC:2546 HPA:CAB016345
            MIM:602364 neXtProt:NX_P56202 PharmGKB:PA27042 eggNOG:NOG288820
            HOVERGEN:HBG100117 InParanoid:P56202 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 PhylomeDB:P56202 GenomeRNAi:1521 NextBio:6295
            ArrayExpress:P56202 Bgee:P56202 CleanEx:HS_CTSW
            Genevestigator:P56202 GermOnline:ENSG00000172543 Uniprot:P56202
        Length = 376

 Score = 279 (103.3 bits), Expect = 7.1e-34, Sum P(2) = 7.1e-34
 Identities = 77/255 (30%), Positives = 125/255 (49%)

Query:    72 RFQIFKDNLRFID--EHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
             R  IF  NL      +   L  T + G+  F+DLT EE+  +Y   R+      M  ++ 
Sbjct:    62 RLDIFAHNLAQAQRLQEEDLG-TAEFGVTPFSDLTEEEFGQLYGYRRAAGGVPSMGREIR 120

Query:   130 SQRYACKAGDELPESVDWRE-KGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
             S+    +  + +P S DWR+   A++P+KDQ +C  CWA +    +E + +I   + + +
Sbjct:   121 SE----EPEESVPFSCDWRKVASAISPIKDQKNCNCCWAMAAAGNIETLWRISFWDFVDV 176

Query:   189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGA--ENKCDPSRRNAKV 246
             S QEL+DC R    GC+GG +  AF  ++ N G+ SE+DYP+ G    ++C P ++  KV
Sbjct:   177 SVQELLDCGR-CGDGCHGGFVWDAFITVLNNSGLASEKDYPFQGKVRAHRCHP-KKYQKV 234

Query:   247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE---CGSAL-DHG 302
               I  +  +   +    +      P++V I    +  Q Y  GV       C   L DH 
Sbjct:   235 AWIQDFIMLQNNEHRIAQYLATYGPITVTINM--KPLQLYRKGVIKATPTTCDPQLVDHS 292

Query:   303 VVAVGYGT---ENGV 314
             V+ VG+G+   E G+
Sbjct:   293 VLLVGFGSVKSEEGI 307

 Score = 105 (42.0 bits), Expect = 7.1e-34, Sum P(2) = 7.1e-34
 Identities = 22/56 (39%), Positives = 30/56 (53%)

Query:   316 YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHSS 371
             YW+++NSWG+ WGE GY +L R    +NT  CGI     +P+         KP  S
Sbjct:   326 YWILKNSWGAQWGEKGYFRLHRG---SNT--CGIT---KFPLTARVQKPDMKPRVS 373


>WB|WBGene00019314 [details] [associations]
            symbol:K02E7.10 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 SMART:SM00645 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 HSSP:P07711
            EMBL:FO080411 PIR:T32392 RefSeq:NP_493904.1 UniGene:Cel.14828
            ProteinModelPortal:O17255 SMR:O17255 EnsemblMetazoa:K02E7.10
            GeneID:186889 KEGG:cel:CELE_K02E7.10 UCSC:K02E7.10 CTD:186889
            WormBase:K02E7.10 eggNOG:NOG331187 HOGENOM:HOG000114005
            InParanoid:O17255 OMA:GNANEAR NextBio:933344 Uniprot:O17255
        Length = 299

 Score = 366 (133.9 bits), Expect = 1.2e-33, P = 1.2e-33
 Identities = 82/235 (34%), Positives = 127/235 (54%)

Query:   132 RYACKAGDELPES-VDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGIN-KIVTGELISLS 189
             +Y  K    + +  +DWREKG V PVKDQG C + +AF+ +AA+E +  K   G+L+S S
Sbjct:    70 QYQTKLSHHMTQDFLDWREKGIVGPVKDQGKCNASYAFAAIAAIESMYAKANNGKLLSFS 129

Query:   190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN--KCDPSRRNAKVV 247
             EQ+++DC    N  C   L +      ++  G+ +E DYPY+G EN  KC+      K+ 
Sbjct:   130 EQQIIDCANFTNP-CQENLENVLSNRFLKENGVGTEADYPYVGKENVGKCEYDSSKMKLR 188

Query:   248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTG---ECGSALD-HGV 303
                 Y DV P +E + +  +              +F HY++G++     ECG+A +   +
Sbjct:   189 PT--YIDVYPNEEWA-RAHITTFGTGYFRMRSPPSFFHYKTGIYNPTKEECGNANEARSL 245

Query:   304 VAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
               VGYG +    YW+V+ S+G+ WGE+GY+KL RN+       CG+A   S P+K
Sbjct:   246 AIVGYGKDGAEKYWIVKGSFGTSWGEHGYMKLARNV-----NACGMAESISIPIK 295


>MGI|MGI:2139628 [details] [associations]
            symbol:Ctso "cathepsin O" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:2139628 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0005764 GeneTree:ENSGT00560000076599 MEROPS:C01.035 CTD:1519
            HOVERGEN:HBG105050 KO:K01374 OMA:SNVCGIA OrthoDB:EOG4V6ZH1
            EMBL:AK034490 EMBL:AK049470 EMBL:AK165930 EMBL:AK166103
            EMBL:BC044664 IPI:IPI00453524 RefSeq:NP_808330.1 UniGene:Mm.254642
            ProteinModelPortal:Q8BM88 SMR:Q8BM88 STRING:Q8BM88
            PhosphoSite:Q8BM88 PRIDE:Q8BM88 Ensembl:ENSMUST00000029649
            GeneID:229445 KEGG:mmu:229445 UCSC:uc008pon.1 InParanoid:Q8BM88
            NextBio:379433 Bgee:Q8BM88 CleanEx:MM_CTSO Genevestigator:Q8BM88
            GermOnline:ENSMUSG00000028015 Uniprot:Q8BM88
        Length = 312

 Score = 358 (131.1 bits), Expect = 8.5e-33, P = 8.5e-33
 Identities = 93/303 (30%), Positives = 147/303 (48%)

Query:    56 LAKHGKTSNGMGHNEKRFQIFKDNL---RFIDEHNSLNRTYKVGLNKFADLTNEEYRAMY 112
             L +HG        +++     +++L   R+++     N T   G+N+F+ L  EE++A+Y
Sbjct:    16 LGRHGVAGTWSWSHQREAAALRESLHRHRYLNSFPHENSTAFYGVNQFSYLFPEEFKALY 75

Query:   113 LGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVA 172
             LG++     R        QR        LP   DWR+K  VNPV++Q  CG CWAFS V+
Sbjct:    76 LGSKYAWAPRY---PAEGQRPIPNVS--LPLRFDWRDKHVVNPVRNQEMCGGCWAFSVVS 130

Query:   173 AVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG-GMDSEQDYPYL 231
             A+E    I    L  LS Q+++DC    N+GC GG    A +++ +    + ++  YP+ 
Sbjct:   131 AIESARAIQGKSLDYLSVQQVIDCSFN-NSGCLGGSPLCALRWLNETQLKLVADSQYPFK 189

Query:   232 GAENKCD--P-SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYES 288
                 +C   P S+    V     Y      DEM+ +  ++  P+ V ++A   ++Q Y  
Sbjct:   190 AVNGQCRHFPQSQAGVSVKDFSAYNFRGQEDEMA-RALLSFGPLVVIVDA--MSWQDYLG 246

Query:   289 GVFTGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
             G+    C S   +H V+  G+       YW+VRNSWGS WG  GY  ++   +  N   C
Sbjct:   247 GIIQHHCSSGEANHAVLITGFDRTGNTPYWMVRNSWGSSWGVEGYAHVK---MGGNV--C 301

Query:   348 GIA 350
             GIA
Sbjct:   302 GIA 304


>UNIPROTKB|E1BPI9 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 CTD:1519 KO:K01374 OMA:SNVCGIA
            EMBL:DAAA02044933 IPI:IPI01004081 RefSeq:XP_002694471.2
            RefSeq:XP_874012.4 Ensembl:ENSBTAT00000014691 GeneID:616804
            KEGG:bta:616804 Uniprot:E1BPI9
        Length = 313

 Score = 355 (130.0 bits), Expect = 1.8e-32, P = 1.8e-32
 Identities = 94/287 (32%), Positives = 144/287 (50%)

Query:    76 FKDNLRFIDEHNSL----NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
             F+++L      NSL    N T   G+N+F+ L  EE++A+YL  RS   R     +  ++
Sbjct:    36 FRESLNRQRYLNSLFPYENSTAVYGINQFSYLFPEEFKAIYL--RSSPSRF---PRFPAE 90

Query:   132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
              Y   +   LP   DWR+K  V  V++Q +CG CWAFS V AVE +  I    L  LS Q
Sbjct:    91 EYTSISNLSLPLRFDWRDKHVVTQVRNQKTCGGCWAFSVVGAVESVCAIKGQPLEVLSVQ 150

Query:   192 ELVDCDRKINAGCNGGLMDYAFQFIIQ-NGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
             +++DC    N GCNGG    A  ++ +    +  + +YP+      C     +    SI 
Sbjct:   151 QVIDCSYS-NYGCNGGSPLSALYWLNKLQVKLVRDSEYPFQAQNGLCRYFSDSHSGSSIK 209

Query:   251 GYE--DVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA-LDHGVVAVG 307
             GY   D S  ++   +  +A  P+ V ++A   ++Q Y  G+    C S   +H V+  G
Sbjct:   210 GYSAYDFSGQEDKMAEALLALGPLIVVVDA--MSWQDYLGGIIQHHCSSGEANHAVLVTG 267

Query:   308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEAS 354
             +     + YW+VRNSWG+ WG +GYV+++   +  N   CGIA   S
Sbjct:   268 FDKTGSIPYWIVRNSWGTSWGIDGYVRVK---MGGNV--CGIADSVS 309


>WB|WBGene00013764 [details] [associations]
            symbol:Y113G7B.15 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 GeneTree:ENSGT00560000076599
            EMBL:AL110477 HOGENOM:HOG000019851 RefSeq:NP_507904.2
            ProteinModelPortal:Q9U2X1 SMR:Q9U2X1 DIP:DIP-25339N IntAct:Q9U2X1
            MINT:MINT-1058673 STRING:Q9U2X1 MEROPS:C01.A47
            EnsemblMetazoa:Y113G7B.15 GeneID:190976 KEGG:cel:CELE_Y113G7B.15
            UCSC:Y113G7B.15 CTD:190976 WormBase:Y113G7B.15 eggNOG:NOG302449
            OMA:AEEDIME Uniprot:Q9U2X1
        Length = 362

 Score = 311 (114.5 bits), Expect = 2.1e-32, Sum P(2) = 2.1e-32
 Identities = 90/276 (32%), Positives = 130/276 (47%)

Query:    91 RTYKVGLNKFADLTNEEYRA----MYLGTRSDAK----RRLMKSKVASQRYACKAGDELP 142
             R    G NKFAD   +E  A    ++    +D      R    S+    + + +   ++P
Sbjct:    73 RNVTFGWNKFADKNRQELSARNSKIHPKNHTDLPIYKPRHPRGSRNHHNKRSKRQSGDIP 132

Query:   143 ESVDWRE---KGA--VNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDC- 196
             +  D R+    G+  V PVKDQ  CG CWAF+T A  E  N + +    SLS+QE+ DC 
Sbjct:   133 DYFDLRDIYVDGSPVVGPVKDQEQCGCCWAFATTAITEAANTLYSKSFTSLSDQEICDCA 192

Query:   197 DRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY----LGAENKCDPSRRNAKVV--SID 250
             D     GC GG      + ++   G  S+ DYPY          C    ++  +   +++
Sbjct:   193 DSGDTPGCVGGDPRNGLK-MVHLRGQSSDGDYPYEEYRANTTGNCVGDEKSTVIQPETLN 251

Query:   251 GYE-DVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFTGE-CGSALD---HGVV 304
              Y  D    +E  ++    +  P +V    G   F+ Y SGV   E C        H V 
Sbjct:   252 VYRFDQDYAEEDIMENLYLNHIPTAVYFRVGEN-FEWYTSGVLQSEDCYQMTPAEWHSVA 310

Query:   305 AVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
              VGYGT ++GV YWLVRNSW SDWG +GYVK++R +
Sbjct:   311 IVGYGTSDDGVPYWLVRNSWNSDWGLHGYVKIRRGV 346

 Score = 59 (25.8 bits), Expect = 2.1e-32, Sum P(2) = 2.1e-32
 Identities = 19/68 (27%), Positives = 28/68 (41%)

Query:    47 EVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFAD 102
             EV++ +  +   H K        ++R   F  N + I E N+      R    G NKFAD
Sbjct:    25 EVLSHFNNFTMHHKKHYRTPAEKDRRLAHFAKNHQKIQELNAKARREGRNVTFGWNKFAD 84

Query:   103 LTNEEYRA 110
                +E  A
Sbjct:    85 KNRQELSA 92


>WB|WBGene00011102 [details] [associations]
            symbol:R07E3.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            EMBL:Z49207 HSSP:P53634 PIR:T24030 RefSeq:NP_001041280.1
            ProteinModelPortal:Q21810 SMR:Q21810 STRING:Q21810 MEROPS:C01.A43
            PaxDb:Q21810 EnsemblMetazoa:R07E3.1a GeneID:181242
            KEGG:cel:CELE_R07E3.1 UCSC:R07E3.1a CTD:181242 WormBase:R07E3.1a
            HOGENOM:HOG000021028 InParanoid:Q21810 OMA:ACKNEVI NextBio:913066
            ArrayExpress:Q21810 Uniprot:Q21810
        Length = 402

 Score = 345 (126.5 bits), Expect = 2.0e-31, P = 2.0e-31
 Identities = 101/290 (34%), Positives = 148/290 (51%)

Query:    79 NLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVAS---QRYAC 135
             N    +EH S    Y  G N  +D T+EE+    L  +S  KR   +++      +    
Sbjct:   123 NWNIQNEHGSAE--Y--GHNDMSDWTDEEFEKTLL-PKSFYKRLHKEAEFIEPIPESLTA 177

Query:   136 KAGDE---LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQE 192
             K G+     P+  DWR+K  + PVK QG CGSCWAF++ A VE    I  GE  +LSEQ 
Sbjct:   178 KKGESSSPFPDFFDWRDKNVITPVKAQGQCGSCWAFASTATVEAAWAIAHGEKRNLSEQT 237

Query:   193 LVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLG-AENKCDPSRR-NAKVVSID 250
             L+DCD   NA C+GG  D AF++I +NG + +  D PY+   +N C  +   N   +   
Sbjct:   238 LLDCDLVDNA-CDGGDEDKAFRYIHRNG-LANAVDLPYVAHRQNGCAVNDHWNTTRIKA- 294

Query:   251 GYEDVSPFDEMSLKKAVAD-QPVSVAIEAGGRAFQHYESGVFTGE---CGSALD--HGVV 304
              Y      DE S+   + +  PV++ + A  +  + Y+ GVFT     C + +   H ++
Sbjct:   295 AY--FLHHDEDSIINWLVNFGPVNIGM-AVIQPMRAYKGGVFTPSEYACKNEVIGLHALL 351

Query:   305 AVGYGT-ENGVDYWLVRNSWGSDWG-ENGYVKLQRNLLDTNTGKCGIAME 352
               GYGT + G  YW+V+NSWG+ WG E+GY+   R +       CGI  E
Sbjct:   352 ITGYGTSKTGEKYWIVKNSWGNTWGVEHGYIYFARGI-----NACGIEDE 396


>WB|WBGene00013076 [details] [associations]
            symbol:Y51A2D.8 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            EMBL:AL021497 HSSP:P53634 HOGENOM:HOG000019851 PIR:T27079
            RefSeq:NP_507627.1 ProteinModelPortal:Q9XXQ7 SMR:Q9XXQ7
            MEROPS:C01.A49 EnsemblMetazoa:Y51A2D.8 GeneID:180208
            KEGG:cel:CELE_Y51A2D.8 UCSC:Y51A2D.8 CTD:180208 WormBase:Y51A2D.8
            eggNOG:NOG307864 InParanoid:Q9XXQ7 OMA:VAVYFKV NextBio:908434
            Uniprot:Q9XXQ7
        Length = 386

 Score = 291 (107.5 bits), Expect = 3.8e-31, Sum P(2) = 3.8e-31
 Identities = 82/276 (29%), Positives = 130/276 (47%)

Query:    96 GLNKFADLTNEEYRAMYLGT-----------RSDAKRRLMKSKVASQRYACKAGDELPES 144
             G+NKF+DL+  E+                    D K+   ++   ++    +     P+ 
Sbjct:    91 GINKFSDLSTAEFHGRLSNVVPSNNTGLPMLNFDKKKPDFRAADMNKTRHKRRSTRYPDY 150

Query:   145 VDWR-EK--GA--VNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK 199
              D R EK  G   V P+KDQG C  CW F+  A VE +    +G+  SLS+QE+ DC  +
Sbjct:   151 FDLRNEKINGRYIVGPIKDQGQCACCWGFAVTALVETVYAAHSGKFKSLSDQEVCDCGTE 210

Query:   200 INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD 259
                GC GG +    Q++ +  G+  ++DYPY   +N+ +  RR  ++   D       F+
Sbjct:   211 GTPGCKGGSLTLGVQYV-KKYGLSGDEDYPY--DQNRANQGRR-CRLRETDRIVPARAFN 266

Query:   260 EMSLKKAVADQ-----------PVSVAIEAGGRAFQHYESGVFT-GECGSALD-HGVVAV 306
                +    A++           PV+V  + G + F+ Y+ GV    +C  A   H    V
Sbjct:   267 FAVINPRRAEEQIIQVLTEWKVPVAVYFKVGDQ-FKEYKEGVIIEDDCRRATQWHAGAIV 325

Query:   307 GYGT-ENGV----DYWLVRNSWGSDWGENGYVKLQR 337
             GY T E+      DYW+++NSWG DW E+GYV++ R
Sbjct:   326 GYDTVEDSRGRSHDYWIIKNSWGGDWAESGYVRVVR 361

 Score = 67 (28.6 bits), Expect = 3.8e-31, Sum P(2) = 3.8e-31
 Identities = 15/67 (22%), Positives = 36/67 (53%)

Query:    46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRT--Y--KVGLNKFA 101
             +++   ++ +  K+ +       N++RF  F  +   +D+ N+ ++   Y  + G+NKF+
Sbjct:    37 EKLYKAFEDFKKKYNRKYKDESENQQRFNNFVKSYNNVDKLNAKSKAAGYDTQFGINKFS 96

Query:   102 DLTNEEY 108
             DL+  E+
Sbjct:    97 DLSTAEF 103


>UNIPROTKB|P43234 [details] [associations]
            symbol:CTSO "Cathepsin O" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 Reactome:REACT_6900
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0004197
            CleanEx:HS_CTSO EMBL:X77383 EMBL:BC049206 IPI:IPI00017257
            PIR:A55090 RefSeq:NP_001325.1 UniGene:Hs.75262
            ProteinModelPortal:P43234 SMR:P43234 IntAct:P43234 STRING:P43234
            MEROPS:C01.035 PhosphoSite:P43234 DMDM:1168795 PRIDE:P43234
            DNASU:1519 Ensembl:ENST00000433477 GeneID:1519 KEGG:hsa:1519
            UCSC:uc003ipg.3 CTD:1519 GeneCards:GC04M156845 HGNC:HGNC:2542
            HPA:HPA002041 MIM:600550 neXtProt:NX_P43234 PharmGKB:PA27040
            HOVERGEN:HBG105050 InParanoid:P43234 KO:K01374 OMA:SNVCGIA
            OrthoDB:EOG4V6ZH1 PhylomeDB:P43234 BindingDB:P43234
            ChEMBL:CHEMBL3035 GenomeRNAi:1519 NextBio:6287 Bgee:P43234
            Genevestigator:P43234 GermOnline:ENSG00000151792 Uniprot:P43234
        Length = 321

 Score = 342 (125.4 bits), Expect = 4.2e-31, P = 4.2e-31
 Identities = 92/293 (31%), Positives = 143/293 (48%)

Query:    70 EKRFQIFKDNL---RFIDE-HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
             E+    F+++L   R+++    S N T   G+N+F+ L  EE++A+YL  RS   +    
Sbjct:    38 EREAAAFRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYL--RSKPSKF--- 92

Query:   126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
              + +++ +       LP   DWR+K  V  V++Q  CG CWAFS V AVE    I    L
Sbjct:    93 PRYSAEVHMSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPL 152

Query:   186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ-NGGMDSEQDYPYLGAENKCDPSRRNA 244
               LS Q+++DC    N GCNGG    A  ++ +    +  + +YP+      C     + 
Sbjct:   153 EDLSVQQVIDCSYN-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSGSH 211

Query:   245 KVVSIDGYE--DVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA-LDH 301
                SI GY   D S  ++   K  +   P+ V ++A   ++Q Y  G+    C S   +H
Sbjct:   212 SGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAV--SWQDYLGGIIQHHCSSGEANH 269

Query:   302 GVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEAS 354
              V+  G+       YW+VRNSWGS WG +GY  ++   + +N   CGIA   S
Sbjct:   270 AVLITGFDKTGSTPYWIVRNSWGSSWGVDGYAHVK---MGSNV--CGIADSVS 317


>WB|WBGene00008861 [details] [associations]
            symbol:F15D4.4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 SMART:SM00848 PROSITE:PS00640
            GeneTree:ENSGT00560000076599 EMBL:Z80344 HSSP:P53634
            eggNOG:NOG310593 PIR:T20981 ProteinModelPortal:Q93512 SMR:Q93512
            MEROPS:C01.A45 EnsemblMetazoa:F15D4.4 KEGG:cel:CELE_F15D4.4
            UCSC:F15D4.4 CTD:184530 WormBase:F15D4.4 InParanoid:Q93512
            OMA:ITMEQNI NextBio:925068 Uniprot:Q93512
        Length = 608

 Score = 345 (126.5 bits), Expect = 8.6e-31, P = 8.6e-31
 Identities = 89/288 (30%), Positives = 146/288 (50%)

Query:    71 KRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKS 126
             KRF ++    + +DEHN +      +YK+  N+F+   + E   + L   +      +  
Sbjct:   153 KRFNVYSKVKKEVDEHNIMYELGMSSYKMSTNQFSVALDGEVAPLTLNLDALTPTATVIP 212

Query:   127 KVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELI 186
                S R   K  D  P +VDWR    + P+ DQ +CG CWAFS ++ +E    I      
Sbjct:   213 ATISSR---KKRDTEP-TVDWRP--FLKPILDQSTCGGCWAFSMISMIESFFAIQGYNTS 266

Query:   187 SLSEQELVDCDRKI-------NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
             SLS Q+L+ CD K+       N GC GG    A  ++  +   D+    P+   +  CD 
Sbjct:   267 SLSVQQLLTCDTKVDSTYGLANVGCKGGYFQIAGSYLEVSAARDASL-IPFDLEDTSCDS 325

Query:   240 SRRNAKVVSI----DGY--EDVSPFDEMSLKKAVADQ----PVSVAIEAGGRAFQHYESG 289
             S     V +I    DGY   + +    +++++ + D+    P++V + AG   ++ Y  G
Sbjct:   326 SFFPPVVPTILLFDDGYISGNFTAAQLITMEQNIEDKVRKGPIAVGMAAGPDIYK-YSEG 384

Query:   290 VFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
             V+ G+CG+ ++H VV VG+ T+   DYW++RNSWG+ WGE GY +++R
Sbjct:   385 VYDGDCGTIINHAVVIVGF-TD---DYWIIRNSWGASWGEAGYFRVKR 428


>ZFIN|ZDB-GENE-030619-9 [details] [associations]
            symbol:ctsc "cathepsin C" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030619-9 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 MEROPS:C01.070 HSSP:P43235
            EMBL:BC064286 IPI:IPI00486570 RefSeq:NP_999887.1 UniGene:Dr.32463
            ProteinModelPortal:Q6P2V1 SMR:Q6P2V1 PRIDE:Q6P2V1 GeneID:368704
            KEGG:dre:368704 InParanoid:Q6P2V1 NextBio:20813127
            ArrayExpress:Q6P2V1 Bgee:Q6P2V1 Uniprot:Q6P2V1
        Length = 455

 Score = 339 (124.4 bits), Expect = 8.8e-31, P = 8.8e-31
 Identities = 92/294 (31%), Positives = 149/294 (50%)

Query:    76 FKDNLRFIDEHNSLNRTYKVGLNKFAD-LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYA 134
             + +N+ F+DE NS+ +++      F + L+  E      G  S   RR+    VA+    
Sbjct:   161 YTNNMMFVDEINSVQKSWTATAYSFHETLSIHEMLRRSGGPASRIPRRVRPVTVAADS-- 218

Query:   135 CKAGDELPESVDWREKGAVN---PVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS--LS 189
              KA   LP+  DWR    VN   PV++Q  CGSC++F+T+  +E   +I T        S
Sbjct:   219 -KAASGLPQHWDWRNVNGVNFVSPVRNQAQCGSCYSFATMGMLEARVRIQTNNTQQPVFS 277

Query:   190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
              Q++V C +  + GC+GG   Y     IQ+ G+  E  +PY G+++ C+   +  K  + 
Sbjct:   278 PQQVVSCSQ-YSQGCDGGF-PYLIGKYIQDFGIVEEDCFPYTGSDSPCNLPAKCTKYYAS 335

Query:   250 DGYEDVSPF-----DEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF--TG--ECGSALD 300
             D Y  V  F     +   + + V + P+ VA+E     F +Y+ G++  TG  +  +  +
Sbjct:   336 D-YHYVGGFYGGCSESAMMLELVKNGPMGVALEVYPD-FMNYKEGIYHHTGLRDANNPFE 393

Query:   301 ---HGVVAVGYGT--ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
                H V+ VGYG   + G  YW+V+NSWGS WGENG+ +++R      T +C I
Sbjct:   394 LTNHAVLLVGYGQCHKTGEKYWIVKNSWGSGWGENGFFRIRRG-----TDECAI 442


>UNIPROTKB|E9PI30 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            EMBL:AP001201 HGNC:HGNC:2546 IPI:IPI00984532
            ProteinModelPortal:E9PI30 SMR:E9PI30 Ensembl:ENST00000528419
            ArrayExpress:E9PI30 Bgee:E9PI30 Uniprot:E9PI30
        Length = 364

 Score = 279 (103.3 bits), Expect = 1.6e-30, Sum P(2) = 1.6e-30
 Identities = 77/255 (30%), Positives = 125/255 (49%)

Query:    72 RFQIFKDNLRFID--EHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
             R  IF  NL      +   L  T + G+  F+DLT EE+  +Y   R+      M  ++ 
Sbjct:    62 RLDIFAHNLAQAQRLQEEDLG-TAEFGVTPFSDLTEEEFGQLYGYRRAAGGVPSMGREIR 120

Query:   130 SQRYACKAGDELPESVDWRE-KGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL 188
             S+    +  + +P S DWR+   A++P+KDQ +C  CWA +    +E + +I   + + +
Sbjct:   121 SE----EPEESVPFSCDWRKVASAISPIKDQKNCNCCWAMAAAGNIETLWRISFWDFVDV 176

Query:   189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGA--ENKCDPSRRNAKV 246
             S QEL+DC R    GC+GG +  AF  ++ N G+ SE+DYP+ G    ++C P ++  KV
Sbjct:   177 SVQELLDCGR-CGDGCHGGFVWDAFITVLNNSGLASEKDYPFQGKVRAHRCHP-KKYQKV 234

Query:   247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGE---CGSAL-DHG 302
               I  +  +   +    +      P++V I    +  Q Y  GV       C   L DH 
Sbjct:   235 AWIQDFIMLQNNEHRIAQYLATYGPITVTINM--KPLQLYRKGVIKATPTTCDPQLVDHS 292

Query:   303 VVAVGYGT---ENGV 314
             V+ VG+G+   E G+
Sbjct:   293 VLLVGFGSVKSEEGI 307

 Score = 73 (30.8 bits), Expect = 1.6e-30, Sum P(2) = 1.6e-30
 Identities = 9/14 (64%), Positives = 13/14 (92%)

Query:   316 YWLVRNSWGSDWGE 329
             YW+++NSWG+ WGE
Sbjct:   326 YWILKNSWGAQWGE 339


>UNIPROTKB|F1PGK4 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 OMA:SNVCGIA
            EMBL:AAEX03010073 Ensembl:ENSCAFT00000013638 Uniprot:F1PGK4
        Length = 316

 Score = 333 (122.3 bits), Expect = 3.8e-30, P = 3.8e-30
 Identities = 87/271 (32%), Positives = 130/271 (47%)

Query:    90 NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWRE 149
             N +   G+N+F+ L+ EE++A+YL ++     R       S R        LP   DWR+
Sbjct:    57 NSSAVYGINQFSYLSPEEFKAIYLRSKPSRSPRYPAEVRTSIRNV-----SLPLRFDWRD 111

Query:   150 KGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLM 209
             K  V  V++Q +CG CWAFS V AVE    I    L  +S Q+++DC    N GC+GG  
Sbjct:   112 KRVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGKPLADISVQQVIDCSYN-NYGCSGGST 170

Query:   210 DYAFQFIIQNG-GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYE--DVSPFDEMSLKKA 266
               A  ++ +    +  + +YP+      C     +    SI GY   D S  ++   K  
Sbjct:   171 LNALNWLNKTQVKLVRDSEYPFKAQNGLCHYFSDSYSGFSIRGYSAYDFSDQEDEMAKVL 230

Query:   267 VADQPVSVAIEAGGRAFQHYESGVFTGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGS 325
             +   P+ V ++A   ++Q Y  G+    C S   +H V+  G+       YW+VRNSWGS
Sbjct:   231 LTFGPLVVVVDAV--SWQDYLGGIIQHHCSSGEANHAVLITGFDKIGSTPYWIVRNSWGS 288

Query:   326 DWGENGY--VKLQRNLLDTNTGKCGIAMEAS 354
              WG +GY  VK+  N+       CGIA   S
Sbjct:   289 SWGVDGYAHVKMGGNI-------CGIADSVS 312


>UNIPROTKB|F1P0K2 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            OMA:SNVCGIA EMBL:AADN02016534 IPI:IPI00651180
            Ensembl:ENSGALT00000015270 Uniprot:F1P0K2
        Length = 320

 Score = 328 (120.5 bits), Expect = 1.3e-29, P = 1.3e-29
 Identities = 86/293 (29%), Positives = 146/293 (49%)

Query:    64 NGMGHNEKRFQIFKD--NLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKR 121
             +G G  E+   + +    +R ++  ++ N +   G N+F+ L  EE++A+YL +      
Sbjct:    35 DGGGREEEAAALRESAKRIRLLNSPSNDNGSAFYGKNQFSHLFPEEFKAIYLRSIPYKLP 94

Query:   122 RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIV 181
             R +K     ++        LP+  DWR+K  +  V++Q +CG CWAFS V  +E    I 
Sbjct:    95 RYIKVPKGEEK-------PLPKKFDWRDKKVIAEVRNQQTCGGCWAFSVVGGIESAYAIK 147

Query:   182 TGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG-GMDSEQDYPYLGAENKCDPS 240
                L  LS Q+++DC    N GC+GG    A  ++ Q    +  + +Y +      C   
Sbjct:   148 GHNLEELSVQQVIDCSYS-NYGCSGGSTITALSWLNQTKVKLVRDSEYTFKAQTGLCHYF 206

Query:   241 RRNAKVVSIDGYE--DVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
               +   VSI G+   D S  +E  ++  V   P++V ++A   ++Q Y  G+    C S 
Sbjct:   207 PHSDFGVSITGFAAYDFSGQEEEMMRVLVDWGPLAVTVDAV--SWQDYLGGIIQYHCSSG 264

Query:   299 -LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
               +H V+  G+ T   + YW+V+NSWG  WG +GYV+++   + +N   CGIA
Sbjct:   265 KANHAVLITGFDTTGIIPYWIVQNSWGRTWGIDGYVRVK---IGSNV--CGIA 312


>UNIPROTKB|Q5QP40 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015644 Pfam:PF00112
            InterPro:IPR000169 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 EMBL:AL355860 HOVERGEN:HBG011513
            PANTHER:PTHR12411:SF55 EMBL:AL356292 UniGene:Hs.632466
            HGNC:HGNC:2536 IPI:IPI00514633 SMR:Q5QP40 STRING:Q5QP40
            Ensembl:ENST00000443913 Uniprot:Q5QP40
        Length = 258

 Score = 326 (119.8 bits), Expect = 2.1e-29, P = 2.1e-29
 Identities = 77/185 (41%), Positives = 107/185 (57%)

Query:    46 DEVM-TIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHN---SLN-RTYKVGLNKF 100
             +E++ T ++ W   H K  N       R  I++ NL++I  HN   SL   TY++ +N  
Sbjct:    78 EEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHL 137

Query:   101 ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
              D+T+EE      G     K  L  S+     Y  +     P+SVD+R+KG V PVK+QG
Sbjct:   138 GDMTSEEVVQKMTGL----KVPLSHSRSNDTLYIPEWEGRAPDSVDYRKKGYVTPVKNQG 193

Query:   161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
              CGSCWAFS+V A+EG  K  TG+L++LS Q LVDC  + N GC GG M  AFQ++ +N 
Sbjct:   194 QCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE-NDGCGGGYMTNAFQYVQKNR 252

Query:   221 GMDSE 225
             G+DSE
Sbjct:   253 GIDSE 257


>WB|WBGene00044760 [details] [associations]
            symbol:Y71H2AM.25 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0004181
            "metallocarboxypeptidase activity" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            GeneTree:ENSGT00560000076599 EMBL:FO081822 eggNOG:NOG331187
            HOGENOM:HOG000114005 RefSeq:NP_001040887.1
            ProteinModelPortal:Q2AAB9 SMR:Q2AAB9 EnsemblMetazoa:Y71H2AM.25
            GeneID:4363054 KEGG:cel:CELE_Y71H2AM.25 UCSC:Y71H2AM.25 CTD:4363054
            WormBase:Y71H2AM.25 InParanoid:Q2AAB9 NextBio:959635 Uniprot:Q2AAB9
        Length = 299

 Score = 317 (116.6 bits), Expect = 1.9e-28, P = 1.9e-28
 Identities = 81/223 (36%), Positives = 117/223 (52%)

Query:   143 ESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGIN-KIVTGELISLSEQELVDCDRKIN 201
             E +DWR+KG V PVKDQG C +  AF+  +++E +  K   G L+S SEQ+L+DCD    
Sbjct:    84 EFLDWRDKGIVGPVKDQGKCNASHAFAISSSIESMYAKATNGSLLSFSEQQLIDCDDHGF 143

Query:   202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN-KCDPSRRNAKVVSIDGYEDVSPFDE 260
              GC       A  + I +G +++E DYPY G EN KC      +K+   D    VS  +E
Sbjct:   144 KGCEEQPAINAVSYFIFHG-IETEADYPYAGKENGKCTFDSTKSKIQLKDAEFVVS--NE 200

Query:   261 MSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFTG---ECGSALD-HGVVAVGYGTENGVD 315
                K+ V +  P    + A    +  Y+ G++     EC S  +   +V VGYG E    
Sbjct:   201 TQGKELVTNYGPAFFTMRAPPSLYD-YKIGIYNPSIEECTSTHEIRSMVIVGYGIEGVQK 259

Query:   316 YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
             YW+V+ S+G+ WGE GY+KL R   D N   C +A   + P +
Sbjct:   260 YWIVKGSFGTSWGEQGYMKLAR---DVNA--CAMADFITVPTE 297


>UNIPROTKB|O97578 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9615 "Canis
            lupus familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 EMBL:AF060171 RefSeq:NP_001182763.1
            UniGene:Cfa.28653 ProteinModelPortal:O97578 SMR:O97578
            MEROPS:C01.070 PRIDE:O97578 GeneID:403458 KEGG:cfa:403458
            InParanoid:O97578 NextBio:20816976 Uniprot:O97578
        Length = 435

 Score = 315 (115.9 bits), Expect = 3.1e-28, P = 3.1e-28
 Identities = 89/296 (30%), Positives = 152/296 (51%)

Query:    74 QIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKR--RLMKSKVASQ 131
             +++K N  F+   N++ +++     ++ +      R M   TR   ++  R   + + ++
Sbjct:   141 RLYKYNYEFVKAINTIQKSWTA--TRYIEYETLTLRDMM--TRVGGRKIPRPKPTPLTAE 196

Query:   132 RYACKAGDELPESVDWRE-KGA--VNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS- 187
              +  +    LP S DWR  +G   V+PV++Q SCGSC+AF++ A +E   +I+T    + 
Sbjct:   197 IH--EEISRLPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTP 254

Query:   188 -LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR----R 242
              LS QE+V C +    GC GG          Q+ G+  E  +PY G+++ C P+      
Sbjct:   255 ILSPQEIVSCSQYAQ-GCEGGFPYLIAGKYAQDFGLVEEACFPYAGSDSPCKPNDCFRYY 313

Query:   243 NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF--TG-----EC 295
             +++   + G+        M L+  V   P++VA E     F HY+ G++  TG       
Sbjct:   314 SSEYYYVGGFYGACNEALMKLE-LVRHGPMAVAFEVYDDFF-HYQKGIYYHTGLRDPFNP 371

Query:   296 GSALDHGVVAVGYGTEN--GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
                 +H V+ VGYGT++  G+DYW+V+NSWGS WGE+GY +++R      T +C I
Sbjct:   372 FELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRG-----TDECAI 422


>UNIPROTKB|F1N455 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1 exclusion domain chain"
            species:9913 "Bos taurus" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 IPI:IPI00697314 UniGene:Bt.49573
            InterPro:IPR014882 Pfam:PF08773 OMA:YDDFLHY GO:GO:0001913
            EMBL:DAAA02062487 EMBL:DAAA02062488 Ensembl:ENSBTAT00000014735
            Uniprot:F1N455
        Length = 463

 Score = 313 (115.2 bits), Expect = 5.7e-28, P = 5.7e-28
 Identities = 81/228 (35%), Positives = 120/228 (52%)

Query:   141 LPESVDWREKGAVN---PVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS--LSEQELVD 195
             LP S DWR    +N   PV++QGSCGSC++F+++  +E   +I+T    +  LS QE+V 
Sbjct:   231 LPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVS 290

Query:   196 CDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDV 255
             C +    GC GG          Q+ G+  E  +PY G ++ C       +  S + Y  V
Sbjct:   291 CSQYAQ-GCEGGFPYLIAGKYAQDFGLVEEDCFPYTGTDSPCRLKEGCFRYYSSE-YHYV 348

Query:   256 SPF----DEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVF--TG-----ECGSALDHGV 303
               F    +E  +K  +  Q P++VA E     F HY  GV+  TG           +H V
Sbjct:   349 GGFYGGCNEALMKLELVHQGPMAVAFEVYDD-FLHYRKGVYHHTGLRDPFNPFELTNHAV 407

Query:   304 VAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
             + VGYGT+  +G+DYW+V+NSWG+ WGENGY +++R      T +C I
Sbjct:   408 LLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRG-----TDECAI 450


>UNIPROTKB|Q3ZCJ8 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9913 "Bos
            taurus" [GO:0031638 "zymogen activation" evidence=IDA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0005764 EMBL:BC102115 IPI:IPI00697314 RefSeq:NP_001028789.1
            UniGene:Bt.49573 ProteinModelPortal:Q3ZCJ8 SMR:Q3ZCJ8 STRING:Q3ZCJ8
            PRIDE:Q3ZCJ8 GeneID:352958 KEGG:bta:352958 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 InParanoid:Q3ZCJ8 KO:K01275
            OrthoDB:EOG4H19VZ BindingDB:Q3ZCJ8 ChEMBL:CHEMBL1075050
            NextBio:20812686 GO:GO:0031638 InterPro:IPR014882 Pfam:PF08773
            Uniprot:Q3ZCJ8
        Length = 463

 Score = 313 (115.2 bits), Expect = 5.7e-28, P = 5.7e-28
 Identities = 81/228 (35%), Positives = 120/228 (52%)

Query:   141 LPESVDWREKGAVN---PVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS--LSEQELVD 195
             LP S DWR    +N   PV++QGSCGSC++F+++  +E   +I+T    +  LS QE+V 
Sbjct:   231 LPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVS 290

Query:   196 CDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDV 255
             C +    GC GG          Q+ G+  E  +PY G ++ C       +  S + Y  V
Sbjct:   291 CSQYAQ-GCEGGFPYLIAGKYAQDFGLVEEDCFPYTGTDSPCRLKEGCFRYYSSE-YHYV 348

Query:   256 SPF----DEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVF--TG-----ECGSALDHGV 303
               F    +E  +K  +  Q P++VA E     F HY  GV+  TG           +H V
Sbjct:   349 GGFYGGCNEALMKLELVHQGPMAVAFEVYDD-FLHYRKGVYHHTGLRDPFNPFELTNHAV 407

Query:   304 VAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
             + VGYGT+  +G+DYW+V+NSWG+ WGENGY +++R      T +C I
Sbjct:   408 LLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRG-----TDECAI 450


>UNIPROTKB|J9P219 [details] [associations]
            symbol:J9P219 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 InterPro:IPR014882
            Pfam:PF08773 OMA:YDDFLHY EMBL:AAEX03012741
            Ensembl:ENSCAFT00000050015 Uniprot:J9P219
        Length = 406

 Score = 309 (113.8 bits), Expect = 1.3e-27, P = 1.3e-27
 Identities = 91/307 (29%), Positives = 153/307 (49%)

Query:    63 SNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYL-GTRSDAKR 121
             + G   N  R  ++K N  F+   N++ +++     ++ +      R M   G      R
Sbjct:   101 ARGFFSNSNR--LYKYNYEFVKAINTIQKSWTA--TRYIEYETLTLRDMMTRGGGRKIPR 156

Query:   122 RLMKSKVASQRYACKAGDELPESVDWRE-KGA--VNPVKDQG-SCGSCWAFSTVAAVEGI 177
             +   + + ++ +  +    LP S DWR  +G   V+PV++Q  SCGSC+AF++ A +E  
Sbjct:   157 KPKPTPLTAEIH--EEISRLPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEAR 214

Query:   178 NKIVTGELIS--LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN 235
              +I+T    +  LS QE+V C +    GC GG          Q+ G+  E  +PY G+++
Sbjct:   215 IRILTNNTQTPILSPQEIVSCSQYAQ-GCEGGFPYLIAGKYAQDFGLVEEACFPYAGSDS 273

Query:   236 KCDPSR----RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF 291
              C P+      +++   + G+        M L+  V   P++VA E     F HY+ G++
Sbjct:   274 PCKPNDCFRYYSSEYYYVGGFYGACNEALMKLE-LVRHGPMAVAFEVYDDFF-HYQKGIY 331

Query:   292 --TG-----ECGSALDHGVVAVGYGTEN--GVDYWLVRNSWGSDWGENGYVKLQRNLLDT 342
               TG           +H V+ VGYGT++  G+DYW+V+NSWGS WGE+GY +++R     
Sbjct:   332 YHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRG---- 387

Query:   343 NTGKCGI 349
              T +C I
Sbjct:   388 -TDECAI 393


>UNIPROTKB|F1PSK8 [details] [associations]
            symbol:F1PSK8 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 InterPro:IPR014882
            Pfam:PF08773 EMBL:AAEX03012741 Ensembl:ENSCAFT00000007054
            Uniprot:F1PSK8
        Length = 405

 Score = 308 (113.5 bits), Expect = 1.7e-27, P = 1.7e-27
 Identities = 89/297 (29%), Positives = 152/297 (51%)

Query:    74 QIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKR--RLMKSKVASQ 131
             +++K N  F+   N++ +++     ++ +      R M   TR   ++  R   + + ++
Sbjct:   110 RLYKYNYEFVKAINTIQKSWTA--TRYIEYETLTLRDMM--TRGGGRKIPRPKPTPLTAE 165

Query:   132 RYACKAGDELPESVDWRE-KGA--VNPVKDQG-SCGSCWAFSTVAAVEGINKIVTGELIS 187
              +  +    LP S DWR  +G   V+PV++Q  SCGSC+AF++ A +E   +I+T    +
Sbjct:   166 IH--EEISRLPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRILTNNTQT 223

Query:   188 --LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR---- 241
               LS QE+V C +    GC GG          Q+ G+  E  +PY G+++ C P+     
Sbjct:   224 PILSPQEIVSCSQYAQ-GCEGGFPYLIAGKYAQDFGLVEEACFPYAGSDSPCKPNDCFRY 282

Query:   242 RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF--TG-----E 294
              +++   + G+        M L+  V   P++VA E     F HY+ G++  TG      
Sbjct:   283 YSSEYYYVGGFYGACNEALMKLE-LVRHGPMAVAFEVYDDFF-HYQKGIYYHTGLRDPFN 340

Query:   295 CGSALDHGVVAVGYGTEN--GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
                  +H V+ VGYGT++  G+DYW+V+NSWGS WGE+GY +++R      T +C I
Sbjct:   341 PFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRG-----TDECAI 392


>WB|WBGene00008231 [details] [associations]
            symbol:tag-329 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            eggNOG:NOG288820 EMBL:Z70750 HSSP:P53634 HOGENOM:HOG000019851
            PIR:T20110 RefSeq:NP_505458.1 ProteinModelPortal:Q18740 SMR:Q18740
            MEROPS:C01.A36 EnsemblMetazoa:C50F4.3 GeneID:183677
            KEGG:cel:CELE_C50F4.3 UCSC:C50F4.3 CTD:183677 WormBase:C50F4.3
            InParanoid:Q18740 OMA:WIFRNSW NextBio:921986 Uniprot:Q18740
        Length = 374

 Score = 308 (113.5 bits), Expect = 1.7e-27, P = 1.7e-27
 Identities = 84/268 (31%), Positives = 129/268 (48%)

Query:    94 KVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDE-LPESVDWREK-- 150
             K G+NKF+DL+ +E   MY       K      K   +    K   E LP++ D R K  
Sbjct:    93 KYGINKFSDLSKKEIHGMY-SKFGPPKNNTNVPKFNLKNLRVKRQMEGLPKTFDLRNKKV 151

Query:   151 GA---VNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGG 207
             G    + P+K Q SC  CW F+  A  E    +   + ++LSEQE+ DC  K   GCNGG
Sbjct:   152 GGHYIIGPIKTQDSCACCWGFAATAVAEAALTVHLKKAMNLSEQEVCDCAPKHGPGCNGG 211

Query:   208 LMDYAFQFIIQNGGMDSEQDYPYLGAEN----KCDPSR--RNAKVVSIDGYEDVSPFD-- 259
                   ++I +  G+   ++YP+    +    +C+  +  R    + +D Y  + PF+  
Sbjct:   212 DPVDGLEYI-KEMGLTGGKEYPFNVNRSTQLGRCESEKYDRELNPLELDYYA-IDPFNAE 269

Query:   260 -EMSLKKAVADQPVSVAIEAGGRAFQHYESGVFT-GECGSALD---HGVVAVGYGT-ENG 313
              +M+    + + P+SVA   G  +   Y SG+    +C        H    VGYGT +N 
Sbjct:   270 YQMTHHLYLLNLPISVAFRTGA-SLSSYLSGILELADCDDEKGGHWHSGAIVGYGTTKNS 328

Query:   314 ----VDYWLVRNSWGSDWGENGYVKLQR 337
                 VDYW+ RNSW +DWG++GY ++ R
Sbjct:   329 AGRTVDYWIFRNSWWTDWGDDGYARIVR 356


>MGI|MGI:109553 [details] [associations]
            symbol:Ctsc "cathepsin C" species:10090 "Mus musculus"
            [GO:0001913 "T cell mediated cytotoxicity" evidence=IGI]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IMP]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0005783 "endoplasmic
            reticulum" evidence=ISO] [GO:0005794 "Golgi apparatus"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=ISO;IMP]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0010033
            "response to organic substance" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0031404 "chloride ion
            binding" evidence=ISO] [GO:0042802 "identical protein binding"
            evidence=ISO] [GO:0043621 "protein self-association" evidence=ISO]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 MGI:MGI:109553 GO:GO:0005783
            GO:GO:0005794 GO:GO:0007568 GO:GO:0010033 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004252 GO:GO:0005764 GO:GO:0031404 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 MEROPS:C01.070 OMA:YDDFLHY
            GO:GO:0001913 EMBL:U89269 EMBL:U74683 EMBL:BC067063 IPI:IPI00130015
            RefSeq:NP_034112.3 UniGene:Mm.322945 ProteinModelPortal:P97821
            SMR:P97821 STRING:P97821 PhosphoSite:P97821 PaxDb:P97821
            PRIDE:P97821 Ensembl:ENSMUST00000032779 GeneID:13032 KEGG:mmu:13032
            InParanoid:P97821 BindingDB:P97821 ChEMBL:CHEMBL3454 ChiTaRS:CTSC
            NextBio:282904 Bgee:P97821 CleanEx:MM_CTSC Genevestigator:P97821
            Uniprot:P97821
        Length = 462

 Score = 308 (113.5 bits), Expect = 2.2e-27, P = 2.2e-27
 Identities = 80/228 (35%), Positives = 120/228 (52%)

Query:   141 LPESVDWREKGAVN---PVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS--LSEQELVD 195
             LPES DWR    VN   PV++Q SCGSC++F+++  +E   +I+T    +  LS QE+V 
Sbjct:   230 LPESWDWRNVQGVNYVSPVRNQESCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVS 289

Query:   196 CDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDV 255
             C      GC+GG          Q+ G+  E  +PY   ++ C P     +  S D Y  V
Sbjct:   290 CSPYAQ-GCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDSPCKPRENCLRYYSSDYYY-V 347

Query:   256 SPF----DEMSLK-KAVADQPVSVAIEAGGRAFQHYESGVF--TGECG-----SALDHGV 303
               F    +E  +K + V   P++VA E     F HY SG++  TG           +H V
Sbjct:   348 GGFYGGCNEALMKLELVKHGPMAVAFEVHDD-FLHYHSGIYHHTGLSDPFNPFELTNHAV 406

Query:   304 VAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
             + VGYG +   G++YW+++NSWGS+WGE+GY +++R      T +C I
Sbjct:   407 LLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRG-----TDECAI 449


>RGD|2445 [details] [associations]
            symbol:Ctsc "cathepsin C" species:10116 "Rattus norvegicus"
          [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA;ISO]
          [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
          [GO:0004252 "serine-type endopeptidase activity" evidence=IEA;ISO]
          [GO:0005764 "lysosome" evidence=IDA;TAS] [GO:0005783 "endoplasmic
          reticulum" evidence=IDA] [GO:0005794 "Golgi apparatus" evidence=IDA]
          [GO:0006508 "proteolysis" evidence=IEP;ISO;TAS] [GO:0007568 "aging"
          evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
          evidence=ISO] [GO:0010033 "response to organic substance"
          evidence=IDA] [GO:0031404 "chloride ion binding" evidence=IDA]
          [GO:0042802 "identical protein binding" evidence=IDA] [GO:0043621
          "protein self-association" evidence=IDA] InterPro:IPR000668
          InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
          InterPro:IPR000169 RGD:2445 GO:GO:0005783 GO:GO:0005794 GO:GO:0007568
          GO:GO:0010033 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
          InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139
          PROSITE:PS00639 GO:GO:0004252 GO:GO:0005764 GO:GO:0043621
          GO:GO:0042802 GO:GO:0031404 GO:GO:0004197
          GeneTree:ENSGT00560000076599 CTD:1075 HOGENOM:HOG000068022
          HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ InterPro:IPR014882
          Pfam:PF08773 MEROPS:C01.070 OMA:YDDFLHY GO:GO:0001913 EMBL:D90404
          IPI:IPI00193765 PIR:A41158 RefSeq:NP_058793.1 UniGene:Rn.203177
          PDB:1JQP PDBsum:1JQP ProteinModelPortal:P80067 SMR:P80067
          STRING:P80067 PhosphoSite:P80067 PRIDE:P80067
          Ensembl:ENSRNOT00000022342 GeneID:25423 KEGG:rno:25423
          InParanoid:P80067 SABIO-RK:P80067 EvolutionaryTrace:P80067
          NextBio:606591 ArrayExpress:P80067 Genevestigator:P80067
          GermOnline:ENSRNOG00000016496 Uniprot:P80067
        Length = 462

 Score = 305 (112.4 bits), Expect = 5.1e-27, P = 5.1e-27
 Identities = 84/239 (35%), Positives = 124/239 (51%)

Query:   141 LPESVDWREKGAVN---PVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS--LSEQELVD 195
             LPES DWR    +N   PV++Q SCGSC++F+++  +E   +I+T    +  LS QE+V 
Sbjct:   230 LPESWDWRNVRGINFVSPVRNQESCGSCYSFASLGMLEARIRILTNNSQTPILSPQEVVS 289

Query:   196 CDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDV 255
             C      GC+GG          Q+ G+  E  +PY   +  C P     +  S + Y  V
Sbjct:   290 CSPYAQ-GCDGGFPYLIAGKYAQDFGVVEENCFPYTATDAPCKPKENCLRYYSSEYYY-V 347

Query:   256 SPF----DEMSLK-KAVADQPVSVAIEAGGRAFQHYESGVF--TGECG-----SALDHGV 303
               F    +E  +K + V   P++VA E     F HY SG++  TG           +H V
Sbjct:   348 GGFYGGCNEALMKLELVKHGPMAVAFEVHDD-FLHYHSGIYHHTGLSDPFNPFELTNHAV 406

Query:   304 VAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI---AMEASYPV 357
             + VGYG +   G+DYW+V+NSWGS WGE+GY +++R      T +C I   AM A+ P+
Sbjct:   407 LLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRIRRG-----TDECAIESIAM-AAIPI 459


>WB|WBGene00022189 [details] [associations]
            symbol:Y71H2AR.2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0004181
            "metallocarboxypeptidase activity" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] [GO:0008340 "determination of adult lifespan"
            evidence=IMP] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0008340 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 HSSP:P07711
            eggNOG:NOG331187 HOGENOM:HOG000114005 EMBL:FO081570
            RefSeq:NP_497627.1 UniGene:Cel.28419 ProteinModelPortal:Q9BL26
            SMR:Q9BL26 EnsemblMetazoa:Y71H2AR.2 GeneID:190615
            KEGG:cel:CELE_Y71H2AR.2 UCSC:Y71H2AR.2 CTD:190615
            WormBase:Y71H2AR.2 InParanoid:Q9BL26 OMA:CAMATTI NextBio:946382
            Uniprot:Q9BL26
        Length = 345

 Score = 303 (111.7 bits), Expect = 5.7e-27, P = 5.7e-27
 Identities = 72/204 (35%), Positives = 111/204 (54%)

Query:   143 ESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGIN-KIVTGELISLSEQELVDCDRKIN 201
             E +DWREKG V PVKDQG C +  AF+  +++E +  K   G L+S SEQ+L+DC+ +  
Sbjct:    84 EFLDWREKGIVGPVKDQGKCNASHAFAITSSIESMYAKATNGTLLSFSEQQLIDCNDQGY 143

Query:   202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN-KCDPSRRNAKVVSIDGYEDVSPFDE 260
              GC       A  ++  +G +++E DYPY+   N KC      +K+    G   V+  +E
Sbjct:   144 KGCEEQFAMNAIGYLATHG-IETEADYPYVDKTNEKCTFDSTKSKIHLKKGV--VAEGNE 200

Query:   261 MSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFTG---ECGSALD-HGVVAVGYGTENGVD 315
             +  K  V +  P    + A    +  Y+ G++     EC S  +   +V VGYG E    
Sbjct:   201 VLGKVYVTNYGPAFFTMRAPPSLYD-YKIGIYNPSIEECTSTHEIRSMVIVGYGIEGEQK 259

Query:   316 YWLVRNSWGSDWGENGYVKLQRNL 339
             YW+V+ S+G+ WGE GY+KL R++
Sbjct:   260 YWIVKGSFGTSWGEQGYMKLARDV 283


>UNIPROTKB|P53634 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9606 "Homo
            sapiens" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0005783 "endoplasmic reticulum" evidence=IEA] [GO:0005794
            "Golgi apparatus" evidence=IEA] [GO:0007568 "aging" evidence=IEA]
            [GO:0010033 "response to organic substance" evidence=IEA]
            [GO:0031404 "chloride ion binding" evidence=IEA] [GO:0042802
            "identical protein binding" evidence=IEA] [GO:0043621 "protein
            self-association" evidence=IEA] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0006508 "proteolysis" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0006955
            "immune response" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005783 GO:GO:0005794 Reactome:REACT_6900
            GO:GO:0006955 GO:GO:0007568 GO:GO:0010033 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004252 GO:GO:0005764 GO:GO:0031404 CTD:1075
            HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ InterPro:IPR014882
            Pfam:PF08773 MEROPS:C01.070 EMBL:X87212 EMBL:U79415 EMBL:AF234263
            EMBL:AF234264 EMBL:AF254757 EMBL:AF525032 EMBL:AF525033
            EMBL:AK292117 EMBL:AK311923 EMBL:AK223038 EMBL:BX537913
            EMBL:AC011088 EMBL:CH471185 EMBL:BC054028 EMBL:BC100891
            EMBL:BC100892 EMBL:BC100893 EMBL:BC100894 EMBL:BC109386
            EMBL:BC110071 EMBL:BC113850 EMBL:BC113897 IPI:IPI00022810
            IPI:IPI00171323 IPI:IPI00872258 PIR:S23941 PIR:S66504
            RefSeq:NP_001107645.1 RefSeq:NP_001805.3 RefSeq:NP_680475.1
            UniGene:Hs.128065 PDB:1K3B PDB:2DJF PDB:2DJG PDB:3PDF PDBsum:1K3B
            PDBsum:2DJF PDBsum:2DJG PDBsum:3PDF ProteinModelPortal:P53634
            SMR:P53634 IntAct:P53634 MINT:MINT-4655964 STRING:P53634
            PhosphoSite:P53634 DMDM:1705632 PaxDb:P53634 PRIDE:P53634
            DNASU:1075 Ensembl:ENST00000227266 Ensembl:ENST00000524463
            Ensembl:ENST00000529974 GeneID:1075 KEGG:hsa:1075 UCSC:uc001pck.4
            UCSC:uc001pcm.4 GeneCards:GC11M088026 HGNC:HGNC:2528 HPA:CAB025364
            MIM:170650 MIM:245000 MIM:245010 MIM:602365 neXtProt:NX_P53634
            Orphanet:2342 Orphanet:678 PharmGKB:PA27028 HOGENOM:HOG000127503
            InParanoid:P53634 OMA:YDDFLHY PhylomeDB:P53634
            BioCyc:MetaCyc:HS03265-MONOMER SABIO-RK:P53634 BindingDB:P53634
            ChEMBL:CHEMBL2252 EvolutionaryTrace:P53634 GenomeRNAi:1075
            NextBio:4488 PMAP-CutDB:P53634 ArrayExpress:P53634 Bgee:P53634
            Genevestigator:P53634 GermOnline:ENSG00000109861 GO:GO:0001913
            Uniprot:P53634
        Length = 463

 Score = 303 (111.7 bits), Expect = 9.0e-27, P = 9.0e-27
 Identities = 79/228 (34%), Positives = 120/228 (52%)

Query:   141 LPESVDWREKGAVN---PVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS--LSEQELVD 195
             LP S DWR    +N   PV++Q SCGSC++F+++  +E   +I+T    +  LS QE+V 
Sbjct:   231 LPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVS 290

Query:   196 CDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDV 255
             C +    GC GG          Q+ G+  E  +PY G ++ C       +  S + Y  V
Sbjct:   291 CSQYAQ-GCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCKMKEDCFRYYSSE-YHYV 348

Query:   256 SPF----DEMSLK-KAVADQPVSVAIEAGGRAFQHYESGVF--TG-----ECGSALDHGV 303
               F    +E  +K + V   P++VA E     F HY+ G++  TG           +H V
Sbjct:   349 GGFYGGCNEALMKLELVHHGPMAVAFEVYDD-FLHYKKGIYHHTGLRDPFNPFELTNHAV 407

Query:   304 VAVGYGTEN--GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
             + VGYGT++  G+DYW+V+NSWG+ WGENGY +++R      T +C I
Sbjct:   408 LLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRG-----TDECAI 450


>UNIPROTKB|F1STR1 [details] [associations]
            symbol:CTSC "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0004252
            "serine-type endopeptidase activity" evidence=IEA] [GO:0001913 "T
            cell mediated cytotoxicity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 KO:K01275 InterPro:IPR014882
            Pfam:PF08773 OMA:YDDFLHY GO:GO:0001913 EMBL:CU855751
            RefSeq:XP_003129789.1 UniGene:Ssc.6155 Ensembl:ENSSSCT00000016280
            GeneID:100522387 KEGG:ssc:100522387 Uniprot:F1STR1
        Length = 463

 Score = 297 (109.6 bits), Expect = 4.5e-26, P = 4.5e-26
 Identities = 79/228 (34%), Positives = 120/228 (52%)

Query:   141 LPESVDWRE-KGA--VNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS--LSEQELVD 195
             LP S DWR  +G   V PV++Q SCGSC++F+++  +E   +I+T    +  LS QE+V 
Sbjct:   231 LPASWDWRNVRGTNFVTPVRNQASCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVS 290

Query:   196 CDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDV 255
             C +    GC GG          Q+ G+  E  +PY G ++ C       +  S + Y  V
Sbjct:   291 CSQYAQ-GCAGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCTVKEGCFRYYSSE-YHYV 348

Query:   256 SPF----DEMSLK-KAVADQPVSVAIEAGGRAFQHYESGVF--TG-----ECGSALDHGV 303
               F    +E  +K + V   P++VA E     F HY  G++  TG           +H V
Sbjct:   349 GGFYGGCNEALMKLELVHHGPMAVAFEVYDD-FLHYRKGIYHHTGLRDPFNPFELTNHAV 407

Query:   304 VAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
             + VGYGT+  +G+DYW+V+NSWG+ WGE+GY +++R      T +C I
Sbjct:   408 LLVGYGTDLASGMDYWIVKNSWGTSWGEDGYFRIRRG-----TDECAI 450


>UNIPROTKB|J9NSE7 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 InterPro:IPR014882 Pfam:PF08773
            EMBL:AAEX03017125 Ensembl:ENSCAFT00000014269 OMA:INGQICH
            Uniprot:J9NSE7
        Length = 458

 Score = 296 (109.3 bits), Expect = 5.3e-26, P = 5.3e-26
 Identities = 87/295 (29%), Positives = 144/295 (48%)

Query:    74 QIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRY 133
             +++K N  F+   N++ +++     ++ +      R M    R    R++ + K      
Sbjct:   164 RLYKYNYEFVKAINTIQKSWTA--TRYIEYETLTLRDMM---RRAGGRKIPRPKPTPLTA 218

Query:   134 ACKAG-DELPESVDWRE-KGA--VNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS-- 187
                     LP S DWR  +G   V+PV++Q SCGSC+AF++   +E   +I+T    +  
Sbjct:   219 EIHEEISRLPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTVMLEARIRILTNNTQTPI 278

Query:   188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR----RN 243
             LS QE+V C +    GC GG          Q+ G+  E  + Y G+++ C P+      +
Sbjct:   279 LSPQEIVSCSQYAQ-GCEGGFPYLIAGKYAQDFGLVDEACFSYAGSDSPCKPNDCFHYYS 337

Query:   244 AKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF--TG-----ECG 296
             ++   + G+        M L+  V   P++VA E     F HY+ G++  TG        
Sbjct:   338 SEYHYVGGFYGACNEALMKLE-LVRHGPMAVAFEVYDDFF-HYQKGIYYHTGLRDPINPF 395

Query:   297 SALDHGVVAVGYGTEN--GVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
                +H V+ VGYGT++  G+DYW+V+NSWGS WGE+GY ++ R      T +C I
Sbjct:   396 ELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFQICRG-----TDECAI 445


>UNIPROTKB|E9PKT6 [details] [associations]
            symbol:CTSH "Cathepsin H" species:9606 "Homo sapiens"
            [GO:0001520 "outer dense fiber" evidence=IEA] [GO:0001656
            "metanephros development" evidence=IEA] [GO:0001669 "acrosomal
            vesicle" evidence=IEA] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0007283 "spermatogenesis" evidence=IEA] [GO:0008284 "positive
            regulation of cell proliferation" evidence=IEA] [GO:0010634
            "positive regulation of epithelial cell migration" evidence=IEA]
            [GO:0016505 "apoptotic protease activator activity" evidence=IEA]
            [GO:0030984 "kininogen binding" evidence=IEA] [GO:0031638 "zymogen
            activation" evidence=IEA] [GO:0031648 "protein destabilization"
            evidence=IEA] [GO:0032403 "protein complex binding" evidence=IEA]
            [GO:0032526 "response to retinoic acid" evidence=IEA] [GO:0033619
            "membrane protein proteolysis" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0043621 "protein
            self-association" evidence=IEA] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IEA] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            InterPro:IPR000169 GO:GO:0043066 GO:GO:0008284 PANTHER:PTHR12411
            PROSITE:PS00139 GO:GO:0045766 GO:GO:0004252 GO:GO:0032526
            GO:GO:0016505 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 GO:GO:0060448 GO:GO:0033619
            EMBL:AC011944 HGNC:HGNC:2535 IPI:IPI00375426
            ProteinModelPortal:E9PKT6 SMR:E9PKT6 PRIDE:E9PKT6
            Ensembl:ENST00000528741 ArrayExpress:E9PKT6 Bgee:E9PKT6
            Uniprot:E9PKT6
        Length = 134

 Score = 284 (105.0 bits), Expect = 5.9e-25, P = 5.9e-25
 Identities = 60/140 (42%), Positives = 82/140 (58%)

Query:    95 VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGA-V 153
             + LN+F+D++  E +  YL +                 Y    G   P SVDWR+KG  V
Sbjct:     1 MALNQFSDMSFAEIKHKYLWSEPQ------NCSATKSNYLRGTGP-YPPSVDWRKKGNFV 53

Query:   154 NPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYA 212
             +PVK+QG+CGSCW FST  A+E    I TG+++SL+EQ+LVDC +  N  GC GGL   A
Sbjct:    54 SPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQA 113

Query:   213 FQFIIQNGGMDSEQDYPYLG 232
             F++I+ N G+  E  YPY G
Sbjct:   114 FEYILYNKGIMGEDTYPYQG 133


>UNIPROTKB|F1NWG2 [details] [associations]
            symbol:CTSC "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 InterPro:IPR014882 Pfam:PF08773
            OMA:YDDFLHY GO:GO:0001913 EMBL:AADN02004805 IPI:IPI00577371
            Ensembl:ENSGALT00000027869 Uniprot:F1NWG2
        Length = 463

 Score = 283 (104.7 bits), Expect = 1.8e-24, P = 1.8e-24
 Identities = 87/294 (29%), Positives = 140/294 (47%)

Query:    76 FKDNLRFIDEHNSLNRTYKVG-LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYA 134
             F  N  F++  N+  ++++     ++ + + EE      G  S   R   K    +    
Sbjct:   168 FVHNFDFVNAINAHQKSWRATRYEEYENFSLEELTRRAGGLYSRTSRP--KPAPLTPELL 225

Query:   135 CKAGDELPESVDWREKGAVN---PVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS--LS 189
              K    LPES DWR    VN   PV++Q SCGSC+AF+++  +E   +I+T        S
Sbjct:   226 KKVSG-LPESWDWRNVNGVNYVSPVRNQASCGSCYAFASMGMLEARIRILTNNTQKPVFS 284

Query:   190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
              Q++V C +  + GC+GG         +Q+ G+  E  +PY   +  C   +R+      
Sbjct:   285 PQQVVSCSQ-YSQGCDGGFPYLIAGKYVQDFGVVEEDCFPYTAKDTPC-LFKRSCYHYYT 342

Query:   250 DGYEDVSPF----DEMSLK-KAVADQPVSVAIEAGGRAFQHYESGVF--TG---ECG--S 297
               Y  V  F    +E  +K + V   P++VA E     F  Y+ G++  TG   E     
Sbjct:   343 SEYHYVGGFYGACNEALMKLELVLSGPMAVAFEVYND-FMFYKEGIYHHTGLKDEFNPFE 401

Query:   298 ALDHGVVAVGYGT--ENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
               +H V+ VGYG   E+G  +W+V+NSWG+ WGE+GY +++R      T +C I
Sbjct:   402 LTNHAVLLVGYGKDPESGEKFWIVKNSWGTSWGEDGYFRIRRG-----TDECAI 450


>TAIR|locus:505006093 [details] [associations]
            symbol:AT1G02305 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0005829 "cytosol" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684 GO:GO:0005773
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 HOGENOM:HOG000241341
            KO:K01363 PANTHER:PTHR12411:SF16 OMA:CCGFLCG UniGene:At.23486
            UniGene:At.42610 UniGene:At.43952 EMBL:AY039887 EMBL:AF428337
            EMBL:BT002227 IPI:IPI00524601 RefSeq:NP_563648.1 HSSP:P07858
            ProteinModelPortal:Q93VC9 SMR:Q93VC9 IntAct:Q93VC9 STRING:Q93VC9
            MEROPS:C01.049 PRIDE:Q93VC9 ProMEX:Q93VC9 EnsemblPlants:AT1G02305.1
            GeneID:839538 KEGG:ath:AT1G02305 TAIR:At1g02305 InParanoid:Q93VC9
            PhylomeDB:Q93VC9 ProtClustDB:CLSN2687619 Genevestigator:Q93VC9
            Uniprot:Q93VC9
        Length = 362

 Score = 172 (65.6 bits), Expect = 1.5e-23, Sum P(2) = 1.5e-23
 Identities = 43/130 (33%), Positives = 65/130 (50%)

Query:   225 EQDYPYLGAENKCDPSR---RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
             E  YP      KC       R +K   +  Y+  S  D++ + +   + PV VA      
Sbjct:   211 EPAYPTPKCARKCVSGNQLWRESKHYGVSAYKVRSHPDDI-MAEVYKNGPVEVAFTVY-E 268

Query:   282 AFQHYESGVFTGECGSALD-HGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
              F HY+SGV+    G+ +  H V  +G+GT ++G DYWL+ N W   WG++GY K++R  
Sbjct:   269 DFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRG- 327

Query:   340 LDTNTGKCGI 349
                 T +CGI
Sbjct:   328 ----TNECGI 333

 Score = 164 (62.8 bits), Expect = 1.5e-23, Sum P(2) = 1.5e-23
 Identities = 45/157 (28%), Positives = 73/157 (46%)

Query:    90 NRTYKVGLN-KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWR 148
             N  +K   N +FA+ T  E++ + LG +   K   +   + S   + K   E      W 
Sbjct:    59 NAGWKASFNDRFANATVAEFKRL-LGVKPTPKTEFLGVPIVSHDISLKLPKEFDARTAWS 117

Query:   149 EKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGG 207
             +  ++  + DQG CGSCWAF  V ++     I     +SLS  +L+ C   +   GCNGG
Sbjct:   118 QCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYNMNVSLSVNDLLACCGFLCGQGCNGG 177

Query:   208 LMDYAFQFIIQNGGMDSEQDYPYL---GAENK-CDPS 240
                 A+++   +G +  E D PY    G  +  C+P+
Sbjct:   178 YPIAAWRYFKHHGVVTEECD-PYFDNTGCSHPGCEPA 213


>DICTYBASE|DDB_G0276111 [details] [associations]
            symbol:DDB_G0276111 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0276111 Pfam:PF00188
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            PROSITE:PS00139 EMBL:AAFI02000014 Gene3D:3.40.33.10
            InterPro:IPR001283 InterPro:IPR014044 PRINTS:PR00837 SMART:SM00198
            SUPFAM:SSF55797 ProtClustDB:CLSZ2429919 RefSeq:XP_643261.1
            ProteinModelPortal:Q75JH0 EnsemblProtists:DDB0169514 GeneID:8620304
            KEGG:ddi:DDB_G0276111 InParanoid:Q75JH0 OMA:GFVTSIK Uniprot:Q75JH0
        Length = 415

 Score = 270 (100.1 bits), Expect = 2.7e-23, P = 2.7e-23
 Identities = 64/197 (32%), Positives = 103/197 (52%)

Query:   145 VDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG---INKIVTGELISLSEQELVDCDRKIN 201
             VDW+  G V  +K+QG CG C++F+T AA+E    I   +    I LSEQ  V C   +N
Sbjct:   213 VDWKSLGFVTSIKNQGQCGGCYSFATCAALESAYLIKNNLPNTDIDLSEQNFVSC---VN 269

Query:   202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
              GC GG        +   G M  E  YPY      C    ++ +     GY ++    E 
Sbjct:   270 YGCGGGNGQSCLDKLKSTGIM-YETSYPYKAVTGSCPNVIQSPQPFKWTGYSNIQGNKEA 328

Query:   262 SLKKAVADQPV--SVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLV 319
              L  A+   P+  S+ +++G   FQ Y+SG+++    S  +H +  VGY + +  + +L+
Sbjct:   329 FLN-ALKSGPIYASLYVDSG---FQLYKSGIYSCSQSSTPNHAITIVGYSSAD--NSYLI 382

Query:   320 RNSWGSDWGENGYVKLQ 336
             +NSWG+ +GE+GY++L+
Sbjct:   383 KNSWGTIYGESGYIRLK 399


>TAIR|locus:2133402 [details] [associations]
            symbol:AT4G01610 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0005829 "cytosol" evidence=IDA] [GO:0000902 "cell
            morphogenesis" evidence=RCA] [GO:0006635 "fatty acid
            beta-oxidation" evidence=RCA] [GO:0010162 "seed dormancy process"
            evidence=RCA] [GO:0016049 "cell growth" evidence=RCA] [GO:0048193
            "Golgi vesicle transport" evidence=RCA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0005773 EMBL:CP002687
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 eggNOG:NOG315657
            HOGENOM:HOG000241341 KO:K01363 PANTHER:PTHR12411:SF16 OMA:DAIPDHF
            HSSP:P07858 ProtClustDB:CLSN2687619 EMBL:AF370193 EMBL:AY065167
            EMBL:AY114015 EMBL:AY086034 EMBL:AF083797 EMBL:BT001190
            EMBL:AK175280 EMBL:AK175481 EMBL:AK175539 EMBL:AK176165
            EMBL:AK176244 EMBL:AK176281 EMBL:AK176330 EMBL:AK176416
            EMBL:AK176433 EMBL:AK176487 EMBL:AK221398 EMBL:AK230235
            IPI:IPI00530811 RefSeq:NP_567215.1 UniGene:At.24471
            ProteinModelPortal:Q94K85 SMR:Q94K85 STRING:Q94K85 MEROPS:C01.144
            PaxDb:Q94K85 PRIDE:Q94K85 EnsemblPlants:AT4G01610.1 GeneID:826792
            KEGG:ath:AT4G01610 TAIR:At4g01610 InParanoid:Q94K85
            PhylomeDB:Q94K85 Genevestigator:Q94K85 Uniprot:Q94K85
        Length = 359

 Score = 166 (63.5 bits), Expect = 4.7e-23, Sum P(2) = 4.7e-23
 Identities = 47/144 (32%), Positives = 69/144 (47%)

Query:   225 EQDYPYLGAENKCDPSRR---NAKVVSIDGYEDVS-PFDEMSLKKAVADQPVSVAIEAGG 280
             E  YP      KC    +    +K  S+  Y   S P D M+  +   + PV V+     
Sbjct:   208 EPAYPTPKCSRKCVSDNKLWSESKHYSVSTYTVKSNPQDIMA--EVYKNGPVEVSFTVY- 264

Query:   281 RAFQHYESGVFTGECGSALD-HGVVAVGYGTEN-GVDYWLVRNSWGSDWGENGYVKLQRN 338
               F HY+SGV+    GS +  H V  +G+GT + G DYWL+ N W   WG++GY  ++R 
Sbjct:   265 EDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRRG 324

Query:   339 LLDTNTGKCGIAMEASYPVKNSQN 362
                  T +CGI  E    + +S+N
Sbjct:   325 -----TNECGIEDEPVAGLPSSKN 343

 Score = 166 (63.5 bits), Expect = 4.7e-23, Sum P(2) = 4.7e-23
 Identities = 48/174 (27%), Positives = 85/174 (48%)

Query:    74 QIFKDNL-RFIDEHNSLNRTYKVGLN-KFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
             +I +D + + ++E+   N  +K  +N +F++ T  E++ + LG +   K+  +   + S 
Sbjct:    41 KILQDEIVKKVNENP--NAGWKAAINDRFSNATVAEFKRL-LGVKPTPKKHFLGVPIVSH 97

Query:   132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
               + K          W +  ++  + DQG CGSCWAF  V ++     I  G  ISLS  
Sbjct:    98 DPSLKLPKAFDARTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIQFGMNISLSVN 157

Query:   192 ELVDC-DRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL---GAENK-CDPS 240
             +L+ C   +   GC+GG    A+Q+   +G +  E D PY    G  +  C+P+
Sbjct:   158 DLLACCGFRCGDGCDGGYPIAAWQYFSYSGVVTEECD-PYFDNTGCSHPGCEPA 210


>DICTYBASE|DDB_G0286015 [details] [associations]
            symbol:gmsA species:44689 "Dictyostelium discoideum"
            [GO:0005615 "extracellular space" evidence=IDA] [GO:0019953 "sexual
            reproduction" evidence=IEP] [GO:0009897 "external side of plasma
            membrane" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA;ISS] [GO:0000747 "conjugation with cellular
            fusion" evidence=IMP] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005576 "extracellular
            region" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0286015 Pfam:PF00188 GO:GO:0005615
            GenomeReviews:CM000153_GR GO:GO:0009897 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AAFI02000085 GO:GO:0000747
            Gene3D:3.40.33.10 InterPro:IPR001283 InterPro:IPR014044
            SMART:SM00198 SUPFAM:SSF55797 HSSP:P07688 RefSeq:XP_637893.1
            ProteinModelPortal:Q54ME1 MEROPS:C01.A52 EnsemblProtists:DDB0191145
            GeneID:8625403 KEGG:ddi:DDB_G0286015 InParanoid:Q54ME1 OMA:PGIAYEK
            ProtClustDB:CLSZ2429919 Uniprot:Q54ME1
        Length = 448

 Score = 266 (98.7 bits), Expect = 1.3e-22, P = 1.3e-22
 Identities = 69/200 (34%), Positives = 109/200 (54%)

Query:   144 SVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG--INKIVTGE--LISLSEQELVDCDRK 199
             +VDW       P++DQG CGSCWAF++ AA+E   + K  T +   + LS Q  V+C   
Sbjct:   243 TVDWTSYQT--PIRDQGQCGSCWAFASSAALESRYLIKYGTAQKSTLQLSNQNAVNC--- 297

Query:   200 INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN-KCDPSRRNAKVVSID-GYEDVSP 257
             I +GCNGG     F F  +  G+  E+D PY       C  +   A+    + GY + + 
Sbjct:   298 IASGCNGGWSGNYFNFF-KTPGIAYEKDDPYKAVTGTSCITTSSVARFKYTNYGYTEKTK 356

Query:   258 FDEMS-LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG-SALDHGVVAVGYGTENGVD 315
                ++ LKK     PV++A+     AFQ+Y+SG++      + ++H V+ VGY  +   D
Sbjct:   357 AALLAELKKG----PVTIAVYVDS-AFQNYKSGIYNSATKYTGINHLVLLVGY--DQATD 409

Query:   316 YWLVRNSWGSDWGENGYVKL 335
              + ++NSWGS WGE+GY+++
Sbjct:   410 AYKIKNSWGSWWGESGYMRI 429


>WB|WBGene00000786 [details] [associations]
            symbol:cpr-6 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            PANTHER:PTHR12411:SF16 EMBL:L39894 EMBL:L39939 EMBL:FO080666
            PIR:T37274 RefSeq:NP_741818.1 UniGene:Cel.18138
            ProteinModelPortal:P43510 SMR:P43510 DIP:DIP-25139N
            MINT:MINT-1074025 STRING:P43510 MEROPS:C01.A51 PaxDb:P43510
            PRIDE:P43510 EnsemblMetazoa:C25B8.3a GeneID:180931
            KEGG:cel:CELE_C25B8.3 UCSC:C25B8.3a CTD:180931 WormBase:C25B8.3a
            InParanoid:P43510 OMA:KAKWGLM NextBio:911608 ArrayExpress:P43510
            Uniprot:P43510
        Length = 379

 Score = 165 (63.1 bits), Expect = 4.5e-22, Sum P(2) = 4.5e-22
 Identities = 47/152 (30%), Positives = 73/152 (48%)

Query:   228 YPYLGAENKCDPSRRNAKVVSIDGYEDVSPF---DEMSL--KKAVADQPVSVAIEAGGRA 282
             YP    E KC  S    K  S D +   S +   D++    K+ +   P+ +A E     
Sbjct:   228 YPTPKCEKKC-VSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVY-ED 285

Query:   283 FQHYESGVFT---GECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
             F +Y+ GV+    G+ G    H V  +G+G ++G+ YW V NSW +DWGE+G+ ++ R +
Sbjct:   286 FLNYDGGVYVHTGGKLGGG--HAVKLIGWGIDDGIPYWTVANSWNTDWGEDGFFRILRGV 343

Query:   340 LDTNTGKCGI--AMEASYPVKNSQNSAKPKPH 369
              D    +CGI   +    P  NS  S   + H
Sbjct:   344 -D----ECGIESGVVGGIPKLNSLTSRLHRHH 370

 Score = 159 (61.0 bits), Expect = 4.5e-22, Sum P(2) = 4.5e-22
 Identities = 50/171 (29%), Positives = 87/171 (50%)

Query:    78 DNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKA 137
             D + +++E+ +L    K    +F+ +  E  +A + G       RL    V  +++  K 
Sbjct:    45 DLIDYVNENQNLWTAKKQ--RRFSSVYGENDKAKW-GLMGVNHVRL---SVKGKQHLSKT 98

Query:   138 GD---ELPESVD----WREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT-GEL-ISL 188
              D   ++PES D    W +  ++  ++DQ SCGSCWAF  V A+     I + GEL ++L
Sbjct:    99 KDLDLDIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTL 158

Query:   189 SEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
             S  +L+ C +    GCNGG    A+++ +++G + +  +Y    A N C P
Sbjct:   159 SADDLLSCCKSCGFGCNGGDPLAAWRYWVKDG-IVTGSNYT---ANNGCKP 205


>UNIPROTKB|F1RWA9 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 EMBL:CU855637
            Ensembl:ENSSSCT00000009707 OMA:WAFSIVG Uniprot:F1RWA9
        Length = 194

 Score = 255 (94.8 bits), Expect = 7.0e-22, P = 7.0e-22
 Identities = 68/199 (34%), Positives = 96/199 (48%)

Query:   162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG- 220
             CG CWAFS V+AVE    I    L  LS Q+++DC    N GCNGG    A  ++ +   
Sbjct:     2 CGGCWAFSVVSAVESAYAIKGQPLEVLSVQQVIDCSYN-NYGCNGGSTLNALYWLNKTQV 60

Query:   221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYE--DVSPFDEMSLKKAVADQPVSVAIEA 278
              + S+ +YP+      C     +   VSI  Y   D S  ++   K  +   P+ V ++A
Sbjct:    61 KVVSDSEYPFKAQNGLCHYFSCSHSGVSIKDYSAYDFSGQEDEMAKTLLTLGPLIVIVDA 120

Query:   279 GGRAFQHYESGVFTGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGY--VKL 335
                ++Q Y  G+    C S   +H V+  G+       YW+VRNSWGS WG +GY  VK+
Sbjct:   121 V--SWQDYLGGIIQHHCSSGEANHAVLVTGFDKTGSTPYWIVRNSWGSAWGIDGYALVKM 178

Query:   336 QRNLLDTNTGKCGIAMEAS 354
               N+       CGIA   S
Sbjct:   179 GGNI-------CGIADSVS 190


>FB|FBgn0033873 [details] [associations]
            symbol:CG6337 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 EMBL:AE013599
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 HSSP:P80067 EMBL:AY084123
            RefSeq:NP_610905.1 UniGene:Dm.5230 SMR:Q7JYA0 IntAct:Q7JYA0
            EnsemblMetazoa:FBtr0087646 GeneID:36530 KEGG:dme:Dmel_CG6337
            UCSC:CG6337-RA FlyBase:FBgn0033873 eggNOG:NOG310593
            InParanoid:Q7JYA0 OMA:NRTTYRE OrthoDB:EOG4MCVFZ GenomeRNAi:36530
            NextBio:799041 Uniprot:Q7JYA0
        Length = 340

 Score = 254 (94.5 bits), Expect = 9.0e-22, P = 9.0e-22
 Identities = 90/319 (28%), Positives = 148/319 (46%)

Query:    40 SSWRTDD-EVMTIYQTWLAKHGKT-SNGMGHNEKRFQIFKDNLRFIDEHNSL---NRT-Y 93
             S W  +  + +  +QT+     KT ++    N   +  F  N   + +HN+    NRT Y
Sbjct:    15 SGWAFNHGQDLVDFQTYEDNFNKTYASTSARNFANYY-FIYNRNQVAQHNAQADRNRTTY 73

Query:    94 KVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAV 153
             +  +N+F+D+   ++ A+ L    +          ASQ  A  A  ++    D+   G  
Sbjct:    74 REAVNQFSDIRLIQFAAL-LPKAVNTVTSAASDPPASQ--AASASFDI--ITDF---GLT 125

Query:   154 NPVKDQG-SCGSCWAFSTVAAVEGINKIVTGELI--SLSEQELVDCDRKINAGCNGGLMD 210
               V+DQG +C S WA++T  AVE +N + T   +  SLS Q+L+DC   +  GC+     
Sbjct:   126 VAVEDQGVNCSSSWAYATAKAVEIMNAVQTANPLPSSLSAQQLLDC-AGMGTGCSTQTPL 184

Query:   211 YAFQFIIQ--NGGMDSEQDYPY---LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
              A  ++ Q  +  +  E DYP    L     C P    +  V + GY  V+  D+ ++ +
Sbjct:   185 AALNYLTQLTDAYLYPEVDYPNNNSLKTPGMCQPPSSVSVGVKLAGYSTVADNDDAAVMR 244

Query:   266 AVADQ-PVSVAIEAGGRAFQHYESGVFTGECGSALD----HGVVAVGYG--TENGVDYWL 318
              V++  PV V        F  Y SGV+  E  +  +      +V VGY    ++ +DYW 
Sbjct:   245 YVSNGFPVIVEYNPATFGFMQYSSGVYVQETRALTNPKSSQFLVVVGYDHDVDSNLDYWR 304

Query:   319 VRNSWGSDWGENGYVKLQR 337
               NS+G  WGE GY+++ R
Sbjct:   305 CLNSFGDTWGEEGYIRIVR 323


>DICTYBASE|DDB_G0288221 [details] [associations]
            symbol:DDB_G0288221 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0288221 Pfam:PF00188 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 PANTHER:PTHR12411 EMBL:AAFI02000109 Gene3D:3.40.33.10
            InterPro:IPR001283 InterPro:IPR014044 SMART:SM00198 SUPFAM:SSF55797
            MEROPS:C01.A52 ProtClustDB:CLSZ2429919 RefSeq:XP_636852.1
            ProteinModelPortal:Q54J84 EnsemblProtists:DDB0187839 GeneID:8626520
            KEGG:ddi:DDB_G0288221 InParanoid:Q54J84 Uniprot:Q54J84
        Length = 395

 Score = 259 (96.2 bits), Expect = 2.4e-21, P = 2.4e-21
 Identities = 66/199 (33%), Positives = 105/199 (52%)

Query:   144 SVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTG----ELISLSEQELVDCDRK 199
             SVDW +     PV+DQG C SCW F ++AA+E    I  G      + LS Q  ++C   
Sbjct:   191 SVDWSDYQT--PVRDQGECKSCWVFGSLAALESRYLIKNGVSEKSTLHLSAQNAMNC--- 245

Query:   200 INAGCNGGLMDYAFQFIIQNGGMDSEQDYPY--LGAENKCDPSRRNAKVVSIDGYEDVSP 257
             I +GC  G     F +  ++ G+  E+DYPY  +G++N C  S    +     GY+ V  
Sbjct:   246 ITSGCESGWPANVFDYF-ESSGIAFEKDYPYDAIGSDN-CTSSSNKFEY---SGYDSVEN 300

Query:   258 FDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTG-ECGSALDHGVVAVGYGTENGVDY 316
               + SL + + + P+++A+ +   AFQ Y  G++   E    ++H V+ VGY  +   D 
Sbjct:   301 TKD-SLIQELKNGPITIALYSD-TAFQSYAGGIYDSVEEYKDVNHIVLLVGY--DKPTDS 356

Query:   317 WLVRNSWGSDWGENGYVKL 335
             W ++NS G+ WGE GY ++
Sbjct:   357 WKIKNSLGTKWGELGYARI 375


>FB|FBgn0030521 [details] [associations]
            symbol:CtsB1 "Cathepsin B1" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS] [GO:0035071 "salivary gland cell autophagic cell
            death" evidence=IEP] [GO:0048102 "autophagic cell death"
            evidence=IEP] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:AE014298 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0035071
            GO:GO:0004197 MEROPS:C01.060 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 KO:K01363 PANTHER:PTHR12411:SF16
            HSSP:P07688 EMBL:AY060640 RefSeq:NP_572920.1 UniGene:Dm.3926
            SMR:Q9VY87 IntAct:Q9VY87 MINT:MINT-932864 STRING:Q9VY87
            EnsemblMetazoa:FBtr0073838 GeneID:32341 KEGG:dme:Dmel_CG10992
            UCSC:CG10992-RA FlyBase:FBgn0030521 InParanoid:Q9VY87 OMA:TEGHIRR
            OrthoDB:EOG48W9HM ChiTaRS:CG10992 GenomeRNAi:32341 NextBio:778020
            Uniprot:Q9VY87
        Length = 340

 Score = 187 (70.9 bits), Expect = 4.0e-21, Sum P(2) = 4.0e-21
 Identities = 54/166 (32%), Positives = 77/166 (46%)

Query:    82 FIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLM--KSKVASQRYACKAGD 139
             FI+   S  +T+ VG N  A +T    R + +G   DA +  +  K +V    Y   + D
Sbjct:    28 FIEVVRSKAKTWTVGRNFDASVTEGHIRRL-MGVHPDAHKFALPDKREVLGDLYV-NSVD 85

Query:   140 ELPESVD----WREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL--SEQEL 193
             ELPE  D    W     +  ++DQGSCGSCWAF  V A+     I +G  ++   S  +L
Sbjct:    86 ELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDL 145

Query:   194 VDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDP 239
             V C      GCNGG    A+ +  + G +      PY G+   C P
Sbjct:   146 VSCCHTCGFGCNGGFPGAAWSYWTRKGIVSGG---PY-GSNQGCRP 187

 Score = 121 (47.7 bits), Expect = 4.0e-21, Sum P(2) = 4.0e-21
 Identities = 21/55 (38%), Positives = 34/55 (61%)

Query:   286 YESGVFTGECGSALD-HGVVAVGYGT--ENGVDYWLVRNSWGSDWGENGYVKLQR 337
             Y+ GV+  E G  L  H +  +G+G   E  + YWL+ NSW +DWG++G+ ++ R
Sbjct:   268 YKDGVYQHEHGKELGGHAIRILGWGVWGEEKIPYWLIGNSWNTDWGDHGFFRILR 322


>UNIPROTKB|Q6IN22 [details] [associations]
            symbol:Ctsb "Cathepsin B" species:10116 "Rattus norvegicus"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 RGD:621509 GO:GO:0005739
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 CTD:1508
            GeneTree:ENSGT00560000076599 HOVERGEN:HBG003480 KO:K01363
            OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16 UniGene:Rn.100909
            EMBL:CH474023 HSSP:P00785 EMBL:BC072490 IPI:IPI00562653
            RefSeq:NP_072119.2 SMR:Q6IN22 IntAct:Q6IN22 STRING:Q6IN22
            Ensembl:ENSRNOT00000014177 GeneID:64529 KEGG:rno:64529
            InParanoid:Q6IN22 NextBio:613362 Genevestigator:Q6IN22
            Uniprot:Q6IN22
        Length = 339

 Score = 167 (63.8 bits), Expect = 4.1e-21, Sum P(2) = 4.1e-21
 Identities = 41/115 (35%), Positives = 57/115 (49%)

Query:   251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH---YESGVFTGECGSALD-HGVVAV 306
             GY   S  D  S K+ +A+   +  +E     F     Y+SGV+  E G  +  H +  +
Sbjct:   226 GYTSYSVSD--SEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRIL 283

Query:   307 GYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
             G+G ENGV YWLV NSW  DWG+NG+ K+ R         CGI  E    +  +Q
Sbjct:   284 GWGIENGVPYWLVANSWNVDWGDNGFFKILRG-----ENHCGIESEIVAGIPRTQ 333

 Score = 145 (56.1 bits), Expect = 4.1e-21, Sum P(2) = 4.1e-21
 Identities = 49/161 (30%), Positives = 76/161 (47%)

Query:    68 HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKF-ADLTNEEYRAMYLGTRSDAKRRLMKS 126
             H++  F    D++  I+  N  N T++ G N +  D++   Y     GT       ++  
Sbjct:    18 HDKPSFHPLSDDM--INYINKQNTTWQAGRNFYNVDIS---YLKKLCGT-------VLGG 65

Query:   127 KVASQRYACKAGDELPESVDWREKGAVNP----VKDQGSCGSCWAFSTVAAVEGINKIVT 182
                 +R        LPES D RE+ +  P    ++DQGSCGSCWAF  V A+     I T
Sbjct:    66 PKLPERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHT 125

Query:   183 -GEL-ISLSEQELVDC-DRKINAGCNGGLMDYAFQFIIQNG 220
              G + + +S ++L+ C   +   GCNGG    A+ F  + G
Sbjct:   126 NGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKG 166


>MGI|MGI:88561 [details] [associations]
            symbol:Ctsb "cathepsin B" species:10090 "Mus musculus"
            [GO:0004175 "endopeptidase activity" evidence=ISO] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISO] [GO:0005576
            "extracellular region" evidence=ISO] [GO:0005615 "extracellular
            space" evidence=ISO] [GO:0005737 "cytoplasm" evidence=ISO]
            [GO:0005739 "mitochondrion" evidence=ISO;IDA] [GO:0005764
            "lysosome" evidence=ISO;IDA] [GO:0005901 "caveola" evidence=ISO]
            [GO:0006508 "proteolysis" evidence=ISO] [GO:0008233 "peptidase
            activity" evidence=ISO] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISO] [GO:0009897 "external side of plasma
            membrane" evidence=ISO] [GO:0009986 "cell surface" evidence=ISO]
            [GO:0016324 "apical plasma membrane" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0030984 "kininogen binding"
            evidence=ISO] [GO:0032403 "protein complex binding" evidence=ISO]
            [GO:0042277 "peptide binding" evidence=ISO] [GO:0042383
            "sarcolemma" evidence=ISO] [GO:0043621 "protein self-association"
            evidence=ISO] [GO:0048471 "perinuclear region of cytoplasm"
            evidence=ISO] [GO:0050790 "regulation of catalytic activity"
            evidence=IEA] [GO:0060548 "negative regulation of cell death"
            evidence=ISO] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 MGI:MGI:88561
            GO:GO:0005739 GO:GO:0042470 GO:GO:0048471 GO:GO:0005615
            GO:GO:0009612 GO:GO:0009611 GO:GO:0009897 GO:GO:0045471
            GO:GO:0016324 GO:GO:0009749 GO:GO:0006914 GO:GO:0043434
            eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0042383 GO:GO:0014070
            GO:GO:0042277 GO:GO:0060548 GO:GO:0005901 GO:GO:0014075
            GO:GO:0004197 GO:GO:0070670 GO:GO:0007519 MEROPS:C01.060 CTD:1508
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 KO:K01363 OMA:GYPSGAW OrthoDB:EOG4K6G4C
            BRENDA:3.4.22.1 GO:GO:0097067 PANTHER:PTHR12411:SF16 ChiTaRS:CTSB
            EMBL:M65270 EMBL:M65263 EMBL:M65264 EMBL:M65265 EMBL:M65266
            EMBL:M65267 EMBL:M65268 EMBL:M65269 EMBL:M14222 EMBL:X54966
            EMBL:S69034 EMBL:AK083393 EMBL:AK147192 EMBL:AK149884 EMBL:AK151790
            EMBL:AK167361 EMBL:BC006656 IPI:IPI00113517 PIR:A38458
            RefSeq:NP_031824.1 UniGene:Mm.236553 UniGene:Mm.489070
            ProteinModelPortal:P10605 SMR:P10605 IntAct:P10605 STRING:P10605
            PhosphoSite:P10605 SWISS-2DPAGE:P10605 PaxDb:P10605 PRIDE:P10605
            Ensembl:ENSMUST00000006235 GeneID:13030 KEGG:mmu:13030
            UCSC:uc007uhh.1 InParanoid:P10605 BioCyc:MetaCyc:MONOMER-14810
            BindingDB:P10605 ChEMBL:CHEMBL5187 NextBio:282900 Bgee:P10605
            CleanEx:MM_CTSB Genevestigator:P10605 GermOnline:ENSMUSG00000021939
            Uniprot:P10605
        Length = 339

 Score = 161 (61.7 bits), Expect = 6.6e-21, Sum P(2) = 6.6e-21
 Identities = 38/106 (35%), Positives = 54/106 (50%)

Query:   251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH---YESGVFTGECGSALD-HGVVAV 306
             GY   S  +  S+K+ +A+   +  +E     F     Y+SGV+  E G  +  H +  +
Sbjct:   226 GYTSYSVSN--SVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRIL 283

Query:   307 GYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAME 352
             G+G ENGV YWL  NSW  DWG+NG+ K+ R         CGI  E
Sbjct:   284 GWGVENGVPYWLAANSWNLDWGDNGFFKILRG-----ENHCGIESE 324

 Score = 150 (57.9 bits), Expect = 6.6e-21, Sum P(2) = 6.6e-21
 Identities = 49/161 (30%), Positives = 77/161 (47%)

Query:    68 HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKF-ADLTNEEYRAMYLGTRSDAKRRLMKS 126
             H++  F    D+L  I+  N  N T++ G N +  D++   Y     GT       ++  
Sbjct:    18 HDKPSFHPLSDDL--INYINKQNTTWQAGRNFYNVDIS---YLKKLCGT-------VLGG 65

Query:   127 KVASQRYACKAGDELPESVDWREKGA----VNPVKDQGSCGSCWAFSTVAAVEGINKIVT 182
                  R A     +LPE+ D RE+ +    +  ++DQGSCGSCWAF  V A+     I T
Sbjct:    66 PKLPGRVAFGEDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHT 125

Query:   183 -GEL-ISLSEQELVDC-DRKINAGCNGGLMDYAFQFIIQNG 220
              G + + +S ++L+ C   +   GCNGG    A+ F  + G
Sbjct:   126 NGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKG 166


>DICTYBASE|DDB_G0280187 [details] [associations]
            symbol:DDB_G0280187 "cathepsin Z-like protein"
            species:44689 "Dictyostelium discoideum" [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            dictyBase:DDB_G0280187 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            EMBL:AAFI02000035 KO:K08568 RefSeq:XP_641294.1
            ProteinModelPortal:Q54VR1 MEROPS:C01.A61 PRIDE:Q54VR1
            EnsemblProtists:DDB0233838 GeneID:8622427 KEGG:ddi:DDB_G0280187
            InParanoid:Q54VR1 OMA:VWKVGDY Uniprot:Q54VR1
        Length = 291

 Score = 184 (69.8 bits), Expect = 6.9e-21, Sum P(2) = 6.9e-21
 Identities = 34/79 (43%), Positives = 53/79 (67%)

Query:   263 LKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA--LDHGVVAVGYGTENGVDYWLVR 320
             +++  A  P++  +E    AF+ Y SGVFT   GS   ++H +  +G+GTENGVDYW+ R
Sbjct:   196 MQEIFARGPIACGMEVTD-AFESYTSGVFTSSVGSTGEINHEISIIGWGTENGVDYWIGR 254

Query:   321 NSWGSDWGENGYVKLQRNL 339
             NSWG+ +GE G+ ++QR +
Sbjct:   255 NSWGTYFGELGFFRIQRGI 273

 Score = 116 (45.9 bits), Expect = 6.9e-21, Sum P(2) = 6.9e-21
 Identities = 41/132 (31%), Positives = 65/132 (49%)

Query:   116 RSDAKRRLMKSKVASQRYACKAGDELPESVDWRE-KGA--VNPVKDQGS---CGSCWAFS 169
             R +A   ++KS++ S+ Y  +  D LP   DWR   G+  +   ++Q     CGSCWA  
Sbjct:    27 RVNAPTSIIKSQLPSE-YIDE--DTLPTQYDWRNISGSSYITITRNQHLPQYCGSCWAHG 83

Query:   170 TVAAVEGINKIV---TGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQ 226
             T +A+    KI    T   + L+ Q L++C    N  C+GG    A+ ++   G  D E 
Sbjct:    84 TTSALGDRIKIGRKGTFPEVVLAPQVLLNCAGPDNT-CDGGDPTEAYAYMAAKGITD-ET 141

Query:   227 DYPYLGAENKCD 238
               PY   +N+C+
Sbjct:   142 CAPYEAIDNECN 153


>ZFIN|ZDB-GENE-040426-2650 [details] [associations]
            symbol:ctsba "cathepsin B, a" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0031101 "fin regeneration"
            evidence=IEP] [GO:0008233 "peptidase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 ZFIN:ZDB-GENE-040426-2650 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790
            GO:GO:0004197 GO:GO:0031101 MEROPS:C01.060 HOVERGEN:HBG003480
            PANTHER:PTHR12411:SF16 HSSP:P07688 EMBL:BC044517 IPI:IPI00485996
            UniGene:Dr.3374 ProteinModelPortal:Q803E4 SMR:Q803E4 STRING:Q803E4
            PRIDE:Q803E4 InParanoid:Q803E4 ArrayExpress:Q803E4 Bgee:Q803E4
            Uniprot:Q803E4
        Length = 330

 Score = 168 (64.2 bits), Expect = 8.7e-21, Sum P(2) = 8.7e-21
 Identities = 34/71 (47%), Positives = 42/71 (59%)

Query:   283 FQHYESGVFTGECGSALD-HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
             F  Y+SGV+    GSAL  H +  +G+G ENGV YWL  NSW +DWG+NGY K+ R    
Sbjct:   258 FLLYKSGVYQHMSGSALGGHAIKILGWGEENGVPYWLAANSWNTDWGDNGYFKILRG--- 314

Query:   342 TNTGKCGIAME 352
                  CGI  E
Sbjct:   315 --EDHCGIESE 323

 Score = 140 (54.3 bits), Expect = 8.7e-21, Sum P(2) = 8.7e-21
 Identities = 47/140 (33%), Positives = 64/140 (45%)

Query:    87 NSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVD 146
             N  N T+  G N F D+ +  Y     GT         K  V  Q Y    G +LP++ D
Sbjct:    34 NKANTTWTAGHN-FRDV-DYSYVKRLCGTFLKGP----KLPVMVQ-YT--EGLKLPKNFD 84

Query:   147 WREKGAVNP----VKDQGSCGSCWAFSTVAAVEGINKIVTGELIS--LSEQELVDCDRKI 200
              RE+    P    ++DQGSCGSCWAF    A+     I +   +S  +S Q+L+ C    
Sbjct:    85 AREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIQSNAKVSVEISSQDLLTCCDSC 144

Query:   201 NAGCNGGLMDYAFQFIIQNG 220
               GCNGG    A+ F   +G
Sbjct:   145 GMGCNGGYPSAAWDFWTTDG 164


>RGD|621509 [details] [associations]
            symbol:Ctsb "cathepsin B" species:10116 "Rattus norvegicus"
            [GO:0004175 "endopeptidase activity" evidence=IMP;IDA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA;ISO;IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005576
            "extracellular region" evidence=IDA] [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0005730 "nucleolus" evidence=IEA;ISO]
            [GO:0005737 "cytoplasm" evidence=IDA] [GO:0005739 "mitochondrion"
            evidence=IEA;ISO;IDA] [GO:0005764 "lysosome" evidence=IEA;ISO;IDA]
            [GO:0006508 "proteolysis" evidence=IEA;IEP;ISO;IMP;IDA;TAS]
            [GO:0006914 "autophagy" evidence=IEP] [GO:0006950 "response to
            stress" evidence=IEP] [GO:0007283 "spermatogenesis" evidence=IEP]
            [GO:0007519 "skeletal muscle tissue development" evidence=IEP]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0009611
            "response to wounding" evidence=IEP] [GO:0009612 "response to
            mechanical stimulus" evidence=IEP] [GO:0009749 "response to glucose
            stimulus" evidence=IEP] [GO:0009897 "external side of plasma
            membrane" evidence=IDA] [GO:0009986 "cell surface" evidence=IDA]
            [GO:0014070 "response to organic cyclic compound" evidence=IEP]
            [GO:0014075 "response to amine stimulus" evidence=IEP] [GO:0016324
            "apical plasma membrane" evidence=IDA] [GO:0030984 "kininogen
            binding" evidence=IPI] [GO:0032403 "protein complex binding"
            evidence=IPI] [GO:0034097 "response to cytokine stimulus"
            evidence=IEP] [GO:0042277 "peptide binding" evidence=IDA]
            [GO:0042383 "sarcolemma" evidence=IDA] [GO:0042470 "melanosome"
            evidence=IEA] [GO:0043231 "intracellular membrane-bounded
            organelle" evidence=ISO] [GO:0043434 "response to peptide hormone
            stimulus" evidence=IEP] [GO:0043621 "protein self-association"
            evidence=IDA] [GO:0045471 "response to ethanol" evidence=IEP]
            [GO:0048471 "perinuclear region of cytoplasm" evidence=ISO;IDA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0060548 "negative regulation of cell death" evidence=IMP]
            [GO:0070670 "response to interleukin-4" evidence=IEP] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA;ISO]
            [GO:0005901 "caveola" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:621509 GO:GO:0005739 GO:GO:0042470
            GO:GO:0005615 GO:GO:0009612 GO:GO:0009611 GO:GO:0009897
            GO:GO:0045471 GO:GO:0016324 GO:GO:0009749 GO:GO:0006914
            GO:GO:0043434 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0007283
            GO:GO:0005764 GO:GO:0042383 GO:GO:0043621 GO:GO:0014070
            GO:GO:0042277 GO:GO:0060548 GO:GO:0014075 GO:GO:0004197
            GO:GO:0070670 GO:GO:0007519 MEROPS:C01.060 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 GO:GO:0097067
            PANTHER:PTHR12411:SF16 EMBL:X82396 EMBL:M11305 IPI:IPI00212811
            PIR:S51041 UniGene:Rn.100909 PDB:1CPJ PDB:1CTE PDB:1MIR PDB:1THE
            PDBsum:1CPJ PDBsum:1CTE PDBsum:1MIR PDBsum:1THE
            ProteinModelPortal:P00787 SMR:P00787 STRING:P00787 PRIDE:P00787
            UCSC:RGD:621509 InParanoid:P00787 SABIO-RK:P00787 BindingDB:P00787
            ChEMBL:CHEMBL2602 EvolutionaryTrace:P00787 ArrayExpress:P00787
            Genevestigator:P00787 GermOnline:ENSRNOG00000010331 Uniprot:P00787
        Length = 339

 Score = 167 (63.8 bits), Expect = 3.5e-20, Sum P(2) = 3.5e-20
 Identities = 41/115 (35%), Positives = 57/115 (49%)

Query:   251 GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQH---YESGVFTGECGSALD-HGVVAV 306
             GY   S  D  S K+ +A+   +  +E     F     Y+SGV+  E G  +  H +  +
Sbjct:   226 GYTSYSVSD--SEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRIL 283

Query:   307 GYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
             G+G ENGV YWLV NSW  DWG+NG+ K+ R         CGI  E    +  +Q
Sbjct:   284 GWGIENGVPYWLVANSWNVDWGDNGFFKILRG-----ENHCGIESEIVAGIPRTQ 333

 Score = 136 (52.9 bits), Expect = 3.5e-20, Sum P(2) = 3.5e-20
 Identities = 35/87 (40%), Positives = 49/87 (56%)

Query:   141 LPESVDWREKGAVNP----VKDQGSCGSCWAFSTVAAVEGINKIVT-GEL-ISLSEQELV 194
             LPES D RE+ +  P    ++DQGSCGSCWAF  V A+     I T G + + +S ++L+
Sbjct:    80 LPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLL 139

Query:   195 DC-DRKINAGCNGGLMDYAFQFIIQNG 220
              C   +   GCNGG    A+ F  + G
Sbjct:   140 TCCGIQCGDGCNGGYPSGAWNFWTRKG 166


>UNIPROTKB|E2R6Q7 [details] [associations]
            symbol:CTSB "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730 "nucleolus"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790
            GO:GO:0005764 GO:GO:0004197 CTD:1508 GeneTree:ENSGT00560000076599
            KO:K01363 OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16
            EMBL:AAEX03014318 RefSeq:XP_543203.3 Ensembl:ENSCAFT00000012692
            GeneID:486077 KEGG:cfa:486077 NextBio:20859923 Uniprot:E2R6Q7
        Length = 339

 Score = 156 (60.0 bits), Expect = 5.6e-20, Sum P(2) = 5.6e-20
 Identities = 39/115 (33%), Positives = 55/115 (47%)

Query:   239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
             PS +  K      Y  VS  ++  + +   + PV  A       F  Y+SGV+    G  
Sbjct:   217 PSYKEDKHYGCSSYS-VSDNEKEIMAEIYKNGPVEAAFTVYSD-FLLYKSGVYQHVTGEM 274

Query:   299 LD-HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAME 352
             +  H V  +G+G E+G  YWLV NSW +DWG+NG+ K+ R         CGI  E
Sbjct:   275 MGGHAVRILGWGVEDGTPYWLVGNSWNTDWGDNGFFKILRG-----RDHCGIESE 324

 Score = 147 (56.8 bits), Expect = 5.6e-20, Sum P(2) = 5.6e-20
 Identities = 49/155 (31%), Positives = 71/155 (45%)

Query:    73 FQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR 132
             F+   D L  +D  N  N T+K G N F ++ +  Y     GT        +      QR
Sbjct:    23 FRALSDEL--VDYVNKRNTTWKAGHN-FHNV-DPSYLRRLCGT-------FLGGPKLPQR 71

Query:   133 YACKAGDELPESVDWREKG----AVNPVKDQGSCGSCWAFSTVAAVEGINKIVT-GEL-I 186
                     LPES D RE+      +  ++DQGSCGSCWAF  V A+     I T G + +
Sbjct:    72 VQFAKNLILPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNV 131

Query:   187 SLSEQELVDC-DRKINAGCNGGLMDYAFQFIIQNG 220
              +S ++++ C   +   GCNGG    A+ F  + G
Sbjct:   132 EVSAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQG 166


>ZFIN|ZDB-GENE-070323-1 [details] [associations]
            symbol:ctsbb "capthepsin B, b" species:7955 "Danio
            rerio" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-070323-1 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197
            GeneTree:ENSGT00560000076599 PANTHER:PTHR12411:SF16 OMA:CCGFLCG
            EMBL:CU207296 EMBL:CABZ01037785 IPI:IPI00877452
            Ensembl:ENSDART00000097263 Bgee:F1QZT5 Uniprot:F1QZT5
        Length = 326

 Score = 172 (65.6 bits), Expect = 9.1e-20, Sum P(2) = 9.1e-20
 Identities = 40/98 (40%), Positives = 52/98 (53%)

Query:   257 PFDEMSLKKAV-ADQPVSVAIEAGGRAFQHYESGVFTGECGSALD-HGVVAVGYGTENGV 314
             P D+  +   +  + PV  A       F  Y+SGV+    GSAL  H V  +G+G ENG 
Sbjct:   227 PSDQQQIMTELYTNGPVEAAFTVY-EDFPLYKSGVYQHLTGSALGGHAVKILGWGEENGT 285

Query:   315 DYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAME 352
              +WLV NSW SDWG+NGY K+ R        +CGI  E
Sbjct:   286 PFWLVANSWNSDWGDNGYFKILRG-----HDECGIESE 318

 Score = 125 (49.1 bits), Expect = 9.1e-20, Sum P(2) = 9.1e-20
 Identities = 29/87 (33%), Positives = 50/87 (57%)

Query:   140 ELPESVDWREKG----AVNPVKDQGSCGSCWAFSTVAAVEGINKIVT-GELI-SLSEQEL 193
             +LP+S D R++      +N ++DQGSCGSCWAF  V ++     I + G+    +S ++L
Sbjct:    74 KLPDSFDLRDQWPNCKTLNQIRDQGSCGSCWAFGAVESISDRICIHSKGKQSPEISAEDL 133

Query:   194 VDCDRKINAGCNGGLMDYAFQFIIQNG 220
             + C  +   GC+GG    A+ +  ++G
Sbjct:   134 LSCCDQCGFGCSGGFPAEAWDYWRRSG 160


>UNIPROTKB|A1E295 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9823 "Sus scrofa"
            [GO:0005615 "extracellular space" evidence=IDA] [GO:0042470
            "melanosome" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730
            "nucleolus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0042470
            GO:GO:0005615 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 MEROPS:C01.060 CTD:1508
            GeneTree:ENSGT00560000076599 HOVERGEN:HBG003480 KO:K01363
            OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16 EMBL:EF095956
            RefSeq:NP_001090927.1 UniGene:Ssc.53773 ProteinModelPortal:A1E295
            SMR:A1E295 PRIDE:A1E295 Ensembl:ENSSSCT00000026923 GeneID:100037961
            KEGG:ssc:100037961 Uniprot:A1E295
        Length = 335

 Score = 158 (60.7 bits), Expect = 1.3e-19, Sum P(2) = 1.3e-19
 Identities = 38/115 (33%), Positives = 55/115 (47%)

Query:   239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
             PS +  K      Y  +S  ++  + +   + PV  A       F  Y+SGV+    G  
Sbjct:   217 PSYKEDKHFGCSSYS-ISRNEKEIMAEIYKNGPVEGAFTVYSD-FLQYKSGVYQHVTGDL 274

Query:   299 LD-HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAME 352
             +  H +  +G+G ENG  YWLV NSW +DWG+NG+ K+ R         CGI  E
Sbjct:   275 MGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRG-----QDHCGIESE 324

 Score = 141 (54.7 bits), Expect = 1.3e-19, Sum P(2) = 1.3e-19
 Identities = 50/157 (31%), Positives = 74/157 (47%)

Query:    73 FQIFKDNL-RFIDEHNSLNRTYKVGLNKF-ADLTNEEYRAMYLGTRSDAKRRLMKSKVAS 130
             FQ   D L  FI++ N+   T+  G N +  DL+   Y     GT        +      
Sbjct:    23 FQPLSDELVNFINKQNT---TWTAGHNFYNVDLS---YVKKLCGT-------FLGGPKLP 69

Query:   131 QRYACKAGDELPESVDWREKG----AVNPVKDQGSCGSCWAFSTVAAV-EGINKIVTGEL 185
             QR A  A   LP+S D RE+      +  ++DQGSCGSCWAF  V A+ + I     G +
Sbjct:    70 QRAAFAADMILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRV 129

Query:   186 -ISLSEQELVDC-DRKINAGCNGGLMDYAFQFIIQNG 220
              + +S ++++ C   +   GCNGG    A+ F  + G
Sbjct:   130 NVEVSAEDMLTCCGDECGDGCNGGFPSGAWNFWTKKG 166


>UNIPROTKB|F1N9D7 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9031 "Gallus gallus"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005730 "nucleolus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005739 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0005764
            GO:GO:0004197 GeneTree:ENSGT00560000076599 OMA:GYPSGAW
            GO:GO:0097067 PANTHER:PTHR12411:SF16 IPI:IPI00573387
            EMBL:AADN02018292 Ensembl:ENSGALT00000026896
            Ensembl:ENSGALT00000036723 Uniprot:F1N9D7
        Length = 340

 Score = 150 (57.9 bits), Expect = 2.4e-19, Sum P(2) = 2.4e-19
 Identities = 38/115 (33%), Positives = 54/115 (46%)

Query:   239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
             PS +  K   I  Y  V   ++  + +   + PV  A       F  Y+SGV+    G  
Sbjct:   218 PSYKEDKHYGITSY-GVPRSEKEIMAEIYKNGPVEGAFIVY-EDFLMYKSGVYQHVSGEQ 275

Query:   299 LD-HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAME 352
             +  H +  +G+G ENG  YWL  NSW +DWG+NG+ K+ R         CGI  E
Sbjct:   276 VGGHAIRILGWGVENGTPYWLAANSWNTDWGDNGFFKILRG-----EDHCGIESE 325

 Score = 148 (57.2 bits), Expect = 2.4e-19, Sum P(2) = 2.4e-19
 Identities = 43/145 (29%), Positives = 69/145 (47%)

Query:    83 IDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELP 142
             ++  N LN T+K G N F + T+  Y     GT        +      +R    A  +LP
Sbjct:    31 VNHINKLNTTWKAGHN-FHN-TDMSYVKKLCGT-------FLGGPKLPERVDFAADMDLP 81

Query:   143 ESVD----WREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL--SEQELVDC 196
             ++ D    W     ++ ++DQGSCGSCWAF  V A+     + T   +S+  S ++L+ C
Sbjct:    82 DTFDSRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLLSC 141

Query:   197 -DRKINAGCNGGLMDYAFQFIIQNG 220
                +   GCNGG    A+++  + G
Sbjct:   142 CGFECGMGCNGGYPSGAWRYWTERG 166


>UNIPROTKB|E2QV47 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0097208 "alveolar lamellar body"
            evidence=IEA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0070371 "ERK1 and ERK2 cascade"
            evidence=IEA] [GO:0070324 "thyroid hormone binding" evidence=IEA]
            [GO:0060448 "dichotomous subdivision of terminal units involved in
            lung branching" evidence=IEA] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IEA] [GO:0043129 "surfactant homeostasis"
            evidence=IEA] [GO:0043066 "negative regulation of apoptotic
            process" evidence=IEA] [GO:0033619 "membrane protein proteolysis"
            evidence=IEA] [GO:0032526 "response to retinoic acid" evidence=IEA]
            [GO:0031648 "protein destabilization" evidence=IEA] [GO:0031638
            "zymogen activation" evidence=IEA] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=IEA] [GO:0016505
            "apoptotic protease activator activity" evidence=IEA] [GO:0010815
            "bradykinin catabolic process" evidence=IEA] [GO:0010813
            "neuropeptide catabolic process" evidence=IEA] [GO:0010634
            "positive regulation of epithelial cell migration" evidence=IEA]
            [GO:0010628 "positive regulation of gene expression" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=IEA] [GO:0005829 "cytosol" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0004252 "serine-type endopeptidase activity"
            evidence=IEA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IEA] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 GO:GO:0016505 GO:GO:0010634
            GO:GO:0004197 GO:GO:0042599 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 Ensembl:ENSCAFT00000036196 Uniprot:E2QV47
        Length = 136

 Score = 232 (86.7 bits), Expect = 2.5e-19, P = 2.5e-19
 Identities = 54/140 (38%), Positives = 75/140 (53%)

Query:   225 EQDYPYLGAENKC--DPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGR 281
             E  YPY G +  C   PS+  A V  +    +++  DE ++ +AVA   PVS A E    
Sbjct:     3 EDSYPYKGQDGDCKYQPSKAIAFVKDV---ANITINDEQAMVEAVALYNPVSFAFEVTSD 59

Query:   282 AFQHYESGVFTG-ECGSALD---HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
              F  Y  G+++   C    D   H V+AVGYG +NG+ YW+V+NSWG  WG NGY  ++R
Sbjct:    60 -FMMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEQNGIPYWIVKNSWGPQWGMNGYFLMER 118

Query:   338 NLLDTNTGKCGIAMEASYPV 357
                      CG+A  ASYP+
Sbjct:   119 G-----KNMCGLAACASYPI 133


>UNIPROTKB|P07688 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9913 "Bos taurus"
            [GO:0042470 "melanosome" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730
            "nucleolus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0042470
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 EMBL:L06075 EMBL:M64620
            EMBL:U16336 EMBL:U16337 EMBL:U16338 EMBL:U16339 EMBL:U16341
            EMBL:U16342 EMBL:U16343 EMBL:BC102997 IPI:IPI00692061 PIR:S38328
            RefSeq:NP_776456.1 UniGene:Bt.393 PDB:1ITO PDB:1QDQ PDB:1SP4
            PDB:2DC6 PDB:2DC7 PDB:2DC8 PDB:2DC9 PDB:2DCA PDB:2DCB PDB:2DCC
            PDB:2DCD PDBsum:1ITO PDBsum:1QDQ PDBsum:1SP4 PDBsum:2DC6
            PDBsum:2DC7 PDBsum:2DC8 PDBsum:2DC9 PDBsum:2DCA PDBsum:2DCB
            PDBsum:2DCC PDBsum:2DCD ProteinModelPortal:P07688 SMR:P07688
            STRING:P07688 MEROPS:C01.060 PRIDE:P07688
            Ensembl:ENSBTAT00000036795 GeneID:281105 KEGG:bta:281105 CTD:1508
            eggNOG:NOG315657 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 InParanoid:P07688 KO:K01363 OMA:GYPSGAW
            OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 BindingDB:P07688
            ChEMBL:CHEMBL2323 EvolutionaryTrace:P07688 NextBio:20805177
            ArrayExpress:P07688 GO:GO:0097067 PANTHER:PTHR12411:SF16
            Uniprot:P07688
        Length = 335

 Score = 154 (59.3 bits), Expect = 3.1e-19, Sum P(2) = 3.1e-19
 Identities = 38/115 (33%), Positives = 55/115 (47%)

Query:   239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
             PS +  K      Y  V+  ++  + +   + PV  A       F  Y+SGV+    G  
Sbjct:   217 PSYKEDKHFGCSSYS-VANNEKEIMAEIYKNGPVEGAFSVYSD-FLLYKSGVYQHVSGEI 274

Query:   299 LD-HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAME 352
             +  H +  +G+G ENG  YWLV NSW +DWG+NG+ K+ R         CGI  E
Sbjct:   275 MGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRG-----QDHCGIESE 324

 Score = 142 (55.0 bits), Expect = 3.1e-19, Sum P(2) = 3.1e-19
 Identities = 46/142 (32%), Positives = 67/142 (47%)

Query:    87 NSLNRTYKVGLNKF-ADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESV 145
             N  N T+K G N +  DL+   Y     G        ++      QR A  A   LPES 
Sbjct:    35 NKQNTTWKAGHNFYNVDLS---YVKKLCGA-------ILGGPKLPQRDAFAADVVLPESF 84

Query:   146 DWREKG----AVNPVKDQGSCGSCWAFSTVAAV-EGINKIVTGEL-ISLSEQELVDC-DR 198
             D RE+      +  ++DQGSCGSCWAF  V A+ + I     G + + +S ++++ C   
Sbjct:    85 DAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCGG 144

Query:   199 KINAGCNGGLMDYAFQFIIQNG 220
             +   GCNGG    A+ F  + G
Sbjct:   145 ECGDGCNGGFPSGAWNFWTKKG 166


>WB|WBGene00000784 [details] [associations]
            symbol:cpr-4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39895 EMBL:L39926 EMBL:FO081381
            PIR:T37280 RefSeq:NP_504682.1 UniGene:Cel.5404
            ProteinModelPortal:P43508 SMR:P43508 DIP:DIP-25376N
            MINT:MINT-1069892 STRING:P43508 MEROPS:C01.A34 PaxDb:P43508
            EnsemblMetazoa:F44C4.3 GeneID:179053 KEGG:cel:CELE_F44C4.3
            UCSC:F44C4.3 CTD:179053 WormBase:F44C4.3 InParanoid:P43508
            OMA:CCGFLCG NextBio:903704 Uniprot:P43508
        Length = 335

 Score = 165 (63.1 bits), Expect = 8.2e-19, Sum P(2) = 8.2e-19
 Identities = 34/84 (40%), Positives = 46/84 (54%)

Query:   267 VADQPVSVAIEAGGRAFQHYESGVFTGECGSALD-HGVVAVGYGTENGVDYWLVRNSWGS 325
             +A  PV  A       F  Y++GV+    G  L  H +  +G+GT+NG  YWLV NSW  
Sbjct:   247 IAHGPVEAAFTVY-EDFYQYKTGVYVHTTGQELGGHAIRILGWGTDNGTPYWLVANSWNV 305

Query:   326 DWGENGYVKLQRNLLDTNTGKCGI 349
             +WGENGY ++ R      T +CGI
Sbjct:   306 NWGENGYFRIIRG-----TNECGI 324

 Score = 125 (49.1 bits), Expect = 8.2e-19, Sum P(2) = 8.2e-19
 Identities = 25/88 (28%), Positives = 46/88 (52%)

Query:   139 DELPESVD----WREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS--LSEQE 192
             D +P + D    W    ++N ++DQ  CGSCWAF+   A      I +   ++  LS ++
Sbjct:    79 DTIPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAED 138

Query:   193 LVDCDRKINAGCNGGLMDYAFQFIIQNG 220
             ++ C      GC GG    A+++++++G
Sbjct:   139 VLSCCSNCGYGCEGGYPINAWKYLVKSG 166


>WB|WBGene00010204 [details] [associations]
            symbol:F57F5.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0006898
            "receptor-mediated endocytosis" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0009792 GO:GO:0006898
            GO:GO:0040007 GO:GO:0002119 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0040011
            EMBL:Z75953 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            PANTHER:PTHR12411:SF16 RefSeq:NP_506011.2 ProteinModelPortal:Q20950
            SMR:Q20950 DIP:DIP-24447N IntAct:Q20950 MINT:MINT-211137
            STRING:Q20950 MEROPS:C01.A42 EnsemblMetazoa:F57F5.1 GeneID:179645
            KEGG:cel:CELE_F57F5.1 UCSC:F57F5.1 CTD:179645 WormBase:F57F5.1
            OMA:ADDINAC Uniprot:Q20950
        Length = 351

 Score = 176 (67.0 bits), Expect = 1.4e-18, Sum P(2) = 1.4e-18
 Identities = 35/87 (40%), Positives = 48/87 (55%)

Query:   264 KKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD-HGVVAVGYGTENGVDYWLVRNS 322
             K+ +   PV VA       F+HY  GV+    G++L  H V  +G+G +NG  YWL  NS
Sbjct:   260 KEIMTHGPVEVAFTVY-EDFEHYSGGVYVHTAGASLGGHAVKMLGWGVDNGTPYWLCANS 318

Query:   323 WGSDWGENGYVKLQRNLLDTNTGKCGI 349
             W  DWGENGY ++ R +      +CGI
Sbjct:   319 WNEDWGENGYFRIIRGV-----NECGI 340

 Score = 111 (44.1 bits), Expect = 1.4e-18, Sum P(2) = 1.4e-18
 Identities = 32/110 (29%), Positives = 53/110 (48%)

Query:   141 LPESVD----WREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGE--LISLSEQELV 194
             +P+S D    W    +++ ++DQ SCGSCWA S    +     I +    ++S+S  ++ 
Sbjct:    97 VPDSFDSRTAWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNAKTILSISADDIN 156

Query:   195 DCDRKI-NAGCNGGLMDYAFQFIIQNGGMD--SEQD------YPYLGAEN 235
              C   +   GCNGG    A++  ++ G +   S QD      YPY   E+
Sbjct:   157 ACCGMVCGNGCNGGYPIEAWRHYVKKGYVTGGSYQDKTGCKPYPYPPCEH 206


>DICTYBASE|DDB_G0288563 [details] [associations]
            symbol:DDB_G0288563 species:44689 "Dictyostelium
            discoideum" [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0288563
            GO:GO:0005615 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197
            EMBL:AAFI02000117 PANTHER:PTHR12411:SF16 RefSeq:XP_636643.1
            MEROPS:C01.A58 PRIDE:Q54IS1 EnsemblProtists:DDB0187993
            GeneID:8626689 KEGG:ddi:DDB_G0288563 InParanoid:Q54IS1 OMA:AWEYMEL
            Uniprot:Q54IS1
        Length = 314

 Score = 229 (85.7 bits), Expect = 1.7e-18, P = 1.7e-18
 Identities = 72/235 (30%), Positives = 116/235 (49%)

Query:   141 LPESVDWREK--GAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS---LSEQELVD 195
             +P S D R +    ++P+ +Q  CGSCWAFS+   +     I +    +   LS Q LV 
Sbjct:    88 IPTSFDSRVQWPDCIHPILNQEQCGSCWAFSSSEVLSDRLCIASNNKTNPGALSPQTLVA 147

Query:   196 CDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYL---GAENKCDPSRRNAKVVSIDGY 252
             CD   N GC+GG+   A++++ +  G+ ++   PY    G    C  S  +++  S+  Y
Sbjct:   148 CDVYGNDGCSGGIPQLAWEYM-ELKGLPTDSCVPYTAGNGTVYSCQRSCSDSEDYSL--Y 204

Query:   253 EDVSPFDEMSLKKAVADQPVSVAIEAGG---------RAFQHYESGVFTGECGSAL--DH 301
                 PF   +LK   + Q +   I A G           F  Y SGV+    GS+L   H
Sbjct:   205 R-AKPF---TLKTCSSVQCIQENILAYGPIVGTMEVYEDFMSYSSGVYVMTPGSSLLGGH 260

Query:   302 GVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEAS 354
              +  VG+G +  + ++YW+V NSWG+DWG+ G+  +    ++T    C I+ +AS
Sbjct:   261 AIKIVGWGFDQTSQLNYWIVANSWGADWGQQGFFFIS---MET----CSISSDAS 308


>UNIPROTKB|P07858 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9606 "Homo sapiens"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0005739 "mitochondrion" evidence=IEA] [GO:0042470 "melanosome"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042981 "regulation of apoptotic process" evidence=TAS]
            [GO:0006508 "proteolysis" evidence=IDA] [GO:0005764 "lysosome"
            evidence=IDA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEP] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IDA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IDA] [GO:0005622 "intracellular" evidence=TAS]
            [GO:0036021 "endolysosome lumen" evidence=TAS] [GO:0045087 "innate
            immune response" evidence=TAS] [GO:0008233 "peptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0005730 "nucleolus" evidence=IDA] [GO:0043231 "intracellular
            membrane-bounded organelle" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_118779 GO:GO:0005739
            GO:GO:0042470 GO:GO:0048471 Reactome:REACT_6900 GO:GO:0005615
            GO:GO:0009612 GO:GO:0009611 GO:GO:0005730 GO:GO:0042981
            GO:GO:0009897 GO:GO:0045471 GO:GO:0016324 GO:GO:0009749
            GO:GO:0006914 GO:GO:0043434 eggNOG:COG4870 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087
            GO:GO:0050790 GO:GO:0042383 GO:GO:0014070 GO:GO:0042277
            GO:GO:0060548 GO:GO:0005901 GO:GO:0014075 GO:GO:0004197
            GO:GO:0070670 EMBL:CH471157 GO:GO:0007519 MEROPS:C01.060 CTD:1508
            HOGENOM:HOG000241341 HOVERGEN:HBG003480 KO:K01363 OMA:GYPSGAW
            OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 GO:GO:0097067
            PANTHER:PTHR12411:SF16 EMBL:M14221 EMBL:L16510 EMBL:AK092070
            EMBL:AK075393 EMBL:BC010240 EMBL:BC095408 EMBL:M13230
            IPI:IPI00295741 PIR:A26498 RefSeq:NP_001899.1 RefSeq:NP_680090.1
            RefSeq:NP_680091.1 RefSeq:NP_680092.1 RefSeq:NP_680093.1
            UniGene:Hs.520898 PDB:1CSB PDB:1GMY PDB:1HUC PDB:1PBH PDB:2IPP
            PDB:2PBH PDB:3AI8 PDB:3CBJ PDB:3CBK PDB:3K9M PDB:3PBH PDBsum:1CSB
            PDBsum:1GMY PDBsum:1HUC PDBsum:1PBH PDBsum:2IPP PDBsum:2PBH
            PDBsum:3AI8 PDBsum:3CBJ PDBsum:3CBK PDBsum:3K9M PDBsum:3PBH
            ProteinModelPortal:P07858 SMR:P07858 DIP:DIP-42785N IntAct:P07858
            MINT:MINT-1397666 STRING:P07858 PhosphoSite:P07858 DMDM:68067549
            SWISS-2DPAGE:P07858 UCD-2DPAGE:P07858 PaxDb:P07858
            PeptideAtlas:P07858 PRIDE:P07858 DNASU:1508 Ensembl:ENST00000345125
            Ensembl:ENST00000353047 Ensembl:ENST00000434271
            Ensembl:ENST00000453527 Ensembl:ENST00000530640
            Ensembl:ENST00000531089 Ensembl:ENST00000533455
            Ensembl:ENST00000534510 GeneID:1508 KEGG:hsa:1508 UCSC:uc003wum.3
            GeneCards:GC08M011700 H-InvDB:HIX0007320 HGNC:HGNC:2527
            HPA:CAB000457 HPA:HPA018156 MIM:116810 neXtProt:NX_P07858
            PharmGKB:PA27027 InParanoid:P07858 PhylomeDB:P07858
            BindingDB:P07858 ChEMBL:CHEMBL4072 ChiTaRS:CTSB
            EvolutionaryTrace:P07858 GenomeRNAi:1508 NextBio:6235
            PMAP-CutDB:P07858 ArrayExpress:P07858 Bgee:P07858 CleanEx:HS_CTSB
            Genevestigator:P07858 GermOnline:ENSG00000164733 GO:GO:0036021
            Uniprot:P07858
        Length = 339

 Score = 156 (60.0 bits), Expect = 2.0e-18, Sum P(2) = 2.0e-18
 Identities = 38/115 (33%), Positives = 56/115 (48%)

Query:   239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
             P+ +  K    + Y  VS  ++  + +   + PV  A       F  Y+SGV+    G  
Sbjct:   217 PTYKQDKHYGYNSYS-VSNSEKDIMAEIYKNGPVEGAFSVYSD-FLLYKSGVYQHVTGEM 274

Query:   299 LD-HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAME 352
             +  H +  +G+G ENG  YWLV NSW +DWG+NG+ K+ R         CGI  E
Sbjct:   275 MGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRG-----QDHCGIESE 324

 Score = 132 (51.5 bits), Expect = 2.0e-18, Sum P(2) = 2.0e-18
 Identities = 33/88 (37%), Positives = 44/88 (50%)

Query:   140 ELPESVDWREKG----AVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL---SEQE 192
             +LP S D RE+      +  ++DQGSCGSCWAF  V A+     I T   +S+   +E  
Sbjct:    79 KLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDL 138

Query:   193 LVDCDRKINAGCNGGLMDYAFQFIIQNG 220
             L  C      GCNGG    A+ F  + G
Sbjct:   139 LTCCGSMCGDGCNGGYPAEAWNFWTRKG 166


>WB|WBGene00021072 [details] [associations]
            symbol:W07B8.4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:FO081739 PIR:T31728 RefSeq:NP_503382.1
            HSSP:P53634 ProteinModelPortal:O16288 SMR:O16288 STRING:O16288
            MEROPS:C01.A39 PaxDb:O16288 EnsemblMetazoa:W07B8.4 GeneID:178611
            KEGG:cel:CELE_W07B8.4 UCSC:W07B8.4 CTD:178611 WormBase:W07B8.4
            InParanoid:O16288 OMA:ESQYGCK NextBio:901836 Uniprot:O16288
        Length = 335

 Score = 151 (58.2 bits), Expect = 9.9e-18, Sum P(2) = 9.9e-18
 Identities = 33/83 (39%), Positives = 46/83 (55%)

Query:   283 FQHYESGVFTGECGSALD-HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
             F  Y++G++T   G  L  H V  +G+G +NG  YWL  NSW + WGE GY ++ R + D
Sbjct:   258 FYLYKTGIYTHVAGGELGGHAVKMLGWGVDNGTPYWLAANSWNTVWGEKGYFRILRGV-D 316

Query:   342 TNTGKCGI--AMEASYPVKNSQN 362
                 +CGI  A  A  P  N +N
Sbjct:   317 ----ECGIESAAVAGMPDLNRRN 335

 Score = 131 (51.2 bits), Expect = 9.9e-18, Sum P(2) = 9.9e-18
 Identities = 36/110 (32%), Positives = 55/110 (50%)

Query:   136 KAGDELPESVD----WREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT-GELISL-- 188
             +  D +P+S D    W +  +VN ++DQ  CGSCWA +   A+     I + G++ +L  
Sbjct:    68 ETADSIPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGDVNTLLS 127

Query:   189 SEQELVDCDRKINAG--CNGGLMDYAFQFIIQNG---GMDSEQDY---PY 230
             +E  L  C  K N G  C GG    A+++ ++NG   G   E  Y   PY
Sbjct:   128 AEDILTCCTGKFNCGDGCEGGYPIQAWRYWVKNGLVTGGSFESQYGCKPY 177


>TAIR|locus:2204873 [details] [associations]
            symbol:AT1G02300 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002684 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 KO:K01363
            PANTHER:PTHR12411:SF16 OMA:ADDINAC IPI:IPI00534431
            RefSeq:NP_563647.1 UniGene:At.43952 ProteinModelPortal:F4HVZ1
            SMR:F4HVZ1 MEROPS:C01.A10 EnsemblPlants:AT1G02300.1 GeneID:839576
            KEGG:ath:AT1G02300 ArrayExpress:F4HVZ1 Uniprot:F4HVZ1
        Length = 379

 Score = 177 (67.4 bits), Expect = 1.8e-17, Sum P(2) = 1.8e-17
 Identities = 44/146 (30%), Positives = 70/146 (47%)

Query:   225 EQDYPYLGAENKCDPSRR---NAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
             E  YP    E KC    +    +K   +  Y  ++P  +  + +   + PV VA      
Sbjct:   228 EPTYPTPKCERKCVSRNQLWGESKHYGVGAYR-INPDPQDIMAEVYKNGPVEVAFTVY-E 285

Query:   282 AFQHYESGVFTGECGSALD-HGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
              F HY+SGV+    G+ +  H V  +G+GT ++G DYWL+ N W   WG++GY K++R  
Sbjct:   286 DFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRG- 344

Query:   340 LDTNTGKCGIAMEASYPVKNSQNSAK 365
                 T +CGI       + + +N  K
Sbjct:   345 ----TNECGIEQSVVAGLPSEKNVFK 366

 Score = 101 (40.6 bits), Expect = 1.8e-17, Sum P(2) = 1.8e-17
 Identities = 28/86 (32%), Positives = 42/86 (48%)

Query:   160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQ 218
             G CGSCWAF  V ++     I     +SLS  +++ C   +   GCNGG    A+ +   
Sbjct:   146 GHCGSCWAFGAVESLSDRFCIKYNLNVSLSANDVIACCGLLCGFGCNGGFPMGAWLYFKY 205

Query:   219 NGGMDSEQDYPYL---GAENK-CDPS 240
             +G +  E D PY    G  +  C+P+
Sbjct:   206 HGVVTQECD-PYFDNTGCSHPGCEPT 230


>WB|WBGene00000783 [details] [associations]
            symbol:cpr-3 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0009792 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39890 EMBL:L39925 EMBL:Z81119
            EMBL:Z82057 PIR:T37282 RefSeq:NP_506790.1 UniGene:Cel.23503
            ProteinModelPortal:P43507 SMR:P43507 MEROPS:C01.A33
            EnsemblMetazoa:T10H4.12 GeneID:180033 KEGG:cel:CELE_T10H4.12
            UCSC:T10H4.12 CTD:180033 WormBase:T10H4.12 eggNOG:NOG240190
            InParanoid:P43507 OMA:PVEASYK NextBio:907824 Uniprot:P43507
        Length = 370

 Score = 165 (63.1 bits), Expect = 2.5e-17, Sum P(3) = 2.5e-17
 Identities = 28/56 (50%), Positives = 39/56 (69%)

Query:   283 FQHYESGVFTGECGSALD-HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQR 337
             F HY+SGV+    G  +  H V  +G+G ENGVDYWL+ NSWG+ +GE G+ K++R
Sbjct:   265 FYHYKSGVYHYTSGKLVGGHAVKIIGWGVENGVDYWLIANSWGTSFGEKGFFKIRR 320

 Score = 103 (41.3 bits), Expect = 2.5e-17, Sum P(3) = 2.5e-17
 Identities = 28/97 (28%), Positives = 42/97 (43%)

Query:   139 DELPESVDWREK----GAVNPVKDQGSCGSCWAFSTVAAVEG---INKIVTGELISLSEQ 191
             + LP++ D REK      +  +++Q +CGSCWAF     +     I    T + +   E 
Sbjct:    90 EPLPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVED 149

Query:   192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDY 228
              L  C      GC GG    A +F   +G +    DY
Sbjct:   150 ILSCCGTTCGYGCKGGYSIEALRFWASSGAVTGG-DY 185

 Score = 46 (21.3 bits), Expect = 1.6e-11, Sum P(3) = 1.6e-11
 Identities = 13/37 (35%), Positives = 15/37 (40%)

Query:   133 YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFS 169
             Y CK G  +     W   GAV    D G  G C  +S
Sbjct:   160 YGCKGGYSIEALRFWASSGAVTG-GDYGGHG-CMPYS 194

 Score = 44 (20.5 bits), Expect = 2.5e-17, Sum P(3) = 2.5e-17
 Identities = 7/18 (38%), Positives = 12/18 (66%)

Query:    46 DEVMTIYQTWLAKHGKTS 63
             D V T+  +W+A+H + S
Sbjct:    37 DHVNTVQTSWVAEHNEIS 54


>UNIPROTKB|P43233 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9031 "Gallus gallus"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 MEROPS:C01.060 CTD:1508
            HOGENOM:HOG000241341 HOVERGEN:HBG003480 KO:K01363 OrthoDB:EOG4K6G4C
            PANTHER:PTHR12411:SF16 EMBL:U18083 IPI:IPI00573387 PIR:S58770
            RefSeq:NP_990702.1 UniGene:Gga.3854 ProteinModelPortal:P43233
            SMR:P43233 STRING:P43233 PRIDE:P43233 GeneID:396329 KEGG:gga:396329
            InParanoid:P43233 NextBio:20816377 Uniprot:P43233
        Length = 340

 Score = 140 (54.3 bits), Expect = 3.2e-17, Sum P(2) = 3.2e-17
 Identities = 42/145 (28%), Positives = 68/145 (46%)

Query:    83 IDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELP 142
             ++  N LN T + G N F + T+  Y     GT        +    A +R       +LP
Sbjct:    31 VNHINKLNTTGRAGHN-FHN-TDMSYVKKLCGT-------FLGGPKAPERVDFAEDMDLP 81

Query:   143 ESVD----WREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISL--SEQELVDC 196
             ++ D    W     ++ ++DQGSCGSCWAF  V A+     + T   +S+  S ++L+ C
Sbjct:    82 DTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLLSC 141

Query:   197 -DRKINAGCNGGLMDYAFQFIIQNG 220
                +   GCNGG    A+++  + G
Sbjct:   142 CGFECGMGCNGGYPSGAWRYWTERG 166

 Score = 139 (54.0 bits), Expect = 3.2e-17, Sum P(2) = 3.2e-17
 Identities = 37/115 (32%), Positives = 52/115 (45%)

Query:   239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
             PS +  K   I  Y  V   ++  + +   + PV  A       F  Y+SGV+    G  
Sbjct:   218 PSYKEDKHYGITSY-GVPRSEKEIMAEIYKNGPVEGAFIVY-EDFLMYKSGVYQHVSGEQ 275

Query:   299 LD-HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAME 352
             +  H +  +G+G ENG  YWL  NSW +DWG  G+ K+ R         CGI  E
Sbjct:   276 VGGHAIRILGWGVENGTPYWLAANSWNTDWGITGFFKILRG-----EDHCGIESE 325


>FB|FBgn0034709 [details] [associations]
            symbol:Swim "Secreted Wg-interacting molecule" species:7227
            "Drosophila melanogaster" [GO:0004197 "cysteine-type endopeptidase
            activity" evidence=ISS] [GO:0006955 "immune response" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044 "scavenger
            receptor activity" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0042600 "chorion" evidence=IDA]
            [GO:0035593 "positive regulation of Wnt receptor signaling pathway
            by establishment of Wnt protein localization to extracellular
            region" evidence=IMP] [GO:0030177 "positive regulation of Wnt
            receptor signaling pathway" evidence=IDA] [GO:0005615
            "extracellular space" evidence=IDA] [GO:0017147 "Wnt-protein
            binding" evidence=IDA] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR013128 Pfam:PF00112 PROSITE:PS50958 SMART:SM00201
            SMART:SM00645 EMBL:AE013599 GO:GO:0005615 GO:GO:0006955
            GO:GO:0030247 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GO:GO:0017147 GO:GO:0005044
            GeneTree:ENSGT00560000076599 GO:GO:0042600 eggNOG:NOG310046
            OMA:DNCNRCT HSSP:P80067 EMBL:AY113377 RefSeq:NP_611652.2
            RefSeq:NP_726176.1 UniGene:Dm.732 SMR:Q7JWQ7 IntAct:Q7JWQ7
            EnsemblMetazoa:FBtr0071784 EnsemblMetazoa:FBtr0071785 GeneID:37537
            KEGG:dme:Dmel_CG3074 UCSC:CG3074-RA FlyBase:FBgn0034709
            HOGENOM:HOG000264150 InParanoid:Q7JWQ7 OrthoDB:EOG48CZ9P
            GenomeRNAi:37537 NextBio:804155 GO:GO:0035593 Uniprot:Q7JWQ7
        Length = 431

 Score = 149 (57.5 bits), Expect = 4.0e-17, Sum P(2) = 4.0e-17
 Identities = 41/129 (31%), Positives = 66/129 (51%)

Query:   139 DELPESVDWREKGA--VNPVKDQGSCGSCWAFSTVAAVEGINKIVT-G-ELISLSEQELV 194
             D LP S +  +K +  ++ V DQG CG+ W  ST +       I + G E + LS Q ++
Sbjct:   185 DGLPSSFNALDKWSSYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNIL 244

Query:   195 DCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYED 254
              C R+   GC GG +D A++++ + G +D E  YPY    + C   R N++ +  +G + 
Sbjct:   245 SCTRR-QQGCEGGHLDAAWRYLHKKGVVD-ENCYPYTQHRDTCK-IRHNSRSLRANGCQK 301

Query:   255 VSPFDEMSL 263
                 D  SL
Sbjct:   302 PVNVDRDSL 310

 Score = 132 (51.5 bits), Expect = 4.0e-17, Sum P(2) = 4.0e-17
 Identities = 36/102 (35%), Positives = 49/102 (48%)

Query:   271 PVSVAIEAGGRAFQHYESGVFTGECGSALD----HGVVAVGYGTE-NGVDYWLVRNSWGS 325
             PV   +    R F  Y  GV+     +       H V  VG+G E NG  YW+  NSWGS
Sbjct:   334 PVQATMRVN-RDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGS 392

Query:   326 DWGENGYVKLQRNLLDTNTGKCGIA--MEASYPVKNSQNSAK 365
              WGE+GY ++ R      + +CGI   + AS+P   S  + K
Sbjct:   393 WWGEHGYFRILRG-----SNECGIEEYVLASWPYVYSYYNVK 429


>WB|WBGene00000785 [details] [associations]
            symbol:cpr-5 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39896 EMBL:L39927 EMBL:FO081739
            PIR:T37277 RefSeq:NP_503383.1 UniGene:Cel.19730
            ProteinModelPortal:P43509 SMR:P43509 DIP:DIP-25329N IntAct:P43509
            MINT:MINT-1051285 STRING:P43509 MEROPS:C01.A35 PaxDb:P43509
            EnsemblMetazoa:W07B8.5 GeneID:178612 KEGG:cel:CELE_W07B8.5
            UCSC:W07B8.5.1 CTD:178612 WormBase:W07B8.5 InParanoid:P43509
            OMA:DAIPDHF NextBio:901840 Uniprot:P43509
        Length = 344

 Score = 154 (59.3 bits), Expect = 5.4e-17, Sum P(2) = 5.4e-17
 Identities = 33/84 (39%), Positives = 44/84 (52%)

Query:   271 PVSVAIEAGGRAFQHYESGVFTGECGSALD-HGVVAVGYGTENGVDYWLVRNSWGSDWGE 329
             P+ VA       F  Y +GV+    G++L  H V  +G+G +NG  YWLV NSW   WGE
Sbjct:   256 PIEVAFTVY-EDFYQYTTGVYVHTAGASLGGHAVKILGWGVDNGTPYWLVANSWNVAWGE 314

Query:   330 NGYVKLQRNLLDTNTGKCGIAMEA 353
              GY ++ R L      +CGI   A
Sbjct:   315 KGYFRIIRGL-----NECGIEHSA 333

 Score = 121 (47.7 bits), Expect = 5.4e-17, Sum P(2) = 5.4e-17
 Identities = 27/96 (28%), Positives = 52/96 (54%)

Query:   134 ACKAGDELPESVDWREKG----AVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS-- 187
             A +  D +P+  D R++     ++N ++DQ  CGSCWAF+   A+     I +   ++  
Sbjct:    75 ATEVSDAIPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTL 134

Query:   188 LSEQELVDCDRKINA---GCNGGLMDYAFQFIIQNG 220
             LS ++L+ C   + +   GC GG    A+++ +++G
Sbjct:   135 LSSEDLLSCCTGMFSCGNGCEGGYPIQAWKWWVKHG 170


>DICTYBASE|DDB_G0283921 [details] [associations]
            symbol:ctsB "cathepsin B precursor" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0283921 GO:GO:0005615
            GenomeReviews:CM000153_GR GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:AAFI02000058
            eggNOG:NOG315657 PANTHER:PTHR12411:SF16 OMA:CSLSCQS
            RefSeq:XP_638805.1 HSSP:P07688 MEROPS:C01.A59
            EnsemblProtists:DDB0233997 GeneID:8624329 KEGG:ddi:DDB_G0283921
            Uniprot:Q54QD9
        Length = 311

 Score = 142 (55.0 bits), Expect = 1.0e-16, Sum P(2) = 1.0e-16
 Identities = 34/93 (36%), Positives = 47/93 (50%)

Query:   259 DEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD-HGVVAVGYGTENGVDYW 317
             DE  +++ V + PV          F  Y+SGV+    G  L  H V  VG+GT NGVDY+
Sbjct:   219 DEAIMQEIVTNGPVEACFTVF-EDFLAYKSGVYVHTTGKDLGGHCVKLVGFGTLNGVDYY 277

Query:   318 LVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
                N W + WG+NG   ++R       G CGI+
Sbjct:   278 AANNQWTTSWGDNGTFLIKR-------GDCGIS 303

 Score = 130 (50.8 bits), Expect = 1.0e-16, Sum P(2) = 1.0e-16
 Identities = 39/142 (27%), Positives = 71/142 (50%)

Query:   106 EEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPES----VDWREKGAVNPVKDQGS 161
             +++  + +G     KR   + K+  + Y    G ++P S     +W     ++ +++Q  
Sbjct:    45 DQFDNIKVGQLLGFKRSPNRPKLQIKSYD-PLGVQIPTSFNAQTNWPNCTTISQIQNQAR 103

Query:   162 CGSCWAF-STVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
             CGSCWAF +T +A + +  I   E + LS  ++V CD   N GC GG    A+ ++ + G
Sbjct:   104 CGSCWAFGATESATDRLC-IHNNENVQLSFMDMVTCDETDN-GCEGGDAFSAWNWLRKQG 161

Query:   221 GMDSEQDYPYLGAENKCDPSRR 242
              + SE+  PY      C P+++
Sbjct:   162 AV-SEECLPY--TIPTCPPAQQ 180


>WB|WBGene00000781 [details] [associations]
            symbol:cpr-1 species:6239 "Caenorhabditis elegans"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008340 "determination
            of adult lifespan" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008340 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            eggNOG:NOG315657 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            KO:K01363 PANTHER:PTHR12411:SF16 EMBL:M74797 EMBL:Z78012 PIR:T20148
            RefSeq:NP_506002.2 ProteinModelPortal:P25807 SMR:P25807
            DIP:DIP-25619N MINT:MINT-1058393 STRING:P25807 MEROPS:C01.A32
            PaxDb:P25807 EnsemblMetazoa:C52E4.1 GeneID:179637
            KEGG:cel:CELE_C52E4.1 UCSC:C52E4.1 CTD:179637 WormBase:C52E4.1
            InParanoid:P25807 OMA:CSLSCQS NextBio:906250 Uniprot:P25807
        Length = 329

 Score = 162 (62.1 bits), Expect = 5.4e-16, Sum P(2) = 5.4e-16
 Identities = 37/95 (38%), Positives = 53/95 (55%)

Query:   257 PFDEMSLKKAV-ADQPVSVAIEAGGRAFQHYESGVFTGECGSALD-HGVVAVGYGTENGV 314
             P +  S++  + A+ PV  A       F  Y+SGV+    G  L  H +  +G+GTE+G 
Sbjct:   230 PKNAASIQAEIYANGPVEAAFSVY-EDFYKYKSGVYKHTAGKYLGGHAIKIIGWGTESGS 288

Query:   315 DYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
              YWLV NSWG +WGE+G+ K+ R   D    +CGI
Sbjct:   289 PYWLVANSWGVNWGESGFFKIYRG--DD---QCGI 318

 Score = 101 (40.6 bits), Expect = 5.4e-16, Sum P(2) = 5.4e-16
 Identities = 39/160 (24%), Positives = 64/160 (40%)

Query:    83 IDEHNSLNRTYKVGLNKFADLTNEE--YRAM---YLGTRSDAKRRLMKSKVASQRYACKA 137
             +D  NS    +K    +  ++T EE  ++ M   Y    SD  R   +  V +   A   
Sbjct:    34 VDYVNSAQSLFKT---EHVEITEEEMKFKLMDGKYAAAHSDEIRATEQEVVLASVPAT-- 88

Query:   138 GDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVT--GELISLSEQELVD 195
                      W E  ++  ++DQ +CGSCWAF     +     I T   +   +S  +L+ 
Sbjct:    89 ---FDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLLS 145

Query:   196 C-DRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAE 234
             C       GC GG    A ++   + G+ +  DY   G +
Sbjct:   146 CCGSSCGNGCEGGYPIQALRWW-DSKGVVTGGDYHGAGCK 184


>WB|WBGene00000788 [details] [associations]
            symbol:cpz-1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] [GO:0009792 "embryo development ending in
            birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0010171 "body morphogenesis" evidence=IMP]
            [GO:0018996 "molting cycle, collagen and cuticulin-based cuticle"
            evidence=IMP] [GO:0031012 "extracellular matrix" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0009792 GO:GO:0040007 GO:GO:0002119
            GO:GO:0018996 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0010171 GO:GO:0031012
            GeneTree:ENSGT00560000076599 KO:K08568 OMA:QCGTCTE EMBL:FO081275
            EMBL:BK001409 PIR:T29872 RefSeq:NP_491023.2 HSSP:Q9UBR2
            ProteinModelPortal:G5EGP8 SMR:G5EGP8 IntAct:G5EGP8 MEROPS:C01.A38
            EnsemblMetazoa:F32B5.8 GeneID:171829 KEGG:cel:CELE_F32B5.8
            CTD:171829 WormBase:F32B5.8 NextBio:872879 Uniprot:G5EGP8
        Length = 306

 Score = 213 (80.0 bits), Expect = 7.1e-16, P = 7.1e-16
 Identities = 62/194 (31%), Positives = 93/194 (47%)

Query:   162 CGSCWAF-STVAAVEGIN--KIVTGELISLSEQELVDCDRK---INAGCNGGLMDYAFQF 215
             CGSCWAF +T A  + IN  +        LS QE++DC      +  G  GG+  YA + 
Sbjct:    92 CGSCWAFGATSALADRINIKRKNAWPQAYLSVQEVIDCSGAGTCVMGGEPGGVYKYAHEH 151

Query:   216 IIQNGGMDSEQDYPYLGAENKCDPSRR-----NAKVVSIDGYE--DVSPFDEM----SLK 264
                  G+  E    Y   + KCDP  R       +  SI  Y    VS +  +     +K
Sbjct:   152 -----GIPHETCNNYQARDGKCDPYNRCGSCWPGECFSIKNYTLYKVSEYGTVHGYEKMK 206

Query:   265 KAVADQ-PVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT--ENGVDYWLVRN 321
               +  + P++  I A  +AF+ Y  G++       +DH +   G+G   E+GV+YW+ RN
Sbjct:   207 AEIYHKGPIACGI-AATKAFETYAGGIYKEVTDEDIDHIISVHGWGVDHESGVEYWIGRN 265

Query:   322 SWGSDWGENGYVKL 335
             SWG  WGE+G+ K+
Sbjct:   266 SWGEPWGEHGWFKI 279

 Score = 121 (47.7 bits), Expect = 5.7e-05, Sum P(2) = 5.7e-05
 Identities = 37/116 (31%), Positives = 54/116 (46%)

Query:   139 DELPESVDWREKGAVNPVK-DQGS-----CGSCWAF-STVAAVEGIN--KIVTGELISLS 189
             ++LP++ DWR+   +N    D+       CGSCWAF +T A  + IN  +        LS
Sbjct:    63 EDLPKTWDWRDANGINYASADRNQHIPQYCGSCWAFGATSALADRINIKRKNAWPQAYLS 122

Query:   190 EQELVDCDRK---INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRR 242
              QE++DC      +  G  GG+  YA +      G+  E    Y   + KCDP  R
Sbjct:   123 VQEVIDCSGAGTCVMGGEPGGVYKYAHEH-----GIPHETCNNYQARDGKCDPYNR 173

 Score = 40 (19.1 bits), Expect = 5.7e-05, Sum P(2) = 5.7e-05
 Identities = 9/32 (28%), Positives = 17/32 (53%)

Query:   325 SDWGE-NGYVKLQRNLLDTNTGKCGIAMEASY 355
             S++G  +GY K++  +       CGIA   ++
Sbjct:   194 SEYGTVHGYEKMKAEIYHKGPIACGIAATKAF 225


>UNIPROTKB|Q9UBR2 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0060441 "epithelial tube
            branching involved in lung morphogenesis" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            Reactome:REACT_11123 Reactome:REACT_17015 InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 EMBL:CH471077 GO:GO:0005615 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AL109840 GO:GO:0060441 MEROPS:C01.013 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN
            BRENDA:3.4.18.1 EMBL:AF073890 EMBL:AF032906 EMBL:AF136273
            EMBL:AF136276 EMBL:AF136274 EMBL:AF136275 EMBL:AK314931
            EMBL:BC042168 EMBL:AF009923 IPI:IPI00002745 RefSeq:NP_001327.2
            UniGene:Hs.252549 PDB:1DEU PDB:1EF7 PDBsum:1DEU PDBsum:1EF7
            ProteinModelPortal:Q9UBR2 SMR:Q9UBR2 STRING:Q9UBR2 DMDM:12643324
            PaxDb:Q9UBR2 PeptideAtlas:Q9UBR2 PRIDE:Q9UBR2 DNASU:1522
            Ensembl:ENST00000217131 GeneID:1522 KEGG:hsa:1522 UCSC:uc002yai.2
            GeneCards:GC20M057570 HGNC:HGNC:2547 HPA:CAB025114 MIM:603169
            neXtProt:NX_Q9UBR2 PharmGKB:PA27043 InParanoid:Q9UBR2 OMA:QCGTCTE
            PhylomeDB:Q9UBR2 BindingDB:Q9UBR2 ChEMBL:CHEMBL4160 ChiTaRS:CTSZ
            EvolutionaryTrace:Q9UBR2 GenomeRNAi:1522 NextBio:6299 Bgee:Q9UBR2
            CleanEx:HS_CTSZ Genevestigator:Q9UBR2 GermOnline:ENSG00000101160
            Uniprot:Q9UBR2
        Length = 303

 Score = 211 (79.3 bits), Expect = 1.1e-15, P = 1.1e-15
 Identities = 67/210 (31%), Positives = 102/210 (48%)

Query:   162 CGSCWAF-STVAAVEGINKIVTGELIS--LSEQELVDCDRKINAG-CNGG----LMDYAF 213
             CGSCWA  ST A  + IN    G   S  LS Q ++DC    NAG C GG    + DYA 
Sbjct:    89 CGSCWAHASTSAMADRINIKRKGAWPSTLLSVQNVIDCG---NAGSCEGGNDLSVWDYAH 145

Query:   214 QFIIQNG------GMDSEQD-YPYLGAENKCDPSR--RNAKVVSIDGYEDVSPFDEMSLK 264
             Q  I +         D E D +   G  N+       RN  +  +  Y  +S  ++M + 
Sbjct:   146 QHGIPDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSLSGREKM-MA 204

Query:   265 KAVADQPVSVAIEAGGRAFQHYESGVFTG-ECGSALDHGVVAVGYGTENGVDYWLVRNSW 323
             +  A+ P+S  I A  R   +Y  G++   +  + ++H V   G+G  +G +YW+VRNSW
Sbjct:   205 EIYANGPISCGIMATER-LANYTGGIYAEYQDTTYINHVVSVAGWGISDGTEYWIVRNSW 263

Query:   324 GSDWGENGYVKLQRNLLDTNTG-KCGIAME 352
             G  WGE G++++  +      G +  +A+E
Sbjct:   264 GEPWGERGWLRIVTSTYKDGKGARYNLAIE 293

 Score = 112 (44.5 bits), Expect = 0.00094, P = 0.00094
 Identities = 42/113 (37%), Positives = 53/113 (46%)

Query:   140 ELPESVDWREKGAVNPV---KDQGS---CGSCWAF-STVAAVEGINKIVTGELIS--LSE 190
             +LP+S DWR    VN     ++Q     CGSCWA  ST A  + IN    G   S  LS 
Sbjct:    61 DLPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTLLSV 120

Query:   191 QELVDCDRKINAG-CNGG----LMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
             Q ++DC    NAG C GG    + DYA Q      G+  E    Y   + +CD
Sbjct:   121 QNVIDCG---NAGSCEGGNDLSVWDYAHQH-----GIPDETCNNYQAKDQECD 165


>DICTYBASE|DDB_G0283401 [details] [associations]
            symbol:ctsZ "cathepsin Z precursor" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            dictyBase:DDB_G0283401 GO:GO:0005615 GenomeReviews:CM000153_GR
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 EMBL:AAFI02000055 KO:K08568 OMA:QCGTCTE
            eggNOG:NOG275763 RefSeq:XP_639036.1 ProteinModelPortal:Q54R55
            IntAct:Q54R55 MEROPS:C01.A60 PRIDE:Q54R55
            EnsemblProtists:DDB0233836 GeneID:8624061 KEGG:ddi:DDB_G0283401
            InParanoid:Q54R55 Uniprot:Q54R55
        Length = 296

 Score = 209 (78.6 bits), Expect = 1.4e-15, P = 1.4e-15
 Identities = 54/201 (26%), Positives = 98/201 (48%)

Query:   162 CGSCWAFSTVAAVEGINKIVTGEL---ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQ 218
             CG CWAF++ +++    KI        ++++ Q L+DC+      C+GG    AF FI +
Sbjct:    85 CGGCWAFASTSSISDRIKIQRKAAFPDVNVAPQHLIDCNG--GGTCDGGDPGDAFAFINE 142

Query:   219 NGGMDSE-QDYPYLGAENKCDPSRRNAKV------------VSIDGYEDVSPFDEMSLKK 265
             NG +D   + Y      ++C P+ +                +++  Y  V    +M + +
Sbjct:   143 NGIVDETCKPYQAKNLPDECSPACKTCNPDGTCQAIPVHTNITVTEYGSVRGAKDM-MAE 201

Query:   266 AVADQPVSVAIEAGGRAFQHYESGVFTG-ECGSALDHGVVAVGYGTENGVDYWLVRNSWG 324
               A  P++ +I+A  +  + Y SG+F   +     +H +  +G+G ++   YW+VRNSWG
Sbjct:   202 IYARGPIACSIDATSK-LEAYTSGIFKEFKLDPLPNHIISVIGWGVQDSTPYWIVRNSWG 260

Query:   325 SDWGENGYVKLQRNLLDTNTG 345
             S +GE G+  + +  L  N G
Sbjct:   261 SYYGEGGFFNIVQGSLFENLG 281

 Score = 131 (51.2 bits), Expect = 6.2e-06, P = 6.2e-06
 Identities = 40/121 (33%), Positives = 61/121 (50%)

Query:   140 ELPESVDWREKGAVNPV---KDQGS---CGSCWAFSTVAAVEGINKIVTGEL---ISLSE 190
             E+P+S DWR    VN +   ++Q     CG CWAF++ +++    KI        ++++ 
Sbjct:    57 EVPQSWDWRNVSGVNYLTMNRNQHIPQYCGGCWAFASTSSISDRIKIQRKAAFPDVNVAP 116

Query:   191 QELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSID 250
             Q L+DC+      C+GG    AF FI +NG +D E   PY  A+N  D      K  + D
Sbjct:   117 QHLIDCNG--GGTCDGGDPGDAFAFINENGIVD-ETCKPYQ-AKNLPDECSPACKTCNPD 172

Query:   251 G 251
             G
Sbjct:   173 G 173


>DICTYBASE|DDB_G0286055 [details] [associations]
            symbol:DDB_G0286055 "peptidase C1A family protein"
            species:44689 "Dictyostelium discoideum" [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 dictyBase:DDB_G0286055 Pfam:PF00188 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411 EMBL:AAFI02000085
            Gene3D:3.40.33.10 InterPro:IPR001283 InterPro:IPR014044
            PRINTS:PR00837 SMART:SM00198 SUPFAM:SSF55797
            ProtClustDB:CLSZ2429919 RefSeq:XP_637918.1
            ProteinModelPortal:Q54MB6 EnsemblProtists:DDB0186794 GeneID:8625429
            KEGG:ddi:DDB_G0286055 InParanoid:Q54MB6 OMA:GENGFAR Uniprot:Q54MB6
        Length = 435

 Score = 218 (81.8 bits), Expect = 1.5e-15, P = 1.5e-15
 Identities = 68/236 (28%), Positives = 110/236 (46%)

Query:   144 SVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR----- 198
             S DWR+ G V   KD  +C S WAF+     E  + + T      S Q+L+DC       
Sbjct:   211 SFDWRDNGVVGFPKDSSNCASGWAFTAAGIFESRSAMRTRHRYDYSAQQLIDCINVCIII 270

Query:   199 --KINAG----CN--GGLMDYAFQFIIQNGGMDSEQDYPYLGAEN-KCDPSRRNAKVVSI 249
                 + G    C+   G ++ A  +  Q  G+ +   YPY+GA +  C  ++ +  V   
Sbjct:   271 FSNFSIGNYTKCSRFSGELNKALMYA-QAYGLQATSTYPYVGASSIGCSYNQSSIAVEGG 329

Query:   250 D-GYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSAL------DHG 302
             D  Y  V   D + ++K     PV V I      F +Y  G+F  EC + L      +H 
Sbjct:   330 DVEYSQVGR-DSI-VEKCRKQGPVGVGIYVTNE-FLYYAGGIF--ECNNTLIDNANINHN 384

Query:   303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
             V+ VGY  ++  +Y++++N++G  WGENG+ ++     D N   C IA   +Y ++
Sbjct:   385 VLLVGYNEKD--NYYIIKNNFGRTWGENGFARITA---DVNKD-CLIAKNPAYSIQ 434


>RGD|708479 [details] [associations]
            symbol:Ctsz "cathepsin Z" species:10116 "Rattus norvegicus"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=TAS]
            [GO:0005615 "extracellular space" evidence=IEA;ISO] [GO:0005783
            "endoplasmic reticulum" evidence=IEA;ISO] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0060441 "epithelial tube branching involved in
            lung morphogenesis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:708479 GO:GO:0005576 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004197 MEROPS:C01.013 CTD:1522 HOVERGEN:HBG004456 KO:K08568
            EMBL:AB023781 EMBL:BC091110 IPI:IPI00207663 RefSeq:NP_899159.1
            UniGene:Rn.1475 ProteinModelPortal:Q9R1T3 SMR:Q9R1T3 PRIDE:Q9R1T3
            GeneID:252929 KEGG:rno:252929 BindingDB:Q9R1T3 NextBio:624097
            Genevestigator:Q9R1T3 Uniprot:Q9R1T3
        Length = 306

 Score = 209 (78.6 bits), Expect = 2.6e-15, P = 2.6e-15
 Identities = 65/212 (30%), Positives = 100/212 (47%)

Query:   162 CGSCWAF-STVAAVEGINKIVTGELIS--LSEQELVDCDRKINAG-CNGGLMDYAFQFII 217
             CGSCWA  ST A  + IN    G   S  LS Q ++DC    NAG C GG  D       
Sbjct:    91 CGSCWAHGSTSALADRINIKRKGAWPSTLLSVQNVIDCG---NAGSCEGG-NDLPVWEYA 146

Query:   218 QNGGMDSEQDYPYLGAENKCDPSRR--------------NAKVVSIDGYEDVSPFDEMSL 263
                G+  E    Y   + +CD   +              N  +  +  Y  +S  ++M +
Sbjct:   147 HKHGIPDETCNNYQAKDQECDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSLSGREKM-M 205

Query:   264 KKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV-GYGTEN-GVDYWLVRN 321
              +  A+ P+S  I A  R   +Y  G++T     A+ + +++V G+G  N G++YW+VRN
Sbjct:   206 AEIYANGPISCGIMATER-MSNYTGGIYTEYQNQAIINHIISVAGWGVSNDGIEYWIVRN 264

Query:   322 SWGSDWGENGYVKLQRNLLDTNTGKC-GIAME 352
             SWG  WGE G++++  +     TG    +A+E
Sbjct:   265 SWGEPWGERGWMRIVTSTYKGGTGSSYNLAIE 296


>DICTYBASE|DDB_G0292462 [details] [associations]
            symbol:DDB_G0292462 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0292462 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            EMBL:AAFI02000190 RefSeq:XP_629634.1 MEROPS:C01.A56
            EnsemblProtists:DDB0184413 GeneID:8628698 KEGG:ddi:DDB_G0292462
            InParanoid:Q54D62 OMA:NTQVESH Uniprot:Q54D62
        Length = 323

 Score = 209 (78.6 bits), Expect = 4.6e-15, P = 4.6e-15
 Identities = 62/210 (29%), Positives = 106/210 (50%)

Query:   146 DWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS--LSEQELVDCDRKI--- 200
             +W +   ++PV++Q SCGSCWA  T   +     I + + I   LS Q L+DCD      
Sbjct:    55 NWGD--CMSPVREQQSCGSCWAQVTSGILADRMCIESDKNIKMLLSPQYLMDCDGSCVSD 112

Query:   201 -----NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRN--AKVVSIDGYE 253
                  N GC GG +  A   +I N G+ S++   Y  +++   P+  +  + + +   Y+
Sbjct:   113 GVSGCNNGCKGGFVGLALTRLI-NEGIVSDECLSYQASKDSSCPTTCDDGSPISNTTIYK 171

Query:   254 DVS----PFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD-HGVVAVGY 308
               S    P  + +  + + + PV +A       F+ ++  V+     + ++ H V  VG+
Sbjct:   172 ATSCRAFPTVQDAQYEIMTNGPV-IATFMLYSDFKPHKWDVYIKSSNTQVESHAVRVVGW 230

Query:   309 GT-ENGVDYWLVRNSWGSDWGENGYVKLQR 337
             GT  +GVDYW+  NSWG+ WG+ GY K++R
Sbjct:   231 GTTSDGVDYWIAANSWGTGWGDKGYFKIRR 260


>WB|WBGene00000782 [details] [associations]
            symbol:cpr-2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 eggNOG:NOG315657 GeneTree:ENSGT00560000076599
            HOGENOM:HOG000241341 PANTHER:PTHR12411:SF16 EMBL:Z81531
            RefSeq:NP_507186.3 ProteinModelPortal:O45466 SMR:O45466
            MEROPS:C01.A40 PaxDb:O45466 EnsemblMetazoa:F36D3.9 GeneID:185355
            KEGG:cel:CELE_F36D3.9 CTD:185355 WormBase:F36D3.9 OMA:FDARLRW
            Uniprot:O45466
        Length = 326

 Score = 147 (56.8 bits), Expect = 5.0e-15, Sum P(2) = 5.0e-15
 Identities = 30/68 (44%), Positives = 41/68 (60%)

Query:   283 FQHYESGVFTGECG-SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
             F+ Y+SG++    G S   H V  +G+GTE G  YWL  NSWGS WGE+G  ++ R + D
Sbjct:   254 FEKYKSGIYRHIAGRSKGGHAVKLIGWGTERGTPYWLAVNSWGSQWGESGTFRILRGV-D 312

Query:   342 TNTGKCGI 349
                 +CGI
Sbjct:   313 ----ECGI 316

 Score = 109 (43.4 bits), Expect = 5.0e-15, Sum P(2) = 5.0e-15
 Identities = 29/98 (29%), Positives = 46/98 (46%)

Query:   147 WREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTG--ELISLSEQELVDC-DRKINAG 203
             W +  ++  +++Q +CGSCWAFST   +     I +   +   +S  +L+ C       G
Sbjct:    93 WPQCKSMKLIREQSNCGSCWAFSTAEVISDRTCIASNGTQQPIISPTDLLTCCGMSCGEG 152

Query:   204 CNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSR 241
             C+GG    AFQ+  + G +       YLG   K  P R
Sbjct:   153 CDGGFPYRAFQWWARRGVVTGGD---YLGTGCKPYPIR 187


>MGI|MGI:1891190 [details] [associations]
            symbol:Ctsz "cathepsin Z" species:10090 "Mus musculus"
            [GO:0005615 "extracellular space" evidence=ISO] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1891190 GO:GO:0005783 GO:GO:0005615 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0060441 GeneTree:ENSGT00560000076599 MEROPS:C01.013 CTD:1522
            HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN OMA:QCGTCTE
            ChiTaRS:CTSZ EMBL:AJ242663 EMBL:AF136277 EMBL:AF136278
            EMBL:BC008619 IPI:IPI00986833 RefSeq:NP_071720.1 UniGene:Mm.156919
            ProteinModelPortal:Q9WUU7 SMR:Q9WUU7 IntAct:Q9WUU7 STRING:Q9WUU7
            PaxDb:Q9WUU7 PRIDE:Q9WUU7 Ensembl:ENSMUST00000016400 GeneID:64138
            KEGG:mmu:64138 InParanoid:Q9WUU7 NextBio:319927 Bgee:Q9WUU7
            CleanEx:MM_CTSZ Genevestigator:Q9WUU7 GermOnline:ENSMUSG00000016256
            Uniprot:Q9WUU7
        Length = 306

 Score = 206 (77.6 bits), Expect = 6.7e-15, P = 6.7e-15
 Identities = 64/213 (30%), Positives = 103/213 (48%)

Query:   162 CGSCWAF-STVAAVEGINKIVTGEL--ISLSEQELVDCDRKINAG-CNGG----LMDYAF 213
             CGSCWA  ST A  + IN    G    I LS Q ++DC    NAG C GG    + +YA 
Sbjct:    91 CGSCWAHGSTSAMADRINIKRKGAWPSILLSVQNVIDCG---NAGSCEGGNDLPVWEYAH 147

Query:   214 QFIIQ----NGGMDSEQDYPYLGAENKCDPSR-----RNAKVVSIDGYEDVSPFDEMSLK 264
             +  I     N     +QD         C   +     +N  +  +  Y  +S  ++M + 
Sbjct:   148 KHGIPDETCNNYQAKDQDCDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSLSGREKM-MA 206

Query:   265 KAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV-GYGTEN-GVDYWLVRNS 322
             +  A+ P+S  I A      +Y  G++      A+ + +++V G+G  N G++YW+VRNS
Sbjct:   207 EIYANGPISCGIMAT-EMMSNYTGGIYAEHQDQAVINHIISVAGWGVSNDGIEYWIVRNS 265

Query:   323 WGSDWGENGYVKLQRNLLDTNTGKC-GIAMEAS 354
             WG  WGE G++++  +     TG    +A+E++
Sbjct:   266 WGEPWGEKGWMRIVTSTYKGGTGDSYNLAIESA 298


>WB|WBGene00013072 [details] [associations]
            symbol:Y51A2D.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            EMBL:AL021497 RefSeq:NP_001256811.1 ProteinModelPortal:O62484
            SMR:O62484 MEROPS:C01.A37 EnsemblMetazoa:Y51A2D.1 GeneID:180204
            KEGG:cel:CELE_Y51A2D.1 UCSC:Y51A2D.1 CTD:180204 WormBase:Y51A2D.1a
            HOGENOM:HOG000019851 NextBio:908416 Uniprot:O62484
        Length = 314

 Score = 149 (57.5 bits), Expect = 1.6e-14, Sum P(2) = 1.6e-14
 Identities = 41/96 (42%), Positives = 50/96 (52%)

Query:   271 PVSVAIEAGGRAFQHYESGVF-TGECGSA--LDHGVVAVGYGTENGVD-----YWLVRNS 322
             PV+V   A G AF  Y+SGV  T +C  A  + H    VGYG EN +      +W+++NS
Sbjct:   218 PVAVYF-AAGTAFLQYKSGVLVTEDCDLAGTVWHAGAIVGYGEENDLRGRSQRFWIMKNS 276

Query:   323 WG-SDWGENGYVKLQR--NLLDTNTGKCGIAMEASY 355
             WG S WG  GYVKL R  N      G  G  ME  Y
Sbjct:   277 WGVSGWGTGGYVKLIRGKNWCGIERGAIGANMEEHY 312

 Score = 101 (40.6 bits), Expect = 1.6e-14, Sum P(2) = 1.6e-14
 Identities = 36/155 (23%), Positives = 62/155 (40%)

Query:    46 DEVMTIYQTWLA---KHGKTSNGMGHNEKRFQIF---KDNLRFIDEH-NSLNRTYKVGLN 98
             D    +YQ ++    K  +T      N+ R Q F   ++N+  ++++     R     +N
Sbjct:    35 DHPEKVYQEFVEFKKKFSRTYKSEAENQLRLQNFVKSRNNVVRLNKNAQKAGRNSNFAVN 94

Query:    99 KFADLTNEEYRAMYLG-----TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGA- 152
             +F+DLT  E            T +    +  K  +   R   +   E   + D R +   
Sbjct:    95 QFSDLTTSELHQRLSRFPPNLTENSVFHKNFKKLLGKTRTK-RQNSEFARNFDLRSQKVN 153

Query:   153 ----VNPVKDQGSCGSCWAFSTVAAVEGINKIVTG 183
                 V P+K+QG C  CW F+  A +E I  +  G
Sbjct:   154 GRYIVGPIKNQGQCACCWGFAVTAMLETIYAVNVG 188


>UNIPROTKB|F1M8U6 [details] [associations]
            symbol:F1M8U6 "Uncharacterized protein" species:10116
            "Rattus norvegicus" [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            IPI:IPI00782277 Ensembl:ENSRNOT00000055587 OMA:EREIAAW
            Uniprot:F1M8U6
        Length = 163

 Score = 190 (71.9 bits), Expect = 1.6e-14, P = 1.6e-14
 Identities = 59/175 (33%), Positives = 85/175 (48%)

Query:   190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
             ++EL+DCD K++  C GGL   A+  I   GG+++E  Y Y G    C+   +  KV   
Sbjct:     1 KKELLDCD-KMDKACLGGLPSNAYTAIKNLGGLETEDGYGYEGHFQACNFLAQMTKVYIS 59

Query:   250 DGYEDVSPFDEMSLKKAVADQP-VSVAIEAGGRAFQHYES-GVFTGECGSAL-DHGVVAV 306
             D  E +S  +E S+   +A +  +SVAI      F  Y +       C     DH V+ V
Sbjct:    60 DSVE-LSQ-NESSIAALLAQKGLISVAI----MQFHRYGTVHPLRPLCSPGFTDHSVLLV 113

Query:   307 GYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
             GYG    + + YW ++N  GSDWGE G+  L R      +G  G+   AS  V N
Sbjct:   114 GYGNRPRSNIPYWAIKNIQGSDWGEEGHYYLYRG-----SGDRGVNTMASSAVVN 163


>ZFIN|ZDB-GENE-060503-240 [details] [associations]
            symbol:tinagl1 "tubulointerstitial nephritis
            antigen-like 1" species:7955 "Danio rerio" [GO:0005044 "scavenger
            receptor activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0030414 "peptidase inhibitor activity" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0002040 "sprouting
            angiogenesis" evidence=IMP] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR008037 InterPro:IPR013128 Pfam:PF00112 Pfam:PF05375
            PROSITE:PS00524 PROSITE:PS50958 SMART:SM00201 SMART:SM00645
            ZFIN:ZDB-GENE-060503-240 GO:GO:0006955 GO:GO:0030247 GO:GO:0030414
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 GO:GO:0002040
            GO:GO:0005044 GeneTree:ENSGT00560000076599 GO:GO:0010466
            SUPFAM:SSF57283 HOVERGEN:HBG053961 MEROPS:C01.975 OMA:DNCNRCT
            EMBL:BX950864 IPI:IPI00609339 UniGene:Dr.103937
            Ensembl:ENSDART00000087096 Ensembl:ENSDART00000126228
            InParanoid:Q1LUC6 Uniprot:Q1LUC6
        Length = 471

 Score = 152 (58.6 bits), Expect = 1.9e-14, Sum P(2) = 1.9e-14
 Identities = 49/172 (28%), Positives = 83/172 (48%)

Query:    64 NGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVG-LNKFADLTNEEYRAMYLGTRSDAKRR 122
             NG    E+   + +D++  I E N  +  ++    ++F  +T +E     LGT+    R 
Sbjct:   127 NGRWECEQHACLIEDDM--IQEINRRDYGWRAANYSQFWGMTLDEGLRFRLGTKRPT-RT 183

Query:   123 LMKSKVASQRYACKAGDELPESVDWREK--GAVNPVKDQGSCGSCWAFSTVA-AVEGINK 179
             +M       +      D LP   +  +K  G ++   DQG+C + WAFST A A + I+ 
Sbjct:   184 IMNMN--EMQMNMNGNDHLPSYFNAVDKWPGKIHEPLDQGNCNASWAFSTAAVASDRISI 241

Query:   180 IVTGELI-SLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
                G +   LS Q L+ CD +   GC GG +D A+ F+ +  G+ ++  YP+
Sbjct:   242 QSMGHMTPQLSPQNLISCDTRHQDGCAGGRIDGAWWFM-RRRGVVTQDCYPF 292

 Score = 104 (41.7 bits), Expect = 1.9e-14, Sum P(2) = 1.9e-14
 Identities = 32/108 (29%), Positives = 46/108 (42%)

Query:   259 DEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFT---------GECGSALDHGVVAVGY 308
             +E  + K + D  PV   +E     F  Y+SG+F           +      H V   G+
Sbjct:   345 NENEIMKEIMDNGPVQAIMEVHEDFFV-YKSGIFRHTDVNYHKPSQYRKHATHSVRITGW 403

Query:   309 GTENGVD-----YWLVRNSWGSDWGENGYVKLQR--NLLDTNTGKCGI 349
             G E         YW+  NSWG +WGE+GY ++ R  N  D  T   G+
Sbjct:   404 GEERDYSGRTRKYWIGANSWGKNWGEDGYFRIARGVNECDIETFVIGV 451


>WB|WBGene00000789 [details] [associations]
            symbol:cpz-2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GeneTree:ENSGT00560000076599 KO:K08568 EMBL:Z81103
            HSSP:P80067 PIR:T23720 RefSeq:NP_506318.1 ProteinModelPortal:P92005
            SMR:P92005 STRING:P92005 MEROPS:C01.A41 PaxDb:P92005
            EnsemblMetazoa:M04G12.2 GeneID:179818 KEGG:cel:CELE_M04G12.2
            UCSC:M04G12.2 CTD:179818 WormBase:M04G12.2 eggNOG:NOG275763
            InParanoid:P92005 OMA:VEYWIAR NextBio:906990 Uniprot:P92005
        Length = 467

 Score = 208 (78.3 bits), Expect = 2.9e-14, P = 2.9e-14
 Identities = 57/191 (29%), Positives = 90/191 (47%)

Query:   162 CGSCWAFSTVAAV-EGINKIVTGE--LISLSEQELVDCDRKINAGCNGGLMDYAFQFI-I 217
             CGSCW F T  A+ +  N    G   +  LS QE++DC+ K N  C GG +    +   I
Sbjct:   248 CGSCWVFGTTGALNDRFNVARKGRWPMTQLSPQEIIDCNGKGN--CQGGEIGNVLEHAKI 305

Query:   218 QNGGMDSEQDYPYLGAENKCDPSRRNA-----KVVSIDGYED--VSPFDEMSLKKAVADQ 270
             Q  G+  E    Y     +C+P  R       +  S+  Y    V  + ++  +  +  +
Sbjct:   306 Q--GLVEEGCNVYRATNGECNPYHRCGSCWPNECFSLTNYTRYYVKDYGQVQGRDKIMSE 363

Query:   271 -----PVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWG 324
                  P++ AI A  +    Y  GV++ +     +H +   G+G  ENGV+YW+ RNSWG
Sbjct:   364 IKKGGPIACAIGATKKFEYEYVKGVYSEKSDLESNHIISLTGWGVDENGVEYWIARNSWG 423

Query:   325 SDWGENGYVKL 335
               WGE G+ ++
Sbjct:   424 EAWGELGWFRV 434

 Score = 137 (53.3 bits), Expect = 3.3e-06, P = 3.3e-06
 Identities = 41/133 (30%), Positives = 62/133 (46%)

Query:   122 RLMKSKVASQRYACKA--GDELPESVDWREKGAVN---PVKDQGS---CGSCWAFSTVAA 173
             ++ +SK A + +   +   ++LP   DWR    VN   P ++Q     CGSCW F T  A
Sbjct:   200 KVFESKTAPREWESSSFKSNDLPTGWDWRNVSGVNYCSPTRNQHIPVYCGSCWVFGTTGA 259

Query:   174 V-EGINKIVTGE--LISLSEQELVDCDRKINAGCNGGLMDYAFQFI-IQNGGMDSEQDYP 229
             + +  N    G   +  LS QE++DC+ K N  C GG +    +   IQ  G+  E    
Sbjct:   260 LNDRFNVARKGRWPMTQLSPQEIIDCNGKGN--CQGGEIGNVLEHAKIQ--GLVEEGCNV 315

Query:   230 YLGAENKCDPSRR 242
             Y     +C+P  R
Sbjct:   316 YRATNGECNPYHR 328


>UNIPROTKB|E1C4M3 [details] [associations]
            symbol:CTSZ "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0060441 "epithelial tube branching
            involved in lung morphogenesis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GeneTree:ENSGT00560000076599 CTD:1522 KO:K08568 OMA:QCGTCTE
            EMBL:AADN02019004 IPI:IPI00596430 RefSeq:XP_417483.3
            Ensembl:ENSGALT00000012067 GeneID:419311 KEGG:gga:419311
            Uniprot:E1C4M3
        Length = 305

 Score = 200 (75.5 bits), Expect = 4.1e-14, P = 4.1e-14
 Identities = 62/193 (32%), Positives = 90/193 (46%)

Query:   162 CGSCWAF-STVAAVEGINKIVTGELIS--LSEQELVDCDRKINAG-CNGGLMDYAFQFII 217
             CGSCWA  ST A  + IN    G   S  LS Q ++DC    NAG C GG     + +  
Sbjct:    90 CGSCWAHGSTSALADRINIKRKGAWPSAYLSVQNVIDC---ANAGSCEGGDHTGVWMYA- 145

Query:   218 QNGGMDSEQDYPYLGAENKCDPSR--------------RNAKVVSIDGYEDVSPFDEMSL 263
              + G+  E    Y     KC                  +N  +  +  Y  VS  ++M +
Sbjct:   146 HDHGIPDETCNNYQAKNQKCKKFNQCGTCVTFGECHVIKNYTLWKVADYGAVSGREKM-M 204

Query:   264 KKAVADQPVSVAIEAGGRAFQHYESGVFTGECGS-ALDHGVVAVGYGTENGVDYWLVRNS 322
              +  A+ P+S  I A  +    Y  G++T    S  ++H V   G+G ENG +YW+VRNS
Sbjct:   205 AEIYANGPISCGIMATEK-LDAYTGGLYTEYNPSPTVNHIVSVAGWGVENGTEYWIVRNS 263

Query:   323 WGSDWGENGYVKL 335
             WG  WGE G++++
Sbjct:   264 WGEPWGERGWLRI 276


>UNIPROTKB|F1PIF2 [details] [associations]
            symbol:CTSZ "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0060441 "epithelial tube branching involved
            in lung morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0005783 GO:GO:0005615 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GO:GO:0060441 GeneTree:ENSGT00560000076599 OMA:QCGTCTE
            EMBL:AAEX03014054 Ensembl:ENSCAFT00000019357 Uniprot:F1PIF2
        Length = 261

 Score = 188 (71.2 bits), Expect = 1.7e-13, P = 1.7e-13
 Identities = 59/191 (30%), Positives = 90/191 (47%)

Query:   162 CGSCWAF-STVAAVEGINKIVTGELIS--LSEQELVDCDRKINAG-CNGGLMDYAFQFII 217
             CGSCWA  ST A  + IN    G   S  LS Q ++DC    NAG C GG     + +  
Sbjct:    47 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVLDC---ANAGSCEGGNDLPVWSYAH 103

Query:   218 QNGGMDSE-QDYPYLGAE----NKCDPSRRNAKVVSIDGYE--DVSPFDEMS-----LKK 265
             ++G  D    +Y     E    N+C       +  +I  Y    V  +  +S     + +
Sbjct:   104 EHGIPDETCNNYQAKDQECNKFNQCGTCTEFKECHAIQNYTLWRVGDYGSLSGREKMMAE 163

Query:   266 AVADQPVSVAIEAGGRAFQHYESGVFTGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWG 324
               A+ P+S  I A  +   +Y  G+       A ++H +  VG+G  +G +YW+VRNSWG
Sbjct:   164 IYANGPISCGIMATEKMV-NYTGGIHAEYQEQAYINHVISVVGWGVSDGTEYWIVRNSWG 222

Query:   325 SDWGENGYVKL 335
               WGE G++++
Sbjct:   223 EPWGERGWMRI 233


>UNIPROTKB|P05689 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:BC122603
            EMBL:X01809 IPI:IPI00708474 PIR:A29172 RefSeq:NP_001071303.1
            UniGene:Bt.4902 ProteinModelPortal:P05689 SMR:P05689 MEROPS:C01.013
            PRIDE:P05689 GeneID:404187 KEGG:bta:404187 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 InParanoid:P05689 KO:K08568
            OrthoDB:EOG42Z4QN BRENDA:3.4.18.1 NextBio:20817615 Uniprot:P05689
        Length = 304

 Score = 195 (73.7 bits), Expect = 1.8e-13, P = 1.8e-13
 Identities = 61/213 (28%), Positives = 99/213 (46%)

Query:   162 CGSCWAF-STVAAVEGINKIVTGELIS--LSEQELVDCDRKINAG-CNGGLMDYAFQFII 217
             CGSCWA  ST A  + IN    G   S  LS Q ++DC    +AG C GG  D       
Sbjct:    90 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCG---DAGSCEGG-NDLPVWEYA 145

Query:   218 QNGGMDSEQDYPYLGAENKCDPSR--------------RNAKVVSIDGYEDVSPFDEMSL 263
                G+  E    Y   + +CD                 +N  +  +  Y  +S  ++M +
Sbjct:   146 HRHGIPDETCNNYQAKDQECDKFNQCGTCTEFKECHVIKNYTLWKVGDYGSLSGREKM-M 204

Query:   264 KKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA-LDHGVVAVGYGTENGVDYWLVRNS 322
              +   + P+S  I A  +   +Y  G+++     A ++H V   G+G  +G++YW+VRNS
Sbjct:   205 AEIYTNGPISCGIMATEK-MSNYTGGIYSEYNDQAFINHIVSVAGWGVSDGMEYWIVRNS 263

Query:   323 WGSDWGENGYVKLQRNLLDTNTG-KCGIAMEAS 354
             WG  WGE+G++++  +      G +  +A+E S
Sbjct:   264 WGEPWGEHGWMRIVTSTYKGGEGARYNLAIEES 296


>UNIPROTKB|A5GFX7 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9823 "Sus scrofa"
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GO:GO:0060441 GeneTree:ENSGT00560000076599 MEROPS:C01.013 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN
            OMA:QCGTCTE EMBL:CR956646 RefSeq:NP_001116576.1 UniGene:Ssc.16769
            ProteinModelPortal:A5GFX7 SMR:A5GFX7 STRING:A5GFX7
            Ensembl:ENSSSCT00000008249 GeneID:100141405 KEGG:ssc:100141405
            ArrayExpress:A5GFX7 Uniprot:A5GFX7
        Length = 304

 Score = 195 (73.7 bits), Expect = 1.8e-13, P = 1.8e-13
 Identities = 60/193 (31%), Positives = 87/193 (45%)

Query:   162 CGSCWAF-STVAAVEGINKIVTGELIS--LSEQELVDCDRKINAG-CNGGLMDYAFQFII 217
             CGSCWA  ST A  + IN    G   S  LS Q ++DC    NAG C GG  D       
Sbjct:    90 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCG---NAGSCEGG-DDLPVWAYA 145

Query:   218 QNGGMDSEQDYPYLGAENKCDPSRR--------------NAKVVSIDGYEDVSPFDEMSL 263
                G+  E    Y   +  CD   +              N  +  +  Y  VS  ++M +
Sbjct:   146 HRHGIPDETCNNYQAKDQVCDKFNQCGTCTEFKECHVIQNYTLWKVGDYGSVSGREKM-M 204

Query:   264 KKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA-LDHGVVAVGYGTENGVDYWLVRNS 322
              +  A+ P+S  I A  +   +Y  G++      A ++H V   G+G   G +YW+VRNS
Sbjct:   205 AEIYANGPISCGIMATEK-MSNYTGGIYAEYKDQAYINHIVSVAGWGVSGGTEYWIVRNS 263

Query:   323 WGSDWGENGYVKL 335
             WG  WGE G++++
Sbjct:   264 WGEPWGERGWMRI 276


>MGI|MGI:2137617 [details] [associations]
            symbol:Tinagl1 "tubulointerstitial nephritis antigen-like 1"
            species:10090 "Mus musculus" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
            [GO:0043236 "laminin binding" evidence=IPI] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 MGI:MGI:2137617
            GO:GO:0005737 GO:GO:0005576 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GO:GO:0031012 CleanEx:MM_ARG1 GO:GO:0005044
            GeneTree:ENSGT00560000076599 eggNOG:NOG310046 HOGENOM:HOG000241342
            HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OMA:DNCNRCT
            OrthoDB:EOG4BG8W0 EMBL:AB047402 EMBL:AB050626 EMBL:BC005738
            EMBL:BC018539 IPI:IPI00115458 RefSeq:NP_001161805.1
            RefSeq:NP_075965.2 UniGene:Mm.15801 ProteinModelPortal:Q99JR5
            SMR:Q99JR5 STRING:Q99JR5 PhosphoSite:Q99JR5 PaxDb:Q99JR5
            PRIDE:Q99JR5 Ensembl:ENSMUST00000030560 Ensembl:ENSMUST00000105998
            Ensembl:ENSMUST00000105999 GeneID:94242 KEGG:mmu:94242
            InParanoid:Q99JR5 NextBio:352247 Bgee:Q99JR5 Genevestigator:Q99JR5
            GermOnline:ENSMUSG00000028776 Uniprot:Q99JR5
        Length = 466

 Score = 154 (59.3 bits), Expect = 2.0e-13, Sum P(2) = 2.0e-13
 Identities = 39/110 (35%), Positives = 59/110 (53%)

Query:   138 GDELPESVDWREK--GAVNPVKDQGSCGSCWAFSTVA-AVEGINKIVTGELIS-LSEQEL 193
             G+ LP + +  EK    ++   DQG+C   WAFST A A + ++    G +   LS Q L
Sbjct:   199 GEVLPTAFEASEKWPNLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNL 258

Query:   194 VDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAE-NKCDPSRR 242
             + CD     GC GG +D A+ F+ +  G+ S+  YP+ G E N+  P+ R
Sbjct:   259 LSCDTHHQQGCRGGRLDGAWWFL-RRRGVVSDNCYPFSGREQNEASPTPR 307

 Score = 92 (37.4 bits), Expect = 2.0e-13, Sum P(2) = 2.0e-13
 Identities = 32/108 (29%), Positives = 48/108 (44%)

Query:   259 DEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFT------GECGSALDHGVVAV---GY 308
             DE  + K + +  PV   +E     F  Y+ G+++      G       HG  +V   G+
Sbjct:   348 DEKEIMKELMENGPVQALMEVHEDFFL-YQRGIYSHTPVSQGRPEQYRRHGTHSVKITGW 406

Query:   309 GTE---NG--VDYWLVRNSWGSDWGENGYVKLQR--NLLDTNTGKCGI 349
             G E   +G  + YW   NSWG  WGE G+ ++ R  N  D  T   G+
Sbjct:   407 GEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFVLGV 454


>WB|WBGene00016306 [details] [associations]
            symbol:C32B5.13 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 EMBL:FO080745
            PIR:T25581 RefSeq:NP_493866.1 UniGene:Cel.15740 HSSP:P00785
            ProteinModelPortal:P91110 SMR:P91110 EnsemblMetazoa:C32B5.13
            GeneID:183116 KEGG:cel:CELE_C32B5.13 UCSC:C32B5.13 CTD:183116
            WormBase:C32B5.13 eggNOG:KOG1543 HOGENOM:HOG000115376
            InParanoid:P91110 NextBio:919978 Uniprot:P91110
        Length = 150

 Score = 179 (68.1 bits), Expect = 2.8e-13, P = 2.8e-13
 Identities = 46/147 (31%), Positives = 78/147 (53%)

Query:   185 LISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAEN-KCDPSRRN 243
             ++S SEQ+++DC     + C   ++ + F   I+  G+ +E DYPY+G EN KC      
Sbjct:    10 VLSFSEQQIIDCGN-FTSPCQENILSHEF---IKKNGVVTEADYPYVGKENEKCKYDENK 65

Query:   244 AKVVSIDGYEDVSPFDEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFTG---ECGSAL 299
              K+   +    V    E  LK  + +  P    ++A   +F +Y++G+++    ECG A 
Sbjct:    66 IKLWPTNMLL-VGNLPETLLKLFIKEHGPGYFRMKAPP-SFFNYKTGIYSPTQEECGKAT 123

Query:   300 D-HGVVAVGYGTENGVDYWLVRNSWGS 325
             D   +  VGYG E G +YW+V+ S+G+
Sbjct:   124 DARSLTIVGYGIEGGQNYWIVKGSFGT 150


>UNIPROTKB|F1MW68 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9913 "Bos taurus"
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0060441
            GeneTree:ENSGT00560000076599 IPI:IPI00708474 UniGene:Bt.4902
            OMA:QCGTCTE EMBL:DAAA02036315 PRIDE:F1MW68
            Ensembl:ENSBTAT00000025007 Uniprot:F1MW68
        Length = 304

 Score = 193 (73.0 bits), Expect = 3.2e-13, P = 3.2e-13
 Identities = 61/213 (28%), Positives = 99/213 (46%)

Query:   162 CGSCWAF-STVAAVEGINKIVTGELIS--LSEQELVDCDRKINAG-CNGGLMDYAFQFII 217
             CGSCWA  ST A  + IN    G   S  LS Q ++DC    +AG C GG  D       
Sbjct:    90 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVLDCG---DAGSCEGG-NDLPVWEYA 145

Query:   218 QNGGMDSEQDYPYLGAENKCDPSR--------------RNAKVVSIDGYEDVSPFDEMSL 263
                G+  E    Y   + +CD                 +N  +  +  Y  +S  ++M +
Sbjct:   146 HRHGIPDETCNNYQAKDQECDKFNQCGTCTEFKECHVIKNYTLWKVGDYGSLSGREKM-M 204

Query:   264 KKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA-LDHGVVAVGYGTENGVDYWLVRNS 322
              +   + P+S  I A  +   +Y  G+++     A ++H V   G+G  +G++YW+VRNS
Sbjct:   205 AEIYTNGPISCGIMATEK-MSNYTGGIYSEYNDQAFINHIVSVAGWGVSDGMEYWIVRNS 263

Query:   323 WGSDWGENGYVKLQRNLLDTNTG-KCGIAMEAS 354
             WG  WGE+G++++  +      G +  +A+E S
Sbjct:   264 WGEPWGEHGWMRIVTSTYKGGEGARYNLAIEES 296


>UNIPROTKB|F1SVA2 [details] [associations]
            symbol:TINAGL1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005615 "extracellular space" evidence=IDA] [GO:0043236
            "laminin binding" evidence=IEA] [GO:0031012 "extracellular matrix"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044
            "scavenger receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005737
            GO:GO:0005615 GO:GO:0006955 GO:GO:0030247 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GO:GO:0031012 GO:GO:0005044 GeneTree:ENSGT00560000076599
            OMA:DNCNRCT EMBL:CU856262 Ensembl:ENSSSCT00000003995 Uniprot:F1SVA2
        Length = 467

 Score = 153 (58.9 bits), Expect = 3.3e-13, Sum P(2) = 3.3e-13
 Identities = 39/110 (35%), Positives = 59/110 (53%)

Query:   138 GDELPESVDWREK--GAVNPVKDQGSCGSCWAFSTVA-AVEGINKIVTGELIS-LSEQEL 193
             G+ LP + +  EK    ++   DQG+C   WAFST A A + ++    G +   LS Q L
Sbjct:   200 GEVLPRAFEASEKWPNLIHDPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNL 259

Query:   194 VDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAE-NKCDPSRR 242
             + CD     GC GG +D A+ F+ +  G+ S+  YP+ G E N+  P+ R
Sbjct:   260 LSCDTHNQQGCQGGRLDGAWWFL-RRRGVVSDHCYPFSGHERNEAGPAPR 308

 Score = 91 (37.1 bits), Expect = 3.3e-13, Sum P(2) = 3.3e-13
 Identities = 27/89 (30%), Positives = 41/89 (46%)

Query:   263 LKKAVADQPVSVAIEAGGRAFQHYESGVFT------GECGSALDHGVVAV---GYGTENG 313
             +K+ + + PV   +E     F  Y+SG+++      G       HG  +V   G+G E  
Sbjct:   354 MKELMENGPVQALMEVHEDFFL-YQSGIYSHTPVSHGRPERYRRHGTHSVKITGWGEETL 412

Query:   314 VD-----YWLVRNSWGSDWGENGYVKLQR 337
              D     YW   NSWG  WGE G+ ++ R
Sbjct:   413 PDGRMLKYWTAANSWGPGWGERGHFRIVR 441


>ZFIN|ZDB-GENE-041010-139 [details] [associations]
            symbol:ctsz "cathepsin Z" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0001525 "angiogenesis"
            evidence=IMP] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 ZFIN:ZDB-GENE-041010-139 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0001525
            CTD:1522 HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568
            OrthoDB:EOG42Z4QN UniGene:Dr.935 eggNOG:NOG275763 EMBL:BC083369
            IPI:IPI00483065 RefSeq:NP_001006043.1 ProteinModelPortal:Q5XJD4
            SMR:Q5XJD4 STRING:Q5XJD4 GeneID:450022 KEGG:dre:450022
            InParanoid:Q5XJD4 NextBio:20833005 ArrayExpress:Q5XJD4
            Uniprot:Q5XJD4
        Length = 301

 Score = 191 (72.3 bits), Expect = 5.3e-13, P = 5.3e-13
 Identities = 60/212 (28%), Positives = 97/212 (45%)

Query:   162 CGSCWAF-STVAAVEGIN--KIVTGELISLSEQELVDCDRKINAG-CNGGLMDYAFQFII 217
             CGSCWA  ST A  + IN  +        LS Q ++DC    +AG C+GG     +++  
Sbjct:    81 CGSCWAHGSTSALADRINIKRKAAWPSAYLSVQNVIDCG---DAGSCSGGDHSGVWEYA- 136

Query:   218 QNGGMDSEQDYPYLGAENKCDPSR--------------RNAKVVSIDGYEDVSPFDEMSL 263
              N G+  E    Y   +  C P                +N  +  +  Y   S  D+M  
Sbjct:   137 HNKGIPDETCNNYQAKDQDCKPFNQCGTCTTFGVCNIVKNFTLWKVGDYGSASGLDKMKA 196

Query:   264 KKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA-LDHGVVAVGYGT-ENGVDYWLVRN 321
             +   +  P+S  I A  +    Y  G+++       ++H V   G+G  ENGV++W+VRN
Sbjct:   197 E-IYSGGPISCGIMATDK-LDAYTGGLYSEYVQEPYINHIVSVAGWGVDENGVEFWVVRN 254

Query:   322 SWGSDWGENGYVKLQRNLLDTNTG-KCGIAME 352
             SWG  WGE G++++  +     +G +  +A+E
Sbjct:   255 SWGEPWGEKGWLRIVTSAYKGGSGSQYNLAIE 286

 Score = 116 (45.9 bits), Expect = 0.00032, P = 0.00032
 Identities = 41/130 (31%), Positives = 58/130 (44%)

Query:   120 KRRLMKSKVASQRYACKAGDELPESVDWRE-KGA--VNPVKDQGS---CGSCWAF-STVA 172
             +R L   K   + Y      ELP+  DWR  KG   V+  ++Q     CGSCWA  ST A
Sbjct:    33 RRNLQGVKTGPRPYESMNLKELPKEWDWRNIKGVNYVSTTRNQHIPQYCGSCWAHGSTSA 92

Query:   173 AVEGIN--KIVTGELISLSEQELVDCDRKINAG-CNGGLMDYAFQFIIQNGGMDSEQDYP 229
               + IN  +        LS Q ++DC    +AG C+GG     +++   N G+  E    
Sbjct:    93 LADRINIKRKAAWPSAYLSVQNVIDCG---DAGSCSGGDHSGVWEYA-HNKGIPDETCNN 148

Query:   230 YLGAENKCDP 239
             Y   +  C P
Sbjct:   149 YQAKDQDCKP 158


>UNIPROTKB|F1RKR7 [details] [associations]
            symbol:CTSH "Cathepsin H light chain" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] InterPro:IPR013128 GO:GO:0008234 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            GeneTree:ENSGT00660000095458 EMBL:CU326382
            Ensembl:ENSSSCT00000001985 ArrayExpress:F1RKR7 Uniprot:F1RKR7
        Length = 197

 Score = 174 (66.3 bits), Expect = 9.9e-13, P = 9.9e-13
 Identities = 44/131 (33%), Positives = 70/131 (53%)

Query:    52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAM 111
             +++W+ +H K  + +     R Q+F  N R I+ HN+ N T+K+GLN+F+D++ +E R  
Sbjct:    35 FKSWMVQHQKKYS-LEEYHHRLQVFVSNWRKINAHNAGNHTFKLGLNQFSDMSFDEIRHK 93

Query:   112 YLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGA-VNPVKDQGSCGSCWAF-- 168
             YL +                 Y    G   P S+DWR+KG  V+PVK+Q S  S W    
Sbjct:    94 YLWSEPQ------NCSATKGNYLRGTGP-YPPSMDWRKKGNFVSPVKNQNS--SWWTAPR 144

Query:   169 -STVAAVEGIN 178
              ST+ A +G++
Sbjct:   145 TSTITAAKGVS 155


>UNIPROTKB|E2RNP9 [details] [associations]
            symbol:TINAG "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0007155 "cell adhesion" evidence=IEA]
            [GO:0005604 "basement membrane" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044
            "scavenger receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 Pfam:PF01033
            PROSITE:PS00524 PROSITE:PS50958 SMART:SM00201 SMART:SM00645
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0007155
            GO:GO:0005604 GO:GO:0005044 GeneTree:ENSGT00560000076599 CTD:27283
            OMA:WGQLTSS EMBL:AAEX03008403 RefSeq:XP_538969.2
            ProteinModelPortal:E2RNP9 Ensembl:ENSCAFT00000003638 GeneID:481848
            KEGG:cfa:481848 NextBio:20856579 Uniprot:E2RNP9
        Length = 476

 Score = 124 (48.7 bits), Expect = 1.1e-12, Sum P(2) = 1.1e-12
 Identities = 42/142 (29%), Positives = 67/142 (47%)

Query:    98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPES--VDWREKGAVNP 155
             ++F  +T EE     LGT   +   L  +++ +   +  A  +LPE     ++  G  + 
Sbjct:   177 SQFWGMTLEEGFKYRLGTLPPSPMLLSMNEMTA---SLPATTDLPEFFIASYKWPGWTHG 233

Query:   156 VKDQGSCGSCWAFSTVA-AVEGINKIVTGELIS-LSEQELVDCDRKINAGCNGGLMDYAF 213
               DQ +C + WAFST + A + I     G   + LS Q L+ C  K   GCN G +D A+
Sbjct:   234 PLDQKNCAASWAFSTASVAADRIAIQSNGRYTANLSPQNLISCCAKNRHGCNSGSIDRAW 293

Query:   214 QFIIQNGGMDSEQDYPYLGAEN 235
              F+ +  G+ S   YP    +N
Sbjct:   294 WFL-RKRGLVSHACYPLFKDQN 314

 Score = 118 (46.6 bits), Expect = 1.1e-12, Sum P(2) = 1.1e-12
 Identities = 32/110 (29%), Positives = 52/110 (47%)

Query:   255 VSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVF-----TGECGSALD----HGVVA 305
             VS  +   +K+ + + PV   ++     F HY++G++     T E          H V  
Sbjct:   357 VSSNETEIMKEIMQNGPVQAIMQVH-EDFFHYKTGIYRHITRTNEESRKYQKLQTHAVKL 415

Query:   306 VGYGTENGVD-----YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIA 350
              G+GT  G       +W+  NSWG  WGENGY ++ R + +++  K  IA
Sbjct:   416 TGWGTLKGAQGQKEKFWIAANSWGISWGENGYFRILRGVNESDIEKLIIA 465

 Score = 37 (18.1 bits), Expect = 0.00025, Sum P(2) = 0.00025
 Identities = 8/30 (26%), Positives = 15/30 (50%)

Query:   341 DTNTGKCGIAMEASYPVKNSQNSAKPKPHS 370
             D N    G AM +    +  +++ KP P++
Sbjct:   312 DQNATNYGCAMASRSDGRGKRHATKPCPNN 341


>RGD|70956 [details] [associations]
            symbol:Tinagl1 "tubulointerstitial nephritis antigen-like 1"
           species:10116 "Rattus norvegicus" [GO:0005044 "scavenger receptor
           activity" evidence=IEA] [GO:0005576 "extracellular region"
           evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA;ISO] [GO:0006508
           "proteolysis" evidence=IEA] [GO:0006955 "immune response"
           evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
           evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
           [GO:0031012 "extracellular matrix" evidence=IEA;ISO] [GO:0043236
           "laminin binding" evidence=IEA;ISO] InterPro:IPR000668
           InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
           PROSITE:PS50958 SMART:SM00201 SMART:SM00645 RGD:70956 GO:GO:0005737
           GO:GO:0005576 GO:GO:0006955 GO:GO:0030247 GO:GO:0008234
           GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
           GO:GO:0031012 GO:GO:0005044 eggNOG:NOG310046 HOGENOM:HOG000241342
           HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OrthoDB:EOG4BG8W0
           EMBL:AB050717 IPI:IPI00190428 RefSeq:NP_446034.1 UniGene:Rn.1256
           ProteinModelPortal:Q9EQT5 PRIDE:Q9EQT5 GeneID:94174 KEGG:rno:94174
           UCSC:RGD:70956 InParanoid:Q9EQT5 NextBio:617830 ArrayExpress:Q9EQT5
           Genevestigator:Q9EQT5 GermOnline:ENSRNOG00000013179 Uniprot:Q9EQT5
        Length = 467

 Score = 147 (56.8 bits), Expect = 1.3e-12, Sum P(2) = 1.3e-12
 Identities = 37/107 (34%), Positives = 56/107 (52%)

Query:   138 GDELPESVDWREK--GAVNPVKDQGSCGSCWAFSTVA-AVEGINKIVTGELIS-LSEQEL 193
             G+ LP + +  EK    ++   DQG+C   WAFST A A + ++    G +   LS Q L
Sbjct:   199 GEVLPTAFEASEKWPNLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNL 258

Query:   194 VDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
             + CD     GC GG +D A+ F+ +  G+ S+  YP+ G E   + S
Sbjct:   259 LSCDTHHQKGCRGGRLDGAWWFL-RRRGVVSDNCYPFSGREQNDEAS 304

 Score = 92 (37.4 bits), Expect = 1.3e-12, Sum P(2) = 1.3e-12
 Identities = 32/108 (29%), Positives = 48/108 (44%)

Query:   259 DEMSLKKAVADQ-PVSVAIEAGGRAFQHYESGVFT------GECGSALDHGVVAV---GY 308
             DE  + K + +  PV   +E     F  Y+ G+++      G       HG  +V   G+
Sbjct:   349 DEKEIMKELMENGPVQALMEVHEDFFL-YQRGIYSHTPVSQGRPEQYRRHGTHSVKITGW 407

Query:   309 GTE---NG--VDYWLVRNSWGSDWGENGYVKLQR--NLLDTNTGKCGI 349
             G E   +G  + YW   NSWG  WGE G+ ++ R  N  D  T   G+
Sbjct:   408 GEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGINECDIETFVLGV 455

WARNING:  HSPs involving 48 database sequences were not reported due to the
          limiting value of parameter B = 250.


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.316   0.132   0.400    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      372       359   0.00081  117 3  11 22  0.43    34
                                                     34  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  298
  No. of states in DFA:  619 (66 KB)
  Total size of DFA:  266 KB (2140 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  28.42u 0.10s 28.52t   Elapsed:  00:00:01
  Total cpu time:  28.47u 0.10s 28.57t   Elapsed:  00:00:01
  Start:  Sat May 11 11:01:52 2013   End:  Sat May 11 11:01:53 2013
WARNINGS ISSUED:  2

Back to top