BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>042468
MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF
KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS
FRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQEL
VDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGY
EDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTAD
DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA

High Scoring Gene Products

Symbol, full name Information P value
SAG12
senescence-associated gene 12
protein from Arabidopsis thaliana 1.6e-102
CEP1
cysteine endopeptidase 1
protein from Arabidopsis thaliana 1.1e-96
AT2G34080 protein from Arabidopsis thaliana 3.8e-94
RD21B
esponsive to dehydration 21B
protein from Arabidopsis thaliana 3.0e-92
AT2G27420 protein from Arabidopsis thaliana 5.0e-92
AT1G29090 protein from Arabidopsis thaliana 1.9e-90
AT3G49340 protein from Arabidopsis thaliana 2.5e-88
XCP1
xylem cysteine peptidase 1
protein from Arabidopsis thaliana 5.3e-88
AT3G19390 protein from Arabidopsis thaliana 1.8e-87
CEP3
cysteine endopeptidase 3
protein from Arabidopsis thaliana 4.7e-87
RD21A
responsive to dehydration 21A
protein from Arabidopsis thaliana 3.3e-86
XCP2
AT1G20850
protein from Arabidopsis thaliana 1.4e-85
AT3G19400 protein from Arabidopsis thaliana 4.4e-84
AT1G29080 protein from Arabidopsis thaliana 7.4e-82
XBCP3
xylem bark cysteine peptidase 3
protein from Arabidopsis thaliana 6.0e-80
AT1G06260 protein from Arabidopsis thaliana 2.0e-79
CP2
cysteine protease 2
protein from Arabidopsis thaliana 5.4e-79
CP1
cysteine protease 1
protein from Arabidopsis thaliana 8.7e-79
AT4G23520 protein from Arabidopsis thaliana 4.8e-78
Cp1
Cysteine proteinase-1
protein from Drosophila melanogaster 6.1e-71
CTSL2
Uncharacterized protein
protein from Gallus gallus 1.4e-69
AT1G29110 protein from Arabidopsis thaliana 1.8e-69
Ssc.54235
Uncharacterized protein
protein from Sus scrofa 9.2e-68
wu:fb37b09 gene_product from Danio rerio 9.2e-68
ctsl.1
cathepsin L.1
gene_product from Danio rerio 1.2e-67
zgc:174855 gene_product from Danio rerio 3.1e-67
Ctsl
cathepsin L
protein from Mus musculus 5.1e-67
zgc:174153 gene_product from Danio rerio 8.2e-67
AT3G43960 protein from Arabidopsis thaliana 1.1e-66
cprB
cysteine proteinase 2
gene from Dictyostelium discoideum 1.1e-66
Ctsl1
cathepsin L1
gene from Rattus norvegicus 1.7e-66
CTSL1
Cathepsin L1
protein from Homo sapiens 2.2e-66
CTSL1
CTSL1 protein
protein from Bos taurus 3.6e-66
ctsl1b
cathepsin L, 1 b
gene_product from Danio rerio 3.6e-66
CTSL1
Cathepsin L1
protein from Bos taurus 5.8e-66
CTSL1
Cathepsin L1
protein from Sus scrofa 1.5e-65
Ctss
cathepsin S
protein from Mus musculus 2.0e-65
ctsl1a
cathepsin L, 1 a
gene_product from Danio rerio 2.0e-65
CTSL1
Cathepsin L1
protein from Canis lupus familiaris 2.5e-65
ctsll
cathepsin L, like
gene_product from Danio rerio 2.5e-65
cprF
cysteine proteinase 6
gene from Dictyostelium discoideum 4.2e-65
CTSL2
Cathepsin L2
protein from Bos taurus 5.2e-65
cprC
cysteine proteinase 3
gene from Dictyostelium discoideum 2.0e-63
Cys
Crustapain
protein from Pandalus borealis 4.2e-63
CTSS
Uncharacterized protein
protein from Sus scrofa 1.1e-62
CTSS
Cathepsin S
protein from Bos taurus 3.0e-62
cfaD
peptidase C1A family protein
gene from Dictyostelium discoideum 4.8e-62
RGD1308751
similar to Cathepsin L precursor (Major excreted protein) (MEP)
gene from Rattus norvegicus 4.8e-62
cpl-1 gene from Caenorhabditis elegans 1.3e-61
Ctsll3
cathepsin L-like 3
gene from Rattus norvegicus 4.3e-61
CTSS
Cathepsin S
protein from Canis lupus familiaris 5.5e-61
CTSS
Cathepsin S
protein from Canis lupus familiaris 7.1e-61
CTSL2
Cathepsin L2
protein from Homo sapiens 7.1e-61
CTSS
Cathepsin S
protein from Homo sapiens 1.2e-60
CTSK
Cathepsin K
protein from Homo sapiens 1.5e-60
ctssb.2
cathepsin S, b.2
gene_product from Danio rerio 5.0e-60
CTSK
Cathepsin K
protein from Sus scrofa 8.1e-60
CTSK
Cathepsin K
protein from Canis lupus familiaris 1.0e-59
CTSK
Cathepsin K
protein from Canis lupus familiaris 1.0e-59
CTSL2
Uncharacterized protein
protein from Gallus gallus 1.7e-59
DDB_G0272298 gene from Dictyostelium discoideum 2.2e-59
Ctsk
cathepsin K
gene from Rattus norvegicus 2.2e-59
CTSK
Cathepsin K
protein from Bos taurus 2.7e-59
CTSL1
Cathepsin L1
protein from Canis lupus familiaris 4.5e-59
CTSL1
Cathepsin L1
protein from Gallus gallus 9.3e-59
P83654
Ervatamin-C
protein from Tabernaemontana divaricata 9.3e-59
CTSL
Cathepsin L1
protein from Ovis aries 9.3e-59
P83443
Macrodontain-1
protein from Pseudananas sagenarius 1.2e-58
Cat-1
Cathepsin L-like proteinase
protein from Fasciola hepatica 1.5e-58
ctsk
cathepsin K
gene_product from Danio rerio 1.9e-58
Ctsk
cathepsin K
protein from Mus musculus 5.1e-58
cprH
cysteine proteinase 8
gene from Dictyostelium discoideum 3.6e-57
LOC420160
Uncharacterized protein
protein from Gallus gallus 3.6e-57
ctssb.1
cathepsin S, b.1
gene_product from Danio rerio 5.9e-57
Ctss
cathepsin S
gene from Rattus norvegicus 2.0e-56
AT3G45310 protein from Arabidopsis thaliana 2.5e-56
D3ZZR3
Uncharacterized protein
protein from Rattus norvegicus 5.3e-56
Ctsj
cathepsin J
protein from Mus musculus 1.4e-55
Ctsj
cathepsin J
gene from Rattus norvegicus 1.4e-55
CTSK
Cathepsin K
protein from Gallus gallus 2.3e-55
CG12163 protein from Drosophila melanogaster 4.8e-55
ALP
aleurain-like protease
protein from Arabidopsis thaliana 1.3e-54
4930486L24Rik
RIKEN cDNA 4930486L24 gene
protein from Mus musculus 2.6e-54
ctssa
cathepsin S, a
gene_product from Danio rerio 3.4e-54
ctskl
cathepsin K, like
gene_product from Danio rerio 4.3e-54
Testin
testin gene
gene from Rattus norvegicus 1.1e-53
cprE
cysteine proteinase 5
gene from Dictyostelium discoideum 1.9e-53
J9P7C5
Uncharacterized protein
protein from Canis lupus familiaris 1.9e-53
ctsh
cathepsin H
gene_product from Danio rerio 1.9e-53
MGC114246
similar to cathepsin R
gene from Rattus norvegicus 2.4e-53
CTSH
Pro-cathepsin H
protein from Bos taurus 4.9e-53
Ctsh
cathepsin H
gene from Rattus norvegicus 8.0e-53
Ctsh
cathepsin H
protein from Mus musculus 1.0e-52
cprG
cysteine proteinase 7
gene from Dictyostelium discoideum 1.7e-52
CTSS
Uncharacterized protein
protein from Gallus gallus 1.7e-52
CTSH
Pro-cathepsin H
protein from Sus scrofa 2.1e-52
Ctsr
cathepsin R
protein from Mus musculus 2.1e-52
cprD
cysteine proteinase 4
gene from Dictyostelium discoideum 2.7e-52
DDB_G0291191
cysteine protease
gene from Dictyostelium discoideum 9.2e-52
cprA
cysteine proteinase 1
gene from Dictyostelium discoideum 1.2e-51

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  042468
        (346 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2152445 - symbol:SAG12 "senescence-associated ...  1016  1.6e-102  1
TAIR|locus:2157712 - symbol:CEP1 "cysteine endopeptidase ...   961  1.1e-96   1
TAIR|locus:2055440 - symbol:AT2G34080 species:3702 "Arabi...   937  3.8e-94   1
TAIR|locus:2167821 - symbol:RD21B "esponsive to dehydrati...   919  3.0e-92   1
TAIR|locus:2038588 - symbol:AT2G27420 species:3702 "Arabi...   917  5.0e-92   1
TAIR|locus:2029924 - symbol:AT1G29090 species:3702 "Arabi...   902  1.9e-90   1
TAIR|locus:2082881 - symbol:AT3G49340 species:3702 "Arabi...   882  2.5e-88   1
TAIR|locus:2122113 - symbol:XCP1 "xylem cysteine peptidas...   879  5.3e-88   1
TAIR|locus:2090614 - symbol:AT3G19390 species:3702 "Arabi...   874  1.8e-87   1
TAIR|locus:505006391 - symbol:CEP3 "cysteine endopeptidas...   870  4.7e-87   1
TAIR|locus:2825832 - symbol:RD21A "responsive to dehydrat...   862  3.3e-86   1
TAIR|locus:2030427 - symbol:XCP2 "xylem cysteine peptidas...   856  1.4e-85   1
TAIR|locus:2090629 - symbol:AT3G19400 species:3702 "Arabi...   842  4.4e-84   1
TAIR|locus:2029934 - symbol:AT1G29080 species:3702 "Arabi...   821  7.4e-82   1
TAIR|locus:2024362 - symbol:XBCP3 "xylem bark cysteine pe...   803  6.0e-80   1
TAIR|locus:2038515 - symbol:AT1G06260 species:3702 "Arabi...   798  2.0e-79   1
TAIR|locus:2128253 - symbol:AT4G11320 species:3702 "Arabi...   794  5.4e-79   1
TAIR|locus:2128243 - symbol:AT4G11310 species:3702 "Arabi...   792  8.7e-79   1
TAIR|locus:2117979 - symbol:AT4G23520 species:3702 "Arabi...   785  4.8e-78   1
FB|FBgn0013770 - symbol:Cp1 "Cysteine proteinase-1" speci...   718  6.1e-71   1
UNIPROTKB|F1NYJ1 - symbol:CTSL2 "Uncharacterized protein"...   705  1.4e-69   1
TAIR|locus:2030027 - symbol:AT1G29110 species:3702 "Arabi...   704  1.8e-69   1
UNIPROTKB|F1S4J6 - symbol:Ssc.54235 "Cathepsin L1" specie...   688  9.2e-68   1
ZFIN|ZDB-GENE-030131-572 - symbol:wu:fb37b09 "wu:fb37b09"...   688  9.2e-68   1
ZFIN|ZDB-GENE-040718-61 - symbol:ctsl.1 "cathepsin L.1" s...   687  1.2e-67   1
ZFIN|ZDB-GENE-071004-74 - symbol:zgc:174855 "zgc:174855" ...   683  3.1e-67   1
MGI|MGI:88564 - symbol:Ctsl "cathepsin L" species:10090 "...   681  5.1e-67   1
ZFIN|ZDB-GENE-080215-7 - symbol:zgc:174153 "zgc:174153" s...   679  8.2e-67   1
TAIR|locus:2097104 - symbol:AT3G43960 species:3702 "Arabi...   678  1.1e-66   1
DICTYBASE|DDB_G0279799 - symbol:cprB "cysteine proteinase...   563  1.1e-66   2
RGD|2448 - symbol:Ctsl1 "cathepsin L1" species:10116 "Rat...   676  1.7e-66   1
UNIPROTKB|P07711 - symbol:CTSL1 "Cathepsin L1" species:96...   675  2.2e-66   1
UNIPROTKB|A4IFS7 - symbol:CTSL1 "CTSL1 protein" species:9...   673  3.6e-66   1
ZFIN|ZDB-GENE-980526-285 - symbol:ctsl1b "cathepsin L, 1 ...   673  3.6e-66   1
UNIPROTKB|P25975 - symbol:CTSL1 "Cathepsin L1" species:99...   671  5.8e-66   1
UNIPROTKB|Q28944 - symbol:CTSL1 "Cathepsin L1" species:98...   667  1.5e-65   1
MGI|MGI:107341 - symbol:Ctss "cathepsin S" species:10090 ...   666  2.0e-65   1
ZFIN|ZDB-GENE-030131-106 - symbol:ctsl1a "cathepsin L, 1 ...   666  2.0e-65   1
UNIPROTKB|Q9GL24 - symbol:CTSL1 "Cathepsin L1" species:96...   665  2.5e-65   1
ZFIN|ZDB-GENE-041010-76 - symbol:ctsll "cathepsin L, like...   665  2.5e-65   1
DICTYBASE|DDB_G0279185 - symbol:cprF "cysteine proteinase...   545  4.2e-65   2
UNIPROTKB|Q5E998 - symbol:CTSL2 "Cathepsin L2" species:99...   662  5.2e-65   1
DICTYBASE|DDB_G0283867 - symbol:cprC "cysteine proteinase...   647  2.0e-63   1
UNIPROTKB|Q86GF7 - symbol:Cys "Crustapain" species:6703 "...   644  4.2e-63   1
UNIPROTKB|F1SS93 - symbol:CTSS "Uncharacterized protein" ...   640  1.1e-62   1
UNIPROTKB|P25326 - symbol:CTSS "Cathepsin S" species:9913...   636  3.0e-62   1
DICTYBASE|DDB_G0281605 - symbol:cfaD "peptidase C1A famil...   634  4.8e-62   1
RGD|1308751 - symbol:RGD1308751 "similar to Cathepsin L p...   634  4.8e-62   1
WB|WBGene00000776 - symbol:cpl-1 species:6239 "Caenorhabd...   630  1.3e-61   1
RGD|1560071 - symbol:Ctsll3 "cathepsin L-like 3" species:...   625  4.3e-61   1
UNIPROTKB|Q8HY81 - symbol:CTSS "Cathepsin S" species:9615...   624  5.5e-61   1
UNIPROTKB|F1PAK0 - symbol:CTSS "Cathepsin S" species:9615...   623  7.1e-61   1
UNIPROTKB|O60911 - symbol:CTSL2 "Cathepsin L2" species:96...   623  7.1e-61   1
UNIPROTKB|P25774 - symbol:CTSS "Cathepsin S" species:9606...   621  1.2e-60   1
UNIPROTKB|P43235 - symbol:CTSK "Cathepsin K" species:9606...   620  1.5e-60   1
ZFIN|ZDB-GENE-050626-55 - symbol:ctssb.2 "cathepsin S, b....   615  5.0e-60   1
UNIPROTKB|Q9GLE3 - symbol:CTSK "Cathepsin K" species:9823...   613  8.1e-60   1
UNIPROTKB|G1K2A7 - symbol:CTSK "Cathepsin K" species:9615...   612  1.0e-59   1
UNIPROTKB|Q3ZKN1 - symbol:CTSK "Cathepsin K" species:9615...   612  1.0e-59   1
UNIPROTKB|F1NEC8 - symbol:CTSL2 "Uncharacterized protein"...   610  1.7e-59   1
DICTYBASE|DDB_G0272298 - symbol:DDB_G0272298 species:4468...   609  2.2e-59   1
RGD|61810 - symbol:Ctsk "cathepsin K" species:10116 "Ratt...   609  2.2e-59   1
UNIPROTKB|Q5E968 - symbol:CTSK "Cathepsin K" species:9913...   608  2.7e-59   1
UNIPROTKB|F1PMM9 - symbol:CTSL1 "Cathepsin L1" species:96...   606  4.5e-59   1
UNIPROTKB|P09648 - symbol:CTSL1 "Cathepsin L1" species:90...   603  9.3e-59   1
UNIPROTKB|P83654 - symbol:P83654 "Ervatamin-C" species:52...   603  9.3e-59   1
UNIPROTKB|Q10991 - symbol:CTSL "Cathepsin L1" species:994...   603  9.3e-59   1
UNIPROTKB|P83443 - symbol:P83443 "Macrodontain-1" species...   602  1.2e-58   1
UNIPROTKB|Q24940 - symbol:Cat-1 "Cathepsin L-like protein...   601  1.5e-58   1
ZFIN|ZDB-GENE-001205-4 - symbol:ctsk "cathepsin K" specie...   600  1.9e-58   1
MGI|MGI:107823 - symbol:Ctsk "cathepsin K" species:10090 ...   596  5.1e-58   1
DICTYBASE|DDB_G0278401 - symbol:cprH "cysteine proteinase...   588  3.6e-57   1
UNIPROTKB|F1NZ37 - symbol:LOC420160 "Uncharacterized prot...   588  3.6e-57   1
ZFIN|ZDB-GENE-050522-559 - symbol:ctssb.1 "cathepsin S, b...   586  5.9e-57   1
RGD|621513 - symbol:Ctss "cathepsin S" species:10116 "Rat...   581  2.0e-56   1
TAIR|locus:2078312 - symbol:AT3G45310 species:3702 "Arabi...   580  2.5e-56   1
UNIPROTKB|D3ZZR3 - symbol:D3ZZR3 "Uncharacterized protein...   577  5.3e-56   1
MGI|MGI:1349426 - symbol:Ctsj "cathepsin J" species:10090...   573  1.4e-55   1
RGD|69241 - symbol:Ctsj "cathepsin J" species:10116 "Ratt...   573  1.4e-55   1
UNIPROTKB|Q90686 - symbol:CTSK "Cathepsin K" species:9031...   571  2.3e-55   1
FB|FBgn0260462 - symbol:CG12163 species:7227 "Drosophila ...   568  4.8e-55   1
TAIR|locus:2175088 - symbol:ALP "aleurain-like protease" ...   564  1.3e-54   1
MGI|MGI:1922258 - symbol:4930486L24Rik "RIKEN cDNA 493048...   561  2.6e-54   1
ZFIN|ZDB-GENE-040426-1583 - symbol:ctssa "cathepsin S, a"...   560  3.4e-54   1
ZFIN|ZDB-GENE-050208-336 - symbol:ctskl "cathepsin K, lik...   559  4.3e-54   1
RGD|708447 - symbol:Testin "testin gene" species:10116 "R...   555  1.1e-53   1
DICTYBASE|DDB_G0272815 - symbol:cprE "cysteine proteinase...   553  1.9e-53   1
UNIPROTKB|J9P7C5 - symbol:J9P7C5 "Uncharacterized protein...   553  1.9e-53   1
ZFIN|ZDB-GENE-030131-3539 - symbol:ctsh "cathepsin H" spe...   553  1.9e-53   1
RGD|1562210 - symbol:MGC114246 "similar to cathepsin R" s...   552  2.4e-53   1
UNIPROTKB|Q3T0I2 - symbol:CTSH "Pro-cathepsin H" species:...   549  4.9e-53   1
RGD|2447 - symbol:Ctsh "cathepsin H" species:10116 "Rattu...   547  8.0e-53   1
MGI|MGI:107285 - symbol:Ctsh "cathepsin H" species:10090 ...   546  1.0e-52   1
DICTYBASE|DDB_G0279187 - symbol:cprG "cysteine proteinase...   544  1.7e-52   1
UNIPROTKB|H9KYW5 - symbol:CTSS "Uncharacterized protein" ...   544  1.7e-52   1
UNIPROTKB|O46427 - symbol:CTSH "Pro-cathepsin H" species:...   543  2.1e-52   1
MGI|MGI:1861723 - symbol:Ctsr "cathepsin R" species:10090...   543  2.1e-52   1
DICTYBASE|DDB_G0278721 - symbol:cprD "cysteine proteinase...   542  2.7e-52   1
DICTYBASE|DDB_G0291191 - symbol:DDB_G0291191 "cysteine pr...   537  9.2e-52   1
DICTYBASE|DDB_G0290957 - symbol:cprA "cysteine proteinase...   536  1.2e-51   1

WARNING:  Descriptions of 198 database sequences were not reported due to the
          limiting value of parameter V = 100.


>TAIR|locus:2152445 [details] [associations]
            symbol:SAG12 "senescence-associated gene 12" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009723 "response to ethylene stimulus" evidence=IEP]
            [GO:0007568 "aging" evidence=IEP;TAS] [GO:0010150 "leaf senescence"
            evidence=IEP;TAS] [GO:0010282 "senescence-associated vacuole"
            evidence=IDA] [GO:0009817 "defense response to fungus, incompatible
            interaction" evidence=IEP] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:CP002688 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0010150 GO:GO:0009817 EMBL:AB016870
            HSSP:O65039 OMA:NDEQALM EMBL:AF370131 EMBL:AY040073 IPI:IPI00544181
            RefSeq:NP_568651.1 UniGene:At.75256 UniGene:At.7710
            ProteinModelPortal:Q9FJ47 SMR:Q9FJ47 IntAct:Q9FJ47 STRING:Q9FJ47
            MEROPS:C01.117 PRIDE:Q9FJ47 ProMEX:Q9FJ47 EnsemblPlants:AT5G45890.1
            GeneID:834629 KEGG:ath:AT5G45890 TAIR:At5g45890 InParanoid:Q9FJ47
            PhylomeDB:Q9FJ47 ProtClustDB:CLSN2917735 ArrayExpress:Q9FJ47
            Genevestigator:Q9FJ47 GO:GO:0010282 Uniprot:Q9FJ47
        Length = 346

 Score = 1016 (362.7 bits), Expect = 1.6e-102, P = 1.6e-102
 Identities = 190/324 (58%), Positives = 240/324 (74%)

Query:    27 SRTL-NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLG 85
             SR L N+  M +RH  WM ++GRVY D  E+  R+ +FK NVE I   N+    + +KL 
Sbjct:    25 SRPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLA 84

Query:    86 INEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS-FRYENAS---VPASIDWRKKGAVT 141
             +N+FAD TN+EFR+   G+K  + ++ S   T +S FRY+N S   +P S+DWRKKGAVT
Sbjct:    85 VNQFADLTNDEFRSMYTGFKG-VSALSSQSQTKMSPFRYQNVSSGALPVSVDWRKKGAVT 143

Query:   142 GVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAF 201
              +K+QG CGCCWAFSAVAA+EG   I   KL SLSEQ+LVDCDT+  D GCEGGLMD AF
Sbjct:   144 PIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN--DFGCEGGLMDTAF 201

Query:   202 EFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVS 261
             E I +  GL TE+ YPYK  D +CN K+ NP A  I+GYEDVP N+E ALMKAVA+QPVS
Sbjct:   202 EHIKATGGLTTESNYPYKGEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAHQPVS 261

Query:   262 VAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGY 321
             V I+  G DFQFYSSGVFTG+C T LDH VTA+GYG + +G+KYW++KNSWGT WGE+GY
Sbjct:   262 VGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGY 321

Query:   322 IRMQRDIDAKEGLCGIAMQASYPT 345
             +R+Q+D+  K+GLCG+AM+ASYPT
Sbjct:   322 MRIQKDVKDKQGLCGLAMKASYPT 345


>TAIR|locus:2157712 [details] [associations]
            symbol:CEP1 "cysteine endopeptidase 1" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005783 EMBL:CP002688
            GenomeReviews:BA000015_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 PROSITE:PS00014
            EMBL:AB024031 MEROPS:I29.003 EMBL:HM367092 EMBL:AY091087
            IPI:IPI00516991 RefSeq:NP_568722.1 UniGene:At.7918 HSSP:O65039
            ProteinModelPortal:Q9FGR9 SMR:Q9FGR9 PaxDb:Q9FGR9 PRIDE:Q9FGR9
            EnsemblPlants:AT5G50260.1 GeneID:835091 KEGG:ath:AT5G50260
            TAIR:At5g50260 HOGENOM:HOG000230773 InParanoid:Q9FGR9 KO:K16292
            OMA:WHSKKYH PhylomeDB:Q9FGR9 ProtClustDB:CLSN2689970
            Genevestigator:Q9FGR9 Uniprot:Q9FGR9
        Length = 361

 Score = 961 (343.3 bits), Expect = 1.1e-96, P = 1.1e-96
 Identities = 189/310 (60%), Positives = 221/310 (71%)

Query:    37 ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
             E +E W + +  V R   EK  RF +FK NV++I   N K  +K YKL +N+F D T+EE
Sbjct:    36 ELYERWRSHH-TVARSLEEKAKRFNVFKHNVKHIHETNKK--DKSYKLKLNKFGDMTSEE 92

Query:    97 FRAPRNGYK-RRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWA 154
             FR    G   +     +  +    SF Y N + +P S+DWRK GAVT VK+QGQCG CWA
Sbjct:    93 FRRTYAGSNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAVTPVKNQGQCGSCWA 152

Query:   155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
             FS V A+EGIN I T+KLTSLSEQELVDCDT+ ++QGC GGLMD AFEFI    GL +E 
Sbjct:   153 FSTVVAVEGINQIRTKKLTSLSEQELVDCDTN-QNQGCNGGLMDLAFEFIKEKGGLTSEL 211

Query:   215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
              YPYKASD +C+  + N     I G+EDVP N+E  LMKAVANQPVSVAIDA GSDFQFY
Sbjct:   212 VYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFY 271

Query:   275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
             S GVFTG+CGTEL+HGV  VGYGT  DGTKYW+VKNSWG  WGE GYIRMQR I  KEGL
Sbjct:   272 SEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGL 331

Query:   335 CGIAMQASYP 344
             CGIAM+ASYP
Sbjct:   332 CGIAMEASYP 341


>TAIR|locus:2055440 [details] [associations]
            symbol:AT2G34080 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002685 GenomeReviews:CT485783_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 MEROPS:I29.003 EMBL:AC002341
            HOGENOM:HOG000230773 HSSP:P53634 IPI:IPI00530325 PIR:B84752
            RefSeq:NP_565780.1 UniGene:At.28613 UniGene:At.37859
            ProteinModelPortal:O22961 SMR:O22961 EnsemblPlants:AT2G34080.1
            GeneID:817969 KEGG:ath:AT2G34080 TAIR:At2g34080 InParanoid:O22961
            OMA:SENDYSY PhylomeDB:O22961 ProtClustDB:CLSN2688064
            ArrayExpress:O22961 Genevestigator:O22961 Uniprot:O22961
        Length = 345

 Score = 937 (334.9 bits), Expect = 3.8e-94, P = 3.8e-94
 Identities = 184/346 (53%), Positives = 240/346 (69%)

Query:    10 LVLAAILVL---GVWAPQSWSRTL--NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFK 64
             +VL  +L++   G    Q+ SRT+   + +M ++HE WMA++ R YRD  EK MR  +FK
Sbjct:     5 MVLVTVLIILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDVFK 64

Query:    65 ENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETT--DVSFR 122
             +N+++I +FN K  NK YKLG+NEFAD TNEEF A   G K  L  V  S+     +S +
Sbjct:    65 KNLKFIENFNKKG-NKSYKLGVNEFADWTNEEFLAIHTGLKG-LTEVSPSKVVAKTISSQ 122

Query:   123 YENAS--VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQEL 180
               N S  V  S DWR +GAVT VK QGQCGCCWAFSAVAA+EG+  I    L SLSEQ+L
Sbjct:   123 TWNVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQL 182

Query:   181 VDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGY 240
             +DCD    D+GC+GG+M DAF +++ N+G+A+E  Y Y+ SDG C +  A P AA+ISG+
Sbjct:   183 LDCDRE-YDRGCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGC-RSNARP-AARISGF 239

Query:   241 EDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTAD 300
             + VPSNNE AL++AV+ QPVSV++DA+G  F  YS GV+ G CGT  +H VT VGYGT+ 
Sbjct:   240 QTVPSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQ 299

Query:   301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
             DGTKYWL KNSWG TWGE GYIR++RD+   +G+CG+A  A YP A
Sbjct:   300 DGTKYWLAKNSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345


>TAIR|locus:2167821 [details] [associations]
            symbol:RD21B "esponsive to dehydration 21B" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS] [GO:0005773
            "vacuole" evidence=IDA] [GO:0009651 "response to salt stress"
            evidence=IEP] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005829 "cytosol" evidence=IDA] [GO:0052541 "plant-type cell
            wall cellulose metabolic process" evidence=RCA] [GO:0052546 "cell
            wall pectin metabolic process" evidence=RCA] [GO:0005783
            "endoplasmic reticulum" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005783 GO:GO:0005829 EMBL:CP002688
            GO:GO:0005773 GO:GO:0009651 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AB008267 HSSP:O65039
            HOGENOM:HOG000230773 KO:K01376 InterPro:IPR000118 Pfam:PF00396
            SMART:SM00277 ProtClustDB:CLSN2688498 EMBL:AY062608 EMBL:AY114661
            IPI:IPI00520971 RefSeq:NP_568620.1 UniGene:At.24130 SMR:Q9FMH8
            IntAct:Q9FMH8 STRING:Q9FMH8 MEROPS:C01.A12
            EnsemblPlants:AT5G43060.1 GeneID:834321 KEGG:ath:AT5G43060
            TAIR:At5g43060 InParanoid:Q9FMH8 OMA:ENSEASL Genevestigator:Q9FMH8
            Uniprot:Q9FMH8
        Length = 463

 Score = 919 (328.6 bits), Expect = 3.0e-92, P = 3.0e-92
 Identities = 181/321 (56%), Positives = 227/321 (70%)

Query:    31 NDATMNERHEMWMAQYGRVYRDN----AEKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
             +D+ +   +E WM ++G+   +     AEK+ RF+IFK+N+ +I   N K  N  YKLG+
Sbjct:    42 SDSEVERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTK--NLSYKLGL 99

Query:    87 NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---NASVPASIDWRKKGAVTGV 143
               FAD TNEE+R+   G K   P+ R  +T+D   RY+     ++P S+DWRK+GAV  V
Sbjct:   100 TRFADLTNEEYRSMYLGAK---PTKRVLKTSD---RYQARVGDALPDSVDWRKEGAVADV 153

Query:   144 KDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEF 203
             KDQG CG CWAFS + A+EGIN I T  L SLSEQELVDCDTS  +QGC GGLMD AFEF
Sbjct:   154 KDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEF 212

Query:   204 IISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVA 263
             II N G+ TEA YPYKA+DG C++   N     I  YEDVP N+EA+L KA+A+QP+SVA
Sbjct:   213 IIKNGGIDTEADYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVA 272

Query:   264 IDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
             I+A G  FQ YSSGVF G CGTELDHGV AVGYGT ++G  YW+V+NSWG  WGE+GYI+
Sbjct:   273 IEAGGRAFQLYSSGVFDGLCGTELDHGVVAVGYGT-ENGKDYWIVRNSWGNRWGESGYIK 331

Query:   324 MQRDIDAKEGLCGIAMQASYP 344
             M R+I+A  G CGIAM+ASYP
Sbjct:   332 MARNIEAPTGKCGIAMEASYP 352


>TAIR|locus:2038588 [details] [associations]
            symbol:AT2G27420 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002685
            GenomeReviews:CT485783_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC006232
            MEROPS:I29.003 OMA:EEFRATH HOGENOM:HOG000230773 HSSP:P53634
            ProtClustDB:CLSN2688476 EMBL:AY064033 EMBL:AY096388 IPI:IPI00539752
            PIR:F84672 RefSeq:NP_565649.1 UniGene:At.27094
            ProteinModelPortal:Q9ZQH7 SMR:Q9ZQH7 PRIDE:Q9ZQH7
            EnsemblPlants:AT2G27420.1 GeneID:817287 KEGG:ath:AT2G27420
            TAIR:At2g27420 InParanoid:Q9ZQH7 PhylomeDB:Q9ZQH7
            ArrayExpress:Q9ZQH7 Genevestigator:Q9ZQH7 Uniprot:Q9ZQH7
        Length = 348

 Score = 917 (327.9 bits), Expect = 5.0e-92, P = 5.0e-92
 Identities = 179/328 (54%), Positives = 234/328 (71%)

Query:    29 TLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK-PYKLGIN 87
             +L +A+  E+HE WMA++ RVY D  EK  RF IFK+N+E++ +FN    NK  YK+ IN
Sbjct:    25 SLFEASAIEKHEQWMARFNRVYSDETEKRNRFNIFKKNLEFVQNFN--MNNKITYKVDIN 82

Query:    88 EFADQTNEEFRAPRNGYK-----RRLPSVRSSETTDVSFRYENASVPA-SIDWRKKGAVT 141
             EF+D T+EEFRA   G        R+ ++ S + T V FRY N S    S+DWR++GAVT
Sbjct:    83 EFSDLTDEEFRATHTGLVVPEAITRISTLSSGKNT-VPFRYGNVSDNGESMDWRQEGAVT 141

Query:   142 GVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAF 201
              VK QG+CG CWAFSAVAA+EGI  IT  +L SLSEQ+L+DCD    +QGC GG+M  AF
Sbjct:   142 PVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQLLDCDRD-YNQGCRGGIMSKAF 200

Query:   202 EFIISNKGLATEAKYPYKASDGSCNKKEANPS---AAKISGYEDVPSNNEAALMKAVANQ 258
             E+II N+G+ TE  YPY+ S  +C+      S   AA ISGYE VP NNE AL++AV+ Q
Sbjct:   201 EYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATISGYETVPMNNEEALLQAVSQQ 260

Query:   259 PVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGE 318
             PVSV I+ +G+ F+ YS GVF G+CGT+L H VT VGYG +++GTKYW+VKNSWG TWGE
Sbjct:   261 PVSVGIEGTGAAFRHYSGGVFNGECGTDLHHAVTIVGYGMSEEGTKYWVVKNSWGETWGE 320

Query:   319 NGYIRMQRDIDAKEGLCGIAMQASYPTA 346
             NGY+R++RD+DA +G+CG+A+ A YP A
Sbjct:   321 NGYMRIKRDVDAPQGMCGLAILAFYPLA 348


>TAIR|locus:2029924 [details] [associations]
            symbol:AT1G29090 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002684 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            HOGENOM:HOG000230773 HSSP:P53634 ProtClustDB:CLSN2688064
            EMBL:BT004146 IPI:IPI00545702 RefSeq:NP_564321.2 UniGene:At.40814
            ProteinModelPortal:Q84W75 SMR:Q84W75 MEROPS:C01.A15
            EnsemblPlants:AT1G29090.1 GeneID:839784 KEGG:ath:AT1G29090
            TAIR:At1g29090 InParanoid:Q84W75 OMA:SIRGHED PhylomeDB:Q84W75
            ArrayExpress:Q84W75 Genevestigator:Q84W75 Uniprot:Q84W75
        Length = 355

 Score = 902 (322.6 bits), Expect = 1.9e-90, P = 1.9e-90
 Identities = 177/344 (51%), Positives = 234/344 (68%)

Query:    10 LVLAAILVLGVWAPQSWSR-TLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
             LV   IL + +   Q+ SR T ++  + E H+ WM ++ RVY D  EK+MRF +FK+N++
Sbjct:    17 LVSLTILSMNLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLK 76

Query:    69 YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDV---SFRYEN 125
             +I  FN K  ++ YKLG+NEFAD T EEF A   G K  +  + SSE  D    S+ +  
Sbjct:    77 FIEKFNKKG-DRTYKLGVNEFADWTREEFIATHTGLKG-VNGIPSSEFVDEMIPSWNWNV 134

Query:   126 ASVPA--SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
             + V    + DWR +GAVT VK QGQCGCCWAFS+VAA+EG+  I    L SLSEQ+L+DC
Sbjct:   135 SDVAGRETKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDC 194

Query:   184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
             D    D GC GG+M DAF +II N+G+A+EA YPY+A++G+C +    PSA  I G++ V
Sbjct:   195 DRE-RDNGCNGGIMSDAFSYIIKNRGIASEASYPYQAAEGTC-RYNGKPSAW-IRGFQTV 251

Query:   244 PSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQ-CGTELDHGVTAVGYGTADDG 302
             PSNNE AL++AV+ QPVSV+IDA G  F  YS GV+    CGT ++H VT VGYGT+ +G
Sbjct:   252 PSNNERALLEAVSKQPVSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTSPEG 311

Query:   303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
              KYWL KNSWG TWGENGYIR++RD+   +G+CG+A  A YP A
Sbjct:   312 IKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 355


>TAIR|locus:2082881 [details] [associations]
            symbol:AT3G49340 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002686
            GenomeReviews:BA000014_GR EMBL:AC012329 EMBL:AL132956
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 MEROPS:I29.003 HOGENOM:HOG000230773 HSSP:P07711
            KO:K01376 IPI:IPI00520642 PIR:T45839 RefSeq:NP_566920.1
            UniGene:At.53854 ProteinModelPortal:Q9SG15 SMR:Q9SG15
            EnsemblPlants:AT3G49340.1 GeneID:824096 KEGG:ath:AT3G49340
            TAIR:At3g49340 InParanoid:Q9SG15 OMA:PQNDEEA PhylomeDB:Q9SG15
            ProtClustDB:CLSN2688476 Genevestigator:Q9SG15 Uniprot:Q9SG15
        Length = 341

 Score = 882 (315.5 bits), Expect = 2.5e-88, P = 2.5e-88
 Identities = 168/323 (52%), Positives = 225/323 (69%)

Query:    30 LNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEF 89
             L +A+  E+HE WM+++ RVY D++EK  RF+IF  N++++ S N    NK Y L +NEF
Sbjct:    26 LFEASAVEKHEQWMSRFNRVYSDDSEKTSRFEIFTNNLKFVESINMNT-NKTYTLDVNEF 84

Query:    90 ADQTNEEFRAPRNGY-----KRRLPSVRSSETTDVSFRYENASVPA-SIDWRKKGAVTGV 143
             +D T+EEF+A   G        R+ +  S ET  VSFRYEN      S+DW ++GAVT V
Sbjct:    85 SDLTDEEFKARYTGLVVPEGMTRISTTDSHET--VSFRYENVGETGESMDWIQEGAVTSV 142

Query:   144 KDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEF 203
             K Q QCGCCWAFSAVAA+EG+  I   +L SLSEQ+L+DC T  E+ GC GG+M  AF++
Sbjct:   143 KHQQQCGCCWAFSAVAAVEGMTKIANGELVSLSEQQLLDCST--ENNGCGGGIMWKAFDY 200

Query:   204 IISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVA 263
             I  N+G+ TE  YPY+ +  +C       +AA ISGYE VP N+E AL+KAV+ QPVSVA
Sbjct:   201 IKENQGITTEDNYPYQGAQQTCESNHL--AAATISGYETVPQNDEEALLKAVSQQPVSVA 258

Query:   264 IDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
             I+ SG +F  YS G+F G+CGT+L H VT VGYG +++G KYWL+KNSWG +WGENGY+R
Sbjct:   259 IEGSGYEFIHYSGGIFNGECGTQLTHAVTIVGYGVSEEGIKYWLLKNSWGESWGENGYMR 318

Query:   324 MQRDIDAKEGLCGIAMQASYPTA 346
             + RD+D+ +G+CG+A  A YP A
Sbjct:   319 IMRDVDSPQGMCGLASLAYYPVA 341


>TAIR|locus:2122113 [details] [associations]
            symbol:XCP1 "xylem cysteine peptidase 1" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0000325 "plant-type vacuole" evidence=IDA] [GO:0005634
            "nucleus" evidence=IDA] [GO:0010623 "developmental programmed cell
            death" evidence=IMP] [GO:0010413 "glucuronoxylan metabolic process"
            evidence=RCA] [GO:0045492 "xylan biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005886
            GO:GO:0005634 EMBL:CP002687 GenomeReviews:CT486007_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0000325
            EMBL:AL022604 EMBL:AL161587 GO:GO:0010623 MEROPS:I29.003
            HOGENOM:HOG000230773 EMBL:AF191027 EMBL:AK117394 EMBL:BT005179
            IPI:IPI00532220 PIR:T06122 RefSeq:NP_567983.1 UniGene:At.2280
            UniGene:At.67622 ProteinModelPortal:O65493 SMR:O65493 STRING:O65493
            PaxDb:O65493 PRIDE:O65493 EnsemblPlants:AT4G35350.1 GeneID:829688
            KEGG:ath:AT4G35350 GeneFarm:5033 TAIR:At4g35350 InParanoid:O65493
            KO:K16290 OMA:FEVFREN PhylomeDB:O65493 ProtClustDB:CLSN2689772
            Genevestigator:O65493 Uniprot:O65493
        Length = 355

 Score = 879 (314.5 bits), Expect = 5.3e-88, P = 5.3e-88
 Identities = 170/316 (53%), Positives = 220/316 (69%)

Query:    31 NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
             N   + E  E WM+++ + Y+   EK  RF++F+EN+ +I   NN+  +  Y LG+NEFA
Sbjct:    43 NTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINS--YWLGLNEFA 100

Query:    91 DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQC 149
             D T+EEF+    G  +  P          +FRY + + +P S+DWRKKGAV  VKDQGQC
Sbjct:   101 DLTHEEFKGRYLGLAK--PQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQC 158

Query:   150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
             G CWAFS VAA+EGIN ITT  L+SLSEQEL+DCDT+  + GC GGLMD AF++IIS  G
Sbjct:   159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTF-NSGCNGGLMDYAFQYIISTGG 217

Query:   210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
             L  E  YPY   +G C +++ +     ISGYEDVP N++ +L+KA+A+QPVSVAI+ASG 
Sbjct:   218 LHKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGR 277

Query:   270 DFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
             DFQFY  GVF G+CGT+LDHGV AVGYG++  G+ Y +VKNSWG  WGE G+IRM+R+  
Sbjct:   278 DFQFYKGGVFNGKCGTDLDHGVAAVGYGSSK-GSDYVIVKNSWGPRWGEKGFIRMKRNTG 336

Query:   330 AKEGLCGIAMQASYPT 345
               EGLCGI   ASYPT
Sbjct:   337 KPEGLCGINKMASYPT 352


>TAIR|locus:2090614 [details] [associations]
            symbol:AT3G19390 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA;ISS] [GO:0000041 "transition metal ion
            transport" evidence=RCA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:CP002686 GenomeReviews:BA000014_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AB025624
            MEROPS:I29.003 OMA:KAMDQKC HSSP:O65039 HOGENOM:HOG000230773
            InterPro:IPR000118 Pfam:PF00396 SMART:SM00277 EMBL:AY062725
            EMBL:AY093350 IPI:IPI00520189 RefSeq:NP_566633.1 UniGene:At.27473
            ProteinModelPortal:Q9LT78 SMR:Q9LT78 IntAct:Q9LT78 STRING:Q9LT78
            PaxDb:Q9LT78 PRIDE:Q9LT78 EnsemblPlants:AT3G19390.1 GeneID:821473
            KEGG:ath:AT3G19390 TAIR:At3g19390 InParanoid:Q9LT78
            PhylomeDB:Q9LT78 ProtClustDB:CLSN2917188 Genevestigator:Q9LT78
            Uniprot:Q9LT78
        Length = 452

 Score = 874 (312.7 bits), Expect = 1.8e-87, P = 1.8e-87
 Identities = 174/342 (50%), Positives = 233/342 (68%)

Query:    10 LVLAAILV-LGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
             L+ + +L+ L + +  +   T N+A     +E W+ +  + Y    EKE RF+IFK+N++
Sbjct:    13 LIFSVLLISLSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEKERRFEIFKDNLK 72

Query:    69 YIASFNNKARNKPYKLGINEFADQTNEEFRAP--RNGYKR-RLPSVRSSETTDVSFRYE- 124
             ++   ++   N+ Y++G+  FAD TN+EFRA   R+  +R R+P V+  +     + Y+ 
Sbjct:    73 FVEE-HSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKMERTRVP-VKGEK-----YLYKV 125

Query:   125 NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
               S+P +IDWR KGAV  VKDQG CG CWAFSA+ A+EGIN I T +L SLSEQELVDCD
Sbjct:   126 GDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCD 185

Query:   185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGS-CNKKEANPSAAKISGYEDV 243
             TS  D GC GGLMD AF+FII N G+ TE  YPY A+D + CN  + N     I GYEDV
Sbjct:   186 TSYND-GCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCNSDKKNTRVVTIDGYEDV 244

Query:   244 PSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT 303
             P N+E +L KA+ANQP+SVAI+A G  FQ Y+SGVFTG CGT LDHGV AVGYG+ + G 
Sbjct:   245 PQNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTSLDHGVVAVGYGS-EGGQ 303

Query:   304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
              YW+V+NSWG+ WGE+GY +++R+I    G CG+AM ASYPT
Sbjct:   304 DYWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPT 345


>TAIR|locus:505006391 [details] [associations]
            symbol:CEP3 "cysteine endopeptidase 3" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005783 EMBL:CP002686
            GenomeReviews:BA000014_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 PROSITE:PS00014
            EMBL:AL049659 HSSP:O65039 HOGENOM:HOG000230773 KO:K16292
            EMBL:AK119026 IPI:IPI00525150 PIR:T06707 RefSeq:NP_566901.1
            UniGene:At.3162 ProteinModelPortal:Q9STL5 SMR:Q9STL5 MEROPS:C01.A02
            PRIDE:Q9STL5 EnsemblPlants:AT3G48350.1 GeneID:823993
            KEGG:ath:AT3G48350 TAIR:At3g48350 InParanoid:Q9STL5 OMA:DITHHEF
            PhylomeDB:Q9STL5 ProtClustDB:CLSN2917387 Genevestigator:Q9STL5
            Uniprot:Q9STL5
        Length = 364

 Score = 870 (311.3 bits), Expect = 4.7e-87, P = 4.7e-87
 Identities = 172/310 (55%), Positives = 209/310 (67%)

Query:    39 HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
             +E W   +  V R + E   RF +F+ NV ++   N K  NKPYKL IN FAD T+ EFR
Sbjct:    38 YERWRGHHS-VSRASHEAIKRFNVFRHNVLHVHRTNKK--NKPYKLKINRFADITHHEFR 94

Query:    99 APRNGYK-RRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
             +   G   +    +R  +     F YEN + VP+S+DWR+KGAVT VK+Q  CG CWAFS
Sbjct:    95 SSYAGSNVKHHRMLRGPKRGSGGFMYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFS 154

Query:   157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
              VAA+EGIN I T KL SLSEQELVDCDT  E+QGC GGLM+ AFEFI +N G+ TE  Y
Sbjct:   155 TVAAVEGINKIRTNKLVSLSEQELVDCDTE-ENQGCAGGLMEPAFEFIKNNGGIKTEETY 213

Query:   217 PYKASDGS-CNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
             PY +SD   C           I G+E VP N+E  L+KAVA+QPVSVAIDA  SDFQ YS
Sbjct:   214 PYDSSDVQFCRANSIGGETVTIDGHEHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYS 273

Query:   276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
              GVF G+CGT+L+HGV  VGYG   +GTKYW+V+NSWG  WGE GY+R++R I   EG C
Sbjct:   274 EGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRC 333

Query:   336 GIAMQASYPT 345
             GIAM+ASYPT
Sbjct:   334 GIAMEASYPT 343


>TAIR|locus:2825832 [details] [associations]
            symbol:RD21A "responsive to dehydration 21A" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;IMP]
            [GO:0008234 "cysteine-type peptidase activity"
            evidence=ISS;IDA;IMP] [GO:0009507 "chloroplast" evidence=IDA]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0008233 "peptidase
            activity" evidence=IDA] [GO:0048046 "apoplast" evidence=IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005829 "cytosol"
            evidence=RCA] [GO:0009506 "plasmodesma" evidence=IDA] [GO:0050832
            "defense response to fungus" evidence=IMP] [GO:0006096 "glycolysis"
            evidence=RCA] [GO:0006833 "water transport" evidence=RCA]
            [GO:0006972 "hyperosmotic response" evidence=RCA] [GO:0007030
            "Golgi organization" evidence=RCA] [GO:0009266 "response to
            temperature stimulus" evidence=RCA] [GO:0009651 "response to salt
            stress" evidence=RCA] [GO:0015996 "chlorophyll catabolic process"
            evidence=RCA] [GO:0042744 "hydrogen peroxide catabolic process"
            evidence=RCA] [GO:0046686 "response to cadmium ion" evidence=RCA]
            [GO:0009414 "response to water deprivation" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0009506 GO:GO:0009507 GO:GO:0005773
            GO:GO:0050832 GO:GO:0048046 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC083835
            HOGENOM:HOG000230773 KO:K01376 InterPro:IPR000118 Pfam:PF00396
            SMART:SM00277 UniGene:At.43549 EMBL:D13043 EMBL:AY072130
            EMBL:AY133781 IPI:IPI00530094 PIR:JN0719 RefSeq:NP_564497.1
            UniGene:At.47599 UniGene:At.71705 ProteinModelPortal:P43297
            SMR:P43297 IntAct:P43297 STRING:P43297 MEROPS:C01.064 PaxDb:P43297
            PRIDE:P43297 ProMEX:P43297 EnsemblPlants:AT1G47128.1 GeneID:841122
            KEGG:ath:AT1G47128 TAIR:At1g47128 InParanoid:P43297 OMA:EAWLVKH
            PhylomeDB:P43297 ProtClustDB:CLSN2688498 Genevestigator:P43297
            GermOnline:AT1G47128 Uniprot:P43297
        Length = 462

 Score = 862 (308.5 bits), Expect = 3.3e-86, P = 3.3e-86
 Identities = 165/319 (51%), Positives = 217/319 (68%)

Query:    31 NDATMNERHEMWMAQYGRVYRDNA--EKEMRFKIFKENVEYIASFNNKARNKPYKLGINE 88
             ++A +   +E W+ ++G+    N+  EK+ RF+IFK+N+ ++   N K  N  Y+LG+  
Sbjct:    42 SEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEK--NLSYRLGLTR 99

Query:    89 FADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---NASVPASIDWRKKGAVTGVKD 145
             FAD TN+E+R+   G K      R +     S RYE      +P SIDWRKKGAV  VKD
Sbjct:   100 FADLTNDEYRSKYLGAKMEKKGERRT-----SLRYEARVGDELPESIDWRKKGAVAEVKD 154

Query:   146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
             QG CG CWAFS + A+EGIN I T  L +LSEQELVDCDTS  ++GC GGLMD AFEFII
Sbjct:   155 QGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFII 213

Query:   206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
              N G+ T+  YPYK  DG+C++   N     I  YEDVP+ +E +L KAVA+QP+S+AI+
Sbjct:   214 KNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIE 273

Query:   266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
             A G  FQ Y SG+F G CGT+LDHGV AVGYGT ++G  YW+V+NSWG +WGE+GY+RM 
Sbjct:   274 AGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLRMA 332

Query:   326 RDIDAKEGLCGIAMQASYP 344
             R+I +  G CGIA++ SYP
Sbjct:   333 RNIASSSGKCGIAIEPSYP 351


>TAIR|locus:2030427 [details] [associations]
            symbol:XCP2 "xylem cysteine peptidase 2" species:3702
            "Arabidopsis thaliana" [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009507 "chloroplast" evidence=ISM] [GO:0008233 "peptidase
            activity" evidence=ISS] [GO:0005618 "cell wall" evidence=IDA]
            [GO:0010623 "developmental programmed cell death" evidence=IMP]
            [GO:0010075 "regulation of meristem growth" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0005886 GO:GO:0005618 GO:GO:0005773
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AC069251 EMBL:AC007369 GO:GO:0010623
            OMA:YKEIPEG HOGENOM:HOG000230773 KO:K16290 EMBL:AF191028
            EMBL:BT004822 IPI:IPI00526722 PIR:A86341 RefSeq:NP_564126.1
            UniGene:At.21316 ProteinModelPortal:Q9LM66 SMR:Q9LM66 IntAct:Q9LM66
            STRING:Q9LM66 MEROPS:C01.120 PaxDb:Q9LM66 PRIDE:Q9LM66
            ProMEX:Q9LM66 EnsemblPlants:AT1G20850.1 GeneID:838677
            KEGG:ath:AT1G20850 GeneFarm:5034 TAIR:At1g20850 InParanoid:Q9LM66
            PhylomeDB:Q9LM66 ProtClustDB:CLSN2917031 Genevestigator:Q9LM66
            GermOnline:AT1G20850 Uniprot:Q9LM66
        Length = 356

 Score = 856 (306.4 bits), Expect = 1.4e-85, P = 1.4e-85
 Identities = 163/310 (52%), Positives = 216/310 (69%)

Query:    37 ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
             E  E W++ + + Y    EK +RF++FK+N+++I   N K   K Y LG+NEFAD ++EE
Sbjct:    49 ELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG--KSYWLGLNEFADLSHEE 106

Query:    97 FRAPRNGYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAF 155
             F+    G K  +   R  E +   F Y +  +VP S+DWRKKGAV  VK+QG CG CWAF
Sbjct:   107 FKKMYLGLKTDIVR-RDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCWAF 165

Query:   156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
             S VAA+EGIN I T  LT+LSEQEL+DCDT+  + GC GGLMD AFE+I+ N GL  E  
Sbjct:   166 STVAAVEGINKIVTGNLTTLSEQELIDCDTT-YNNGCNGGLMDYAFEYIVKNGGLRKEED 224

Query:   216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
             YPY   +G+C  ++       I+G++DVP+N+E +L+KA+A+QP+SVAIDASG +FQFYS
Sbjct:   225 YPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFYS 284

Query:   276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
              GVF G+CG +LDHGV AVGYG++  G+ Y +VKNSWG  WGE GYIR++R+    EGLC
Sbjct:   285 GGVFDGRCGVDLDHGVAAVGYGSSK-GSDYIIVKNSWGPKWGEKGYIRLKRNTGKPEGLC 343

Query:   336 GIAMQASYPT 345
             GI   AS+PT
Sbjct:   344 GINKMASFPT 353


>TAIR|locus:2090629 [details] [associations]
            symbol:AT3G19400 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0019344 "cysteine biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            EMBL:CP002686 GenomeReviews:BA000014_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AB025624
            MEROPS:I29.003 HOGENOM:HOG000230773 EMBL:AK118509 IPI:IPI00543468
            RefSeq:NP_566634.2 UniGene:At.38409 ProteinModelPortal:Q9LT77
            SMR:Q9LT77 PaxDb:Q9LT77 PRIDE:Q9LT77 EnsemblPlants:AT3G19400.1
            GeneID:821474 KEGG:ath:AT3G19400 TAIR:At3g19400 InParanoid:Q9LT77
            OMA:IGEHERR ProtClustDB:CLSN2679975 Genevestigator:Q9LT77
            Uniprot:Q9LT77
        Length = 362

 Score = 842 (301.5 bits), Expect = 4.4e-84, P = 4.4e-84
 Identities = 166/348 (47%), Positives = 226/348 (64%)

Query:     4 ILLENKLVLAAILV---LGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
             +++   ++L+ +L+   LGV       R  N+  +   +E W+ +  + Y    EKE RF
Sbjct:     8 VIVSALVILSVLLLSSSLGVATETEIER--NETEVRLMYEQWLVENRKNYNGLGEKERRF 65

Query:    61 KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
             KIFK+N++++   +N   ++ +++G+  FAD TNEEFRA     ++++   + S  T+  
Sbjct:    66 KIFKDNLKFVDE-HNSVPDRTFEVGLTRFADLTNEEFRAIY--LRKKMERTKDSVKTE-R 121

Query:   121 FRYENASV-PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
             + Y+   V P  +DWR  GAV  VKDQG CG CWAFSAV A+EGIN ITT +L SLSEQE
Sbjct:   122 YLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQITTGELISLSEQE 181

Query:   180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASD-GSCNK-KEANPSAAKI 237
             LVDCD    + GC+GG+M+ AFEFI+ N G+ T+  YPY A+D G CN  K  N     I
Sbjct:   182 LVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNANDLGLCNADKNNNTRVVTI 241

Query:   238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
              GYEDVP ++E +L KAVA+QPVSVAI+AS   FQ Y SGV TG CG  LDHGV  VGYG
Sbjct:   242 DGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKSGVMTGTCGISLDHGVVVVGYG 301

Query:   298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
             +   G  YW+++NSWG  WG++GY+++QR+ID   G CGIAM  SYPT
Sbjct:   302 STS-GEDYWIIRNSWGLNWGDSGYVKLQRNIDDPFGKCGIAMMPSYPT 348


>TAIR|locus:2029934 [details] [associations]
            symbol:AT1G29080 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002684 GenomeReviews:CT485782_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AC021043 MEROPS:I29.003 HOGENOM:HOG000230773
            HSSP:P53634 ProtClustDB:CLSN2688064 EMBL:DQ056468 IPI:IPI00521747
            PIR:C86413 RefSeq:NP_564320.1 UniGene:At.51814
            ProteinModelPortal:Q9LP39 SMR:Q9LP39 EnsemblPlants:AT1G29080.1
            GeneID:839783 KEGG:ath:AT1G29080 TAIR:At1g29080 InParanoid:Q9LP39
            OMA:KTWGENG PhylomeDB:Q9LP39 Genevestigator:Q9LP39 Uniprot:Q9LP39
        Length = 346

 Score = 821 (294.1 bits), Expect = 7.4e-82, P = 7.4e-82
 Identities = 163/351 (46%), Positives = 232/351 (66%)

Query:     3 MILLENKLVLAAILVLGVWAPQSWSRTL--NDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
             M  +E   V+  I  + +   ++ SR      +++ + H+ WM Q+ RVY D  EK++R 
Sbjct:     1 MDFVEFVCVVLTIFFMDLKISEATSRVALYKPSSIVDYHQQWMIQFSRVYDDEFEKQLRL 60

Query:    61 KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKR---RLPSVRSSETT 117
             ++  EN+++I SFNN   N+ YKLG+NEF D T EEF A   G +      P    +ET 
Sbjct:    61 QVLTENLKFIESFNNMG-NQSYKLGVNEFTDWTKEEFLATYTGLRGVNVTSPFEVVNETK 119

Query:   118 DVSFRYENASV-PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLS 176
               ++ +  + V   + DWR +GAVT VK QG+CG CWAFSA+AA+EG+  I    L SLS
Sbjct:   120 P-AWNWTVSDVLGTNKDWRNEGAVTPVKSQGECGGCWAFSAIAAVEGLTKIARGNLISLS 178

Query:   177 EQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAK 236
             EQ+L+DC T  ++ GC+GG   +AF +II ++G+++E +YPY+  +G C +  A P A  
Sbjct:   179 EQQLLDC-TREQNNGCKGGTFVNAFNYIIKHRGISSENEYPYQVKEGPC-RSNARP-AIL 235

Query:   237 ISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQ-CGTELDHGVTAVG 295
             I G+E+VPSNNE AL++AV+ QPV+VAIDAS + F  YS GV+  + CGT ++H VT VG
Sbjct:   236 IRGFENVPSNNERALLEAVSRQPVAVAIDASEAGFVHYSGGVYNARNCGTSVNHAVTLVG 295

Query:   296 YGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
             YGT+ +G KYWL KNSWG TWGENGYIR++RD++  +G+CG+A  ASYP A
Sbjct:   296 YGTSPEGMKYWLAKNSWGKTWGENGYIRIRRDVEWPQGMCGVAQYASYPVA 346


>TAIR|locus:2024362 [details] [associations]
            symbol:XBCP3 "xylem bark cysteine peptidase 3"
            species:3702 "Arabidopsis thaliana" [GO:0005576 "extracellular
            region" evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0005783 "endoplasmic
            reticulum" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005783 EMBL:CP002684 GO:GO:0005773 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 MEROPS:I29.003
            HOGENOM:HOG000230773 InterPro:IPR000118 Pfam:PF00396 SMART:SM00277
            UniGene:At.10233 OMA:CEIESAV EMBL:BT026490 EMBL:AK226753
            IPI:IPI00536687 RefSeq:NP_563855.1 ProteinModelPortal:Q0WVJ5
            SMR:Q0WVJ5 PRIDE:Q0WVJ5 EnsemblPlants:AT1G09850.1 GeneID:837517
            KEGG:ath:AT1G09850 TAIR:At1g09850 InParanoid:Q0WVJ5
            PhylomeDB:Q0WVJ5 ProtClustDB:CLSN2687747 Genevestigator:Q0WVJ5
            Uniprot:Q0WVJ5
        Length = 437

 Score = 803 (287.7 bits), Expect = 6.0e-80, P = 6.0e-80
 Identities = 156/311 (50%), Positives = 201/311 (64%)

Query:    35 MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
             ++E  + W  ++G+ Y    E++ R +IFK+N +++   +N   N  Y L +N FAD T+
Sbjct:    28 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQ-HNLITNATYSLSLNAFADLTH 86

Query:    95 EEFRAPRNGYKRRLPSV-RSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCW 153
              EF+A R G     PSV  +S+   +     +  VP S+DWRKKGAVT VKDQG CG CW
Sbjct:    87 HEFKASRLGLSVSAPSVIMASKGQSLG---GSVKVPDSVDWRKKGAVTNVKDQGSCGACW 143

Query:   154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
             +FSA  AMEGIN I T  L SLSEQEL+DCD S  + GC GGLMD AFEF+I N G+ TE
Sbjct:   144 SFSATGAMEGINQIVTGDLISLSEQELIDCDKS-YNAGCNGGLMDYAFEFVIKNHGIDTE 202

Query:   214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
               YPY+  DG+C K +       I  Y  V SN+E ALM+AVA QPVSV I  S   FQ 
Sbjct:   203 KDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQL 262

Query:   274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
             YSSG+F+G C T LDH V  VGYG+  +G  YW+VKNSWG +WG +G++ MQR+ +  +G
Sbjct:   263 YSSGIFSGPCSTSLDHAVLIVGYGS-QNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDG 321

Query:   334 LCGIAMQASYP 344
             +CGI M ASYP
Sbjct:   322 VCGINMLASYP 332


>TAIR|locus:2038515 [details] [associations]
            symbol:AT1G06260 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0048046 "apoplast"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0048046 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC025290
            MEROPS:I29.003 HSSP:O65039 HOGENOM:HOG000230773 OMA:METAFEF
            IPI:IPI00525965 PIR:D86198 RefSeq:NP_563764.1 UniGene:At.24617
            ProteinModelPortal:Q9LNC1 SMR:Q9LNC1 PaxDb:Q9LNC1 PRIDE:Q9LNC1
            EnsemblPlants:AT1G06260.1 GeneID:837137 KEGG:ath:AT1G06260
            TAIR:At1g06260 InParanoid:Q9LNC1 PhylomeDB:Q9LNC1
            ProtClustDB:CLSN2916975 Genevestigator:Q9LNC1 Uniprot:Q9LNC1
        Length = 343

 Score = 798 (286.0 bits), Expect = 2.0e-79, P = 2.0e-79
 Identities = 164/348 (47%), Positives = 217/348 (62%)

Query:     1 MAMILLENKLVLAAILVLGVWAPQ--SWSRTLNDA--TMNERHEMWMAQYGRVYRDNAEK 56
             M  +L  + L LA ++   + A +  S   ++ D   T+ +R E W+  + ++Y    E 
Sbjct:     1 MLNVLRNSNLTLAVLICFVLIASKLCSVDSSVYDPHKTLKQRFEKWLKTHSKLYGGRDEW 60

Query:    57 EMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSET 116
              +RF I++ NV+ I   N+   + P+KL  N FAD TN EF+A   G       +   + 
Sbjct:    61 MLRFGIYQSNVQLIDYINSL--HLPFKLTDNRFADMTNSEFKAHFLGLNTSSLRLHKKQR 118

Query:   117 TDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLS 176
                       +VP ++DWR +GAVT +++QG+CG CWAFSAVAA+EGIN I T  L SLS
Sbjct:   119 PVCD---PAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLS 175

Query:   177 EQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAK 236
             EQ+L+DCD    ++GC GGLM+ AFEFI +N GLATE  YPY   +G+C+++++      
Sbjct:   176 EQQLIDCDVGTYNKGCSGGLMETAFEFIKTNGGLATETDYPYTGIEGTCDQEKSKNKVVT 235

Query:   237 ISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGY 296
             I GY+ V + NEA+L  A A QPVSV IDA G  FQ YSSGVFT  CGT L+HGVT VGY
Sbjct:   236 IQGYQKV-AQNEASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTNYCGTNLNHGVTVVGY 294

Query:   297 GTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             G   D  KYW+VKNSWGT WGE GYIRM+R +    G CGIAM ASYP
Sbjct:   295 GVEGD-QKYWIVKNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASYP 341


>TAIR|locus:2128253 [details] [associations]
            symbol:AT4G11320 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002687 GenomeReviews:CT486007_GR EMBL:AL096882 EMBL:AL161531
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 OMA:ICHGADP
            HOGENOM:HOG000230773 KO:K01376 ProtClustDB:CLSN2689395
            EMBL:AY035055 EMBL:AY051062 IPI:IPI00520480 PIR:T13023
            RefSeq:NP_567377.1 UniGene:At.25206 ProteinModelPortal:Q9SUS9
            SMR:Q9SUS9 STRING:Q9SUS9 MEROPS:C01.A21 PaxDb:Q9SUS9 PRIDE:Q9SUS9
            EnsemblPlants:AT4G11320.1 GeneID:826734 KEGG:ath:AT4G11320
            TAIR:At4g11320 InParanoid:Q9SUS9 PhylomeDB:Q9SUS9
            Genevestigator:Q9SUS9 GermOnline:AT4G11320 Uniprot:Q9SUS9
        Length = 371

 Score = 794 (284.6 bits), Expect = 5.4e-79, P = 5.4e-79
 Identities = 158/316 (50%), Positives = 210/316 (66%)

Query:    32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
             DA      E WM ++G+VY   AEKE R  IF++N+ +I   N  A N  Y+LG+N FAD
Sbjct:    49 DAEATLMFESWMVKHGKVYDSVAEKERRLTIFEDNLRFIT--NRNAENLSYRLGLNRFAD 106

Query:    92 QTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV-PASIDWRKKGAVTGVKDQGQCG 150
              +  E+    +G   R P      T+   ++  +  V P S+DWR +GAVT VKDQG C 
Sbjct:   107 LSLHEYGEICHGADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCR 166

Query:   151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
              CWAFS V A+EG+N I T +L +LSEQ+L++C+   E+ GC GG ++ A+EFI++N GL
Sbjct:   167 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKVETAYEFIMNNGGL 224

Query:   211 ATEAKYPYKASDGSCNK--KEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
              T+  YPYKA +G C    KE N +   I GYE++P+N+EAALMKAVA+QPV+  +D+S 
Sbjct:   225 GTDNDYPYKALNGVCEGRLKEDNKNVM-IDGYENLPANDEAALMKAVAHQPVTAVVDSSS 283

Query:   269 SDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
              +FQ Y SGVF G CGT L+HGV  VGYGT ++G  YW+VKNS G TWGE GY++M R+I
Sbjct:   284 REFQLYESGVFDGTCGTNLNHGVVVVGYGT-ENGRDYWIVKNSRGDTWGEAGYMKMARNI 342

Query:   329 DAKEGLCGIAMQASYP 344
                 GLCGIAM+ASYP
Sbjct:   343 ANPRGLCGIAMRASYP 358


>TAIR|locus:2128243 [details] [associations]
            symbol:AT4G11310 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005618 "cell wall"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 GO:GO:0005618 EMBL:CP002687
            GenomeReviews:CT486007_GR EMBL:AL096882 EMBL:AL161531
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            HOGENOM:HOG000230773 KO:K01376 EMBL:AY093066 EMBL:BT000099
            IPI:IPI00520496 PIR:T13022 RefSeq:NP_567376.1 UniGene:At.43189
            ProteinModelPortal:Q9SUT0 SMR:Q9SUT0 IntAct:Q9SUT0 STRING:Q9SUT0
            MEROPS:C01.A20 PaxDb:Q9SUT0 PRIDE:Q9SUT0 EnsemblPlants:AT4G11310.1
            GeneID:826733 KEGG:ath:AT4G11310 TAIR:At4g11310 InParanoid:Q9SUT0
            OMA:EVCHGAD PhylomeDB:Q9SUT0 ProtClustDB:CLSN2689395
            Genevestigator:Q9SUT0 GermOnline:AT4G11310 Uniprot:Q9SUT0
        Length = 364

 Score = 792 (283.9 bits), Expect = 8.7e-79, P = 8.7e-79
 Identities = 158/317 (49%), Positives = 213/317 (67%)

Query:    32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
             DA  +   E WM ++G+VY   AEKE R  IF++N+ +I   N  A N  Y+LG+  FAD
Sbjct:    42 DAEASLIFESWMVKHGKVYGSVAEKERRLTIFEDNLRFIN--NRNAENLSYRLGLTGFAD 99

Query:    92 QTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVKDQGQ 148
              +  E++   +G   R P  R+      S RY+ ++   +P S+DWR +GAVT VKDQG 
Sbjct:   100 LSLHEYKEVCHGADPRPP--RNHVFMTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGH 157

Query:   149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
             C  CWAFS V A+EG+N I T +L +LSEQ+L++C+   E+ GC GG ++ A+EFI+ N 
Sbjct:   158 CRSCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKLETAYEFIMKNG 215

Query:   209 GLATEAKYPYKASDGSCNKK-EANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDAS 267
             GL T+  YPYKA +G C+ + + N     I GYE++P+N+E+ALMKAVA+QPV+  ID+S
Sbjct:   216 GLGTDNDYPYKAVNGVCDGRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSS 275

Query:   268 GSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
               +FQ Y SGVF G CGT L+HGV  VGYGT ++G  YWLVKNS G TWGE GY++M R+
Sbjct:   276 SREFQLYESGVFDGSCGTNLNHGVVVVGYGT-ENGRDYWLVKNSRGITWGEAGYMKMARN 334

Query:   328 IDAKEGLCGIAMQASYP 344
             I    GLCGIAM+ASYP
Sbjct:   335 IANPRGLCGIAMRASYP 351


>TAIR|locus:2117979 [details] [associations]
            symbol:AT4G23520 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002687 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            KO:K01376 IPI:IPI00527171 RefSeq:NP_567686.2 UniGene:At.32421
            ProteinModelPortal:F4JNL3 SMR:F4JNL3 MEROPS:C01.A22 PRIDE:F4JNL3
            EnsemblPlants:AT4G23520.1 GeneID:828452 KEGG:ath:AT4G23520
            OMA:PANDEIS ArrayExpress:F4JNL3 Uniprot:F4JNL3
        Length = 356

 Score = 785 (281.4 bits), Expect = 4.8e-78, P = 4.8e-78
 Identities = 150/308 (48%), Positives = 210/308 (68%)

Query:    40 EMWMAQYGRVYRDN-AEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
             +MWM+++G+ Y +   EKE RF+ FK+N+ +I   N  A+N  Y+LG+  FAD T +E+R
Sbjct:    48 QMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHN--AKNLSYQLGLTRFADLTVQEYR 105

Query:    99 APRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAV 158
                 G  +  P  R+ +T+          +P S+DWR++GAV+ +KDQG C  CWAFS V
Sbjct:   106 DLFPGSPK--PKQRNLKTSRRYVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTV 163

Query:   159 AAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEG-GLMDDAFEFIISNKGLATEAKYP 217
             AA+EG+N I T +L SLSEQELVDC+    + GC G GLMD AF+F+I+N GL +E  YP
Sbjct:   164 AAVEGLNKIVTGELISLSEQELVDCNLV--NNGCYGSGLMDTAFQFLINNNGLDSEKDYP 221

Query:   218 YKASDGSCNKKEANPSAA-KISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
             Y+ + GSCN+K++  +    I  YEDVP+N+E +L KAVA+QPVSV +D    +F  Y S
Sbjct:   222 YQGTQGSCNRKQSTSNKVITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRS 281

Query:   277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
              ++ G CGT LDH +  VGYG+ ++G  YW+V+NSWGTTWG+ GYI++ R+ +  +GLCG
Sbjct:   282 CIYNGPCGTNLDHALVIVGYGS-ENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCG 340

Query:   337 IAMQASYP 344
             IAM ASYP
Sbjct:   341 IAMLASYP 348


>FB|FBgn0013770 [details] [associations]
            symbol:Cp1 "Cysteine proteinase-1" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS;NAS] [GO:0030163 "protein catabolic process"
            evidence=NAS] [GO:0005764 "lysosome" evidence=NAS] [GO:0048102
            "autophagic cell death" evidence=IEP] [GO:0035071 "salivary gland
            cell autophagic cell death" evidence=IEP] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0045169 "fusome" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:AE013599 GO:GO:0007586 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0035071 GO:GO:0045169 GeneTree:ENSGT00660000095458 KO:K01365
            EMBL:U75652 EMBL:AF012089 EMBL:BT016071 EMBL:D31970
            RefSeq:NP_523735.2 RefSeq:NP_725347.1 RefSeq:NP_725348.1
            UniGene:Dm.7400 ProteinModelPortal:Q95029 SMR:Q95029 IntAct:Q95029
            MINT:MINT-814156 STRING:Q95029 MEROPS:C01.092 PaxDb:Q95029
            EnsemblMetazoa:FBtr0087593 GeneID:36546 KEGG:dme:Dmel_CG6692
            CTD:36546 FlyBase:FBgn0013770 InParanoid:Q95029 OMA:ICHGADP
            OrthoDB:EOG46M91C PhylomeDB:Q95029 GenomeRNAi:36546 NextBio:799136
            Bgee:Q95029 GermOnline:CG6692 Uniprot:Q95029
        Length = 371

 Score = 718 (257.8 bits), Expect = 6.1e-71, P = 6.1e-71
 Identities = 156/350 (44%), Positives = 211/350 (60%)

Query:     4 ILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIF 63
             I +   ++L  + +L V    S++    D  M E H   + ++ + Y+D  E+  R KIF
Sbjct:    29 ITMRTAVLLPLLALLAVAQAVSFA----DVVMEEWHTFKL-EHRKNYQDETEERFRLKIF 83

Query:    64 KENVEYIASFNNK-ARNK-PYKLGINEFADQTNEEFRAPRNGYKRRL-PSVRSSETT--D 118
              EN   IA  N + A  K  +KL +N++AD  + EFR   NG+   L   +R+++ +   
Sbjct:    84 NENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQLMNGFNYTLHKQLRAADESFKG 143

Query:   119 VSF-RYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
             V+F    + ++P S+DWR KGAVT VKDQG CG CWAFS+  A+EG +   +  L SLSE
Sbjct:   144 VTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSE 203

Query:   178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
             Q LVDC T   + GC GGLMD+AF +I  N G+ TE  YPY+A D SC+  +    A   
Sbjct:   204 QNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATD- 262

Query:   238 SGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTG-QCGTE-LDHGVTAV 294
              G+ D+P  +E  + +AVA   PVSVAIDAS   FQFYS GV+   QC  + LDHGV  V
Sbjct:   263 RGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVV 322

Query:   295 GYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             G+GT + G  YWLVKNSWGTTWG+ G+I+M R+   KE  CGIA  +SYP
Sbjct:   323 GFGTDESGEDYWLVKNSWGTTWGDKGFIKMLRN---KENQCGIASASSYP 369


>UNIPROTKB|F1NYJ1 [details] [associations]
            symbol:CTSL2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 IPI:IPI00602255
            OMA:DITHHEF EMBL:AADN02067812 Ensembl:ENSGALT00000020588
            ArrayExpress:F1NYJ1 Uniprot:F1NYJ1
        Length = 339

 Score = 705 (253.2 bits), Expect = 1.4e-69, P = 1.4e-69
 Identities = 150/323 (46%), Positives = 198/323 (61%)

Query:    32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN-NKARNK-PYKLGINEF 89
             D  ++   ++W + + + Y +  E+  R  ++++N++ I   N + +  K  YKLG+N+F
Sbjct:    23 DPDLDSHWQLWKSWHSKDYHER-EESWRRVVWEKNLKMIELHNLDHSLGKHSYKLGMNQF 81

Query:    90 ADQTNEEFRAPRNGYKRRLPS--VRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQG 147
              D T EEFR   NGYK +      R S+  + SF       P S+DWR+KG VT VKDQG
Sbjct:    82 GDMTAEEFRQLMNGYKHKKSERKYRGSQFLEPSF----LEAPRSVDWREKGYVTPVKDQG 137

Query:   148 QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISN 207
             QCG CWAFS   A+EG +   T KL SLSEQ LVDC     +QGC GGLMD AF+++  N
Sbjct:   138 QCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDN 197

Query:   208 KGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDA 266
              G+ +E  YPY A D    + +A  +AA  +G+ D+P  +E ALMKAVA+  PVSVAIDA
Sbjct:   198 GGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVASVGPVSVAIDA 257

Query:   267 SGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYGTAD---DGTKYWLVKNSWGTTWGENGY 321
               S FQFY SG++    C +E LDHGV  VGYG      DG KYW+VKNSWG  WG+ GY
Sbjct:   258 GHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGY 317

Query:   322 IRMQRDIDAKEGLCGIAMQASYP 344
             I M +D   ++  CGIA  ASYP
Sbjct:   318 IYMAKD---RKNHCGIATAASYP 337


>TAIR|locus:2030027 [details] [associations]
            symbol:AT1G29110 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            EMBL:CP002684 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            IPI:IPI00544534 RefSeq:NP_564322.1 UniGene:At.51816
            ProteinModelPortal:F4HZW2 SMR:F4HZW2 EnsemblPlants:AT1G29110.1
            GeneID:839786 KEGG:ath:AT1G29110 OMA:SCRANAR Uniprot:F4HZW2
        Length = 334

 Score = 704 (252.9 bits), Expect = 1.8e-69, P = 1.8e-69
 Identities = 144/348 (41%), Positives = 210/348 (60%)

Query:     3 MILLENKLVLAAILVLGVWAPQSWSR-TLNDATMNERHEMWMAQYGRVYRDNAEKEMRFK 61
             M+ + +  V   IL + +   Q+    TLN+ ++ + H+ WM Q+ RVY+D +EKEMR K
Sbjct:     1 MVSVRSVFVALTILSMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLK 60

Query:    62 IFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSS-ETTDVS 120
             +FK+N+++I +FNN   N+ Y LG+NEF D   EEF A   G +  + S+      T  S
Sbjct:    61 VFKKNLKFIENFNNMG-NQSYTLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPS 119

Query:   121 FRYENASVPA---SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
               +  + +     S DWR +GAVT VK QG C              +  I+ + L +LSE
Sbjct:   120 RNWNMSDIDMEDESKDWRDEGAVTPVKYQGACR-------------LTKISGKNLLTLSE 166

Query:   178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
             Q+L+DCD   ++ GC GG  ++AF++II N G++ E +YPY+    SC          +I
Sbjct:   167 QQLIDCDIE-KNGGCNGGEFEEAFKYIIKNGGVSLETEYPYQVKKESCRANARRAPHTQI 225

Query:   238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTG-QCGTELDHGVTAVGY 296
              G++ VPS+NE AL++AV  QPVSV IDA    F  Y  GV+ G  CGT+++H VT VGY
Sbjct:   226 RGFQMVPSHNERALLEAVRRQPVSVLIDARADSFGHYKGGVYAGLDCGTDVNHAVTIVGY 285

Query:   297 GTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             GT   G  YW++KNSWG +WGENGY+R++RD++  +G+CGIA  A+YP
Sbjct:   286 GTMS-GLNYWVLKNSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAYP 332


>UNIPROTKB|F1S4J6 [details] [associations]
            symbol:Ssc.54235 "Cathepsin L1" species:9823 "Sus scrofa"
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197
            GO:GO:0002250 GeneTree:ENSGT00560000076577 GO:GO:0097067
            EMBL:CU571031 RefSeq:XP_003130681.1 Ensembl:ENSSSCT00000011983
            GeneID:100515919 KEGG:ssc:100515919 OMA:IAICATK Uniprot:F1S4J6
        Length = 332

 Score = 688 (247.2 bits), Expect = 9.2e-68, P = 9.2e-68
 Identities = 156/344 (45%), Positives = 202/344 (58%)

Query:     8 NKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
             N  +L A   LG+ +    +   +D +++     W A + ++Y  N E   R  I+++N+
Sbjct:     2 NPSLLLAAFCLGIAS----AAPRHDHSLDADWYKWKATHRKLYGLNEEGRRR-AIWEKNM 56

Query:    68 EYIASFNNKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN 125
             + I   N + R     + + +N F D TNEEFR   NG++ +    +     D      +
Sbjct:    57 KMIERHNWEHRQGKHSFTMAMNAFGDMTNEEFRKTMNGFQNQKHK-KGKVFLDAG----S 111

Query:   126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
             A  P S+DWR+KG VT VK+QG CG CWAFSA  A+EG     T KL SLSEQ LVDC  
Sbjct:   112 ALTPHSVDWREKGYVTAVKNQGHCGSCWAFSATGALEGQMFRKTSKLISLSEQNLVDCSW 171

Query:   186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
                ++GC GGLMD+AF++I  N GL +E  YPY   DGSC  K  + SAA  +GY D+P 
Sbjct:   172 PEGNEGCNGGLMDNAFQYIKDNGGLDSEESYPYFGKDGSCKYKPQS-SAANDTGYVDIPK 230

Query:   246 NNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYGT--AD 300
               E ALMKAVA   P+SV IDAS   FQFYS+G+ F  QC +E LDHGV  VGYG   A 
Sbjct:   231 Q-EKALMKAVATVGPISVGIDASHESFQFYSTGIYFEPQCSSEDLDHGVLVVGYGVEGAH 289

Query:   301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
                KYWLVKNSWG TWG +GYI+M +D   +   CGIA  ASYP
Sbjct:   290 SNNKYWLVKNSWGNTWGMDGYIKMTKD---QNNHCGIATMASYP 330


>ZFIN|ZDB-GENE-030131-572 [details] [associations]
            symbol:wu:fb37b09 "wu:fb37b09" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030131-572 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:BX465840 IPI:IPI00866294 RefSeq:XP_001923796.1
            UniGene:Dr.25683 PRIDE:E9QBE2 Ensembl:ENSDART00000133962
            GeneID:321853 KEGG:dre:321853 NextBio:20807556 Uniprot:E9QBE2
        Length = 335

 Score = 688 (247.2 bits), Expect = 9.2e-68, P = 9.2e-68
 Identities = 146/320 (45%), Positives = 195/320 (60%)

Query:    32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIA--SFNNKARNKPYKLGINEF 89
             D  +++    W +Q+G+ Y ++ E   R  I++EN+  I   +F     N  +K+G+N+F
Sbjct:    21 DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSLGNHTFKMGMNQF 79

Query:    90 ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
              D TNEEFR   NGYK   P+ R+S+   +    +  + P  +DWR++G VT VKDQ QC
Sbjct:    80 GDMTNEEFRQAMNGYKHD-PN-RTSQGP-LFMEPKFFAAPQQVDWRQRGYVTPVKDQKQC 136

Query:   150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
             G CW+FS+  A+EG     T KL S+SEQ LVDC     +QGC GGLMD AF+++  NKG
Sbjct:   137 GSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHGNQGCNGGLMDQAFQYVKENKG 196

Query:   210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASG 268
             L +E  YPY A D    + +   + AKI+G+ D+P  NE ALM AVA   PVSVAIDAS 
Sbjct:   197 LDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGNELALMNAVAAVGPVSVAIDASH 256

Query:   269 SDFQFYSSGVFTGQ-CGTELDHGVTAVGYGT--AD-DGTKYWLVKNSWGTTWGENGYIRM 324
                QFY SG++  + C ++LDH V  VGYG   AD  G +YW+VKNSW   WG+ GYI M
Sbjct:   257 QSLQFYQSGIYYERACTSQLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYM 316

Query:   325 QRDIDAKEGLCGIAMQASYP 344
              +D   K   CGIA  ASYP
Sbjct:   317 AKD---KNNHCGIATMASYP 333


>ZFIN|ZDB-GENE-040718-61 [details] [associations]
            symbol:ctsl.1 "cathepsin L.1" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-040718-61
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            GeneTree:ENSGT00660000095458 MEROPS:C01.092 EMBL:FP015965
            EMBL:BC075887 IPI:IPI00513499 RefSeq:NP_001002368.1
            UniGene:Dr.85174 SMR:Q6DHT0 Ensembl:ENSDART00000017756
            GeneID:436641 KEGG:dre:436641 CTD:436641 InParanoid:Q6DHT0
            OMA:GGQMENA OrthoDB:EOG41ZFB9 NextBio:20831086 Uniprot:Q6DHT0
        Length = 334

 Score = 687 (246.9 bits), Expect = 1.2e-67, P = 1.2e-67
 Identities = 163/345 (47%), Positives = 209/345 (60%)

Query:    10 LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
             LV+AA   L V +  S S  L D    E H  W  ++G+ YR   E+  R   +  N + 
Sbjct:     4 LVVAAAF-LAVASAASLS--LEDM---EFHA-WKLKFGKSYRSAEEESHRQLTWLTNRKL 56

Query:    70 IASFNNKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS--FRYEN 125
             +   N  A    K Y+LG+  FAD +NEE+R  +  ++  L S+ +++    S  FR   
Sbjct:    57 VLVHNMMADQGLKSYRLGMTYFADMSNEEYR--QLVFRGCLGSMNNTKARGGSTFFRLRK 114

Query:   126 ASV-PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
             A+V P ++DWR KG VT +KDQ QCG CWAFSA  ++EG     T KL SLSEQ+LVDC 
Sbjct:   115 AAVVPDTVDWRDKGYVTDIKDQKQCGSCWAFSATGSLEGQTFRKTGKLVSLSEQQLVDCS 174

Query:   185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSA--AKISGYED 242
              S  + GC+GGLMD AF++I +NKGL TE  YPY+A DG C     NPS   A  +GY D
Sbjct:   175 GSYGNYGCDGGLMDQAFQYIEANKGLDTEDSYPYEAQDGECR---FNPSTVGASCTGYVD 231

Query:   243 VPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFTG-QCGT-ELDHGVTAVGYGTA 299
             + S +E+AL +AVA   P+SVAIDA  S FQ YSSGV+    C + ELDHGV AVGYG++
Sbjct:   232 IASGDESALQEAVATIGPISVAIDAGHSSFQLYSSGVYNEPDCSSSELDHGVLAVGYGSS 291

Query:   300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             + G  YW+VKNSWG  WG  GYI M R+   K   CGIA  ASYP
Sbjct:   292 N-GDDYWIVKNSWGLDWGVQGYILMSRN---KSNQCGIATAASYP 332


>ZFIN|ZDB-GENE-071004-74 [details] [associations]
            symbol:zgc:174855 "zgc:174855" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-071004-74
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 MEROPS:C01.032 EMBL:BX000534 EMBL:BC152282
            IPI:IPI00773140 RefSeq:NP_001096592.1 UniGene:Dr.104905 SMR:A7MCR6
            STRING:A7MCR6 Ensembl:ENSDART00000109968 GeneID:569326
            KEGG:dre:569326 NextBio:20889622 Uniprot:A7MCR6
        Length = 335

 Score = 683 (245.5 bits), Expect = 3.1e-67, P = 3.1e-67
 Identities = 145/320 (45%), Positives = 194/320 (60%)

Query:    32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIA--SFNNKARNKPYKLGINEF 89
             D  +++    W +Q+G+ Y ++ E   R  I++EN+  I   +F     N  +K+G+N+F
Sbjct:    21 DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSLGNHTFKMGMNQF 79

Query:    90 ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
              D TNEEFR   NGYK+  P+ R+S+   +       + P  +DWR++G VT VKDQ QC
Sbjct:    80 GDMTNEEFRQAMNGYKQD-PN-RTSKGA-LFMEPSFFAAPQQVDWRQRGYVTPVKDQKQC 136

Query:   150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
             G CW+FS+  A+EG     T KL S+SEQ LVDC     +QGC GG+MD AF+++  NKG
Sbjct:   137 GSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQAFQYVKENKG 196

Query:   210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASG 268
             L +E  YPY A D    + +   + AKI+G+ D+P  NE ALM AVA   PVSVAIDAS 
Sbjct:   197 LDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGNELALMNAVAAVGPVSVAIDASH 256

Query:   269 SDFQFYSSGVFTGQ-CGTELDHGVTAVGYGT--AD-DGTKYWLVKNSWGTTWGENGYIRM 324
                QFY SG++  + C + LDH V  VGYG   AD  G +YW+VKNSW   WG+ GYI M
Sbjct:   257 QSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYM 316

Query:   325 QRDIDAKEGLCGIAMQASYP 344
              +D   K   CGIA  ASYP
Sbjct:   317 AKD---KNNHCGIATMASYP 333


>MGI|MGI:88564 [details] [associations]
            symbol:Ctsl "cathepsin L" species:10090 "Mus musculus"
            [GO:0004177 "aminopeptidase activity" evidence=ISO] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISO] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005730 "nucleolus"
            evidence=NAS] [GO:0005737 "cytoplasm" evidence=ISO] [GO:0005764
            "lysosome" evidence=ISO] [GO:0005773 "vacuole" evidence=ISO]
            [GO:0005902 "microvillus" evidence=ISO] [GO:0006508 "proteolysis"
            evidence=ISO;IDA] [GO:0007154 "cell communication" evidence=ISO]
            [GO:0008152 "metabolic process" evidence=TAS] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISO;TAS] [GO:0009897 "external side of
            plasma membrane" evidence=ISO] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0030141 "secretory granule" evidence=ISO]
            [GO:0030984 "kininogen binding" evidence=ISO] [GO:0032403 "protein
            complex binding" evidence=ISO] [GO:0042277 "peptide binding"
            evidence=ISO] [GO:0042393 "histone binding" evidence=ISO;NAS]
            [GO:0043005 "neuron projection" evidence=ISO] [GO:0043204
            "perikaryon" evidence=ISO] [GO:0045177 "apical part of cell"
            evidence=ISO] [GO:0048863 "stem cell differentiation" evidence=NAS]
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IMP] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:88564 GO:GO:0005730 GO:GO:0009897 GO:GO:0034698
            GO:GO:0043204 GO:GO:0009749 GO:GO:0030141 GO:GO:0048863
            GO:GO:0051384 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045177 GO:GO:0043005
            GO:GO:0007283 GO:GO:0004177 GO:GO:0005764 GO:GO:0042277
            GO:GO:0009267 GO:GO:0021675 GO:GO:0042393 GO:GO:0005902
            GO:GO:0010259 GO:GO:0004197 GO:GO:0048102 GO:GO:0046697
            HOVERGEN:HBG011513 KO:K01365 OMA:EEFRATH OrthoDB:EOG48PMKF
            MEROPS:C01.032 BRENDA:3.4.22.15 ChiTaRS:CTSL1 EMBL:X06086
            EMBL:J02583 EMBL:M20495 EMBL:AF121837 EMBL:AF121838 EMBL:AF121839
            EMBL:BC068163 EMBL:X04392 IPI:IPI00128154 PIR:S01177
            RefSeq:NP_034114.1 UniGene:Mm.930 PDB:1MVV PDBsum:1MVV
            ProteinModelPortal:P06797 SMR:P06797 STRING:P06797
            PhosphoSite:P06797 PaxDb:P06797 PRIDE:P06797
            Ensembl:ENSMUST00000021933 GeneID:13039 KEGG:mmu:13039 CTD:13039
            InParanoid:P06797 BioCyc:MetaCyc:MONOMER-14812 BindingDB:P06797
            ChEMBL:CHEMBL5291 NextBio:282928 Bgee:P06797 CleanEx:MM_CTSL
            Genevestigator:P06797 GermOnline:ENSMUSG00000021477 GO:GO:0060008
            Uniprot:P06797
        Length = 334

 Score = 681 (244.8 bits), Expect = 5.1e-67, P = 5.1e-67
 Identities = 153/345 (44%), Positives = 204/345 (59%)

Query:     8 NKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
             N L+L A+L LG     + +    D T +     W + + R+Y  N E+E R  I+++N+
Sbjct:     2 NLLLLLAVLCLGT----ALATPKFDQTFSAEWHQWKSTHRRLYGTN-EEEWRRAIWEKNM 56

Query:    68 EYIASFNNKARNKP--YKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN 125
               I   N +  N    + + +N F D TNEEFR   NGY+ +           +  +   
Sbjct:    57 RMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK--- 113

Query:   126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
               +P S+DWR+KG VT VK+QGQCG CWAFSA   +EG   + T KL SLSEQ LVDC  
Sbjct:   114 --IPKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSH 171

Query:   186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
             +  +QGC GGLMD AF++I  N GL +E  YPY+A DGSC K  A  + A  +G+ D+P 
Sbjct:   172 AQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSC-KYRAEFAVANDTGFVDIPQ 230

Query:   246 NNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYG---TA 299
               E ALMKAVA   P+SVA+DAS    QFYSSG++    C ++ LDHGV  VGYG   T 
Sbjct:   231 Q-EKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTD 289

Query:   300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
              +  KYWLVKNSWG+ WG  GYI++ +D   ++  CG+A  ASYP
Sbjct:   290 SNKNKYWLVKNSWGSEWGMEGYIKIAKD---RDNHCGLATAASYP 331


>ZFIN|ZDB-GENE-080215-7 [details] [associations]
            symbol:zgc:174153 "zgc:174153" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-080215-7
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 EMBL:BX000534 EMBL:BX322603
            IPI:IPI00483644 Ensembl:ENSDART00000113654 OMA:ITLCISA Bgee:F1R8Y0
            Uniprot:F1R8Y0
        Length = 336

 Score = 679 (244.1 bits), Expect = 8.2e-67, P = 8.2e-67
 Identities = 146/321 (45%), Positives = 193/321 (60%)

Query:    32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIA--SFNNKARNKPYKLGINEF 89
             D  +++    W +Q+G+ Y ++ E   R  I++EN+  I   +F     N  +K+G+N+F
Sbjct:    21 DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSYGNHTFKMGMNQF 79

Query:    90 ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
              D TNEEFR   NGYK   P+  S     +   +  A  P  +DWR++G VT VKDQ QC
Sbjct:    80 GDMTNEEFRQAMNGYKHD-PNQTSQGPLFMEPSFFAA--PQQVDWRQRGYVTPVKDQKQC 136

Query:   150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
             G CW+FS+  A+EG     T KL S+SEQ LVDC     +QGC GGLMD AF+++  NKG
Sbjct:   137 GSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGLMDQAFQYVKENKG 196

Query:   210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASG 268
             L +E  YPY A D    + +   + AKI+G+ D+PS NE ALM AVA   PVSVAIDAS 
Sbjct:   197 LDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGNEPALMNAVAAVGPVSVAIDASH 256

Query:   269 SDFQFYSSGVFTGQ-CGTE-LDHGVTAVGYGT--AD-DGTKYWLVKNSWGTTWGENGYIR 323
                QFY SG++  + C +  LDH V  VGYG   AD  G +YW+VKNSW   WG+ GYI 
Sbjct:   257 QSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIY 316

Query:   324 MQRDIDAKEGLCGIAMQASYP 344
             M +D   K   CG+A +ASYP
Sbjct:   317 MAKD---KNNHCGVATKASYP 334


>TAIR|locus:2097104 [details] [associations]
            symbol:AT3G43960 species:3702 "Arabidopsis thaliana"
            [GO:0005886 "plasma membrane" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0031225 "anchored to
            membrane" evidence=TAS] [GO:0048767 "root hair elongation"
            evidence=IMP] [GO:0016132 "brassinosteroid biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:CP002686
            GenomeReviews:BA000014_GR GO:GO:0031225 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0048767 MEROPS:I29.003 HOGENOM:HOG000230773
            EMBL:AL163975 EMBL:AK118634 IPI:IPI00526842 PIR:T48950
            RefSeq:NP_566867.1 UniGene:At.43352 ProteinModelPortal:Q9LXW3
            SMR:Q9LXW3 STRING:Q9LXW3 PaxDb:Q9LXW3 PRIDE:Q9LXW3
            EnsemblPlants:AT3G43960.1 GeneID:823513 KEGG:ath:AT3G43960
            TAIR:At3g43960 eggNOG:NOG286334 InParanoid:Q9LXW3 KO:K01376
            OMA:MAISFRT PhylomeDB:Q9LXW3 ProtClustDB:CLSN2917367
            Genevestigator:Q9LXW3 GermOnline:AT3G43960 Uniprot:Q9LXW3
        Length = 376

 Score = 678 (243.7 bits), Expect = 1.1e-66, P = 1.1e-66
 Identities = 148/344 (43%), Positives = 204/344 (59%)

Query:    10 LVLAAILV---LGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKEN 66
             L L+ +L+   LGV       R  N+  +   +E W+ + G+ Y    EKE RFKIFK+N
Sbjct:    11 LTLSVLLISISLGVVTATESQR--NEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDN 68

Query:    67 VEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA 126
             ++ I   N+   N+ Y+ G+N+F+D T +EF+A   G K    S+  S+  +  ++Y+  
Sbjct:    69 LKRIEEHNSDP-NRSYERGLNKFSDLTADEFQASYLGGKMEKKSL--SDVAE-RYQYKEG 124

Query:   127 SV-PASIDWRKKGAVTG-VKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
              V P  +DWR++GAV   VK QG+CG CWAF+A  A+EGIN ITT +L SLSEQEL+DCD
Sbjct:   125 DVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCD 184

Query:   185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASD-GSCNKKEANPS-AAKISGYED 242
                ++ GC GG    AFEFI  N G+ ++  Y Y   D  +C   E   +    I+G+E 
Sbjct:   185 RGNDNFGCAGGGAVWAFEFIKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEV 244

Query:   243 VPSNNEAALMKAVANQPVSVAIDASG-SDFQFYSSGVFTGQCGTEL-DHGVTAVGYGTAD 300
             VP N+E +L KAVA QP+SV I A+  SD   Y SGV+ G C     DH V  VGYGT+ 
Sbjct:   245 VPVNDEMSLKKAVAYQPISVMISAANMSD---YKSGVYKGACSNLWGDHNVLIVGYGTSS 301

Query:   301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             D   YWL++NSWG  WGE GY+R+QR+     G C +A+   YP
Sbjct:   302 DEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPTGKCAVAVAPVYP 345


>DICTYBASE|DDB_G0279799 [details] [associations]
            symbol:cprB "cysteine proteinase 2" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0279799 GenomeReviews:CM000152_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            MEROPS:I29.003 KO:K01365 EMBL:AAFI02000033 EMBL:M16039 EMBL:X03344
            PIR:A25439 RefSeq:XP_641494.1 ProteinModelPortal:P04989 SMR:P04989
            EnsemblProtists:DDB0214998 GeneID:8622234 KEGG:ddi:DDB_G0279799
            OMA:YVNITAG Uniprot:P04989
        Length = 376

 Score = 563 (203.2 bits), Expect = 1.1e-66, Sum P(2) = 1.1e-66
 Identities = 115/258 (44%), Positives = 161/258 (62%)

Query:    42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
             W  ++ R Y  ++E   R+ IFK N++Y+ ++N+K  ++   LG+N FAD TNEE+R   
Sbjct:    39 WTLKFNRQY-SSSEFSNRYSIFKSNMDYVDNWNSKGDSQTV-LGLNNFADITNEEYRKTY 96

Query:   102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
              G +    S    +  +V    +  + P SIDWR K AVT +KDQGQCG CW+FS   + 
Sbjct:    97 LGTRVNAHSYNGYDGREVLNVEDLQTNPKSIDWRTKNAVTPIKDQGQCGSCWSFSTTGST 156

Query:   162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
             EG + + T+KL SLSEQ LVDC    E+ GC+GGLM++AF++II NKG+ TE+ YPY A 
Sbjct:   157 EGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAFDYIIKNKGIDTESSYPYTAE 216

Query:   222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF-T 280
              GS      +   A I GY ++ + +E +L     + PVSVAIDAS + FQ Y+SG++  
Sbjct:   217 TGSTCLFNKSDIGATIKGYVNITAGSEISLENGAQHGPVSVAIDASHNSFQLYTSGIYYE 276

Query:   281 GQCG-TELDHGVTAVGYG 297
              +C  TELDHGV  VGYG
Sbjct:   277 PKCSPTELDHGVLVVGYG 294

 Score = 133 (51.9 bits), Expect = 1.1e-66, Sum P(2) = 1.1e-66
 Identities = 24/42 (57%), Positives = 30/42 (71%)

Query:   305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
             YW+VKNSWGT+WG  GYI M +D   ++  CGIA  +SYP A
Sbjct:   338 YWIVKNSWGTSWGIKGYILMSKD---RKNNCGIASVSSYPLA 376


>RGD|2448 [details] [associations]
            symbol:Ctsl1 "cathepsin L1" species:10116 "Rattus norvegicus"
          [GO:0002250 "adaptive immune response" evidence=ISO] [GO:0004177
          "aminopeptidase activity" evidence=IDA] [GO:0004197 "cysteine-type
          endopeptidase activity" evidence=ISO;IDA] [GO:0005576 "extracellular
          region" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IDA]
          [GO:0005764 "lysosome" evidence=ISO;IDA] [GO:0005773 "vacuole"
          evidence=IDA] [GO:0005902 "microvillus" evidence=IDA] [GO:0006508
          "proteolysis" evidence=IEP;ISO] [GO:0007154 "cell communication"
          evidence=IDA] [GO:0007283 "spermatogenesis" evidence=IEP] [GO:0008234
          "cysteine-type peptidase activity" evidence=ISO] [GO:0008584 "male
          gonad development" evidence=IEP] [GO:0009267 "cellular response to
          starvation" evidence=IEP] [GO:0009749 "response to glucose stimulus"
          evidence=IEP] [GO:0009897 "external side of plasma membrane"
          evidence=IDA] [GO:0010259 "multicellular organismal aging"
          evidence=IEP] [GO:0014070 "response to organic cyclic compound"
          evidence=IEP] [GO:0021675 "nerve development" evidence=IEP]
          [GO:0030984 "kininogen binding" evidence=IPI] [GO:0032403 "protein
          complex binding" evidence=IPI] [GO:0034698 "response to gonadotropin
          stimulus" evidence=IEP] [GO:0042277 "peptide binding" evidence=IDA]
          [GO:0042393 "histone binding" evidence=ISO] [GO:0043005 "neuron
          projection" evidence=IDA] [GO:0043204 "perikaryon" evidence=IDA]
          [GO:0046697 "decidualization" evidence=IEP] [GO:0048102 "autophagic
          cell death" evidence=IEP] [GO:0051384 "response to glucocorticoid
          stimulus" evidence=IEP] [GO:0060008 "Sertoli cell differentiation"
          evidence=IEP] [GO:0097067 "cellular response to thyroid hormone
          stimulus" evidence=ISO] [GO:0030141 "secretory granule" evidence=IDA]
          [GO:0045177 "apical part of cell" evidence=IDA] [GO:0060441
          "epithelial tube branching involved in lung morphogenesis"
          evidence=ISO] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
          PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:Y00697 RGD:2448
          GO:GO:0005576 GO:GO:0009897 GO:GO:0034698 GO:GO:0043204 GO:GO:0009749
          GO:GO:0051384 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
          InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
          PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
          PROSITE:PS00139 PROSITE:PS00639 GO:GO:0043005 GO:GO:0007283
          GO:GO:0004177 GO:GO:0005764 GO:GO:0042277 GO:GO:0009267 GO:GO:0021675
          GO:GO:0005902 GO:GO:0010259 GO:GO:0004197 GO:GO:0048102 GO:GO:0046697
          GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 KO:K01365
          OrthoDB:EOG48PMKF MEROPS:C01.032 OMA:FDQNLDT CTD:1514
          BRENDA:3.4.22.15 GO:GO:0060008 EMBL:AF025476 EMBL:BC063175
          EMBL:S85184 IPI:IPI00326070 PIR:S07098 RefSeq:NP_037288.1
          UniGene:Rn.1294 ProteinModelPortal:P07154 SMR:P07154 IntAct:P07154
          STRING:P07154 PhosphoSite:P07154 PRIDE:P07154
          Ensembl:ENSRNOT00000025462 GeneID:25697 KEGG:rno:25697 UCSC:RGD:2448
          InParanoid:P07154 SABIO-RK:P07154 BindingDB:P07154 ChEMBL:CHEMBL2305
          NextBio:607715 Genevestigator:P07154 GermOnline:ENSRNOG00000018566
          Uniprot:P07154
        Length = 334

 Score = 676 (243.0 bits), Expect = 1.7e-66, P = 1.7e-66
 Identities = 151/343 (44%), Positives = 202/343 (58%)

Query:    10 LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
             L+L A+L LG     + +    D T N +   W + + R+Y  N E+E R  ++++N+  
Sbjct:     4 LLLLAVLCLGT----ALATPKFDQTFNAQWHQWKSTHRRLYGTN-EEEWRRAVWEKNMRM 58

Query:    70 IASFNNKARNKP--YKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS 127
             I   N +  N    + + +N F D TNEEFR   NGY+ +    +           +   
Sbjct:    59 IQLHNGEYSNGKHGFTMEMNAFGDMTNEEFRQIVNGYRHQ--KHKKGRLFQEPLMLQ--- 113

Query:   128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
             +P ++DWR+KG VT VK+QGQCG CWAFSA   +EG   + T KL SLSEQ LVDC    
Sbjct:   114 IPKTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQ 173

Query:   188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
              +QGC GGLMD AF++I  N GL +E  YPY+A DGSC K  A  + A  +G+ D+P   
Sbjct:   174 GNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSC-KYRAEYAVANDTGFVDIPQQ- 231

Query:   248 EAALMKAVANQ-PVSVAIDASGSDFQFYSSGVF-TGQCGT-ELDHGVTAVGYG---TADD 301
             E ALMKAVA   P+SVA+DAS    QFYSSG++    C + +LDHGV  VGYG   T  +
Sbjct:   232 EKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSN 291

Query:   302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
               KYWLVKNSWG  WG +GYI++ +D   +   CG+A  ASYP
Sbjct:   292 KDKYWLVKNSWGKEWGMDGYIKIAKD---RNNHCGLATAASYP 331


>UNIPROTKB|P07711 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9606 "Homo sapiens"
            [GO:0005576 "extracellular region" evidence=NAS] [GO:0005764
            "lysosome" evidence=IDA;NAS] [GO:0006508 "proteolysis"
            evidence=IDA] [GO:0002250 "adaptive immune response" evidence=IEP]
            [GO:0019882 "antigen processing and presentation" evidence=TAS]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0036021 "endolysosome lumen" evidence=TAS] [GO:0043202
            "lysosomal lumen" evidence=TAS] [GO:0045087 "innate immune
            response" evidence=TAS] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042393 "histone binding" evidence=IDA] [GO:0005634 "nucleus"
            evidence=TAS] [GO:0071888 "macrophage apoptotic process"
            evidence=NAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            Reactome:REACT_118779 EMBL:X12451 GO:GO:0005634 Reactome:REACT_6900
            GO:GO:0005576 GO:GO:0019886 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087
            GO:GO:0042393 GO:GO:0004197 GO:GO:0002250 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0036021 KO:K01365 OrthoDB:EOG48PMKF EMBL:M20496
            EMBL:CR457053 EMBL:BX537395 EMBL:AL160279 EMBL:BC012612 EMBL:X05256
            IPI:IPI00012887 PIR:S01002 RefSeq:NP_001244900.1
            RefSeq:NP_001244901.1 RefSeq:NP_001903.1 RefSeq:NP_666023.1
            UniGene:Hs.731507 UniGene:Hs.731952 PDB:1CJL PDB:1CS8 PDB:1ICF
            PDB:1MHW PDB:2NQD PDB:2VHS PDB:2XU1 PDB:2XU3 PDB:2XU4 PDB:2XU5
            PDB:2YJ2 PDB:2YJ8 PDB:2YJ9 PDB:2YJB PDB:2YJC PDB:3BC3 PDB:3H89
            PDB:3H8B PDB:3H8C PDB:3HHA PDB:3HWN PDB:3IV2 PDB:3K24 PDB:3KSE
            PDB:3OF8 PDB:3OF9 PDBsum:1CJL PDBsum:1CS8 PDBsum:1ICF PDBsum:1MHW
            PDBsum:2NQD PDBsum:2VHS PDBsum:2XU1 PDBsum:2XU3 PDBsum:2XU4
            PDBsum:2XU5 PDBsum:2YJ2 PDBsum:2YJ8 PDBsum:2YJ9 PDBsum:2YJB
            PDBsum:2YJC PDBsum:3BC3 PDBsum:3H89 PDBsum:3H8B PDBsum:3H8C
            PDBsum:3HHA PDBsum:3HWN PDBsum:3IV2 PDBsum:3K24 PDBsum:3KSE
            PDBsum:3OF8 PDBsum:3OF9 ProteinModelPortal:P07711 SMR:P07711
            IntAct:P07711 STRING:P07711 MEROPS:I29.001 PhosphoSite:P07711
            DMDM:115741 PaxDb:P07711 PeptideAtlas:P07711 PRIDE:P07711
            DNASU:1514 Ensembl:ENST00000340342 Ensembl:ENST00000343150
            GeneID:1514 KEGG:hsa:1514 UCSC:uc004aph.3 CTD:1514
            GeneCards:GC09P090341 H-InvDB:HIX0058839 H-InvDB:HIX0170314
            HGNC:HGNC:2537 HPA:CAB000459 MIM:116880 neXtProt:NX_P07711
            PharmGKB:PA162382890 InParanoid:P07711 OMA:REPLFAQ PhylomeDB:P07711
            BRENDA:3.4.22.15 BindingDB:P07711 ChEMBL:CHEMBL3837 ChiTaRS:CTSL1
            DrugBank:DB00040 EvolutionaryTrace:P07711 GenomeRNAi:1514
            NextBio:6271 PMAP-CutDB:P07711 ArrayExpress:P07711 Bgee:P07711
            CleanEx:HS_CTSL1 Genevestigator:P07711 GermOnline:ENSG00000135047
            GO:GO:0071888 Uniprot:P07711
        Length = 333

 Score = 675 (242.7 bits), Expect = 2.2e-66, P = 2.2e-66
 Identities = 150/339 (44%), Positives = 197/339 (58%)

Query:    16 LVLGVWAPQSWSRTLN-DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN 74
             L+L  +     S TL  D ++  +   W A + R+Y  N E+  R  ++++N++ I   N
Sbjct:     5 LILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKMIELHN 63

Query:    75 NKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASI 132
              + R     + + +N F D T+EEFR   NG++ R P  R  +       YE    P S+
Sbjct:    64 QEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKP--RKGKVFQEPLFYE---APRSV 118

Query:   133 DWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGC 192
             DWR+KG VT VK+QGQCG CWAFSA  A+EG     T +L SLSEQ LVDC     ++GC
Sbjct:   119 DWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGC 178

Query:   193 EGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALM 252
              GGLMD AF+++  N GL +E  YPY+A++ SC K     S A  +G+ D+P   E ALM
Sbjct:   179 NGGLMDYAFQYVQDNGGLDSEESYPYEATEESC-KYNPKYSVANDTGFVDIPKQ-EKALM 236

Query:   253 KAVANQ-PVSVAIDASGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYG---TADDGTKYW 306
             KAVA   P+SVAIDA    F FY  G+ F   C +E +DHGV  VGYG   T  D  KYW
Sbjct:   237 KAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYW 296

Query:   307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
             LVKNSWG  WG  GY++M +D   +   CGIA  ASYPT
Sbjct:   297 LVKNSWGEEWGMGGYVKMAKD---RRNHCGIASAASYPT 332


>UNIPROTKB|A4IFS7 [details] [associations]
            symbol:CTSL1 "CTSL1 protein" species:9913 "Bos taurus"
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197 GO:GO:0002250
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 GO:GO:0097067
            OrthoDB:EOG48PMKF MEROPS:C01.032 CTD:1514 EMBL:DAAA02023987
            EMBL:BC134741 IPI:IPI00708619 RefSeq:NP_001077155.1
            UniGene:Bt.23199 SMR:A4IFS7 Ensembl:ENSBTAT00000000962
            GeneID:515200 KEGG:bta:515200 InParanoid:A4IFS7 OMA:NDEQALM
            NextBio:20871707 Uniprot:A4IFS7
        Length = 333

 Score = 673 (242.0 bits), Expect = 3.6e-66, P = 3.6e-66
 Identities = 146/322 (45%), Positives = 200/322 (62%)

Query:    32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGINEF 89
             D +++ + ++W A + + Y D  E+  R  ++K+N++ I   N + ++ K  + + +N F
Sbjct:    22 DHSLDTQWKLWKAAHRKPY-DLNEEGWRKAVWKKNMKMIELHNQEYSQGKHSFSMAMNAF 80

Query:    90 ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
              D TNEEFR   NG++R+  + +  E  +  F    AS+P S+DWR+KG VT VK+QG+C
Sbjct:    81 GDMTNEEFRHTMNGFQRQ-KNKKGKEFHETIF----ASIPPSVDWREKGYVTPVKNQGKC 135

Query:   150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
             G CWAFSA  A+EG     T KL SLSEQ LVDC     ++GC GG +D+AF++++   G
Sbjct:   136 GSCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCSQPEGNRGCHGGFIDNAFQYVLDVGG 195

Query:   210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASG 268
             L +E  YPY    G+C     N SAA  +G+ D+P   E ALMKAVAN  P+SVA+DA  
Sbjct:   196 LDSEESYPYTGLVGTC-LYNPNNSAANETGFVDLPKQ-EKALMKAVANLGPISVAVDAHN 253

Query:   269 SDFQFYSSGVF-TGQCGTE-LDHGVTAVGYGT--AD-DGTKYWLVKNSWGTTWGENGYIR 323
               FQFY SG++    C +E +DH V  VGYG   AD D  KYWLVKNSWG  WG NGYI+
Sbjct:   254 PSFQFYKSGIYYEPNCSSESVDHAVLVVGYGFEGADSDDNKYWLVKNSWGEHWGMNGYIK 313

Query:   324 MQRDIDAKEGLCGIAMQASYPT 345
             M +D   +   CGIA  ASYPT
Sbjct:   314 MAKD---RNNHCGIATMASYPT 332


>ZFIN|ZDB-GENE-980526-285 [details] [associations]
            symbol:ctsl1b "cathepsin L, 1 b" species:7955
            "Danio rerio" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005576
            "extracellular region" evidence=IDA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-980526-285 GO:GO:0005576 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:BX465840 IPI:IPI00498443 Ensembl:ENSDART00000145570
            Bgee:F1R7B3 Uniprot:F1R7B3
        Length = 352

 Score = 673 (242.0 bits), Expect = 3.6e-66, P = 3.6e-66
 Identities = 145/321 (45%), Positives = 192/321 (59%)

Query:    32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIA--SFNNKARNKPYKLGINEF 89
             D  +++    W +Q+G+ Y ++ E   R  I++EN+  I   +F     N  +K+G+N+F
Sbjct:    37 DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSYGNHTFKMGMNQF 95

Query:    90 ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
              D TNEEFR   NGY    P+  S     +   +  A  P  +DWR++G VT VKDQ QC
Sbjct:    96 GDMTNEEFRQAMNGYTHD-PNQTSQGPLFMEPSFFAA--PQQVDWRQRGYVTPVKDQKQC 152

Query:   150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
             G CW+FS+  A+EG     T KL S+SEQ LVDC     +QGC GGLMD AF+++  NKG
Sbjct:   153 GSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGLMDQAFQYVKENKG 212

Query:   210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASG 268
             L +E  YPY A D    + +   + AKI+G+ D+PS NE ALM AVA   PVSVAIDAS 
Sbjct:   213 LDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGNELALMNAVAAVGPVSVAIDASH 272

Query:   269 SDFQFYSSGVFTGQ-CGTE-LDHGVTAVGYGT--AD-DGTKYWLVKNSWGTTWGENGYIR 323
                QFY SG++  + C +  LDH V  VGYG   AD  G +YW+VKNSW   WG+ GYI 
Sbjct:   273 QSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIY 332

Query:   324 MQRDIDAKEGLCGIAMQASYP 344
             M +D   K   CG+A +ASYP
Sbjct:   333 MAKD---KNNHCGVATKASYP 350


>UNIPROTKB|P25975 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 EMBL:X91755 EMBL:BC102312 EMBL:AB017648
            IPI:IPI00687440 PIR:S15845 RefSeq:NP_776457.1 UniGene:Bt.3987
            ProteinModelPortal:P25975 SMR:P25975 STRING:P25975
            Ensembl:ENSBTAT00000022710 Ensembl:ENSBTAT00000036427 GeneID:281108
            KEGG:bta:281108 CTD:1515 InParanoid:P25975 KO:K01365 OMA:EEFRATH
            OrthoDB:EOG48PMKF BindingDB:P25975 ChEMBL:CHEMBL2113
            NextBio:20805179 ArrayExpress:P25975 Uniprot:P25975
        Length = 334

 Score = 671 (241.3 bits), Expect = 5.8e-66, P = 5.8e-66
 Identities = 155/347 (44%), Positives = 204/347 (58%)

Query:     8 NKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
             N      +L LGV    S +  L D  ++     W A + R+Y  N E+E R  ++++N 
Sbjct:     2 NPSFFLTVLCLGV---ASAAPKL-DPNLDAHWHQWKATHRRLYGMN-EEEWRRAVWEKNK 56

Query:    68 EYIASFNNK-ARNKP-YKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN 125
             + I   N + +  K  +++ +N F D TNEEFR   NG++ +       +   +      
Sbjct:    57 KIIDLHNQEYSEGKHGFRMAMNAFGDMTNEEFRQVMNGFQNQ-----KHKKGKLFHEPLL 111

Query:   126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
               VP S+DW KKG VT VK+QGQCG CWAFSA  A+EG     T KL SLSEQ LVDC  
Sbjct:   112 VDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSR 171

Query:   186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASD-GSCNKKEANPSAAKISGYEDVP 244
             +  +QGC GGLMD+AF++I  N GL +E  YPY A+D  SCN K    SAA  +G+ D+P
Sbjct:   172 AQGNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKP-ECSAANDTGFVDIP 230

Query:   245 SNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTG-QCGT-ELDHGVTAVGYG---T 298
                E ALMKAVA   P+SVAIDA  + FQFY SG++    C + +LDHGV  VGYG   T
Sbjct:   231 QR-EKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGT 289

Query:   299 ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
               +  K+W+VKNSWG  WG NGY++M +D   +   CGIA  ASYPT
Sbjct:   290 DSNNNKFWIVKNSWGPEWGWNGYVKMAKD---QNNHCGIATAASYPT 333


>UNIPROTKB|Q28944 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9823 "Sus scrofa"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 KO:K01365 OrthoDB:EOG48PMKF MEROPS:C01.032
            CTD:1514 EMBL:D37917 EMBL:AJ315771 PIR:A58195 RefSeq:NP_999057.1
            UniGene:Ssc.54036 ProteinModelPortal:Q28944 SMR:Q28944
            STRING:Q28944 Ensembl:ENSSSCT00000012233 GeneID:396926
            KEGG:ssc:396926 OMA:DASETGK ArrayExpress:Q28944 Uniprot:Q28944
        Length = 334

 Score = 667 (239.9 bits), Expect = 1.5e-65, P = 1.5e-65
 Identities = 146/313 (46%), Positives = 193/313 (61%)

Query:    42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNKP-YKLGINEFADQTNEEFRA 99
             W A +GR+Y  N E+  R  ++++N++ I   N + ++ K  + + +N F D TNEEFR 
Sbjct:    32 WKATHGRLYGMN-EEGWRRAVWEKNMKMIELHNQEYSQGKHGFSMAMNAFGDMTNEEFRQ 90

Query:   100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
               NG++ +    +  +    S   E   VP S+DWR+KG VT VK+QGQCG CWAFSA  
Sbjct:    91 VMNGFQNQ--KHKKGKVFHESLVLE---VPKSVDWREKGYVTAVKNQGQCGSCWAFSATG 145

Query:   160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
             A+EG     T KL SLSEQ LVDC     +QGC GGLMD+AF+++  N GL TE  YPY 
Sbjct:   146 ALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTEESYPYL 205

Query:   220 ASD-GSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSG 277
               +  SC  K    SAA  +G+ D+P   E ALMKAVA   P+SVAIDA  S FQFY SG
Sbjct:   206 GRETNSCTYKP-ECSAANDTGFVDIPQR-EKALMKAVATVGPISVAIDAGHSSFQFYKSG 263

Query:   278 VFTG-QCGT-ELDHGVTAVGYG---TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKE 332
             ++    C + +LDHGV  VGYG   T  + +K+W+VKNSWG  WG NGY++M +D   + 
Sbjct:   264 IYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWGPEWGWNGYVKMAKD---QN 320

Query:   333 GLCGIAMQASYPT 345
               CGI+  ASYPT
Sbjct:   321 NHCGISTAASYPT 333


>MGI|MGI:107341 [details] [associations]
            symbol:Ctss "cathepsin S" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=ISO;IDA] [GO:0006508 "proteolysis"
            evidence=IDA] [GO:0008152 "metabolic process" evidence=ISO]
            [GO:0008233 "peptidase activity" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0009986 "cell
            surface" evidence=ISO] [GO:0016020 "membrane" evidence=IDA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0045453 "bone
            resorption" evidence=ISO] [GO:0051930 "regulation of sensory
            perception of pain" evidence=ISO] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:107341 GO:GO:0016020 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0008233 GO:GO:0031905 Reactome:REACT_102124
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 BRENDA:3.4.22.27
            ChiTaRS:CTSS EMBL:AF051732 EMBL:AF051727 EMBL:AF051728
            EMBL:AF051729 EMBL:AF051726 EMBL:AF051730 EMBL:AF051731
            EMBL:AF038546 EMBL:AJ002386 EMBL:AC092203 EMBL:Y18466 EMBL:AJ223208
            IPI:IPI00309520 UniGene:Mm.3619 PDB:1M0H PDBsum:1M0H
            ProteinModelPortal:O70370 SMR:O70370 STRING:O70370
            PhosphoSite:O70370 PaxDb:O70370 PRIDE:O70370
            Ensembl:ENSMUST00000116304 BindingDB:O70370 ChEMBL:CHEMBL4098
            NextBio:282932 Bgee:O70370 CleanEx:MM_CTSS Genevestigator:O70370
            GermOnline:ENSMUSG00000038642 Uniprot:O70370
        Length = 340

 Score = 666 (239.5 bits), Expect = 2.0e-65, P = 2.0e-65
 Identities = 147/320 (45%), Positives = 197/320 (61%)

Query:    32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEF 89
             D T++   ++W   + + Y+D  E+E+R  I+++N+++I   N +       Y++G+N+ 
Sbjct:    29 DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 88

Query:    90 ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR-YENASVPASIDWRKKGAVTGVKDQGQ 148
              D TNEE    R G   R+P  R S  T V+FR Y N ++P ++DWR+KG VT VK QG 
Sbjct:    89 GDMTNEEILC-RMG-ALRIP--RQSPKT-VTFRSYSNRTLPDTVDWREKGCVTEVKYQGS 143

Query:   149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE--DQGCEGGLMDDAFEFIIS 206
             CG CWAFSAV A+EG   + T KL SLS Q LVDC    +  ++GC GG M +AF++II 
Sbjct:   144 CGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIID 203

Query:   207 NKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAID 265
             N G+  +A YPYKA+D  C+    N  AA  S Y  +P  +E AL +AVA + PVSV ID
Sbjct:   204 NGGIEADASYPYKATDEKCHYNSKN-RAATCSRYIQLPFGDEDALKEAVATKGPVSVGID 262

Query:   266 ASGSDFQFYSSGVFTG-QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
             AS S F FY SGV+    C   ++HGV  VGYGT D G  YWLVKNSWG  +G+ GYIRM
Sbjct:   263 ASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTLD-GKDYWLVKNSWGLNFGDQGYIRM 321

Query:   325 QRDIDAKEGLCGIAMQASYP 344
              R+    +  CGIA   SYP
Sbjct:   322 ARN---NKNHCGIASYCSYP 338


>ZFIN|ZDB-GENE-030131-106 [details] [associations]
            symbol:ctsl1a "cathepsin L, 1 a" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030131-106 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 HSSP:P43235
            KO:K01365 EMBL:BC066490 IPI:IPI00495935 RefSeq:NP_997749.1
            UniGene:Dr.104499 ProteinModelPortal:Q6NYR5 SMR:Q6NYR5
            MEROPS:C01.074 PRIDE:Q6NYR5 GeneID:321453 KEGG:dre:321453
            CTD:321453 InParanoid:Q6NYR5 NextBio:20807387 ArrayExpress:Q6NYR5
            Bgee:Q6NYR5 Uniprot:Q6NYR5
        Length = 337

 Score = 666 (239.5 bits), Expect = 2.0e-65, P = 2.0e-65
 Identities = 146/323 (45%), Positives = 189/323 (58%)

Query:    32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN--NKARNKPYKLGINEF 89
             D  +N+  + W   + + Y    E+  R  I+++N++ I   N  +      Y+LG+N F
Sbjct:    22 DQQLNDHWDQWKKWHSKKYHAT-EEGWRRVIWEKNLKKIEMHNLEHSMGIHTYRLGMNHF 80

Query:    90 ADQTNEEFRAPRNGYKRRLPS-VRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQ 148
              D T+EEFR   NG+K +     R S   + +F      VP  +DWR+KG VT VKDQG+
Sbjct:    81 GDMTHEEFRQVMNGFKHKKDRRFRGSLFMEPNF----IEVPNKLDWREKGYVTPVKDQGE 136

Query:   149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
             CG CWAFS   A+EG     T KL SLSEQ LVDC     ++GC GGLMD AF+++    
Sbjct:   137 CGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDQN 196

Query:   209 GLATEAKYPYKASDGS-CNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDA 266
             GL +E  YPY  +D   C+    N SAA  +G+ D+PS  E ALMKA+A   PVSVAIDA
Sbjct:   197 GLDSEESYPYLGTDDQPCHFDPKN-SAANDTGFVDIPSGKERALMKAIAAVGPVSVAIDA 255

Query:   267 SGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYGTAD---DGTKYWLVKNSWGTTWGENGY 321
                 FQFY SG++   +C +E LDHGV AVGYG      DG KYW+VKNSW   WG+ GY
Sbjct:   256 GHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVDGKKYWIVKNSWSENWGDKGY 315

Query:   322 IRMQRDIDAKEGLCGIAMQASYP 344
             I M +D   +   CGIA  ASYP
Sbjct:   316 IYMAKD---RHNHCGIATAASYP 335


>UNIPROTKB|Q9GL24 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 CTD:1515 KO:K01365
            OrthoDB:EOG48PMKF EMBL:AJ279008 RefSeq:NP_001239115.1
            UniGene:Cfa.3571 ProteinModelPortal:Q9GL24 SMR:Q9GL24
            MEROPS:C01.032 Ensembl:ENSCAFT00000001770
            Ensembl:ENSCAFT00000023837 GeneID:100684364 KEGG:cfa:100684364
            InParanoid:Q9GL24 OMA:FDQNLDT NextBio:20817211 Uniprot:Q9GL24
        Length = 333

 Score = 665 (239.2 bits), Expect = 2.5e-65, P = 2.5e-65
 Identities = 146/322 (45%), Positives = 195/322 (60%)

Query:    32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNKP-YKLGINEF 89
             D ++N +   W A + R+Y  N E+  R  ++++N++ I   N + ++ K  + + +N F
Sbjct:    22 DQSLNAQWYQWKATHRRLYGMN-EEGWRRAVWEKNMKMIELHNREYSQGKHGFTMAMNAF 80

Query:    90 ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
              D TNEEFR   NG++ +    +     +  F    A +P S+DWR+KG VT VK+QGQC
Sbjct:    81 GDMTNEEFRQVMNGFQNQKHK-KGKMFQEPLF----AEIPKSVDWREKGYVTPVKNQGQC 135

Query:   150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
             G CWAFSA  A+EG     T KL SLSEQ LVDC  +  ++GC GGLMD+AF ++  N G
Sbjct:   136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNAFRYVKDNGG 195

Query:   210 LATEAKYPYKASDG-SCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDAS 267
             L +E  YPY   D  +CN K    SAA  +G+ D+P   E ALMKAVA   P+SVAIDA 
Sbjct:   196 LDSEESYPYLGRDTETCNYKP-ECSAANDTGFVDLPQR-EKALMKAVATLGPISVAIDAG 253

Query:   268 GSDFQFYSSGV-FTGQCGT-ELDHGVTAVGYGT--ADDGTKYWLVKNSWGTTWGENGYIR 323
                FQFY SG+ F   C + +LDHGV  VGYG    D   K+W+VKNSWG  WG NGY++
Sbjct:   254 HQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKFWIVKNSWGPEWGWNGYVK 313

Query:   324 MQRDIDAKEGLCGIAMQASYPT 345
             M +D   +   CGIA  ASYPT
Sbjct:   314 MAKD---QNNHCGIATAASYPT 332


>ZFIN|ZDB-GENE-041010-76 [details] [associations]
            symbol:ctsll "cathepsin L, like" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-041010-76
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            OrthoDB:EOG48PMKF MEROPS:C01.032 EMBL:BX119902 IPI:IPI00616622
            UniGene:Dr.79994 SMR:A2BEM8 Ensembl:ENSDART00000144226
            InParanoid:A2BEM8 OMA:PRYSAAN Uniprot:A2BEM8
        Length = 337

 Score = 665 (239.2 bits), Expect = 2.5e-65, P = 2.5e-65
 Identities = 147/343 (42%), Positives = 206/343 (60%)

Query:    10 LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
             L+ A+++ L + A  + + TL D  +++   +W   + + Y +  E+  R  ++++N++ 
Sbjct:     2 LLFASLVTLCISAVFA-APTL-DQKLDDHWHLWKRWHEKSYHEK-EEGWRRMVWEKNLKK 58

Query:    70 IASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS 127
             I   N  +      ++LG+N+F D TNEEFR   NGY R  P+ +S  +  +   +  A 
Sbjct:    59 IELHNLEHSVGKHTFRLGMNQFGDMTNEEFRQAMNGYNRD-PNRKSKGSLFIEPSFFTA- 116

Query:   128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
              P  IDWR+KG VT +KDQ +CG CWAFS+  A+EG     T KL SLSEQ L+DC    
Sbjct:   117 -PQQIDWRQKGYVTPIKDQKRCGSCWAFSSTGALEGQVFRKTGKLVSLSEQNLMDCSRPQ 175

Query:   188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
              + GC+GGLMD AF+++  N GL +E  YPY A+D      +   SAA ++G+ D+PS  
Sbjct:   176 GNNGCDGGLMDQAFQYVQDNNGLDSEESYPYLATDDQPCHYDPRYSAANVTGFVDIPSGK 235

Query:   248 EAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTGQ-CGTE-LDHGVTAVGYGTAD-D-- 301
             E ALMKAVA   PV+VAIDA    FQFY SG++  + C TE LDHGV  VGYG    D  
Sbjct:   236 EHALMKAVAAVGPVAVAIDAGHESFQFYQSGIYYEKACSTEELDHGVLVVGYGYEGVDVA 295

Query:   302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             G +YW+VKNSW   WG+ GYI M +D+   +  CGIA  ASYP
Sbjct:   296 GRRYWIVKNSWTDRWGDKGYIYMAKDL---KNHCGIATSASYP 335


>DICTYBASE|DDB_G0279185 [details] [associations]
            symbol:cprF "cysteine proteinase 6" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0279185 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000030 HSSP:P07711 ProtClustDB:CLSZ2846820 EMBL:U72745
            RefSeq:XP_641725.1 ProteinModelPortal:Q94503 SMR:Q94503
            MEROPS:C01.081 PRIDE:Q94503 EnsemblProtists:DDB0215002
            GeneID:8621921 KEGG:ddi:DDB_G0279185 Uniprot:Q94503
        Length = 434

 Score = 545 (196.9 bits), Expect = 4.2e-65, Sum P(2) = 4.2e-65
 Identities = 128/298 (42%), Positives = 174/298 (58%)

Query:    11 VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
             VL+A+ VL V    +  + L++         WM  + R Y  + E   RF IFK N++YI
Sbjct:     3 VLSALCVLLVSVATA-KQQLSELQYRNAFTNWMIAHQRHY-SSEEFNGRFNIFKANMDYI 60

Query:    71 ASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPA 130
               +N K       LG+N FAD TNEE+RA   G      S+  + +  V F    A+   
Sbjct:    61 NEWNTKGSETV--LGLNVFADITNEEYRATYLGTPFDASSLEMTPSEKV-FGGVQAN--- 114

Query:   131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITT--RKLTSLSEQELVDCDTSGE 188
             S+DWR KGAVT +K+QG+CG CW+FSA  A EG  +I      LTS+SEQ+L+DC  S  
Sbjct:   115 SVDWRAKGAVTPIKNQGECGGCWSFSATGATEGAQYIANGDSDLTSVSEQQLIDCSGSYG 174

Query:   189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
             + GCEGGLM  AFE+II+N G+ TE+ YP+ A+   C    +N   A++S Y +V S +E
Sbjct:   175 NNGCEGGLMTLAFEYIINNGGIDTESSYPFTANTEKCKYNPSN-IGAELSSYVNVTSGSE 233

Query:   249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQ-CG-TELDHGVTAVGYGTADDGTK 304
             + L   V   P SVAIDAS   FQFYSSG++    C  T+LDHGV AVG+G+   G++
Sbjct:   234 SDLAAKVTQGPTSVAIDASQPSFQFYSSGIYNEPACSSTQLDHGVLAVGFGSGSSGSQ 291

 Score = 136 (52.9 bits), Expect = 4.2e-65, Sum P(2) = 4.2e-65
 Identities = 27/46 (58%), Positives = 30/46 (65%)

Query:   301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
             DG  YW+VKNSWG  WG NGYI M +D   K+  CGIA  AS P A
Sbjct:   386 DGN-YWIVKNSWGLDWGINGYILMSKD---KDNQCGIATMASIPQA 427


>UNIPROTKB|Q5E998 [details] [associations]
            symbol:CTSL2 "Cathepsin L2" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            HOVERGEN:HBG011513 UniGene:Bt.3987 MEROPS:C01.032 EMBL:BT021022
            IPI:IPI00711962 ProteinModelPortal:Q5E998 SMR:Q5E998 STRING:Q5E998
            InParanoid:Q5E998 Uniprot:Q5E998
        Length = 334

 Score = 662 (238.1 bits), Expect = 5.2e-65, P = 5.2e-65
 Identities = 154/347 (44%), Positives = 203/347 (58%)

Query:     8 NKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
             N      +L LGV    S +  L D  ++     W A + R+Y  N E+E R  ++++N 
Sbjct:     2 NPSFFLTVLCLGV---ASAAPKL-DPNLDAHWHQWKATHRRLYGMN-EEEWRRAVWEKNK 56

Query:    68 EYIASFNNK-ARNKP-YKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN 125
             + I   N + +  K  +++ +N F D TNEEFR   NG++ +       +   +      
Sbjct:    57 KIIDLHNQEYSEGKHGFRMAMNAFGDMTNEEFRQVMNGFQNQ-----KHKKGKLFHEPLL 111

Query:   126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
               VP S+DW KKG VT VK+QGQCG CWAFSA  A+EG     T KL SLSEQ LVDC  
Sbjct:   112 VDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSR 171

Query:   186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASD-GSCNKKEANPSAAKISGYEDVP 244
             +  +QGC GGLMD+AF++I  N  L +E  YPY A+D  SCN K    SAA  +G+ D+P
Sbjct:   172 AQGNQGCNGGLMDNAFQYIKDNGCLDSEESYPYLATDTNSCNYKP-ECSAANDTGFVDIP 230

Query:   245 SNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTG-QCGT-ELDHGVTAVGYG---T 298
                E ALMKAVA   P+SVAIDA  + FQFY SG++    C + +LDHGV  VGYG   T
Sbjct:   231 QR-EKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGT 289

Query:   299 ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
               +  K+W+VKNSWG  WG NGY++M +D   +   CGIA  ASYPT
Sbjct:   290 DSNNNKFWIVKNSWGPEWGWNGYVKMAKD---QNNHCGIATAASYPT 333


>DICTYBASE|DDB_G0283867 [details] [associations]
            symbol:cprC "cysteine proteinase 3" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0283867 GenomeReviews:CM000153_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:AAFI02000057
            KO:K01365 EMBL:X03930 RefSeq:XP_638859.1 ProteinModelPortal:Q23894
            SMR:Q23894 MEROPS:C01.114 EnsemblProtists:DDB0220784 GeneID:8624257
            KEGG:ddi:DDB_G0283867 OMA:NNVEHIN Uniprot:Q23894
        Length = 337

 Score = 647 (232.8 bits), Expect = 2.0e-63, P = 2.0e-63
 Identities = 138/311 (44%), Positives = 198/311 (63%)

Query:    42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
             WM    + Y  + E   R++ FK+N++Y+ ++N+K  +K   LG+N+ AD +NEE+R   
Sbjct:    37 WMRSNNKAYT-HKEFMPRYEEFKKNMDYVHNWNSKG-SKTV-LGLNQHADLSNEEYRLNY 93

Query:   102 NGYKRRLPSVRSSETTDVSFRYENASV--PASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
              G +  +  +      ++  R        P ++DWR+K AVT VKDQGQCG C++FS   
Sbjct:    94 LGTRAHI-KLNGYHKRNLGLRLNRPQFKQPLNVDWREKDAVTPVKDQGQCGSCYSFSTTG 152

Query:   160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPY- 218
             ++EG+  I T KL SLSEQ ++DC +S  ++GC GGLM +AFE+II N GL +E +YPY 
Sbjct:   153 SVEGVTAIKTGKLVSLSEQNILDCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYE 212

Query:   219 -KASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSG 277
              K +D  C  +E +  AAKI+ Y+++ + +E  L  A+   PVSVAIDAS + FQ Y++G
Sbjct:   213 MKVND-ECKFQEGSV-AAKITSYKEIEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAG 270

Query:   278 VF-TGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
             V+    C +E LDHGV AVG GT D+G  Y++VKNSWG +WG NGYI M R+   K+  C
Sbjct:   271 VYYEPACSSEDLDHGVLAVGMGT-DNGEDYYIVKNSWGPSWGLNGYIHMARN---KDNNC 326

Query:   336 GIAMQASYPTA 346
             GI+  ASYP A
Sbjct:   327 GISTMASYPIA 337


>UNIPROTKB|Q86GF7 [details] [associations]
            symbol:Cys "Crustapain" species:6703 "Pandalus borealis"
            [GO:0005576 "extracellular region" evidence=IC] [GO:0007586
            "digestion" evidence=NAS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IDA] [GO:0030163 "protein catabolic process"
            evidence=NAS] [GO:0030574 "collagen catabolic process"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005576
            GO:GO:0007586 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0030163 GO:GO:0030574 EMBL:AB091669
            ProteinModelPortal:Q86GF7 SMR:Q86GF7 MEROPS:C01.030 Uniprot:Q86GF7
        Length = 323

 Score = 644 (231.8 bits), Expect = 4.2e-63, P = 4.2e-63
 Identities = 142/314 (45%), Positives = 193/314 (61%)

Query:    40 EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK--ARNKPYKLGINEFADQTNEEF 97
             E +  ++G+ Y ++ E+  R  +F + +++I   N +       Y L IN F+D T+EE 
Sbjct:    21 ENFKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHEEV 80

Query:    98 RAPRNGY-KRRLP-SV--RSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCW 153
              A + G  +RR P SV  +S+ TT ++         A +DWR KGAVT VKDQGQCG CW
Sbjct:    81 LATKTGMTRRRHPLSVLPKSAPTTPMA---------ADVDWRNKGAVTPVKDQGQCGSCW 131

Query:   154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
             AFSAVAA+EG + + T  L SLSEQ LVDC +S  +QGC GG    A+++II+N+G+ TE
Sbjct:   132 AFSAVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRGIDTE 191

Query:   214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQ 272
             + YPYKA D +C + +A    A +S Y +  S +E+AL  AV N+ PVSV IDA  S F 
Sbjct:   192 SSYPYKAIDDNC-RYDAGNIGATVSSYVEPASGDESALQHAVQNEGPVSVCIDAGQSSFG 250

Query:   273 FYSSGVF-TGQCGT-ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDA 330
              Y  GV+    C +   +H VTAVGYGT  +G  YW+VKNSWG  WGE+GYI+M R+ D 
Sbjct:   251 SYGGGVYYEPNCDSWYANHAVTAVGYGTDANGGDYWIVKNSWGAWWGESGYIKMARNRDN 310

Query:   331 KEGLCGIAMQASYP 344
                 C IA  + YP
Sbjct:   311 N---CAIATYSVYP 321


>UNIPROTKB|F1SS93 [details] [associations]
            symbol:CTSS "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0002250 "adaptive immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0016020 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0002250
            GeneTree:ENSGT00560000076577 GO:GO:0097067 EMBL:CU463875
            Ensembl:ENSSSCT00000007284 OMA:CEIESAV Uniprot:F1SS93
        Length = 342

 Score = 640 (230.4 bits), Expect = 1.1e-62, P = 1.1e-62
 Identities = 142/318 (44%), Positives = 189/318 (59%)

Query:    32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN--NKARNKPYKLGINEF 89
             D T++   ++W   YG+ Y++  E+  R  I+++N++ +   N  +      Y LG+N  
Sbjct:    32 DPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSYDLGMNHL 91

Query:    90 ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
              D T+EE  +  +    R+PS      T  S    N  +P S+DWR+KG VT VK QG C
Sbjct:    92 GDMTSEEVISLMSCV--RVPSQWPRNVTYKS--NPNQKLPDSMDWREKGCVTEVKYQGSC 147

Query:   150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG-EDQGCEGGLMDDAFEFIISNK 208
             G CWAFSAV A+E    + T +L SLS Q LVDC T    ++GC GG M +AF++II N 
Sbjct:   148 GSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNN 207

Query:   209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDAS 267
             G+ +EA YPYKA DG C K ++   AA  S Y ++P  +E AL +AVAN+ PVSVAIDA 
Sbjct:   208 GIDSEASYPYKAVDGKC-KYDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAIDAK 266

Query:   268 GSDFQFYSSGVFTG-QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
              S F FY SGV+    C   ++HGV  VGYG  + G  YWLVKNSWG  +G+ GYIRM R
Sbjct:   267 HSSFFFYRSGVYYDPSCTQNVNHGVLVVGYGNLN-GKDYWLVKNSWGLNFGDGGYIRMAR 325

Query:   327 DIDAKEGLCGIAMQASYP 344
             +    E  CGIA   SYP
Sbjct:   326 N---SENHCGIANYPSYP 340


>UNIPROTKB|P25326 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9913 "Bos taurus"
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0016020 "membrane" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0002250 "adaptive
            immune response" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0016020 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0002250 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            GO:GO:0097067 EMBL:BC102245 EMBL:M95211 EMBL:X62001 IPI:IPI00702008
            PIR:S15844 RefSeq:NP_001028787.1 UniGene:Bt.7938
            ProteinModelPortal:P25326 SMR:P25326 STRING:P25326 PRIDE:P25326
            Ensembl:ENSBTAT00000022774 GeneID:327711 KEGG:bta:327711 CTD:1520
            InParanoid:P25326 KO:K01368 OMA:KAMDQKC OrthoDB:EOG4JM7Q2
            NextBio:20810175 Uniprot:P25326
        Length = 331

 Score = 636 (228.9 bits), Expect = 3.0e-62, P = 3.0e-62
 Identities = 143/335 (42%), Positives = 196/335 (58%)

Query:    20 VWA----PQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN- 74
             VWA      + +    D T++   ++W   YG+ Y++  E+  R  I+++N++ +   N 
Sbjct:     5 VWALLLCSSAMAHVHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVTLHNL 64

Query:    75 -NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE-NASVPASI 132
              +      Y+LG+N   D T+EE  +  +    R+PS       +V+++ + N  +P S+
Sbjct:    65 EHSMGMHSYELGMNHLGDMTSEEVISLMSSL--RVPS---QWPRNVTYKSDPNQKLPDSM 119

Query:   133 DWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE-DQG 191
             DWR+KG VT VK QG CG CWAFSAV A+E    + T KL SLS Q LVDC T+   ++G
Sbjct:   120 DWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKG 179

Query:   192 CEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAAL 251
             C GG M +AF++II N G+ +EA YPYKA DG C     N  AA  S Y ++P  +E AL
Sbjct:   180 CNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYDVKN-RAATCSRYIELPFGSEEAL 238

Query:   252 MKAVANQ-PVSVAIDASGSDFQFYSSGVFTG-QCGTELDHGVTAVGYGTADDGTKYWLVK 309
              +AVAN+ PVSV IDAS S F  Y +GV+    C   ++HGV  VGYG  D G  YWLVK
Sbjct:   239 KEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNLD-GKDYWLVK 297

Query:   310 NSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             NSWG  +G+ GYIRM R+       CGIA   SYP
Sbjct:   298 NSWGLHFGDQGYIRMARN---SGNHCGIANYPSYP 329


>DICTYBASE|DDB_G0281605 [details] [associations]
            symbol:cfaD "peptidase C1A family protein"
            species:44689 "Dictyostelium discoideum" [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IDA] [GO:0006508 "proteolysis" evidence=IDA] [GO:0031410
            "cytoplasmic vesicle" evidence=IDA] [GO:0031288 "sorocarp
            morphogenesis" evidence=IMP] [GO:0008285 "negative regulation of
            cell proliferation" evidence=IGI;IDA] [GO:0005576 "extracellular
            region" evidence=IEA;IDA] [GO:0005515 "protein binding"
            evidence=IPI] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0281605
            GO:GO:0008285 GO:GO:0005615 GenomeReviews:CM000152_GR
            eggNOG:COG4870 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0031410 EMBL:AAFI02000042
            GO:GO:0031288 RefSeq:XP_640530.1 HSSP:P07711
            ProteinModelPortal:Q54TR1 STRING:Q54TR1 PRIDE:Q54TR1
            EnsemblProtists:DDB0229857 GeneID:8623140 KEGG:ddi:DDB_G0281605
            InParanoid:Q54TR1 OMA:PSAHEHE ProtClustDB:CLSZ2430523
            Uniprot:Q54TR1
        Length = 531

 Score = 634 (228.2 bits), Expect = 4.8e-62, P = 4.8e-62
 Identities = 135/310 (43%), Positives = 185/310 (59%)

Query:    42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
             + AQY + Y    E + RF  FK   + IA+ N  A+   YKLG+N +AD +N+EF    
Sbjct:   228 YKAQYNKEYSSQDEHDERFINFKAARKIIATHN--AKESSYKLGMNHYADLSNKEFNTLV 285

Query:   102 NGYKRRLPSVRSSETT--DVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
                  R PSV  +++   D S R    S+P+++DWR +  VT VKDQG CG CW F +  
Sbjct:   286 KPKVAR-PSVTGADSVHDDESLR----SIPSTVDWRNQNCVTPVKDQGICGSCWTFGSTG 340

Query:   160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
             ++EG N +T  +L SLSEQ+LVDC      QGC GG    AF++++    LATE+ YPY 
Sbjct:   341 SLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGSLATESNYPYL 400

Query:   220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGV 278
               +G C  +   PS   I+GY +V S +E+AL  A+A   PV++AIDAS  DF++Y SGV
Sbjct:   401 MQNGLCRDRTVTPSGVSITGYVNVTSGSESALQNAIATTGPVAIAIDASVDDFRYYMSGV 460

Query:   279 FTGQ-CGT---ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
             +    C     +LDH V A+GYGT   G  Y+LVKNSW T WG +GY+ M R+ D    L
Sbjct:   461 YNNPACKNGLDDLDHEVLAIGYGTYQ-GQDYFLVKNSWSTNWGMDGYVYMARN-D--NNL 516

Query:   335 CGIAMQASYP 344
             CG++ QA+YP
Sbjct:   517 CGVSSQATYP 526


>RGD|1308751 [details] [associations]
            symbol:RGD1308751 "similar to Cathepsin L precursor (Major
            excreted protein) (MEP)" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1308751 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:CH474032 IPI:IPI00365697 RefSeq:XP_001065885.2
            RefSeq:XP_225137.5 MEROPS:C01.069 Ensembl:ENSRNOT00000061391
            GeneID:290981 KEGG:rno:290981 UCSC:RGD:1308751 CTD:290981
            OMA:ESYAYEA OrthoDB:EOG42823G NextBio:631921 Uniprot:D3ZKC3
        Length = 330

 Score = 634 (228.2 bits), Expect = 4.8e-62, P = 4.8e-62
 Identities = 150/344 (43%), Positives = 199/344 (57%)

Query:    10 LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
             + L A L LG+ +    +   +D + +   E W  ++G+ Y  N E + R  +++ N++ 
Sbjct:     4 IFLLATLCLGMIS----AAPTHDPSFDTVWEEWKTKHGKTYNTNEEGQKR-AVWENNMKM 58

Query:    70 IASFNNK-ARNKP-YKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN-A 126
             I   N    + K  + L +N F D TN EFR    G++    S+   ETT   FR     
Sbjct:    59 INLHNEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQ----SMGPKETT--IFREPFLG 112

Query:   127 SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
              +P S+DWR+ G VT VK+QGQCG CWAFSAV ++EG     T KL SLSEQ LVDC  S
Sbjct:   113 DIPKSLDWREHGYVTPVKNQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDCSWS 172

Query:   187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANP--SAAKISGYEDVP 244
               + GC GGLM+ AF+++  N+GL T   Y Y+A DG C     NP  SAA ++G+  VP
Sbjct:   173 YGNLGCNGGLMEFAFQYVKENRGLDTGESYAYEAQDGLCRY---NPKYSAANVTGFVKVP 229

Query:   245 SNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVF-TGQCG-TELDHGVTAVGYGTADD 301
              + E  LM AVA+  PVSV ID+    F+FYS G++    C  TE+DH V  VGYG   D
Sbjct:   230 LS-EDDLMSAVASVGPVSVGIDSHHQSFRFYSGGMYYEPDCSSTEMDHAVLVVGYGEESD 288

Query:   302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
             G KYWLVKNSWG  WG +GYI+M +D   +   CGIA  A YPT
Sbjct:   289 GGKYWLVKNSWGEDWGMDGYIKMAKD---QNNNCGIATYAIYPT 329


>WB|WBGene00000776 [details] [associations]
            symbol:cpl-1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0040010 "positive regulation
            of growth rate" evidence=IMP] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] [GO:0040011
            "locomotion" evidence=IMP] [GO:0070265 "necrotic cell death"
            evidence=IMP] [GO:0031983 "vesicle lumen" evidence=IDA] [GO:0042718
            "yolk granule" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0009792 GO:GO:0040010 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0040011
            GO:GO:0031983 GO:GO:0070265 GeneTree:ENSGT00660000095458 KO:K01365
            GO:GO:0042718 MEROPS:I29.009 EMBL:Z92812 GeneID:180111
            KEGG:cel:CELE_T03E6.7 CTD:180111 PIR:T24387 RefSeq:NP_001256718.1
            HSSP:P80067 ProteinModelPortal:O45734 SMR:O45734 DIP:DIP-26616N
            IntAct:O45734 MINT:MINT-211563 STRING:O45734 PaxDb:O45734
            EnsemblMetazoa:T03E6.7.1 EnsemblMetazoa:T03E6.7.2 UCSC:T03E6.7.1
            WormBase:T03E6.7a InParanoid:O45734 OMA:HIENHNR NextBio:908128
            Uniprot:O45734
        Length = 337

 Score = 630 (226.8 bits), Expect = 1.3e-61, P = 1.3e-61
 Identities = 145/346 (41%), Positives = 193/346 (55%)

Query:     8 NKLVLAAILVLGVWAPQS--WSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKE 65
             N+ +L A LV  V A  S   SR +  A   E+ + +   + + Y ++ E++   + F +
Sbjct:     2 NRFILLA-LVAAVVAVNSAKLSRQIESAI--EKWDDYKEDFDKEYSES-EEQTYMEAFVK 57

Query:    66 NVEYIASFNNKAR--NKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY 123
             N+ +I + N   R   K +++G+N  AD    ++R   NGY+R     R   ++     +
Sbjct:    58 NMIHIENHNRDHRLGRKTFEMGLNHIADLPFSQYRK-LNGYRRLFGDSRIKNSSSFLAPF 116

Query:   124 ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
              N  VP  +DWR    VT VK+QG CG CWAFSA  A+EG +     +L SLSEQ LVDC
Sbjct:   117 -NVQVPDEVDWRDTHLVTDVKNQGMCGSCWAFSATGALEGQHARKLGQLVSLSEQNLVDC 175

Query:   184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSC--NKKEANPSAAKISGYE 241
              T   + GC GGLMD AFE+I  N G+ TE  YPYK  D  C  NKK      A   GY 
Sbjct:   176 STKYGNHGCNGGLMDQAFEYIRDNHGVDTEESYPYKGRDMKCHFNKKTVG---ADDKGYV 232

Query:   242 DVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTGQ-CGTE-LDHGVTAVGYGT 298
             D P  +E  L  AVA Q P+S+AIDA    FQ Y  GV+  + C +E LDHGV  VGYGT
Sbjct:   233 DTPEGDEEQLKIAVATQGPISIAIDAGHRSFQLYKKGVYYDEECSSEELDHGVLLVGYGT 292

Query:   299 ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
               +   YW+VKNSWG  WGE GYIR+ R+   +   CG+A +ASYP
Sbjct:   293 DPEHGDYWIVKNSWGAGWGEKGYIRIARN---RNNHCGVATKASYP 335


>RGD|1560071 [details] [associations]
            symbol:Ctsll3 "cathepsin L-like 3" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1560071 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:CH474032 IPI:IPI00560469 RefSeq:XP_001065834.2
            RefSeq:XP_573976.3 UniGene:Rn.104851 MEROPS:C01.107
            Ensembl:ENSRNOT00000061398 GeneID:498691 KEGG:rno:498691
            UCSC:RGD:1560071 CTD:70202 OMA:NCGIASD OrthoDB:EOG4HDSTZ
            NextBio:700548 Uniprot:D3ZJV2
        Length = 330

 Score = 625 (225.1 bits), Expect = 4.3e-61, P = 4.3e-61
 Identities = 144/342 (42%), Positives = 194/342 (56%)

Query:    10 LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
             + L A L LG+ +    +   +D + +   E W  ++G+ Y  N E + R  +++ N++ 
Sbjct:     4 IFLLATLCLGMIS----AAPTHDPSFDTVWEEWKTKHGKTYNTNEEGQKR-AVWENNMKM 58

Query:    70 IASFNNK-ARNKP-YKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS 127
             I   N    + K  + L +N F D TN EFR    G++ +    +  +     F      
Sbjct:    59 INLHNEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQGQ--KTKMMKVFPEPFL---GD 113

Query:   128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
             VP ++DWRK G VT VK+QG CG CWAFSAV ++EG     T KL  LSEQ LVDC  S 
Sbjct:   114 VPKTVDWRKHGYVTPVKNQGPCGSCWAFSAVGSLEGQVFRKTGKLVPLSEQNLVDCSWSH 173

Query:   188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANP--SAAKISGYEDVPS 245
              ++GC+GGL D AF+++  N GL T   YPY+A +G+C     NP  SAAK+ G+  +P 
Sbjct:   174 GNKGCDGGLPDFAFQYVKDNGGLDTSVSYPYEALNGTCRY---NPKYSAAKVVGFMSIPP 230

Query:   246 NNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVF-TGQCG-TELDHGVTAVGYGTADDG 302
             + E ALMKAVA   P+SV ID     FQFY  G++    C  T L+H V  VGYG   DG
Sbjct:   231 S-ENALMKAVATVGPISVGIDIKHKSFQFYKGGMYYEPDCSSTNLNHAVLVVGYGEESDG 289

Query:   303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
              KYWLVKNSWG  WG +GYI+M +D +     CGIA  ASYP
Sbjct:   290 RKYWLVKNSWGRDWGMDGYIKMAKDWNNN---CGIASDASYP 328


>UNIPROTKB|Q8HY81 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            CTD:1520 KO:K01368 OrthoDB:EOG4JM7Q2 EMBL:AY156692
            RefSeq:NP_001002938.2 UniGene:Cfa.1661 ProteinModelPortal:Q8HY81
            SMR:Q8HY81 STRING:Q8HY81 MEROPS:C01.034 GeneID:403400
            KEGG:cfa:403400 InParanoid:Q8HY81 NextBio:20816922 Uniprot:Q8HY81
        Length = 331

 Score = 624 (224.7 bits), Expect = 5.5e-61, P = 5.5e-61
 Identities = 138/319 (43%), Positives = 186/319 (58%)

Query:    32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN--NKARNKPYKLGINEF 89
             D T++    +W   Y + Y++  E+  R  I+++N++++   N  +      Y LG+N  
Sbjct:    21 DPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHL 80

Query:    90 ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE-NASVPASIDWRKKGAVTGVKDQGQ 148
              D T EE  +       R+PS       +V++R   N  +P S+DWR+KG VT VK QG 
Sbjct:    81 GDMTGEEVISLMGSL--RVPS---QWQRNVTYRSNSNQKLPDSVDWREKGCVTEVKYQGS 135

Query:   149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE-DQGCEGGLMDDAFEFIISN 207
             CG CWAFSAV A+E    + T KL SLS Q LVDC T    ++GC GG M  AF++II N
Sbjct:   136 CGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDN 195

Query:   208 KGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDA 266
              G+ +EA YPYKA +G C + ++   AA  S Y ++P  +E AL +AVAN+ PVSVAIDA
Sbjct:   196 NGIDSEASYPYKAMNGKC-RYDSKKRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDA 254

Query:   267 SGSDFQFYSSGVF-TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
             S   F  Y SGV+    C   ++HGV  VGYG  + G  YWLVKNSWG  +G+ GYIRM 
Sbjct:   255 SHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNLN-GKDYWLVKNSWGLNFGDQGYIRMA 313

Query:   326 RDIDAKEGLCGIAMQASYP 344
             R+       CGIA   SYP
Sbjct:   314 RN---SGNHCGIASYPSYP 329


>UNIPROTKB|F1PAK0 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9615 "Canis lupus
            familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:AAEX03011051 Ensembl:ENSCAFT00000019176 OMA:YEPACTQ
            Uniprot:F1PAK0
        Length = 339

 Score = 623 (224.4 bits), Expect = 7.1e-61, P = 7.1e-61
 Identities = 138/319 (43%), Positives = 186/319 (58%)

Query:    32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN--NKARNKPYKLGINEF 89
             D T++    +W   Y + Y++  E+  R  I+++N++++   N  +      Y LG+N  
Sbjct:    29 DPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHL 88

Query:    90 ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE-NASVPASIDWRKKGAVTGVKDQGQ 148
              D T EE  +       R+PS       +V++R   N  +P S+DWR+KG VT VK QG 
Sbjct:    89 GDMTGEEVISLMGSL--RVPS---QWQRNVTYRSNSNQKLPDSVDWREKGCVTEVKYQGS 143

Query:   149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE-DQGCEGGLMDDAFEFIISN 207
             CG CWAFSAV A+E    + T KL SLS Q LVDC T    ++GC GG M  AF++II N
Sbjct:   144 CGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDN 203

Query:   208 KGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDA 266
              G+ +EA YPYKA +G C + ++   AA  S Y ++P  +E AL +AVAN+ PVSVAIDA
Sbjct:   204 NGIDSEASYPYKAVNGKC-RYDSKKRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDA 262

Query:   267 SGSDFQFYSSGVF-TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
             S   F  Y SGV+    C   ++HGV  VGYG  + G  YWLVKNSWG  +G+ GYIRM 
Sbjct:   263 SHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNLN-GKDYWLVKNSWGLNFGDQGYIRMA 321

Query:   326 RDIDAKEGLCGIAMQASYP 344
             R+       CGIA   SYP
Sbjct:   322 RN---SGNHCGIASYPSYP 337


>UNIPROTKB|O60911 [details] [associations]
            symbol:CTSL2 "Cathepsin L2" species:9606 "Homo sapiens"
            [GO:0004177 "aminopeptidase activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA] [GO:0005902
            "microvillus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0007283 "spermatogenesis" evidence=IEA] [GO:0009267 "cellular
            response to starvation" evidence=IEA] [GO:0009749 "response to
            glucose stimulus" evidence=IEA] [GO:0009897 "external side of
            plasma membrane" evidence=IEA] [GO:0010259 "multicellular
            organismal aging" evidence=IEA] [GO:0021675 "nerve development"
            evidence=IEA] [GO:0030984 "kininogen binding" evidence=IEA]
            [GO:0032403 "protein complex binding" evidence=IEA] [GO:0034698
            "response to gonadotropin stimulus" evidence=IEA] [GO:0042277
            "peptide binding" evidence=IEA] [GO:0043005 "neuron projection"
            evidence=IEA] [GO:0043204 "perikaryon" evidence=IEA] [GO:0046697
            "decidualization" evidence=IEA] [GO:0048102 "autophagic cell death"
            evidence=IEA] [GO:0051384 "response to glucocorticoid stimulus"
            evidence=IEA] [GO:0060008 "Sertoli cell differentiation"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=TAS] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_118779 Reactome:REACT_6900
            GO:GO:0009897 GO:GO:0019886 GO:GO:0034698 GO:GO:0043204
            GO:GO:0009749 GO:GO:0030141 GO:GO:0051384 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0045177 GO:GO:0043005 GO:GO:0007283
            GO:GO:0004177 GO:GO:0042277 GO:GO:0009267 GO:GO:0021675
            GO:GO:0043202 GO:GO:0005902 GO:GO:0010259 GO:GO:0004197
            GO:GO:0048102 GO:GO:0046697 HOVERGEN:HBG011513 CTD:1515
            OrthoDB:EOG48PMKF OMA:FDQNLDT GO:GO:0060008 EMBL:Y14734
            EMBL:AB001928 EMBL:AF070448 EMBL:AB019534 EMBL:AY358641
            EMBL:AL445670 EMBL:BC023504 EMBL:BC110512 IPI:IPI00000013
            RefSeq:NP_001188504.1 RefSeq:NP_001324.2 UniGene:Hs.610096 PDB:1FH0
            PDB:3H6S PDB:3KFQ PDBsum:1FH0 PDBsum:3H6S PDBsum:3KFQ
            ProteinModelPortal:O60911 SMR:O60911 IntAct:O60911 STRING:O60911
            MEROPS:I29.010 PhosphoSite:O60911 PaxDb:O60911 PeptideAtlas:O60911
            PRIDE:O60911 Ensembl:ENST00000259470 Ensembl:ENST00000538255
            GeneID:1515 KEGG:hsa:1515 UCSC:uc004awt.3 GeneCards:GC09M099794
            HGNC:HGNC:2538 HPA:CAB017112 MIM:603308 neXtProt:NX_O60911
            PharmGKB:PA27036 InParanoid:O60911 KO:K01375 PhylomeDB:O60911
            BRENDA:3.4.22.43 SABIO-RK:O60911 BindingDB:O60911 ChEMBL:CHEMBL3272
            ChiTaRS:CTSL2 EvolutionaryTrace:O60911 GenomeRNAi:1515 NextBio:6277
            Bgee:O60911 CleanEx:HS_CTSL2 Genevestigator:O60911
            GermOnline:ENSG00000136943 Uniprot:O60911
        Length = 334

 Score = 623 (224.4 bits), Expect = 7.1e-61, P = 7.1e-61
 Identities = 141/322 (43%), Positives = 186/322 (57%)

Query:    32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNKP-YKLGINEF 89
             D  ++ +   W A + R+Y  N E+  R  ++++N++ I   N + ++ K  + + +N F
Sbjct:    22 DQNLDTKWYQWKATHRRLYGAN-EEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAF 80

Query:    90 ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN-ASVPASIDWRKKGAVTGVKDQGQ 148
              D TNEEFR     ++ +    R  +     FR      +P S+DWRKKG VT VK+Q Q
Sbjct:    81 GDMTNEEFRQMMGCFRNQ--KFRKGKV----FREPLFLDLPKSVDWRKKGYVTPVKNQKQ 134

Query:   149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
             CG CWAFSA  A+EG     T KL SLSEQ LVDC     +QGC GG M  AF+++  N 
Sbjct:   135 CGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENG 194

Query:   209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDAS 267
             GL +E  YPY A D  C  +  N S A  +G+  V    E ALMKAVA   P+SVA+DA 
Sbjct:   195 GLDSEESYPYVAVDEICKYRPEN-SVANDTGFTVVAPGKEKALMKAVATVGPISVAMDAG 253

Query:   268 GSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYG---TADDGTKYWLVKNSWGTTWGENGYI 322
              S FQFY SG+ F   C ++ LDHGV  VGYG      + +KYWLVKNSWG  WG NGY+
Sbjct:   254 HSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYV 313

Query:   323 RMQRDIDAKEGLCGIAMQASYP 344
             ++ +D   K   CGIA  ASYP
Sbjct:   314 KIAKD---KNNHCGIATAASYP 332


>UNIPROTKB|P25774 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0016020 "membrane"
            evidence=IEA] [GO:0005576 "extracellular region" evidence=NAS]
            [GO:0005764 "lysosome" evidence=IDA;NAS] [GO:0097067 "cellular
            response to thyroid hormone stimulus" evidence=IEP] [GO:0002250
            "adaptive immune response" evidence=IEP] [GO:0019882 "antigen
            processing and presentation" evidence=TAS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=TAS] [GO:0006955
            "immune response" evidence=TAS] [GO:0002474 "antigen processing and
            presentation of peptide antigen via MHC class I" evidence=TAS]
            [GO:0002480 "antigen processing and presentation of exogenous
            peptide antigen via MHC class I, TAP-independent" evidence=TAS]
            [GO:0019886 "antigen processing and presentation of exogenous
            peptide antigen via MHC class II" evidence=TAS] [GO:0036021
            "endolysosome lumen" evidence=TAS] [GO:0042590 "antigen processing
            and presentation of exogenous peptide antigen via MHC class I"
            evidence=TAS] [GO:0043202 "lysosomal lumen" evidence=TAS]
            [GO:0045087 "innate immune response" evidence=TAS] [GO:0043231
            "intracellular membrane-bounded organelle" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 Reactome:REACT_118779
            Reactome:REACT_6900 GO:GO:0005576 GO:GO:0002480 GO:GO:0016020
            GO:GO:0019886 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087 EMBL:CH471121
            GO:GO:0004197 GO:GO:0002250 HOVERGEN:HBG011513 GO:GO:0097067
            GO:GO:0036021 EMBL:AL356292 CTD:1520 KO:K01368 OMA:KAMDQKC
            OrthoDB:EOG4JM7Q2 EMBL:S93414 EMBL:M86553 EMBL:M90696 EMBL:U07374
            EMBL:U07370 EMBL:U07371 EMBL:U07372 EMBL:U07373 EMBL:CR541676
            EMBL:AK301472 EMBL:AK314482 EMBL:BC002642 IPI:IPI00299150
            IPI:IPI00910216 PIR:A42482 RefSeq:NP_001186668.1 RefSeq:NP_004070.3
            UniGene:Hs.181301 PDB:1BXF PDB:1GLO PDB:1MS6 PDB:1NPZ PDB:1NQC
            PDB:2C0Y PDB:2F1G PDB:2FQ9 PDB:2FRA PDB:2FRQ PDB:2FT2 PDB:2FUD
            PDB:2FYE PDB:2G6D PDB:2G7Y PDB:2H7J PDB:2HH5 PDB:2HHN PDB:2HXZ
            PDB:2OP3 PDB:2R9M PDB:2R9N PDB:2R9O PDB:3IEJ PDB:3KWN PDB:3MPE
            PDB:3MPF PDB:3N3G PDB:3N4C PDB:3OVX PDBsum:1BXF PDBsum:1GLO
            PDBsum:1MS6 PDBsum:1NPZ PDBsum:1NQC PDBsum:2C0Y PDBsum:2F1G
            PDBsum:2FQ9 PDBsum:2FRA PDBsum:2FRQ PDBsum:2FT2 PDBsum:2FUD
            PDBsum:2FYE PDBsum:2G6D PDBsum:2G7Y PDBsum:2H7J PDBsum:2HH5
            PDBsum:2HHN PDBsum:2HXZ PDBsum:2OP3 PDBsum:2R9M PDBsum:2R9N
            PDBsum:2R9O PDBsum:3IEJ PDBsum:3KWN PDBsum:3MPE PDBsum:3MPF
            PDBsum:3N3G PDBsum:3N4C PDBsum:3OVX ProteinModelPortal:P25774
            SMR:P25774 IntAct:P25774 STRING:P25774 MEROPS:I29.004
            PhosphoSite:P25774 DMDM:88984046 PaxDb:P25774 PeptideAtlas:P25774
            PRIDE:P25774 DNASU:1520 Ensembl:ENST00000368985
            Ensembl:ENST00000448301 GeneID:1520 KEGG:hsa:1520 UCSC:uc001evn.3
            GeneCards:GC01M150702 HGNC:HGNC:2545 HPA:CAB000460 HPA:HPA002988
            MIM:116845 neXtProt:NX_P25774 PharmGKB:PA27041 InParanoid:P25774
            PhylomeDB:P25774 BRENDA:3.4.22.27 BindingDB:P25774
            ChEMBL:CHEMBL2954 ChiTaRS:CTSS EvolutionaryTrace:P25774
            GenomeRNAi:1520 NextBio:6291 PMAP-CutDB:P25774 ArrayExpress:P25774
            Bgee:P25774 CleanEx:HS_CTSS Genevestigator:P25774
            GermOnline:ENSG00000163131 Uniprot:P25774
        Length = 331

 Score = 621 (223.7 bits), Expect = 1.2e-60, P = 1.2e-60
 Identities = 137/318 (43%), Positives = 185/318 (58%)

Query:    32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN--NKARNKPYKLGINEF 89
             D T++    +W   YG+ Y++  E+ +R  I+++N++++   N  +      Y LG+N  
Sbjct:    21 DPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHL 80

Query:    90 ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
              D T+EE  +  +    R+PS      T  S    N  +P S+DWR+KG VT VK QG C
Sbjct:    81 GDMTSEEVMSLMSSL--RVPSQWQRNITYKS--NPNRILPDSVDWREKGCVTEVKYQGSC 136

Query:   150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE-DQGCEGGLMDDAFEFIISNK 208
             G CWAFSAV A+E    + T KL SLS Q LVDC T    ++GC GG M  AF++II NK
Sbjct:   137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196

Query:   209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDAS 267
             G+ ++A YPYKA D  C + ++   AA  S Y ++P   E  L +AVAN+ PVSV +DA 
Sbjct:   197 GIDSDASYPYKAMDQKC-QYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDAR 255

Query:   268 GSDFQFYSSGVF-TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
                F  Y SGV+    C   ++HGV  VGYG  + G +YWLVKNSWG  +GE GYIRM R
Sbjct:   256 HPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLN-GKEYWLVKNSWGHNFGEEGYIRMAR 314

Query:   327 DIDAKEGLCGIAMQASYP 344
             +   K   CGIA   SYP
Sbjct:   315 N---KGNHCGIASFPSYP 329


>UNIPROTKB|P43235 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0001957
            "intramembranous ossification" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0045453 "bone resorption"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=TAS] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=TAS] [GO:0036021 "endolysosome lumen" evidence=TAS]
            [GO:0045087 "innate immune response" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            Reactome:REACT_118779 Reactome:REACT_6900 GO:GO:0005615
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087 GO:GO:0045453
            EMBL:CH471121 EMBL:AL355860 GO:GO:0004197 GO:GO:0001957
            HOVERGEN:HBG011513 GO:GO:0036021 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:U13665 EMBL:X82153
            EMBL:U20280 EMBL:S79895 EMBL:CR541675 EMBL:AL356292 EMBL:BC016058
            IPI:IPI00300599 PIR:JC2476 RefSeq:NP_000387.1 UniGene:Hs.632466
            PDB:1ATK PDB:1AU0 PDB:1AU2 PDB:1AU3 PDB:1AU4 PDB:1AYU PDB:1AYV
            PDB:1AYW PDB:1BGO PDB:1BY8 PDB:1MEM PDB:1NL6 PDB:1NLJ PDB:1Q6K
            PDB:1SNK PDB:1TU6 PDB:1U9V PDB:1U9W PDB:1U9X PDB:1VSN PDB:1YK7
            PDB:1YK8 PDB:1YT7 PDB:2ATO PDB:2AUX PDB:2AUZ PDB:2BDL PDB:2R6N
            PDB:3C9E PDB:3H7D PDB:3KW9 PDB:3KWB PDB:3KWZ PDB:3KX1 PDB:3O0U
            PDB:3O1G PDB:3OVZ PDB:4DMX PDB:4DMY PDB:7PCK PDBsum:1ATK
            PDBsum:1AU0 PDBsum:1AU2 PDBsum:1AU3 PDBsum:1AU4 PDBsum:1AYU
            PDBsum:1AYV PDBsum:1AYW PDBsum:1BGO PDBsum:1BY8 PDBsum:1MEM
            PDBsum:1NL6 PDBsum:1NLJ PDBsum:1Q6K PDBsum:1SNK PDBsum:1TU6
            PDBsum:1U9V PDBsum:1U9W PDBsum:1U9X PDBsum:1VSN PDBsum:1YK7
            PDBsum:1YK8 PDBsum:1YT7 PDBsum:2ATO PDBsum:2AUX PDBsum:2AUZ
            PDBsum:2BDL PDBsum:2R6N PDBsum:3C9E PDBsum:3H7D PDBsum:3KW9
            PDBsum:3KWB PDBsum:3KWZ PDBsum:3KX1 PDBsum:3O0U PDBsum:3O1G
            PDBsum:3OVZ PDBsum:4DMX PDBsum:4DMY PDBsum:7PCK
            ProteinModelPortal:P43235 SMR:P43235 DIP:DIP-39993N IntAct:P43235
            STRING:P43235 PhosphoSite:P43235 DMDM:1168793 PaxDb:P43235
            PRIDE:P43235 DNASU:1513 Ensembl:ENST00000271651 GeneID:1513
            KEGG:hsa:1513 UCSC:uc001evp.2 GeneCards:GC01M150768 HGNC:HGNC:2536
            MIM:265800 MIM:601105 neXtProt:NX_P43235 Orphanet:763
            PharmGKB:PA27034 InParanoid:P43235 OMA:LKVPPSH PhylomeDB:P43235
            BindingDB:P43235 ChEMBL:CHEMBL268 EvolutionaryTrace:P43235
            GenomeRNAi:1513 NextBio:6267 ArrayExpress:P43235 Bgee:P43235
            CleanEx:HS_CTSK CleanEx:HS_CTSO Genevestigator:P43235
            GermOnline:ENSG00000143387 Uniprot:P43235
        Length = 329

 Score = 620 (223.3 bits), Expect = 1.5e-60, P = 1.5e-60
 Identities = 136/310 (43%), Positives = 187/310 (60%)

Query:    40 EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEF 97
             E+W   + + Y +  ++  R  I+++N++YI+  N +A      Y+L +N   D T+EE 
Sbjct:    27 ELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEV 86

Query:    98 RAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
                  G K  L   RS++T  +   +E  + P S+D+RKKG VT VK+QGQCG CWAFS+
Sbjct:    87 VQKMTGLKVPLSHSRSNDTLYIP-EWEGRA-PDSVDYRKKGYVTPVKNQGQCGSCWAFSS 144

Query:   158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
             V A+EG     T KL +LS Q LVDC +  E+ GC GG M +AF+++  N+G+ +E  YP
Sbjct:   145 VGALEGQLKKKTGKLLNLSPQNLVDCVS--ENDGCGGGYMTNAFQYVQKNRGIDSEDAYP 202

Query:   218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSS 276
             Y   + SC        AAK  GY ++P  NE AL +AVA   PVSVAIDAS + FQFYS 
Sbjct:   203 YVGQEESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSK 261

Query:   277 GVFTGQ-CGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
             GV+  + C ++ L+H V AVGYG    G K+W++KNSWG  WG  GYI M R+   K   
Sbjct:   262 GVYYDESCNSDNLNHAVLAVGYGI-QKGNKHWIIKNSWGENWGNKGYILMARN---KNNA 317

Query:   335 CGIAMQASYP 344
             CGIA  AS+P
Sbjct:   318 CGIANLASFP 327


>ZFIN|ZDB-GENE-050626-55 [details] [associations]
            symbol:ctssb.2 "cathepsin S, b.2" species:7955
            "Danio rerio" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-050626-55
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            KO:K01368 EMBL:BC093339 IPI:IPI00507098 RefSeq:NP_001017661.1
            UniGene:Dr.132688 ProteinModelPortal:Q566T8 SMR:Q566T8
            GeneID:337572 KEGG:dre:337572 CTD:337572 InParanoid:Q566T8
            NextBio:20812306 ArrayExpress:Q566T8 Uniprot:Q566T8
        Length = 330

 Score = 615 (221.5 bits), Expect = 5.0e-60, P = 5.0e-60
 Identities = 135/314 (42%), Positives = 186/314 (59%)

Query:    35 MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQ 92
             +++  E+W  ++ ++Y    E+  R ++++ N+E IA  N +A      Y L IN  AD 
Sbjct:    23 LDQHWELWKKKHVKLYSCEDEEVGRRELWERNLELIAIHNLEASMGMHSYDLAINHMADM 82

Query:    93 TNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCC 152
             T EE         R  P  +      VS  +  A VP ++DWR KG VT VK+QG CG C
Sbjct:    83 TTEEILQTL-AVTRVPPGFKRPTAEYVSSSF--AVVPDTLDWRDKGYVTSVKNQGACGSC 139

Query:   153 WAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLAT 212
             WAFS+V A+EG    TT KL  LS Q LVDC +   + GC GG M  AF+++I N G+ +
Sbjct:   140 WAFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGYMSQAFQYVIDNGGIDS 199

Query:   213 EAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDF 271
             E+ YPY+ + GSC + + +  AA  + Y+ V   +E AL +A+AN  PVSVAIDA+   F
Sbjct:   200 ESSYPYQGTQGSC-RYDPSQRAANCTSYKFVSQGDEQALKEALANIGPVSVAIDATRPQF 258

Query:   272 QFYSSGVFTG-QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDA 330
              FY SGV+    C  +++HGV AVGYGT   G  YWLVKNSWG  +G+ GYIR+ R+   
Sbjct:   259 IFYRSGVYDDPSCTQKVNHGVLAVGYGTLS-GQDYWLVKNSWGAGFGDGGYIRIARN--- 314

Query:   331 KEGLCGIAMQASYP 344
             K  +CGIA +A YP
Sbjct:   315 KNNMCGIASEACYP 328


>UNIPROTKB|Q9GLE3 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9823 "Sus scrofa"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0005576 "extracellular
            region" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005576 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 MEROPS:I29.007
            CTD:1513 KO:K01371 OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55
            OMA:LKVPPSH EMBL:AF292030 RefSeq:NP_999467.1 UniGene:Ssc.1020
            ProteinModelPortal:Q9GLE3 SMR:Q9GLE3 STRING:Q9GLE3
            Ensembl:ENSSSCT00000007283 GeneID:397569 KEGG:ssc:397569
            ArrayExpress:Q9GLE3 Uniprot:Q9GLE3
        Length = 330

 Score = 613 (220.8 bits), Expect = 8.1e-60, P = 8.1e-60
 Identities = 141/338 (41%), Positives = 196/338 (57%)

Query:    12 LAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIA 71
             L  +L+L V +   +   + D     + E+W   Y + Y    ++  R  I+++N+++I+
Sbjct:     4 LKVVLLLPVMSSALYPEEILDT----QWELWKKTYRKQYNSKVDEISRRLIWEKNLKHIS 59

Query:    72 SFNNKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVP 129
               N +A      Y+L +N   D T+EE      G K      RS++T  +   +E  + P
Sbjct:    60 IHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPPSHSRSNDTLYIP-DWEGRT-P 117

Query:   130 ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
              SID+RKKG VT VK+QGQCG CWAFS+V A+EG     T KL +LS Q LVDC +  E+
Sbjct:   118 DSIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS--EN 175

Query:   190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEA 249
              GC GG M +AF+++  N+G+ +E  YPY   D +C        AAK  GY ++P  NE 
Sbjct:   176 DGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDENCMYNPTG-KAAKCRGYREIPEGNEK 234

Query:   250 ALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTGQ-CGTE-LDHGVTAVGYGTADDGTKYW 306
             AL +AVA   PVSVAIDAS + FQFYS GV+  + C ++ L+H V AVGYG    G K+W
Sbjct:   235 ALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGI-QKGKKHW 293

Query:   307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             ++KNSWG  WG  GYI M R+   K   CGIA  AS+P
Sbjct:   294 IIKNSWGENWGNKGYILMARN---KNNACGIANLASFP 328


>UNIPROTKB|G1K2A7 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9615 "Canis lupus
            familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 PANTHER:PTHR12411:SF55 OMA:LKVPPSH
            EMBL:AAEX03011051 Ensembl:ENSCAFT00000019202 Uniprot:G1K2A7
        Length = 333

 Score = 612 (220.5 bits), Expect = 1.0e-59, P = 1.0e-59
 Identities = 134/315 (42%), Positives = 189/315 (60%)

Query:    35 MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQ 92
             ++ + ++W   Y + Y    ++  R  I+++N+++I+  N +A      Y+L +N   D 
Sbjct:    26 LDTQWDLWKKTYRKQYNSKVDELSRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDM 85

Query:    93 TNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCC 152
             T+EE      G K      RS++T  +   +E+ + P S+D+RKKG VT VK+QGQCG C
Sbjct:    86 TSEEVVQKMTGLKVPPSHSRSNDTLYIP-DWESRA-PDSVDYRKKGYVTPVKNQGQCGSC 143

Query:   153 WAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLAT 212
             WAFS+V A+EG     T KL +LS Q LVDC +  E+ GC GG M +AF+++  N+G+ +
Sbjct:   144 WAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS--ENDGCGGGYMTNAFQYVQKNRGIDS 201

Query:   213 EAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDF 271
             E  YPY   D SC        AAK  GY ++P  NE AL +AVA   P+SVAIDAS + F
Sbjct:   202 EDAYPYVGQDESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSF 260

Query:   272 QFYSSGVFTGQ-CGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
             QFYS GV+  + C ++ L+H V AVGYG    G K+W++KNSWG  WG  GYI M R+  
Sbjct:   261 QFYSKGVYYDENCNSDNLNHAVLAVGYGI-QKGNKHWIIKNSWGENWGNKGYILMARN-- 317

Query:   330 AKEGLCGIAMQASYP 344
              K   CGIA  AS+P
Sbjct:   318 -KNNACGIANLASFP 331


>UNIPROTKB|Q3ZKN1 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:AY738221
            RefSeq:NP_001029168.1 UniGene:Cfa.588 HSSP:P43235
            ProteinModelPortal:Q3ZKN1 SMR:Q3ZKN1 STRING:Q3ZKN1 GeneID:608843
            KEGG:cfa:608843 InParanoid:Q3ZKN1 NextBio:20894470 Uniprot:Q3ZKN1
        Length = 330

 Score = 612 (220.5 bits), Expect = 1.0e-59, P = 1.0e-59
 Identities = 134/315 (42%), Positives = 189/315 (60%)

Query:    35 MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQ 92
             ++ + ++W   Y + Y    ++  R  I+++N+++I+  N +A      Y+L +N   D 
Sbjct:    23 LDTQWDLWKKTYRKQYNSKVDELSRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDM 82

Query:    93 TNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCC 152
             T+EE      G K      RS++T  +   +E+ + P S+D+RKKG VT VK+QGQCG C
Sbjct:    83 TSEEVVQKMTGLKVPPSHSRSNDTLYIP-DWESRA-PDSVDYRKKGYVTPVKNQGQCGSC 140

Query:   153 WAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLAT 212
             WAFS+V A+EG     T KL +LS Q LVDC +  E+ GC GG M +AF+++  N+G+ +
Sbjct:   141 WAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS--ENDGCGGGYMTNAFQYVQKNRGIDS 198

Query:   213 EAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDF 271
             E  YPY   D SC        AAK  GY ++P  NE AL +AVA   P+SVAIDAS + F
Sbjct:   199 EDAYPYVGQDESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSF 257

Query:   272 QFYSSGVFTGQ-CGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
             QFYS GV+  + C ++ L+H V AVGYG    G K+W++KNSWG  WG  GYI M R+  
Sbjct:   258 QFYSKGVYYDENCNSDNLNHAVLAVGYGI-QKGNKHWIIKNSWGENWGNKGYILMARN-- 314

Query:   330 AKEGLCGIAMQASYP 344
              K   CGIA  AS+P
Sbjct:   315 -KNNACGIANLASFP 328


>UNIPROTKB|F1NEC8 [details] [associations]
            symbol:CTSL2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:AADN02067812 IPI:IPI00820956 Ensembl:ENSGALT00000037988
            ArrayExpress:F1NEC8 Uniprot:F1NEC8
        Length = 218

 Score = 610 (219.8 bits), Expect = 1.7e-59, P = 1.7e-59
 Identities = 121/219 (55%), Positives = 148/219 (67%)

Query:   129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
             P S+DWR+KG VT VKDQGQCG CWAFS   A+EG +   T KL SLSEQ LVDC     
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEG 61

Query:   189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
             +QGC GGLMD AF+++  N G+ +E  YPY A D    + +A  +AA  +G+ D+P  +E
Sbjct:    62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHE 121

Query:   249 AALMKAVANQ-PVSVAIDASGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYGTADDGTKY 305
              ALMKAVA+  PVSVAIDA  S FQFY SG++    C +E LDHGV  VGYG  +DG KY
Sbjct:   122 RALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGF-EDGKKY 180

Query:   306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             W+VKNSWG  WG+ GYI M +D   ++  CGIA  ASYP
Sbjct:   181 WIVKNSWGEKWGDKGYIYMAKD---RKNHCGIATAASYP 216


>DICTYBASE|DDB_G0272298 [details] [associations]
            symbol:DDB_G0272298 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0272298 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246
            SMART:SM00848 EMBL:AAFI02000008 KO:K01365 RefSeq:XP_645281.1
            ProteinModelPortal:Q559Q3 MEROPS:C01.A53 EnsemblProtists:DDB0203746
            GeneID:8618447 KEGG:ddi:DDB_G0272298 InParanoid:Q559Q3 OMA:PANINWR
            Uniprot:Q559Q3
        Length = 305

 Score = 609 (219.4 bits), Expect = 2.2e-59, P = 2.2e-59
 Identities = 127/309 (41%), Positives = 191/309 (61%)

Query:    43 MAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRN 102
             M +Y + Y++N E   RF IF++N  +I +  NK   +  ++ +NE++D T +EF A + 
Sbjct:     1 MVKYNKHYKNNKEYLKRFDIFQDNYNFILNHRNK-NGENIEMDLNEYSDLTQKEF-ADKF 58

Query:   103 GYKRRLPSVRSSETTDVS---FRYE-NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAV 158
              +++ +P  RS    D+    F++  NA++P S DWR  GAV  VK+QG C  CW+FSA+
Sbjct:    59 -FEKLVPEPRSGPINDIKATPFKHNVNATIPKSFDWRDHGAVGKVKNQGSCASCWSFSAL 117

Query:   159 AAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPY 218
              A+EG  +I   +L  LSEQ LVDC T    +GC+ G M DAF++IIS+ G+  E++YPY
Sbjct:   118 GALEGHYYIKYGELLDLSEQNLVDCATPFGPKGCKTGWMHDAFKYIISSGGVNLESQYPY 177

Query:   219 KASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSG 277
                D  C K   +   AK+SG+  +P  +E+ALM+A+A   PV+V ID S  +FQ  S G
Sbjct:   178 TGKDEVC-KFNQSEKEAKVSGFVMIPKFDESALMEAIALYGPVAVPIDTSTKEFQHLSGG 236

Query:   278 VF-TGQCGT-ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
             ++ +  C      H V A+GYGT ++G  Y+L+KNSWG +WG NG+ +++R +   +G C
Sbjct:   237 IYYSDSCDPWNTIHAVLAIGYGTDENGVDYFLMKNSWGKSWGTNGFFKVKRGV---KGKC 293

Query:   336 GIAMQASYP 344
             GI   ASYP
Sbjct:   294 GIVTAASYP 302


>RGD|61810 [details] [associations]
            symbol:Ctsk "cathepsin K" species:10116 "Rattus norvegicus"
           [GO:0001957 "intramembranous ossification" evidence=IEP] [GO:0005615
           "extracellular space" evidence=IDA] [GO:0005737 "cytoplasm"
           evidence=IDA] [GO:0005764 "lysosome" evidence=IDA] [GO:0006508
           "proteolysis" evidence=TAS] [GO:0008234 "cysteine-type peptidase
           activity" evidence=TAS] [GO:0045453 "bone resorption" evidence=IMP]
           InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
           Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
           RGD:61810 GO:GO:0005615 eggNOG:COG4870 HOGENOM:HOG000230774
           GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
           InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
           PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
           GO:GO:0045453 GO:GO:0001957 GeneTree:ENSGT00560000076577
           HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
           OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:AF010306 EMBL:BC078793
           IPI:IPI00206378 RefSeq:NP_113748.1 UniGene:Rn.5598
           ProteinModelPortal:O35186 SMR:O35186 STRING:O35186
           PhosphoSite:O35186 PRIDE:O35186 Ensembl:ENSRNOT00000028730
           GeneID:29175 KEGG:rno:29175 UCSC:RGD:61810 InParanoid:O35186
           OMA:YKEIPEG BindingDB:O35186 ChEMBL:CHEMBL3034 NextBio:608248
           Genevestigator:O35186 GermOnline:ENSRNOG00000021155 Uniprot:O35186
        Length = 329

 Score = 609 (219.4 bits), Expect = 2.2e-59, P = 2.2e-59
 Identities = 136/319 (42%), Positives = 187/319 (58%)

Query:    32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKAR--NKPYKLGINEF 89
             + T++ + E+W   +G+ Y    ++  R  I+++N++ I+  N +A      Y+L +N  
Sbjct:    19 EETLDTQWELWKKTHGKQYNSKVDEISRRLIWEKNLKKISVHNLEASLGAHTYELAMNHL 78

Query:    90 ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE-NASVPASIDWRKKGAVTGVKDQGQ 148
              D T+EE      G   R+P  RS  + D  +  E    VP SID+RKKG VT VK+QGQ
Sbjct:    79 GDMTSEEVVQKMTGL--RVPPSRSF-SNDTLYTPEWEGRVPDSIDYRKKGYVTPVKNQGQ 135

Query:   149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
             CG CWAFS+  A+EG     T KL +LS Q LVDC +  E+ GC GG M  AF+++  N 
Sbjct:   136 CGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVS--ENYGCGGGYMTTAFQYVQQNG 193

Query:   209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDAS 267
             G+ +E  YPY   D SC    A   AAK  GY ++P  NE AL +AVA   PVSV+IDAS
Sbjct:   194 GIDSEDAYPYVGQDESC-MYNATAKAAKCRGYREIPVGNEKALKRAVARVGPVSVSIDAS 252

Query:   268 GSDFQFYSSGVFTGQ-CGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
              + FQFYS GV+  + C  + ++H V  VGYGT   G KYW++KNSWG +WG  GY+ + 
Sbjct:   253 LTSFQFYSRGVYYDENCDRDNVNHAVLVVGYGT-QKGNKYWIIKNSWGESWGNKGYVLLA 311

Query:   326 RDIDAKEGLCGIAMQASYP 344
             R+   K   CGI   AS+P
Sbjct:   312 RN---KNNACGITNLASFP 327


>UNIPROTKB|Q5E968 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:BT021052
            EMBL:BC109853 IPI:IPI00709374 RefSeq:NP_001029607.1
            UniGene:Bt.23218 ProteinModelPortal:Q5E968 SMR:Q5E968 STRING:Q5E968
            MEROPS:I29.007 PRIDE:Q5E968 Ensembl:ENSBTAT00000028016
            GeneID:513038 KEGG:bta:513038 CTD:1513 InParanoid:Q5E968 KO:K01371
            OrthoDB:EOG4SJ5FC NextBio:20870669 PANTHER:PTHR12411:SF55
            Uniprot:Q5E968
        Length = 329

 Score = 608 (219.1 bits), Expect = 2.7e-59, P = 2.7e-59
 Identities = 133/315 (42%), Positives = 187/315 (59%)

Query:    35 MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQ 92
             ++ + E+W   Y + Y    ++  R  I+++N+++I+  N +A      Y+L +N   D 
Sbjct:    22 LDTQWELWKKTYRKQYNSKGDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDM 81

Query:    93 TNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCC 152
             T+EE      G K      RS++T  +   +E  + P S+D+RKKG VT VK+QGQCG C
Sbjct:    82 TSEEVVQKMTGLKVPASRSRSNDTLYIP-DWEGRA-PDSVDYRKKGYVTPVKNQGQCGSC 139

Query:   153 WAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLAT 212
             WAFS+V A+EG     T KL +LS Q LVDC +  E+ GC GG M +AF+++  N+G+ +
Sbjct:   140 WAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS--ENDGCGGGYMTNAFQYVQKNRGIDS 197

Query:   213 EAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDF 271
             E  YPY   D +C        AAK  GY ++P  NE AL +AVA   P+SVAIDAS + F
Sbjct:   198 EDAYPYVGQDENCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSF 256

Query:   272 QFYSSGVFTGQ-CGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
             QFY  GV+  + C ++ L+H V AVGYG    G K+W++KNSWG  WG  GYI M R+  
Sbjct:   257 QFYRKGVYYDENCNSDNLNHAVLAVGYGI-QKGNKHWIIKNSWGENWGNKGYILMARN-- 313

Query:   330 AKEGLCGIAMQASYP 344
              K   CGIA  AS+P
Sbjct:   314 -KNNACGIANLASFP 327


>UNIPROTKB|F1PMM9 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9615 "Canis lupus
            familiaris" [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00639 GO:GO:0004197 GO:GO:0002250
            GeneTree:ENSGT00560000076577 GO:GO:0097067 EMBL:AAEX03000499
            Ensembl:ENSCAFT00000002029 OMA:EFKQVLN Uniprot:F1PMM9
        Length = 341

 Score = 606 (218.4 bits), Expect = 4.5e-59, P = 4.5e-59
 Identities = 143/345 (41%), Positives = 194/345 (56%)

Query:    10 LVLAAILVLGVW--APQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
             L LAA L LG+   APQ       D +++     W   +G++Y D  E+  R  +++ N+
Sbjct:    13 LFLAA-LCLGIASAAPQQ------DHSLDAHWSQWKEAHGKLY-DKDEEGWRRTVWERNM 64

Query:    68 EYIASFNNKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN 125
             E I   N +       + L +N F D TNEEF+   N +K     ++  +   V      
Sbjct:    65 EMIEQHNQEYSQGEHSFTLAMNAFGDMTNEEFKQVLNDFK-----IQKHKKGKVFPAPLF 119

Query:   126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
             A VP+S+DWR++G VT VKDQGQC  CWAFSA  A+EG     T KL SLSEQ LVDC  
Sbjct:   120 AEVPSSVDWREQGYVTPVKDQGQCLGCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSW 179

Query:   186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
             S  ++GC GGLM+ AF+++  N GL +E  YPY A +  C K     SAA ++ +  +  
Sbjct:   180 SQGNRGCNGGLMEYAFQYVKDNGGLDSEESYPYLARNEPC-KYRPEKSAANVTAFWPI-L 237

Query:   246 NNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTG-QCGTEL-DHGVTAVGYG---TA 299
             N E  LM  VA   PVS A+D+S   FQFY  G++   +C  +L +HGV  VGYG     
Sbjct:   238 NEEDGLMTTVATVGPVSAAVDSSPQSFQFYKKGIYYDPKCSNKLLNHGVLVVGYGFEGAE 297

Query:   300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
              D  KYW+VKNSWGT WG  GY+ + +D   ++  CGIA +ASYP
Sbjct:   298 SDNKKYWIVKNSWGTNWGMQGYMLLAKD---RDNHCGIATRASYP 339


>UNIPROTKB|P09648 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9031 "Gallus gallus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            OrthoDB:EOG48PMKF MEROPS:C01.032 IPI:IPI00602255 PIR:S00081
            UniGene:Gga.523 ProteinModelPortal:P09648 SMR:P09648 Uniprot:P09648
        Length = 218

 Score = 603 (217.3 bits), Expect = 9.3e-59, P = 9.3e-59
 Identities = 120/219 (54%), Positives = 147/219 (67%)

Query:   129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
             P S+DWR+KG VT VKDQGQCG CWAFS   A+EG +  T  KL SLSEQ LVDC     
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRPEG 61

Query:   189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
             +QGC GGLMD AF+++  N G+ +E  YPY A D    + +A  +AA  +G+ D+P  +E
Sbjct:    62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHE 121

Query:   249 AALMKAVANQ-PVSVAIDASGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYGTADDGTKY 305
              ALMKAVA+  PVSVAIDA  S FQFY SG++    C +E LDHGV  VGYG  + G KY
Sbjct:   122 RALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGF-EGGKKY 180

Query:   306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             W+VKNSWG  WG+ GYI M +D   ++  CGIA  ASYP
Sbjct:   181 WIVKNSWGEKWGDKGYIYMAKD---RKNHCGIATAASYP 216


>UNIPROTKB|P83654 [details] [associations]
            symbol:P83654 "Ervatamin-C" species:52861 "Tabernaemontana
            divaricata" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0006508 "proteolysis" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005576 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197 PDB:1O0E PDB:2PNS
            PDBsum:1O0E PDBsum:2PNS MEROPS:C01.116 EvolutionaryTrace:P83654
            Uniprot:P83654
        Length = 208

 Score = 603 (217.3 bits), Expect = 9.3e-59, P = 9.3e-59
 Identities = 123/218 (56%), Positives = 145/218 (66%)

Query:   128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
             +P  IDWRKKGAVT VK+QG CG CWAFS V+ +E IN I T  L SLSEQELVDCD   
Sbjct:     1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDK-- 58

Query:   188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
             ++ GC GG    A+++II+N G+ T+A YPYKA  G C   +A      I GY  VP  N
Sbjct:    59 KNHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPC---QAASKVVSIDGYNGVPFCN 115

Query:   248 EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWL 307
             E AL +AVA QP +VAIDAS + FQ YSSG+F+G CGT+L+HGVT VGY        YW+
Sbjct:   116 EXALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQA-----NYWI 170

Query:   308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
             V+NSWG  WGE GYIRM R      GLCGIA    YPT
Sbjct:   171 VRNSWGRYWGEKGYIRMLRVGGC--GLCGIARLPYYPT 206


>UNIPROTKB|Q10991 [details] [associations]
            symbol:CTSL "Cathepsin L1" species:9940 "Ovis aries"
            [GO:0005515 "protein binding" evidence=IPI] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            MEROPS:C01.032 ProteinModelPortal:Q10991 SMR:Q10991 Uniprot:Q10991
        Length = 217

 Score = 603 (217.3 bits), Expect = 9.3e-59, P = 9.3e-59
 Identities = 123/221 (55%), Positives = 147/221 (66%)

Query:   128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
             VP S+DW KKG VT VK+QGQCG CWAFSA  A+EG     T KL SLSEQ LVD     
Sbjct:     1 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQ 60

Query:   188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
              +QGC GGLMD+AF++I  N GL +E  YPY+A+D SCN K    SAAK +G+ D+P   
Sbjct:    61 GNQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTSCNYKPEY-SAAKDTGFVDIPQR- 118

Query:   248 EAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTG-QCGT-ELDHGVTAVGYGTADDGTK 304
             E ALMKAVA   P+SVAIDA  S FQFY SG++    C + +LDHGV  VGYG      K
Sbjct:   119 EKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTNNK 178

Query:   305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
             +W+VKNSWG  WG  GY++M +D   +   CGIA  ASYPT
Sbjct:   179 FWIVKNSWGPEWGNKGYVKMAKD---QNNHCGIATAASYPT 216


>UNIPROTKB|P83443 [details] [associations]
            symbol:P83443 "Macrodontain-1" species:203992 "Pseudananas
            sagenarius" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0006508 "proteolysis" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197
            ProteinModelPortal:P83443 SMR:P83443 MEROPS:C01.028 Uniprot:P83443
        Length = 213

 Score = 602 (217.0 bits), Expect = 1.2e-58, P = 1.2e-58
 Identities = 110/219 (50%), Positives = 153/219 (69%)

Query:   127 SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
             +VP SIDWR  GAV  VK+QG CG CWAF+A+A +EGI  I    L  LSEQE++DC  S
Sbjct:     1 AVPQSIDWRDYGAVNEVKNQGPCGGCWAFAAIATVEGIYKIRKGNLVYLSEQEVLDCAVS 60

Query:   187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
                 GC+GG ++ A++FIISN G+ T+  YPY+A  G+CN     P++A I+GY  V  N
Sbjct:    61 ---YGCKGGWVNRAYDFIISNNGVTTDENYPYRAYQGTCNANYF-PNSAYITGYSYVRRN 116

Query:   247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
             +E+ +M AV+NQP++  IDASG +FQ+Y  GV++G CG  L+H +T +GYG  D    YW
Sbjct:   117 DESHMMYAVSNQPIAALIDASGDNFQYYKGGVYSGPCGFSLNHAITIIGYGR-DS---YW 172

Query:   307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
             +V+NSWG++WG+ GY+R++RD+    G+CGIAM   +PT
Sbjct:   173 IVRNSWGSSWGQGGYVRIRRDVSHSGGVCGIAMSPLFPT 211


>UNIPROTKB|Q24940 [details] [associations]
            symbol:Cat-1 "Cathepsin L-like proteinase" species:6192
            "Fasciola hepatica" [GO:0004175 "endopeptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005576 "extracellular region" evidence=IDA]
            [GO:0006508 "proteolysis" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005576 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0004197 EMBL:L33771 PIR:S43991 PDB:2O6X
            PDBsum:2O6X ProteinModelPortal:Q24940 SMR:Q24940 MEROPS:C01.033
            EvolutionaryTrace:Q24940 Uniprot:Q24940
        Length = 326

 Score = 601 (216.6 bits), Expect = 1.5e-58, P = 1.5e-58
 Identities = 132/311 (42%), Positives = 176/311 (56%)

Query:    39 HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN--NKARNKPYKLGINEFADQTNEE 96
             H+ W   Y + Y + A+ + R  I+++NV++I   N  +      Y LG+N+F D T EE
Sbjct:    22 HQ-WKRMYNKEY-NGADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEE 79

Query:    97 FRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
             F+A    Y   +       +  V +   N +VP  IDWR+ G VT VKDQG CG CWAFS
Sbjct:    80 FKAK---YLTEMSRASDILSHGVPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAFS 136

Query:   157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
                 MEG      R   S SEQ+LVDC     + GC GGLM++A++++    GL TE+ Y
Sbjct:   137 TTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYL-KQFGLETESSY 195

Query:   217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAV-ANQPVSVAIDASGSDFQFYS 275
             PY A +G C   +     AK++GY  V S +E  L   V A +P +VA+D   SDF  Y 
Sbjct:   196 PYTAVEGQCRYNK-QLGVAKVTGYYTVHSGSEVELKNLVGARRPAAVAVDVE-SDFMMYR 253

Query:   276 SGVFTGQ-CGT-ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
             SG++  Q C    ++H V AVGYGT   GT YW+VKNSWGT WGE GYIRM R+   +  
Sbjct:   254 SGIYQSQTCSPLRVNHAVLAVGYGT-QGGTDYWIVKNSWGTYWGERGYIRMARN---RGN 309

Query:   334 LCGIAMQASYP 344
             +CGIA  AS P
Sbjct:   310 MCGIASLASLP 320


>ZFIN|ZDB-GENE-001205-4 [details] [associations]
            symbol:ctsk "cathepsin K" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-001205-4 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            CTD:1513 KO:K01371 OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55
            EMBL:BC092901 IPI:IPI00512751 RefSeq:NP_001017778.1
            UniGene:Dr.76224 ProteinModelPortal:Q568D6 SMR:Q568D6 GeneID:550475
            KEGG:dre:550475 InParanoid:Q568D6 NextBio:20879718
            ArrayExpress:Q568D6 Uniprot:Q568D6
        Length = 333

 Score = 600 (216.3 bits), Expect = 1.9e-58, P = 1.9e-58
 Identities = 138/335 (41%), Positives = 189/335 (56%)

Query:    15 ILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN 74
             +LVL +W     + +L++ +++E  E W   + R Y    E+ +R  I+++N+ +I + N
Sbjct:     9 LLVL-LWC--GLAHSLDNLSLDEAWESWKITHKREYNGLNEESIRRTIWEKNMLFIEAHN 65

Query:    75 NKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASI 132
              +       Y LG+N F D T EE      G +  +P  R    T V        +P SI
Sbjct:    66 KEYELGIHTYDLGMNHFGDMTLEEVAEKVMGLQ--MPMYRDPANTFVPDD-RVGKLPKSI 122

Query:   133 DWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGC 192
             D+RK G VT VK+QG CG CWAFS+V A+EG    T  +L  LS Q LVDC T  E+ GC
Sbjct:   123 DYRKLGYVTSVKNQGSCGSCWAFSSVGALEGQLMKTKGQLVDLSPQNLVDCVT--ENDGC 180

Query:   193 EGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALM 252
              GG M +AF ++ +N+G+ +E  YPY  +D  C    +   AA   GY+++P  NE AL 
Sbjct:   181 GGGYMTNAFRYVSNNQGIDSEESYPYVGTDQQCAYNTSGV-AASCRGYKEIPQGNERALT 239

Query:   253 KAVANQ-PVSVAIDASGSDFQFYSSGVFTG-QCGTE-LDHGVTAVGYGTADDGTKYWLVK 309
              AVAN  PVSV IDA  S F +Y SGV+    C  E ++H V AVGYG    G KYW+VK
Sbjct:   240 AAVANVGPVSVGIDAMQSTFLYYKSGVYYDPNCNKEDVNHAVLAVGYGATPRGKKYWIVK 299

Query:   310 NSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             NSWG  WG+ GY+ M R+   +   CGIA  AS+P
Sbjct:   300 NSWGEEWGKKGYVLMARN---RNNACGIANLASFP 331


>MGI|MGI:107823 [details] [associations]
            symbol:Ctsk "cathepsin K" species:10090 "Mus musculus"
            [GO:0005615 "extracellular space" evidence=ISO] [GO:0005737
            "cytoplasm" evidence=ISO] [GO:0005764 "lysosome" evidence=ISO]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0045453 "bone resorption" evidence=ISO]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:107823 GO:GO:0005615 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0045453
            GO:GO:0001957 HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 OMA:LKVPPSH EMBL:X94444
            EMBL:AJ006033 EMBL:BC046320 IPI:IPI00316575 PIR:S74227
            RefSeq:NP_031828.2 UniGene:Mm.272085 ProteinModelPortal:P55097
            SMR:P55097 MINT:MINT-3089515 STRING:P55097 PhosphoSite:P55097
            PRIDE:P55097 Ensembl:ENSMUST00000015664 GeneID:13038 KEGG:mmu:13038
            InParanoid:P55097 BioCyc:MetaCyc:MONOMER-14811 ChEMBL:CHEMBL1075277
            NextBio:282924 Bgee:P55097 CleanEx:MM_CTSK Genevestigator:P55097
            GermOnline:ENSMUSG00000028111 Uniprot:P55097
        Length = 329

 Score = 596 (214.9 bits), Expect = 5.1e-58, P = 5.1e-58
 Identities = 133/316 (42%), Positives = 185/316 (58%)

Query:    35 MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQ 92
             ++ + E+W   + + Y    ++  R  I+++N++ I++ N +A      Y+L +N   D 
Sbjct:    22 LDTQWELWKKTHQKQYNSKVDEISRRLIWEKNLKQISAHNLEASLGVHTYELAMNHLGDM 81

Query:    93 TNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE-NASVPASIDWRKKGAVTGVKDQGQCGC 151
             T+EE      G   R+P  RS  + D  +  E    VP SID+RKKG VT VK+QGQCG 
Sbjct:    82 TSEEVVQKMTGL--RIPPSRSY-SNDTLYTPEWEGRVPDSIDYRKKGYVTPVKNQGQCGS 138

Query:   152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
             CWAFS+  A+EG     T KL +LS Q LVDC T  E+ GC GG M  AF+++  N G+ 
Sbjct:   139 CWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVT--ENYGCGGGYMTTAFQYVQQNGGID 196

Query:   212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSD 270
             +E  YPY   D SC    A   AAK  GY ++P  NE AL +AVA   P+SV+IDAS + 
Sbjct:   197 SEDAYPYVGQDESC-MYNATAKAAKCRGYREIPVGNEKALKRAVARVGPISVSIDASLAS 255

Query:   271 FQFYSSGVFTGQ-CGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
             FQFYS GV+  + C  + ++H V  VGYGT   G+K+W++KNSWG +WG  GY  + R+ 
Sbjct:   256 FQFYSRGVYYDENCDRDNVNHAVLVVGYGT-QKGSKHWIIKNSWGESWGNKGYALLARN- 313

Query:   329 DAKEGLCGIAMQASYP 344
               K   CGI   AS+P
Sbjct:   314 --KNNACGITNMASFP 327


>DICTYBASE|DDB_G0278401 [details] [associations]
            symbol:cprH "cysteine proteinase 8" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0278401 EMBL:AAFI02000023
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 ProtClustDB:CLSZ2430780 RefSeq:XP_642342.1
            ProteinModelPortal:Q54Y60 MEROPS:C01.A62 EnsemblProtists:DDB0205428
            GeneID:8621547 KEGG:ddi:DDB_G0278401 InParanoid:Q54Y60 OMA:FANMENE
            Uniprot:Q54Y60
        Length = 337

 Score = 588 (212.0 bits), Expect = 3.6e-57, P = 3.6e-57
 Identities = 142/349 (40%), Positives = 199/349 (57%)

Query:    11 VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
             VL A+L+    A Q     L+++   +    WM    + Y  ++E   R+ IFK N +YI
Sbjct:     6 VLCALLITVATAKQE----LSESQYRDAFTDWMISNQKSY-SSSEFITRYNIFKTNFDYI 60

Query:    71 ASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPA 130
               +N+K       LG+N+ AD TNEE+R+   G      S+  ++  ++ F  + +S   
Sbjct:    61 EEWNSKGSETV--LGLNKMADITNEEYRSLYLGKPFDASSLIGTKE-EILFSNKFSS--- 114

Query:   131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHIT---TRKLTSLSEQELVDCDTSG 187
             ++DWRKKGAVT VK+Q  C  CW+FSA  A EG + +    T +L SLSEQ L+DC T  
Sbjct:   115 TVDWRKKGAVTHVKNQQSCSGCWSFSATGATEGAHKLANNGTNELVSLSEQNLIDCSTPF 174

Query:   188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
              + GC GG++  AFE+IISN G+ TE  YP++ +DG+C  K  N S A IS Y +V   +
Sbjct:   175 GNTGCNGGVITYAFEYIISNGGIDTEKSYPFEGTDGTCRYKSEN-SGATISSYVNVTFGS 233

Query:   248 EAALMKAVANQPVSVAIDASGSDFQFYSSGV-FTGQCG-TELDHGVTAVGYGT----ADD 301
             E++L  AV   PV+ +IDAS S F FY SG+ F   C  T LDHGV  VGYGT    + D
Sbjct:   234 ESSLESAVNVNPVACSIDASHSSFLFYKSGIYFEPACSRTNLDHGVLVVGYGTENSQSQD 293

Query:   302 GTK------YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
              +       YW+ KNSWG     NGYI M +D   ++ +CGI+  AS+P
Sbjct:   294 SSSEPNHSNYWIAKNSWGI----NGYILMSKD---RDNMCGISTLASFP 335


>UNIPROTKB|F1NZ37 [details] [associations]
            symbol:LOC420160 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 EMBL:AADN02062018
            IPI:IPI00587784 Ensembl:ENSGALT00000006765 OMA:CGVANQA
            Uniprot:F1NZ37
        Length = 340

 Score = 588 (212.0 bits), Expect = 3.6e-57, P = 3.6e-57
 Identities = 132/325 (40%), Positives = 189/325 (58%)

Query:    29 TLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN-NKARNK-PYKLGI 86
             T  D  + E  E W + Y + Y   AE  +R ++++ N+  I   N  +++ +  ++LG+
Sbjct:    24 TALDPVLEEAWERWKSLYAKEYPGEAEL-IRREVWENNLRRIEQHNWEESQGQHTFRLGM 82

Query:    87 NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKD 145
             N + D  +EEF    NG+    P V+  E   ++F+   A   PA +DWR +G VT VK+
Sbjct:    83 NHYGDLMDEEFNQLLNGFA---P-VQHEEPA-LTFQASAAQKTPAEVDWRMRGYVTPVKN 137

Query:   146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
             QG CG CWAFSA  A+EG+    T KL  LSEQ L+DC     + GC+GG M  AF+++ 
Sbjct:   138 QGHCGSCWAFSATGALEGLVFNWTGKLAVLSEQNLIDCSWKLGNNGCQGGYMTRAFQYVH 197

Query:   206 SNKGLATEAKYPYKASD-GSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVA 263
              N G+ +E  YPY+A+D  SC    A+  AA  S    V   +EAAL +AVA   PVSVA
Sbjct:   198 DNGGMNSEHIYPYQATDTSSCRYNPAD-RAANCSTVWLVAQGSEAALEQAVATVGPVSVA 256

Query:   264 IDASGSDFQFYSSGVFTGQ-CGTELDHGVTAVGYGTADDGTK---YWLVKNSWGTTWGEN 319
             +DAS   F FY SG+F    C  +++HG+ AVGYG + +  K   YW++KNSW   WGE 
Sbjct:   257 VDASSFFFHFYKSGIFNSMFCSQKVNHGMLAVGYGISQEARKNVSYWILKNSWSEVWGEK 316

Query:   320 GYIRMQRDIDAKEGLCGIAMQASYP 344
             GYIR+ + ++     CG+A QAS+P
Sbjct:   317 GYIRLLKGVNNH---CGVANQASFP 338


>ZFIN|ZDB-GENE-050522-559 [details] [associations]
            symbol:ctssb.1 "cathepsin S, b.1" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-050522-559 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 MEROPS:C01.034
            EMBL:BC095694 IPI:IPI00607338 UniGene:Dr.75553
            ProteinModelPortal:Q502H6 SMR:Q502H6 InParanoid:Q502H6
            ArrayExpress:Q502H6 Uniprot:Q502H6
        Length = 330

 Score = 586 (211.3 bits), Expect = 5.9e-57, P = 5.9e-57
 Identities = 129/324 (39%), Positives = 184/324 (56%)

Query:    32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEF 89
             +  +++  E+W   YG++Y    E+  R ++++ N++ I   N +A      Y L +N  
Sbjct:    20 NTNLDQHWELWKKTYGKIYTTEVEEFGRRQLWERNLQLITVHNLEASMGMHSYDLSMNHM 79

Query:    90 ADQTNEEF-------RAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTG 142
              D T EE          P +G+KR++ ++  S            +VP S+DWR+KG V+ 
Sbjct:    80 GDLTTEEILQTLALTHVP-SGFKRQIANIVGSS---------GDAVPDSLDWREKGYVSS 129

Query:   143 VKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFE 202
             VK QG CG CWAFS+V A+EG    TT KL  LS Q LVDC +   ++GC GG M DAF+
Sbjct:   130 VKMQGACGSCWAFSSVGALEGQLKKTTGKLVDLSPQNLVDCSSKYGNKGCNGGFMSDAFQ 189

Query:   203 FIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVS 261
             ++I N G+A+++ YPY+     C+   +   AA  + Y  V   +E AL +AVA+  P+S
Sbjct:   190 YVIDNGGIASDSAYPYRGVQQQCSYSSSQ-RAANCTKYYFVRQGDENALKQAVASVGPIS 248

Query:   262 VAIDASGSDFQFYSSGVFTGQ-CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENG 320
             VAIDA+   F  Y SGV+    C   ++H V  VGYGT   G  +WLVKNSWGT +G+ G
Sbjct:   249 VAIDATRPQFVLYHSGVYNDPTCSKRVNHAVLVVGYGTLS-GQDHWLVKNSWGTRFGDGG 307

Query:   321 YIRMQRDIDAKEGLCGIAMQASYP 344
             YIRM R+   K  +CGIA  A YP
Sbjct:   308 YIRMARN---KNNMCGIASYACYP 328


>RGD|621513 [details] [associations]
            symbol:Ctss "cathepsin S" species:10116 "Rattus norvegicus"
            [GO:0001656 "metanephros development" evidence=IEP] [GO:0002250
            "adaptive immune response" evidence=ISO] [GO:0005764 "lysosome"
            evidence=IEA;ISO] [GO:0006508 "proteolysis" evidence=IEA;ISO]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0009986 "cell
            surface" evidence=IDA] [GO:0016020 "membrane" evidence=ISO]
            [GO:0043231 "intracellular membrane-bounded organelle"
            evidence=ISO] [GO:0045453 "bone resorption" evidence=IMP]
            [GO:0051930 "regulation of sensory perception of pain"
            evidence=IMP] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            RGD:621513 GO:GO:0009986 GO:GO:0051930 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0045453
            GO:GO:0001656 HOVERGEN:HBG011513 CTD:1520 KO:K01368 MEROPS:I29.004
            BRENDA:3.4.22.27 EMBL:L03201 IPI:IPI00210228 PIR:A45087
            RefSeq:NP_059016.1 UniGene:Rn.11347 ProteinModelPortal:Q02765
            PhosphoSite:Q02765 PRIDE:Q02765 GeneID:50654 KEGG:rno:50654
            UCSC:RGD:621513 ChEMBL:CHEMBL1075217 NextBio:610462
            Genevestigator:Q02765 Uniprot:Q02765
        Length = 330

 Score = 581 (209.6 bits), Expect = 2.0e-56, P = 2.0e-56
 Identities = 137/323 (42%), Positives = 180/323 (55%)

Query:    29 TLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN--NKARNKPYKLGI 86
             T    T++   ++W     R   D  E+++R  I+++N+++I   N  +      Y +G+
Sbjct:    16 TAERPTLDHHWDLWKKTRMRRNTDQNEEDVRRLIWEKNLKFIMLHNLEHSMGMHSYSVGM 75

Query:    87 NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQ 146
             N   D T EE        +   P  RS      S    N ++P S+DWR+KG VT VK Q
Sbjct:    76 NHMGDMTPEEVIGYMGSLRIPRPWNRSGTLKSSS----NQTLPDSVDWREKGCVTNVKYQ 131

Query:   147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE--DQGCEGGLMDDAFEFI 204
             G CG CWAFSA  A+EG   + T KL SLS Q LVDC T  +  ++GC GG M +AF++I
Sbjct:   132 GSCGSCWAFSAEGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEAFQYI 191

Query:   205 ISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVA 263
             I    + +EA YPYKA D  C     N  AA  S Y ++P  +E AL +AVA + PVSV 
Sbjct:   192 IDTS-IDSEASYPYKAMDEKCLYDPKN-RAATCSRYIELPFGDEEALKEAVATKGPVSVG 249

Query:   264 ID-ASGSDFQFYSSGVFTG-QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGY 321
             ID AS S F  Y SGV+    C   ++HGV  VGYGT D G  YWLVKNSWG  +G+ GY
Sbjct:   250 IDDASHSSFFLYQSGVYDDPSCTENMNHGVLVVGYGTLD-GKDYWLVKNSWGLHFGDQGY 308

Query:   322 IRMQRDIDAKEGLCGIAMQASYP 344
             IRM R+    +  CGIA   SYP
Sbjct:   309 IRMARN---NKNHCGIASYCSYP 328


>TAIR|locus:2078312 [details] [associations]
            symbol:AT3G45310 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005773 EMBL:CP002686
            GenomeReviews:BA000014_GR eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AL132953
            EMBL:AY091771 IPI:IPI00540369 PIR:T47471 RefSeq:NP_566880.1
            UniGene:At.25239 ProteinModelPortal:Q8RWQ9 SMR:Q8RWQ9
            MEROPS:C01.162 PaxDb:Q8RWQ9 PRIDE:Q8RWQ9 EnsemblPlants:AT3G45310.1
            GeneID:823669 KEGG:ath:AT3G45310 GeneFarm:5032 TAIR:At3g45310
            InParanoid:Q8RWQ9 KO:K01366 OMA:AFEVVHE PhylomeDB:Q8RWQ9
            ProtClustDB:CLSN2689015 Genevestigator:Q8RWQ9 Uniprot:Q8RWQ9
        Length = 358

 Score = 580 (209.2 bits), Expect = 2.5e-56, P = 2.5e-56
 Identities = 130/305 (42%), Positives = 173/305 (56%)

Query:    45 QYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGY 104
             +YG+ Y+   E ++RF +FKEN++ I S N K  +  YKL +N+FAD T +EF+  + G 
Sbjct:    65 RYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLS--YKLSLNQFADLTWQEFQRYKLGA 122

Query:   105 KRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGI 164
              +       S T   S +   A+VP + DWR+ G V+ VK+QG CG CW FS   A+E  
Sbjct:   123 AQNC-----SATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAA 177

Query:   165 NHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGS 224
              H    K  SLSEQ+LVDC  +  + GC GGL   AFE+I  N GL TE  YPY   DG 
Sbjct:   178 YHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGG 237

Query:   225 CNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFTGQ- 282
             C K  A     ++    ++    E  L  AV   +PVSVA +    +F+FY  GVFT   
Sbjct:   238 C-KFSAKNIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVV-HEFRFYKKGVFTSNT 295

Query:   283 CG-TELD--HGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
             CG T +D  H V AVGYG  DD   YWL+KNSWG  WG+NGY +M+      + +CG+A 
Sbjct:   296 CGNTPMDVNHAVLAVGYGVEDD-VPYWLIKNSWGGEWGDNGYFKMEMG----KNMCGVAT 350

Query:   340 QASYP 344
              +SYP
Sbjct:   351 CSSYP 355


>UNIPROTKB|D3ZZR3 [details] [associations]
            symbol:D3ZZR3 "Uncharacterized protein" species:10116
            "Rattus norvegicus" [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0016020 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0002250 GeneTree:ENSGT00560000076577 GO:GO:0097067
            OrthoDB:EOG4JM7Q2 IPI:IPI00210228 PRIDE:D3ZZR3
            Ensembl:ENSRNOT00000028732 Uniprot:D3ZZR3
        Length = 331

 Score = 577 (208.2 bits), Expect = 5.3e-56, P = 5.3e-56
 Identities = 132/317 (41%), Positives = 179/317 (56%)

Query:    37 ERH-EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN--NKARNKPYKLGINEFADQT 93
             + H ++W   + + Y+D  E+++R  I+++N+++I   N  +      Y +G+N   D  
Sbjct:    22 DHHWDLWKKTHEKEYKDQNEEDVRRLIWEKNLKFIMLHNLEHSMGMHSYSVGMNHMGDMV 81

Query:    94 NEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDW--RKKGAVTGVKDQGQCGC 151
              E           RLP  R  +   +     N ++PA + W  R KG    +  QG CG 
Sbjct:    82 AETIIGEMGS--ERLP--RKRKALGLIPSSVNQNLPAGVKWKERTKGCWKNLVFQGSCGS 137

Query:   152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE--DQGCEGGLMDDAFEFIISNKG 209
             CWAFSAV A+EG   + T KL SLS Q LVDC T  +  ++GC GG M +AF++II N G
Sbjct:   138 CWAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEAFQYIIDNGG 197

Query:   210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASG 268
             + +EA YPYKA D  C+    N  AA  S Y ++P  +E AL +AVA + PVSV IDAS 
Sbjct:   198 IDSEASYPYKAMDEKCHYDPKN-RAATCSRYIELPFGDEEALKEAVATKGPVSVGIDASH 256

Query:   269 SDFQFYSSGVFTG-QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
             S F  Y SGV+    C   ++HGV  VGYGT D G  YWLVKNSWG  +G+ GYIRM R+
Sbjct:   257 SSFFLYQSGVYDDPSCTENVNHGVLVVGYGTLD-GKDYWLVKNSWGLHFGDQGYIRMARN 315

Query:   328 IDAKEGLCGIAMQASYP 344
                 +  CGIA   SYP
Sbjct:   316 ---NKNHCGIASYCSYP 329


>MGI|MGI:1349426 [details] [associations]
            symbol:Ctsj "cathepsin J" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=ISO]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0048471 "perinuclear region
            of cytoplasm" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:1349426 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF136272
            EMBL:AF158182 EMBL:AY034579 EMBL:AK005526 EMBL:AK131661
            EMBL:BC103769 IPI:IPI00126770 RefSeq:NP_036137.1 UniGene:Mm.31948
            ProteinModelPortal:Q9R014 SMR:Q9R014 MEROPS:C01.038 PRIDE:Q9R014
            Ensembl:ENSMUST00000071526 GeneID:26898 KEGG:mmu:26898
            UCSC:uc007qwa.1 CTD:26898 InParanoid:Q9R014 KO:K09599
            NextBio:304745 Bgee:Q9R014 CleanEx:MM_CTSJ Genevestigator:Q9R014
            GermOnline:ENSMUSG00000055298 Uniprot:Q9R014
        Length = 334

 Score = 573 (206.8 bits), Expect = 1.4e-55, P = 1.4e-55
 Identities = 132/342 (38%), Positives = 194/342 (56%)

Query:    11 VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
             VL  IL  GV    S ++  +D  ++   + W  +Y + Y    E+ +R  +++EN+  I
Sbjct:     5 VLLLILCFGV---ASGAQA-HDPKLDAEWKDWKTKYAKSYSPK-EEALRRAVWEENMRMI 59

Query:    71 ASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
                N  N      + + +N+F DQT+EEFR   +     +P + ++ T   +  + +  +
Sbjct:    60 KLHNKENSLGKNNFTMKMNKFGDQTSEEFRKSIDN----IP-IPAAMTDPHAQNHVSIGL 114

Query:   129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
             P   DWR++G VT V++QG+CG CWAF+A  A+EG     T  LT LS Q L+DC  +  
Sbjct:   115 PDYKDWREEGYVTPVRNQGKCGSCWAFAAAGAIEGQMFWKTGNLTPLSVQNLLDCSKTVG 174

Query:   189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
             ++GC+ G    AFE+++ NKGL  EA YPY+  DG C  +  N SA  I+ Y ++P N E
Sbjct:   175 NKGCQSGTAHQAFEYVLKNKGLEAEATYPYEGKDGPCRYRSENASA-NITDYVNLPPN-E 232

Query:   249 AALMKAVAN-QPVSVAIDASGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYGT-AD--DG 302
               L  AVA+  PVS AIDAS   F+FY+ G++    C +  ++H V  VGYG+  D  DG
Sbjct:   233 LYLWVAVASIGPVSAAIDASHDSFRFYNGGIYYEPNCSSYFVNHAVLVVGYGSEGDVKDG 292

Query:   303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
               YWL+KNSWG  WG NGY+++ +D       CGIA  ASYP
Sbjct:   293 NNYWLIKNSWGEEWGMNGYMQIAKD---HNNHCGIASLASYP 331


>RGD|69241 [details] [associations]
            symbol:Ctsj "cathepsin J" species:10116 "Rattus norvegicus"
           [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
           evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
           evidence=IEA] [GO:0048471 "perinuclear region of cytoplasm"
           evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
           PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:L14776
           RGD:69241 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
           InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246
           SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
           GO:GO:0005764 HOVERGEN:HBG011513 MEROPS:C01.038 CTD:26898 KO:K09599
           EMBL:AF310623 EMBL:BC097263 IPI:IPI00205027 PIR:I58002
           RefSeq:NP_058817.1 UniGene:Rn.34875 ProteinModelPortal:Q63088
           SMR:Q63088 PRIDE:Q63088 GeneID:29174 KEGG:rno:29174 NextBio:608244
           Genevestigator:Q63088 Uniprot:Q63088
        Length = 334

 Score = 573 (206.8 bits), Expect = 1.4e-55, P = 1.4e-55
 Identities = 133/342 (38%), Positives = 190/342 (55%)

Query:    11 VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
             V   IL  GV A  + +R   D  ++   + W  +Y + Y    E+E++  +++EN++ I
Sbjct:     5 VFLVILCFGV-ASGAPAR---DPNLDAEWQDWKTKYAKSYSP-VEEELKRAVWEENLKMI 59

Query:    71 ASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
                N  N      + + +N FAD T EEFR   +     +P+  ++ +     +  +  +
Sbjct:    60 QLHNKENGLGKNGFTMEMNAFADTTGEEFRKSLSDIL--IPAAVTNPSAQ---KQVSIGL 114

Query:   129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
             P   DWRK+G VT V++QG+CG CWAF+AV A+EG     T  LT LS Q L+DC  S  
Sbjct:   115 PNFKDWRKEGYVTPVRNQGKCGSCWAFAAVGAIEGQMFSKTGNLTPLSVQNLLDCSKSEG 174

Query:   189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
             + GC  G    AF +++ NKGL  EA YPY+  DG C     N SA  I+G+ ++P N E
Sbjct:   175 NNGCRWGTAHQAFNYVLKNKGLEAEATYPYEGKDGPCRYHSENASA-NITGFVNLPPN-E 232

Query:   249 AALMKAVAN-QPVSVAIDASGSDFQFYSSGVF-TGQCGTEL-DHGVTAVGYG---TADDG 302
               L  AVA+  PVS AIDAS   F+FYS GV+    C + + +H V  VGYG      DG
Sbjct:   233 LYLWVAVASIGPVSAAIDASHDSFRFYSGGVYHEPNCSSYVVNHAVLVVGYGFEGNETDG 292

Query:   303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
               YWL+KNSWG  WG NG++++ +D   +   CGIA QAS+P
Sbjct:   293 NNYWLIKNSWGEEWGINGFMKIAKD---RNNHCGIASQASFP 331


>UNIPROTKB|Q90686 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9031 "Gallus gallus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            CTD:1513 KO:K01371 PANTHER:PTHR12411:SF55 EMBL:U37691
            IPI:IPI00575213 RefSeq:NP_990302.1 UniGene:Gga.51509
            ProteinModelPortal:Q90686 SMR:Q90686 MEROPS:C01.036 GeneID:395818
            KEGG:gga:395818 NextBio:20815886 Uniprot:Q90686
        Length = 334

 Score = 571 (206.1 bits), Expect = 2.3e-55, P = 2.3e-55
 Identities = 126/267 (47%), Positives = 164/267 (61%)

Query:    82 YKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVT 141
             ++L +N   D T+EE      G   R+P  R      +     ++  PA++DWR+KG VT
Sbjct:    76 FQLAMNYLGDMTSEEVVRTMTGL--RVPRSRPRPNGTLYVPDWSSRAPAAVDWRRKGYVT 133

Query:   142 GVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAF 201
              VKDQGQCG CWAFS+V A+EG     T KL SLS Q LV C ++  + GC GG M +AF
Sbjct:   134 PVKDQGQCGSCWAFSSVGALEGQLKRRTGKLLSLSPQNLVYCVSN--NNGCGGGYMTNAF 191

Query:   202 EFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPV 260
             E++  N+G+ +E  YPY   D SC        AAK  GY ++P +NE AL +AVA   PV
Sbjct:   192 EYVRLNRGIDSEDAYPYIGQDESCMYSPTG-KAAKCRGYREIPEDNEKALKRAVARIGPV 250

Query:   261 SVAIDASGSDFQFYSSGVF--TGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWG 317
             SV IDAS   FQFYS GV+  TG C  E ++H V AVGYG A  GTK+W++KNSWGT WG
Sbjct:   251 SVGIDASLPSFQFYSRGVYYDTG-CNPENINHAVLAVGYG-AQKGTKHWIIKNSWGTEWG 308

Query:   318 ENGYIRMQRDIDAKEGLCGIAMQASYP 344
               GY+ + R++  K+  CGIA  AS+P
Sbjct:   309 NKGYVLLARNM--KQ-TCGIANLASFP 332


>FB|FBgn0260462 [details] [associations]
            symbol:CG12163 species:7227 "Drosophila melanogaster"
            [GO:0035071 "salivary gland cell autophagic cell death"
            evidence=IEP] [GO:0048102 "autophagic cell death" evidence=IEP]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0004869 "cysteine-type
            endopeptidase inhibitor activity" evidence=IEA] [GO:0045169
            "fusome" evidence=IDA] [GO:0035220 "wing disc development"
            evidence=IGI] [GO:0022416 "chaeta development" evidence=IGI]
            InterPro:IPR000010 InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00043 SMART:SM00645
            InterPro:IPR000169 EMBL:AE014297 GO:GO:0004869 eggNOG:COG4870
            KO:K01373 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0022416 GO:GO:0035220 GO:GO:0035071
            GO:GO:0045169 GeneTree:ENSGT00660000095458 EMBL:AY121614
            EMBL:BT003231 RefSeq:NP_649521.1 RefSeq:NP_730901.1
            RefSeq:NP_730902.2 UniGene:Dm.7315 ProteinModelPortal:Q9VN93
            SMR:Q9VN93 DIP:DIP-17491N IntAct:Q9VN93 MINT:MINT-763966
            STRING:Q9VN93 MEROPS:C01.A27 PaxDb:Q9VN93
            EnsemblMetazoa:FBtr0078823 GeneID:40628 KEGG:dme:Dmel_CG12163
            UCSC:CG12163-RA FlyBase:FBgn0260462 InParanoid:Q9VN93 OMA:GPRWGEQ
            OrthoDB:EOG4CC2G9 PhylomeDB:Q9VN93 GenomeRNAi:40628 NextBio:819744
            Bgee:Q9VN93 GermOnline:CG12163 Uniprot:Q9VN93
        Length = 614

 Score = 568 (205.0 bits), Expect = 4.8e-55, P = 4.8e-55
 Identities = 124/311 (39%), Positives = 177/311 (56%)

Query:    42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
             +  ++GR Y   AE++MR +IF++N++ I   N        K GI EFAD T+ E++  R
Sbjct:   311 FQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSA-KYGITEFADMTSSEYKE-R 368

Query:   102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
              G  +R  +  +  +  V   Y +  +P   DWR+K AVT VK+QG CG CWAFS    +
Sbjct:   369 TGLWQRDEAKATGGSAAVVPAY-HGELPKEFDWRQKDAVTQVKNQGSCGSCWAFSVTGNI 427

Query:   162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
             EG+  + T +L   SEQEL+DCDT+  D  C GGLMD+A++ I    GL  EA+YPYKA 
Sbjct:   428 EGLYAVKTGELKEFSEQELLDCDTT--DSACNGGLMDNAYKAIKDIGGLEYEAEYPYKAK 485

Query:   222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMK-AVANQPVSVAIDASGSDFQFYSSGV-- 278
                C+      S  +++G+ D+P  NE A+ +  +AN P+S+ I+A+    QFY  GV  
Sbjct:   486 KNQCHFNRTL-SHVQVAGFVDLPKGNETAMQEWLLANGPISIGINANA--MQFYRGGVSH 542

Query:   279 -FTGQCGTE-LDHGVTAVGYGTAD-----DGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
              +   C  + LDHGV  VGYG +D         YW+VKNSWG  WGE GY R+ R     
Sbjct:   543 PWKALCSKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYRVYRG---- 598

Query:   332 EGLCGIAMQAS 342
             +  CG++  A+
Sbjct:   599 DNTCGVSEMAT 609


>TAIR|locus:2175088 [details] [associations]
            symbol:ALP "aleurain-like protease" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009723 "response to ethylene stimulus" evidence=IEP]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0005829 "cytosol"
            evidence=RCA] [GO:0006096 "glycolysis" evidence=RCA] [GO:0006816
            "calcium ion transport" evidence=RCA] [GO:0006833 "water transport"
            evidence=RCA] [GO:0006972 "hyperosmotic response" evidence=RCA]
            [GO:0007030 "Golgi organization" evidence=RCA] [GO:0009266
            "response to temperature stimulus" evidence=RCA] [GO:0009651
            "response to salt stress" evidence=RCA] [GO:0009750 "response to
            fructose stimulus" evidence=RCA] [GO:0042744 "hydrogen peroxide
            catabolic process" evidence=RCA] [GO:0046686 "response to cadmium
            ion" evidence=RCA] [GO:0007568 "aging" evidence=IEP]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002688 GO:GO:0005773
            GO:GO:0007568 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AB011483 KO:K01366
            ProtClustDB:CLSN2689015 UniGene:At.25414 IPI:IPI00846287
            RefSeq:NP_001078774.1 ProteinModelPortal:A8MQZ1 SMR:A8MQZ1
            STRING:A8MQZ1 PRIDE:A8MQZ1 EnsemblPlants:AT5G60360.3 GeneID:836158
            KEGG:ath:AT5G60360 OMA:CGSTPMD Genevestigator:A8MQZ1 Uniprot:A8MQZ1
        Length = 361

 Score = 564 (203.6 bits), Expect = 1.3e-54, P = 1.3e-54
 Identities = 124/297 (41%), Positives = 171/297 (57%)

Query:    45 QYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGY 104
             +YG+ Y++  E ++RF IFKEN++ I S N K  +  YKLG+N+FAD T +EF+  + G 
Sbjct:    65 RYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLS--YKLGVNQFADLTWQEFQRTKLGA 122

Query:   105 KRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGI 164
              +       S T   S +   A++P + DWR+ G V+ VKDQG CG CW FS   A+E  
Sbjct:   123 AQNC-----SATLKGSHKVTEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAA 177

Query:   165 NHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGS 224
              H    K  SLSEQ+LVDC  +  + GC GGL   AFE+I SN GL TE  YPY   D +
Sbjct:   178 YHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKDET 237

Query:   225 CNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFT-GQ 282
             C K  A     ++    ++    E  L  AV   +PVS+A +   S F+ Y SGV+T   
Sbjct:   238 C-KFSAENVGVQVLNSVNITLGAEDELKHAVGLVRPVSIAFEVIHS-FRLYKSGVYTDSH 295

Query:   283 CGT---ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
             CG+   +++H V AVGYG  +DG  YWL+KNSWG  WG+ GY +M+      + +CG
Sbjct:   296 CGSTPMDVNHAVLAVGYGV-EDGVPYWLIKNSWGADWGDKGYFKMEMG----KNMCG 347


>MGI|MGI:1922258 [details] [associations]
            symbol:4930486L24Rik "RIKEN cDNA 4930486L24 gene"
            species:10090 "Mus musculus" [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030054 "cell
            junction" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 MGI:MGI:1922258
            GO:GO:0005576 GO:GO:0030054 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 HSSP:P07711
            EMBL:AY146988 EMBL:AK145933 EMBL:BC061218 IPI:IPI00280732
            RefSeq:NP_835199.1 UniGene:Mm.19839 ProteinModelPortal:Q80UB0
            SMR:Q80UB0 MEROPS:C01.972 PRIDE:Q80UB0 Ensembl:ENSMUST00000091569
            GeneID:214639 KEGG:mmu:214639 UCSC:uc007qvs.1 InParanoid:Q80UB0
            OMA:RYHAENS OrthoDB:EOG4XWG0N NextBio:374408 Bgee:Q80UB0
            CleanEx:MM_4930486L24RIK Genevestigator:Q80UB0 Uniprot:Q80UB0
        Length = 333

 Score = 561 (202.5 bits), Expect = 2.6e-54, P = 2.6e-54
 Identities = 135/343 (39%), Positives = 184/343 (53%)

Query:    12 LAAILVLGVWAPQ--SWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
             + A+L L +   +  S + TL D +++ +   W  ++G+ Y  N E+ +R  ++++N + 
Sbjct:     1 MIAVLFLAILCLEIDSTAPTL-DPSLDVQWNEWRTKHGKAYNVNEER-LRRAVWEKNFKM 58

Query:    70 IASFN-NKARNK-PYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS 127
             I   N      K  + + +N F D TN EF     G++R+    R     D  F Y    
Sbjct:    59 IELHNWEYLEGKHDFTMTMNAFGDLTNTEFVKMMTGFRRQKIK-RMHVFQDHQFLY---- 113

Query:   128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
             VP  +DWR  G VT VK+QG C   WAFSA  ++EG     T +L  LSEQ L+DC  S 
Sbjct:   114 VPKYVDWRMLGYVTPVKNQGYCASSWAFSATGSLEGQMFKKTGRLVPLSEQNLLDCMGSN 173

Query:   188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
                 C GG M +AF+++  N GLATE  YPY      C +  A  SAA +  +  +P   
Sbjct:   174 VTHDCSGGFMQNAFQYVKDNGGLATEESYPYIGPGRKC-RYHAENSAANVRDFVQIPGRE 232

Query:   248 EAALMKAVANQ-PVSVAIDASGSDFQFYSSGVF-TGQCG-TELDHGVTAVGYG---TADD 301
             EA LMKAVA   P+SVA+DAS   FQFY SG++   QC    L+H V  VGYG      D
Sbjct:   233 EA-LMKAVAKVGPISVAVDASHDSFQFYDSGIYYEPQCKRVHLNHAVLVVGYGFEGEESD 291

Query:   302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             G  YWLVKNSWG  WG  GYI++ +D +     CGIA  A+YP
Sbjct:   292 GNSYWLVKNSWGEEWGMKGYIKIAKDWNNH---CGIATLATYP 331


>ZFIN|ZDB-GENE-040426-1583 [details] [associations]
            symbol:ctssa "cathepsin S, a" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-040426-1583
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            GeneTree:ENSGT00660000095458 EMBL:CR548627 IPI:IPI00491948
            UniGene:Dr.81560 SMR:Q1L8W8 Ensembl:ENSDART00000053638 OMA:RNTREER
            OrthoDB:EOG480HX9 Uniprot:Q1L8W8
        Length = 328

 Score = 560 (202.2 bits), Expect = 3.4e-54, P = 3.4e-54
 Identities = 119/309 (38%), Positives = 179/309 (57%)

Query:    42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
             W +Q+ + YR+  E+ +R  ++K+N++ I   N  A      Y LG+N+ +D T +E   
Sbjct:    30 WKSQHNKTYRNTREERLRRSVWKQNLQDILLHNEAAAVGLHSYTLGLNQLSDMTADEVND 89

Query:   100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
                  +   P V ++ +   S +    ++P  ++W + G V+ V++QG CG CWAFSAV 
Sbjct:    90 MNGLLEEDFPDVNATFSPP-SLQ----TLPQRVNWTEHGMVSPVQNQGPCGSCWAFSAVG 144

Query:   160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
             ++E      T  L  LS Q L+DC  S  ++GC+GG +  AF ++I N+G+ +   YPY+
Sbjct:   145 SLEAQMKRRTAALVPLSAQNLLDCSVSLGNRGCKGGFLSRAFLYVIQNRGIDSSTFYPYE 204

Query:   220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV 278
               +G C +   +  A   +G+  VP +NEAAL  AVAN  PVSV I+A    F  Y SG+
Sbjct:   205 HKEGVC-RYSVSGRAGYCTGFRIVPRHNEAALQSAVANIGPVSVGINAKLLSFHRYRSGI 263

Query:   279 FTG-QCGTEL-DHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
             +   +C + L +H V  VGYG+ ++G  YWLVKNSWGT WGENGYIRM R+    + +CG
Sbjct:   264 YNDPKCSSALINHAVLVVGYGS-ENGQDYWLVKNSWGTAWGENGYIRMARN----KNMCG 318

Query:   337 IAMQASYPT 345
             I+    YPT
Sbjct:   319 ISSFGIYPT 327


>ZFIN|ZDB-GENE-050208-336 [details] [associations]
            symbol:ctskl "cathepsin K, like" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-050208-336 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:BX465190
            GeneTree:ENSGT00660000095458 IPI:IPI00491185 RefSeq:XP_695425.1
            UniGene:Dr.110795 Ensembl:ENSDART00000062749 GeneID:567046
            KEGG:dre:567046 CTD:567046 NextBio:20888499 Bgee:F1QCP8
            Uniprot:F1QCP8
        Length = 349

 Score = 559 (201.8 bits), Expect = 4.3e-54, P = 4.3e-54
 Identities = 134/338 (39%), Positives = 185/338 (54%)

Query:    14 AILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASF 73
             A+  L VWAP   +    +    E + +W  ++   Y + +E   R  I++ N++ I   
Sbjct:    17 AVFALLVWAPVQVASESEEEAPTEWN-LWKKKHEISYDEESEDVHRKTIWETNMQKIWKN 75

Query:    74 NNKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV--P 129
             NN        +K+ +N++ D T+ E++    G K +    R  + T       NA     
Sbjct:    76 NNDFSFGLSMFKMAMNKYGDLTSVEYKRLL-GSKIKGTGNRKGKITSAQMLRLNAKRLGV 134

Query:   130 ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
              +ID+R KG VT VKDQG CG CW+FS   A+EG  +  T +L SLSEQ+LVDC  S   
Sbjct:   135 TNIDYRAKGYVTEVKDQGYCGSCWSFSTTGAIEGQMYKHTGRLVSLSEQQLVDCSRSYGT 194

Query:   190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEA 249
              GC G  M +A++++I+N  L +   YPY + D      E N + A IS Y  VP+ NE 
Sbjct:   195 YGCSGAWMANAYDYVINN-ALESSDTYPYTSVDTQPCFYEKNLAMAGISDYRFVPAGNEQ 253

Query:   250 ALMKAVANQ-PVSVAIDASGSDFQFYSSGVFT-GQCG-TELDHGVTAVGYGTADDGTKYW 306
             AL  AVA   PVSVAIDA    F FYSSG++    C    L+H V  VGYG+ ++GT YW
Sbjct:   254 ALADAVATVGPVSVAIDADNPSFLFYSSGIYKESNCNPNNLNHAVLVVGYGS-EEGTDYW 312

Query:   307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             ++KNSWGT WGE GY+RM R+    +  CGIA  A YP
Sbjct:   313 IIKNSWGTGWGEGGYMRMIRN---GKNTCGIASYALYP 347


>RGD|708447 [details] [associations]
            symbol:Testin "testin gene" species:10116 "Rattus norvegicus"
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0030054 "cell junction" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 RGD:708447 GO:GO:0005576 GO:GO:0030054 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            MEROPS:C01.972 OMA:RYHAENS OrthoDB:EOG4XWG0N EMBL:U16858
            IPI:IPI00207173 PIR:I52525 PIR:PC1251 RefSeq:NP_775155.1
            UniGene:Rn.10029 ProteinModelPortal:P15242 SMR:P15242
            Ensembl:ENSRNOT00000024467 GeneID:286916 KEGG:rno:286916
            UCSC:RGD:708447 CTD:286916 InParanoid:P15242 NextBio:625036
            Genevestigator:P15242 GermOnline:ENSRNOG00000018028 Uniprot:P15242
        Length = 333

 Score = 555 (200.4 bits), Expect = 1.1e-53, P = 1.1e-53
 Identities = 125/322 (38%), Positives = 175/322 (54%)

Query:    32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK--ARNKPYKLGINEF 89
             D +++     W  ++G+ Y  N E+  R  ++++N + I   N +       + + +N F
Sbjct:    22 DPSLDVEWNEWRTKHGKTYNMNEERLKR-AVWEKNFKMIELHNWEYLEGRHDFTMAMNAF 80

Query:    90 ADQTNEEFRAPRNGYKRRLPSVRSSET-TDVSFRYENASVPASIDWRKKGAVTGVKDQGQ 148
              D TN EF     G++R+   ++ +    D  F Y    VP  +DWR+ G VT VK+QG 
Sbjct:    81 GDLTNIEFVKMMTGFQRQ--KIKKTHIFQDHQFLY----VPKRVDWRQLGYVTPVKNQGH 134

Query:   149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
             C   WAFSA  ++EG     T +L  LSEQ L+DC  S    GC GG M  AF+++  N 
Sbjct:   135 CASSWAFSATGSLEGQMFRKTERLIPLSEQNLLDCMGSNVTHGCSGGFMQYAFQYVKDNG 194

Query:   209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDAS 267
             GLATE  YPY+     C +  A  SAA +  +  +P + EA LMKAVA   P+SVA+DAS
Sbjct:   195 GLATEESYPYRGQGREC-RYHAENSAANVRDFVQIPGSEEA-LMKAVAKVGPISVAVDAS 252

Query:   268 GSDFQFYSSGVF-TGQCG-TELDHGVTAVGYG---TADDGTKYWLVKNSWGTTWGENGYI 322
                FQFY SG++   QC    L+H V  VGYG      DG  +WLVKNSWG  WG  GY+
Sbjct:   253 HGSFQFYGSGIYYEPQCKRVHLNHAVLVVGYGFEGEESDGNSFWLVKNSWGEEWGMKGYM 312

Query:   323 RMQRDIDAKEGLCGIAMQASYP 344
             ++ +D       CGIA  ++YP
Sbjct:   313 KLAKDWSNH---CGIATYSTYP 331


>DICTYBASE|DDB_G0272815 [details] [associations]
            symbol:cprE "cysteine proteinase 5" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0272815 GO:GO:0005615
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GenomeReviews:CM000151_GR GO:GO:0005764
            EMBL:AAFI02000008 MEROPS:I29.003 KO:K01376 EMBL:L36205
            RefSeq:XP_644977.1 ProteinModelPortal:P54640 SMR:P54640
            PRIDE:P54640 EnsemblProtists:DDB0185092 GeneID:8618654
            KEGG:ddi:DDB_G0272815 OMA:METAFEF ProtClustDB:CLSZ2430780
            Uniprot:P54640
        Length = 344

 Score = 553 (199.7 bits), Expect = 1.9e-53, P = 1.9e-53
 Identities = 121/259 (46%), Positives = 162/259 (62%)

Query:    42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
             WM  + + Y    E   R+ IFK N++Y+  +N+K       LG+N FAD TNEE+R   
Sbjct:    33 WMITHQKSYTSE-EFGARYNIFKANMDYVQQWNSKGSETV--LGLNNFADITNEEYRNTY 89

Query:   102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
              G K    S+  ++   V F   +A   AS DWR +GAVT VK+QGQCG CW+FS   + 
Sbjct:    90 LGTKFDASSLIGTQEEKV-FTTSSA---ASKDWRSEGAVTPVKNQGQCGGCWSFSTTGST 145

Query:   162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
             EG +  +  +L SLSEQ L+DC T  E+ GC+GGLM  AFE+II+N G+ TE+ YPYKA 
Sbjct:   146 EGAHFQSKGELVSLSEQNLIDCST--ENSGCDGGLMTYAFEYIINNNGIDTESSYPYKAE 203

Query:   222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF-T 280
             +G C  K  N S A +S Y+ V + +E++L  AV   PVSVAIDAS   FQ Y+SG++  
Sbjct:   204 NGKCEYKSEN-SGATLSSYKTVTAGSESSLESAVNVNPVSVAIDASHQSFQLYTSGIYYE 262

Query:   281 GQCGTE-LDHGVTAVGYGT 298
              +C +E LDHGV AVGYG+
Sbjct:   263 PECSSENLDHGVLAVGYGS 281


>UNIPROTKB|J9P7C5 [details] [associations]
            symbol:J9P7C5 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 EMBL:AAEX03010953
            Ensembl:ENSCAFT00000012925 Uniprot:J9P7C5
        Length = 321

 Score = 553 (199.7 bits), Expect = 1.9e-53, P = 1.9e-53
 Identities = 132/318 (41%), Positives = 183/318 (57%)

Query:    35 MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNKP-YKLGINEFADQ 92
             +++R++ W A + R+Y  N E+  R  ++++N++ I   N + ++ K  + + +N F D 
Sbjct:    21 LDQRYQ-WKAMHRRLYGMN-EEGWRRAVWEKNMKMIELHNREYSQGKHGFTMAMNAFGDM 78

Query:    93 TNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCC 152
             TNEEFR   NG++ +    +     +  F    A +P S+DWR+KG VT VK+QGQCG C
Sbjct:    79 TNEEFRQVINGFQNQKHK-KGKVFQEPLF----AEIPKSVDWREKGYVTPVKNQGQCGSC 133

Query:   153 WAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLAT 212
             WAFSA  A EG     T  L  LSEQ L      G ++GC GGLMD+AF+++  N+ L +
Sbjct:   134 WAFSATGAFEGQMFWKTGNLVPLSEQNLAQ----G-NEGCNGGLMDNAFQYVKDNRCLDS 188

Query:   213 EAKYPYKASD-GSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSD 270
             E  YPY   D  +CN K    SAA  SG+ D+P   E ALMKA+A    ++VAIDA    
Sbjct:   189 EESYPYLGRDTDTCNYKP-ECSAAHDSGFVDLPQR-EKALMKAMATLGSITVAIDAGHQY 246

Query:   271 FQFYSSGV-FTGQCGT-ELDHGVTAVGYG-TADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
             FQFY S + F   C + +LDHGV  VGYG    D    W+VKNSW   WG N Y++M + 
Sbjct:   247 FQFYKSSIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKWIVKNSWSPEWGWNSYVKMAK- 305

Query:   328 IDAKEGLCGIAMQASYPT 345
                +   CGI   ASYPT
Sbjct:   306 --GQNNHCGITA-ASYPT 320


>ZFIN|ZDB-GENE-030131-3539 [details] [associations]
            symbol:ctsh "cathepsin H" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-030131-3539
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 KO:K01366 HOVERGEN:HBG011513
            CTD:1512 OrthoDB:EOG4W9J43 MEROPS:I29.003 HSSP:P43235 EMBL:BC067615
            IPI:IPI00506892 RefSeq:NP_997853.1 UniGene:Dr.14176
            ProteinModelPortal:Q6NWF2 SMR:Q6NWF2 PRIDE:Q6NWF2 GeneID:324818
            KEGG:dre:324818 InParanoid:Q6NWF2 NextBio:20808976 Bgee:Q6NWF2
            Uniprot:Q6NWF2
        Length = 330

 Score = 553 (199.7 bits), Expect = 1.9e-53, P = 1.9e-53
 Identities = 125/316 (39%), Positives = 174/316 (55%)

Query:    36 NERH-EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
             +E H + WM+QY + Y  N E   R +IF EN + I   N    N  + +G+N+F+D T 
Sbjct:    26 DEYHFKSWMSQYNKKYEIN-EFYQRLQIFLENKKRIDQHNEG--NHKFSMGLNQFSDMTF 82

Query:    95 EEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGA-VTGVKDQGQCGCCW 153
              EF+     Y    P  ++   T  +    N   P +IDWR KG  +T VK+QG CG CW
Sbjct:    83 AEFKKT---YLLTEP--QNCSATRGNHVSSNGLYPDAIDWRTKGHYITDVKNQGPCGSCW 137

Query:   154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
              FS    +E +  I T KL  L+EQ+L+DC    ++ GC GGL   AFE+I+ NKGL TE
Sbjct:   138 TFSTTGCLESVTAIATGKLLQLAEQQLIDCAGDFDNHGCNGGLPSHAFEYIMYNKGLMTE 197

Query:   214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQ 272
               YPY+A  G C  K    +AA +    ++   +E  ++ AVA   PVS A + + SDF 
Sbjct:   198 DDYPYQAKGGQCRFKP-QLAAAFVKEVVNITKYDEMGMVDAVARLNPVSFAYEVT-SDFM 255

Query:   273 FYSSGVFTG-QCGTELD---HGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
              Y  G++T  +C    D   H V AVGY   ++GT YW+VKNSWGT WG  GY  ++R  
Sbjct:   256 HYKDGIYTSTECHNTTDMVNHAVLAVGYAE-ENGTPYWIVKNSWGTNWGIKGYFYIERG- 313

Query:   329 DAKEGLCGIAMQASYP 344
                + +CG+A  +SYP
Sbjct:   314 ---KNMCGLAACSSYP 326


>RGD|1562210 [details] [associations]
            symbol:MGC114246 "similar to cathepsin R" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1562210 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 EMBL:CH474032 MEROPS:C01.042 KO:K09601
            OMA:ASHESFK OrthoDB:EOG4ZCT6D EMBL:BC091563 IPI:IPI00555186
            RefSeq:NP_001017509.1 UniGene:Rn.198321 SMR:Q5BJA0
            Ensembl:ENSRNOT00000061470 GeneID:498688 KEGG:rno:498688
            UCSC:RGD:1562210 InParanoid:Q5BJA0 NextBio:700535
            Genevestigator:Q5BJA0 Uniprot:Q5BJA0
        Length = 334

 Score = 552 (199.4 bits), Expect = 2.4e-53, P = 2.4e-53
 Identities = 133/343 (38%), Positives = 182/343 (53%)

Query:    11 VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
             V  AIL LGV    S +  L D +++   + W  +Y + Y    E+E+R  +++EN++ I
Sbjct:     5 VFIAILCLGV---ASGAPIL-DPSLDAEWQEWKKKYDKSY-SLEEEELRRAVWEENLKMI 59

Query:    71 ASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
                N  N      + + INEF D T EEFR     +    P     E   +  R   +  
Sbjct:    60 KLHNGENGLGKNGFTMEINEFGDTTGEEFRKMMVEF----PVQTHREGKSIMKRAAGSIF 115

Query:   129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
             P  +DWRKKG VT V+ QG C  CWAFS   A+E      + KL  LS Q LVDC     
Sbjct:   116 PKFVDWRKKGYVTPVRRQGNCNACWAFSVTGAIEAQTIWQSGKLIPLSVQNLVDCSKPQG 175

Query:   189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
             + GC GG   +AF++++ N GL +EA YPY+  DG C     N S+A+I+G+  +P + E
Sbjct:   176 NNGCLGGDTYNAFQYVLHNGGLQSEATYPYEGKDGPCRYNPKN-SSAEITGFVSLPES-E 233

Query:   249 AALMKAVAN-QPVSVAIDASGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYG-TADD--G 302
               LM AVA   P+S  IDAS   F+FY  G++    C +  + HGV  VGYG   +D  G
Sbjct:   234 DILMVAVATIGPISAGIDASHESFKFYKKGIYHEPNCSSNSVTHGVLVVGYGFKGNDTGG 293

Query:   303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
               YWL+KNSWG  WG  GY+++ +D   K   C IA  A YPT
Sbjct:   294 DHYWLIKNSWGKQWGIRGYMKITKD---KNNHCAIASYAHYPT 333


>UNIPROTKB|Q3T0I2 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9913 "Bos taurus"
            [GO:0031638 "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=ISS] [GO:0010815 "bradykinin
            catabolic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0010813 "neuropeptide catabolic
            process" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0045766 "positive regulation of angiogenesis" evidence=ISS]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=ISS] [GO:0008234 "cysteine-type peptidase activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0033619 "membrane protein proteolysis" evidence=ISS]
            [GO:0043066 "negative regulation of apoptotic process"
            evidence=ISS] [GO:0004252 "serine-type endopeptidase activity"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0016505 "apoptotic protease activator activity"
            evidence=ISS] [GO:0010952 "positive regulation of peptidase
            activity" evidence=ISS] [GO:0030108 "HLA-A specific activating MHC
            class I receptor activity" evidence=ISS] [GO:0002764 "immune
            response-regulating signaling pathway" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0070324 "thyroid
            hormone binding" evidence=ISS] [GO:0006508 "proteolysis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0097208
            "alveolar lamellar body" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004175
            "endopeptidase activity" evidence=ISS] [GO:0032526 "response to
            retinoic acid" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 EMBL:BC102386 IPI:IPI00693034
            RefSeq:NP_001029557.1 UniGene:Bt.52393 ProteinModelPortal:Q3T0I2
            SMR:Q3T0I2 STRING:Q3T0I2 MEROPS:C01.040 PRIDE:Q3T0I2
            Ensembl:ENSBTAT00000014593 GeneID:510524 KEGG:bta:510524 CTD:1512
            InParanoid:Q3T0I2 OMA:STSCHKT OrthoDB:EOG4W9J43 NextBio:20869490
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 Uniprot:Q3T0I2
        Length = 335

 Score = 549 (198.3 bits), Expect = 4.9e-53, P = 4.9e-53
 Identities = 125/340 (36%), Positives = 188/340 (55%)

Query:    15 ILVLGVW---APQSWSRTLNDATMNERH-EMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
             +L  G W   AP   +  L   ++ + H + WM Q+ + Y  + E   R + F  N+  I
Sbjct:     7 LLCAGAWLLGAPACGAAELAANSLEKFHFQSWMVQHQKKY-SSEEYYHRLQAFASNLREI 65

Query:    71 ASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPA 130
              + N  ARN  +K+G+N+F+D + +E +     Y    P  ++   T  ++       P 
Sbjct:    66 NAHN--ARNHTFKMGLNQFSDMSFDELKRK---YLWSEP--QNCSATKSNYLRGTGPYPP 118

Query:   131 SIDWRKKGA-VTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
             S+DWRKKG  VT VK+QG CG CW FS   A+E    I T KL  L+EQ+LVDC  +  +
Sbjct:   119 SMDWRKKGNFVTPVKNQGSCGSCWTFSTTGALESAVAIATGKLPFLAEQQLVDCAQNFNN 178

Query:   190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEA 249
              GC+GGL   AFE+I  NKG+  E  YPY+  DG C K + + + A +    ++  N+E 
Sbjct:   179 HGCQGGLPSQAFEYIRYNKGIMGEDTYPYRGQDGDC-KYQPSKAIAFVKDVANITLNDEE 237

Query:   250 ALMKAVA-NQPVSVAIDASGSDFQFYSSGVFTG-QCGT---ELDHGVTAVGYGTADDGTK 304
             A+++AVA + PVS A + + +DF  Y  G+++   C     +++H V AVGYG  + G  
Sbjct:   238 AMVEAVALHNPVSFAFEVT-ADFMMYRKGIYSSTSCHKTPDKVNHAVLAVGYGE-EKGIP 295

Query:   305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             YW+VKNSWG  WG  GY  ++R     + +CG+A  AS+P
Sbjct:   296 YWIVKNSWGPNWGMKGYFLIERG----KNMCGLAACASFP 331


>RGD|2447 [details] [associations]
            symbol:Ctsh "cathepsin H" species:10116 "Rattus norvegicus"
          [GO:0001520 "outer dense fiber" evidence=IDA] [GO:0001656
          "metanephros development" evidence=IEP] [GO:0001669 "acrosomal
          vesicle" evidence=IDA] [GO:0001913 "T cell mediated cytotoxicity"
          evidence=ISO;ISS] [GO:0002250 "adaptive immune response"
          evidence=ISO] [GO:0002764 "immune response-regulating signaling
          pathway" evidence=ISO;ISS] [GO:0004175 "endopeptidase activity"
          evidence=ISO] [GO:0004177 "aminopeptidase activity" evidence=ISO;IDA]
          [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISO;ISS]
          [GO:0004252 "serine-type endopeptidase activity" evidence=ISO;ISS]
          [GO:0005615 "extracellular space" evidence=ISO;ISS;IDA] [GO:0005764
          "lysosome" evidence=ISO;ISS;IDA] [GO:0005829 "cytosol"
          evidence=ISO;ISS] [GO:0006508 "proteolysis" evidence=IEP;ISO]
          [GO:0007283 "spermatogenesis" evidence=IEP] [GO:0008233 "peptidase
          activity" evidence=ISO] [GO:0008234 "cysteine-type peptidase
          activity" evidence=ISO] [GO:0008284 "positive regulation of cell
          proliferation" evidence=ISO;ISS] [GO:0010628 "positive regulation of
          gene expression" evidence=ISO;ISS] [GO:0010634 "positive regulation
          of epithelial cell migration" evidence=ISO;ISS] [GO:0010813
          "neuropeptide catabolic process" evidence=ISO;ISS] [GO:0010815
          "bradykinin catabolic process" evidence=ISO;ISS] [GO:0010952
          "positive regulation of peptidase activity" evidence=ISO;ISS]
          [GO:0016505 "apoptotic protease activator activity" evidence=ISO;ISS]
          [GO:0030108 "HLA-A specific activating MHC class I receptor activity"
          evidence=ISO;ISS] [GO:0030335 "positive regulation of cell migration"
          evidence=ISO;ISS] [GO:0030984 "kininogen binding" evidence=IPI]
          [GO:0031638 "zymogen activation" evidence=ISO;ISS] [GO:0031648
          "protein destabilization" evidence=ISO;ISS] [GO:0032403 "protein
          complex binding" evidence=IPI] [GO:0032526 "response to retinoic
          acid" evidence=ISO;ISS] [GO:0033619 "membrane protein proteolysis"
          evidence=ISO;ISS] [GO:0035085 "cilium axoneme" evidence=IDA]
          [GO:0043066 "negative regulation of apoptotic process"
          evidence=ISO;ISS] [GO:0043129 "surfactant homeostasis"
          evidence=ISO;ISS] [GO:0043621 "protein self-association"
          evidence=IDA] [GO:0045766 "positive regulation of angiogenesis"
          evidence=ISO;ISS] [GO:0060448 "dichotomous subdivision of terminal
          units involved in lung branching" evidence=ISO;ISS] [GO:0070324
          "thyroid hormone binding" evidence=ISO;ISS] [GO:0070371 "ERK1 and
          ERK2 cascade" evidence=ISO;ISS] [GO:0097067 "cellular response to
          thyroid hormone stimulus" evidence=ISO;IEP] [GO:0097208 "alveolar
          lamellar body" evidence=ISO;ISS;IDA] InterPro:IPR000668
          InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
          InterPro:IPR000169 RGD:2447 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
          GO:GO:0008284 GO:GO:0070371 GO:GO:0001669 eggNOG:COG4870
          HOGENOM:HOG000230774 InterPro:IPR025661 InterPro:IPR025660
          InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
          PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0007283
          GO:GO:0045766 GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
          GO:GO:0043621 GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 KO:K01366
          GO:GO:0016505 GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
          HOVERGEN:HBG011513 GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
          GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT OrthoDB:EOG4W9J43
          GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
          GO:GO:0010813 GO:GO:0043129 MEROPS:I29.003 EMBL:Y00708 EMBL:BC085352
          EMBL:M38135 IPI:IPI00212809 PIR:S00211 RefSeq:NP_037071.1
          UniGene:Rn.1997 ProteinModelPortal:P00786 SMR:P00786 STRING:P00786
          PRIDE:P00786 Ensembl:ENSRNOT00000019285 GeneID:25425 KEGG:rno:25425
          UCSC:RGD:2447 InParanoid:P00786 BindingDB:P00786 NextBio:606599
          Genevestigator:P00786 GermOnline:ENSRNOG00000014064 GO:GO:0035086
          GO:GO:0001520 Uniprot:P00786
        Length = 333

 Score = 547 (197.6 bits), Expect = 8.0e-53, P = 8.0e-53
 Identities = 124/339 (36%), Positives = 185/339 (54%)

Query:    15 ILVLGVW---APQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIA 71
             +L  G W   A  +   T+N A        WM Q+ + Y  + E   R ++F  N   I 
Sbjct:     7 LLCAGAWLLSAGATAELTVN-AIEKFHFTSWMKQHQKTY-SSREYSHRLQVFANNWRKIQ 64

Query:    72 SFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPAS 131
             + N   RN  +K+G+N+F+D +  E +     +K      ++   T  ++       P+S
Sbjct:    65 AHNQ--RNHTFKMGLNQFSDMSFAEIK-----HKYLWSEPQNCSATKSNYLRGTGPYPSS 117

Query:   132 IDWRKKG-AVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQ 190
             +DWRKKG  V+ VK+QG CG CW FS   A+E    I + K+ +L+EQ+LVDC  +  + 
Sbjct:   118 MDWRKKGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMMTLAEQQLVDCAQNFNNH 177

Query:   191 GCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAA 250
             GC+GGL   AFE+I+ NKG+  E  YPY   +G C K     + A +    ++  N+EAA
Sbjct:   178 GCQGGLPSQAFEYILYNKGIMGEDSYPYIGKNGQC-KFNPEKAVAFVKNVVNITLNDEAA 236

Query:   251 LMKAVA-NQPVSVAIDASGSDFQFYSSGVFTGQ-CGT---ELDHGVTAVGYGTADDGTKY 305
             +++AVA   PVS A + +  DF  Y SGV++   C     +++H V AVGYG   +G  Y
Sbjct:   237 MVEAVALYNPVSFAFEVT-EDFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGE-QNGLLY 294

Query:   306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             W+VKNSWG+ WG NGY  ++R     + +CG+A  ASYP
Sbjct:   295 WIVKNSWGSNWGNNGYFLIERG----KNMCGLAACASYP 329


>MGI|MGI:107285 [details] [associations]
            symbol:Ctsh "cathepsin H" species:10090 "Mus musculus"
            [GO:0001520 "outer dense fiber" evidence=ISO] [GO:0001669
            "acrosomal vesicle" evidence=ISO] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=IGI] [GO:0002764 "immune response-regulating
            signaling pathway" evidence=ISO] [GO:0004175 "endopeptidase
            activity" evidence=ISO;IMP] [GO:0004177 "aminopeptidase activity"
            evidence=ISO] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISO;IDA] [GO:0004252 "serine-type endopeptidase activity"
            evidence=IMP] [GO:0005615 "extracellular space" evidence=ISO]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0005829 "cytosol"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=ISO;IMP]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0008284
            "positive regulation of cell proliferation" evidence=IMP]
            [GO:0010628 "positive regulation of gene expression" evidence=ISO]
            [GO:0010634 "positive regulation of epithelial cell migration"
            evidence=IMP] [GO:0010813 "neuropeptide catabolic process"
            evidence=ISO] [GO:0010815 "bradykinin catabolic process"
            evidence=ISO] [GO:0010952 "positive regulation of peptidase
            activity" evidence=IGI;ISO] [GO:0016505 "apoptotic protease
            activator activity" evidence=IGI;ISO] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0030108 "HLA-A specific activating MHC
            class I receptor activity" evidence=ISO] [GO:0030335 "positive
            regulation of cell migration" evidence=ISO] [GO:0030984 "kininogen
            binding" evidence=ISO] [GO:0031638 "zymogen activation"
            evidence=ISO;IMP] [GO:0031648 "protein destabilization"
            evidence=ISO;IMP] [GO:0032403 "protein complex binding"
            evidence=ISO] [GO:0032526 "response to retinoic acid" evidence=IDA]
            [GO:0033619 "membrane protein proteolysis" evidence=ISO;IMP]
            [GO:0035085 "cilium axoneme" evidence=ISO] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IMP] [GO:0043129
            "surfactant homeostasis" evidence=ISO] [GO:0043621 "protein
            self-association" evidence=ISO] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IMP] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=IMP]
            [GO:0070324 "thyroid hormone binding" evidence=ISO] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISO] [GO:0097208 "alveolar
            lamellar body" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:107285 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 EMBL:CH466560 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            HOVERGEN:HBG011513 GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT OrthoDB:EOG4W9J43
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 BRENDA:3.4.22.16
            EMBL:U06119 EMBL:AK149949 EMBL:AK150583 EMBL:AK157376 EMBL:AK160026
            EMBL:Y18464 IPI:IPI00118987 RefSeq:NP_031827.2 UniGene:Mm.2277
            ProteinModelPortal:P49935 SMR:P49935 STRING:P49935 MEROPS:I29.003
            PhosphoSite:P49935 PaxDb:P49935 PRIDE:P49935
            Ensembl:ENSMUST00000034915 GeneID:13036 KEGG:mmu:13036
            InParanoid:Q3UCD6 ChEMBL:CHEMBL1949491 NextBio:282920 Bgee:P49935
            CleanEx:MM_CTSH Genevestigator:P49935 GermOnline:ENSMUSG00000032359
            Uniprot:P49935
        Length = 333

 Score = 546 (197.3 bits), Expect = 1.0e-52, P = 1.0e-52
 Identities = 125/340 (36%), Positives = 190/340 (55%)

Query:    15 ILVLGVWAPQSWSRTLNDATMN--ER-H-EMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
             +L  G W   + +    + T+N  E+ H + WM Q+ + Y  + E   R ++F  N   I
Sbjct:     7 LLCAGAWLLSTGATA--ELTVNAIEKFHFKSWMKQHQKTY-SSVEYNHRLQMFANNWRKI 63

Query:    71 ASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPA 130
              + N   RN  +K+ +N+F+D +  E +     +K      ++   T  ++       P+
Sbjct:    64 QAHNQ--RNHTFKMALNQFSDMSFAEIK-----HKFLWSEPQNCSATKSNYLRGTGPYPS 116

Query:   131 SIDWRKKG-AVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
             S+DWRKKG  V+ VK+QG CG CW FS   A+E    I + K+ SL+EQ+LVDC  +  +
Sbjct:   117 SMDWRKKGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQAFNN 176

Query:   190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEA 249
              GC+GGL   AFE+I+ NKG+  E  YPY   D SC +     + A +    ++  N+EA
Sbjct:   177 HGCKGGLPSQAFEYILYNKGIMEEDSYPYIGKDSSC-RFNPQKAVAFVKNVVNITLNDEA 235

Query:   250 ALMKAVA-NQPVSVAIDASGSDFQFYSSGVFTGQ-CGT---ELDHGVTAVGYGTADDGTK 304
             A+++AVA   PVS A + +  DF  Y SGV++ + C     +++H V AVGYG   +G  
Sbjct:   236 AMVEAVALYNPVSFAFEVT-EDFLMYKSGVYSSKSCHKTPDKVNHAVLAVGYGE-QNGLL 293

Query:   305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             YW+VKNSWG+ WGENGY  ++R     + +CG+A  ASYP
Sbjct:   294 YWIVKNSWGSQWGENGYFLIERG----KNMCGLAACASYP 329


>DICTYBASE|DDB_G0279187 [details] [associations]
            symbol:cprG "cysteine proteinase 7" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0279187 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000030 ProtClustDB:CLSZ2846820 MEROPS:C01.081
            EMBL:U72746 RefSeq:XP_641720.2 ProteinModelPortal:Q94504 SMR:Q94504
            PRIDE:Q94504 EnsemblProtists:DDB0215005 GeneID:8621915
            KEGG:ddi:DDB_G0279187 OMA:INTETEK Uniprot:Q94504
        Length = 460

 Score = 544 (196.6 bits), Expect = 1.7e-52, P = 1.7e-52
 Identities = 130/294 (44%), Positives = 173/294 (58%)

Query:    11 VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
             VL+A+ VL V    +  + L++         WM  + R Y  + E   R+ IFK N++Y+
Sbjct:     3 VLSALCVLLVSVATA-KQQLSEVEYRNAFTNWMIAHQRHY-SSEEFNGRYNIFKANMDYV 60

Query:    71 ASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPA 130
               +N K       LG+N FAD +NEE+RA   G      S+  +E+ D  F   +AS  A
Sbjct:    61 NEWNTKGSETV--LGLNVFADISNEEYRATYLGTPFDASSLEMTES-DKIF---DAS--A 112

Query:   131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRK--LTSLSEQELVDCDTSGE 188
              +DWR +GAVT +K+QGQCG CW+FS   A EG  ++   K  L SLSEQ L+DC  S  
Sbjct:   113 QVDWRTQGAVTPIKNQGQCGGCWSFSTTGATEGAQYLANGKKNLVSLSEQNLIDCSGSYG 172

Query:   189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPS--AAKISGYEDVPSN 246
             + GCEGGLM  AFE+II+NKG+ TE+ YPY A DG   K + NP   AA++S Y +V S 
Sbjct:   173 NNGCEGGLMTLAFEYIINNKGIDTESSYPYTAEDGK--KCKFNPKNVAAQLSSYVNVTSG 230

Query:   247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQ-CG-TELDHGVTAVGYGT 298
             +E+ L   V   P SVAIDAS   FQ Y SG++    C  T+LDHGV AVG+GT
Sbjct:   231 SESDLAAKVTQGPTSVAIDASNQSFQLYVSGIYNEPACSSTQLDHGVLAVGFGT 284

 Score = 143 (55.4 bits), Expect = 6.3e-07, P = 6.3e-07
 Identities = 40/87 (45%), Positives = 49/87 (56%)

Query:   261 SVAIDASGSDFQFYSSGVFTGQC-GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGEN 319
             SV+  ASGS     +SG  +G   G+  + GV    Y TA D   YW+VKNSWGT+WG +
Sbjct:   385 SVSGSASGS-----ASGSASGSSSGSNSNGGV----YPTAGD---YWIVKNSWGTSWGMD 432

Query:   320 GYIRMQRDIDAKEGLCGIAMQASYPTA 346
             GYI M +        CGIA  AS PTA
Sbjct:   433 GYILMTK---GNNNQCGIATMASRPTA 456


>UNIPROTKB|H9KYW5 [details] [associations]
            symbol:CTSS "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0002250 "adaptive immune response" evidence=IEA]
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 OMA:YEPACTQ EMBL:AADN02010496
            Ensembl:ENSGALT00000001122 Uniprot:H9KYW5
        Length = 245

 Score = 544 (196.6 bits), Expect = 1.7e-52, P = 1.7e-52
 Identities = 121/254 (47%), Positives = 157/254 (61%)

Query:    93 TNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCC 152
             T+E+  A   G   R+PS  +  +T   +R      P ++DWR+KG VT VK+QG CG C
Sbjct:     1 TSEDVAALLTGL--RVPSGHNQTST---YR-RRGGAPDAMDWREKGCVTEVKNQGACGAC 54

Query:   153 WAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLAT 212
             WAFSAV A+E    + T KL SLS Q LVDC     ++GC GG M  AF++II N G+ +
Sbjct:    55 WAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSMMYGNKGCGGGFMTRAFQYIIDNNGIDS 114

Query:   213 EAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDF 271
             E  YPY A +G+C +   +  AA  S Y ++P  +EAAL  AVAN  PVSVAIDA+   F
Sbjct:   115 EESYPYMAQNGTC-QYNVSTRAATCSKYVELPYADEAALKDAVANVGPVSVAIDATQPTF 173

Query:   272 QFYSSGVFTG-QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDA 330
               Y SGV+   +C  E++HGV  VGYGT ++   +WLVKNSWG  +G+ GYIRM R+  A
Sbjct:   174 FLYRSGVYDDPRCTQEVNHGVLVVGYGTLNE-KDFWLVKNSWGERFGDGGYIRMSRN-HA 231

Query:   331 KEGLCGIAMQASYP 344
                 CGIA  ASYP
Sbjct:   232 NH--CGIASYASYP 243


>UNIPROTKB|O46427 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9823 "Sus scrofa"
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0032526 "response to retinoic acid" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0043129
            "surfactant homeostasis" evidence=ISS] [GO:0010815 "bradykinin
            catabolic process" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0005829 "cytosol"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0030335 "positive regulation of cell
            migration" evidence=ISS] [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0016505 "apoptotic protease activator
            activity" evidence=ISS] [GO:0004252 "serine-type endopeptidase
            activity" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=ISS] [GO:0031638 "zymogen activation"
            evidence=ISS] [GO:0030108 "HLA-A specific activating MHC class I
            receptor activity" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0010628 "positive regulation of gene
            expression" evidence=ISS] [GO:0070324 "thyroid hormone binding"
            evidence=ISS] [GO:0006508 "proteolysis" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0060448
            "dichotomous subdivision of terminal units involved in lung
            branching" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] [GO:0005764 "lysosome" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0004177 "aminopeptidase
            activity" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=ISS] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 MEROPS:C01.040 CTD:1512 OMA:STSCHKT
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 EMBL:AF001169
            RefSeq:NP_999094.1 UniGene:Ssc.3593 PDB:1NB3 PDB:1NB5 PDB:8PCH
            PDBsum:1NB3 PDBsum:1NB5 PDBsum:8PCH ProteinModelPortal:O46427
            SMR:O46427 Ensembl:ENSSSCT00000001983 GeneID:396969 KEGG:ssc:396969
            EvolutionaryTrace:O46427 ArrayExpress:O46427 Uniprot:O46427
        Length = 335

 Score = 543 (196.2 bits), Expect = 2.1e-52, P = 2.1e-52
 Identities = 125/341 (36%), Positives = 187/341 (54%)

Query:    14 AILVLGVWA---PQSWSRTLNDATMNERH-EMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
             ++L  G W    P   +  L  ++  + H + WM Q+ + Y    E   R ++F  N   
Sbjct:     6 SLLCAGAWLLGPPACGASNLAVSSFEKLHFKSWMVQHQKKY-SLEEYHHRLQVFVSNWRK 64

Query:    70 IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVP 129
             I + N  A N  +KLG+N+F+D + +E R     +K      ++   T  ++       P
Sbjct:    65 INAHN--AGNHTFKLGLNQFSDMSFDEIR-----HKYLWSEPQNCSATKGNYLRGTGPYP 117

Query:   130 ASIDWRKKGA-VTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
              S+DWRKKG  V+ VK+QG CG CW FS   A+E    I T K+ SL+EQ+LVDC  +  
Sbjct:   118 PSMDWRKKGNFVSPVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNFN 177

Query:   189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
             + GC+GGL   AFE+I  NKG+  E  YPYK  D  C K + + + A +    ++  N+E
Sbjct:   178 NHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKGQDDHC-KFQPDKAIAFVKDVANITMNDE 236

Query:   249 AALMKAVA-NQPVSVAIDASGSDFQFYSSGVFTG-QCGT---ELDHGVTAVGYGTADDGT 303
              A+++AVA   PVS A + + +DF  Y  G+++   C     +++H V AVGYG  ++G 
Sbjct:   237 EAMVEAVALYNPVSFAFEVT-NDFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGE-ENGI 294

Query:   304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
              YW+VKNSWG  WG NGY  ++R     + +CG+A  ASYP
Sbjct:   295 PYWIVKNSWGPQWGMNGYFLIERG----KNMCGLAACASYP 331


>MGI|MGI:1861723 [details] [associations]
            symbol:Ctsr "cathepsin R" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=ISA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=ISA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0030163 "protein
            catabolic process" evidence=ISA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1861723 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0030163
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF245399
            EMBL:AY014778 EMBL:AK014432 EMBL:AK005429 IPI:IPI00120321
            RefSeq:NP_064680.1 UniGene:Mm.315715 ProteinModelPortal:Q9JIA9
            SMR:Q9JIA9 MEROPS:C01.042 PRIDE:Q9JIA9 Ensembl:ENSMUST00000021889
            GeneID:56835 KEGG:mmu:56835 CTD:56835 InParanoid:Q9JIA9 KO:K09601
            OMA:ASHESFK OrthoDB:EOG4ZCT6D NextBio:313379 Bgee:Q9JIA9
            CleanEx:MM_CTSR Genevestigator:Q9JIA9 GermOnline:ENSMUSG00000055679
            Uniprot:Q9JIA9
        Length = 334

 Score = 543 (196.2 bits), Expect = 2.1e-52, P = 2.1e-52
 Identities = 130/344 (37%), Positives = 181/344 (52%)

Query:    10 LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
             +V  A L LGV    S    L D++++   + W  +Y + Y    EK  R  +++E ++ 
Sbjct:     4 VVFIAFLYLGV---ASGVPVL-DSSLDAEWQDWKIKYNKSYSLKEEKLKRV-VWEEKLKM 58

Query:    70 IASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS 127
             I   N  N      + + +NEF DQT+EEFR         +      E   +  R   + 
Sbjct:    59 IKLHNRENSLGKNGFTMKMNEFGDQTDEEFRK----MMIEISVWTHREGKSIMKREAGSI 114

Query:   128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
             +P  +DWRKKG VT V+ QG C  CWAF+   A+E      T KLT LS Q LVDC    
Sbjct:   115 LPKFVDWRKKGYVTPVRRQGDCDACWAFAVTGAIEAQAIWQTGKLTPLSVQNLVDCSKPQ 174

Query:   188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
              + GC GG   +AF++++ N GL +EA YPY+  DG C     N S A+I+G+  +P + 
Sbjct:   175 GNNGCLGGDTYNAFQYVLHNGGLESEATYPYEGKDGPCRYNPKN-SKAEITGFVSLPQS- 232

Query:   248 EAALMKAVAN-QPVSVAIDASGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYG---TADD 301
             E  LM AVA   P++  IDAS   F+ Y  G++    C ++ + HGV  VGYG      D
Sbjct:   233 EDILMAAVATIGPITAGIDASHESFKNYKGGIYHEPNCSSDTVTHGVLVVGYGFKGIETD 292

Query:   302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
             G  YWL+KNSWG  WG  GY+++ +D   K   CGIA  A YPT
Sbjct:   293 GNHYWLIKNSWGKRWGIRGYMKLAKD---KNNHCGIASYAHYPT 333


>DICTYBASE|DDB_G0278721 [details] [associations]
            symbol:cprD "cysteine proteinase 4" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0278721 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000024 EMBL:L36204 RefSeq:XP_641963.1
            ProteinModelPortal:P54639 SMR:P54639 MEROPS:C01.A57 PRIDE:P54639
            EnsemblProtists:DDB0214999 GeneID:8621695 KEGG:ddi:DDB_G0278721
            OMA:NAFADIT ProtClustDB:CLSZ2846820 Uniprot:P54639
        Length = 442

 Score = 542 (195.9 bits), Expect = 2.7e-52, P = 2.7e-52
 Identities = 122/269 (45%), Positives = 164/269 (60%)

Query:    42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
             WM  + R Y  + E   R++IFK N++Y+  +N+K       LG+N FAD TN+E+R   
Sbjct:    33 WMQAHQRTY-SSEEFNARYQIFKSNMDYVHQWNSKGGETV--LGLNVFADITNQEYRTTY 89

Query:   102 NGYKRRLPSVRSSETTDVSFRYENASVPA-SIDWRKKGAVTGVKDQGQCGCCWAFSAVAA 160
              G      ++  +E   + F     S PA ++DWR +GAVT +K+QGQCG CW+FS   +
Sbjct:    90 LGTPFDGSALIGTEEEKI-F-----STPAPTVDWRAQGAVTPIKNQGQCGGCWSFSTTGS 143

Query:   161 MEGINHIT--TRK-LTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
              EG + I   T+K L SLSEQ L+DC  S  + GCEGGLM  AFE+II+NKG+ TE+ YP
Sbjct:   144 TEGAHFIASGTKKDLVSLSEQNLIDCSKSYGNNGCEGGLMTLAFEYIINNKGIDTESSYP 203

Query:   218 YKASDGS-CNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
             Y A DG  C  K +N   A+I  Y++V S +EA+L  A  N PVSVAIDAS   FQ Y S
Sbjct:   204 YTAEDGKECKFKTSN-IGAQIVSYQNVTSGSEASLQSASNNAPVSVAIDASNESFQLYES 262

Query:   277 GVF-TGQCG-TELDHGVTAVGYGTADDGT 303
             G++    C  T+LDHGV  VGYG+    +
Sbjct:   263 GIYYEPACSPTQLDHGVLVVGYGSGSSSS 291

 Score = 166 (63.5 bits), Expect = 1.5e-09, P = 1.5e-09
 Identities = 46/142 (32%), Positives = 68/142 (47%)

Query:   206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMK-AVANQPVSVAI 264
             S  G  + +    KAS  S  K  ++ S+ K S      S +++     + + Q      
Sbjct:   304 STGGKTSSSSSSGKASSSSSGKASSSSSSGKTSSAASSTSGSQSGSQSGSQSGQSTGSQS 363

Query:   265 DASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
               + +  Q  +SG  +G  G+    G +  G   A  G  YW+VKNSWGT+WG +GYI M
Sbjct:   364 GQTSASGQASASGSGSGS-GSGSGSG-SGSGAVEASSGN-YWIVKNSWGTSWGMDGYIFM 420

Query:   325 QRDIDAKEGLCGIAMQASYPTA 346
              +D   +   CGIA  AS+PTA
Sbjct:   421 SKD---RNNNCGIATMASFPTA 439


>DICTYBASE|DDB_G0291191 [details] [associations]
            symbol:DDB_G0291191 "cysteine protease" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0291191
            GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AAFI02000175 MEROPS:C01.022
            ProtClustDB:CLSZ2429603 RefSeq:XP_635374.1
            ProteinModelPortal:Q54F16 PRIDE:Q54F16 EnsemblProtists:DDB0252831
            GeneID:8628022 KEGG:ddi:DDB_G0291191 OMA:NETQIAS Uniprot:Q54F16
        Length = 352

 Score = 537 (194.1 bits), Expect = 9.2e-52, P = 9.2e-52
 Identities = 128/324 (39%), Positives = 176/324 (54%)

Query:    45 QYGRVYRDNAEKEM-RFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRAPR 101
             +Y ++Y  +AE+ + +F+ FK N+  I + N +A       K G+N+FAD + EEF+   
Sbjct:    33 KYNKIY--SAEEYLVKFETFKSNLLNIDALNKQATTIGSDTKFGVNKFADLSKEEFK--- 87

Query:   102 NGYKRRLPSVRSSETTDVSFRYENAS------VPASIDWRKKGA---------VTGVKDQ 146
                K  L S  +  T D+     N S       PA+ DWR  G          VT VK+Q
Sbjct:    88 ---KYYLSSKEARLTDDLPM-LPNLSDDIISATPAAFDWRNTGGSTKFPQGTPVTAVKNQ 143

Query:   147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD---TSGEDQ-----GCEGGLMD 198
             GQCG CW+FS    +EG ++++T  L  LSEQ LVDCD    + E++     GC+GGL  
Sbjct:   144 GQCGSCWSFSTTGNVEGQHYLSTGTLVGLSEQNLVDCDHTCMTYENENVCNAGCDGGLQP 203

Query:   199 DAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ 258
             +A+ +II N G+ TEA YPY A DG C    A   A KIS +  VP N          N 
Sbjct:   204 NAYNYIIKNGGIQTEATYPYTAVDGECKFNSAQVGA-KISSFTMVPQNETQIASYLFNNG 262

Query:   259 PVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADD----GTKYWLVKNSWGT 314
             P+++A DA   ++QFY  GVF   CG  LDHG+  VGYG  D      T YW++KNSWG 
Sbjct:   263 PLAIAADAE--EWQFYMGGVFDFPCGQTLDHGILIVGYGAQDTIVGKNTPYWIIKNSWGA 320

Query:   315 TWGENGYIRMQRDIDAKEGLCGIA 338
              WGE GY++++R+ D     CG+A
Sbjct:   321 DWGEAGYLKVERNTDK----CGVA 340


>DICTYBASE|DDB_G0290957 [details] [associations]
            symbol:cprA "cysteine proteinase 1" species:44689
            "Dictyostelium discoideum" [GO:0006972 "hyperosmotic response"
            evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0290957
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GenomeReviews:CM000154_GR GO:GO:0005764
            GO:GO:0006972 EMBL:AAFI02000174 KO:K01376 EMBL:X02407 PIR:A22827
            RefSeq:XP_635417.1 ProteinModelPortal:P04988 MEROPS:C01.022
            GlycoSuiteDB:P04988 SWISS-2DPAGE:P04988 EnsemblProtists:DDB0201647
            GeneID:8627918 KEGG:ddi:DDB_G0290957 OMA:KISNFTM
            ProtClustDB:CLSZ2429603 Uniprot:P04988
        Length = 343

 Score = 536 (193.7 bits), Expect = 1.2e-51, P = 1.2e-51
 Identities = 121/298 (40%), Positives = 166/298 (55%)

Query:    59 RFKIFKENVEYIASFNNKARNKPY--KLGINEFADQTNEEFRAPRNGYKRRLPSVRSSET 116
             RF+IFK N+  I   N  A N     K G+N+FAD +++EF+   N Y     ++ + + 
Sbjct:    48 RFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFK---NYYLNNKEAIFTDDL 104

Query:   117 TDVSFRYENA--SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTS 174
                 +  +    S+P + DWR +GAVT VK+QGQCG CW+FS    +EG + I+  KL S
Sbjct:   105 PVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVS 164

Query:   175 LSEQELVDCDTS-----GE---DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGS-C 225
             LSEQ LVDCD       GE   D+GC GGL  +A+ +II N G+ TE+ YPY A  G+ C
Sbjct:   165 LSEQNLVDCDHECMEYEGEQACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQC 224

Query:   226 NKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCG- 284
             N   AN   AKIS +  +P N        V+  P+++A DA   ++QFY  GVF   C  
Sbjct:   225 NFNSAN-IGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAV--EWQFYIGGVFDIPCNP 281

Query:   285 TELDHGVTAVGYGTADD----GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA 338
               LDHG+  VGY   +        YW+VKNSWG  WGE GYI ++R     +  CG++
Sbjct:   282 NSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG----KNTCGVS 335


>UNIPROTKB|G1SQF0 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9986
            "Oryctolagus cuniculus" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 RefSeq:XP_002721635.1 UniGene:Ocu.7137
            Ensembl:ENSOCUT00000006138 GeneID:100101597 Uniprot:G1SQF0
        Length = 333

 Score = 536 (193.7 bits), Expect = 1.2e-51, P = 1.2e-51
 Identities = 123/341 (36%), Positives = 191/341 (56%)

Query:    14 AILVLGVW---APQSWSRTLNDATMNERH-EMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
             ++L  G W   AP + + + N+  + + H + WM+Q+ + Y    E   R + F  N   
Sbjct:     6 SLLCAGAWLLGAPGADAFSANN--LEKFHFKSWMSQHHKKYSAE-EYPRRLQTFVRNWRK 62

Query:    70 IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVP 129
             I + NN   N  +++G+N+F+D +  E +     +K      ++   T  ++       P
Sbjct:    63 INAHNNG--NHTFQMGLNQFSDMSFAEIK-----HKYLWTEPQNCSATKSNYLRGTGPYP 115

Query:   130 ASIDWRKKGA-VTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
             +S+DWRKKG  V+ VK+QG CG CW FS   A+E    I   K+ SL+EQ+LVDC  +  
Sbjct:   116 SSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAVAIAGGKMLSLAEQQLVDCAQNFN 175

Query:   189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
             + GCEGGL   AFE+I+ NKG+  E  YPY+A +G C K +   + A +    ++  N+E
Sbjct:   176 NHGCEGGLPSQAFEYILYNKGIMGEDSYPYRAMEGRC-KFQPQKAIAFVKDVANITLNDE 234

Query:   249 AALMKAVA-NQPVSVAIDASGSDFQFYSSGVFTG-QCGT---ELDHGVTAVGYGTADDGT 303
              A+++AVA   PVS A + +  DF  Y  G+++   C     +++H V AVGYG  ++G 
Sbjct:   235 EAMVEAVALYNPVSFAFEVT-EDFMQYRKGIYSSTSCHKTPDKVNHAVLAVGYGE-ENGV 292

Query:   304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
              YW+VKNSWG+ WG NGY  ++R     + +CG+A  ASYP
Sbjct:   293 PYWIVKNSWGSHWGMNGYFYIERG----KNMCGLAACASYP 329


>UNIPROTKB|F1NHB8 [details] [associations]
            symbol:F1NHB8 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 EMBL:AADN02044011
            IPI:IPI00586027 Ensembl:ENSGALT00000021873 OMA:SELDHAV
            Uniprot:F1NHB8
        Length = 329

 Score = 534 (193.0 bits), Expect = 1.9e-51, P = 1.9e-51
 Identities = 122/306 (39%), Positives = 169/306 (55%)

Query:    45 QYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGY 104
             ++G+ Y    E E R + F  N+ ++ S N  A +  Y L +N  AD+T +E  A R   
Sbjct:    32 RFGKRYSSEEEHEHRKRTFIHNMRFVHSKNRAALS--YSLALNHLADRTPQEMAALRG-- 87

Query:   105 KRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGI 164
             +RR    +S +   +   Y +  +P S+DWR  GAVT VKDQ  CG CW+F+   AMEG 
Sbjct:    88 RRRSGDPKSGQPFSMQL-YASLVLPESLDWRLYGAVTPVKDQAVCGSCWSFATTGAMEGA 146

Query:   165 NHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY-PYKASDG 223
               + T  LT LS+Q L+DC     +  C+GG    A+E+I  + G+A+   Y PY   +G
Sbjct:   147 LFLKTGVLTPLSQQVLIDCSWGFGNYACDGGEEWRAYEWIKKHGGIASTESYGPYLGQNG 206

Query:   224 SCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTG- 281
              C+  ++    A ++GY  V S N  AL  A+    PV+V IDAS   F FY++GV+   
Sbjct:   207 YCHYNQSE-LVAPLAGYVTVESGNAEALKAALFKHGPVAVNIDASHKSFTFYANGVYEEP 265

Query:   282 QCG---TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA 338
              CG   +ELDH V AVGYG    G  YWL+KNSW T WG +GYI M      K+  CG+A
Sbjct:   266 HCGNETSELDHAVLAVGYGVLH-GKSYWLIKNSWSTYWGNDGYILMAM----KDNNCGVA 320

Query:   339 MQASYP 344
               AS+P
Sbjct:   321 TAASFP 326


>UNIPROTKB|Q4QRC2 [details] [associations]
            symbol:Ctsql2 "Protein Ctsql2" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 HOVERGEN:HBG011513 EMBL:CH474032
            RGD:1303225 EMBL:BC097257 IPI:IPI00421946 RefSeq:NP_001002813.2
            UniGene:Rn.128678 SMR:Q4QRC2 MEROPS:C01.111
            Ensembl:ENSRNOT00000038758 GeneID:408201 KEGG:rno:408201 CTD:408201
            InParanoid:Q4QRC2 OMA:NDEGALM NextBio:696394 Genevestigator:Q4QRC2
            Uniprot:Q4QRC2
        Length = 343

 Score = 532 (192.3 bits), Expect = 3.1e-51, P = 3.1e-51
 Identities = 128/344 (37%), Positives = 183/344 (53%)

Query:    15 ILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN 74
             IL LGV    S +   N  +++ + + W  +Y ++Y    E   R  +++ENV+ I   N
Sbjct:     9 ILCLGV---VSGASAFN-LSLDVQWQEWKMKYEKLYSPEEELLKRV-VWEENVKKIELHN 63

Query:    75 --NKARNKPYKLGINEFADQTNEEFR-------APRNGYKRRLPSVRSSETTDVSFRYEN 125
               N      Y + IN FAD T+EEF+        P N   + L           S+ + +
Sbjct:    64 RENSLGKNTYIMEINNFADLTDEEFKDMITGITLPINNTMKSLWKRALGSPFPNSWYWRD 123

Query:   126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
             A +P SIDWRK+G VT V++QG+C  CWAF    A+EG     T KLT LS Q LVDC  
Sbjct:   124 A-LPKSIDWRKEGYVTRVREQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCSK 182

Query:   186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
                ++GC GG   +AF++++ N GL +EA YPYK  +G C     N + AKI+ +  +P 
Sbjct:   183 PQGNKGCRGGTTYNAFQYVLQNGGLESEATYPYKGKEGLCKYNPKN-AYAKITRFVALPE 241

Query:   246 NNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVF-TGQCGTELDHGVTAVGYG---TAD 300
             + E  LM A+A + PV+  I    S  +FY  G++   +C   ++H V  VGYG      
Sbjct:   242 D-EDVLMDALATKGPVAAGIHVVYSSLRFYKKGIYHEPKCNNRVNHAVLVVGYGFEGNET 300

Query:   301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             DG  YWL+KNSWG  WG  GY+++ +D   +   CGIA  A YP
Sbjct:   301 DGNNYWLIKNSWGKQWGLKGYMKIAKD---RNNHCGIATFAQYP 341


>UNIPROTKB|F7B939 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458 CTD:1512
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 EMBL:ACFV01158341
            EMBL:ACFV01158342 EMBL:ACFV01158343 RefSeq:XP_002753411.1
            Ensembl:ENSCJAT00000004397 GeneID:100413104 Uniprot:F7B939
        Length = 336

 Score = 531 (192.0 bits), Expect = 4.0e-51, P = 4.0e-51
 Identities = 121/342 (35%), Positives = 184/342 (53%)

Query:    10 LVLAAILVLGVWAPQSWSRTLNDATMNERH-EMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
             L+ A + +LG  AP   +  L+  ++ + H + WMA++ + Y    E   R + F  N  
Sbjct:     7 LLCAGVCLLG--APARGAAELSVNSLEKFHFKSWMAKHHKTYSREEEYHQRLQTFASNWR 64

Query:    69 YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
              I + NN   N  +K+ +N+F+D +  E +     Y    P  ++   T  ++       
Sbjct:    65 KINAHNNG--NHTFKMAVNQFSDMSFAEIKRK---YLWSEP--QNCSATKSNYLRGTGPY 117

Query:   129 PASIDWRKKGA-VTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
             P S+DWRKKG  V+ VK+QG CG CW FS   A+E    I T K+ SL+EQ+LVDC    
Sbjct:   118 PPSVDWRKKGHFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDF 177

Query:   188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
              + GC+GGL   AFE+I+ N G+  E  YPY+  D  C K +   +   +    ++   +
Sbjct:   178 NNHGCQGGLPSQAFEYILYNNGIMGEDTYPYQGKDSDC-KFQPGKAIGFVKDVANITIYD 236

Query:   248 EAALMKAVA-NQPVSVAIDASGSDFQFYSSGVFTG-QCGT---ELDHGVTAVGYGTADDG 302
             E A+++AVA   PVS A + +  DF  Y  G+++   C     +++H V AVGYG  ++G
Sbjct:   237 EDAMVEAVALYNPVSFAFEVT-QDFMMYKRGIYSSTSCHKTPDKVNHAVLAVGYGE-ENG 294

Query:   303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
               YW+VKNSWG  WG NGY  ++R     + +CG+A  ASYP
Sbjct:   295 IPYWIVKNSWGPQWGMNGYFLIERG----KNMCGLAACASYP 332


>UNIPROTKB|G3SSC1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9785
            "Loxodonta africana" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 RefSeq:XP_003413898.1
            Ensembl:ENSLAFT00000003415 GeneID:100662496 Uniprot:G3SSC1
        Length = 335

 Score = 530 (191.6 bits), Expect = 5.1e-51, P = 5.1e-51
 Identities = 126/340 (37%), Positives = 183/340 (53%)

Query:    15 ILVLGVW--APQSWSRTLNDATMNER-H-EMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
             +L  G W   P++   T    +  E+ H + WMAQ+ + Y  + E   R + F  N   I
Sbjct:     7 LLCAGAWFLGPRTCDATALSVSSYEKFHFQSWMAQHQKKY-SSEEYHQRQQTFVSNWRKI 65

Query:    71 ASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPA 130
              + N  ARN  +K+ +N+F+D T  E +     Y    P  ++   T  ++       P 
Sbjct:    66 NAHN--ARNHTFKMALNQFSDMTFAEIKQK---YLWSEP--QNCSATKGNYLRGTGPYPP 118

Query:   131 SIDWRKKGA-VTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
              +DWRKKG  V+ VK+QG CG CW FS   A+E    I   KL SL+EQ+LVDC     +
Sbjct:   119 FVDWRKKGHFVSPVKNQGACGSCWTFSTTGALESAIAIAGGKLLSLAEQQLVDCAKDFNN 178

Query:   190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEA 249
              GC+GGL   AFE+I+ NKG+  E  YPYK  D  C K +   + A +    ++  N+E 
Sbjct:   179 HGCQGGLPSQAFEYILYNKGIMGEDTYPYKGQDDVC-KFQPKKAIAFVKDVANITLNDEE 237

Query:   250 ALMKAVA-NQPVSVAIDASGSDFQFYSSGVFTG-QCGT---ELDHGVTAVGYGTADDGTK 304
             A+++AVA   PVS A + +  DF  YS G+++   C     +++H V AVGYG  + G  
Sbjct:   238 AMVEAVALYNPVSFAFEVT-DDFMKYSKGIYSSTSCHKTPDKVNHAVLAVGYGE-EKGIP 295

Query:   305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             YW+VKNSWG  WG +GY  ++R     + +CG+A  ASYP
Sbjct:   296 YWIVKNSWGPYWGMDGYFLIERG----KNMCGLAACASYP 331


>MGI|MGI:1927229 [details] [associations]
            symbol:Ctsm "cathepsin M" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008152 "metabolic process" evidence=ISS] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015645 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1927229 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF202528
            EMBL:AY014777 EMBL:AY057446 EMBL:AK005550 EMBL:AK005428
            IPI:IPI00131133 RefSeq:NP_071721.2 UniGene:Mm.279933
            ProteinModelPortal:Q9JL96 SMR:Q9JL96 STRING:Q9JL96 MEROPS:C01.023
            PRIDE:Q9JL96 DNASU:64139 Ensembl:ENSMUST00000099451 GeneID:64139
            KEGG:mmu:64139 UCSC:uc007qwj.1 CTD:64139 InParanoid:Q9JL96
            KO:K09600 OrthoDB:EOG4TTGKR NextBio:319931 Bgee:Q9JL96
            CleanEx:MM_CTSM Genevestigator:Q9JL96 GermOnline:ENSMUSG00000074484
            GermOnline:ENSMUSG00000074871 PANTHER:PTHR12411:SF58 Uniprot:Q9JL96
        Length = 333

 Score = 530 (191.6 bits), Expect = 5.1e-51, P = 5.1e-51
 Identities = 131/342 (38%), Positives = 184/342 (53%)

Query:    11 VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
             +  A+L LG+  P        D  ++   + W  +YG+ Y    E + R  ++++N++ I
Sbjct:     5 IFLAMLCLGMALPSP----APDPILDVEWQKWKIKYGKAYSLEEEGQKR-AVWEDNMKKI 59

Query:    71 ASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
                N  N      + + +N F D T EEFR      +  +P+V+  ++  V  R  + ++
Sbjct:    60 KLHNGENGLGKHGFTMEMNAFGDMTLEEFRKVM--IEIPVPTVKKGKS--VQKRL-SVNL 114

Query:   129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
             P  I+W+K+G VT V+ QG+C  CWAFS   A+EG     T +L  LS Q LVDC     
Sbjct:   115 PKFINWKKRGYVTPVQTQGRCNSCWAFSVTGAIEGQMFRKTGQLIPLSVQNLVDCSRPQG 174

Query:   189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
             + GC  G    A  +++ N GL +EA YPY+  DGSC     N S A I+G+E VP N E
Sbjct:   175 NWGCYLGNTYLALHYVMENGGLESEATYPYEEKDGSCRYSPEN-STANITGFEFVPKN-E 232

Query:   249 AALMKAVAN-QPVSVAIDASGSDFQFYSSGVF-TGQCGT-ELDHGVTAVGYGTA---DDG 302
              ALM AVA+  P+SVAIDA  + F FY  G++    C +  + H +  VGYG      DG
Sbjct:   233 DALMNAVASIGPISVAIDARHASFLFYKRGIYYEPNCSSCVVTHSMLLVGYGFTGRESDG 292

Query:   303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
              KYWLVKNS GT WG  GY+++ RD   K   CGIA  A YP
Sbjct:   293 RKYWLVKNSMGTQWGNKGYMKISRD---KGNHCGIATYALYP 331


>UNIPROTKB|F6R7P5 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9544 "Macaca
            mulatta" [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=ISS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0004175 "endopeptidase activity" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0004252
            "serine-type endopeptidase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005829 "cytosol" evidence=ISS] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0010628 "positive regulation of
            gene expression" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0010815 "bradykinin catabolic
            process" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0016505 "apoptotic protease
            activator activity" evidence=ISS] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=ISS] [GO:0030335
            "positive regulation of cell migration" evidence=ISS] [GO:0031638
            "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0032526 "response to retinoic
            acid" evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0043129 "surfactant homeostasis"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0070324
            "thyroid hormone binding" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            GO:GO:0043066 GO:GO:0005615 GO:GO:0008284 GO:GO:0070371
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458 CTD:1512
            OMA:STSCHKT GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 RefSeq:XP_001108862.1
            UniGene:Mmu.3000 Ensembl:ENSMMUT00000014095 GeneID:711437
            KEGG:mcc:711437 NextBio:19969972 Uniprot:F6R7P5
        Length = 335

 Score = 529 (191.3 bits), Expect = 6.5e-51, P = 6.5e-51
 Identities = 121/344 (35%), Positives = 184/344 (53%)

Query:    11 VLAAILVLGVW---APQSWSRTLNDATMNERH-EMWMAQYGRVYRDNAEKEMRFKIFKEN 66
             V   +L  G W   AP   +  L+  ++ + H + WM+++ + Y    E   R + F  N
Sbjct:     3 VTLPLLCAGAWLLGAPVCGAAELSVNSLEKFHFKSWMSKHHKTY-STEEYHHRMQTFASN 61

Query:    67 VEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA 126
                I + NN   N  +K+ +N+F+D +  E +     +K      ++   T  ++     
Sbjct:    62 WRKINAHNNG--NHTFKMALNQFSDMSFAEIK-----HKYLWSEPQNCSATKSNYLRGTG 114

Query:   127 SVPASIDWRKKGA-VTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
               P S+DWRKKG  V+ VK+QG CG CW FS   A+E    I T K+ SL+EQ+LVDC  
Sbjct:   115 PYPPSMDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQ 174

Query:   186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
                + GC+GGL   AFE+I+ NKG+  E  YPY+  DG C K     +   +    ++  
Sbjct:   175 DFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGDC-KFRPGKAIGFVKDVANITI 233

Query:   246 NNEAALMKAVA-NQPVSVAIDASGSDFQFYSSGVFTG-QCGT---ELDHGVTAVGYGTAD 300
              +E A+++AVA   PVS A + +  DF  Y +G+++   C     +++H V AVGYG  +
Sbjct:   234 YDEEAMVEAVALYNPVSFAFEVT-QDFMIYKTGIYSSTSCHKTPDKVNHAVLAVGYGE-E 291

Query:   301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             +G  YW+VKNSWG  WG NGY  ++R     + +CG+A  ASYP
Sbjct:   292 NGIPYWIVKNSWGPQWGMNGYFLIERG----KNMCGLAACASYP 331


>MGI|MGI:1860262 [details] [associations]
            symbol:Cts7 "cathepsin 7" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005576
            "extracellular region" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005768 "endosome" evidence=IEA]
            [GO:0005794 "Golgi apparatus" evidence=IEA] [GO:0006508
            "proteolysis" evidence=ISA] [GO:0007049 "cell cycle" evidence=IEA]
            [GO:0007067 "mitosis" evidence=IEA] [GO:0008152 "metabolic process"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=ISA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0051301 "cell
            division" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:1860262 GO:GO:0005634 GO:GO:0005794 GO:GO:0048471
            GO:GO:0005615 GO:GO:0051301 GO:GO:0007067 GO:GO:0005768
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GO:GO:0008233 EMBL:CH466546
            EMBL:AY014779 EMBL:CT030645 EMBL:BC064740 EMBL:AF250837
            IPI:IPI00131132 RefSeq:NP_062412.1 UniGene:Mm.3692 HSSP:O60911
            ProteinModelPortal:Q91ZF2 SMR:Q91ZF2 STRING:Q91ZF2 MEROPS:C01.016
            PRIDE:Q91ZF2 Ensembl:ENSMUST00000021892 GeneID:56092 KEGG:mmu:56092
            UCSC:uc007qwi.1 CTD:56092 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 InParanoid:Q91ZF2 OMA:ERRVIWE OrthoDB:EOG44QT2S
            NextBio:311908 Bgee:Q91ZF2 Genevestigator:Q91ZF2 Uniprot:Q91ZF2
        Length = 331

 Score = 529 (191.3 bits), Expect = 6.5e-51, P = 6.5e-51
 Identities = 125/341 (36%), Positives = 176/341 (51%)

Query:    11 VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
             V  +IL LGV    + +    D  ++   E W     R Y    EK+ R  +++ NV++I
Sbjct:     5 VFLSILCLGV----ALAAPAPDYNLDAEWEEWKRSNDRTYSPEEEKQRR-AVWEGNVKWI 59

Query:    71 AS--FNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
                   N      + + +NEF D T EE +         L   R+ +      +  N  +
Sbjct:    60 KQHIMENGLWMNNFTIEMNEFGDMTGEEMKMLTESSSYPL---RNGK----HIQKRNPKI 112

Query:   129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
             P ++DWRK+G VT V+ QG CG CWAFS  A +EG     T KL  LS Q L+DC  S  
Sbjct:   113 PPTLDWRKEGYVTPVRRQGSCGACWAFSVTACIEGQLFKKTGKLIPLSVQNLMDCSVSYG 172

Query:   189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
              +GC+GG   DAF+++ +N GL  EA YPY+A    C  +    S  K++ +  VP N E
Sbjct:   173 TKGCDGGRPYDAFQYVKNNGGLEAEATYPYEAKAKHCRYRPER-SVVKVNRFFVVPRNEE 231

Query:   249 AALMKAVANQPVSVAIDASGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYGTA---DDGT 303
             A L   V + P++VAID S + F  Y  G++   +C  + LDHG+  VGYG      +  
Sbjct:   232 ALLQALVTHGPIAVAIDGSHASFHSYRGGIYHEPKCRKDTLDHGLLLVGYGYEGHESENR 291

Query:   304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             KYWL+KNS G  WGENGY+++ R    +   CGIA  A YP
Sbjct:   292 KYWLLKNSHGERWGENGYMKLPR---GQNNYCGIASYAMYP 329


>RGD|631421 [details] [associations]
            symbol:Ctsq "cathepsin Q" species:10116 "Rattus norvegicus"
            [GO:0005764 "lysosome" evidence=NAS] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 RGD:631421 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 UniGene:Rn.34875 EMBL:AF187323 IPI:IPI00214897
            PIR:JC7183 RefSeq:NP_640355.1 UniGene:Rn.35820
            ProteinModelPortal:Q9QZE3 SMR:Q9QZE3 STRING:Q9QZE3 MEROPS:C01.039
            PRIDE:Q9QZE3 Ensembl:ENSRNOT00000024208 GeneID:246147
            KEGG:rno:246147 UCSC:RGD:631421 CTD:104002 InParanoid:Q9QZE3
            OMA:ESEDVLM OrthoDB:EOG4HHP48 NextBio:623425 Genevestigator:Q9QZE3
            GermOnline:ENSRNOG00000017946 Uniprot:Q9QZE3
        Length = 343

 Score = 529 (191.3 bits), Expect = 6.5e-51, P = 6.5e-51
 Identities = 124/349 (35%), Positives = 187/349 (53%)

Query:    11 VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
             V   IL LGV  P +   +  D +++ + + W  +Y ++Y    E+ ++  +++ENV+ I
Sbjct:     5 VFLVILCLGV-VPGA---SALDLSLDVQWQEWKIKYEKLYSPE-EEVLKRVVWEENVKKI 59

Query:    71 ASFN--NKARNKPYKLGINEFADQTNEEFR-------APRNGYKRRLPSVRSSETTDVSF 121
                N  N      Y + IN+FAD T+EEF+        P +  ++RL           S+
Sbjct:    60 ELHNRENSLGKNTYTMEINDFADMTDEEFKDMIIGFQLPVHNTEKRLWKRALGSFFPNSW 119

Query:   122 RYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELV 181
              + +A +P  +DWR +G VT V+ QG C  CWAF    A+EG     T KL  LS Q L+
Sbjct:   120 NWRDA-LPKFVDWRNEGYVTRVRKQGGCSSCWAFPVTGAIEGQMFKKTGKLIPLSVQNLI 178

Query:   182 DCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYE 241
             DC     ++GC  G   +AF++++ N GL  EA YPY+  +G C     N S+AKI+G+ 
Sbjct:   179 DCSKPQGNRGCLWGNTYNAFQYVLHNGGLEAEATYPYERKEGVCRYNPKN-SSAKITGFV 237

Query:   242 DVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVF-TGQCGTELDHGVTAVGYG-- 297
              +P + E  LM AVA + P++  +    S F+FY  GV+   +C + ++H V  VGYG  
Sbjct:   238 VLPES-EDVLMDAVATKGPIATGVHVISSSFRFYQKGVYHEPKCSSYVNHAVLVVGYGFE 296

Query:   298 -TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
                 DG  YWL+KNSWG  WG  GY+++ +D   +   C IA  A YPT
Sbjct:   297 GNETDGNNYWLIKNSWGKRWGLRGYMKIAKD---RNNHCAIASLAQYPT 342


>RGD|1588248 [details] [associations]
            symbol:Cts8 "cathepsin 8" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1588248 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 IPI:IPI00765053
            RefSeq:NP_001121688.1 UniGene:Rn.220599 Ensembl:ENSRNOT00000061486
            GeneID:680718 KEGG:rno:680718 UCSC:RGD:1588248 CTD:56094
            OMA:DSEWQEW OrthoDB:EOG4JT07C NextBio:719350 Uniprot:D3ZP54
        Length = 333

 Score = 528 (190.9 bits), Expect = 8.3e-51, P = 8.3e-51
 Identities = 127/344 (36%), Positives = 186/344 (54%)

Query:    10 LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
             +VL AIL LGV    + +   +D +++   + W  +Y + Y    E + R  +++EN++ 
Sbjct:     4 VVLLAILCLGV----ARATQPSDPSLDSEWQEWKTKYEKNYSLEEEGQKR-AVWEENMKV 58

Query:    70 IASFNNK--ARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS-FRYENA 126
             +   N +     K + + +N FAD T EEFR         + ++R  ++     FRY   
Sbjct:    59 VKQHNIEYDQEKKNFTMELNAFADMTGEEFRKMMTNIP--VQNLRKKKSIHQPIFRY--- 113

Query:   127 SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
              +P  +DWR++G VT VK+QG C  CWAFS   A+EG     T +L SLS Q LVDC   
Sbjct:   114 -LPKFVDWRRRGYVTSVKNQGTCNSCWAFSVAGAIEGQMFRKTGRLVSLSPQNLVDCSRP 172

Query:   187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
               + GC  G    A +++ SN GL  E+ YPY+  +G C       SAA+++G+  V + 
Sbjct:   173 EGNHGCHMGSTLYALKYVWSNGGLEAESTYPYEGKEGPCRYLPRR-SAARVTGFSTV-AR 230

Query:   247 NEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYG---TAD 300
             +E ALM AVA   P+SV IDAS   F+FY  G++   +C +  ++H V  VGYG      
Sbjct:   231 SEEALMHAVATIGPISVGIDASHVSFRFYRRGIYYEPRCSSNRINHSVLVVGYGYEGRES 290

Query:   301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             DG KYWL+KNS G  WG NGY+++ R  +     CGIA    YP
Sbjct:   291 DGRKYWLIKNSHGVGWGMNGYMKLARGWNNH---CGIATYGFYP 331


>UNIPROTKB|F7BRD4 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001656 "metanephros development"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0001656
            GeneTree:ENSGT00660000095458 EMBL:ACFV01158341 EMBL:ACFV01158342
            EMBL:ACFV01158343 Ensembl:ENSCJAT00000004396 Uniprot:F7BRD4
        Length = 336

 Score = 525 (189.9 bits), Expect = 1.7e-50, P = 1.7e-50
 Identities = 120/342 (35%), Positives = 181/342 (52%)

Query:    10 LVLAAILVLGVWAPQSWSRTLNDATMNERH-EMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
             L+ A + +LG   P S  +      + + H + WMA++ + Y    E   R + F  N  
Sbjct:     7 LLCAGVCLLGT--PVSKKKKKKMLALEKFHFKSWMAKHHKTYSREEEYHQRLQTFASNWR 64

Query:    69 YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
              I + NN   N  +K+ +N+F+D +  E +     Y    P  ++   T  ++       
Sbjct:    65 KINAHNNG--NHTFKMAVNQFSDMSFAEIKRK---YLWSEP--QNCSATKSNYLRGTGPY 117

Query:   129 PASIDWRKKGA-VTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
             P S+DWRKKG  V+ VK+QG CG CW FS   A+E    I T K+ SL+EQ+LVDC    
Sbjct:   118 PPSVDWRKKGHFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDF 177

Query:   188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
              + GC+GGL   AFE+I+ N G+  E  YPY+  D  C K +   +   +    ++   +
Sbjct:   178 NNHGCQGGLPSQAFEYILYNNGIMGEDTYPYQGKDSDC-KFQPGKAIGFVKDVANITIYD 236

Query:   248 EAALMKAVA-NQPVSVAIDASGSDFQFYSSGVFTG-QCGT---ELDHGVTAVGYGTADDG 302
             E A+++AVA   PVS A + +  DF  Y  G+++   C     +++H V AVGYG  ++G
Sbjct:   237 EDAMVEAVALYNPVSFAFEVT-QDFMMYKRGIYSSTSCHKTPDKVNHAVLAVGYGE-ENG 294

Query:   303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
               YW+VKNSWG  WG NGY  ++R     + +CG+A  ASYP
Sbjct:   295 IPYWIVKNSWGPQWGMNGYFLIERG----KNMCGLAACASYP 332


>UNIPROTKB|G1M0X4 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9646
            "Ailuropoda melanoleuca" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 EMBL:ACTA01057330 EMBL:ACTA01065330
            Ensembl:ENSAMET00000013529 Uniprot:G1M0X4
        Length = 337

 Score = 525 (189.9 bits), Expect = 1.7e-50, P = 1.7e-50
 Identities = 119/309 (38%), Positives = 172/309 (55%)

Query:    42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
             WM Q+ + Y  + E + R + F  N   I + N  A N  +K+G+N+F+D +  E +   
Sbjct:    40 WMVQHQKKY-SSEEYQHRLRTFVGNWRKINAHN--AGNHTFKMGLNQFSDMSFAEIKRK- 95

Query:   102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGA-VTGVKDQGQCGCCWAFSAVAA 160
               Y    P  ++   T  ++       P  +DWRKKG  V+ VK+QG CG CW FS   A
Sbjct:    96 --YLWSEP--QNCSATKGNYLRGTGPYPPFVDWRKKGKFVSPVKNQGGCGSCWTFSTTGA 151

Query:   161 MEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKA 220
             +E    I T KL SL+EQ+LVDC     + GC+GGL   AFE+I  N+G+  E  YPYK 
Sbjct:   152 LESAIAIKTGKLLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYIRYNRGIMGEDSYPYKG 211

Query:   221 SDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVF 279
              DG C K + + + A +    ++  N+E A+++AVA   PVS A + +G DF  Y  GV+
Sbjct:   212 QDGDC-KFQPSKAIAFVKDVANITINDEQAMVEAVALFNPVSFAFEVTG-DFMMYRKGVY 269

Query:   280 TG-QCGT---ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
             +   C     +++H V AVGYG   +G  YW+VKNSWG  WG +GY  ++R     + +C
Sbjct:   270 SSTSCHKTPDKVNHAVLAVGYGE-QNGVPYWIVKNSWGPQWGMHGYFLIERG----KNMC 324

Query:   336 GIAMQASYP 344
             G+A  ASYP
Sbjct:   325 GLAACASYP 333


>DICTYBASE|DDB_G0272742 [details] [associations]
            symbol:DDB_G0272742 species:44689 "Dictyostelium
            discoideum" [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0272742 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639 EMBL:AAFI02000008
            eggNOG:NOG331187 RefSeq:XP_644986.1 ProteinModelPortal:Q7KWP5
            PRIDE:Q7KWP5 EnsemblProtists:DDB0168242 GeneID:8618663
            KEGG:ddi:DDB_G0272742 InParanoid:Q7KWP5 OMA:ATESAHF Uniprot:Q7KWP5
        Length = 345

 Score = 522 (188.8 bits), Expect = 3.6e-50, P = 3.6e-50
 Identities = 134/349 (38%), Positives = 195/349 (55%)

Query:    14 AILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASF 73
             ++L+L ++   S+S+ L +         WM    R Y  ++E   R+  FK N+++I  +
Sbjct:     5 SLLILILFINCSFSK-LTEIQYRNEFTAWMTSNQRTYA-SSEFTNRYNTFKSNLDFINQW 62

Query:    74 NNKARNKPYKLGINEFADQTNEEFRAP--RNGYK-RRLPSVRSSETTDVSFRYENASVPA 130
             N+K  +K   L +NEFAD +NEE+R    RN     +L S+  ++  D   +  ++S   
Sbjct:    63 NSKG-SKTV-LALNEFADISNEEYRKNYLRNDNNINKLSSLLINDKEDKEIKSSSSSGSG 120

Query:   131 S--IDWRKKGAVTGVKDQ-GQCGCCWAFSAVAAMEGINHITTRK--LTSLSEQELVDCDT 185
             S  IDWRKKGAV  VK Q G CG  W  +AV A E  + +   K    SLS Q L+DC  
Sbjct:   121 SSGIDWRKKGAVPSVKSQIGGCGS-WPITAVGATESAHFLANPKDPFISLSMQNLIDC-- 177

Query:   186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASD-GSCNKKEANPSAAKISGYEDVP 244
             S  ++ C  G +++AF++II N G+ +E  Y +   + G C    +N S AKI+ YE V 
Sbjct:   178 SNLNKQCYQGTVNEAFQYIIENGGIDSEESYKFSGGEPGKCKYNSSN-SVAKITSYEKVK 236

Query:   245 SNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF-TGQCG-TELDHGVTAVGYG----T 298
             S +E++L  AV+ +PV+  IDAS S FQFYSSG++    C  T+L+H +  VG+     T
Sbjct:   237 SGSESSLESAVSLKPVAAYIDASLSSFQFYSSGIYYEPSCNSTDLNHSILIVGFSDFSTT 296

Query:   299 ADDGTK----YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASY 343
               D  K    YW+V+NS+G  WGENGYI M +D D     CGI+  ASY
Sbjct:   297 PTDSLKHSSNYWIVQNSFGKNWGENGYIFMSKDRDDN---CGISKMASY 342


>UNIPROTKB|G1RBY1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:61853
            "Nomascus leucogenys" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 EMBL:ADFV01087552 RefSeq:XP_003275518.1
            Ensembl:ENSNLET00000011249 GeneID:100584322 Uniprot:G1RBY1
        Length = 335

 Score = 522 (188.8 bits), Expect = 3.6e-50, P = 3.6e-50
 Identities = 120/340 (35%), Positives = 182/340 (53%)

Query:    15 ILVLGVW---APQSWSRTLNDATMNERH-EMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
             +L  G W   AP   +  L+  ++ + H + WM+++ + Y    E   R ++F  N   I
Sbjct:     7 LLCAGAWLLGAPVCGAAELSVNSLEKFHFKSWMSKHHKTY-STEEYHHRLQMFASNWRKI 65

Query:    71 ASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPA 130
              + NN   N  +K+ +N+F+D +  E +     +K      ++   T  ++       P 
Sbjct:    66 NAHNNG--NHTFKMALNQFSDMSFAEIK-----HKYLWSEPQNCSATKSNYLRGTGPYPP 118

Query:   131 SIDWRKKGA-VTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
             S+DWRKKG  V+ VK+QG CG CW FS   A+E    I T K+ SL+EQ+LVDC     +
Sbjct:   119 SMDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNN 178

Query:   190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEA 249
              GC+GGL   AFE+I+ NKG+  E  YPY+  DG C K     +   +    ++   +E 
Sbjct:   179 HGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGYC-KFRPGKAIGFVKDVANITIYDEE 237

Query:   250 ALMKAVA-NQPVSVAIDASGSDFQFYSSGVFTG-QCGT---ELDHGVTAVGYGTADDGTK 304
             A+++AVA   PVS A + +  DF  Y  G+++   C     +++H V AVGYG  + G  
Sbjct:   238 AMVEAVALYNPVSFAFEVT-QDFMMYRRGIYSSTSCHKTPDKVNHAVLAVGYGEKN-GIP 295

Query:   305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             YW+VKNSWG  WG NGY  ++R     + +CG+A  ASYP
Sbjct:   296 YWIVKNSWGPQWGMNGYFLIERG----KNMCGLAACASYP 331


>UNIPROTKB|E9PSK9 [details] [associations]
            symbol:Ctsql2 "Protein Ctsql2" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            IPI:IPI00562656 Ensembl:ENSRNOT00000045847 RGD:1303225
            ArrayExpress:E9PSK9 Uniprot:E9PSK9
        Length = 342

 Score = 521 (188.5 bits), Expect = 4.6e-50, P = 4.6e-50
 Identities = 128/343 (37%), Positives = 180/343 (52%)

Query:    15 ILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN 74
             IL LGV    S +   N  +++ + + W  +Y ++Y    E   R  +++ENV+ I   N
Sbjct:     9 ILCLGV---VSGASAFN-LSLDVQWQEWKMKYEKLYSPEEELLKRV-VWEENVKKIELHN 63

Query:    75 --NKARNKPYKLGINEFADQTNEEFR-------APRNGYKRRLPSVRSSETTDVSFRYEN 125
               N      Y + IN FAD T+EEF+        P N   + L           S+ + +
Sbjct:    64 RENSLGKNTYIMEINNFADLTDEEFKDMITGITLPINNTMKSLWKRALGSPFPNSWYWRD 123

Query:   126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
             A +P SIDWRK+G VT V++QG+C  CWAF    A+EG     T KLT LS Q LVDC  
Sbjct:   124 A-LPKSIDWRKEGYVTRVREQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCSK 182

Query:   186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
                ++GC GG   +AF++++ N GL +EA YPYK  +G C     N + AKI+ +  +P 
Sbjct:   183 PQGNKGCRGGTTYNAFQYVLQNGGLESEATYPYKGKEGLCKYNPKN-AYAKITRFVALPE 241

Query:   246 NNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG---TADD 301
             + E  LM A+A + PV+  I    S F F S      +C   ++H V  VGYG      D
Sbjct:   242 D-EDVLMDALATKGPVAAGIHVVYSYFHFVSGIYHEPKCNNRVNHAVLVVGYGFEGNETD 300

Query:   302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             G  YWL+KNSWG  WG  GY+++ +D   +   CGIA  A YP
Sbjct:   301 GNNYWLIKNSWGKQWGLKGYMKIAKD---RNNHCGIATFAQYP 340


>UNIPROTKB|P09668 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9606 "Homo sapiens"
            [GO:0001520 "outer dense fiber" evidence=IEA] [GO:0001669
            "acrosomal vesicle" evidence=IEA] [GO:0007283 "spermatogenesis"
            evidence=IEA] [GO:0030984 "kininogen binding" evidence=IEA]
            [GO:0032403 "protein complex binding" evidence=IEA] [GO:0043621
            "protein self-association" evidence=IEA] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0031648 "protein destabilization"
            evidence=IMP] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=ISS] [GO:0032526 "response to retinoic acid"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0030108 "HLA-A
            specific activating MHC class I receptor activity" evidence=IDA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEP] [GO:0010813 "neuropeptide catabolic process"
            evidence=IDA] [GO:0010815 "bradykinin catabolic process"
            evidence=IDA] [GO:0030335 "positive regulation of cell migration"
            evidence=IDA] [GO:0070371 "ERK1 and ERK2 cascade" evidence=IDA]
            [GO:0010628 "positive regulation of gene expression" evidence=IDA]
            [GO:0006508 "proteolysis" evidence=IDA;TAS] [GO:0031638 "zymogen
            activation" evidence=IDA] [GO:0016505 "apoptotic protease activator
            activity" evidence=IDA] [GO:0010952 "positive regulation of
            peptidase activity" evidence=IDA] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=ISS] [GO:0004252 "serine-type endopeptidase
            activity" evidence=ISS] [GO:0043066 "negative regulation of
            apoptotic process" evidence=ISS] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0033619 "membrane protein proteolysis"
            evidence=IDA] [GO:0004175 "endopeptidase activity" evidence=IDA]
            [GO:0004177 "aminopeptidase activity" evidence=IDA] [GO:0005764
            "lysosome" evidence=IDA] [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0002250 "adaptive immune response" evidence=IEP]
            [GO:0019882 "antigen processing and presentation" evidence=TAS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0070324 "thyroid hormone binding" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0008284
            "positive regulation of cell proliferation" evidence=ISS]
            [GO:0045766 "positive regulation of angiogenesis" evidence=ISS]
            [GO:0008233 "peptidase activity" evidence=IDA] [GO:0097208
            "alveolar lamellar body" evidence=IDA] [GO:0043129 "surfactant
            homeostasis" evidence=IDA] [GO:0004197 "cysteine-type endopeptidase
            activity" evidence=IDA;TAS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 Reactome:REACT_6900 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913 MEROPS:C01.040 CTD:1512
            OMA:STSCHKT OrthoDB:EOG4W9J43 GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 EMBL:X16832 EMBL:AF426247 EMBL:AK314698 EMBL:AC011944
            EMBL:BC002479 EMBL:X07549 IPI:IPI00297487 PIR:S12486
            RefSeq:NP_004381.2 UniGene:Hs.148641 PDB:1BZN PDBsum:1BZN
            ProteinModelPortal:P09668 SMR:P09668 IntAct:P09668 STRING:P09668
            PhosphoSite:P09668 DMDM:288558851 PaxDb:P09668 PRIDE:P09668
            DNASU:1512 Ensembl:ENST00000220166 GeneID:1512 KEGG:hsa:1512
            UCSC:uc021srk.1 GeneCards:GC15M079213 H-InvDB:HIX0012481
            HGNC:HGNC:2535 HPA:CAB000458 HPA:HPA003524 MIM:116820
            neXtProt:NX_P09668 PharmGKB:PA27033 InParanoid:P09668
            PhylomeDB:P09668 BRENDA:3.4.22.16 ChEMBL:CHEMBL2225 GenomeRNAi:1512
            NextBio:6261 ArrayExpress:P09668 Bgee:P09668 CleanEx:HS_CTSH
            Genevestigator:P09668 GermOnline:ENSG00000103811 GO:GO:0019882
            Uniprot:P09668
        Length = 335

 Score = 519 (187.8 bits), Expect = 7.4e-50, P = 7.4e-50
 Identities = 121/342 (35%), Positives = 184/342 (53%)

Query:    10 LVLAAILVLGVWAPQSWSRTLNDATMNERH-EMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
             L+ A   +LGV  P   +  L   ++ + H + WM+++ + Y    E   R + F  N  
Sbjct:     7 LLCAGAWLLGV--PVCGAAELCVNSLEKFHFKSWMSKHRKTY-STEEYHHRLQTFASNWR 63

Query:    69 YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
              I + NN   N  +K+ +N+F+D +  E +     +K      ++   T  ++       
Sbjct:    64 KINAHNNG--NHTFKMALNQFSDMSFAEIK-----HKYLWSEPQNCSATKSNYLRGTGPY 116

Query:   129 PASIDWRKKGA-VTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
             P S+DWRKKG  V+ VK+QG CG CW FS   A+E    I T K+ SL+EQ+LVDC    
Sbjct:   117 PPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDF 176

Query:   188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
              + GC+GGL   AFE+I+ NKG+  E  YPY+  DG C K +   +   +    ++   +
Sbjct:   177 NNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGYC-KFQPGKAIGFVKDVANITIYD 235

Query:   248 EAALMKAVA-NQPVSVAIDASGSDFQFYSSGVFTG-QCGT---ELDHGVTAVGYGTADDG 302
             E A+++AVA   PVS A + +  DF  Y +G+++   C     +++H V AVGYG  + G
Sbjct:   236 EEAMVEAVALYNPVSFAFEVT-QDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKN-G 293

Query:   303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
               YW+VKNSWG  WG NGY  ++R     + +CG+A  ASYP
Sbjct:   294 IPYWIVKNSWGPQWGMNGYFLIERG----KNMCGLAACASYP 331


>TAIR|locus:2120222 [details] [associations]
            symbol:RD19 "RESPONSIVE TO DEHYDRATION 19" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009269 "response to desiccation" evidence=IEP] [GO:0006970
            "response to osmotic stress" evidence=IGI] [GO:0005634 "nucleus"
            evidence=IDA] [GO:0005773 "vacuole" evidence=IDA] [GO:0042742
            "defense response to bacterium" evidence=IMP] [GO:0006096
            "glycolysis" evidence=RCA] [GO:0006833 "water transport"
            evidence=RCA] [GO:0006972 "hyperosmotic response" evidence=RCA]
            [GO:0007030 "Golgi organization" evidence=RCA] [GO:0009266
            "response to temperature stimulus" evidence=RCA] [GO:0009651
            "response to salt stress" evidence=IEP;RCA] [GO:0046686 "response
            to cadmium ion" evidence=RCA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=ISS] [GO:0009414 "response to
            water deprivation" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005634 GO:GO:0005773 EMBL:CP002687
            GenomeReviews:CT486007_GR GO:GO:0009651 GO:GO:0042742
            eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            ProtClustDB:CLSN2688311 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AL035679 EMBL:AL161594 GO:GO:0004197
            MEROPS:C01.022 EMBL:D13042 EMBL:AY080598 EMBL:AY133844
            IPI:IPI00544363 PIR:JN0718 RefSeq:NP_568052.1 UniGene:At.2850
            UniGene:At.74924 ProteinModelPortal:P43296 SMR:P43296 STRING:P43296
            PaxDb:P43296 PRIDE:P43296 EnsemblPlants:AT4G39090.1 GeneID:830064
            KEGG:ath:AT4G39090 TAIR:At4g39090 InParanoid:P43296 OMA:EDFDWRD
            PhylomeDB:P43296 Genevestigator:P43296 GermOnline:AT4G39090
            Uniprot:P43296
        Length = 368

 Score = 519 (187.8 bits), Expect = 7.4e-50, P = 7.4e-50
 Identities = 116/323 (35%), Positives = 162/323 (50%)

Query:    34 TMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQT 93
             T  +   ++  ++G+VY  N E + RF +FK N+           +  +  G+ +F+D T
Sbjct:    46 TSEDHFSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATH--GVTQFSDLT 103

Query:    94 NEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCW 153
               EFR    G +      + +    +    EN  +P   DWR  GAVT VK+QG CG CW
Sbjct:   104 RSEFRKKHLGVRSGFKLPKDANKAPI-LPTEN--LPEDFDWRDHGAVTPVKNQGSCGSCW 160

Query:   154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE-------DQGCEGGLMDDAFEFIIS 206
             +FSA  A+EG N + T KL SLSEQ+LVDCD   +       D GC GGLM+ AFE+ + 
Sbjct:   161 SFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLK 220

Query:   207 NKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDA 266
               GL  E  YPY   DG   K + +   A +S +  +  + E      V N P++VAI+A
Sbjct:   221 TGGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINA 280

Query:   267 SGSDFQFYSSGVFTGQ-CGTELDHGVTAVGYGTADDGTK------YWLVKNSWGTTWGEN 319
                  Q Y  GV     C   L+HGV  VGYG A           YW++KNSWG TWGEN
Sbjct:   281 GY--MQTYIGGVSCPYICTRRLNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGETWGEN 338

Query:   320 GYIRMQRDIDAKEGLCGIAMQAS 342
             G+ ++ +       +CG+    S
Sbjct:   339 GFYKICKG----RNICGVDSMVS 357


>FB|FBgn0250848 [details] [associations]
            symbol:26-29-p "26-29kD-proteinase" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005811
            "lipid particle" evidence=IDA] [GO:0005875 "microtubule associated
            complex" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005875 EMBL:AE014296 GO:GO:0005811 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 MEROPS:I29.003 HSSP:O65039
            EMBL:AY122222 EMBL:AB011376 RefSeq:NP_620470.1 UniGene:Dm.3049
            SMR:Q9V3U6 MINT:MINT-890485 STRING:Q9V3U6
            EnsemblMetazoa:FBtr0075766 GeneID:39547 KEGG:dme:Dmel_CG8947
            UCSC:CG8947-RA CTD:39547 FlyBase:FBgn0250848 InParanoid:Q9V3U6
            OMA:IHSKNRA OrthoDB:EOG4BVQ8T GenomeRNAi:39547 NextBio:814210
            Uniprot:Q9V3U6
        Length = 549

 Score = 518 (187.4 bits), Expect = 9.5e-50, P = 9.5e-50
 Identities = 122/320 (38%), Positives = 174/320 (54%)

Query:    32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
             D  +++    +  ++G  Y  + E E R  IF++N+ YI S  N+A+   Y L +N  AD
Sbjct:   238 DEHVDKAFHHFKRKHGVAYHSDTEHEHRKNIFRQNLRYIHS-KNRAK-LTYTLAVNHLAD 295

Query:    92 QTNEEFRAPRNGYKRR-LPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
             +T EE +A R GYK   + +       DV  +Y++  +P   DWR  GAVT VKDQ  CG
Sbjct:   296 KTEEELKA-RRGYKSSGIYNTGKPFPYDVP-KYKD-EIPDQYDWRLYGAVTPVKDQSVCG 352

Query:   151 CCWAFSAVAAMEGINHITTR-KLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
              CW+F  +  +EG   +     L  LS+Q L+DC  +  + GC+GG     +++++ + G
Sbjct:   353 SCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDCSWAYGNNGCDGGEDFRVYQWMLQSGG 412

Query:   210 LATEAKY-PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDAS 267
             + TE +Y PY   DG C+        A I G+ +V SN+  A   A+    P+SVAIDAS
Sbjct:   413 VPTEEEYGPYLGQDGYCHVNNVT-LVAPIKGFVNVTSNDPNAFKLALLKHGPLSVAIDAS 471

Query:   268 GSDFQFYSSGVF-TGQCGTE---LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
                F FYS GV+    C  +   LDH V AVGYG+ + G  YWLVKNSW T WG +GYI 
Sbjct:   472 PKTFSFYSHGVYYEPTCKNDVDGLDHAVLAVGYGSIN-GEDYWLVKNSWSTYWGNDGYIL 530

Query:   324 MQRDIDAKEGLCGIAMQASY 343
             M     AK+  CG+    +Y
Sbjct:   531 MS----AKKNNCGVMTMPTY 546


>UNIPROTKB|F7BJD8 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9796 "Equus
            caballus" [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=ISS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0004175 "endopeptidase activity" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0004252
            "serine-type endopeptidase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005829 "cytosol" evidence=ISS] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0010628 "positive regulation of
            gene expression" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0010815 "bradykinin catabolic
            process" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0016505 "apoptotic protease
            activator activity" evidence=ISS] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=ISS] [GO:0030335
            "positive regulation of cell migration" evidence=ISS] [GO:0031638
            "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0032526 "response to retinoic
            acid" evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0043129 "surfactant homeostasis"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0070324
            "thyroid hormone binding" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            GO:GO:0043066 GO:GO:0005615 GO:GO:0008284 GO:GO:0070371
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458
            OMA:STSCHKT GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129
            Ensembl:ENSECAT00000013967 Uniprot:F7BJD8
        Length = 305

 Score = 517 (187.1 bits), Expect = 1.2e-49, P = 1.2e-49
 Identities = 116/309 (37%), Positives = 170/309 (55%)

Query:    42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
             WM Q+ + Y  + E   R + F  N   I + N    N  +++G+N+F+     E +   
Sbjct:     8 WMVQHQKKY-SSEEYHHRLQTFVSNWRKINAHNTG--NHTFRMGLNQFSAMNFAELK--- 61

Query:   102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGA-VTGVKDQGQCGCCWAFSAVAA 160
               +K      ++   T  ++       P S+DWRKKG  V+ VK+QG CG CW FS   A
Sbjct:    62 --HKYLWSEPQNCSATKGNYLRGAGPYPPSVDWRKKGNFVSPVKNQGGCGSCWTFSTTGA 119

Query:   161 MEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKA 220
             +E    I + KL SL+EQ+LVDC  +  + GC+GGL   AFE+I  NKG+  E  YPYK 
Sbjct:   120 LESAVAIASGKLLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKG 179

Query:   221 SDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVA-NQPVSVAIDASGSDFQFYSSGVF 279
              DG C K + N + A +    ++  N+E A+++AVA   PVS A + +  DF  Y  G++
Sbjct:   180 QDGDC-KFQPNKAIAFVKDVANITLNDEKAMVEAVALYNPVSFAFEVT-EDFMMYRKGIY 237

Query:   280 TG-QCGT---ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
             +   C     +++H V AVGYG  ++G  YW+VKNSWG  WG NGY  ++R     + +C
Sbjct:   238 SSTSCHKTPDKVNHAVLAVGYGE-ENGIPYWIVKNSWGPHWGMNGYFLIERG----KNMC 292

Query:   336 GIAMQASYP 344
             G+A  ASYP
Sbjct:   293 GLAACASYP 301


>UNIPROTKB|F6X9C1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00660000095458
            OMA:STSCHKT Ensembl:ENSCAFT00000036196 EMBL:AAEX03002388
            Uniprot:F6X9C1
        Length = 305

 Score = 516 (186.7 bits), Expect = 1.5e-49, P = 1.5e-49
 Identities = 117/309 (37%), Positives = 169/309 (54%)

Query:    42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
             W  Q+ + Y  + E   R + F  N   I + N  A N  +K+G+N+F+D    E +   
Sbjct:     8 WAVQHQKKY-SSEEYLQRLQTFVGNWRKINAHN--AGNHTFKMGLNQFSDMNFAEIK--- 61

Query:   102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGA-VTGVKDQGQCGCCWAFSAVAA 160
               +K      ++   T  ++       P  +DWRKKG  V+ VK+QG CG CW FS   A
Sbjct:    62 --HKYLWSEPQNCSATKGNYLRGTGPYPPFVDWRKKGKFVSPVKNQGSCGSCWTFSTTGA 119

Query:   161 MEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKA 220
             +E    I + KL SL+EQ+LVDC  +  + GC+GG    AFE+I  NKG+  E  YPYK 
Sbjct:   120 LESAIAIKSGKLLSLAEQQLVDCAQNFNNHGCQGGAPLQAFEYIRYNKGIMGEDSYPYKG 179

Query:   221 SDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVA-NQPVSVAIDASGSDFQFYSSGVF 279
              DG C K + + + A +    ++  N+E A+++AVA   PVS A + + SDF  Y  G++
Sbjct:   180 QDGDC-KYQPSKAIAFVKDVANITINDEQAMVEAVALYNPVSFAFEVT-SDFMMYRKGIY 237

Query:   280 TG-QCGT---ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
             +   C     +++H V AVGYG   +G  YW+VKNSWG  WG NGY  M+R     + +C
Sbjct:   238 SSTSCHKTPDKVNHAVLAVGYGE-QNGIPYWIVKNSWGPQWGMNGYFLMERG----KNMC 292

Query:   336 GIAMQASYP 344
             G+A  ASYP
Sbjct:   293 GLAACASYP 301


>UNIPROTKB|G3R9A7 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9595 "Gorilla
            gorilla gorilla" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 OMA:STSCHKT GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 RefSeq:XP_004056662.1 Ensembl:ENSGGOT00000012331
            GeneID:101144312 Uniprot:G3R9A7
        Length = 335

 Score = 516 (186.7 bits), Expect = 1.5e-49, P = 1.5e-49
 Identities = 120/342 (35%), Positives = 184/342 (53%)

Query:    10 LVLAAILVLGVWAPQSWSRTLNDATMNERH-EMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
             L+ A   +LGV  P   +  L+  ++ + +   WM+++ + Y    E   R + F  N  
Sbjct:     7 LLCAGAWLLGV--PVCGAAELSVNSLEKFYFRSWMSKHRKTY-STEEYHHRLQTFASNWR 63

Query:    69 YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
              I + NN   N  +K+ +N+F+D +  E +     +K      ++   T  ++       
Sbjct:    64 KINAHNNG--NHTFKMALNQFSDMSFAEIK-----HKYLWSEPQNCSATKSNYLRGTGPY 116

Query:   129 PASIDWRKKGA-VTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
             P S+DWRKKG  V+ VK+QG CG CW FS   A+E    I T K+ SL+EQ+LVDC    
Sbjct:   117 PPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDF 176

Query:   188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
              + GC+GGL   AFE+I+ NKG+  E  YPY+  DG C K +   +   +    ++   +
Sbjct:   177 NNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGYC-KFQPGKAIGFVKDVANITIYD 235

Query:   248 EAALMKAVA-NQPVSVAIDASGSDFQFYSSGVFTG-QCGT---ELDHGVTAVGYGTADDG 302
             E A+++AVA   PVS A + +  DF  Y +G+++   C     +++H V AVGYG  + G
Sbjct:   236 EEAMVEAVALYNPVSFAFEVT-QDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKN-G 293

Query:   303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
               YW+VKNSWG  WG NGY  ++R     + +CG+A  ASYP
Sbjct:   294 IPYWIVKNSWGPKWGMNGYFLIERG----KNMCGLAACASYP 331


>UNIPROTKB|E9PTT3 [details] [associations]
            symbol:Ctsr "Protein Ctsr" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            IPI:IPI00627092 Ensembl:ENSRNOT00000024115 RGD:631422
            Uniprot:E9PTT3
        Length = 334

 Score = 515 (186.3 bits), Expect = 2.0e-49, P = 2.0e-49
 Identities = 130/342 (38%), Positives = 185/342 (54%)

Query:    14 AILVLGVWAPQ-SWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIAS 72
             AIL LGV     ++  +L DA   E H+    +Y + Y    E   R  +++EN++ I  
Sbjct:     8 AILCLGVGXGALAFDPSL-DA---EWHDX-KTEYEKSYTMEEEGHRR-AVWEENMKMIKL 61

Query:    73 FN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV-P 129
              N  N      + + +NEF D T EEFR         +P +RS     +  + +  +V P
Sbjct:    62 HNRENSLGKNGFIMEMNEFGDLTAEEFRK----MMVNIP-IRSHRKGKIIRKRDVGNVLP 116

Query:   130 ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
               +DWRKKG VT V++Q  C  CWAF+   A+EG     T +LT LS Q LVDC  S  +
Sbjct:   117 KFVDWRKKGYVTRVQNQKFCNSCWAFAVTGAIEGQMFNKTGQLTPLSVQNLVDCTKSQGN 176

Query:   190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEA 249
             +GC+ G    A+E++++N GL  EA YPYK  +G C     + S A+I+G+  +P + E 
Sbjct:   177 EGCQWGDPHIAYEYVLNNGGLEAEATYPYKGKEGVCRYNPKH-SKAEITGFVSLPES-ED 234

Query:   250 ALMKAVAN-QPVSVAIDASGSDFQFYSSGVFTG-QCGTE-LDHGVTAVGYG---TADDGT 303
              LM+AVA   P+SVA+DAS + F FY  G++    C    ++H V  VGYG      DG 
Sbjct:   235 ILMEAVATIGPISVAVDASFNSFGFYKKGLYDEPNCSNNTVNHSVLVVGYGFEGNETDGN 294

Query:   304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
              YWL+KNSWG  WG  GY+++ +D   +   C IA  A YPT
Sbjct:   295 SYWLIKNSWGRKWGLRGYMKIPKD---QNNFCAIASYAHYPT 333


>TAIR|locus:2050145 [details] [associations]
            symbol:AT2G21430 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002685 GenomeReviews:CT485783_GR
            EMBL:AC006841 EMBL:X74359 IPI:IPI00519637 PIR:B84601
            RefSeq:NP_565512.1 UniGene:At.14069 ProteinModelPortal:P43295
            SMR:P43295 MEROPS:C01.A04 PRIDE:P43295 EnsemblPlants:AT2G21430.1
            GeneID:816682 KEGG:ath:AT2G21430 TAIR:At2g21430 eggNOG:COG4870
            HOGENOM:HOG000230774 InParanoid:P43295 KO:K01373 OMA:GSIEEHY
            PhylomeDB:P43295 ProtClustDB:CLSN2688311 Genevestigator:P43295
            GermOnline:AT2G21430 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 Uniprot:P43295
        Length = 361

 Score = 508 (183.9 bits), Expect = 1.1e-48, P = 1.1e-48
 Identities = 112/317 (35%), Positives = 166/317 (52%)

Query:    36 NERH-EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
             +E H  ++  ++G+VY    E   RF +FK N+  + +  ++  +   + G+ +F+D T 
Sbjct:    44 SEDHFTLFKKKFGKVYGSIEEHYYRFSVFKANL--LRAMRHQKMDPSARHGVTQFSDLTR 101

Query:    95 EEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWA 154
              EFR    G K      + +    +       ++P   DWR +GAVT VK+QG CG CW+
Sbjct:   102 SEFRRKHLGVKGGFKLPKDANQAPI---LPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWS 158

Query:   155 FSAVAAMEGINHITTRKLTSLSEQELVDCD------TSGE-DQGCEGGLMDDAFEFIISN 207
             FS   A+EG + + T KL SLSEQ+LVDCD        G  D GC GGLM+ AFE+ +  
Sbjct:   159 FSTTGALEGAHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKT 218

Query:   208 KGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDAS 267
              GL  E  YPY  +DG   K + +   A +S +  V  N +      + N P++VAI+A+
Sbjct:   219 GGLMREKDYPYTGTDGGSCKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAA 278

Query:   268 GSDFQFYSSGVFTGQ-CGTELDHGVTAVGYGTADDGTK------YWLVKNSWGTTWGENG 320
                 Q Y  GV     C   L+HGV  VGYG+A           YW++KNSWG +WGENG
Sbjct:   279 Y--MQTYIGGVSCPYICSRRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENG 336

Query:   321 YIRMQRDIDAKEGLCGI 337
             + ++ +       +CG+
Sbjct:   337 FYKICKG----RNICGV 349


>TAIR|locus:2130180 [details] [associations]
            symbol:AT4G16190 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0042744 "hydrogen peroxide catabolic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005773
            EMBL:CP002687 HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 EMBL:Z97340 EMBL:AL161543 UniGene:At.25555
            EMBL:AY039556 EMBL:AY129473 EMBL:AY136316 EMBL:BT000733
            EMBL:AK226366 IPI:IPI00543588 PIR:D71428 RefSeq:NP_567489.1
            HSSP:P25779 ProteinModelPortal:Q9SUL1 SMR:Q9SUL1 STRING:Q9SUL1
            MEROPS:C01.A06 PRIDE:Q9SUL1 EnsemblPlants:AT4G16190.1 GeneID:827311
            KEGG:ath:AT4G16190 TAIR:At4g16190 InParanoid:Q9SUL1 OMA:NACGINK
            PhylomeDB:Q9SUL1 ProtClustDB:CLSN2917559 Genevestigator:Q9SUL1
            Uniprot:Q9SUL1
        Length = 373

 Score = 508 (183.9 bits), Expect = 1.1e-48, P = 1.1e-48
 Identities = 113/328 (34%), Positives = 173/328 (52%)

Query:    31 NDATMNERHE--MWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINE 88
             ++  +N  H   ++ ++Y + Y    E + RF++FK N+    +  N+  +     G+ +
Sbjct:    45 DEQLLNAEHHFTLFKSKYEKTYATQVEHDHRFRVFKANLR--RARRNQLLDPSAVHGVTQ 102

Query:    89 FADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQ 148
             F+D T +EFR    G KRR    R    T  +     + +P   DWR++GAVT VK+QG 
Sbjct:   103 FSDLTPKEFRRKFLGLKRR--GFRLPTDTQTAPILPTSDLPTEFDWREQGAVTPVKNQGM 160

Query:   149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE-------DQGCEGGLMDDAF 201
             CG CW+FSA+ A+EG + + T++L SLSEQ+LVDCD   +       D GC GGLM++AF
Sbjct:   161 CGSCWSFSAIGALEGAHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAF 220

Query:   202 EFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVS 261
             E+ +   GL  E  YPY   D +  K + +   A +S +  V S+ +      V + P++
Sbjct:   221 EYALKAGGLMKEEDYPYTGRDHTACKFDKSKIVASVSNFSVVSSDEDQIAANLVQHGPLA 280

Query:   262 VAIDASGSDFQFYSSGVFTGQ-CGTELDHGVTAVGYGTADDGT------KYWLVKNSWGT 314
             +AI+A     Q Y  GV     C    DHGV  VG+G++           YW++KNSWG 
Sbjct:   281 IAINAMW--MQTYIGGVSCPYVCSKSQDHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGA 338

Query:   315 TWGENGYIRMQRDIDAKEGLCGIAMQAS 342
              WGE+GY ++ R       +CG+    S
Sbjct:   339 MWGEHGYYKICR---GPHNMCGMDTMVS 363


>ZFIN|ZDB-GENE-050417-107 [details] [associations]
            symbol:zgc:110239 "zgc:110239" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-050417-107
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 MEROPS:I29.003 OrthoDB:EOG412M56 EMBL:BC092817
            IPI:IPI00503987 RefSeq:NP_001017633.1 UniGene:Dr.39081
            ProteinModelPortal:Q568K7 GeneID:550326 KEGG:dre:550326
            HOGENOM:HOG000007373 HOVERGEN:HBG105018 InParanoid:Q568K7
            NextBio:20879584 ArrayExpress:Q568K7 Uniprot:Q568K7
        Length = 546

 Score = 504 (182.5 bits), Expect = 2.9e-48, P = 2.9e-48
 Identities = 119/320 (37%), Positives = 177/320 (55%)

Query:    35 MNERHEM---WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
             ++  H M   +  ++ R Y +  E E R   F  N+ Y+ S N    +  + L +N  AD
Sbjct:   236 VSHAHRMFGHYKEKFNRQYDNEMEHEEREHNFVHNIRYVHSMNRAGLS--FSLSVNHLAD 293

Query:    92 QTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGC 151
             ++ +E    R G +R     R ++      R  + + P S+DWR  GAVT VKDQ  CG 
Sbjct:   294 RSQKELSMMR-GCQRTHKVHRKAQPFPSEIR--SIATPNSVDWRLYGAVTPVKDQAVCGS 350

Query:   152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
             CW+F+    +EG   + T +LTSLS+Q LVDC     + GC+GG    AFE+I+ + G++
Sbjct:   351 CWSFATTGTLEGALFLKTGQLTSLSQQMLVDCTWGFGNNGCDGGEEWRAFEWIMKHGGIS 410

Query:   212 TEAKY-PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGS 269
             T   Y  Y   +G C+  +++   A+++GY +V S +  AL  A+    PV+V+IDA+  
Sbjct:   411 TAESYGAYMGMNGLCHYDKSS-MVAQLTGYTNVTSGDILALKAAIFKFGPVAVSIDAAHR 469

Query:   270 DFQFYSSGVF-TGQC--G-TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
              F FYS+GV+   +C  G  +LDH V AVGYG  ++ + YWLVKNSW + WG +GYI M 
Sbjct:   470 SFAFYSNGVYYEPECKNGINDLDHAVLAVGYGIMNNES-YWLVKNSWSSYWGNDGYILMS 528

Query:   326 RDIDAKEGLCGIAMQASYPT 345
                  K+  CG+A  A Y T
Sbjct:   529 M----KDNNCGVATDAIYAT 544


>UNIPROTKB|F1NT07 [details] [associations]
            symbol:LOC100857883 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 EMBL:AADN02044012
            EMBL:AADN02044013 EMBL:AADN02044014 IPI:IPI00577314
            Ensembl:ENSGALT00000000192 OMA:IYKHGPV Uniprot:F1NT07
        Length = 317

 Score = 503 (182.1 bits), Expect = 3.7e-48, P = 3.7e-48
 Identities = 120/306 (39%), Positives = 166/306 (54%)

Query:    47 GRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKR 106
             GR Y    E E R +IF  ++ ++ S N  A +  Y L +N  AD+T +E  A R    R
Sbjct:    20 GRPYGSAREMEHRQRIFAHHMRFVHSKNRAALS--YSLALNHLADRTPQEMAALRG---R 74

Query:   107 RLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINH 166
             R     +      +  Y    +P S+DWR  GAVT VKDQ  CG CW+F+   AMEG   
Sbjct:    75 RRSGDPNHGLPFPAEHYTGIILPESLDWRMYGAVTPVKDQAVCGSCWSFATTGAMEGALF 134

Query:   167 ITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA-TEA--KYPYKASDG 223
             + T  LT LS+Q L+DC     +  C+GG    A  +I  + G+A TE+   +P    +G
Sbjct:   135 LKTGVLTPLSQQVLIDCSWGKGNYACDGGEEWRAKGWIKKHGGIASTESPPSFPLVLQNG 194

Query:   224 SCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVF-TG 281
              C+  ++    AKI+GY +V S N  A+  A+    PV+V+IDAS   F FYS+G++   
Sbjct:   195 LCHYNQSE-MLAKITGYVNVTSGNITAVKTAIYKHGPVAVSIDASHKTFSFYSNGIYYEP 253

Query:   282 QCGT---ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA 338
             +C     +LDH V AVGYG    G  YWL+KNSW T WG +GYI M      K+  CG+A
Sbjct:   254 KCANKPGQLDHAVLAVGYGVLQ-GETYWLIKNSWSTYWGNDGYILMAM----KDNNCGVA 308

Query:   339 MQASYP 344
              +A+YP
Sbjct:   309 TEATYP 314


>UNIPROTKB|G3V9F8 [details] [associations]
            symbol:Ctsm "RCG24133" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015645 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 EMBL:CH474032
            PANTHER:PTHR12411:SF58 Ensembl:ENSRNOT00000045830 RGD:631420
            Uniprot:G3V9F8
        Length = 333

 Score = 496 (179.7 bits), Expect = 2.0e-47, P = 2.0e-47
 Identities = 119/321 (37%), Positives = 171/321 (53%)

Query:    32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN--NKARNKPYKLGINEF 89
             D  ++   + W  +Y + Y    E + R  +++EN++ I   N  N      + + +N F
Sbjct:    22 DPVLDAEWQKWKIKYEKTYSLEEEGQKR-AVWEENMKKIKLHNGENGLGKHGFTMEMNAF 80

Query:    90 ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
              D T EEFR  +   +  +P+V+   +     + +  +VP  I+WRK+G VT V+ QG+C
Sbjct:    81 GDMTIEEFR--KLMIEIPIPTVKKENSVQ---KRQAVNVPNFINWRKRGYVTPVRRQGRC 135

Query:   150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
               CWAFS   A+EG     T +L  LS Q LVDC     + GC  G    A +++  N G
Sbjct:   136 NVCWAFSVAGAIEGQMFQKTGQLIPLSVQNLVDCSRPQGNLGCYLGNTYLALQYVKENGG 195

Query:   210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASG 268
             L +EA YPY+  +GSC     N S A I+ +E VP N E ALM AVA   P+SVAIDA  
Sbjct:   196 LESEATYPYEEKEGSCRYHPDN-STASITDFEFVPKN-EDALMNAVATLGPISVAIDARH 253

Query:   269 SDFQFYSSGVF-TGQCGTEL-DHGVTAVGYGTA---DDGTKYWLVKNSWGTTWGENGYIR 323
               F FY +G++    C + +  H +  VGYG      DG KYW++KNS G  WG  GY++
Sbjct:   254 ESFLFYRNGIYHEPNCSSSVVTHAMLLVGYGFVGEESDGRKYWILKNSMGNKWGNRGYMK 313

Query:   324 MQRDIDAKEGLCGIAMQASYP 344
             + +D   +   CGIA  A YP
Sbjct:   314 IAKD---QGNHCGIATYALYP 331


>RGD|1309226 [details] [associations]
            symbol:Cts7 "cathepsin 7" species:10116 "Rattus norvegicus"
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005768 "endosome" evidence=IEA] [GO:0005794 "Golgi apparatus"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0007067
            "mitosis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IEA] [GO:0051301 "cell division" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 RGD:1309226 GO:GO:0005634
            GO:GO:0005794 GO:GO:0048471 GO:GO:0005615 GO:GO:0051301
            GO:GO:0007067 GO:GO:0005768 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 MEROPS:C01.016 CTD:56092
            GeneTree:ENSGT00560000076577 OrthoDB:EOG44QT2S EMBL:CH474032
            IPI:IPI00870531 RefSeq:NP_001099569.1 UniGene:Rn.218615
            Ensembl:ENSRNOT00000043686 GeneID:290970 KEGG:rno:290970
            UCSC:RGD:1309226 OMA:VESFNAN Uniprot:D3ZZ07
        Length = 331

 Score = 495 (179.3 bits), Expect = 2.6e-47, P = 2.6e-47
 Identities = 114/320 (35%), Positives = 164/320 (51%)

Query:    32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIA--SFNNKARNKPYKLGINEF 89
             D +++   E W     + Y    EK+ R  +++ENV+ I   +  N      + + +NEF
Sbjct:    22 DYSLDAEWEEWKRNNAKTYSPEEEKQRR-AVWEENVKMIKWHTMQNGLWMNNFTIEMNEF 80

Query:    90 ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
              D T EE R   +     L   R+ +      +  N  +P ++DWR  G V  V+ QG C
Sbjct:    81 GDMTGEEMRMMTDSSALTL---RNGK----HIQKRNVKIPKTLDWRDTGCVAPVRSQGGC 133

Query:   150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
             G CWAFS  A++E      T KL  LS Q L+DC  +  +  C GG    AF+++ +N G
Sbjct:   134 GACWAFSVAASIESQLFKKTGKLIPLSVQNLIDCTVTYGNNDCSGGKPYTAFQYVKNNGG 193

Query:   210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
             L  EA YPY+A    C  +    S  KI+ +  VP N EA +   V   P++VAID S +
Sbjct:   194 LEAEATYPYEAKLRHCRYRPER-SVVKIARFFVVPRNEEALMQALVTYGPIAVAIDGSHA 252

Query:   270 DFQFYSSGVF-TGQCGTE-LDHGVTAVGYGTA---DDGTKYWLVKNSWGTTWGENGYIRM 324
              F+ Y  G++   +C  + LDHG+  VGYG      +  KYWL+KNS G  WGE GY+++
Sbjct:   253 SFKRYRGGIYHEPKCRRDTLDHGLLLVGYGYEGHESENRKYWLLKNSHGEQWGERGYMKL 312

Query:   325 QRDIDAKEGLCGIAMQASYP 344
              RD   +   CGIA  A YP
Sbjct:   313 PRD---QNNYCGIASYAMYP 329


>UNIPROTKB|F1P3U9 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005829 "cytosol" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=IEA] [GO:0010628 "positive regulation of gene expression"
            evidence=IEA] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=IEA] [GO:0010813 "neuropeptide catabolic
            process" evidence=IEA] [GO:0010815 "bradykinin catabolic process"
            evidence=IEA] [GO:0016505 "apoptotic protease activator activity"
            evidence=IEA] [GO:0030108 "HLA-A specific activating MHC class I
            receptor activity" evidence=IEA] [GO:0031638 "zymogen activation"
            evidence=IEA] [GO:0031648 "protein destabilization" evidence=IEA]
            [GO:0032526 "response to retinoic acid" evidence=IEA] [GO:0033619
            "membrane protein proteolysis" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0043129
            "surfactant homeostasis" evidence=IEA] [GO:0045766 "positive
            regulation of angiogenesis" evidence=IEA] [GO:0060448 "dichotomous
            subdivision of terminal units involved in lung branching"
            evidence=IEA] [GO:0070324 "thyroid hormone binding" evidence=IEA]
            [GO:0070371 "ERK1 and ERK2 cascade" evidence=IEA] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA]
            [GO:0097208 "alveolar lamellar body" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066
            GO:GO:0005615 GO:GO:0008284 GO:GO:0070371 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0032526 GO:GO:0010628
            GO:GO:0070324 GO:GO:0016505 GO:GO:0010634 GO:GO:0004197
            GO:GO:0042599 GO:GO:0031648 GO:GO:0097067 GO:GO:0031638
            GO:GO:0001913 GeneTree:ENSGT00660000095458 OMA:STSCHKT
            GO:GO:0030108 GO:GO:0010815 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 EMBL:AADN02038832 EMBL:AADN02038831 IPI:IPI00594147
            Ensembl:ENSGALT00000013440 Uniprot:F1P3U9
        Length = 261

 Score = 491 (177.9 bits), Expect = 6.9e-47, P = 6.9e-47
 Identities = 108/269 (40%), Positives = 157/269 (58%)

Query:    82 YKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGA-V 140
             + + +N+F+D T  EF+     Y    P  ++   T  +F   +   P ++DWRKKG  V
Sbjct:     1 FLVALNQFSDMTFAEFKKL---YLWSEP--QNCSATRGNFLRSDGPCPEAVDWRKKGNFV 55

Query:   141 TGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDA 200
             T VK+QG CG CW FS    +E    I T KL SL+EQ LVDC  +  + GC GGL   A
Sbjct:    56 TPVKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQLLVDCAQAFNNHGCSGGLPSQA 115

Query:   201 FEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVA-NQP 259
             FE+I+ NKGL  E  YPY+A +G+C K + + + A +    ++   +EA +++AV  + P
Sbjct:   116 FEYILYNKGLMGEDAYPYRAQNGTC-KFQPDKAIAFVKDVINITQYDEAGMVEAVGKHNP 174

Query:   260 VSVAIDASGSDFQFYSSGVFTG-QCG---TELDHGVTAVGYGTADDGTKYWLVKNSWGTT 315
             VS A + + SDF  Y  GV++  +C     +++H V AVGYG  +DG  YW+VKNSWG  
Sbjct:   175 VSFAFEVT-SDFMHYRKGVYSNPRCEHTPDKVNHAVLAVGYGE-EDGRPYWIVKNSWGPL 232

Query:   316 WGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             WG +GY  ++R     + +CG+A  ASYP
Sbjct:   233 WGMDGYFLIERG----KNMCGLAACASYP 257


>WB|WBGene00007055 [details] [associations]
            symbol:tag-196 species:6239 "Caenorhabditis elegans"
            [GO:0004869 "cysteine-type endopeptidase inhibitor activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000010
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00031 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00043 SMART:SM00645 InterPro:IPR000169
            GO:GO:0004869 eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 EMBL:FO080488 PIR:T31871
            RefSeq:NP_505215.2 HSSP:Q9UBX1 ProteinModelPortal:O16454 SMR:O16454
            DIP:DIP-27400N IntAct:O16454 MINT:MINT-1044990 MEROPS:C01.A50
            PaxDb:O16454 EnsemblMetazoa:F41E6.6.1 EnsemblMetazoa:F41E6.6.2
            EnsemblMetazoa:F41E6.6.3 GeneID:179240 KEGG:cel:CELE_F41E6.6
            UCSC:F41E6.6.1 CTD:179240 WormBase:F41E6.6 InParanoid:O16454
            OMA:GGGLMTN NextBio:904514 Uniprot:O16454
        Length = 477

 Score = 486 (176.1 bits), Expect = 2.3e-46, P = 2.3e-46
 Identities = 121/309 (39%), Positives = 167/309 (54%)

Query:    45 QYGRVYRDNAEKEMRFKIFKENVEYIASFN-NKARNKPYKLGINEFADQTNEEFRAPRNG 103
             ++ + Y +  E   RF++FK+N + I     N+     Y  G  +F+D T  EF+     
Sbjct:   180 RHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVY--GFTKFSDMTTMEFKKIMLP 237

Query:   104 YKRRLP----SVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
             Y+   P       + E  DV+   E+  +P S DWR+KGAVT VK+QG CG CWAFS   
Sbjct:   238 YQWEQPVYPMEQANFEKHDVTINEED--LPESFDWREKGAVTQVKNQGNCGSCWAFSTTG 295

Query:   160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
              +EG   I   KL SLSEQELVDCD+   DQGC GGL  +A++ II   GL  E  YPY 
Sbjct:   296 NVEGAWFIAKNKLVSLSEQELVDCDSM--DQGCNGGLPSNAYKEIIRMGGLEPEDAYPYD 353

Query:   220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMK-AVANQPVSVAIDASGSDFQFYSSGV 278
                 +C+    +  A  I+G  ++P ++E  + K  V   P+S+ ++A+    QFY  GV
Sbjct:   354 GRGETCHLVRKD-IAVYINGSVELP-HDEVEMQKWLVTKGPISIGLNAN--TLQFYRHGV 409

Query:   279 ---FTGQCGT-ELDHGVTAVGYGTADDGTK-YWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
                F   C    L+HGV  VGYG   DG K YW+VKNSWG  WGE GY ++ R     + 
Sbjct:   410 VHPFKIFCEPFMLNHGVLIVGYGK--DGRKPYWIVKNSWGPNWGEAGYFKLYRG----KN 463

Query:   334 LCGIAMQAS 342
             +CG+   A+
Sbjct:   464 VCGVQEMAT 472


>GENEDB_PFALCIPARUM|PF11_0162 [details] [associations]
            symbol:PF11_0162 "falcipain-3" species:5833
            "Plasmodium falciparum" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 HSSP:P43235 EMBL:AE014186 GO:GO:0020020
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347833.1
            ProteinModelPortal:Q8IIL0 SMR:Q8IIL0 MEROPS:C01.063
            EnsemblProtists:PF11_0162:mRNA GeneID:810709 KEGG:pfa:PF11_0162
            EuPathDB:PlasmoDB:PF3D7_1115400 OMA:ENDEDYW ChEMBL:CHEMBL1250373
            Uniprot:Q8IIL0
        Length = 492

 Score = 484 (175.4 bits), Expect = 3.8e-46, P = 3.8e-46
 Identities = 117/324 (36%), Positives = 171/324 (52%)

Query:    41 MWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAP 100
             +++ +  + Y  + E + RF IF EN   I   +NK  N  YK G+N+F D + EEFR+ 
Sbjct:   173 IFLKENNKKYETSEEMQKRFIIFSENYRKI-ELHNKKTNSLYKRGMNKFGDLSPEEFRSK 231

Query:   101 -----RNG-YKRRLPSVR-SSETTDVSFRYENASVPA---SIDWRKKGAVTGVKDQGQCG 150
                   +G +K   P V   +   DV  +Y+ A       + DWR  G VT VKDQ  CG
Sbjct:   232 YLNLKTHGPFKTLSPPVSYEANYEDVIKKYKPADAKLDRIAYDWRLHGGVTPVKDQALCG 291

Query:   151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
              CWAFS+V ++E    I  + L   SEQELVDC  S ++ GC GG + +AF+ +I   GL
Sbjct:   292 SCWAFSSVGSVESQYAIRKKALFLFSEQELVDC--SVKNNGCYGGYITNAFDDMIDLGGL 349

Query:   211 ATEAKYPYKAS-DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
              ++  YPY ++   +CN K  N     I  Y  +P +     ++ +   P+S++I AS  
Sbjct:   350 CSQDDYPYVSNLPETCNLKRCNERYT-IKSYVSIPDDKFKEALRYLG--PISISIAAS-D 405

Query:   270 DFQFYSSGVFTGQCGTELDHGVTAVGYGTAD---DGTK------YWLVKNSWGTTWGENG 320
             DF FY  G + G+CG   +H V  VGYG  D   + T       Y+++KNSWG+ WGE G
Sbjct:   406 DFAFYRGGFYDGECGAAPNHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWGSDWGEGG 465

Query:   321 YIRMQRDIDAKEGLCGIAMQASYP 344
             YI ++ D +  +  C I  +A  P
Sbjct:   466 YINLETDENGYKKTCSIGTEAYVP 489


>UNIPROTKB|Q8IIL0 [details] [associations]
            symbol:PF11_0162 "Falcipain-3" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 HSSP:P43235 EMBL:AE014186 GO:GO:0020020
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347833.1
            ProteinModelPortal:Q8IIL0 SMR:Q8IIL0 MEROPS:C01.063
            EnsemblProtists:PF11_0162:mRNA GeneID:810709 KEGG:pfa:PF11_0162
            EuPathDB:PlasmoDB:PF3D7_1115400 OMA:ENDEDYW ChEMBL:CHEMBL1250373
            Uniprot:Q8IIL0
        Length = 492

 Score = 484 (175.4 bits), Expect = 3.8e-46, P = 3.8e-46
 Identities = 117/324 (36%), Positives = 171/324 (52%)

Query:    41 MWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAP 100
             +++ +  + Y  + E + RF IF EN   I   +NK  N  YK G+N+F D + EEFR+ 
Sbjct:   173 IFLKENNKKYETSEEMQKRFIIFSENYRKI-ELHNKKTNSLYKRGMNKFGDLSPEEFRSK 231

Query:   101 -----RNG-YKRRLPSVR-SSETTDVSFRYENASVPA---SIDWRKKGAVTGVKDQGQCG 150
                   +G +K   P V   +   DV  +Y+ A       + DWR  G VT VKDQ  CG
Sbjct:   232 YLNLKTHGPFKTLSPPVSYEANYEDVIKKYKPADAKLDRIAYDWRLHGGVTPVKDQALCG 291

Query:   151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
              CWAFS+V ++E    I  + L   SEQELVDC  S ++ GC GG + +AF+ +I   GL
Sbjct:   292 SCWAFSSVGSVESQYAIRKKALFLFSEQELVDC--SVKNNGCYGGYITNAFDDMIDLGGL 349

Query:   211 ATEAKYPYKAS-DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
              ++  YPY ++   +CN K  N     I  Y  +P +     ++ +   P+S++I AS  
Sbjct:   350 CSQDDYPYVSNLPETCNLKRCNERYT-IKSYVSIPDDKFKEALRYLG--PISISIAAS-D 405

Query:   270 DFQFYSSGVFTGQCGTELDHGVTAVGYGTAD---DGTK------YWLVKNSWGTTWGENG 320
             DF FY  G + G+CG   +H V  VGYG  D   + T       Y+++KNSWG+ WGE G
Sbjct:   406 DFAFYRGGFYDGECGAAPNHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWGSDWGEGG 465

Query:   321 YIRMQRDIDAKEGLCGIAMQASYP 344
             YI ++ D +  +  C I  +A  P
Sbjct:   466 YINLETDENGYKKTCSIGTEAYVP 489


>GENEDB_PFALCIPARUM|PF11_0165 [details] [associations]
            symbol:PF11_0165 "falcipain 2 precursor"
            species:5833 "Plasmodium falciparum" [GO:0020020 "food vacuole"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014186 HSSP:O65039 GO:GO:0020020
            RefSeq:XP_001347836.1 ProteinModelPortal:Q8I6U4 SMR:Q8I6U4
            IntAct:Q8I6U4 MINT:MINT-1559493 MEROPS:C01.046
            EnsemblProtists:PF11_0165:mRNA GeneID:810712 KEGG:pfa:PF11_0165
            EuPathDB:PlasmoDB:PF3D7_1115700 HOGENOM:HOG000065857 OMA:NESLHAN
            ProtClustDB:PTZ00021 BindingDB:Q8I6U4 ChEMBL:CHEMBL3470
            Uniprot:Q8I6U4
        Length = 484

 Score = 479 (173.7 bits), Expect = 1.3e-45, P = 1.3e-45
 Identities = 117/334 (35%), Positives = 169/334 (50%)

Query:    30 LNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEF 89
             +N+A    +  M++    + Y    E + RF++F +N   +   NN  +N  YK  +N F
Sbjct:   156 MNNAEHINQFYMFIKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNN-KNSLYKKELNRF 214

Query:    90 ADQTNEEFRAPRNGYKRRLPSVRSSETTD------VSFRYE-NASVP-ASIDWRKKGAVT 141
             AD T  EF+      +   P   S    D      V  +Y+ N +   A+ DWR    VT
Sbjct:   215 ADLTYHEFKNKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYKGNENFDHAAYDWRLHSGVT 274

Query:   142 GVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAF 201
              VKDQ  CG CWAFS++ ++E    I   KL +LSEQELVDC  S ++ GC GGL+++AF
Sbjct:   275 PVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDC--SFKNYGCNGGLINNAF 332

Query:   202 EFIISNKGLATEAKYPYKASDGS--CNKKEANPSAAKISGYEDVPSNNEAALMKAVANQP 259
             E +I   G+ T+  YPY  SD    CN          I  Y  VP N     ++ +   P
Sbjct:   333 EDMIELGGICTDDDYPY-VSDAPNLCNIDRCTEKYG-IKNYLSVPDNKLKEALRFLG--P 388

Query:   260 VSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTAD-------DGTK--YWLVKN 310
             +S+++  S  DF FY  G+F G+CG +L+H V  VG+G  +        G K  Y+++KN
Sbjct:   389 ISISVAVS-DDFAFYKEGIFDGECGDQLNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKN 447

Query:   311 SWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             SWG  WGE G+I ++ D       CG+   A  P
Sbjct:   448 SWGQQWGERGFINIETDESGLMRKCGLGTDAFIP 481


>UNIPROTKB|Q8I6U4 [details] [associations]
            symbol:PF11_0165 "Falcipain-2A" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            EMBL:AE014186 HSSP:O65039 GO:GO:0020020 RefSeq:XP_001347836.1
            ProteinModelPortal:Q8I6U4 SMR:Q8I6U4 IntAct:Q8I6U4
            MINT:MINT-1559493 MEROPS:C01.046 EnsemblProtists:PF11_0165:mRNA
            GeneID:810712 KEGG:pfa:PF11_0165 EuPathDB:PlasmoDB:PF3D7_1115700
            HOGENOM:HOG000065857 OMA:NESLHAN ProtClustDB:PTZ00021
            BindingDB:Q8I6U4 ChEMBL:CHEMBL3470 Uniprot:Q8I6U4
        Length = 484

 Score = 479 (173.7 bits), Expect = 1.3e-45, P = 1.3e-45
 Identities = 117/334 (35%), Positives = 169/334 (50%)

Query:    30 LNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEF 89
             +N+A    +  M++    + Y    E + RF++F +N   +   NN  +N  YK  +N F
Sbjct:   156 MNNAEHINQFYMFIKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNN-KNSLYKKELNRF 214

Query:    90 ADQTNEEFRAPRNGYKRRLPSVRSSETTD------VSFRYE-NASVP-ASIDWRKKGAVT 141
             AD T  EF+      +   P   S    D      V  +Y+ N +   A+ DWR    VT
Sbjct:   215 ADLTYHEFKNKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYKGNENFDHAAYDWRLHSGVT 274

Query:   142 GVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAF 201
              VKDQ  CG CWAFS++ ++E    I   KL +LSEQELVDC  S ++ GC GGL+++AF
Sbjct:   275 PVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDC--SFKNYGCNGGLINNAF 332

Query:   202 EFIISNKGLATEAKYPYKASDGS--CNKKEANPSAAKISGYEDVPSNNEAALMKAVANQP 259
             E +I   G+ T+  YPY  SD    CN          I  Y  VP N     ++ +   P
Sbjct:   333 EDMIELGGICTDDDYPY-VSDAPNLCNIDRCTEKYG-IKNYLSVPDNKLKEALRFLG--P 388

Query:   260 VSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTAD-------DGTK--YWLVKN 310
             +S+++  S  DF FY  G+F G+CG +L+H V  VG+G  +        G K  Y+++KN
Sbjct:   389 ISISVAVS-DDFAFYKEGIFDGECGDQLNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKN 447

Query:   311 SWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             SWG  WGE G+I ++ D       CG+   A  P
Sbjct:   448 SWGQQWGERGFINIETDESGLMRKCGLGTDAFIP 481


>WB|WBGene00019986 [details] [associations]
            symbol:R09F10.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            EMBL:FO081137 HSSP:P53634 PIR:D89588 RefSeq:NP_509408.1
            ProteinModelPortal:Q23030 SMR:Q23030 STRING:Q23030 MEROPS:C01.A44
            PaxDb:Q23030 EnsemblMetazoa:R09F10.1 GeneID:181087
            KEGG:cel:CELE_R09F10.1 UCSC:R09F10.1 CTD:181087 WormBase:R09F10.1
            InParanoid:Q23030 OMA:EYPYSAL NextBio:912346 Uniprot:Q23030
        Length = 383

 Score = 477 (173.0 bits), Expect = 2.1e-45, P = 2.1e-45
 Identities = 123/347 (35%), Positives = 183/347 (52%)

Query:     5 LLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFK 64
             L    ++L  +++L  +  Q  +  + +    +    ++ ++ R Y    E E R++IF 
Sbjct:    48 LFSGLVLLTMLILLSFFVFQRLNHKMENLKHEQMFNDFILKFDRKYTSVEEFEYRYQIFL 107

Query:    65 ENV-EYIASFNNKARNKPYKLGINEFADQTNEEFR--APRNGYKRRLPSVRSSETTDVSF 121
              NV E+ A    + RN    L +NEF D T+EE +     N Y +        +T     
Sbjct:   108 RNVIEFEAE---EERNLGLDLDVNEFTDWTDEELQKMVQENKYTKY-----DFDTPKFEG 159

Query:   122 RYENASV--PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
              Y    V  PASIDWR++G +T +K+QGQCG CWAF+ VA++E  N I   KL SLSEQE
Sbjct:   160 SYLETGVIRPASIDWREQGKLTPIKNQGQCGSCWAFATVASVEAQNAIKKGKLVSLSEQE 219

Query:   180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKA-SDGSCNKKEANPSAAKIS 238
             +VDCD  G + GC GG    A +F+  N GL +E +YPY A     C  KE N +   I 
Sbjct:   220 MVDCD--GRNNGCSGGYRPYAMKFVKEN-GLESEKEYPYSALKHDQCFLKE-NDTRVFID 275

Query:   239 GYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTGQ---CGTELD---HGV 291
              +  + SNNE  +   V  + PV+  ++   + +  Y SG+F      C TE     H +
Sbjct:   276 DFRML-SNNEEDIANWVGTKGPVTFGMNVVKAMYS-YRSGIFNPSVEDC-TEKSMGAHAL 332

Query:   292 TAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA 338
             T +GYG  +  + YW+VKNSWGT+WG +GY R+ R +++    CG+A
Sbjct:   333 TIIGYG-GEGESAYWIVKNSWGTSWGASGYFRLARGVNS----CGLA 374


>GENEDB_PFALCIPARUM|PF11_0161 [details] [associations]
            symbol:PF11_0161 "falcipain-2 precursor,
            putative" species:5833 "Plasmodium falciparum" [GO:0020020 "food
            vacuole" evidence=TAS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014186 HSSP:O65039 OMA:NNVEHIN GO:GO:0020020
            MEROPS:C01.046 HOGENOM:HOG000065857 ProtClustDB:PTZ00021
            RefSeq:XP_001347832.1 ProteinModelPortal:Q8I6U5 SMR:Q8I6U5
            IntAct:Q8I6U5 MINT:MINT-1546851 EnsemblProtists:PF11_0161:mRNA
            GeneID:810708 KEGG:pfa:PF11_0161 EuPathDB:PlasmoDB:PF3D7_1115300
            Uniprot:Q8I6U5
        Length = 482

 Score = 473 (171.6 bits), Expect = 5.6e-45, P = 5.6e-45
 Identities = 114/314 (36%), Positives = 163/314 (51%)

Query:    50 YRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLP 109
             Y    E + RF++F +N   +   NN  ++  YK  +N FAD T  EF++     +   P
Sbjct:   174 YNSPNEMKERFQVFLQNAHKVKMHNNNKKSL-YKKELNRFADLTYHEFKSKYLTLRSSKP 232

Query:   110 SVRSSETTD-VSF-----RYE-NASVP-ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
                S    D +++     +Y+ N +   A+ DWR    VT VKDQ  CG CWAFS++ ++
Sbjct:   233 LKNSKYLLDQINYDAVIKKYKGNENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIGSV 292

Query:   162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
             E    I   KL +LSEQELVDC  S ++ GC GGL+++AFE +I   G+ T+  YPY  S
Sbjct:   293 ESQYAIRKNKLITLSEQELVDC--SFKNYGCNGGLINNAFEDMIELGGICTDDDYPY-VS 349

Query:   222 DGS--CNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF 279
             D    CN          I  Y  VP N     ++ +   P+S++I  S  DF FY  G+F
Sbjct:   350 DAPNLCNIDRCTEKYG-IKNYLSVPDNKLKEALRFLG--PISISIAVS-DDFPFYKEGIF 405

Query:   280 TGQCGTELDHGVTAVGYGTAD-------DGTK--YWLVKNSWGTTWGENGYIRMQRDIDA 330
              G+CG EL+H V  VG+G  +        G K  Y+++KNSWG  WGE G+I ++ D   
Sbjct:   406 DGECGDELNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDESG 465

Query:   331 KEGLCGIAMQASYP 344
                 CG+   A  P
Sbjct:   466 LMRKCGLGTDAFIP 479


>UNIPROTKB|Q8I6U5 [details] [associations]
            symbol:PF11_0161 "Falcipain-2B" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            EMBL:AE014186 HSSP:O65039 OMA:NNVEHIN GO:GO:0020020 MEROPS:C01.046
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347832.1
            ProteinModelPortal:Q8I6U5 SMR:Q8I6U5 IntAct:Q8I6U5
            MINT:MINT-1546851 EnsemblProtists:PF11_0161:mRNA GeneID:810708
            KEGG:pfa:PF11_0161 EuPathDB:PlasmoDB:PF3D7_1115300 Uniprot:Q8I6U5
        Length = 482

 Score = 473 (171.6 bits), Expect = 5.6e-45, P = 5.6e-45
 Identities = 114/314 (36%), Positives = 163/314 (51%)

Query:    50 YRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLP 109
             Y    E + RF++F +N   +   NN  ++  YK  +N FAD T  EF++     +   P
Sbjct:   174 YNSPNEMKERFQVFLQNAHKVKMHNNNKKSL-YKKELNRFADLTYHEFKSKYLTLRSSKP 232

Query:   110 SVRSSETTD-VSF-----RYE-NASVP-ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
                S    D +++     +Y+ N +   A+ DWR    VT VKDQ  CG CWAFS++ ++
Sbjct:   233 LKNSKYLLDQINYDAVIKKYKGNENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIGSV 292

Query:   162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
             E    I   KL +LSEQELVDC  S ++ GC GGL+++AFE +I   G+ T+  YPY  S
Sbjct:   293 ESQYAIRKNKLITLSEQELVDC--SFKNYGCNGGLINNAFEDMIELGGICTDDDYPY-VS 349

Query:   222 DGS--CNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF 279
             D    CN          I  Y  VP N     ++ +   P+S++I  S  DF FY  G+F
Sbjct:   350 DAPNLCNIDRCTEKYG-IKNYLSVPDNKLKEALRFLG--PISISIAVS-DDFPFYKEGIF 405

Query:   280 TGQCGTELDHGVTAVGYGTAD-------DGTK--YWLVKNSWGTTWGENGYIRMQRDIDA 330
              G+CG EL+H V  VG+G  +        G K  Y+++KNSWG  WGE G+I ++ D   
Sbjct:   406 DGECGDELNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDESG 465

Query:   331 KEGLCGIAMQASYP 344
                 CG+   A  P
Sbjct:   466 LMRKCGLGTDAFIP 479


>FB|FBgn0034229 [details] [associations]
            symbol:CG4847 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0032504
            "multicellular organism reproduction" evidence=IEP] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0005615 "extracellular space"
            evidence=ISM;IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:AE013599 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 GO:GO:0032504 GeneTree:ENSGT00560000076599
            KO:K01371 EMBL:BT099507 RefSeq:NP_725686.1 UniGene:Dm.4677
            SMR:A1ZAU4 IntAct:A1ZAU4 MEROPS:C01.A28 EnsemblMetazoa:FBtr0086935
            GeneID:36973 KEGG:dme:Dmel_CG4847 UCSC:CG4847-RB
            FlyBase:FBgn0034229 InParanoid:A1ZAU4 OMA:GGFQEYA OrthoDB:EOG4J9KFC
            ChiTaRS:CG4847 GenomeRNAi:36973 NextBio:801302 Uniprot:A1ZAU4
        Length = 420

 Score = 468 (169.8 bits), Expect = 1.9e-44, P = 1.9e-44
 Identities = 111/313 (35%), Positives = 164/313 (52%)

Query:    42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARN-KPYKLGINEFADQTNEEFRA 99
             +++Q G+ Y   A++ +    F      + + N   A+    +K  +N FAD T+ EF +
Sbjct:   115 FLSQSGKTYLSAADRALHEGAFASTKNLVEAGNAAFAQGVHTFKQAVNAFADLTHSEFLS 174

Query:   100 PRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAV 158
                G KR  P  ++     +      A  +P + DWR+ G VT VK QG CG CWAF+  
Sbjct:   175 QLTGLKRS-PEAKARAAASLKLVNLPAKPIPDAFDWREHGGVTPVKFQGTCGSCWAFATT 233

Query:   159 AAMEGINHITTRKLTSLSEQELVDCDTSGED---QGCEGGLMDDAFEFIIS-NKGLATEA 214
              A+EG     T  L +LSEQ LVDC    ED    GC+GG  + AF FI    KG++ E 
Sbjct:   234 GAIEGHTFRKTGSLPNLSEQNLVDCGPV-EDFGLNGCDGGFQEAAFCFIDEVQKGVSQEG 292

Query:   215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQF 273
              YPY  + G+C K + + S A + G+  +P  +E  L K VA   PV+ +++   +  + 
Sbjct:   293 AYPYIDNKGTC-KYDGSKSGATLQGFAAIPPKDEEQLKKVVATLGPVACSVNGLET-LKN 350

Query:   274 YSSGVFTG-QCGT-ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
             Y+ G++   +C   E +H +  VGYG+ + G  YW+VKNSW  TWGE GY R+ R     
Sbjct:   351 YAGGIYNDDECNKGEPNHSILVVGYGS-EKGQDYWIVKNSWDDTWGEKGYFRLPRG---- 405

Query:   332 EGLCGIAMQASYP 344
             +  C IA + SYP
Sbjct:   406 KNYCFIAEECSYP 418


>DICTYBASE|DDB_G0282991 [details] [associations]
            symbol:DDB_G0282991 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0282991 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            EMBL:AAFI02000049 eggNOG:NOG331187 RefSeq:XP_639299.1
            ProteinModelPortal:Q54RQ2 EnsemblProtists:DDB0185304 GeneID:8623870
            KEGG:ddi:DDB_G0282991 InParanoid:Q54RQ2 OMA:PENGNEY Uniprot:Q54RQ2
        Length = 339

 Score = 462 (167.7 bits), Expect = 8.1e-44, P = 8.1e-44
 Identities = 114/313 (36%), Positives = 169/313 (53%)

Query:    42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
             W  +Y ++Y  N E  MRF  FK+N EY+  +N K      +L  N FAD +  E+    
Sbjct:    30 WTNKYNKIY-SNKEFYMRFNNFKKNKEYVDQWNEKQLETILEL--NFFADLSRNEYI--- 83

Query:   102 NGYKRRLPSVRSSETTDVSFRYE-----NASVPASIDWRKKGAVTGVKDQGQC-GCCWAF 155
             N Y      + + E  +  +        N S+  SIDWR   AVT VK+QG C G  ++F
Sbjct:    84 NNYLASFIDISNIEQKNTKYEGNLKNNFNNSIK-SIDWRNFDAVTPVKNQGLCSGAGYSF 142

Query:   156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
             SA+  +E  + I  ++L +LSEQ ++DC T   + GC GGL   AF++II  KG+ +E  
Sbjct:   143 SAIGVIESSHFIKNKELITLSEQNIIDCTTDMGNNGCMGGLALIAFDYIIKQKGIDSEFN 202

Query:   216 YPYKA-------SDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
             YPY+          G C +  +  S A IS Y ++   NE  L +++   PVSV IDAS 
Sbjct:   203 YPYEGYLIEPYEGRGRC-RYNSFYSKASISSYIEIERFNENELTQSLIKSPVSVMIDASQ 261

Query:   269 SDFQFYSSGVFTG-QCG-TELDHGVTAVGYG-TADDGTKYWLVKNSWGTTWGENGYIRMQ 325
               F  Y SGV+    C  T L+HG+  +G+G T ++G +Y+++KNS+G+ WG  GYI + 
Sbjct:   262 LSFMLYKSGVYKDPSCSSTILNHGILNIGFGVTPENGNEYYILKNSFGSKWGMKGYIYLS 321

Query:   326 RDIDAKEGLCGIA 338
             R+ +     CGI+
Sbjct:   322 RNFNNH---CGIS 331


>FB|FBgn0033874 [details] [associations]
            symbol:CG6347 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE013599 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 HSSP:P53634 EMBL:AY069609
            RefSeq:NP_610906.1 UniGene:Dm.608 SMR:Q7K0S6 MEROPS:C01.A29
            EnsemblMetazoa:FBtr0087637 GeneID:36531 KEGG:dme:Dmel_CG6347
            UCSC:CG6347-RA FlyBase:FBgn0033874 InParanoid:Q7K0S6 OMA:FEYIRDH
            OrthoDB:EOG4FQZ74 GenomeRNAi:36531 NextBio:799046 Uniprot:Q7K0S6
        Length = 352

 Score = 462 (167.7 bits), Expect = 8.1e-44, P = 8.1e-44
 Identities = 121/353 (34%), Positives = 182/353 (51%)

Query:     6 LENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKE 65
             L+  L LA +  + +   QS+ + L D    +  + ++ Q G+VY D  E+  R  IF  
Sbjct:     9 LQMTLGLALLGAVSLQQLQSFPK-LCDV---QNFDDFLRQTGKVYSDE-ERVYRESIFAA 63

Query:    66 NVEYIASFNNKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVR--SSETTDVSF 121
              +  I   N  A N    ++LG+N  AD T +E  A   G K      R  +     V+ 
Sbjct:    64 KMSLITLSNKNADNGVSGFRLGVNTLADMTRKEI-ATLLGSKISEFGERYTNGHINFVTA 122

Query:   122 RYE-NASVPASIDWRKKGAVTGVKDQGQ-CGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
             R   +A++P   DWR+KG VT    QG  CG CW+F+   A+EG     T  L SLS+Q 
Sbjct:   123 RNPASANLPEMFDWREKGGVTPPGFQGVGCGACWSFATTGALEGHLFRRTGVLASLSQQN 182

Query:   180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEA--NP---SA 234
             LVDC     + GC+GG  +  FE+I  + G+    KYPY  ++  C + E    P   S 
Sbjct:   183 LVDCADDYGNMGCDGGFQEYGFEYI-RDHGVTLANKYPYTQTEMQCRQNETAGRPPRESL 241

Query:   235 AKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTGQ-CGT-ELDHGV 291
              KI  Y  +   +E  + + +A   P++ +++A    F+ YS G++  + C   EL+H V
Sbjct:   242 VKIRDYATITPGDEEKMKEVIATLGPLACSMNADTISFEQYSGGIYEDEECNQGELNHSV 301

Query:   292 TAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             T VGYGT ++G  YW++KNS+   WGE G++R+ R+     G CGIA + SYP
Sbjct:   302 TVVGYGT-ENGRDYWIIKNSYSQNWGEGGFMRILRNAG---GFCGIASECSYP 350


>TAIR|locus:2082687 [details] [associations]
            symbol:AT3G54940 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002686 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HSSP:P53634
            OMA:GGGLMTN EMBL:AY070063 IPI:IPI00528988 RefSeq:NP_567010.5
            UniGene:At.28412 ProteinModelPortal:Q8VYS0 SMR:Q8VYS0 PRIDE:Q8VYS0
            EnsemblPlants:AT3G54940.2 GeneID:824659 KEGG:ath:AT3G54940
            TAIR:At3g54940 PhylomeDB:Q8VYS0 ProtClustDB:CLSN2718801
            ArrayExpress:Q8VYS0 Genevestigator:Q8VYS0 Uniprot:Q8VYS0
        Length = 367

 Score = 461 (167.3 bits), Expect = 1.0e-43, P = 1.0e-43
 Identities = 110/320 (34%), Positives = 160/320 (50%)

Query:    38 RHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEF 97
             +  ++M+ YG+ Y    E   R  IF +NV  + +  ++  +     G+ +F+D T EEF
Sbjct:    50 KFRLFMSDYGKNYSTREEYIHRLGIFAKNV--LKAAEHQMMDPSAVHGVTQFSDLTEEEF 107

Query:    98 RAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
             +    G    +   R       +   E   +P   DWR+KG VT VK+QG CG CWAFS 
Sbjct:   108 KRMYTGVAD-VGGSRGGTVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFST 166

Query:   158 VAAMEGINHITTRKLTSLSEQELVDCDTSGE-------DQGCEGGLMDDAFEFIISNKGL 210
               A EG + ++T KL SLSEQ+LVDCD + +       D GC GGLM +A+E+++   GL
Sbjct:   167 TGAAEGAHFVSTGKLLSLSEQQLVDCDQACDPKDKKACDNGCGGGLMTNAYEYLMEAGGL 226

Query:   211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
               E  YPY    G C K +    A ++  +  +P +        V + P++V ++A    
Sbjct:   227 EEERSYPYTGKRGHC-KFDPEKVAVRVLNFTTIPLDENQIAANLVRHGPLAVGLNAVF-- 283

Query:   271 FQFYSSGVFTGQ-CGTE-LDHGVTAVGYGTAD------DGTKYWLVKNSWGTTWGENGYI 322
              Q Y  GV     C    ++HGV  VGYG+            YW++KNSWG  WGENGY 
Sbjct:   284 MQTYIGGVSCPLICSKRNVNHGVLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYY 343

Query:   323 RMQRDIDAKEGLCGIAMQAS 342
             ++ R  D    +CGI    S
Sbjct:   344 KLCRGHD----ICGINSMVS 359


>RGD|1308181 [details] [associations]
            symbol:Ctsf "cathepsin F" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1308181 eggNOG:COG4870 HOGENOM:HOG000230774
            KO:K01373 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 MEROPS:C01.018
            CTD:8722 OMA:LAPPEWD OrthoDB:EOG4CC41T GeneTree:ENSGT00660000095458
            EMBL:CH473953 EMBL:BC099780 EMBL:EU253481 IPI:IPI00201100
            RefSeq:NP_001029282.1 UniGene:Rn.25087 SMR:Q499S6
            Ensembl:ENSRNOT00000026718 GeneID:361704 KEGG:rno:361704
            UCSC:RGD:1308181 InParanoid:Q499S6 NextBio:677325
            Genevestigator:Q499S6 Uniprot:Q499S6
        Length = 462

 Score = 435 (158.2 bits), Expect = 1.2e-43, Sum P(2) = 1.2e-43
 Identities = 115/307 (37%), Positives = 149/307 (48%)

Query:    42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
             +M  Y R Y    E + R  +F  N+          R    + GI +F+D T EEF    
Sbjct:   168 FMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTA-QYGITKFSDLTEEEFHTI- 225

Query:   102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
               Y   L    S     ++ +  N   P   DWRKKGAVT VKDQG CG CWAFS    +
Sbjct:   226 --YLNPLLQKESGGKMSLA-KSINDLAPPEWDWRKKGAVTEVKDQGMCGSCWAFSVTGNV 282

Query:   162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
             EG   +    L SLSEQEL+DCD    D+ C GGL  +A+  I +  GL TE  Y Y+  
Sbjct:   283 EGQWFLNRGTLLSLSEQELLDCDKM--DKACMGGLPSNAYTAIKNLGGLETEDDYGYQGH 340

Query:   222 DGSCNKKEANPSAAKISGYEDVP-SNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGV- 278
               +CN    +   AK+   + V  S +E  +   +A + P+SVAI+A G   QFY  G+ 
Sbjct:   341 VQACN---FSTQMAKVYINDSVELSRDENKIAAWLAQKGPISVAINAFG--MQFYRHGIA 395

Query:   279 --FTGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
               F   C    +DH V  VGYG   +   YW +KNSWG  WGE GY  + R      G C
Sbjct:   396 HPFRPLCSPWFIDHAVLLVGYGNRSN-IPYWAIKNSWGRDWGEEGYYYLYRG----SGAC 450

Query:   336 GIAMQAS 342
             G+   AS
Sbjct:   451 GVNTMAS 457

 Score = 42 (19.8 bits), Expect = 1.2e-43, Sum P(2) = 1.2e-43
 Identities = 19/56 (33%), Positives = 26/56 (46%)

Query:     1 MAMIL-LENKLVLAAILVLGV-----WAP--QSWSRTLNDATMNERHEMWMAQYGR 48
             MA++L L   L L A +VL       WA   Q+WS +  +     R  + M  YGR
Sbjct:     1 MALLLQLLWLLTLIATVVLSPVPAKPWADDEQAWSLSSQELLAPARFALDMYNYGR 56


>DICTYBASE|DDB_G0274385 [details] [associations]
            symbol:DDB_G0274385 "Cysteine proteinase 1,
            mitochondrial" species:44689 "Dictyostelium discoideum" [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0274385 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 EMBL:AAFI02000012 RefSeq:XP_644301.1
            ProteinModelPortal:Q86KD4 EnsemblProtists:DDB0167535 GeneID:8619729
            KEGG:ddi:DDB_G0274385 InParanoid:Q86KD4 OMA:SICVDAS Uniprot:Q86KD4
        Length = 358

 Score = 458 (166.3 bits), Expect = 2.2e-43, P = 2.2e-43
 Identities = 119/331 (35%), Positives = 170/331 (51%)

Query:    31 NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
             +D++M +    W  ++ ++Y+D+ E E RF  FKEN++     N+    K  K   N F+
Sbjct:    36 SDSSMRDTFNHWAKKHSKIYKDSIEMENRFSNFKENMKKNIELNSMHAGKA-KFESNGFS 94

Query:    91 DQTNEEFRA--PRNGYKRRLPSVRSS----ETTDVSF-----RYENASVPA--SIDWRKK 137
             D + EEF        +K +   +R+S     T   S        EN  +    SIDWRKK
Sbjct:    95 DLSEEEFSNFHLNKAFKGKPSHLRNSIKPQPTPHHSLINGYKEMENGDLNELYSIDWRKK 154

Query:   138 GAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLM 197
             G VT VKDQGQCG C+ FSAV  +E        K   LSEQ+ VDCD    D  C GG  
Sbjct:   155 GLVTPVKDQGQCGSCYIFSAVEQIETAWIKAGNKPILLSEQQAVDCDPY--DGQCGGGDP 212

Query:   198 DDAFEFIISNKGLATEAKYPYKASDGSC-NKKEANPSAAKISGYEDVPSNNEAALMKAVA 256
                +E+     G++T A+YPY A+DG+C N   A P    +S +      +E  L+K + 
Sbjct:   213 YTVYEYFSQVGGVSTNAQYPYTATDGTCVNMSRAVPV---VSYHYVTQGGDENTLIKTIV 269

Query:   257 NQ-PVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGT----ADDGTKYWLVKNS 311
             N  PVS+ +DAS   +Q YS G+ T  CG  +DH V  VG         +  +Y++++NS
Sbjct:   270 NDGPVSICVDAS--TWQSYSGGIITTGCGKNIDHCVQVVGLEVDKTDPSNPVQYYIIRNS 327

Query:   312 WGTTWGENGYIRMQRDIDAKEGLCGIAMQAS 342
             WGT WG +GYI +    D    LCGI  +++
Sbjct:   328 WGTDWGIDGYIYVATGSD----LCGITYEST 354


>UNIPROTKB|Q9UBX1 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=TAS] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_6900 GO:GO:0019886 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0043202
            GO:GO:0004197 HOVERGEN:HBG011513 EMBL:AJ007331 EMBL:AF088886
            EMBL:AF132894 EMBL:AF136279 EMBL:AF071748 EMBL:AF071749
            EMBL:AK313657 EMBL:BC011682 EMBL:BC036451 EMBL:AL137742
            IPI:IPI00002816 RefSeq:NP_003784.2 UniGene:Hs.11590 PDB:1D5U
            PDB:1M6D PDBsum:1D5U PDBsum:1M6D ProteinModelPortal:Q9UBX1
            SMR:Q9UBX1 STRING:Q9UBX1 MEROPS:C01.018 PhosphoSite:Q9UBX1
            DMDM:12643325 PaxDb:Q9UBX1 PeptideAtlas:Q9UBX1 PRIDE:Q9UBX1
            DNASU:8722 Ensembl:ENST00000310325 GeneID:8722 KEGG:hsa:8722
            UCSC:uc001oip.3 CTD:8722 GeneCards:GC11M066332 HGNC:HGNC:2531
            HPA:CAB002141 MIM:603539 neXtProt:NX_Q9UBX1 PharmGKB:PA27031
            InParanoid:Q9UBX1 OMA:LAPPEWD OrthoDB:EOG4CC41T PhylomeDB:Q9UBX1
            BindingDB:Q9UBX1 ChEMBL:CHEMBL2517 ChiTaRS:CTSF
            EvolutionaryTrace:Q9UBX1 GenomeRNAi:8722 NextBio:32715
            ArrayExpress:Q9UBX1 Bgee:Q9UBX1 CleanEx:HS_CTSF
            Genevestigator:Q9UBX1 GermOnline:ENSG00000174080 Uniprot:Q9UBX1
        Length = 484

 Score = 448 (162.8 bits), Expect = 2.5e-42, P = 2.5e-42
 Identities = 117/304 (38%), Positives = 146/304 (48%)

Query:    46 YGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR-NGY 104
             Y R Y    E   R  +F  N+          R    + G+ +F+D T EEFR    N  
Sbjct:   194 YNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTA-QYGVTKFSDLTEEEFRTIYLNTL 252

Query:   105 KRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGI 164
              R+ P  +  +   V         P   DWR KGAVT VKDQG CG CWAFS    +EG 
Sbjct:   253 LRKEPGNKMKQAKSVG-----DLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQ 307

Query:   165 NHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGS 224
               +    L SLSEQEL+DCD    D+ C GGL  +A+  I +  GL TE  Y Y+    S
Sbjct:   308 WFLNQGTLLSLSEQELLDCDKM--DKACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQS 365

Query:   225 CNKKEANPSAAKISGYEDVP-SNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGV---F 279
             CN    +   AK+   + V  S NE  L   +A + P+SVAI+A G   QFY  G+    
Sbjct:   366 CN---FSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFG--MQFYRHGISRPL 420

Query:   280 TGQCGTEL-DHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA 338
                C   L DH V  VGYG   D   +W +KNSWGT WGE GY  + R      G CG+ 
Sbjct:   421 RPLCSPWLIDHAVLLVGYGNRSD-VPFWAIKNSWGTDWGEKGYYYLHRG----SGACGVN 475

Query:   339 MQAS 342
               AS
Sbjct:   476 TMAS 479


>UNIPROTKB|F1RU48 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            OMA:LAPPEWD GeneTree:ENSGT00660000095458 EMBL:CU928034
            EMBL:FP565364 Ensembl:ENSSSCT00000014140 Ensembl:ENSSSCT00000014154
            Uniprot:F1RU48
        Length = 460

 Score = 443 (161.0 bits), Expect = 8.4e-42, P = 8.4e-42
 Identities = 118/309 (38%), Positives = 155/309 (50%)

Query:    42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK-PYKLGINEFADQTNEEFRAP 100
             ++  Y R Y    E   R  +F  N+  + +   +A +    + G+ +F+D T EEFR  
Sbjct:   166 FVTTYNRTYDTKEEARWRMSVFANNM--VRAQKIQALDTGTARYGVTKFSDLTEEEFRTI 223

Query:   101 R-NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
               N   +  P  +      VS     +  P   DWRKKGAVT VKDQG CG CWAFS   
Sbjct:   224 YLNPLLQEEPGRKMRLAKSVS-----SLPPPEWDWRKKGAVTKVKDQGMCGSCWAFSVTG 278

Query:   160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
              +EG   +    L SLSEQEL+DCD    D+GC GGL  +A+  I +  GL TE  Y Y+
Sbjct:   279 NVEGQWFLKQGTLLSLSEQELLDCDKV--DKGCMGGLPSNAYSAIKTLGGLETEEDYSYR 336

Query:   220 ASDGSCNKKEANPSAAKISGYEDVP-SNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSG 277
                 +C+    N   AK+   + V  S NE  L   +A + P+SVAI+A G   QFY  G
Sbjct:   337 GHLQTCS---FNAEKAKVYINDSVELSQNEQKLAAWLAEKGPISVAINAFG--MQFYRHG 391

Query:   278 V---FTGQCGTEL-DHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
             +       C   L DH V  VGYG     T +W +KNSWGT WGE GY  + R      G
Sbjct:   392 ISHPLRPLCSPWLIDHAVLLVGYGNRS-ATPFWAIKNSWGTDWGEEGYYYLYRG----SG 446

Query:   334 LCGIAMQAS 342
              CG+ + AS
Sbjct:   447 ACGVNIMAS 455


>UNIPROTKB|E2RR02 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            OMA:LAPPEWD GeneTree:ENSGT00660000095458 EMBL:AAEX03011628
            Ensembl:ENSCAFT00000019742 Uniprot:E2RR02
        Length = 460

 Score = 442 (160.7 bits), Expect = 1.1e-41, P = 1.1e-41
 Identities = 114/306 (37%), Positives = 150/306 (49%)

Query:    42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
             ++  Y R Y    E E R  +F  N+          R    + GI +F+D T EEFR   
Sbjct:   165 FVTTYNRTYETKEEAEWRMSVFSNNMVRAQKIQALDRGTA-QYGITKFSDLTEEEFRTI- 222

Query:   102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
               Y   L      +   ++    + + P   DWR KGAVT VKDQG CG CWAFS    +
Sbjct:   223 --YLNPLLRENRGKKMRLAKSISDHAPPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNV 280

Query:   162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
             EG   +    L SLSEQEL+DCD    D+ C GGL  +A+  I++  GL TE  Y Y+  
Sbjct:   281 EGQWFLKEGTLLSLSEQELLDCDKV--DKACLGGLPSNAYSAIMTLGGLETEDDYSYQGH 338

Query:   222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGV-- 278
               +C+   A  +   I+   ++ S NE  L   +A + P+SVAI+A G   QFY  G+  
Sbjct:   339 LQACSFS-AKKARVYINDSMEL-SQNEQKLAAWLAKKGPISVAINAFG--MQFYRHGISH 394

Query:   279 -FTGQCGTEL-DHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
                  C   L DH V  VGYG    G  +W +KNSWGT WGE GY  + R      G CG
Sbjct:   395 PLRPLCSPWLIDHAVLLVGYGNRS-GIPFWAIKNSWGTDWGEEGYYYLHRG----SGACG 449

Query:   337 IAMQAS 342
             +   AS
Sbjct:   450 VNTMAS 455


>UNIPROTKB|Q0VCU3 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            HOVERGEN:HBG011513 MEROPS:C01.018 CTD:8722 OMA:LAPPEWD
            OrthoDB:EOG4CC41T GeneTree:ENSGT00660000095458 EMBL:DAAA02063594
            EMBL:BC120003 IPI:IPI00717812 RefSeq:NP_001068884.1 UniGene:Bt.7264
            SMR:Q0VCU3 Ensembl:ENSBTAT00000014587 GeneID:509715 KEGG:bta:509715
            InParanoid:Q0VCU3 NextBio:20869091 Uniprot:Q0VCU3
        Length = 460

 Score = 434 (157.8 bits), Expect = 7.5e-41, P = 7.5e-41
 Identities = 117/311 (37%), Positives = 153/311 (49%)

Query:    42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
             ++  Y R Y    E   R  +F  N+          R    + G+ +F+D T EEFR   
Sbjct:   166 FVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTA-RYGVTKFSDLTEEEFRTIY 224

Query:   102 -NGYKRRLP--SVRSSE-TTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
              N   +  P  ++R ++  TDV         P   DWR KGAVT VKDQG CG CWAFS 
Sbjct:   225 LNPLLKDAPGRNMRPAQPVTDVP--------PPQWDWRNKGAVTNVKDQGMCGSCWAFSV 276

Query:   158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
                +EG   +    L SLSEQEL+DCD +  D+ C GGL  +A+  I +  GL TE  Y 
Sbjct:   277 TGNVEGQWFLKRGTLLSLSEQELLDCDKT--DKACLGGLPSNAYSAIRTLGGLETEDDYS 334

Query:   218 YKASDGSCNKKEANPSAAKISGYEDVP-SNNEAALMKAVA-NQPVSVAIDASGSDFQFYS 275
             Y+    +C+    +   AK+   + V  S NE  L   +A N PVS+AI+A G   QFY 
Sbjct:   335 YRGRLQTCS---FSAEKAKVYINDSVELSKNEQKLAAWLAKNGPVSIAINAFG--MQFYR 389

Query:   276 SGV---FTGQCGTEL-DHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
              G+       C   L DH V  VGYG       +W +KNSWGT WGE GY  + R     
Sbjct:   390 HGISHPLRPLCSPWLIDHAVLLVGYGNRS-AIPFWAIKNSWGTDWGEEGYYYLHRG---- 444

Query:   332 EGLCGIAMQAS 342
              G CG+ + AS
Sbjct:   445 SGACGVNIMAS 455


>MGI|MGI:1861434 [details] [associations]
            symbol:Ctsf "cathepsin F" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=ISS]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISS] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:1861434 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513 MEROPS:C01.018
            CTD:8722 OMA:LAPPEWD OrthoDB:EOG4CC41T EMBL:AF136280 EMBL:AF217224
            EMBL:AJ131851 EMBL:AK075862 EMBL:BC058758 IPI:IPI00126769
            RefSeq:NP_063914.1 UniGene:Mm.29561 ProteinModelPortal:Q9R013
            SMR:Q9R013 STRING:Q9R013 PhosphoSite:Q9R013 PaxDb:Q9R013
            PRIDE:Q9R013 Ensembl:ENSMUST00000119694 GeneID:56464 KEGG:mmu:56464
            UCSC:uc008gbc.1 GeneTree:ENSGT00660000095458 InParanoid:Q9R013
            NextBio:312722 Bgee:Q9R013 CleanEx:MM_CTSF Genevestigator:Q9R013
            GermOnline:ENSMUSG00000006458 Uniprot:Q9R013
        Length = 462

 Score = 434 (157.8 bits), Expect = 7.5e-41, P = 7.5e-41
 Identities = 115/307 (37%), Positives = 149/307 (48%)

Query:    42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
             +M  Y R Y    E + R  +F  N+          R    + GI +F+D T EEF    
Sbjct:   168 FMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTA-QYGITKFSDLTEEEFHTI- 225

Query:   102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
               Y   L    S      + +  N   P   DWRKKGAVT VK+QG CG CWAFS    +
Sbjct:   226 --YLNPLLQKESGRKMSPA-KSINDLAPPEWDWRKKGAVTEVKNQGMCGSCWAFSVTGNV 282

Query:   162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
             EG   +    L SLSEQEL+DCD    D+ C GGL  +A+  I +  GL TE  Y Y+  
Sbjct:   283 EGQWFLNRGTLLSLSEQELLDCDKV--DKACLGGLPSNAYAAIKNLGGLETEDDYGYQGH 340

Query:   222 DGSCNKKEANPSAAKISGYEDVP-SNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGV- 278
               +CN    +   AK+   + V  S NE  +   +A + P+SVAI+A G   QFY  G+ 
Sbjct:   341 VQTCN---FSAQMAKVYINDSVELSRNENKIAAWLAQKGPISVAINAFG--MQFYRHGIA 395

Query:   279 --FTGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
               F   C    +DH V  VGYG   +   YW +KNSWG+ WGE GY  + R      G C
Sbjct:   396 HPFRPLCSPWFIDHAVLLVGYGNRSN-IPYWAIKNSWGSDWGEEGYYYLYRG----SGAC 450

Query:   336 GIAMQAS 342
             G+   AS
Sbjct:   451 GVNTMAS 457


>FB|FBgn0032228 [details] [associations]
            symbol:CG5367 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE014134 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 HSSP:P80067
            RefSeq:NP_609387.1 UniGene:Dm.26782 ProteinModelPortal:Q9VKY4
            SMR:Q9VKY4 MEROPS:C01.A30 EnsemblMetazoa:FBtr0080055 GeneID:34401
            KEGG:dme:Dmel_CG5367 UCSC:CG5367-RA FlyBase:FBgn0032228
            InParanoid:Q9VKY4 OMA:QIVDCSV OrthoDB:EOG4THT8X PhylomeDB:Q9VKY4
            GenomeRNAi:34401 NextBio:788324 ArrayExpress:Q9VKY4 Bgee:Q9VKY4
            Uniprot:Q9VKY4
        Length = 338

 Score = 431 (156.8 bits), Expect = 1.6e-40, P = 1.6e-40
 Identities = 101/296 (34%), Positives = 161/296 (54%)

Query:    57 EMR-FKIFKENVEYIASFNN--KARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPS-VR 112
             EMR +K F+EN + I   N   K     ++L  N FAD + + +     G+ R L S + 
Sbjct:    53 EMRSYKAFEENFKVIEEHNQNYKEGQTSFRLKPNIFADMSTDGYL---KGFLRLLKSNIE 109

Query:   113 SS--ETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTR 170
              S     ++      A+VP S+DWR KG +T   +Q  CG C+AFS   ++ G     T 
Sbjct:   110 DSADNMAEIVGSPLMANVPESLDWRSKGFITPPYNQLSCGSCYAFSIAESIMGQVFKRTG 169

Query:   171 KLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEA 230
             K+ SLS+Q++VDC  S  +QGC GG + +   ++ S  G+  +  YPY A  G C +   
Sbjct:   170 KILSLSKQQIVDCSVSHGNQGCVGGSLRNTLSYLQSTGGIMRDQDYPYVARKGKC-QFVP 228

Query:   231 NPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFTGQ-CGT-EL 287
             + S   ++ +  +P  +E A+  AV +  PV+++I+AS   FQ YS G++    C +  +
Sbjct:   229 DLSVVNVTSWAILPVRDEQAIQAAVTHIGPVAISINASPKTFQLYSDGIYDDPLCSSASV 288

Query:   288 DHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASY 343
             +H +  +G+G       YW++KN WG  WGENGYIR+++ ++    +CGIA  A+Y
Sbjct:   289 NHAMVVIGFGK-----DYWILKNWWGQNWGENGYIRIRKGVN----MCGIANYAAY 335


>ZFIN|ZDB-GENE-030131-9831 [details] [associations]
            symbol:ctsf "cathepsin F" species:7955 "Danio
            rerio" [GO:0004869 "cysteine-type endopeptidase inhibitor activity"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000010 InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00031 Pfam:PF00112 PRINTS:PR00705 SMART:SM00043
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-030131-9831
            GO:GO:0004869 eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 HOVERGEN:HBG011513 CTD:8722 OrthoDB:EOG4CC41T
            MEROPS:I25.006 EMBL:BC124243 IPI:IPI00503226 RefSeq:NP_001071036.1
            UniGene:Dr.81265 ProteinModelPortal:Q08CH0 SMR:Q08CH0 GeneID:565588
            KEGG:dre:565588 InParanoid:Q08CH0 NextBio:20885952
            ArrayExpress:Q08CH0 Uniprot:Q08CH0
        Length = 473

 Score = 431 (156.8 bits), Expect = 1.6e-40, P = 1.6e-40
 Identities = 112/305 (36%), Positives = 151/305 (49%)

Query:    42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
             +M  Y R Y    E E R +IF++N++   +  +  +    + GI +F+D T +EFR   
Sbjct:   178 FMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSA-EYGITKFSDLTEDEFRMM- 235

Query:   102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
               Y   + S  S +         +A  P + DWR  GAV+ VK+QG CG CWAFS    +
Sbjct:   236 --YLNPMLSQWSLKKEMKPAIPASAPAPDTWDWRDHGAVSPVKNQGMCGSCWAFSVTGNI 293

Query:   162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
             EG     T +L SLSEQELVDCD    DQ C GGL  +A+E I +  GL TE  Y Y   
Sbjct:   294 EGQWFKKTGQLLSLSEQELVDCDKL--DQACGGGLPSNAYEAIENLGGLETETDYSYTGH 351

Query:   222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTG 281
               SC+       AA I+   ++P + +        N PVS A++A     QFY  GV   
Sbjct:   352 KQSCDFSTGKV-AAYINSSVELPKDEKEIAAFLAENGPVSAALNAFA--MQFYRKGVSHP 408

Query:   282 Q---CGT-ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
                 C    +DH V  VG+G  + G  +W +KNSWG  +GE GY  + R      GLCGI
Sbjct:   409 LKIFCNPWMIDHAVLLVGFGQRN-GVPFWAIKNSWGEDYGEQGYYYLYRG----SGLCGI 463

Query:   338 AMQAS 342
                 S
Sbjct:   464 HKMCS 468


>RGD|1564827 [details] [associations]
            symbol:RGD1564827 "similar to cathepsin M" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 IPI:IPI00192321
            Ensembl:ENSRNOT00000023990 ArrayExpress:D3ZY04 Uniprot:D3ZY04
        Length = 338

 Score = 430 (156.4 bits), Expect = 2.0e-40, P = 2.0e-40
 Identities = 88/204 (43%), Positives = 117/204 (57%)

Query:   146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
             QG+C  CWAF  V A+EG     T KLT LS Q LVDC     ++GC GG   +AF++++
Sbjct:   139 QGRCNSCWAFPVVGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVL 198

Query:   206 SNKGLATEAKYPYKASDGSCNKKEANP-SAAKISGYEDVPSNNEAALMKAVANQPVSVAI 264
              N GL +EA YPY+  +G C     NP S+AKI+     P  NE  LM AVA +PV+  I
Sbjct:   199 QNGGLESEATYPYEGKEGLCRY---NPNSSAKITXICAPPQKNEDVLMDAVATKPVAAGI 255

Query:   265 DASGSDFQFYSSGVF-TGQCGTELDHGVTAVGYG---TADDGTKYWLVKNSWGTTWGENG 320
                 S  +FY  G++   +C   ++H V  VGYG      DG  YWL++NSWG  WG NG
Sbjct:   256 HVVHSSLRFYKKGIYHEPKCNNYVNHAVLVVGYGFEGNETDGNNYWLIQNSWGERWGLNG 315

Query:   321 YIRMQRDIDAKEGLCGIAMQASYP 344
             Y+++ +D   +   CGIA  A YP
Sbjct:   316 YMKIAKD---RNNHCGIATFAQYP 336


>DICTYBASE|DDB_G0281077 [details] [associations]
            symbol:DDB_G0281077 species:44689 "Dictyostelium
            discoideum" [GO:0030246 "carbohydrate binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR008965 InterPro:IPR013128 InterPro:IPR019028
            Pfam:PF00112 Pfam:PF09478 PRINTS:PR00705 SMART:SM00645
            SMART:SM01063 InterPro:IPR000169 dictyBase:DDB_G0281077
            GO:GO:0030246 EMBL:AAFI02000040 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 SUPFAM:SSF49384 ProtClustDB:CLSZ2430562
            RefSeq:XP_640803.1 ProteinModelPortal:Q54UH3
            EnsemblProtists:DDB0203998 GeneID:8622857 KEGG:ddi:DDB_G0281077
            InParanoid:Q54UH3 OMA:LINDFNF Uniprot:Q54UH3
        Length = 662

 Score = 353 (129.3 bits), Expect = 2.8e-40, Sum P(2) = 2.8e-40
 Identities = 80/205 (39%), Positives = 115/205 (56%)

Query:   110 SVRSSETTDV--SFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHI 167
             S  S+ TTD     R    S P SIDWR  G V+ VK+QG CG C+AFS V A+E   + 
Sbjct:   451 SSSSNITTDEPSKSRLLKWSRPISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALEAHYYR 510

Query:   168 TTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNK 227
                ++ +LSEQ LVDC  +  +  C GG M + F +I  N G+  ++ YPY+   G C +
Sbjct:   511 KNNRMLNLSEQNLVDCTRNYGNGECSGGWMHNCFRYIKENGGINLQSTYPYEGRVGLC-R 569

Query:   228 KEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTGQ-CGT 285
               +  + ++IS Y  +  ++E  L  AVA+  PVSVA DAS  +F +YSSG++    C  
Sbjct:   570 YNSGDAQSRISNYVMIKQHDEEDLANAVASVGPVSVAYDASTREFMYYSSGIYNSDSCDK 629

Query:   286 -ELDHGVTAVGYGTADDGTKYWLVK 309
                 H V  VGYG  ++G  +W++K
Sbjct:   630 YRTTHAVVVVGYGI-ENGVDFWIIK 653

 Score = 106 (42.4 bits), Expect = 2.8e-40, Sum P(2) = 2.8e-40
 Identities = 22/76 (28%), Positives = 43/76 (56%)

Query:    42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
             W  Q+ R YR + +  ++++ FK++  +I  +  + +N   +LG+ +F+D T++EF    
Sbjct:   165 WSNQFNRTYRAD-QFLLKYEAFKDSSRFIEQYKRENQNSTMELGLTQFSDMTHDEFL--- 220

Query:   102 NGYKRRLPSVRSSETT 117
             N Y  +L     +ETT
Sbjct:   221 NIYTSKLYEFNLNETT 236


>FB|FBgn0037396 [details] [associations]
            symbol:CG11459 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE014297 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 KO:K01365 HSSP:P07711 EMBL:AY060710
            RefSeq:NP_649608.1 UniGene:Dm.3894 SMR:Q9VNK6 MEROPS:C01.A31
            EnsemblMetazoa:FBtr0078623 GeneID:40741 KEGG:dme:Dmel_CG11459
            UCSC:CG11459-RA FlyBase:FBgn0037396 InParanoid:Q9VNK6 OMA:NYDEREL
            OrthoDB:EOG4MGQPX ChiTaRS:CG11459 GenomeRNAi:40741 NextBio:820359
            Uniprot:Q9VNK6
        Length = 336

 Score = 426 (155.0 bits), Expect = 5.3e-40, P = 5.3e-40
 Identities = 108/324 (33%), Positives = 173/324 (53%)

Query:    33 ATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGINEFA 90
             A  +   + + A+Y + YR N +K  R  ++++ V  + S N    + K  +K+G+N+F+
Sbjct:    24 AVSDTEWDQYKAKYNKQYR-NRDKYHR-ALYEQRVLAVESHNQLYLQGKVAFKMGLNKFS 81

Query:    91 DQTNEEFRAPRNGYKRRLPSVRSSETTDVSF-RYENASVPASIDWRKKGAVTGVKDQG-Q 148
             D         R+     L +  ++ T  V++ RY+   +   IDWR+ G ++ V DQG +
Sbjct:    82 DTDQRILFNYRSSIPAPLETSTNALTETVNYKRYDQ--ITEGIDWRQYGYISPVGDQGTE 139

Query:   149 CGCCWAFSAVAAMEGINHITTR--KLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
             C  CWAFS    +E   H+  +   L  LS + LVDC     + GC GG +  AF +   
Sbjct:   140 CLSCWAFSTSGVLEA--HMAKKYGNLVPLSPKHLVDC-VPYPNNGCSGGWVSVAFNYT-R 195

Query:   207 NKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAID 265
             + G+AT+  YPY+   G C  K ++ SA  +SGY  + + +E  L + V N  PV+V+ID
Sbjct:   196 DHGIATKESYPYEPVSGECLWK-SDRSAGTLSGYVTLGNYDERELAEVVYNIGPVAVSID 254

Query:   266 ASGSDFQFYSSGVFT-GQCGT---ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGY 321
                 +F  YS GV +   C +   +L H V  VG+GT      YW++KNS+GT WGE+GY
Sbjct:   255 HLHEEFDQYSGGVLSIPACRSKRQDLTHSVLLVGFGTHRKWGDYWIIKNSYGTDWGESGY 314

Query:   322 IRMQRDIDAKEGLCGIAMQASYPT 345
             +++ R+ +    +CG+A    YPT
Sbjct:   315 LKLARNAN---NMCGVASLPQYPT 335


>DICTYBASE|DDB_G0281079 [details] [associations]
            symbol:DDB_G0281079 species:44689 "Dictyostelium
            discoideum" [GO:0030246 "carbohydrate binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR008965 InterPro:IPR013128 InterPro:IPR019028
            Pfam:PF00112 Pfam:PF09478 PRINTS:PR00705 SMART:SM00645
            SMART:SM01063 InterPro:IPR000169 dictyBase:DDB_G0281079
            GO:GO:0030246 EMBL:AAFI02000040 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 SUPFAM:SSF49384 RefSeq:XP_640804.1
            ProteinModelPortal:Q54UH2 EnsemblProtists:DDB0204000 GeneID:8622858
            KEGG:ddi:DDB_G0281079 InParanoid:Q54UH2 OMA:ALESHYY
            ProtClustDB:CLSZ2430562 Uniprot:Q54UH2
        Length = 664

 Score = 345 (126.5 bits), Expect = 1.7e-39, Sum P(2) = 1.7e-39
 Identities = 79/207 (38%), Positives = 114/207 (55%)

Query:   110 SVRSSETTDV--SFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHI 167
             S  S+ TTD     R    S P SIDWR  G V+ VK+QG CG C+AFS V A+E   + 
Sbjct:   450 SSSSNITTDEPSKSRLLKWSRPISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALESHYYR 509

Query:   168 TTRKLTSLSEQELVDCDTSGE--DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSC 225
                ++  LSEQ LVDC  S +  + GC GG M + + +I  N G+  E+ YPY+   G C
Sbjct:   510 KNNRMLDLSEQNLVDCTASNKYRNGGCSGGWMHNCYSYIQENGGINQESTYPYEGKFGQC 569

Query:   226 NKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVF-TGQC 283
              +  +  + ++IS +  +  ++E  L   VA+  PVSVA DAS  +F +YS G++ +  C
Sbjct:   570 -RYNSGDAQSRISKFVMIKQHDEEDLADTVASVGPVSVAYDASTREFMYYSRGIYYSDNC 628

Query:   284 GT-ELDHGVTAVGYGTADDGTKYWLVK 309
                   H V  VGY   ++G  YW++K
Sbjct:   629 NKYRTTHAVVVVGYDN-ENGVDYWIIK 654

 Score = 107 (42.7 bits), Expect = 1.7e-39, Sum P(2) = 1.7e-39
 Identities = 22/76 (28%), Positives = 43/76 (56%)

Query:    42 WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
             W  Q+ R YR + +  ++++ FK++  +I  +  + +N   +LG+ +F+D T++EF    
Sbjct:   164 WSNQFNRTYRAD-QFLLKYEAFKDSSRFIEQYKRENQNSTMELGLTQFSDMTHDEFL--- 219

Query:   102 NGYKRRLPSVRSSETT 117
             N Y  +L     +ETT
Sbjct:   220 NVYTSKLYEFNLNETT 235


>UNIPROTKB|P56202 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 GO:GO:0006955 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AF013611
            EMBL:AF015954 EMBL:AF055903 EMBL:AP001201 EMBL:BC048255
            IPI:IPI00328978 RefSeq:NP_001326.2 UniGene:Hs.416848
            ProteinModelPortal:P56202 SMR:P56202 STRING:P56202 MEROPS:C01.037
            PhosphoSite:P56202 DMDM:259016196 PaxDb:P56202 PRIDE:P56202
            Ensembl:ENST00000307886 GeneID:1521 KEGG:hsa:1521 UCSC:uc001ogc.1
            CTD:1521 GeneCards:GC11P065647 HGNC:HGNC:2546 HPA:CAB016345
            MIM:602364 neXtProt:NX_P56202 PharmGKB:PA27042 eggNOG:NOG288820
            HOVERGEN:HBG100117 InParanoid:P56202 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 PhylomeDB:P56202 GenomeRNAi:1521 NextBio:6295
            ArrayExpress:P56202 Bgee:P56202 CleanEx:HS_CTSW
            Genevestigator:P56202 GermOnline:ENSG00000172543 Uniprot:P56202
        Length = 376

 Score = 324 (119.1 bits), Expect = 1.3e-38, Sum P(2) = 1.3e-38
 Identities = 91/306 (29%), Positives = 139/306 (45%)

Query:    11 VLAAILVLGVWAP-QSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
             +L A L  G+  P ++         + E  +++  Q+ R Y    E   R  IF  N+  
Sbjct:    13 LLVAGLAQGIRGPLRAQDLGPQPLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQ 72

Query:    70 IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVP 129
                   +      + G+  F+D T EEF     GY+R    V S    ++       SVP
Sbjct:    73 AQRLQEEDLGTA-EFGVTPFSDLTEEEF-GQLYGYRRAAGGVPSMGR-EIRSEEPEESVP 129

Query:   130 ASIDWRK-KGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
              S DWRK   A++ +KDQ  C CCWA +A   +E +  I+      +S QEL+DC   G+
Sbjct:   130 FSCDWRKVASAISPIKDQKNCNCCWAMAAAGNIETLWRISFWDFVDVSVQELLDCGRCGD 189

Query:   189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSA-AKISGYED--VPS 245
               GC GG + DAF  +++N GLA+E  YP++   G       +P    K++  +D  +  
Sbjct:   190 --GCHGGFVWDAFITVLNNSGLASEKDYPFQ---GKVRAHRCHPKKYQKVAWIQDFIMLQ 244

Query:   246 NNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTGQ---CGTEL-DHGVTAVGYGTAD 300
             NNE  + + +A   P++V I+      Q Y  GV       C  +L DH V  VG+G+  
Sbjct:   245 NNEHRIAQYLATYGPITVTINMK--PLQLYRKGVIKATPTTCDPQLVDHSVLLVGFGSVK 302

Query:   301 DGTKYW 306
                  W
Sbjct:   303 SEEGIW 308

 Score = 105 (42.0 bits), Expect = 1.3e-38, Sum P(2) = 1.3e-38
 Identities = 18/35 (51%), Positives = 22/35 (62%)

Query:   303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
             T YW++KNSWG  WGE GY R+ R  +     CGI
Sbjct:   324 TPYWILKNSWGAQWGEKGYFRLHRGSNT----CGI 354


>WB|WBGene00012747 [details] [associations]
            symbol:Y40H7A.10 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000230773 EMBL:AL033510
            HSSP:P80067 MEROPS:C01.A48 PIR:T26792 RefSeq:NP_502836.1
            ProteinModelPortal:Q9XWA4 SMR:Q9XWA4 STRING:Q9XWA4
            EnsemblMetazoa:Y40H7A.10 GeneID:189809 KEGG:cel:CELE_Y40H7A.10
            UCSC:Y40H7A.10 CTD:189809 WormBase:Y40H7A.10 eggNOG:NOG286423
            InParanoid:Q9XWA4 OMA:NGPMIVC NextBio:943702 Uniprot:Q9XWA4
        Length = 343

 Score = 406 (148.0 bits), Expect = 7.0e-38, P = 7.0e-38
 Identities = 104/311 (33%), Positives = 161/311 (51%)

Query:    32 DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK-PYKLGINEFA 90
             D       + ++ +Y R Y +  E   RF IF  N++ +  +N +   K  Y+L  N+F+
Sbjct:    44 DVKYTNAFQNFLVKYLREYPNEYEIVKRFTIFSRNLDLVERYNKEDAGKVTYEL--NDFS 101

Query:    91 DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRK-KGA--VTGVKDQG 147
             D T EE++     Y        S ++       +  ++P S+DWR   G   VTG+K QG
Sbjct:   102 DLTEEEWKK----YLMTPKPDHSEKSLKPKTLIDKKNLPNSVDWRNVNGTNHVTGIKYQG 157

Query:   148 QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISN 207
              CG CWAF+  AA+E    I+   L SLS Q+L+DC T   D+ C GG   +A ++  S+
Sbjct:   158 PCGSCWAFATAAAIESAVSISGGGLQSLSSQQLLDC-TVVSDK-CGGGEPVEALKYAQSH 215

Query:   208 KGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDAS 267
              G+ T   YPY      C  +E  P+ A+IS +    S +E A + A+ N P+ V  + +
Sbjct:   216 -GITTAHNYPYYFWTTKC--RETVPTVARISSWMKAESEDEMAQIVAL-NGPMIVCANFA 271

Query:   268 GSDFQFYSSGVFTG-QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
              +  +FY SG+     CGTE  H +  +GYG       YW++KN++   WGE GY+R++R
Sbjct:   272 TNKNRFYHSGIAEDPDCGTEPTHALIVIGYGP-----DYWILKNTYSKVWGEKGYMRVKR 326

Query:   327 DIDAKEGLCGI 337
             D++     CGI
Sbjct:   327 DVN----WCGI 333


>MGI|MGI:1338045 [details] [associations]
            symbol:Ctsw "cathepsin W" species:10090 "Mus musculus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 MGI:MGI:1338045 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 MEROPS:C01.037
            CTD:1521 eggNOG:NOG288820 HOVERGEN:HBG100117 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 EMBL:AF014941 EMBL:AC122861 IPI:IPI00111727
            RefSeq:NP_034115.2 UniGene:Mm.113590 ProteinModelPortal:P56203
            SMR:P56203 PhosphoSite:P56203 PRIDE:P56203 DNASU:13041
            Ensembl:ENSMUST00000025844 GeneID:13041 KEGG:mmu:13041
            InParanoid:P56203 NextBio:282936 Bgee:P56203 CleanEx:MM_CTSW
            Genevestigator:P56203 GermOnline:ENSMUSG00000024910 Uniprot:P56203
        Length = 371

 Score = 310 (114.2 bits), Expect = 6.4e-37, Sum P(2) = 6.4e-37
 Identities = 85/276 (30%), Positives = 134/276 (48%)

Query:    35 MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
             + E  +++  ++ R Y + AE   R  IF  N+        +      + G   F+D T 
Sbjct:    36 LKEVFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDLGTA-EFGETPFSDLTE 94

Query:    95 EEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRK-KGAVTGVKDQGQCGCCW 153
             EEF     G +R  P    + T  V       SVP + DWRK K  ++ VK+QG C CCW
Sbjct:    95 EEF-GQLYGQERS-PERTPNMTKKVESNTWGESVPRTCDWRKAKNIISSVKNQGSCKCCW 152

Query:   154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
             A +A   ++ +  I  ++   +S QEL+DC+  G   GC GG + DA+  +++N GLA+E
Sbjct:   153 AMAAADNIQALWRIKHQQFVDVSVQELLDCERCGN--GCNGGFVWDAYLTVLNNSGLASE 210

Query:   214 AKYPYKASDGSCNKKEANPSAAKISGYEDVP--SNNEAALMKAVA-NQPVSVAIDASGSD 270
               YP++  D   ++  A     K++  +D    SNNE A+   +A + P++V I+     
Sbjct:   211 KDYPFQG-DRKPHRCLAK-KYKKVAWIQDFTMLSNNEQAIAHYLAVHGPITVTINMKL-- 266

Query:   271 FQFYSSGVFTG---QCGT-ELDHGVTAVGYGTADDG 302
              Q Y  GV       C   ++DH V  VG+G   +G
Sbjct:   267 LQHYQKGVIKATPSSCDPRQVDHSVLLVGFGKEKEG 302

 Score = 103 (41.3 bits), Expect = 6.4e-37, Sum P(2) = 6.4e-37
 Identities = 25/75 (33%), Positives = 34/75 (45%)

Query:   286 ELDHGVTAVGYGTADDGTK----------------YWLVKNSWGTTWGENGYIRMQRDID 329
             ++DH V  VG+G   +G +                YW++KNSWG  WGE GY R+ R   
Sbjct:   286 QVDHSVLLVGFGKEKEGMQTGTVLSHSRKRRHSSPYWILKNSWGAHWGEKGYFRLYRG-- 343

Query:   330 AKEGLCGIAMQASYP 344
                  CG+     YP
Sbjct:   344 --NNTCGVT---KYP 353


>RGD|1309354 [details] [associations]
            symbol:Ctsw "cathepsin W" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1309354 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 MEROPS:C01.037
            CTD:1521 eggNOG:NOG288820 HOVERGEN:HBG100117 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 EMBL:CH473953 EMBL:BC093401 IPI:IPI00371471
            RefSeq:NP_001019413.1 UniGene:Rn.34406 Ensembl:ENSRNOT00000037404
            GeneID:293676 KEGG:rno:293676 UCSC:RGD:1309354 InParanoid:Q561Q9
            NextBio:636716 Genevestigator:Q561Q9 Uniprot:Q561Q9
        Length = 371

 Score = 302 (111.4 bits), Expect = 1.0e-36, Sum P(2) = 1.0e-36
 Identities = 83/276 (30%), Positives = 135/276 (48%)

Query:    35 MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
             + E  +++  Q+ R Y + AE   R  IF  N+        +      + G   F+D T 
Sbjct:    36 LKEVFKLFQIQFNRSYSNPAEYTRRLGIFAHNLAQAQRLQEEDLGTA-EFGQTPFSDLTE 94

Query:    95 EEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRK-KGAVTGVKDQGQCGCCW 153
             EEF     G++R    + +      S R+   SVP + DWRK K  ++ +K+QG C CCW
Sbjct:    95 EEF-GQLYGHQRAPERILNMAKKVKSERW-GESVPPTCDWRKVKNIISSIKNQGNCRCCW 152

Query:   154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
             A +A   ++ +  I T++   +S QEL+DCD  G   GC GG + DA+  +++N GLA+E
Sbjct:   153 AIAAADNIQTLWRIKTQQFVDVSVQELLDCDRCGN--GCNGGFVWDAYITVLNNSGLASE 210

Query:   214 AKYPYKASDGSCNKKEANPSAAKISGYEDVP--SNNEAALMKAVA-NQPVSVAIDASGSD 270
               YP++      ++  A+    K++  +D    S+NE  +   +A + P++V I+     
Sbjct:   211 EDYPFQGHQKP-HRCLAD-KYRKVAWIQDFTMLSSNEQVIAGYLAIHGPITVTINMKL-- 266

Query:   271 FQFYSSGVFTGQ---CGTEL-DHGVTAVGYGTADDG 302
              Q+Y  GV       C   L +H V  VG+G    G
Sbjct:   267 LQYYQKGVIKATPSTCDPHLVNHSVLLVGFGKEKGG 302

 Score = 109 (43.4 bits), Expect = 1.0e-36, Sum P(2) = 1.0e-36
 Identities = 21/42 (50%), Positives = 24/42 (57%)

Query:   303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             T YW++KNSWG  WGE GY R+ R        CGIA    YP
Sbjct:   319 TPYWILKNSWGAEWGEKGYFRLYRG----NNTCGIA---KYP 353


>WB|WBGene00011102 [details] [associations]
            symbol:R07E3.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            EMBL:Z49207 HSSP:P53634 PIR:T24030 RefSeq:NP_001041280.1
            ProteinModelPortal:Q21810 SMR:Q21810 STRING:Q21810 MEROPS:C01.A43
            PaxDb:Q21810 EnsemblMetazoa:R07E3.1a GeneID:181242
            KEGG:cel:CELE_R07E3.1 UCSC:R07E3.1a CTD:181242 WormBase:R07E3.1a
            HOGENOM:HOG000021028 InParanoid:Q21810 OMA:ACKNEVI NextBio:913066
            ArrayExpress:Q21810 Uniprot:Q21810
        Length = 402

 Score = 394 (143.8 bits), Expect = 1.3e-36, P = 1.3e-36
 Identities = 108/309 (34%), Positives = 155/309 (50%)

Query:    45 QYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA---PR 101
             ++ + Y  + E   R   +    E IA++N +  +   + G N+ +D T+EEF     P+
Sbjct:    96 KFDKSYATSQESLKRLNAYYNTDENIANWNIQNEHGSAEYGHNDMSDWTDEEFEKTLLPK 155

Query:   102 NGYKRRLPSVRSSETTDVSF---RYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
             + YKR        E    S    + E++S  P   DWR K  +T VK QGQCG CWAF++
Sbjct:   156 SFYKRLHKEAEFIEPIPESLTAKKGESSSPFPDFFDWRDKNVITPVKAQGQCGSCWAFAS 215

Query:   158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
              A +E    I   +  +LSEQ L+DCD    D  C+GG  D AF +I  N GLA     P
Sbjct:   216 TATVEAAWAIAHGEKRNLSEQTLLDCDLV--DNACDGGDEDKAFRYIHRN-GLANAVDLP 272

Query:   218 YKA--SDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFY 274
             Y A   +G       N +  K + +     ++E +++  + N  PV++ + A     + Y
Sbjct:   273 YVAHRQNGCAVNDHWNTTRIKAAYFLH---HDEDSIINWLVNFGPVNIGM-AVIQPMRAY 328

Query:   275 SSGVFTGQ---CGTELD--HGVTAVGYGTADDGTKYWLVKNSWGTTWG-ENGYIRMQRDI 328
               GVFT     C  E+   H +   GYGT+  G KYW+VKNSWG TWG E+GYI   R I
Sbjct:   329 KGGVFTPSEYACKNEVIGLHALLITGYGTSKTGEKYWIVKNSWGNTWGVEHGYIYFARGI 388

Query:   329 DAKEGLCGI 337
             +A    CGI
Sbjct:   389 NA----CGI 393


>UNIPROTKB|F1MHV4 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 OMA:GRCGDGC EMBL:DAAA02063574
            IPI:IPI00716321 Ensembl:ENSBTAT00000027681 Uniprot:F1MHV4
        Length = 375

 Score = 303 (111.7 bits), Expect = 7.2e-36, Sum P(2) = 7.2e-36
 Identities = 83/271 (30%), Positives = 130/271 (47%)

Query:    35 MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
             + E   ++  QY R Y + AE   R  IF +N+        +      + G+ +F+D T 
Sbjct:    38 LKEVFRLFQMQYNRSYPNPAEYARRLDIFAQNLAKAQRLQEEDLGTA-EFGVTQFSDLTE 96

Query:    95 EEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWA 154
             EEF      Y  ++       +  V       S P + DWRK G ++ V+DQ  C CCWA
Sbjct:    97 EEFVQL---YGSQVAGEALGVSRKVGSEEWGESEPQTCDWRKVGTISPVRDQRNCNCCWA 153

Query:   155 FSAVAAMEGINHITTRKLTSLSEQ-ELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
              +A   +E +  I  R    +S Q EL+DCD  G   GC GG + DAF  +++N GLA+E
Sbjct:   154 MAAAGNIEALWAIKFRHFVEVSVQPELLDCDRCGN--GCRGGFVWDAFLTVLNNSGLASE 211

Query:   214 AKYPYKASDGSCNKKEANPSAAKISGYED--VPSNNEAALMKAVANQ-PVSVAIDASGSD 270
               YP+  S G  ++  A     K++  +D  +    E ++ + +A + P++V I+ +   
Sbjct:   212 KDYPFNGS-GKTHRCLAK-KYKKVAWIQDFIILQACEQSMARHLATEGPITVTINMTL-- 267

Query:   271 FQFYSSGVFTGQ---CG-TELDHGVTAVGYG 297
              Q Y  GV       C  T++DH V  VG+G
Sbjct:   268 LQQYQKGVIKATPTTCDPTQVDHSVLLVGFG 298

 Score = 100 (40.3 bits), Expect = 7.2e-36, Sum P(2) = 7.2e-36
 Identities = 17/33 (51%), Positives = 21/33 (63%)

Query:   305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
             YW++KNSWG  WGE GY R+ R  +     CGI
Sbjct:   325 YWILKNSWGPQWGEEGYFRLHRGSNT----CGI 353


>UNIPROTKB|E9PI30 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            EMBL:AP001201 HGNC:HGNC:2546 IPI:IPI00984532
            ProteinModelPortal:E9PI30 SMR:E9PI30 Ensembl:ENST00000528419
            ArrayExpress:E9PI30 Bgee:E9PI30 Uniprot:E9PI30
        Length = 364

 Score = 324 (119.1 bits), Expect = 9.2e-36, Sum P(2) = 9.2e-36
 Identities = 91/306 (29%), Positives = 139/306 (45%)

Query:    11 VLAAILVLGVWAP-QSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
             +L A L  G+  P ++         + E  +++  Q+ R Y    E   R  IF  N+  
Sbjct:    13 LLVAGLAQGIRGPLRAQDLGPQPLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQ 72

Query:    70 IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVP 129
                   +      + G+  F+D T EEF     GY+R    V S    ++       SVP
Sbjct:    73 AQRLQEEDLGTA-EFGVTPFSDLTEEEF-GQLYGYRRAAGGVPSMGR-EIRSEEPEESVP 129

Query:   130 ASIDWRK-KGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
              S DWRK   A++ +KDQ  C CCWA +A   +E +  I+      +S QEL+DC   G+
Sbjct:   130 FSCDWRKVASAISPIKDQKNCNCCWAMAAAGNIETLWRISFWDFVDVSVQELLDCGRCGD 189

Query:   189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSA-AKISGYED--VPS 245
               GC GG + DAF  +++N GLA+E  YP++   G       +P    K++  +D  +  
Sbjct:   190 --GCHGGFVWDAFITVLNNSGLASEKDYPFQ---GKVRAHRCHPKKYQKVAWIQDFIMLQ 244

Query:   246 NNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTGQ---CGTEL-DHGVTAVGYGTAD 300
             NNE  + + +A   P++V I+      Q Y  GV       C  +L DH V  VG+G+  
Sbjct:   245 NNEHRIAQYLATYGPITVTINMK--PLQLYRKGVIKATPTTCDPQLVDHSVLLVGFGSVK 302

Query:   301 DGTKYW 306
                  W
Sbjct:   303 SEEGIW 308

 Score = 78 (32.5 bits), Expect = 9.2e-36, Sum P(2) = 9.2e-36
 Identities = 11/16 (68%), Positives = 13/16 (81%)

Query:   303 TKYWLVKNSWGTTWGE 318
             T YW++KNSWG  WGE
Sbjct:   324 TPYWILKNSWGAQWGE 339


>UNIPROTKB|Q5T8F0 [details] [associations]
            symbol:CTSL1 "Cathepsin L1 light chain" species:9606 "Homo
            sapiens" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            EMBL:AL160279 UniGene:Hs.731507 UniGene:Hs.731952 HGNC:HGNC:2537
            ChiTaRS:CTSL1 IPI:IPI00640540 SMR:Q5T8F0 Ensembl:ENST00000342020
            ChEMBL:CHEMBL1293261 Uniprot:Q5T8F0
        Length = 225

 Score = 384 (140.2 bits), Expect = 1.5e-35, P = 1.5e-35
 Identities = 85/209 (40%), Positives = 120/209 (57%)

Query:    16 LVLGVWAPQSWSRTLN-DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN 74
             L+L  +     S TL  D ++  +   W A + R+Y  N E+  R  ++++N++ I   N
Sbjct:     5 LILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKMIELHN 63

Query:    75 NKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASI 132
              + R     + + +N F D T+EEFR   NG++ R P  R  +       YE    P S+
Sbjct:    64 QEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKP--RKGKVFQEPLFYE---APRSV 118

Query:   133 DWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGC 192
             DWR+KG VT VK+QGQCG CWAFSA  A+EG     T +L SLSEQ LVDC     ++GC
Sbjct:   119 DWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGC 178

Query:   193 EGGLMDDAFEFIISNKGLATEAKYPYKAS 221
              GGLMD AF+++  N GL +E  YPY+A+
Sbjct:   179 NGGLMDYAFQYVQDNGGLDSEESYPYEAT 207

 Score = 260 (96.6 bits), Expect = 2.1e-22, P = 2.1e-22
 Identities = 55/111 (49%), Positives = 69/111 (62%)

Query:   129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
             P S+DWR+KG VT VK+QGQCG CWAFSA  A+EG     T +L SLSEQ LVDC     
Sbjct:   115 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 174

Query:   189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
             ++GC GGLMD AF+++  N GL +E  YPY+A+               +SG
Sbjct:   175 NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEAT---------------VSG 210


>UNIPROTKB|P43234 [details] [associations]
            symbol:CTSO "Cathepsin O" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 Reactome:REACT_6900
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0004197
            CleanEx:HS_CTSO EMBL:X77383 EMBL:BC049206 IPI:IPI00017257
            PIR:A55090 RefSeq:NP_001325.1 UniGene:Hs.75262
            ProteinModelPortal:P43234 SMR:P43234 IntAct:P43234 STRING:P43234
            MEROPS:C01.035 PhosphoSite:P43234 DMDM:1168795 PRIDE:P43234
            DNASU:1519 Ensembl:ENST00000433477 GeneID:1519 KEGG:hsa:1519
            UCSC:uc003ipg.3 CTD:1519 GeneCards:GC04M156845 HGNC:HGNC:2542
            HPA:HPA002041 MIM:600550 neXtProt:NX_P43234 PharmGKB:PA27040
            HOVERGEN:HBG105050 InParanoid:P43234 KO:K01374 OMA:SNVCGIA
            OrthoDB:EOG4V6ZH1 PhylomeDB:P43234 BindingDB:P43234
            ChEMBL:CHEMBL3035 GenomeRNAi:1519 NextBio:6287 Bgee:P43234
            Genevestigator:P43234 GermOnline:ENSG00000151792 Uniprot:P43234
        Length = 321

 Score = 377 (137.8 bits), Expect = 8.3e-35, P = 8.3e-35
 Identities = 96/293 (32%), Positives = 145/293 (49%)

Query:    56 KEMRFKIFKENVEYIASFNN--KARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRS 113
             +E     F+E++      N+   + N     GIN+F+    EEF+A    Y R  PS   
Sbjct:    37 REREAAAFRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAI---YLRSKPSKFP 93

Query:   114 SETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLT 173
               + +V     N S+P   DWR K  VT V++Q  CG CWAFS V A+E    I  + L 
Sbjct:    94 RYSAEVHMSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLE 153

Query:   174 SLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK-GLATEAKYPYKASDGSCNKKEANP 232
              LS Q+++DC  S  + GC GG   +A  ++   +  L  +++YP+KA +G C+    + 
Sbjct:   154 DLSVQQVIDC--SYNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSGSH 211

Query:   233 SAAKISGYEDVP-SNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFTGQCGT-ELDH 289
             S   I GY     S+ E  + KA+    P+ V +DA    +Q Y  G+    C + E +H
Sbjct:   212 SGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVS--WQDYLGGIIQHHCSSGEANH 269

Query:   290 GVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQAS 342
              V   G+      T YW+V+NSWG++WG +GY  ++        +CGIA   S
Sbjct:   270 AVLITGFDKTGS-TPYWIVRNSWGSSWGVDGYAHVKMG----SNVCGIADSVS 317


>WB|WBGene00013076 [details] [associations]
            symbol:Y51A2D.8 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            EMBL:AL021497 HSSP:P53634 HOGENOM:HOG000019851 PIR:T27079
            RefSeq:NP_507627.1 ProteinModelPortal:Q9XXQ7 SMR:Q9XXQ7
            MEROPS:C01.A49 EnsemblMetazoa:Y51A2D.8 GeneID:180208
            KEGG:cel:CELE_Y51A2D.8 UCSC:Y51A2D.8 CTD:180208 WormBase:Y51A2D.8
            eggNOG:NOG307864 InParanoid:Q9XXQ7 OMA:VAVYFKV NextBio:908434
            Uniprot:Q9XXQ7
        Length = 386

 Score = 301 (111.0 bits), Expect = 1.3e-34, Sum P(2) = 1.3e-34
 Identities = 75/248 (30%), Positives = 118/248 (47%)

Query:   100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKK---GA-VTG-VKDQGQCGCCWA 154
             P   + ++ P  R+++      +  +   P   D R +   G  + G +KDQGQC CCW 
Sbjct:   119 PMLNFDKKKPDFRAADMNKTRHKRRSTRYPDYFDLRNEKINGRYIVGPIKDQGQCACCWG 178

Query:   155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
             F+  A +E +    + K  SLS+QE+ DC T G   GC+GG +    +++    GL+ + 
Sbjct:   179 FAVTALVETVYAAHSGKFKSLSDQEVCDCGTEGTP-GCKGGSLTLGVQYV-KKYGLSGDE 236

Query:   215 KYPY---KASDGS-CNKKEANPSA-AKISGYEDV-PSNNEAALMKAVANQPVSVAIDAS- 267
              YPY   +A+ G  C  +E +    A+   +  + P   E  +++ +    V VA+    
Sbjct:   237 DYPYDQNRANQGRRCRLRETDRIVPARAFNFAVINPRRAEEQIIQVLTEWKVPVAVYFKV 296

Query:   268 GSDFQFYSSGVFT-GQCGTELD-HGVTAVGYGTADDGT----KYWLVKNSWGTTWGENGY 321
             G  F+ Y  GV     C      H    VGY T +D       YW++KNSWG  W E+GY
Sbjct:   297 GDQFKEYKEGVIIEDDCRRATQWHAGAIVGYDTVEDSRGRSHDYWIIKNSWGGDWAESGY 356

Query:   322 IRMQRDID 329
             +R+ R  D
Sbjct:   357 VRVVRGRD 364

 Score = 90 (36.7 bits), Expect = 1.3e-34, Sum P(2) = 1.3e-34
 Identities = 19/60 (31%), Positives = 33/60 (55%)

Query:    40 EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPY--KLGINEFADQTNEEF 97
             E +  +Y R Y+D +E + RF  F ++   +   N K++   Y  + GIN+F+D +  EF
Sbjct:    44 EDFKKKYNRKYKDESENQQRFNNFVKSYNNVDKLNAKSKAAGYDTQFGINKFSDLSTAEF 103


>ZFIN|ZDB-GENE-080724-8 [details] [associations]
            symbol:ctso "cathepsin O" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            ZFIN:ZDB-GENE-080724-8 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 CTD:1519 KO:K01374 EMBL:CR931784
            IPI:IPI00513613 RefSeq:XP_695717.3 UniGene:Dr.88386
            Ensembl:ENSDART00000074786 GeneID:567333 KEGG:dre:567333
            NextBio:20888622 Uniprot:E7FA09
        Length = 334

 Score = 373 (136.4 bits), Expect = 2.2e-34, P = 2.2e-34
 Identities = 93/304 (30%), Positives = 158/304 (51%)

Query:    50 YRDNAEKEMRFKIFKENVEYIAS-----FNNKA---RNKPYKLGINEFADQTNEEFRAPR 101
             + D  ++++  ++++  + Y +S     F N A    N+  + G+N+F+  + ++F+   
Sbjct:    38 HSDTFQQDVNNELYQRWINYQSSLQRQAFLNSALGKSNQSAQYGVNQFSYLSQKQFKEQY 97

Query:   102 -NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAA 160
                     P    S++ ++  +  N   P   DWR  G V  V +QG CG CWAFS V A
Sbjct:    98 LTARAEAAPKFDQSKS-EIKVKANN---PPRFDWRDHGVVGPVHNQGSCGGCWAFSIVEA 153

Query:   161 MEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK-GLATEAKYPYK 219
             +E ++     KL  LS Q+++DC  S ++QGC GG   +A  ++  +K  L +EA+YP+K
Sbjct:   154 IESVSAKGGEKLQQLSVQQVIDC--SYQNQGCNGGSPVEALYWLTQSKLKLVSEAEYPFK 211

Query:   220 ASDGSCN---KKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYS 275
              +DG C    +  A  +    S Y+   S  E  +M A+ +  P+ V +DA    +Q Y 
Sbjct:   212 GADGVCQFFPQAHAGVAVRNYSAYDF--SGQEEVMMSALVDFGPLVVIVDAIS--WQDYL 267

Query:   276 SGVFTGQCGT-ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
              G+    C + + +H V   GY T  +   YW+V+NSWGT+WG++GY  ++   D    +
Sbjct:   268 GGIIQHHCSSHKANHAVLITGYDTTGE-VPYWIVRNSWGTSWGDDGYAYIKIGND----V 322

Query:   335 CGIA 338
             CG+A
Sbjct:   323 CGVA 326


>GENEDB_PFALCIPARUM|PF14_0553 [details] [associations]
            symbol:PF14_0553 "cysteine proteinase
            falcipain-1" species:5833 "Plasmodium falciparum" [GO:0042540
            "hemoglobin catabolic process" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014187 KO:K01376 HSSP:P07688 GO:GO:0042540
            RefSeq:XP_001348727.1 ProteinModelPortal:Q8I6V0 PRIDE:Q8I6V0
            EnsemblProtists:PF14_0553:mRNA GeneID:812135 KEGG:pfa:PF14_0553
            EuPathDB:PlasmoDB:PF3D7_1458000 HOGENOM:HOG000065906
            ProtClustDB:CLSZ2457715 ChEMBL:CHEMBL1250371 Uniprot:Q8I6V0
        Length = 569

 Score = 275 (101.9 bits), Expect = 8.0e-34, Sum P(2) = 8.0e-34
 Identities = 75/252 (29%), Positives = 124/252 (49%)

Query:    53 NAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVR 112
             N  K  +  ++K+ V   + ++ +   + +K  ++   +   E++  P   + +   ++ 
Sbjct:   258 NHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLH-VPNHMIEKYSKPFENHLK--DNIL 314

Query:   113 SSETTDVSFRYEN---ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITT 169
              SE      R E    + VP  +D+R+KG V   KDQG CG CWAF++V  +E +     
Sbjct:   315 ISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKN 374

Query:   170 RKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGS-CNKK 228
             + + S SEQE+VDC  S ++ GC+GG    +F +++ N+ L    +Y YKA D   C   
Sbjct:   375 KNILSFSEQEVVDC--SKDNFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKDDMFCLNY 431

Query:   229 EANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD 288
                   + +S    V  N     +  V   P+SV +  + +DF  YS GV+ G C  EL+
Sbjct:   432 RCKRKVS-LSSIGAVKENQLILALNEVG--PLSVNVGVN-NDFVAYSEGVYNGTCSEELN 487

Query:   289 HGVTAVGYGTAD 300
             H V  VGYG  +
Sbjct:   488 HSVLLVGYGQVE 499

 Score = 123 (48.4 bits), Expect = 8.0e-34, Sum P(2) = 8.0e-34
 Identities = 18/40 (45%), Positives = 26/40 (65%)

Query:   305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             YW++KNSW   WGENG++R+ R+ +     CGI  +  YP
Sbjct:   528 YWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYP 567

 Score = 100 (40.3 bits), Expect = 1.6e-10, Sum P(2) = 1.6e-10
 Identities = 22/70 (31%), Positives = 41/70 (58%)

Query:    30 LNDATMNERHEMWMAQYGRVYRDNAEKEMR-FKIFKENVEYIASFNNKARNKPYKLGINE 88
             +N+     +   +M ++ +VY+ N +++MR F+IFK N   I + N   +N  YK  +N+
Sbjct:   216 INNIKYASKFFKFMKEHNKVYK-NIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQ 274

Query:    89 FADQTNEEFR 98
             F+D + EE +
Sbjct:   275 FSDYSEEELK 284


>UNIPROTKB|Q8I6V0 [details] [associations]
            symbol:PF14_0553 "Cysteine proteinase falcipain-1"
            species:36329 "Plasmodium falciparum 3D7" [GO:0042540 "hemoglobin
            catabolic process" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014187 KO:K01376 HSSP:P07688 GO:GO:0042540
            RefSeq:XP_001348727.1 ProteinModelPortal:Q8I6V0 PRIDE:Q8I6V0
            EnsemblProtists:PF14_0553:mRNA GeneID:812135 KEGG:pfa:PF14_0553
            EuPathDB:PlasmoDB:PF3D7_1458000 HOGENOM:HOG000065906
            ProtClustDB:CLSZ2457715 ChEMBL:CHEMBL1250371 Uniprot:Q8I6V0
        Length = 569

 Score = 275 (101.9 bits), Expect = 8.0e-34, Sum P(2) = 8.0e-34
 Identities = 75/252 (29%), Positives = 124/252 (49%)

Query:    53 NAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVR 112
             N  K  +  ++K+ V   + ++ +   + +K  ++   +   E++  P   + +   ++ 
Sbjct:   258 NHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLH-VPNHMIEKYSKPFENHLK--DNIL 314

Query:   113 SSETTDVSFRYEN---ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITT 169
              SE      R E    + VP  +D+R+KG V   KDQG CG CWAF++V  +E +     
Sbjct:   315 ISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKN 374

Query:   170 RKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGS-CNKK 228
             + + S SEQE+VDC  S ++ GC+GG    +F +++ N+ L    +Y YKA D   C   
Sbjct:   375 KNILSFSEQEVVDC--SKDNFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKDDMFCLNY 431

Query:   229 EANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD 288
                   + +S    V  N     +  V   P+SV +  + +DF  YS GV+ G C  EL+
Sbjct:   432 RCKRKVS-LSSIGAVKENQLILALNEVG--PLSVNVGVN-NDFVAYSEGVYNGTCSEELN 487

Query:   289 HGVTAVGYGTAD 300
             H V  VGYG  +
Sbjct:   488 HSVLLVGYGQVE 499

 Score = 123 (48.4 bits), Expect = 8.0e-34, Sum P(2) = 8.0e-34
 Identities = 18/40 (45%), Positives = 26/40 (65%)

Query:   305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             YW++KNSW   WGENG++R+ R+ +     CGI  +  YP
Sbjct:   528 YWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYP 567

 Score = 100 (40.3 bits), Expect = 1.6e-10, Sum P(2) = 1.6e-10
 Identities = 22/70 (31%), Positives = 41/70 (58%)

Query:    30 LNDATMNERHEMWMAQYGRVYRDNAEKEMR-FKIFKENVEYIASFNNKARNKPYKLGINE 88
             +N+     +   +M ++ +VY+ N +++MR F+IFK N   I + N   +N  YK  +N+
Sbjct:   216 INNIKYASKFFKFMKEHNKVYK-NIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQ 274

Query:    89 FADQTNEEFR 98
             F+D + EE +
Sbjct:   275 FSDYSEEELK 284


>UNIPROTKB|E2RPX3 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 CTD:1521 KO:K08569 OMA:GRCGDGC
            EMBL:AAEX03011632 RefSeq:XP_540846.2 Ensembl:ENSCAFT00000020910
            GeneID:483725 KEGG:cfa:483725 Uniprot:E2RPX3
        Length = 374

 Score = 281 (104.0 bits), Expect = 9.0e-34, Sum P(2) = 9.0e-34
 Identities = 78/264 (29%), Positives = 117/264 (44%)

Query:    41 MWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAP 100
             ++  QY R Y +  E   R  IF  N+       ++      + G+  F+D T EEF   
Sbjct:    44 LFQIQYNRSYSNPEEYARRLDIFAHNLAQAQQLEDEDLGTA-EFGVTPFSDLTEEEF-GQ 101

Query:   101 RNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRK-KGAVTGVKDQGQCGCCWAFSAVA 159
               G++R +     S    V        VP + DWRK  G ++ +K QG C CCWA +A  
Sbjct:   102 FYGHQR-MAGEAPSVGRKVESEEWGEPVPPTCDWRKLPGIISPIKQQGNCRCCWAMAAAG 160

Query:   160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPY- 218
              +E +  I   +   +S QEL+DC   G+  GC+GG   DAF  +++N GLA+   YP+ 
Sbjct:   161 NIEALWGIRYHQPVEVSVQELLDCGRCGD--GCKGGFTWDAFITVLNNSGLASAKDYPFL 218

Query:   219 -KASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSG 277
                    C  K+     A I  +  +  N +A         P++V I+      Q Y  G
Sbjct:   219 GNTKPHRCLAKKYK-KVAWIQDFIMLQGNEQAIAWYLATKGPITVTINMKL--LQHYQKG 275

Query:   278 VFTGQ---CGTE-LDHGVTAVGYG 297
             V       C  + +DH V  VG+G
Sbjct:   276 VIQATHTTCDPQRVDHSVLLVGFG 299

 Score = 102 (41.0 bits), Expect = 9.0e-34, Sum P(2) = 9.0e-34
 Identities = 19/40 (47%), Positives = 22/40 (55%)

Query:   305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             YW++KNSWG  WGE GY R+ R        CGI     YP
Sbjct:   324 YWILKNSWGAEWGEEGYFRLHRG----NNTCGIT---KYP 356


>UNIPROTKB|F1PGK4 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 OMA:SNVCGIA
            EMBL:AAEX03010073 Ensembl:ENSCAFT00000013638 Uniprot:F1PGK4
        Length = 316

 Score = 366 (133.9 bits), Expect = 1.2e-33, P = 1.2e-33
 Identities = 93/286 (32%), Positives = 141/286 (49%)

Query:    63 FKENVEYIASFNN--KARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
             F+E++      N+     N     GIN+F+  + EEF+A    Y R  PS       +V 
Sbjct:    39 FRESLNRHRYLNSVFPRENSSAVYGINQFSYLSPEEFKAI---YLRSKPSRSPRYPAEVR 95

Query:   121 FRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQEL 180
                 N S+P   DWR K  VT V++Q  CG CWAFS V A+E    I  + L  +S Q++
Sbjct:    96 TSIRNVSLPLRFDWRDKRVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGKPLADISVQQV 155

Query:   181 VDCDTSGEDQGCEGGLMDDAFEFIISNK-GLATEAKYPYKASDGSCNKKEANPSAAKISG 239
             +DC  S  + GC GG   +A  ++   +  L  +++YP+KA +G C+    + S   I G
Sbjct:   156 IDC--SYNNYGCSGGSTLNALNWLNKTQVKLVRDSEYPFKAQNGLCHYFSDSYSGFSIRG 213

Query:   240 YEDVP-SNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFTGQCGT-ELDHGVTAVGY 296
             Y     S+ E  + K +    P+ V +DA    +Q Y  G+    C + E +H V   G+
Sbjct:   214 YSAYDFSDQEDEMAKVLLTFGPLVVVVDAVS--WQDYLGGIIQHHCSSGEANHAVLITGF 271

Query:   297 GTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQAS 342
                   T YW+V+NSWG++WG +GY  ++        +CGIA   S
Sbjct:   272 DKIGS-TPYWIVRNSWGSSWGVDGYAHVKMG----GNICGIADSVS 312


>UNIPROTKB|F1RU23 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 KO:K08569 EMBL:CU928325
            RefSeq:XP_003122571.1 UniGene:Ssc.28940 Ensembl:ENSSSCT00000014177
            GeneID:100525853 KEGG:ssc:100525853 OMA:CWAMAAV Uniprot:F1RU23
        Length = 367

 Score = 364 (133.2 bits), Expect = 2.0e-33, P = 2.0e-33
 Identities = 105/330 (31%), Positives = 155/330 (46%)

Query:    35 MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
             + E   ++  QY R Y + AE   R  IF +N+        +      + G+  F+D T 
Sbjct:    38 LKEVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGTA-EFGVTPFSDLTE 96

Query:    95 EEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKK-GAVTGVKDQGQCGCCW 153
             EEF    +G+        S     V       +VP S DWRKK G ++ +K Q  C CCW
Sbjct:    97 EEF-GQLHGHHWGAGKAPSMGIK-VGSEESGETVPQSCDWRKKPGVISAIKHQKDCNCCW 154

Query:   154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
             A +AV  +E    I   +   LS Q+++DCD  G   GC GG + DAF  +++  GLA+E
Sbjct:   155 AMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRCGN--GCNGGFVWDAFLTVLNTSGLASE 212

Query:   214 AKYPYKASDGS--CNKKEANPSAAKISGYEDVPSNN--EAALMKAVANQ-PVSVAIDASG 268
               YPYK +  +  C  K+      K++  +D       E ++ + +A + P++V I+A G
Sbjct:   213 QDYPYKGTVKTHRCLAKQHR----KVAWIQDFLMLQFCEQSIARYLATEGPITVTINA-G 267

Query:   269 SDFQFYSSGVFTGQ---CGTEL-DHGVTAVGYGTAD--DGTK--------YWLVKNSWGT 314
                Q Y  GV       C   L +H V  VG+G +   +G +        YW++KNSWG 
Sbjct:   268 L-LQQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGRRPRPGHSIPYWILKNSWGP 326

Query:   315 TWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
              WGE GY R+ R  +     CGI     YP
Sbjct:   327 DWGEEGYFRLHRGSNT----CGIT---KYP 349


>UNIPROTKB|E1BPI9 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 CTD:1519 KO:K01374 OMA:SNVCGIA
            EMBL:DAAA02044933 IPI:IPI01004081 RefSeq:XP_002694471.2
            RefSeq:XP_874012.4 Ensembl:ENSBTAT00000014691 GeneID:616804
            KEGG:bta:616804 Uniprot:E1BPI9
        Length = 313

 Score = 363 (132.8 bits), Expect = 2.5e-33, P = 2.5e-33
 Identities = 93/265 (35%), Positives = 136/265 (51%)

Query:    85 GINEFADQTNEEFRAPR-NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGV 143
             GIN+F+    EEF+A        R P   + E T +S    N S+P   DWR K  VT V
Sbjct:    60 GINQFSYLFPEEFKAIYLRSSPSRFPRFPAEEYTSIS----NLSLPLRFDWRDKHVVTQV 115

Query:   144 KDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEF 203
             ++Q  CG CWAFS V A+E +  I  + L  LS Q+++DC  S  + GC GG    A  +
Sbjct:   116 RNQKTCGGCWAFSVVGAVESVCAIKGQPLEVLSVQQVIDCSYS--NYGCNGGSPLSALYW 173

Query:   204 IISNK---GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP-SNNEAALMKAV-ANQ 258
             +  NK    L  +++YP++A +G C     + S + I GY     S  E  + +A+ A  
Sbjct:   174 L--NKLQVKLVRDSEYPFQAQNGLCRYFSDSHSGSSIKGYSAYDFSGQEDKMAEALLALG 231

Query:   259 PVSVAIDASGSDFQFYSSGVFTGQCGT-ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWG 317
             P+ V +DA    +Q Y  G+    C + E +H V   G+        YW+V+NSWGT+WG
Sbjct:   232 PLIVVVDAMS--WQDYLGGIIQHHCSSGEANHAVLVTGFDKTGS-IPYWIVRNSWGTSWG 288

Query:   318 ENGYIRMQRDIDAKEGLCGIAMQAS 342
              +GY+R++        +CGIA   S
Sbjct:   289 IDGYVRVKMG----GNVCGIADSVS 309


>WB|WBGene00013764 [details] [associations]
            symbol:Y113G7B.15 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 GeneTree:ENSGT00560000076599
            EMBL:AL110477 HOGENOM:HOG000019851 RefSeq:NP_507904.2
            ProteinModelPortal:Q9U2X1 SMR:Q9U2X1 DIP:DIP-25339N IntAct:Q9U2X1
            MINT:MINT-1058673 STRING:Q9U2X1 MEROPS:C01.A47
            EnsemblMetazoa:Y113G7B.15 GeneID:190976 KEGG:cel:CELE_Y113G7B.15
            UCSC:Y113G7B.15 CTD:190976 WormBase:Y113G7B.15 eggNOG:NOG302449
            OMA:AEEDIME Uniprot:Q9U2X1
        Length = 362

 Score = 358 (131.1 bits), Expect = 8.5e-33, P = 8.5e-33
 Identities = 95/289 (32%), Positives = 149/289 (51%)

Query:    68 EYIASFNNKARNKPYKLGINEFADQTNEEFRA------PRNG-----YKRRLPSVRSSET 116
             E  A    + RN  +  G N+FAD+  +E  A      P+N      YK R P    +  
Sbjct:    63 ELNAKARREGRNVTF--GWNKFADKNRQELSARNSKIHPKNHTDLPIYKPRHPRGSRNHH 120

Query:   117 TDVSFRYENASVPASIDWRK---KGA-VTG-VKDQGQCGCCWAFSAVAAMEGINHITTRK 171
                S R ++  +P   D R     G+ V G VKDQ QCGCCWAF+  A  E  N + ++ 
Sbjct:   121 NKRSKR-QSGDIPDYFDLRDIYVDGSPVVGPVKDQEQCGCCWAFATTAITEAANTLYSKS 179

Query:   172 LTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKA----SDGSC-- 225
              TSLS+QE+ DC  SG+  GC GG   +  + ++  +G +++  YPY+     + G+C  
Sbjct:   180 FTSLSDQEICDCADSGDTPGCVGGDPRNGLK-MVHLRGQSSDGDYPYEEYRANTTGNCVG 238

Query:   226 NKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDAS-GSDFQFYSSGVFTGQ-C 283
             ++K        ++ Y       E  +M+ +    +  A+    G +F++Y+SGV   + C
Sbjct:   239 DEKSTVIQPETLNVYRFDQDYAEEDIMENLYLNHIPTAVYFRVGENFEWYTSGVLQSEDC 298

Query:   284 G--TELD-HGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
                T  + H V  VGYGT+DDG  YWLV+NSW + WG +GY++++R ++
Sbjct:   299 YQMTPAEWHSVAIVGYGTSDDGVPYWLVRNSWNSDWGLHGYVKIRRGVN 347


>DICTYBASE|DDB_G0276111 [details] [associations]
            symbol:DDB_G0276111 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0276111 Pfam:PF00188
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            PROSITE:PS00139 EMBL:AAFI02000014 Gene3D:3.40.33.10
            InterPro:IPR001283 InterPro:IPR014044 PRINTS:PR00837 SMART:SM00198
            SUPFAM:SSF55797 ProtClustDB:CLSZ2429919 RefSeq:XP_643261.1
            ProteinModelPortal:Q75JH0 EnsemblProtists:DDB0169514 GeneID:8620304
            KEGG:ddi:DDB_G0276111 InParanoid:Q75JH0 OMA:GFVTSIK Uniprot:Q75JH0
        Length = 415

 Score = 351 (128.6 bits), Expect = 4.7e-32, P = 4.7e-32
 Identities = 85/234 (36%), Positives = 127/234 (54%)

Query:   109 PSVRSSETTDVSFRYE-NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGI--- 164
             PSV+   + +VS R+    S    +DW+  G VT +K+QGQCG C++F+  AA+E     
Sbjct:   190 PSVKPI-SINVSSRFILPTSSTGDVDWKSLGFVTSIKNQGQCGGCYSFATCAALESAYLI 248

Query:   165 -NHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDG 223
              N++    +  LSEQ  V C     + GC GG      + + S  G+  E  YPYKA  G
Sbjct:   249 KNNLPNTDI-DLSEQNFVSC----VNYGCGGGNGQSCLDKLKST-GIMYETSYPYKAVTG 302

Query:   224 SCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQC 283
             SC     +P   K +GY ++  N EA L  A+ + P+  ++    S FQ Y SG+++   
Sbjct:   303 SCPNVIQSPQPFKWTGYSNIQGNKEAFL-NALKSGPIYASLYVD-SGFQLYKSGIYSCSQ 360

Query:   284 GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
              +  +H +T VGY +AD+    +L+KNSWGT +GE+GYIR+      KEG C +
Sbjct:   361 SSTPNHAITIVGYSSADNS---YLIKNSWGTIYGESGYIRL------KEGSCNL 405


>UNIPROTKB|F1STR1 [details] [associations]
            symbol:CTSC "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0004252
            "serine-type endopeptidase activity" evidence=IEA] [GO:0001913 "T
            cell mediated cytotoxicity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 KO:K01275 InterPro:IPR014882
            Pfam:PF08773 OMA:YDDFLHY GO:GO:0001913 EMBL:CU855751
            RefSeq:XP_003129789.1 UniGene:Ssc.6155 Ensembl:ENSSSCT00000016280
            GeneID:100522387 KEGG:ssc:100522387 Uniprot:F1STR1
        Length = 463

 Score = 345 (126.5 bits), Expect = 2.0e-31, P = 2.0e-31
 Identities = 99/299 (33%), Positives = 152/299 (50%)

Query:    55 EKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSS 114
             +K+   +++K N +++ + N   ++        E+   T +E      GY +RLP  + +
Sbjct:   160 QKKYSNRLYKYNHDFVKAINGIQKSWT-ATAYMEYETLTLKEMTQRGGGYNQRLPRPKPA 218

Query:   115 ETTDVSFRYENASVPASIDWRK-KGA--VTGVKDQGQCGCCWAFSAVAAMEG-INHITTR 170
               T    + ++  +PAS DWR  +G   VT V++Q  CG C++F+++  ME  I  +T  
Sbjct:   219 PIT-AEIQEKSLHLPASWDWRNVRGTNFVTPVRNQASCGSCYSFASMGMMEARIRILTNN 277

Query:   171 KLTS-LSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK-----GLATEAKYPYKASDGS 224
               T  LS QE+V C  S   QGC GG     F ++I+ K     GL  EA +PY  +D  
Sbjct:   278 TQTPILSPQEVVSC--SQYAQGCAGG-----FPYLIAGKYAQDFGLVEEACFPYTGTDSP 330

Query:   225 CNKKEA-----NPSAAKISGYEDVPSNNEAAL-MKAVANQPVSVAIDASGSDFQFYSSGV 278
             C  KE      +     + G+      NEA + ++ V + P++VA +    DF  Y  G+
Sbjct:   331 CTVKEGCFRYYSSEYHYVGGFYG--GCNEALMKLELVHHGPMAVAFEVY-DDFLHYRKGI 387

Query:   279 F--TGQCGT----EL-DHGVTAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
             +  TG        EL +H V  VGYGT    G  YW+VKNSWGT+WGE+GY R++R  D
Sbjct:   388 YHHTGLRDPFNPFELTNHAVLLVGYGTDLASGMDYWIVKNSWGTSWGEDGYFRIRRGTD 446


>UNIPROTKB|F1N455 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1 exclusion domain chain"
            species:9913 "Bos taurus" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 IPI:IPI00697314 UniGene:Bt.49573
            InterPro:IPR014882 Pfam:PF08773 OMA:YDDFLHY GO:GO:0001913
            EMBL:DAAA02062487 EMBL:DAAA02062488 Ensembl:ENSBTAT00000014735
            Uniprot:F1N455
        Length = 463

 Score = 340 (124.7 bits), Expect = 6.9e-31, P = 6.9e-31
 Identities = 101/294 (34%), Positives = 148/294 (50%)

Query:    61 KIFKENVEYIASFNNKARNKPYKLG-INEFADQTNEEFRAPRNGYKRRLPSVRSSETTDV 119
             ++++ N +++ + N  A  K +      E+   T +E      G+ RR+P  + +  T  
Sbjct:   166 RLYRYNHDFVKAIN--AIQKSWTAAPYMEYETLTLKEMIRRGGGHSRRIPRPKPAPIT-A 222

Query:   120 SFRYENASVPASIDWRK-KGA--VTGVKDQGQCGCCWAFSAVAAMEG-INHITTRKLTS- 174
               + +   +P S DWR   G   VT V++QG CG C++F+++  ME  I  +T    T  
Sbjct:   223 EIQKKILHLPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPI 282

Query:   175 LSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK-----GLATEAKYPYKASDGSCNKKE 229
             LS QE+V C  S   QGCEGG     F ++I+ K     GL  E  +PY  +D  C  KE
Sbjct:   283 LSPQEVVSC--SQYAQGCEGG-----FPYLIAGKYAQDFGLVEEDCFPYTGTDSPCRLKE 335

Query:   230 A-----NPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVF--TG 281
                   +     + G+      NEA +   + +Q P++VA +    DF  Y  GV+  TG
Sbjct:   336 GCFRYYSSEYHYVGGFYG--GCNEALMKLELVHQGPMAVAFEVY-DDFLHYRKGVYHHTG 392

Query:   282 QCGT----EL-DHGVTAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
                     EL +H V  VGYGT A  G  YW+VKNSWGT+WGENGY R++R  D
Sbjct:   393 LRDPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTD 446


>UNIPROTKB|Q3ZCJ8 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9913 "Bos
            taurus" [GO:0031638 "zymogen activation" evidence=IDA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0005764 EMBL:BC102115 IPI:IPI00697314 RefSeq:NP_001028789.1
            UniGene:Bt.49573 ProteinModelPortal:Q3ZCJ8 SMR:Q3ZCJ8 STRING:Q3ZCJ8
            PRIDE:Q3ZCJ8 GeneID:352958 KEGG:bta:352958 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 InParanoid:Q3ZCJ8 KO:K01275
            OrthoDB:EOG4H19VZ BindingDB:Q3ZCJ8 ChEMBL:CHEMBL1075050
            NextBio:20812686 GO:GO:0031638 InterPro:IPR014882 Pfam:PF08773
            Uniprot:Q3ZCJ8
        Length = 463

 Score = 340 (124.7 bits), Expect = 6.9e-31, P = 6.9e-31
 Identities = 101/294 (34%), Positives = 148/294 (50%)

Query:    61 KIFKENVEYIASFNNKARNKPYKLG-INEFADQTNEEFRAPRNGYKRRLPSVRSSETTDV 119
             ++++ N +++ + N  A  K +      E+   T +E      G+ RR+P  + +  T  
Sbjct:   166 RLYRYNHDFVKAIN--AIQKSWTAAPYMEYETLTLKEMIRRGGGHSRRIPRPKPAPIT-A 222

Query:   120 SFRYENASVPASIDWRK-KGA--VTGVKDQGQCGCCWAFSAVAAMEG-INHITTRKLTS- 174
               + +   +P S DWR   G   VT V++QG CG C++F+++  ME  I  +T    T  
Sbjct:   223 EIQKKILHLPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPI 282

Query:   175 LSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK-----GLATEAKYPYKASDGSCNKKE 229
             LS QE+V C  S   QGCEGG     F ++I+ K     GL  E  +PY  +D  C  KE
Sbjct:   283 LSPQEVVSC--SQYAQGCEGG-----FPYLIAGKYAQDFGLVEEDCFPYTGTDSPCRLKE 335

Query:   230 A-----NPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVF--TG 281
                   +     + G+      NEA +   + +Q P++VA +    DF  Y  GV+  TG
Sbjct:   336 GCFRYYSSEYHYVGGFYG--GCNEALMKLELVHQGPMAVAFEVY-DDFLHYRKGVYHHTG 392

Query:   282 QCGT----EL-DHGVTAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
                     EL +H V  VGYGT A  G  YW+VKNSWGT+WGENGY R++R  D
Sbjct:   393 LRDPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTD 446


>WB|WBGene00008231 [details] [associations]
            symbol:tag-329 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            eggNOG:NOG288820 EMBL:Z70750 HSSP:P53634 HOGENOM:HOG000019851
            PIR:T20110 RefSeq:NP_505458.1 ProteinModelPortal:Q18740 SMR:Q18740
            MEROPS:C01.A36 EnsemblMetazoa:C50F4.3 GeneID:183677
            KEGG:cel:CELE_C50F4.3 UCSC:C50F4.3 CTD:183677 WormBase:C50F4.3
            InParanoid:Q18740 OMA:WIFRNSW NextBio:921986 Uniprot:Q18740
        Length = 374

 Score = 336 (123.3 bits), Expect = 1.8e-30, P = 1.8e-30
 Identities = 96/326 (29%), Positives = 146/326 (44%)

Query:    31 NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPY--KLGINE 88
             N   + +  E ++ +Y R Y+D  EK+ RF+ F      +   N  A+   +  K GIN+
Sbjct:    39 NPEKLYKEFEDFIVKYKRNYKDEIEKKFRFQQFVATHNRVGKMNKAAKKAGHDTKYGINK 98

Query:    89 FADQTNEEFRA--PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKK--GA--VTG 142
             F+D + +E      + G  +   +V      ++  + +   +P + D R K  G   + G
Sbjct:    99 FSDLSKKEIHGMYSKFGPPKNNTNVPKFNLKNLRVKRQMEGLPKTFDLRNKKVGGHYIIG 158

Query:   143 -VKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAF 201
              +K Q  C CCW F+A A  E    +  +K  +LSEQE+ DC       GC GG   D  
Sbjct:   159 PIKTQDSCACCWGFAATAVAEAALTVHLKKAMNLSEQEVCDCAPK-HGPGCNGGDPVDGL 217

Query:   202 EFIISNKGLATEAKYPYKASD----GSCNK----KEANPSAAKISGYEDVPSNNEAALMK 253
             E+I    GL    +YP+  +     G C      +E NP   ++  Y   P N E  +  
Sbjct:   218 EYI-KEMGLTGGKEYPFNVNRSTQLGRCESEKYDRELNP--LELDYYAIDPFNAEYQMTH 274

Query:   254 AV--ANQPVSVAIDASGSDFQFYSSGVFT-GQCGTELD---HGVTAVGYGTADDGT---- 303
              +   N P+SVA   +G+    Y SG+     C  E     H    VGYGT  +      
Sbjct:   275 HLYLLNLPISVAF-RTGASLSSYLSGILELADCDDEKGGHWHSGAIVGYGTTKNSAGRTV 333

Query:   304 KYWLVKNSWGTTWGENGYIRMQRDID 329
              YW+ +NSW T WG++GY R+ R  D
Sbjct:   334 DYWIFRNSWWTDWGDDGYARIVRGED 359


>UNIPROTKB|F1P0K2 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            OMA:SNVCGIA EMBL:AADN02016534 IPI:IPI00651180
            Ensembl:ENSGALT00000015270 Uniprot:F1P0K2
        Length = 320

 Score = 335 (123.0 bits), Expect = 2.3e-30, P = 2.3e-30
 Identities = 90/297 (30%), Positives = 141/297 (47%)

Query:    47 GRVYRDNAEKEMRFKIFKENVEYIASFNNKAR-NKPYKLGINEFADQTNEEFRAPRNGYK 105
             GR   D   +E      +E+ + I   N+ +  N     G N+F+    EEF+A    Y 
Sbjct:    30 GRPPWDGGGREEEAAALRESAKRIRLLNSPSNDNGSAFYGKNQFSHLFPEEFKAI---YL 86

Query:   106 RRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGIN 165
             R +P  +      V  + E   +P   DWR K  +  V++Q  CG CWAFS V  +E   
Sbjct:    87 RSIP-YKLPRYIKVP-KGEEKPLPKKFDWRDKKVIAEVRNQQTCGGCWAFSVVGGIESAY 144

Query:   166 HITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK-GLATEAKYPYKASDGS 224
              I    L  LS Q+++DC  S  + GC GG    A  ++   K  L  +++Y +KA  G 
Sbjct:   145 AIKGHNLEELSVQQVIDCSYS--NYGCSGGSTITALSWLNQTKVKLVRDSEYTFKAQTGL 202

Query:   225 CNKKEANPSAAKISGYEDVP-SNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTGQ 282
             C+    +     I+G+     S  E  +M+ + +  P++V +DA    +Q Y  G+    
Sbjct:   203 CHYFPHSDFGVSITGFAAYDFSGQEEEMMRVLVDWGPLAVTVDAVS--WQDYLGGIIQYH 260

Query:   283 CGT-ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA 338
             C + + +H V   G+ T      YW+V+NSWG TWG +GY+R++        +CGIA
Sbjct:   261 CSSGKANHAVLITGFDTTGI-IPYWIVQNSWGRTWGIDGYVRVK----IGSNVCGIA 312


>MGI|MGI:2139628 [details] [associations]
            symbol:Ctso "cathepsin O" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:2139628 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0005764 GeneTree:ENSGT00560000076599 MEROPS:C01.035 CTD:1519
            HOVERGEN:HBG105050 KO:K01374 OMA:SNVCGIA OrthoDB:EOG4V6ZH1
            EMBL:AK034490 EMBL:AK049470 EMBL:AK165930 EMBL:AK166103
            EMBL:BC044664 IPI:IPI00453524 RefSeq:NP_808330.1 UniGene:Mm.254642
            ProteinModelPortal:Q8BM88 SMR:Q8BM88 STRING:Q8BM88
            PhosphoSite:Q8BM88 PRIDE:Q8BM88 Ensembl:ENSMUST00000029649
            GeneID:229445 KEGG:mmu:229445 UCSC:uc008pon.1 InParanoid:Q8BM88
            NextBio:379433 Bgee:Q8BM88 CleanEx:MM_CTSO Genevestigator:Q8BM88
            GermOnline:ENSMUSG00000028015 Uniprot:Q8BM88
        Length = 312

 Score = 330 (121.2 bits), Expect = 7.9e-30, P = 7.9e-30
 Identities = 89/277 (32%), Positives = 137/277 (49%)

Query:    69 YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRR-LPSVRSSETTDVSFRYENAS 127
             Y+ SF ++     Y  G+N+F+    EEF+A   G K    P   +     +     N S
Sbjct:    45 YLNSFPHENSTAFY--GVNQFSYLFPEEFKALYLGSKYAWAPRYPAEGQRPIP----NVS 98

Query:   128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
             +P   DWR K  V  V++Q  CG CWAFS V+A+E    I  + L  LS Q+++DC  S 
Sbjct:    99 LPLRFDWRDKHVVNPVRNQEMCGGCWAFSVVSAIESARAIQGKSLDYLSVQQVIDC--SF 156

Query:   188 EDQGCEGGLMDDAFEFIISNK-GLATEAKYPYKASDGSCN---KKEANPSAAKISGYEDV 243
              + GC GG    A  ++   +  L  +++YP+KA +G C    + +A  S    S Y   
Sbjct:   157 NNSGCLGGSPLCALRWLNETQLKLVADSQYPFKAVNGQCRHFPQSQAGVSVKDFSAYNFR 216

Query:   244 PSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFTGQCGT-ELDHGVTAVGYGTADD 301
                +E A  +A+ +  P+ V +DA    +Q Y  G+    C + E +H V   G+    +
Sbjct:   217 GQEDEMA--RALLSFGPLVVIVDAMS--WQDYLGGIIQHHCSSGEANHAVLITGFDRTGN 272

Query:   302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA 338
              T YW+V+NSWG++WG  GY  ++        +CGIA
Sbjct:   273 -TPYWMVRNSWGSSWGVEGYAHVKMG----GNVCGIA 304


>UNIPROTKB|O97578 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9615 "Canis
            lupus familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 EMBL:AF060171 RefSeq:NP_001182763.1
            UniGene:Cfa.28653 ProteinModelPortal:O97578 SMR:O97578
            MEROPS:C01.070 PRIDE:O97578 GeneID:403458 KEGG:cfa:403458
            InParanoid:O97578 NextBio:20816976 Uniprot:O97578
        Length = 435

 Score = 324 (119.1 bits), Expect = 3.4e-29, P = 3.4e-29
 Identities = 96/292 (32%), Positives = 147/292 (50%)

Query:    61 KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
             +++K N E++ + N   ++      I E+   T  +      G  R++P  + +  T   
Sbjct:   141 RLYKYNYEFVKAINTIQKSWTATRYI-EYETLTLRDMMTRVGG--RKIPRPKPTPLT-AE 196

Query:   121 FRYENASVPASIDWRK-KGA--VTGVKDQGQCGCCWAFSAVAAMEG-INHITTRKLTS-L 175
                E + +P S DWR  +G   V+ V++Q  CG C+AF++ A +E  I  +T    T  L
Sbjct:   197 IHEEISRLPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPIL 256

Query:   176 SEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK-----GLATEAKYPYKASDGSCNKKEA 230
             S QE+V C  S   QGCEGG     F ++I+ K     GL  EA +PY  SD  C   + 
Sbjct:   257 SPQEIVSC--SQYAQGCEGG-----FPYLIAGKYAQDFGLVEEACFPYAGSDSPCKPNDC 309

Query:   231 ----NPSAAKISGYEDVPSNNEAAL-MKAVANQPVSVAIDASGSDFQFYSSGVF--TGQC 283
                 +     + G+    + NEA + ++ V + P++VA +    DF  Y  G++  TG  
Sbjct:   310 FRYYSSEYYYVGGFYG--ACNEALMKLELVRHGPMAVAFEVY-DDFFHYQKGIYYHTGLR 366

Query:   284 GT----EL-DHGVTAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
                   EL +H V  VGYGT +  G  YW+VKNSWG+ WGE+GY R++R  D
Sbjct:   367 DPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTD 418


>UNIPROTKB|H0YD65 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 EMBL:AP002748
            HGNC:HGNC:2531 ChiTaRS:CTSF Ensembl:ENST00000524994 Uniprot:H0YD65
        Length = 283

 Score = 323 (118.8 bits), Expect = 4.4e-29, P = 4.4e-29
 Identities = 89/236 (37%), Positives = 116/236 (49%)

Query:    46 YGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR-NGY 104
             Y R Y ++ E   R  +F  N+          R    + G+ +F+D T EEFR    N  
Sbjct:    43 YNRTY-ESKEARWRLSVFVNNMVRAQKIQALDRGTA-QYGVTKFSDLTEEEFRTIYLNTL 100

Query:   105 KRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGI 164
              R+ P  +  +   V         P   DWR KGAVT VKDQG CG CWAFS    +EG 
Sbjct:   101 LRKEPGNKMKQAKSVG-----DLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQ 155

Query:   165 NHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGS 224
               +    L SLSEQEL+DCD    D+ C GGL  +A+  I +  GL TE  Y Y+    S
Sbjct:   156 WFLNQGTLLSLSEQELLDCDKM--DKACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQS 213

Query:   225 CNKKEANPSAAKISGYEDVP-SNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGV 278
             CN    +   AK+   + V  S NE  L   +A + P+SVAI+A G   QFY  G+
Sbjct:   214 CN---FSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFG--MQFYRHGI 264


>UNIPROTKB|P53634 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9606 "Homo
            sapiens" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0005783 "endoplasmic reticulum" evidence=IEA] [GO:0005794
            "Golgi apparatus" evidence=IEA] [GO:0007568 "aging" evidence=IEA]
            [GO:0010033 "response to organic substance" evidence=IEA]
            [GO:0031404 "chloride ion binding" evidence=IEA] [GO:0042802
            "identical protein binding" evidence=IEA] [GO:0043621 "protein
            self-association" evidence=IEA] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0006508 "proteolysis" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0006955
            "immune response" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005783 GO:GO:0005794 Reactome:REACT_6900
            GO:GO:0006955 GO:GO:0007568 GO:GO:0010033 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004252 GO:GO:0005764 GO:GO:0031404 CTD:1075
            HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ InterPro:IPR014882
            Pfam:PF08773 MEROPS:C01.070 EMBL:X87212 EMBL:U79415 EMBL:AF234263
            EMBL:AF234264 EMBL:AF254757 EMBL:AF525032 EMBL:AF525033
            EMBL:AK292117 EMBL:AK311923 EMBL:AK223038 EMBL:BX537913
            EMBL:AC011088 EMBL:CH471185 EMBL:BC054028 EMBL:BC100891
            EMBL:BC100892 EMBL:BC100893 EMBL:BC100894 EMBL:BC109386
            EMBL:BC110071 EMBL:BC113850 EMBL:BC113897 IPI:IPI00022810
            IPI:IPI00171323 IPI:IPI00872258 PIR:S23941 PIR:S66504
            RefSeq:NP_001107645.1 RefSeq:NP_001805.3 RefSeq:NP_680475.1
            UniGene:Hs.128065 PDB:1K3B PDB:2DJF PDB:2DJG PDB:3PDF PDBsum:1K3B
            PDBsum:2DJF PDBsum:2DJG PDBsum:3PDF ProteinModelPortal:P53634
            SMR:P53634 IntAct:P53634 MINT:MINT-4655964 STRING:P53634
            PhosphoSite:P53634 DMDM:1705632 PaxDb:P53634 PRIDE:P53634
            DNASU:1075 Ensembl:ENST00000227266 Ensembl:ENST00000524463
            Ensembl:ENST00000529974 GeneID:1075 KEGG:hsa:1075 UCSC:uc001pck.4
            UCSC:uc001pcm.4 GeneCards:GC11M088026 HGNC:HGNC:2528 HPA:CAB025364
            MIM:170650 MIM:245000 MIM:245010 MIM:602365 neXtProt:NX_P53634
            Orphanet:2342 Orphanet:678 PharmGKB:PA27028 HOGENOM:HOG000127503
            InParanoid:P53634 OMA:YDDFLHY PhylomeDB:P53634
            BioCyc:MetaCyc:HS03265-MONOMER SABIO-RK:P53634 BindingDB:P53634
            ChEMBL:CHEMBL2252 EvolutionaryTrace:P53634 GenomeRNAi:1075
            NextBio:4488 PMAP-CutDB:P53634 ArrayExpress:P53634 Bgee:P53634
            Genevestigator:P53634 GermOnline:ENSG00000109861 GO:GO:0001913
            Uniprot:P53634
        Length = 463

 Score = 320 (117.7 bits), Expect = 9.1e-29, P = 9.1e-29
 Identities = 96/302 (31%), Positives = 151/302 (50%)

Query:    53 NAEKEMRFKIFKENVEYIASFNNKARNKPYKLGIN-EFADQTNEEFRAPRNGYKRRLPSV 111
             N++++   +++K +  ++ + N  A  K +      E+   T  +      G+ R++P  
Sbjct:   158 NSQEKYSNRLYKYDHNFVKAIN--AIQKSWTATTYMEYETLTLGDMIRRSGGHSRKIPRP 215

Query:   112 RSSETTDVSFRYENASVPASIDWRK-KGA--VTGVKDQGQCGCCWAFSAVAAMEG-INHI 167
             + +  T    + +   +P S DWR   G   V+ V++Q  CG C++F+++  +E  I  +
Sbjct:   216 KPAPLT-AEIQQKILHLPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRIL 274

Query:   168 TTRKLTS-LSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK-----GLATEAKYPYKAS 221
             T    T  LS QE+V C  S   QGCEGG     F ++I+ K     GL  EA +PY  +
Sbjct:   275 TNNSQTPILSPQEVVSC--SQYAQGCEGG-----FPYLIAGKYAQDFGLVEEACFPYTGT 327

Query:   222 DGSCNKKEA-----NPSAAKISGYEDVPSNNEAAL-MKAVANQPVSVAIDASGSDFQFYS 275
             D  C  KE      +     + G+      NEA + ++ V + P++VA +    DF  Y 
Sbjct:   328 DSPCKMKEDCFRYYSSEYHYVGGFYG--GCNEALMKLELVHHGPMAVAFEVY-DDFLHYK 384

Query:   276 SGVF--TGQCGT----EL-DHGVTAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
              G++  TG        EL +H V  VGYGT +  G  YW+VKNSWGT WGENGY R++R 
Sbjct:   385 KGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRG 444

Query:   328 ID 329
              D
Sbjct:   445 TD 446


>UNIPROTKB|F1PSK8 [details] [associations]
            symbol:F1PSK8 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 InterPro:IPR014882
            Pfam:PF08773 EMBL:AAEX03012741 Ensembl:ENSCAFT00000007054
            Uniprot:F1PSK8
        Length = 405

 Score = 316 (116.3 bits), Expect = 2.4e-28, P = 2.4e-28
 Identities = 96/293 (32%), Positives = 147/293 (50%)

Query:    61 KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
             +++K N E++ + N   ++      I E+   T  +      G  R++P  + +  T   
Sbjct:   110 RLYKYNYEFVKAINTIQKSWTATRYI-EYETLTLRDMMTRGGG--RKIPRPKPTPLT-AE 165

Query:   121 FRYENASVPASIDWRK-KGA--VTGVKDQG-QCGCCWAFSAVAAMEG-INHITTRKLTS- 174
                E + +P S DWR  +G   V+ V++Q   CG C+AF++ A +E  I  +T    T  
Sbjct:   166 IHEEISRLPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRILTNNTQTPI 225

Query:   175 LSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK-----GLATEAKYPYKASDGSCNKKE 229
             LS QE+V C  S   QGCEGG     F ++I+ K     GL  EA +PY  SD  C   +
Sbjct:   226 LSPQEIVSC--SQYAQGCEGG-----FPYLIAGKYAQDFGLVEEACFPYAGSDSPCKPND 278

Query:   230 A----NPSAAKISGYEDVPSNNEAAL-MKAVANQPVSVAIDASGSDFQFYSSGVF--TGQ 282
                  +     + G+    + NEA + ++ V + P++VA +    DF  Y  G++  TG 
Sbjct:   279 CFRYYSSEYYYVGGFYG--ACNEALMKLELVRHGPMAVAFEVY-DDFFHYQKGIYYHTGL 335

Query:   283 CGT----EL-DHGVTAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
                    EL +H V  VGYGT +  G  YW+VKNSWG+ WGE+GY R++R  D
Sbjct:   336 RDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTD 388


>MGI|MGI:109553 [details] [associations]
            symbol:Ctsc "cathepsin C" species:10090 "Mus musculus"
            [GO:0001913 "T cell mediated cytotoxicity" evidence=IGI]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IMP]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0005783 "endoplasmic
            reticulum" evidence=ISO] [GO:0005794 "Golgi apparatus"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=ISO;IMP]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0010033
            "response to organic substance" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0031404 "chloride ion
            binding" evidence=ISO] [GO:0042802 "identical protein binding"
            evidence=ISO] [GO:0043621 "protein self-association" evidence=ISO]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 MGI:MGI:109553 GO:GO:0005783
            GO:GO:0005794 GO:GO:0007568 GO:GO:0010033 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004252 GO:GO:0005764 GO:GO:0031404 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 MEROPS:C01.070 OMA:YDDFLHY
            GO:GO:0001913 EMBL:U89269 EMBL:U74683 EMBL:BC067063 IPI:IPI00130015
            RefSeq:NP_034112.3 UniGene:Mm.322945 ProteinModelPortal:P97821
            SMR:P97821 STRING:P97821 PhosphoSite:P97821 PaxDb:P97821
            PRIDE:P97821 Ensembl:ENSMUST00000032779 GeneID:13032 KEGG:mmu:13032
            InParanoid:P97821 BindingDB:P97821 ChEMBL:CHEMBL3454 ChiTaRS:CTSC
            NextBio:282904 Bgee:P97821 CleanEx:MM_CTSC Genevestigator:P97821
            Uniprot:P97821
        Length = 462

 Score = 316 (116.3 bits), Expect = 2.4e-28, P = 2.4e-28
 Identities = 88/252 (34%), Positives = 135/252 (53%)

Query:   101 RNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRK-KGA--VTGVKDQGQCGCCWAFSA 157
             R+G+ +R+P  + +  TD   + +  ++P S DWR  +G   V+ V++Q  CG C++F++
Sbjct:   204 RSGHSQRIPRPKPAPMTD-EIQQQILNLPESWDWRNVQGVNYVSPVRNQESCGSCYSFAS 262

Query:   158 VAAMEG-INHITTRKLTS-LSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK-----GL 210
             +  +E  I  +T    T  LS QE+V C  S   QGC+GG     F ++I+ K     G+
Sbjct:   263 MGMLEARIRILTNNSQTPILSPQEVVSC--SPYAQGCDGG-----FPYLIAGKYAQDFGV 315

Query:   211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN----NEAAL-MKAVANQPVSVAID 265
               E+ +PY A D  C  +E N      S Y  V       NEA + ++ V + P++VA +
Sbjct:   316 VEESCFPYTAKDSPCKPRE-NCLRYYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFE 374

Query:   266 ASGSDFQFYSSGVF--TGQCGT----EL-DHGVTAVGYGTAD-DGTKYWLVKNSWGTTWG 317
                 DF  Y SG++  TG        EL +H V  VGYG     G +YW++KNSWG+ WG
Sbjct:   375 VH-DDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWG 433

Query:   318 ENGYIRMQRDID 329
             E+GY R++R  D
Sbjct:   434 ESGYFRIRRGTD 445


>UNIPROTKB|Q5QP40 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015644 Pfam:PF00112
            InterPro:IPR000169 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 EMBL:AL355860 HOVERGEN:HBG011513
            PANTHER:PTHR12411:SF55 EMBL:AL356292 UniGene:Hs.632466
            HGNC:HGNC:2536 IPI:IPI00514633 SMR:Q5QP40 STRING:Q5QP40
            Ensembl:ENST00000443913 Uniprot:Q5QP40
        Length = 258

 Score = 315 (115.9 bits), Expect = 3.1e-28, P = 3.1e-28
 Identities = 70/176 (39%), Positives = 106/176 (60%)

Query:    40 EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEF 97
             E+W   + + Y +  ++  R  I+++N++YI+  N +A      Y+L +N   D T+EE 
Sbjct:    86 ELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEV 145

Query:    98 RAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
                  G K  L   RS++T  +   +E  + P S+D+RKKG VT VK+QGQCG CWAFS+
Sbjct:   146 VQKMTGLKVPLSHSRSNDTLYIP-EWEGRA-PDSVDYRKKGYVTPVKNQGQCGSCWAFSS 203

Query:   158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
             V A+EG     T KL +LS Q LVDC +  E+ GC GG M +AF+++  N+G+ +E
Sbjct:   204 VGALEGQLKKKTGKLLNLSPQNLVDCVS--ENDGCGGGYMTNAFQYVQKNRGIDSE 257


>UNIPROTKB|J9P219 [details] [associations]
            symbol:J9P219 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 InterPro:IPR014882
            Pfam:PF08773 OMA:YDDFLHY EMBL:AAEX03012741
            Ensembl:ENSCAFT00000050015 Uniprot:J9P219
        Length = 406

 Score = 313 (115.2 bits), Expect = 5.0e-28, P = 5.0e-28
 Identities = 95/293 (32%), Positives = 144/293 (49%)

Query:    61 KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
             +++K N E++ + N   ++      I E+   T  +      G  R++P           
Sbjct:   110 RLYKYNYEFVKAINTIQKSWTATRYI-EYETLTLRDMMTRGGG--RKIPRKPKPTPLTAE 166

Query:   121 FRYENASVPASIDWRK-KGA--VTGVKDQG-QCGCCWAFSAVAAMEG-INHITTRKLTS- 174
                E + +P S DWR  +G   V+ V++Q   CG C+AF++ A +E  I  +T    T  
Sbjct:   167 IHEEISRLPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRILTNNTQTPI 226

Query:   175 LSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK-----GLATEAKYPYKASDGSCNKKE 229
             LS QE+V C  S   QGCEGG     F ++I+ K     GL  EA +PY  SD  C   +
Sbjct:   227 LSPQEIVSC--SQYAQGCEGG-----FPYLIAGKYAQDFGLVEEACFPYAGSDSPCKPND 279

Query:   230 A----NPSAAKISGYEDVPSNNEAAL-MKAVANQPVSVAIDASGSDFQFYSSGVF--TGQ 282
                  +     + G+    + NEA + ++ V + P++VA +    DF  Y  G++  TG 
Sbjct:   280 CFRYYSSEYYYVGGFYG--ACNEALMKLELVRHGPMAVAFEVY-DDFFHYQKGIYYHTGL 336

Query:   283 CGT----EL-DHGVTAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
                    EL +H V  VGYGT +  G  YW+VKNSWG+ WGE+GY R++R  D
Sbjct:   337 RDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTD 389


>WB|WBGene00019314 [details] [associations]
            symbol:K02E7.10 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 SMART:SM00645 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 HSSP:P07711
            EMBL:FO080411 PIR:T32392 RefSeq:NP_493904.1 UniGene:Cel.14828
            ProteinModelPortal:O17255 SMR:O17255 EnsemblMetazoa:K02E7.10
            GeneID:186889 KEGG:cel:CELE_K02E7.10 UCSC:K02E7.10 CTD:186889
            WormBase:K02E7.10 eggNOG:NOG331187 HOGENOM:HOG000114005
            InParanoid:O17255 OMA:GNANEAR NextBio:933344 Uniprot:O17255
        Length = 299

 Score = 306 (112.8 bits), Expect = 2.8e-27, P = 2.8e-27
 Identities = 76/219 (34%), Positives = 114/219 (52%)

Query:   132 IDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGIN-HITTRKLTSLSEQELVDCDTSGEDQ 190
             +DWR+KG V  VKDQG+C   +AF+A+AA+E +       KL S SEQ+++DC  +    
Sbjct:    84 LDWREKGIVGPVKDQGKCNASYAFAAIAAIESMYAKANNGKLLSFSEQQIIDC--ANFTN 141

Query:   191 GCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG-YEDVPSNNEA 249
              C+  L +      +   G+ TEA YPY   + +  K E + S  K+   Y DV  N E 
Sbjct:   142 PCQENLENVLSNRFLKENGVGTEADYPYVGKE-NVGKCEYDSSKMKLRPTYIDVYPNEEW 200

Query:   250 ALMKAVANQPVSVAIDASGSDFQFYSSGVFTG---QCGTELD-HGVTAVGYGTADDGTKY 305
             A    +           S   F  Y +G++     +CG   +   +  VGYG  D   KY
Sbjct:   201 ARAH-ITTFGTGYFRMRSPPSFFHYKTGIYNPTKEECGNANEARSLAIVGYGK-DGAEKY 258

Query:   306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             W+VK S+GT+WGE+GY+++ R+++A    CG+A   S P
Sbjct:   259 WIVKGSFGTSWGEHGYMKLARNVNA----CGMAESISIP 293


>RGD|2445 [details] [associations]
            symbol:Ctsc "cathepsin C" species:10116 "Rattus norvegicus"
          [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA;ISO]
          [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
          [GO:0004252 "serine-type endopeptidase activity" evidence=IEA;ISO]
          [GO:0005764 "lysosome" evidence=IDA;TAS] [GO:0005783 "endoplasmic
          reticulum" evidence=IDA] [GO:0005794 "Golgi apparatus" evidence=IDA]
          [GO:0006508 "proteolysis" evidence=IEP;ISO;TAS] [GO:0007568 "aging"
          evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
          evidence=ISO] [GO:0010033 "response to organic substance"
          evidence=IDA] [GO:0031404 "chloride ion binding" evidence=IDA]
          [GO:0042802 "identical protein binding" evidence=IDA] [GO:0043621
          "protein self-association" evidence=IDA] InterPro:IPR000668
          InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
          InterPro:IPR000169 RGD:2445 GO:GO:0005783 GO:GO:0005794 GO:GO:0007568
          GO:GO:0010033 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
          InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139
          PROSITE:PS00639 GO:GO:0004252 GO:GO:0005764 GO:GO:0043621
          GO:GO:0042802 GO:GO:0031404 GO:GO:0004197
          GeneTree:ENSGT00560000076599 CTD:1075 HOGENOM:HOG000068022
          HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ InterPro:IPR014882
          Pfam:PF08773 MEROPS:C01.070 OMA:YDDFLHY GO:GO:0001913 EMBL:D90404
          IPI:IPI00193765 PIR:A41158 RefSeq:NP_058793.1 UniGene:Rn.203177
          PDB:1JQP PDBsum:1JQP ProteinModelPortal:P80067 SMR:P80067
          STRING:P80067 PhosphoSite:P80067 PRIDE:P80067
          Ensembl:ENSRNOT00000022342 GeneID:25423 KEGG:rno:25423
          InParanoid:P80067 SABIO-RK:P80067 EvolutionaryTrace:P80067
          NextBio:606591 ArrayExpress:P80067 Genevestigator:P80067
          GermOnline:ENSRNOG00000016496 Uniprot:P80067
        Length = 462

 Score = 307 (113.1 bits), Expect = 2.9e-27, P = 2.9e-27
 Identities = 98/304 (32%), Positives = 153/304 (50%)

Query:    61 KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
             +++  N  ++ + N+    K +     E  ++ +      R+G+  R+   + +  TD  
Sbjct:   166 RLYSHNHNFVKAINSV--QKSWTATTYEEYEKLSIRDLIRRSGHSGRILRPKPAPITD-E 222

Query:   121 FRYENASVPASIDWRK-KGA--VTGVKDQGQCGCCWAFSAVAAMEG-INHITTRKLTS-L 175
              + +  S+P S DWR  +G   V+ V++Q  CG C++F+++  +E  I  +T    T  L
Sbjct:   223 IQQQILSLPESWDWRNVRGINFVSPVRNQESCGSCYSFASLGMLEARIRILTNNSQTPIL 282

Query:   176 SEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK-----GLATEAKYPYKASDGSCNKKEA 230
             S QE+V C  S   QGC+GG     F ++I+ K     G+  E  +PY A+D  C  KE 
Sbjct:   283 SPQEVVSC--SPYAQGCDGG-----FPYLIAGKYAQDFGVVEENCFPYTATDAPCKPKE- 334

Query:   231 NPSAAKISGYEDVPSN----NEAAL-MKAVANQPVSVAIDASGSDFQFYSSGVF--TGQC 283
             N      S Y  V       NEA + ++ V + P++VA +    DF  Y SG++  TG  
Sbjct:   335 NCLRYYSSEYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVH-DDFLHYHSGIYHHTGLS 393

Query:   284 GT----EL-DHGVTAVGYGTAD-DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
                   EL +H V  VGYG     G  YW+VKNSWG+ WGE+GY R++R  D +  +  I
Sbjct:   394 DPFNPFELTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRIRRGTD-ECAIESI 452

Query:   338 AMQA 341
             AM A
Sbjct:   453 AMAA 456


>UNIPROTKB|J9NSE7 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 InterPro:IPR014882 Pfam:PF08773
            EMBL:AAEX03017125 Ensembl:ENSCAFT00000014269 OMA:INGQICH
            Uniprot:J9NSE7
        Length = 458

 Score = 305 (112.4 bits), Expect = 4.7e-27, P = 4.7e-27
 Identities = 93/292 (31%), Positives = 144/292 (49%)

Query:    61 KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
             +++K N E++ + N   ++      I E+   T  +      G  R++P  + +  T   
Sbjct:   164 RLYKYNYEFVKAINTIQKSWTATRYI-EYETLTLRDMMRRAGG--RKIPRPKPTPLT-AE 219

Query:   121 FRYENASVPASIDWRK-KGA--VTGVKDQGQCGCCWAFSAVAAMEG-INHITTRKLTS-L 175
                E + +P S DWR  +G   V+ V++Q  CG C+AF++   +E  I  +T    T  L
Sbjct:   220 IHEEISRLPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTVMLEARIRILTNNTQTPIL 279

Query:   176 SEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK-----GLATEAKYPYKASDGSCNKKEA 230
             S QE+V C  S   QGCEGG     F ++I+ K     GL  EA + Y  SD  C   + 
Sbjct:   280 SPQEIVSC--SQYAQGCEGG-----FPYLIAGKYAQDFGLVDEACFSYAGSDSPCKPNDC 332

Query:   231 ----NPSAAKISGYEDVPSNNEAAL-MKAVANQPVSVAIDASGSDFQFYSSGVF--TGQC 283
                 +     + G+    + NEA + ++ V + P++VA +    DF  Y  G++  TG  
Sbjct:   333 FHYYSSEYHYVGGFYG--ACNEALMKLELVRHGPMAVAFEVY-DDFFHYQKGIYYHTGLR 389

Query:   284 GT----EL-DHGVTAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
                   EL +H V  VGYGT +  G  YW+VKNSWG+ WGE+GY ++ R  D
Sbjct:   390 DPINPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFQICRGTD 441


>DICTYBASE|DDB_G0286015 [details] [associations]
            symbol:gmsA species:44689 "Dictyostelium discoideum"
            [GO:0005615 "extracellular space" evidence=IDA] [GO:0019953 "sexual
            reproduction" evidence=IEP] [GO:0009897 "external side of plasma
            membrane" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA;ISS] [GO:0000747 "conjugation with cellular
            fusion" evidence=IMP] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005576 "extracellular
            region" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0286015 Pfam:PF00188 GO:GO:0005615
            GenomeReviews:CM000153_GR GO:GO:0009897 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AAFI02000085 GO:GO:0000747
            Gene3D:3.40.33.10 InterPro:IPR001283 InterPro:IPR014044
            SMART:SM00198 SUPFAM:SSF55797 HSSP:P07688 RefSeq:XP_637893.1
            ProteinModelPortal:Q54ME1 MEROPS:C01.A52 EnsemblProtists:DDB0191145
            GeneID:8625403 KEGG:ddi:DDB_G0286015 InParanoid:Q54ME1 OMA:PGIAYEK
            ProtClustDB:CLSZ2429919 Uniprot:Q54ME1
        Length = 448

 Score = 303 (111.7 bits), Expect = 6.4e-27, P = 6.4e-27
 Identities = 80/220 (36%), Positives = 113/220 (51%)

Query:   131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEG---INHITTRKLT-SLSEQELVDCDTS 186
             ++DW      T ++DQGQCG CWAF++ AA+E    I + T +K T  LS Q  V+C  S
Sbjct:   243 TVDWTSYQ--TPIRDQGQCGSCWAFASSAALESRYLIKYGTAQKSTLQLSNQNAVNCIAS 300

Query:   187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
             G    C GG   + F F     G+A E   PYKA  G+     ++ +  K + Y      
Sbjct:   301 G----CNGGWSGNYFNFF-KTPGIAYEKDDPYKAVTGTSCITTSSVARFKYTNY-GYTEK 354

Query:   247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCG-TELDHGVTAVGYGTADDGTKY 305
              +AAL+  +   PV++A+    S FQ Y SG++      T ++H V  VGY  A D  K 
Sbjct:   355 TKAALLAELKKGPVTIAVYVD-SAFQNYKSGIYNSATKYTGINHLVLLVGYDQATDAYK- 412

Query:   306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
               +KNSWG+ WGE+GY+R+    D    L   A  + YPT
Sbjct:   413 --IKNSWGSWWGESGYMRITASND---NLAIFAYNSYYPT 447


>WB|WBGene00044760 [details] [associations]
            symbol:Y71H2AM.25 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0004181
            "metallocarboxypeptidase activity" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            GeneTree:ENSGT00560000076599 EMBL:FO081822 eggNOG:NOG331187
            HOGENOM:HOG000114005 RefSeq:NP_001040887.1
            ProteinModelPortal:Q2AAB9 SMR:Q2AAB9 EnsemblMetazoa:Y71H2AM.25
            GeneID:4363054 KEGG:cel:CELE_Y71H2AM.25 UCSC:Y71H2AM.25 CTD:4363054
            WormBase:Y71H2AM.25 InParanoid:Q2AAB9 NextBio:959635 Uniprot:Q2AAB9
        Length = 299

 Score = 300 (110.7 bits), Expect = 1.2e-26, P = 1.2e-26
 Identities = 85/247 (34%), Positives = 127/247 (51%)

Query:   112 RSSETTDVSF-----RYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGIN- 165
             R+SE+   +F     +Y   +    +DWR KG V  VKDQG+C    AF+  +++E +  
Sbjct:    61 RTSESLPTTFQWKTPKYTIQTTEEFLDWRDKGIVGPVKDQGKCNASHAFAISSSIESMYA 120

Query:   166 HITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASD-GS 224
               T   L S SEQ+L+DCD  G  +GCE     +A  + I + G+ TEA YPY   + G 
Sbjct:   121 KATNGSLLSFSEQQLIDCDDHGF-KGCEEQPAINAVSYFIFH-GIETEADYPYAGKENGK 178

Query:   225 CNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTG-- 281
             C   ++  S  ++   E V SN E    + V N  P    + A  S +  Y  G++    
Sbjct:   179 CTF-DSTKSKIQLKDAEFVVSN-ETQGKELVTNYGPAFFTMRAPPSLYD-YKIGIYNPSI 235

Query:   282 -QC-GTELDHGVTAVGYGTADDGT-KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA 338
              +C  T     +  VGYG   +G  KYW+VK S+GT+WGE GY+++ RD++A    C +A
Sbjct:   236 EECTSTHEIRSMVIVGYGI--EGVQKYWIVKGSFGTSWGEQGYMKLARDVNA----CAMA 289

Query:   339 MQASYPT 345
                + PT
Sbjct:   290 DFITVPT 296


>WB|WBGene00008861 [details] [associations]
            symbol:F15D4.4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 SMART:SM00848 PROSITE:PS00640
            GeneTree:ENSGT00560000076599 EMBL:Z80344 HSSP:P53634
            eggNOG:NOG310593 PIR:T20981 ProteinModelPortal:Q93512 SMR:Q93512
            MEROPS:C01.A45 EnsemblMetazoa:F15D4.4 KEGG:cel:CELE_F15D4.4
            UCSC:F15D4.4 CTD:184530 WormBase:F15D4.4 InParanoid:Q93512
            OMA:ITMEQNI NextBio:925068 Uniprot:Q93512
        Length = 608

 Score = 306 (112.8 bits), Expect = 1.6e-26, P = 1.6e-26
 Identities = 87/293 (29%), Positives = 139/293 (47%)

Query:    51 RDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPS 110
             R N   +++ ++ + N+ Y    ++      YK+  N+F+   + E  AP       L  
Sbjct:   154 RFNVYSKVKKEVDEHNIMYELGMSS------YKMSTNQFSVALDGEV-APLTLNLDALTP 206

Query:   111 VRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTR 170
               +     +S R +  + P ++DWR    +  + DQ  CG CWAFS ++ +E    I   
Sbjct:   207 TATVIPATISSRKKRDTEP-TVDWRP--FLKPILDQSTCGGCWAFSMISMIESFFAIQGY 263

Query:   171 KLTSLSEQELVDCDTSGEDQ------GCEGGLMDDAFEFIISNKGLATEAKY-PYKASDG 223
               +SLS Q+L+ CDT  +        GC+GG    A  ++      A +A   P+   D 
Sbjct:   264 NTSSLSVQQLLTCDTKVDSTYGLANVGCKGGYFQIAGSYL--EVSAARDASLIPFDLEDT 321

Query:   224 SCNKKEANPSAAKISGYED--VPSNNEAALM--------KAVANQPVSVAIDASGSDFQF 273
             SC+     P    I  ++D  +  N  AA +          V   P++V + A+G D   
Sbjct:   322 SCDSSFFPPVVPTILLFDDGYISGNFTAAQLITMEQNIEDKVRKGPIAVGM-AAGPDIYK 380

Query:   274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
             YS GV+ G CGT ++H V  VG+   DD   YW+++NSWG +WGE GY R++R
Sbjct:   381 YSEGVYDGDCGTIINHAVVIVGF--TDD---YWIIRNSWGASWGEAGYFRVKR 428


>UNIPROTKB|F1NWG2 [details] [associations]
            symbol:CTSC "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 InterPro:IPR014882 Pfam:PF08773
            OMA:YDDFLHY GO:GO:0001913 EMBL:AADN02004805 IPI:IPI00577371
            Ensembl:ENSGALT00000027869 Uniprot:F1NWG2
        Length = 463

 Score = 299 (110.3 bits), Expect = 2.6e-26, P = 2.6e-26
 Identities = 90/292 (30%), Positives = 143/292 (48%)

Query:    63 FKENVEYIASFNNKARNKPYKLG-INEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSF 121
             F  N +++ + N  A  K ++     E+ + + EE      G   R    + +  T    
Sbjct:   168 FVHNFDFVNAIN--AHQKSWRATRYEEYENFSLEELTRRAGGLYSRTSRPKPAPLTPELL 225

Query:   122 RYENASVPASIDWRK-KGA--VTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTS--LS 176
             + + + +P S DWR   G   V+ V++Q  CG C+AF+++  +E    I T        S
Sbjct:   226 K-KVSGLPESWDWRNVNGVNYVSPVRNQASCGSCYAFASMGMLEARIRILTNNTQKPVFS 284

Query:   177 EQELVDCDTSGEDQGCEGGLMDDAFEFIISNK-----GLATEAKYPYKASDGSCNKKEA- 230
              Q++V C  S   QGC+GG     F ++I+ K     G+  E  +PY A D  C  K + 
Sbjct:   285 PQQVVSC--SQYSQGCDGG-----FPYLIAGKYVQDFGVVEEDCFPYTAKDTPCLFKRSC 337

Query:   231 ----NPSAAKISGYEDVPSNNEAAL-MKAVANQPVSVAIDASGSDFQFYSSGVF--TGQC 283
                       + G+    + NEA + ++ V + P++VA +   +DF FY  G++  TG  
Sbjct:   338 YHYYTSEYHYVGGFYG--ACNEALMKLELVLSGPMAVAFEVY-NDFMFYKEGIYHHTGLK 394

Query:   284 GT----EL-DHGVTAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
                   EL +H V  VGYG   + G K+W+VKNSWGT+WGE+GY R++R  D
Sbjct:   395 DEFNPFELTNHAVLLVGYGKDPESGEKFWIVKNSWGTSWGEDGYFRIRRGTD 446


>DICTYBASE|DDB_G0288221 [details] [associations]
            symbol:DDB_G0288221 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0288221 Pfam:PF00188 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 PANTHER:PTHR12411 EMBL:AAFI02000109 Gene3D:3.40.33.10
            InterPro:IPR001283 InterPro:IPR014044 SMART:SM00198 SUPFAM:SSF55797
            MEROPS:C01.A52 ProtClustDB:CLSZ2429919 RefSeq:XP_636852.1
            ProteinModelPortal:Q54J84 EnsemblProtists:DDB0187839 GeneID:8626520
            KEGG:ddi:DDB_G0288221 InParanoid:Q54J84 Uniprot:Q54J84
        Length = 395

 Score = 293 (108.2 bits), Expect = 6.6e-26, P = 6.6e-26
 Identities = 83/242 (34%), Positives = 127/242 (52%)

Query:   100 PRNGYKRRLPSVRSSETTDVSF--RYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
             P  G+K  LP   +S     S   +  N S   S+DW      T V+DQG+C  CW F +
Sbjct:   159 PAGGFKGVLPYKPTSINPSASTTPKMPNFS-SGSVDW--SDYQTPVRDQGECKSCWVFGS 215

Query:   158 VAAMEGI----NHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
             +AA+E      N ++ +    LS Q  ++C TSG    CE G   + F++  S+ G+A E
Sbjct:   216 LAALESRYLIKNGVSEKSTLHLSAQNAMNCITSG----CESGWPANVFDYFESS-GIAFE 270

Query:   214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
               YPY A  GS N   ++ +  + SGY+ V  N + +L++ + N P+++A+  S + FQ 
Sbjct:   271 KDYPYDAI-GSDNCTSSS-NKFEYSGYDSV-ENTKDSLIQELKNGPITIAL-YSDTAFQS 326

Query:   274 YSSGVFTG-QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKE 332
             Y+ G++   +   +++H V  VGY   D  T  W +KNS GT WGE GY R+    D K 
Sbjct:   327 YAGGIYDSVEEYKDVNHIVLLVGY---DKPTDSWKIKNSLGTKWGELGYARITASND-KL 382

Query:   333 GL 334
             G+
Sbjct:   383 GI 384


>WB|WBGene00022189 [details] [associations]
            symbol:Y71H2AR.2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0004181
            "metallocarboxypeptidase activity" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] [GO:0008340 "determination of adult lifespan"
            evidence=IMP] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0008340 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 HSSP:P07711
            eggNOG:NOG331187 HOGENOM:HOG000114005 EMBL:FO081570
            RefSeq:NP_497627.1 UniGene:Cel.28419 ProteinModelPortal:Q9BL26
            SMR:Q9BL26 EnsemblMetazoa:Y71H2AR.2 GeneID:190615
            KEGG:cel:CELE_Y71H2AR.2 UCSC:Y71H2AR.2 CTD:190615
            WormBase:Y71H2AR.2 InParanoid:Q9BL26 OMA:CAMATTI NextBio:946382
            Uniprot:Q9BL26
        Length = 345

 Score = 289 (106.8 bits), Expect = 1.8e-25, P = 1.8e-25
 Identities = 77/231 (33%), Positives = 122/231 (52%)

Query:   112 RSSETTDVSFRYE-----NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGIN- 165
             R+SE+    F++E     + +    +DWR+KG V  VKDQG+C    AF+  +++E +  
Sbjct:    61 RTSESLPTRFQWETPIHMDRTTEEFLDWREKGIVGPVKDQGKCNASHAFAITSSIESMYA 120

Query:   166 HITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSC 225
               T   L S SEQ+L+DC+  G  +GCE     +A  ++ ++ G+ TEA YPY   D + 
Sbjct:   121 KATNGTLLSFSEQQLIDCNDQGY-KGCEEQFAMNAIGYLATH-GIETEADYPYV--DKTN 176

Query:   226 NKKEANPSAAKISGYEDVPSNNEAALMKA-VANQ-PVSVAIDASGSDFQFYSSGVFTG-- 281
              K   + + +KI   + V +     L K  V N  P    + A  S +  Y  G++    
Sbjct:   177 EKCTFDSTKSKIHLKKGVVAEGNEVLGKVYVTNYGPAFFTMRAPPSLYD-YKIGIYNPSI 235

Query:   282 -QC-GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDA 330
              +C  T     +  VGYG   +  KYW+VK S+GT+WGE GY+++ RD++A
Sbjct:   236 EECTSTHEIRSMVIVGYGIEGE-QKYWIVKGSFGTSWGEQGYMKLARDVNA 285


>ZFIN|ZDB-GENE-030619-9 [details] [associations]
            symbol:ctsc "cathepsin C" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030619-9 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 MEROPS:C01.070 HSSP:P43235
            EMBL:BC064286 IPI:IPI00486570 RefSeq:NP_999887.1 UniGene:Dr.32463
            ProteinModelPortal:Q6P2V1 SMR:Q6P2V1 PRIDE:Q6P2V1 GeneID:368704
            KEGG:dre:368704 InParanoid:Q6P2V1 NextBio:20813127
            ArrayExpress:Q6P2V1 Bgee:Q6P2V1 Uniprot:Q6P2V1
        Length = 455

 Score = 286 (105.7 bits), Expect = 7.3e-25, P = 7.3e-25
 Identities = 91/295 (30%), Positives = 137/295 (46%)

Query:    55 EKEMRFKI-FKENVEYIASFNNKARNKPYKLGINEFADQTN-EEFRAPRNGYKRRLPSVR 112
             E  +  K+ +  N+ ++   N+    K +      F +  +  E      G   R+P  R
Sbjct:   152 EHRLLMKLPYTNNMMFVDEINSV--QKSWTATAYSFHETLSIHEMLRRSGGPASRIPR-R 208

Query:   113 SSETTDVSFRYENASVPASIDWRK-KGA--VTGVKDQGQCGCCWAFSAVAAMEGINHITT 169
                 T  +     + +P   DWR   G   V+ V++Q QCG C++F+ +  +E    I T
Sbjct:   209 VRPVTVAADSKAASGLPQHWDWRNVNGVNFVSPVRNQAQCGSCYSFATMGMLEARVRIQT 268

Query:   170 RKLTS--LSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNK 227
                     S Q++V C  S   QGC+GG      ++I  + G+  E  +PY  SD  CN 
Sbjct:   269 NNTQQPVFSPQQVVSC--SQYSQGCDGGFPYLIGKYI-QDFGIVEEDCFPYTGSDSPCNL 325

Query:   228 KEANPSAAKISGYEDVPSN----NEAALM-KAVANQPVSVAIDASGSDFQFYSSGVF--T 280
               A  +    S Y  V       +E+A+M + V N P+ VA++    DF  Y  G++  T
Sbjct:   326 P-AKCTKYYASDYHYVGGFYGGCSESAMMLELVKNGPMGVALEVY-PDFMNYKEGIYHHT 383

Query:   281 GQCGT----EL-DHGVTAVGYGTADD-GTKYWLVKNSWGTTWGENGYIRMQRDID 329
             G        EL +H V  VGYG     G KYW+VKNSWG+ WGENG+ R++R  D
Sbjct:   384 GLRDANNPFELTNHAVLLVGYGQCHKTGEKYWIVKNSWGSGWGENGFFRIRRGTD 438


>FB|FBgn0033873 [details] [associations]
            symbol:CG6337 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 EMBL:AE013599
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 HSSP:P80067 EMBL:AY084123
            RefSeq:NP_610905.1 UniGene:Dm.5230 SMR:Q7JYA0 IntAct:Q7JYA0
            EnsemblMetazoa:FBtr0087646 GeneID:36530 KEGG:dme:Dmel_CG6337
            UCSC:CG6337-RA FlyBase:FBgn0033873 eggNOG:NOG310593
            InParanoid:Q7JYA0 OMA:NRTTYRE OrthoDB:EOG4MCVFZ GenomeRNAi:36530
            NextBio:799041 Uniprot:Q7JYA0
        Length = 340

 Score = 278 (102.9 bits), Expect = 2.6e-24, P = 2.6e-24
 Identities = 92/317 (29%), Positives = 147/317 (46%)

Query:    46 YGRVYRDNAEKEMRFKIFKENVEYIASFNNKA-RNKP-YKLGINEFADQTNEEFRAPRNG 103
             + + Y   + +      F  N   +A  N +A RN+  Y+  +N+F+D    +F A    
Sbjct:    35 FNKTYASTSARNFANYYFIYNRNQVAQHNAQADRNRTTYREAVNQFSDIRLIQFAAL--- 91

Query:   104 YKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQG-QCGCCWAFSAVAAME 162
               + + +V S+ +   + +  +AS     D+   G    V+DQG  C   WA++   A+E
Sbjct:    92 LPKAVNTVTSAASDPPASQAASASFDIITDF---GLTVAVEDQGVNCSSSWAYATAKAVE 148

Query:   163 GINHI-TTRKL-TSLSEQELVDCDTSGEDQGCEGGLMDDAFEFI--ISNKGLATEAKYPY 218
              +N + T   L +SLS Q+L+DC  +G   GC       A  ++  +++  L  E  YP 
Sbjct:   149 IMNAVQTANPLPSSLSAQQLLDC--AGMGTGCSTQTPLAALNYLTQLTDAYLYPEVDYPN 206

Query:   219 KAS---DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFY 274
               S    G C    +     K++GY  V  N++AA+M+ V+N  PV V  + +   F  Y
Sbjct:   207 NNSLKTPGMCQPPSSVSVGVKLAGYSTVADNDDAAVMRYVSNGFPVIVEYNPATFGFMQY 266

Query:   275 SSGVFTGQC----GTELDHGVTAVGYG-TADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
             SSGV+  +       +    +  VGY    D    YW   NS+G TWGE GYIR+ R  +
Sbjct:   267 SSGVYVQETRALTNPKSSQFLVVVGYDHDVDSNLDYWRCLNSFGDTWGEEGYIRIVRRSN 326

Query:   330 AKEGLCGIAMQASYPTA 346
                    IA  A +P+A
Sbjct:   327 QP-----IAKNAVFPSA 338


>UNIPROTKB|F1RWA9 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 EMBL:CU855637
            Ensembl:ENSSSCT00000009707 OMA:WAFSIVG Uniprot:F1RWA9
        Length = 194

 Score = 270 (100.1 bits), Expect = 1.8e-23, P = 1.8e-23
 Identities = 65/199 (32%), Positives = 101/199 (50%)

Query:   148 QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISN 207
             QCG CWAFS V+A+E    I  + L  LS Q+++DC  S  + GC GG   +A  ++   
Sbjct:     1 QCGGCWAFSVVSAVESAYAIKGQPLEVLSVQQVIDC--SYNNYGCNGGSTLNALYWLNKT 58

Query:   208 K-GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP-SNNEAALMKAVANQ-PVSVAI 264
             +  + ++++YP+KA +G C+    + S   I  Y     S  E  + K +    P+ V +
Sbjct:    59 QVKVVSDSEYPFKAQNGLCHYFSCSHSGVSIKDYSAYDFSGQEDEMAKTLLTLGPLIVIV 118

Query:   265 DASGSDFQFYSSGVFTGQCGT-ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
             DA    +Q Y  G+    C + E +H V   G+      T YW+V+NSWG+ WG +GY  
Sbjct:   119 DAVS--WQDYLGGIIQHHCSSGEANHAVLVTGFDKTGS-TPYWIVRNSWGSAWGIDGYAL 175

Query:   324 MQRDIDAKEGLCGIAMQAS 342
             ++        +CGIA   S
Sbjct:   176 VKMG----GNICGIADSVS 190


>DICTYBASE|DDB_G0292462 [details] [associations]
            symbol:DDB_G0292462 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0292462 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            EMBL:AAFI02000190 RefSeq:XP_629634.1 MEROPS:C01.A56
            EnsemblProtists:DDB0184413 GeneID:8628698 KEGG:ddi:DDB_G0292462
            InParanoid:Q54D62 OMA:NTQVESH Uniprot:Q54D62
        Length = 323

 Score = 267 (99.0 bits), Expect = 3.8e-23, P = 3.8e-23
 Identities = 78/243 (32%), Positives = 119/243 (48%)

Query:   127 SVPASIDWRKK--GAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTS--LSEQELVD 182
             ++PAS D R      ++ V++Q  CG CWA      +     I + K     LS Q L+D
Sbjct:    45 TIPASFDVRTNWGDCMSPVREQQSCGSCWAQVTSGILADRMCIESDKNIKMLLSPQYLMD 104

Query:   183 CD-------TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS-DGSC------NKK 228
             CD        SG + GC+GG +  A   +I N+G+ ++    Y+AS D SC         
Sbjct:   105 CDGSCVSDGVSGCNNGCKGGFVGLALTRLI-NEGIVSDECLSYQASKDSSCPTTCDDGSP 163

Query:   229 EANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD 288
              +N +  K +     P+  +A   + + N PV +A     SDF+ +   V+     T+++
Sbjct:   164 ISNTTIYKATSCRAFPTVQDAQY-EIMTNGPV-IATFMLYSDFKPHKWDVYIKSSNTQVE 221

Query:   289 -HGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDA---KEGLCGI-AMQASY 343
              H V  VG+GT  DG  YW+  NSWGT WG+ GY +++R  D    +EG   + A  AS 
Sbjct:   222 SHAVRVVGWGTTSDGVDYWIAANSWGTGWGDKGYFKIRRGSDEAAFEEGFITVTADTASV 281

Query:   344 PTA 346
             PT+
Sbjct:   282 PTS 284


>UNIPROTKB|E9PKT6 [details] [associations]
            symbol:CTSH "Cathepsin H" species:9606 "Homo sapiens"
            [GO:0001520 "outer dense fiber" evidence=IEA] [GO:0001656
            "metanephros development" evidence=IEA] [GO:0001669 "acrosomal
            vesicle" evidence=IEA] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0007283 "spermatogenesis" evidence=IEA] [GO:0008284 "positive
            regulation of cell proliferation" evidence=IEA] [GO:0010634
            "positive regulation of epithelial cell migration" evidence=IEA]
            [GO:0016505 "apoptotic protease activator activity" evidence=IEA]
            [GO:0030984 "kininogen binding" evidence=IEA] [GO:0031638 "zymogen
            activation" evidence=IEA] [GO:0031648 "protein destabilization"
            evidence=IEA] [GO:0032403 "protein complex binding" evidence=IEA]
            [GO:0032526 "response to retinoic acid" evidence=IEA] [GO:0033619
            "membrane protein proteolysis" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0043621 "protein
            self-association" evidence=IEA] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IEA] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            InterPro:IPR000169 GO:GO:0043066 GO:GO:0008284 PANTHER:PTHR12411
            PROSITE:PS00139 GO:GO:0045766 GO:GO:0004252 GO:GO:0032526
            GO:GO:0016505 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 GO:GO:0060448 GO:GO:0033619
            EMBL:AC011944 HGNC:HGNC:2535 IPI:IPI00375426
            ProteinModelPortal:E9PKT6 SMR:E9PKT6 PRIDE:E9PKT6
            Ensembl:ENST00000528741 ArrayExpress:E9PKT6 Bgee:E9PKT6
            Uniprot:E9PKT6
        Length = 134

 Score = 266 (98.7 bits), Expect = 4.8e-23, P = 4.8e-23
 Identities = 54/137 (39%), Positives = 78/137 (56%)

Query:    84 LGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGA-VTG 142
             + +N+F+D +  E +     +K      ++   T  ++       P S+DWRKKG  V+ 
Sbjct:     1 MALNQFSDMSFAEIK-----HKYLWSEPQNCSATKSNYLRGTGPYPPSVDWRKKGNFVSP 55

Query:   143 VKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFE 202
             VK+QG CG CW FS   A+E    I T K+ SL+EQ+LVDC     + GC+GGL   AFE
Sbjct:    56 VKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFE 115

Query:   203 FIISNKGLATEAKYPYK 219
             +I+ NKG+  E  YPY+
Sbjct:   116 YILYNKGIMGEDTYPYQ 132


>UNIPROTKB|E2QV47 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0097208 "alveolar lamellar body"
            evidence=IEA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0070371 "ERK1 and ERK2 cascade"
            evidence=IEA] [GO:0070324 "thyroid hormone binding" evidence=IEA]
            [GO:0060448 "dichotomous subdivision of terminal units involved in
            lung branching" evidence=IEA] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IEA] [GO:0043129 "surfactant homeostasis"
            evidence=IEA] [GO:0043066 "negative regulation of apoptotic
            process" evidence=IEA] [GO:0033619 "membrane protein proteolysis"
            evidence=IEA] [GO:0032526 "response to retinoic acid" evidence=IEA]
            [GO:0031648 "protein destabilization" evidence=IEA] [GO:0031638
            "zymogen activation" evidence=IEA] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=IEA] [GO:0016505
            "apoptotic protease activator activity" evidence=IEA] [GO:0010815
            "bradykinin catabolic process" evidence=IEA] [GO:0010813
            "neuropeptide catabolic process" evidence=IEA] [GO:0010634
            "positive regulation of epithelial cell migration" evidence=IEA]
            [GO:0010628 "positive regulation of gene expression" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=IEA] [GO:0005829 "cytosol" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0004252 "serine-type endopeptidase activity"
            evidence=IEA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IEA] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 GO:GO:0016505 GO:GO:0010634
            GO:GO:0004197 GO:GO:0042599 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 Ensembl:ENSCAFT00000036196 Uniprot:E2QV47
        Length = 136

 Score = 258 (95.9 bits), Expect = 3.4e-22, P = 3.4e-22
 Identities = 56/137 (40%), Positives = 80/137 (58%)

Query:   213 EAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVA-NQPVSVAIDASGSDF 271
             E  YPYK  DG C K + + + A +    ++  N+E A+++AVA   PVS A + + SDF
Sbjct:     3 EDSYPYKGQDGDC-KYQPSKAIAFVKDVANITINDEQAMVEAVALYNPVSFAFEVT-SDF 60

Query:   272 QFYSSGVFTG-QCGT---ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
               Y  G+++   C     +++H V AVGYG   +G  YW+VKNSWG  WG NGY  M+R 
Sbjct:    61 MMYRKGIYSSTSCHKTPDKVNHAVLAVGYGE-QNGIPYWIVKNSWGPQWGMNGYFLMERG 119

Query:   328 IDAKEGLCGIAMQASYP 344
                 + +CG+A  ASYP
Sbjct:   120 ----KNMCGLAACASYP 132


>DICTYBASE|DDB_G0280187 [details] [associations]
            symbol:DDB_G0280187 "cathepsin Z-like protein"
            species:44689 "Dictyostelium discoideum" [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            dictyBase:DDB_G0280187 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            EMBL:AAFI02000035 KO:K08568 RefSeq:XP_641294.1
            ProteinModelPortal:Q54VR1 MEROPS:C01.A61 PRIDE:Q54VR1
            EnsemblProtists:DDB0233838 GeneID:8622427 KEGG:ddi:DDB_G0280187
            InParanoid:Q54VR1 OMA:VWKVGDY Uniprot:Q54VR1
        Length = 291

 Score = 183 (69.5 bits), Expect = 5.2e-22, Sum P(2) = 5.2e-22
 Identities = 37/99 (37%), Positives = 64/99 (64%)

Query:   246 NNEAALMKAV-ANQPVSVAIDASGSDFQFYSSGVFTGQCGT--ELDHGVTAVGYGTADDG 302
             N   A+M+ + A  P++  ++ + + F+ Y+SGVFT   G+  E++H ++ +G+GT ++G
Sbjct:   190 NGSVAMMQEIFARGPIACGMEVTDA-FESYTSGVFTSSVGSTGEINHEISIIGWGT-ENG 247

Query:   303 TKYWLVKNSWGTTWGENGYIRMQRDID--AKEGLCGIAM 339
               YW+ +NSWGT +GE G+ R+QR ID  + E  C  A+
Sbjct:   248 VDYWIGRNSWGTYFGELGFFRIQRGIDLLSIESACDWAV 286

 Score = 127 (49.8 bits), Expect = 5.2e-22, Sum P(2) = 5.2e-22
 Identities = 42/137 (30%), Positives = 63/137 (45%)

Query:    99 APRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRK-KGA--VTGVKDQG---QCGCC 152
             AP +  K +LPS    E T          +P   DWR   G+  +T  ++Q     CG C
Sbjct:    30 APTSIIKSQLPSEYIDEDT----------LPTQYDWRNISGSSYITITRNQHLPQYCGSC 79

Query:   153 WAFSAVAAMEG---INHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
             WA    +A+     I    T     L+ Q L++C  +G D  C+GG   +A+ ++ + KG
Sbjct:    80 WAHGTTSALGDRIKIGRKGTFPEVVLAPQVLLNC--AGPDNTCDGGDPTEAYAYMAA-KG 136

Query:   210 LATEAKYPYKASDGSCN 226
             +  E   PY+A D  CN
Sbjct:   137 ITDETCAPYEAIDNECN 153


>TAIR|locus:505006093 [details] [associations]
            symbol:AT1G02305 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0005829 "cytosol" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684 GO:GO:0005773
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 HOGENOM:HOG000241341
            KO:K01363 PANTHER:PTHR12411:SF16 OMA:CCGFLCG UniGene:At.23486
            UniGene:At.42610 UniGene:At.43952 EMBL:AY039887 EMBL:AF428337
            EMBL:BT002227 IPI:IPI00524601 RefSeq:NP_563648.1 HSSP:P07858
            ProteinModelPortal:Q93VC9 SMR:Q93VC9 IntAct:Q93VC9 STRING:Q93VC9
            MEROPS:C01.049 PRIDE:Q93VC9 ProMEX:Q93VC9 EnsemblPlants:AT1G02305.1
            GeneID:839538 KEGG:ath:AT1G02305 TAIR:At1g02305 InParanoid:Q93VC9
            PhylomeDB:Q93VC9 ProtClustDB:CLSN2687619 Genevestigator:Q93VC9
            Uniprot:Q93VC9
        Length = 362

 Score = 183 (69.5 bits), Expect = 5.3e-22, Sum P(2) = 5.3e-22
 Identities = 38/102 (37%), Positives = 57/102 (55%)

Query:   237 ISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD-HGVTAVG 295
             +S Y+ V S+ +  + +   N PV VA      DF  Y SGV+    GT +  H V  +G
Sbjct:   238 VSAYK-VRSHPDDIMAEVYKNGPVEVAFTVY-EDFAHYKSGVYKHITGTNIGGHAVKLIG 295

Query:   296 YGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
             +GT+DDG  YWL+ N W  +WG++GY +++R  +     CGI
Sbjct:   296 WGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNE----CGI 333

 Score = 135 (52.6 bits), Expect = 5.3e-22, Sum P(2) = 5.3e-22
 Identities = 52/187 (27%), Positives = 77/187 (41%)

Query:    52 DNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINE-FADQTNEEFRAPRNGYKRRLPS 110
             +N  K+       +N E +   N    N  +K   N+ FA+ T  EF+    G K   P 
Sbjct:    34 ENLSKQKLTSWILQN-EIVKEVNENP-NAGWKASFNDRFANATVAEFKRLL-GVKPT-PK 89

Query:   111 VRSSETTDVSFRYENASVPASID----WRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINH 166
                     VS    +  +P   D    W +  ++  + DQG CG CWAF AV ++     
Sbjct:    90 TEFLGVPIVSHDI-SLKLPKEFDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFC 148

Query:   167 ITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCN 226
             I      SLS  +L+ C      QGC GG    A+ +   + G+ TE   PY  + G C+
Sbjct:   149 IKYNMNVSLSVNDLLACCGFLCGQGCNGGYPIAAWRYF-KHHGVVTEECDPYFDNTG-CS 206

Query:   227 KKEANPS 233
                  P+
Sbjct:   207 HPGCEPA 213


>DICTYBASE|DDB_G0288563 [details] [associations]
            symbol:DDB_G0288563 species:44689 "Dictyostelium
            discoideum" [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0288563
            GO:GO:0005615 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197
            EMBL:AAFI02000117 PANTHER:PTHR12411:SF16 RefSeq:XP_636643.1
            MEROPS:C01.A58 PRIDE:Q54IS1 EnsemblProtists:DDB0187993
            GeneID:8626689 KEGG:ddi:DDB_G0288563 InParanoid:Q54IS1 OMA:AWEYMEL
            Uniprot:Q54IS1
        Length = 314

 Score = 250 (93.1 bits), Expect = 2.4e-21, P = 2.4e-21
 Identities = 69/210 (32%), Positives = 103/210 (49%)

Query:   127 SVPASIDWRKK--GAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTS---LSEQELV 181
             S+P S D R +    +  + +Q QCG CWAFS+   +     I +   T+   LS Q LV
Sbjct:    87 SIPTSFDSRVQWPDCIHPILNQEQCGSCWAFSSSEVLSDRLCIASNNKTNPGALSPQTLV 146

Query:   182 DCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAK-ISGY 240
              CD  G D GC GG+   A+E++   KGL T++  PY A +G+    + + S ++  S Y
Sbjct:   147 ACDVYGND-GCSGGIPQLAWEYM-ELKGLPTDSCVPYTAGNGTVYSCQRSCSDSEDYSLY 204

Query:   241 EDVP------SNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTEL--DHGVT 292
                P      S+ +      +A  P+   ++    DF  YSSGV+    G+ L   H + 
Sbjct:   205 RAKPFTLKTCSSVQCIQENILAYGPIVGTMEVY-EDFMSYSSGVYVMTPGSSLLGGHAIK 263

Query:   293 AVGYGTADDGT-KYWLVKNSWGTTWGENGY 321
              VG+G        YW+V NSWG  WG+ G+
Sbjct:   264 IVGWGFDQTSQLNYWIVANSWGADWGQQGF 293


>ZFIN|ZDB-GENE-070323-1 [details] [associations]
            symbol:ctsbb "capthepsin B, b" species:7955 "Danio
            rerio" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-070323-1 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197
            GeneTree:ENSGT00560000076599 PANTHER:PTHR12411:SF16 OMA:CCGFLCG
            EMBL:CU207296 EMBL:CABZ01037785 IPI:IPI00877452
            Ensembl:ENSDART00000097263 Bgee:F1QZT5 Uniprot:F1QZT5
        Length = 326

 Score = 173 (66.0 bits), Expect = 1.3e-20, Sum P(2) = 1.3e-20
 Identities = 40/106 (37%), Positives = 56/106 (52%)

Query:   242 DVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD-HGVTAVGYGTAD 300
             +VPS+ +  + +   N PV  A      DF  Y SGV+    G+ L  H V  +G+G  +
Sbjct:   225 NVPSDQQQIMTELYTNGPVEAAFTVY-EDFPLYKSGVYQHLTGSALGGHAVKILGWGE-E 282

Query:   301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA--MQASYP 344
             +GT +WLV NSW + WG+NGY ++ R  D     CGI   M A  P
Sbjct:   283 NGTPFWLVANSWNSDWGDNGYFKILRGHDE----CGIESEMVAGLP 324

 Score = 131 (51.2 bits), Expect = 1.3e-20, Sum P(2) = 1.3e-20
 Identities = 50/175 (28%), Positives = 75/175 (42%)

Query:    72 SFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPAS 131
             SF N AR+  +  G+N F D   +++     G   + P  R   T   S    N  +P S
Sbjct:    27 SFINAARST-WTAGVN-F-DNVPKKYLKSLCGTVLKGP--RLPHTVKHS---TNVKLPDS 78

Query:   132 ID----WRKKGAVTGVKDQGQCGCCWAFSAVAAMEG--INHITTRKLTSLSEQELVDC-D 184
              D    W     +  ++DQG CG CWAF AV ++      H   ++   +S ++L+ C D
Sbjct:    79 FDLRDQWPNCKTLNQIRDQGSCGSCWAFGAVESISDRICIHSKGKQSPEISAEDLLSCCD 138

Query:   185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
               G   GC GG   +A+++     GL T   Y    SD  C      P    ++G
Sbjct:   139 QCGF--GCSGGFPAEAWDYW-RRSGLVTGGLYN---SDVGCRPYSIAPCEHHVNG 187


>TAIR|locus:2133402 [details] [associations]
            symbol:AT4G01610 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0005829 "cytosol" evidence=IDA] [GO:0000902 "cell
            morphogenesis" evidence=RCA] [GO:0006635 "fatty acid
            beta-oxidation" evidence=RCA] [GO:0010162 "seed dormancy process"
            evidence=RCA] [GO:0016049 "cell growth" evidence=RCA] [GO:0048193
            "Golgi vesicle transport" evidence=RCA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0005773 EMBL:CP002687
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 eggNOG:NOG315657
            HOGENOM:HOG000241341 KO:K01363 PANTHER:PTHR12411:SF16 OMA:DAIPDHF
            HSSP:P07858 ProtClustDB:CLSN2687619 EMBL:AF370193 EMBL:AY065167
            EMBL:AY114015 EMBL:AY086034 EMBL:AF083797 EMBL:BT001190
            EMBL:AK175280 EMBL:AK175481 EMBL:AK175539 EMBL:AK176165
            EMBL:AK176244 EMBL:AK176281 EMBL:AK176330 EMBL:AK176416
            EMBL:AK176433 EMBL:AK176487 EMBL:AK221398 EMBL:AK230235
            IPI:IPI00530811 RefSeq:NP_567215.1 UniGene:At.24471
            ProteinModelPortal:Q94K85 SMR:Q94K85 STRING:Q94K85 MEROPS:C01.144
            PaxDb:Q94K85 PRIDE:Q94K85 EnsemblPlants:AT4G01610.1 GeneID:826792
            KEGG:ath:AT4G01610 TAIR:At4g01610 InParanoid:Q94K85
            PhylomeDB:Q94K85 Genevestigator:Q94K85 Uniprot:Q94K85
        Length = 359

 Score = 170 (64.9 bits), Expect = 1.3e-20, Sum P(2) = 1.3e-20
 Identities = 41/129 (31%), Positives = 60/129 (46%)

Query:   213 EAKYPYKASDGSC---NKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
             E  YP       C   NK  +      +S Y  V SN +  + +   N PV V+      
Sbjct:   208 EPAYPTPKCSRKCVSDNKLWSESKHYSVSTYT-VKSNPQDIMAEVYKNGPVEVSFTVY-E 265

Query:   270 DFQFYSSGVFTGQCGTELD-HGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
             DF  Y SGV+    G+ +  H V  +G+GT+ +G  YWL+ N W   WG++GY  ++R  
Sbjct:   266 DFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRRGT 325

Query:   329 DAKEGLCGI 337
             +     CGI
Sbjct:   326 NE----CGI 330

 Score = 137 (53.3 bits), Expect = 1.3e-20, Sum P(2) = 1.3e-20
 Identities = 47/163 (28%), Positives = 73/163 (44%)

Query:    75 NKARNKPYKLGINE-FADQTNEEFR---APRNGYKRRLPSVRSSETTDVSFRYENASVPA 130
             N+  N  +K  IN+ F++ T  EF+     +   K+    V    + D S +   A   A
Sbjct:    52 NENPNAGWKAAINDRFSNATVAEFKRLLGVKPTPKKHFLGV-PIVSHDPSLKLPKA-FDA 109

Query:   131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQ 190
                W +  ++  + DQG CG CWAF AV ++     I      SLS  +L+ C       
Sbjct:   110 RTAWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLACCGFRCGD 169

Query:   191 GCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPS 233
             GC+GG    A+++  S  G+ TE   PY  + G C+     P+
Sbjct:   170 GCDGGYPIAAWQYF-SYSGVVTEECDPYFDNTG-CSHPGCEPA 210


>UNIPROTKB|P07858 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9606 "Homo sapiens"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0005739 "mitochondrion" evidence=IEA] [GO:0042470 "melanosome"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042981 "regulation of apoptotic process" evidence=TAS]
            [GO:0006508 "proteolysis" evidence=IDA] [GO:0005764 "lysosome"
            evidence=IDA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEP] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IDA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IDA] [GO:0005622 "intracellular" evidence=TAS]
            [GO:0036021 "endolysosome lumen" evidence=TAS] [GO:0045087 "innate
            immune response" evidence=TAS] [GO:0008233 "peptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0005730 "nucleolus" evidence=IDA] [GO:0043231 "intracellular
            membrane-bounded organelle" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_118779 GO:GO:0005739
            GO:GO:0042470 GO:GO:0048471 Reactome:REACT_6900 GO:GO:0005615
            GO:GO:0009612 GO:GO:0009611 GO:GO:0005730 GO:GO:0042981
            GO:GO:0009897 GO:GO:0045471 GO:GO:0016324 GO:GO:0009749
            GO:GO:0006914 GO:GO:0043434 eggNOG:COG4870 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087
            GO:GO:0050790 GO:GO:0042383 GO:GO:0014070 GO:GO:0042277
            GO:GO:0060548 GO:GO:0005901 GO:GO:0014075 GO:GO:0004197
            GO:GO:0070670 EMBL:CH471157 GO:GO:0007519 MEROPS:C01.060 CTD:1508
            HOGENOM:HOG000241341 HOVERGEN:HBG003480 KO:K01363 OMA:GYPSGAW
            OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 GO:GO:0097067
            PANTHER:PTHR12411:SF16 EMBL:M14221 EMBL:L16510 EMBL:AK092070
            EMBL:AK075393 EMBL:BC010240 EMBL:BC095408 EMBL:M13230
            IPI:IPI00295741 PIR:A26498 RefSeq:NP_001899.1 RefSeq:NP_680090.1
            RefSeq:NP_680091.1 RefSeq:NP_680092.1 RefSeq:NP_680093.1
            UniGene:Hs.520898 PDB:1CSB PDB:1GMY PDB:1HUC PDB:1PBH PDB:2IPP
            PDB:2PBH PDB:3AI8 PDB:3CBJ PDB:3CBK PDB:3K9M PDB:3PBH PDBsum:1CSB
            PDBsum:1GMY PDBsum:1HUC PDBsum:1PBH PDBsum:2IPP PDBsum:2PBH
            PDBsum:3AI8 PDBsum:3CBJ PDBsum:3CBK PDBsum:3K9M PDBsum:3PBH
            ProteinModelPortal:P07858 SMR:P07858 DIP:DIP-42785N IntAct:P07858
            MINT:MINT-1397666 STRING:P07858 PhosphoSite:P07858 DMDM:68067549
            SWISS-2DPAGE:P07858 UCD-2DPAGE:P07858 PaxDb:P07858
            PeptideAtlas:P07858 PRIDE:P07858 DNASU:1508 Ensembl:ENST00000345125
            Ensembl:ENST00000353047 Ensembl:ENST00000434271
            Ensembl:ENST00000453527 Ensembl:ENST00000530640
            Ensembl:ENST00000531089 Ensembl:ENST00000533455
            Ensembl:ENST00000534510 GeneID:1508 KEGG:hsa:1508 UCSC:uc003wum.3
            GeneCards:GC08M011700 H-InvDB:HIX0007320 HGNC:HGNC:2527
            HPA:CAB000457 HPA:HPA018156 MIM:116810 neXtProt:NX_P07858
            PharmGKB:PA27027 InParanoid:P07858 PhylomeDB:P07858
            BindingDB:P07858 ChEMBL:CHEMBL4072 ChiTaRS:CTSB
            EvolutionaryTrace:P07858 GenomeRNAi:1508 NextBio:6235
            PMAP-CutDB:P07858 ArrayExpress:P07858 Bgee:P07858 CleanEx:HS_CTSB
            Genevestigator:P07858 GermOnline:ENSG00000164733 GO:GO:0036021
            Uniprot:P07858
        Length = 339

 Score = 167 (63.8 bits), Expect = 1.8e-20, Sum P(2) = 1.8e-20
 Identities = 40/102 (39%), Positives = 54/102 (52%)

Query:   239 GYEDVP-SNNEAALMKAV-ANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD-HGVTAVG 295
             GY     SN+E  +M  +  N PV  A     SDF  Y SGV+    G  +  H +  +G
Sbjct:   226 GYNSYSVSNSEKDIMAEIYKNGPVEGAFSVY-SDFLLYKSGVYQHVTGEMMGGHAIRILG 284

Query:   296 YGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
             +G  ++GT YWLV NSW T WG+NG+ ++ R  D     CGI
Sbjct:   285 WGV-ENGTPYWLVANSWNTDWGDNGFFKILRGQDH----CGI 321

 Score = 138 (53.6 bits), Expect = 1.8e-20, Sum P(2) = 1.8e-20
 Identities = 37/122 (30%), Positives = 54/122 (44%)

Query:   124 ENASVPASID----WRKKGAVTGVKDQGQCGCCWAFSAVAAMEG--INHITTRKLTSLSE 177
             E+  +PAS D    W +   +  ++DQG CG CWAF AV A+      H        +S 
Sbjct:    76 EDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSA 135

Query:   178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
             ++L+ C  S    GC GG   +A+ F  + KGL +   Y    S   C      P    +
Sbjct:   136 EDLLTCCGSMCGDGCNGGYPAEAWNFW-TRKGLVSGGLYE---SHVGCRPYSIPPCEHHV 191

Query:   238 SG 239
             +G
Sbjct:   192 NG 193


>UNIPROTKB|Q9UBR2 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0060441 "epithelial tube
            branching involved in lung morphogenesis" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            Reactome:REACT_11123 Reactome:REACT_17015 InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 EMBL:CH471077 GO:GO:0005615 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AL109840 GO:GO:0060441 MEROPS:C01.013 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN
            BRENDA:3.4.18.1 EMBL:AF073890 EMBL:AF032906 EMBL:AF136273
            EMBL:AF136276 EMBL:AF136274 EMBL:AF136275 EMBL:AK314931
            EMBL:BC042168 EMBL:AF009923 IPI:IPI00002745 RefSeq:NP_001327.2
            UniGene:Hs.252549 PDB:1DEU PDB:1EF7 PDBsum:1DEU PDBsum:1EF7
            ProteinModelPortal:Q9UBR2 SMR:Q9UBR2 STRING:Q9UBR2 DMDM:12643324
            PaxDb:Q9UBR2 PeptideAtlas:Q9UBR2 PRIDE:Q9UBR2 DNASU:1522
            Ensembl:ENST00000217131 GeneID:1522 KEGG:hsa:1522 UCSC:uc002yai.2
            GeneCards:GC20M057570 HGNC:HGNC:2547 HPA:CAB025114 MIM:603169
            neXtProt:NX_Q9UBR2 PharmGKB:PA27043 InParanoid:Q9UBR2 OMA:QCGTCTE
            PhylomeDB:Q9UBR2 BindingDB:Q9UBR2 ChEMBL:CHEMBL4160 ChiTaRS:CTSZ
            EvolutionaryTrace:Q9UBR2 GenomeRNAi:1522 NextBio:6299 Bgee:Q9UBR2
            CleanEx:HS_CTSZ Genevestigator:Q9UBR2 GermOnline:ENSG00000101160
            Uniprot:Q9UBR2
        Length = 303

 Score = 240 (89.5 bits), Expect = 2.7e-20, P = 2.7e-20
 Identities = 70/223 (31%), Positives = 110/223 (49%)

Query:   126 ASVPASIDWRKKGAVTGVK-DQGQ-----CGCCWAFSAVAAMEGINHITTRKL---TSLS 176
             A +P S DWR    V      + Q     CG CWA ++ +AM    +I  +     T LS
Sbjct:    60 ADLPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTLLS 119

Query:   177 EQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASD---------GSCNK 227
              Q ++DC  +G    CEGG     +++   + G+  E    Y+A D         G+CN+
Sbjct:   120 VQNVIDCGNAGS---CEGGNDLSVWDYAHQH-GIPDETCNNYQAKDQECDKFNQCGTCNE 175

Query:   228 -KEA----NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTG- 281
              KE     N +  ++  Y  + S  E  + +  AN P+S  I A+      Y+ G++   
Sbjct:   176 FKECHAIRNYTLWRVGDYGSL-SGREKMMAEIYANGPISCGIMAT-ERLANYTGGIYAEY 233

Query:   282 QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
             Q  T ++H V+  G+G +D GT+YW+V+NSWG  WGE G++R+
Sbjct:   234 QDTTYINHVVSVAGWGISD-GTEYWIVRNSWGEPWGERGWLRI 275


>ZFIN|ZDB-GENE-040426-2650 [details] [associations]
            symbol:ctsba "cathepsin B, a" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0031101 "fin regeneration"
            evidence=IEP] [GO:0008233 "peptidase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 ZFIN:ZDB-GENE-040426-2650 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790
            GO:GO:0004197 GO:GO:0031101 MEROPS:C01.060 HOVERGEN:HBG003480
            PANTHER:PTHR12411:SF16 HSSP:P07688 EMBL:BC044517 IPI:IPI00485996
            UniGene:Dr.3374 ProteinModelPortal:Q803E4 SMR:Q803E4 STRING:Q803E4
            PRIDE:Q803E4 InParanoid:Q803E4 ArrayExpress:Q803E4 Bgee:Q803E4
            Uniprot:Q803E4
        Length = 330

 Score = 172 (65.6 bits), Expect = 6.3e-20, Sum P(2) = 6.3e-20
 Identities = 37/96 (38%), Positives = 49/96 (51%)

Query:   243 VPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD-HGVTAVGYGTADD 301
             VPSN    + +   N PV  A      DF  Y SGV+    G+ L  H +  +G+G  ++
Sbjct:   231 VPSNQNGIMAELFKNGPVEAAFTVY-EDFLLYKSGVYQHMSGSALGGHAIKILGWGE-EN 288

Query:   302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
             G  YWL  NSW T WG+NGY ++ R  D     CGI
Sbjct:   289 GVPYWLAANSWNTDWGDNGYFKILRGEDH----CGI 320

 Score = 126 (49.4 bits), Expect = 6.3e-20, Sum P(2) = 6.3e-20
 Identities = 40/129 (31%), Positives = 58/129 (44%)

Query:    97 FRAPRNGYKRRL-PSVRSSETTDVSFRY-ENASVPASID----WRKKGAVTGVKDQGQCG 150
             FR     Y +RL  +        V  +Y E   +P + D    W     +  ++DQG CG
Sbjct:    46 FRDVDYSYVKRLCGTFLKGPKLPVMVQYTEGLKLPKNFDAREQWPNCPTLKEIRDQGSCG 105

Query:   151 CCWAFSAVAAMEGINHITTRKLTS--LSEQELVDC-DTSGEDQGCEGGLMDDAFEFIISN 207
              CWAF A  A+     I +    S  +S Q+L+ C D+ G   GC GG    A++F  ++
Sbjct:   106 SCWAFGAAEAISDRVCIQSNAKVSVEISSQDLLTCCDSCG--MGCNGGYPSAAWDFWTTD 163

Query:   208 KGLATEAKY 216
              GL T   Y
Sbjct:   164 -GLVTGGLY 171


>WB|WBGene00010204 [details] [associations]
            symbol:F57F5.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0006898
            "receptor-mediated endocytosis" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0009792 GO:GO:0006898
            GO:GO:0040007 GO:GO:0002119 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0040011
            EMBL:Z75953 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            PANTHER:PTHR12411:SF16 RefSeq:NP_506011.2 ProteinModelPortal:Q20950
            SMR:Q20950 DIP:DIP-24447N IntAct:Q20950 MINT:MINT-211137
            STRING:Q20950 MEROPS:C01.A42 EnsemblMetazoa:F57F5.1 GeneID:179645
            KEGG:cel:CELE_F57F5.1 UCSC:F57F5.1 CTD:179645 WormBase:F57F5.1
            OMA:ADDINAC Uniprot:Q20950
        Length = 351

 Score = 173 (66.0 bits), Expect = 7.0e-20, Sum P(2) = 7.0e-20
 Identities = 39/95 (41%), Positives = 49/95 (51%)

Query:   245 SNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTGQCGTELD-HGVTAVGYGTADDG 302
             S   A + K +    PV VA      DF+ YS GV+    G  L  H V  +G+G  D+G
Sbjct:   252 SKKAAEIQKEIMTHGPVEVAFTVY-EDFEHYSGGVYVHTAGASLGGHAVKMLGWGV-DNG 309

Query:   303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
             T YWL  NSW   WGENGY R+ R ++     CGI
Sbjct:   310 TPYWLCANSWNEDWGENGYFRIIRGVNE----CGI 340

 Score = 126 (49.4 bits), Expect = 7.0e-20, Sum P(2) = 7.0e-20
 Identities = 50/193 (25%), Positives = 78/193 (40%)

Query:    64 KENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKR-RLPS-VRSSETTDVSF 121
             +E V+Y+    NK +   +K  +  +     +  +    G K   +P   R  E T    
Sbjct:    38 QELVDYV----NKVQTS-FKAELGSYFSSYPDTIKKQLMGAKMVEIPEEYRVFEMTHPEV 92

Query:   122 RYENASVPASID----WRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLT--SL 175
               E+A+VP S D    W    +++ ++DQ  CG CWA SA   +     I +   T  S+
Sbjct:    93 --EDAAVPDSFDSRTAWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNAKTILSI 150

Query:   176 SEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAA 235
             S  ++  C       GC GG   +A+   +  KG  T   Y  K     C      P   
Sbjct:   151 SADDINACCGMVCGNGCNGGYPIEAWRHYVK-KGYVTGGSYQDKTG---CKPYPYPPCEH 206

Query:   236 KISG--YEDVPSN 246
              ++G  Y+  PSN
Sbjct:   207 HVNGTHYKPCPSN 219


>RGD|708479 [details] [associations]
            symbol:Ctsz "cathepsin Z" species:10116 "Rattus norvegicus"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=TAS]
            [GO:0005615 "extracellular space" evidence=IEA;ISO] [GO:0005783
            "endoplasmic reticulum" evidence=IEA;ISO] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0060441 "epithelial tube branching involved in
            lung morphogenesis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:708479 GO:GO:0005576 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004197 MEROPS:C01.013 CTD:1522 HOVERGEN:HBG004456 KO:K08568
            EMBL:AB023781 EMBL:BC091110 IPI:IPI00207663 RefSeq:NP_899159.1
            UniGene:Rn.1475 ProteinModelPortal:Q9R1T3 SMR:Q9R1T3 PRIDE:Q9R1T3
            GeneID:252929 KEGG:rno:252929 BindingDB:Q9R1T3 NextBio:624097
            Genevestigator:Q9R1T3 Uniprot:Q9R1T3
        Length = 306

 Score = 236 (88.1 bits), Expect = 7.2e-20, P = 7.2e-20
 Identities = 66/223 (29%), Positives = 109/223 (48%)

Query:   126 ASVPASIDWRKKGAV---TGVKDQG---QCGCCWAFSAVAAMEGINHITTRKL---TSLS 176
             A +P + DWR    V   +  ++Q     CG CWA  + +A+    +I  +     T LS
Sbjct:    62 ADLPKNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSALADRINIKRKGAWPSTLLS 121

Query:   177 EQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNK--------- 227
              Q ++DC  +G    CEGG     +E+   + G+  E    Y+A D  C+K         
Sbjct:   122 VQNVIDCGNAGS---CEGGNDLPVWEYAHKH-GIPDETCNNYQAKDQECDKFNQCGTCTE 177

Query:   228 -KEA----NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTG- 281
              KE     N +  ++  Y  + S  E  + +  AN P+S  I A+      Y+ G++T  
Sbjct:   178 FKECHTIQNYTLWRVGDYGSL-SGREKMMAEIYANGPISCGIMAT-ERMSNYTGGIYTEY 235

Query:   282 QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
             Q    ++H ++  G+G ++DG +YW+V+NSWG  WGE G++R+
Sbjct:   236 QNQAIINHIISVAGWGVSNDGIEYWIVRNSWGEPWGERGWMRI 278


>RGD|621509 [details] [associations]
            symbol:Ctsb "cathepsin B" species:10116 "Rattus norvegicus"
            [GO:0004175 "endopeptidase activity" evidence=IMP;IDA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA;ISO;IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005576
            "extracellular region" evidence=IDA] [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0005730 "nucleolus" evidence=IEA;ISO]
            [GO:0005737 "cytoplasm" evidence=IDA] [GO:0005739 "mitochondrion"
            evidence=IEA;ISO;IDA] [GO:0005764 "lysosome" evidence=IEA;ISO;IDA]
            [GO:0006508 "proteolysis" evidence=IEA;IEP;ISO;IMP;IDA;TAS]
            [GO:0006914 "autophagy" evidence=IEP] [GO:0006950 "response to
            stress" evidence=IEP] [GO:0007283 "spermatogenesis" evidence=IEP]
            [GO:0007519 "skeletal muscle tissue development" evidence=IEP]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0009611
            "response to wounding" evidence=IEP] [GO:0009612 "response to
            mechanical stimulus" evidence=IEP] [GO:0009749 "response to glucose
            stimulus" evidence=IEP] [GO:0009897 "external side of plasma
            membrane" evidence=IDA] [GO:0009986 "cell surface" evidence=IDA]
            [GO:0014070 "response to organic cyclic compound" evidence=IEP]
            [GO:0014075 "response to amine stimulus" evidence=IEP] [GO:0016324
            "apical plasma membrane" evidence=IDA] [GO:0030984 "kininogen
            binding" evidence=IPI] [GO:0032403 "protein complex binding"
            evidence=IPI] [GO:0034097 "response to cytokine stimulus"
            evidence=IEP] [GO:0042277 "peptide binding" evidence=IDA]
            [GO:0042383 "sarcolemma" evidence=IDA] [GO:0042470 "melanosome"
            evidence=IEA] [GO:0043231 "intracellular membrane-bounded
            organelle" evidence=ISO] [GO:0043434 "response to peptide hormone
            stimulus" evidence=IEP] [GO:0043621 "protein self-association"
            evidence=IDA] [GO:0045471 "response to ethanol" evidence=IEP]
            [GO:0048471 "perinuclear region of cytoplasm" evidence=ISO;IDA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0060548 "negative regulation of cell death" evidence=IMP]
            [GO:0070670 "response to interleukin-4" evidence=IEP] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA;ISO]
            [GO:0005901 "caveola" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:621509 GO:GO:0005739 GO:GO:0042470
            GO:GO:0005615 GO:GO:0009612 GO:GO:0009611 GO:GO:0009897
            GO:GO:0045471 GO:GO:0016324 GO:GO:0009749 GO:GO:0006914
            GO:GO:0043434 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0007283
            GO:GO:0005764 GO:GO:0042383 GO:GO:0043621 GO:GO:0014070
            GO:GO:0042277 GO:GO:0060548 GO:GO:0014075 GO:GO:0004197
            GO:GO:0070670 GO:GO:0007519 MEROPS:C01.060 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 GO:GO:0097067
            PANTHER:PTHR12411:SF16 EMBL:X82396 EMBL:M11305 IPI:IPI00212811
            PIR:S51041 UniGene:Rn.100909 PDB:1CPJ PDB:1CTE PDB:1MIR PDB:1THE
            PDBsum:1CPJ PDBsum:1CTE PDBsum:1MIR PDBsum:1THE
            ProteinModelPortal:P00787 SMR:P00787 STRING:P00787 PRIDE:P00787
            UCSC:RGD:621509 InParanoid:P00787 SABIO-RK:P00787 BindingDB:P00787
            ChEMBL:CHEMBL2602 EvolutionaryTrace:P00787 ArrayExpress:P00787
            Genevestigator:P00787 GermOnline:ENSRNOG00000010331 Uniprot:P00787
        Length = 339

 Score = 160 (61.4 bits), Expect = 8.2e-20, Sum P(2) = 8.2e-20
 Identities = 44/121 (36%), Positives = 61/121 (50%)

Query:   225 CNKK-EANPSAA----KISGYEDVP-SNNEAALMKAV-ANQPVSVAIDASGSDFQFYSSG 277
             CNK  EA  S +    K  GY     S++E  +M  +  N PV  A     SDF  Y SG
Sbjct:   207 CNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVF-SDFLTYKSG 265

Query:   278 VFTGQCGTELD-HGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
             V+  + G  +  H +  +G+G  ++G  YWLV NSW   WG+NG+ ++ R     E  CG
Sbjct:   266 VYKHEAGDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNGFFKILRG----ENHCG 320

Query:   337 I 337
             I
Sbjct:   321 I 321

 Score = 140 (54.3 bits), Expect = 8.2e-20, Sum P(2) = 8.2e-20
 Identities = 54/201 (26%), Positives = 82/201 (40%)

Query:    70 IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGY-------KRRLPSVRSSETTD--VS 120
             + S ++K  + P    +  + ++ N  ++A RN Y       K+   +V         V 
Sbjct:    14 LTSAHDKPSSHPLSDDMINYINKQNTTWQAGRNFYNVDISYLKKLCGTVLGGPNLPERVG 73

Query:   121 FRYENASVPASID----WRKKGAVTGVKDQGQCGCCWAFSAVAAMEG--INHITTRKLTS 174
             F  E+ ++P S D    W     +  ++DQG CG CWAF AV AM      H   R    
Sbjct:    74 FS-EDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVE 132

Query:   175 LSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY-------PYKASDGSCNK 227
             +S ++L+ C       GC GG    A+ F  + KGL +   Y       PY      C +
Sbjct:   133 VSAEDLLTCCGIQCGDGCNGGYPSGAWNFW-TRKGLVSGGVYNSHIGCLPYTIPP--C-E 188

Query:   228 KEANPSAAKISGYEDVPSNNE 248
                N S    +G  D P  N+
Sbjct:   189 HHVNGSRPPCTGEGDTPKCNK 209


>UNIPROTKB|F1PIF2 [details] [associations]
            symbol:CTSZ "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0060441 "epithelial tube branching involved
            in lung morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0005783 GO:GO:0005615 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GO:GO:0060441 GeneTree:ENSGT00560000076599 OMA:QCGTCTE
            EMBL:AAEX03014054 Ensembl:ENSCAFT00000019357 Uniprot:F1PIF2
        Length = 261

 Score = 233 (87.1 bits), Expect = 1.5e-19, P = 1.5e-19
 Identities = 73/237 (30%), Positives = 112/237 (47%)

Query:   113 SSETTDVSFRYENAS-VPASIDWRKKGAV---TGVKDQG---QCGCCWAFSAVAAMEGIN 165
             SS T      Y + S +P S DWR    V   +  ++Q     CG CWA  + +AM    
Sbjct:     4 SSRTYPRPHEYLSPSDLPKSWDWRNVNGVNYASATRNQHIPQYCGSCWAHGSTSAMADRI 63

Query:   166 HITTRKL---TSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASD 222
             +I  +     T LS Q ++DC  +G    CEGG     + +   + G+  E    Y+A D
Sbjct:    64 NIKRKGAWPSTLLSVQHVLDCANAGS---CEGGNDLPVWSYAHEH-GIPDETCNNYQAKD 119

Query:   223 GSCNK----------KEA----NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
               CNK          KE     N +  ++  Y  + S  E  + +  AN P+S  I A+ 
Sbjct:   120 QECNKFNQCGTCTEFKECHAIQNYTLWRVGDYGSL-SGREKMMAEIYANGPISCGIMATE 178

Query:   269 SDFQFYSSGVFTG-QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
                  Y+ G+    Q    ++H ++ VG+G +D GT+YW+V+NSWG  WGE G++R+
Sbjct:   179 KMVN-YTGGIHAEYQEQAYINHVISVVGWGVSD-GTEYWIVRNSWGEPWGERGWMRI 233


>UNIPROTKB|E2R6Q7 [details] [associations]
            symbol:CTSB "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730 "nucleolus"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790
            GO:GO:0005764 GO:GO:0004197 CTD:1508 GeneTree:ENSGT00560000076599
            KO:K01363 OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16
            EMBL:AAEX03014318 RefSeq:XP_543203.3 Ensembl:ENSCAFT00000012692
            GeneID:486077 KEGG:cfa:486077 NextBio:20859923 Uniprot:E2R6Q7
        Length = 339

 Score = 173 (66.0 bits), Expect = 1.9e-19, Sum P(2) = 1.9e-19
 Identities = 40/95 (42%), Positives = 52/95 (54%)

Query:   245 SNNEAALMKAV-ANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD-HGVTAVGYGTADDG 302
             S+NE  +M  +  N PV  A     SDF  Y SGV+    G  +  H V  +G+G  +DG
Sbjct:   233 SDNEKEIMAEIYKNGPVEAAFTVY-SDFLLYKSGVYQHVTGEMMGGHAVRILGWGV-EDG 290

Query:   303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
             T YWLV NSW T WG+NG+ ++ R  D     CGI
Sbjct:   291 TPYWLVGNSWNTDWGDNGFFKILRGRDH----CGI 321

 Score = 121 (47.7 bits), Expect = 1.9e-19, Sum P(2) = 1.9e-19
 Identities = 36/124 (29%), Positives = 56/124 (45%)

Query:   124 ENASVPASID----WRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSL--SE 177
             +N  +P S D    W     +  ++DQG CG CWAF AV A+     I T    ++  S 
Sbjct:    76 KNLILPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNVEVSA 135

Query:   178 QELVDC--DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAA 235
             ++++ C  D  G+  GC GG   +A+ F  + +GL +   Y    S   C      P   
Sbjct:   136 EDMLTCCGDQCGD--GCNGGFPAEAWNFW-TKQGLVSGGLYD---SHVGCRPYSIPPCEH 189

Query:   236 KISG 239
              ++G
Sbjct:   190 HVNG 193


>MGI|MGI:1891190 [details] [associations]
            symbol:Ctsz "cathepsin Z" species:10090 "Mus musculus"
            [GO:0005615 "extracellular space" evidence=ISO] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1891190 GO:GO:0005783 GO:GO:0005615 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0060441 GeneTree:ENSGT00560000076599 MEROPS:C01.013 CTD:1522
            HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN OMA:QCGTCTE
            ChiTaRS:CTSZ EMBL:AJ242663 EMBL:AF136277 EMBL:AF136278
            EMBL:BC008619 IPI:IPI00986833 RefSeq:NP_071720.1 UniGene:Mm.156919
            ProteinModelPortal:Q9WUU7 SMR:Q9WUU7 IntAct:Q9WUU7 STRING:Q9WUU7
            PaxDb:Q9WUU7 PRIDE:Q9WUU7 Ensembl:ENSMUST00000016400 GeneID:64138
            KEGG:mmu:64138 InParanoid:Q9WUU7 NextBio:319927 Bgee:Q9WUU7
            CleanEx:MM_CTSZ Genevestigator:Q9WUU7 GermOnline:ENSMUSG00000016256
            Uniprot:Q9WUU7
        Length = 306

 Score = 230 (86.0 bits), Expect = 3.2e-19, P = 3.2e-19
 Identities = 67/223 (30%), Positives = 108/223 (48%)

Query:   126 ASVPASIDWRKKGAV---TGVKDQG---QCGCCWAFSAVAAM-EGINHITTRKLTS--LS 176
             A +P + DWR    V   +  ++Q     CG CWA  + +AM + IN        S  LS
Sbjct:    62 ADLPKNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSILLS 121

Query:   177 EQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNK--------- 227
              Q ++DC  +G    CEGG     +E+   + G+  E    Y+A D  C+K         
Sbjct:   122 VQNVIDCGNAGS---CEGGNDLPVWEYAHKH-GIPDETCNNYQAKDQDCDKFNQCGTCTE 177

Query:   228 -KEA----NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTG- 281
              KE     N +  ++  Y  + S  E  + +  AN P+S  I A+      Y+ G++   
Sbjct:   178 FKECHTIQNYTLWRVGDYGSL-SGREKMMAEIYANGPISCGIMATEM-MSNYTGGIYAEH 235

Query:   282 QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
             Q    ++H ++  G+G ++DG +YW+V+NSWG  WGE G++R+
Sbjct:   236 QDQAVINHIISVAGWGVSNDGIEYWIVRNSWGEPWGEKGWMRI 278


>UNIPROTKB|P07688 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9913 "Bos taurus"
            [GO:0042470 "melanosome" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730
            "nucleolus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0042470
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 EMBL:L06075 EMBL:M64620
            EMBL:U16336 EMBL:U16337 EMBL:U16338 EMBL:U16339 EMBL:U16341
            EMBL:U16342 EMBL:U16343 EMBL:BC102997 IPI:IPI00692061 PIR:S38328
            RefSeq:NP_776456.1 UniGene:Bt.393 PDB:1ITO PDB:1QDQ PDB:1SP4
            PDB:2DC6 PDB:2DC7 PDB:2DC8 PDB:2DC9 PDB:2DCA PDB:2DCB PDB:2DCC
            PDB:2DCD PDBsum:1ITO PDBsum:1QDQ PDBsum:1SP4 PDBsum:2DC6
            PDBsum:2DC7 PDBsum:2DC8 PDBsum:2DC9 PDBsum:2DCA PDBsum:2DCB
            PDBsum:2DCC PDBsum:2DCD ProteinModelPortal:P07688 SMR:P07688
            STRING:P07688 MEROPS:C01.060 PRIDE:P07688
            Ensembl:ENSBTAT00000036795 GeneID:281105 KEGG:bta:281105 CTD:1508
            eggNOG:NOG315657 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 InParanoid:P07688 KO:K01363 OMA:GYPSGAW
            OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 BindingDB:P07688
            ChEMBL:CHEMBL2323 EvolutionaryTrace:P07688 NextBio:20805177
            ArrayExpress:P07688 GO:GO:0097067 PANTHER:PTHR12411:SF16
            Uniprot:P07688
        Length = 335

 Score = 166 (63.5 bits), Expect = 4.9e-19, Sum P(2) = 4.9e-19
 Identities = 38/95 (40%), Positives = 52/95 (54%)

Query:   245 SNNEAALMKAV-ANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD-HGVTAVGYGTADDG 302
             +NNE  +M  +  N PV  A     SDF  Y SGV+    G  +  H +  +G+G  ++G
Sbjct:   233 ANNEKEIMAEIYKNGPVEGAFSVY-SDFLLYKSGVYQHVSGEIMGGHAIRILGWGV-ENG 290

Query:   303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
             T YWLV NSW T WG+NG+ ++ R  D     CGI
Sbjct:   291 TPYWLVGNSWNTDWGDNGFFKILRGQDH----CGI 321

 Score = 125 (49.1 bits), Expect = 4.9e-19, Sum P(2) = 4.9e-19
 Identities = 32/96 (33%), Positives = 45/96 (46%)

Query:   128 VPASID----WRKKGAVTGVKDQGQCGCCWAFSAVAAMEG--INHITTRKLTSLSEQELV 181
             +P S D    W     +  ++DQG CG CWAF AV A+      H   R    +S ++++
Sbjct:    80 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 139

Query:   182 DCDTSGE-DQGCEGGLMDDAFEFIISNKGLATEAKY 216
              C   GE   GC GG    A+ F  + KGL +   Y
Sbjct:   140 TC-CGGECGDGCNGGFPSGAWNFW-TKKGLVSGGLY 173

 Score = 42 (19.8 bits), Expect = 2.0e-10, Sum P(2) = 2.0e-10
 Identities = 8/18 (44%), Positives = 9/18 (50%)

Query:   134 WRKKGAVTGVKDQGQCGC 151
             W KKG V+G       GC
Sbjct:   162 WTKKGLVSGGLYNSHVGC 179


>UNIPROTKB|A5GFX7 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9823 "Sus scrofa"
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GO:GO:0060441 GeneTree:ENSGT00560000076599 MEROPS:C01.013 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN
            OMA:QCGTCTE EMBL:CR956646 RefSeq:NP_001116576.1 UniGene:Ssc.16769
            ProteinModelPortal:A5GFX7 SMR:A5GFX7 STRING:A5GFX7
            Ensembl:ENSSSCT00000008249 GeneID:100141405 KEGG:ssc:100141405
            ArrayExpress:A5GFX7 Uniprot:A5GFX7
        Length = 304

 Score = 228 (85.3 bits), Expect = 5.5e-19, P = 5.5e-19
 Identities = 69/224 (30%), Positives = 109/224 (48%)

Query:   126 ASVPASIDWRKKGAV---TGVKDQG---QCGCCWAFSAVAAMEGINHITTRKL---TSLS 176
             + +P S DWR    V   +  ++Q     CG CWA  + +AM    +I  +     T LS
Sbjct:    61 SDLPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLS 120

Query:   177 EQELVDCDTSGEDQGCEGGLMDDAFEFIISNK-GLATEAKYPYKASDGSCNK-------- 227
              Q ++DC  +G    CEGG  DD   +  +++ G+  E    Y+A D  C+K        
Sbjct:   121 VQHVIDCGNAGS---CEGG--DDLPVWAYAHRHGIPDETCNNYQAKDQVCDKFNQCGTCT 175

Query:   228 --KEA----NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTG 281
               KE     N +  K+  Y  V S  E  + +  AN P+S  I A+      Y+ G++  
Sbjct:   176 EFKECHVIQNYTLWKVGDYGSV-SGREKMMAEIYANGPISCGIMAT-EKMSNYTGGIYAE 233

Query:   282 -QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
              +    ++H V+  G+G +  GT+YW+V+NSWG  WGE G++R+
Sbjct:   234 YKDQAYINHIVSVAGWGVSG-GTEYWIVRNSWGEPWGERGWMRI 276


>WB|WBGene00000785 [details] [associations]
            symbol:cpr-5 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39896 EMBL:L39927 EMBL:FO081739
            PIR:T37277 RefSeq:NP_503383.1 UniGene:Cel.19730
            ProteinModelPortal:P43509 SMR:P43509 DIP:DIP-25329N IntAct:P43509
            MINT:MINT-1051285 STRING:P43509 MEROPS:C01.A35 PaxDb:P43509
            EnsemblMetazoa:W07B8.5 GeneID:178612 KEGG:cel:CELE_W07B8.5
            UCSC:W07B8.5.1 CTD:178612 WormBase:W07B8.5 InParanoid:P43509
            OMA:DAIPDHF NextBio:901840 Uniprot:P43509
        Length = 344

 Score = 166 (63.5 bits), Expect = 1.5e-18, Sum P(2) = 1.5e-18
 Identities = 36/88 (40%), Positives = 47/88 (53%)

Query:   255 VANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD-HGVTAVGYGTADDGTKYWLVKNSWG 313
             + N P+ VA      DF  Y++GV+    G  L  H V  +G+G  D+GT YWLV NSW 
Sbjct:   252 LTNGPIEVAFTVY-EDFYQYTTGVYVHTAGASLGGHAVKILGWGV-DNGTPYWLVANSWN 309

Query:   314 TTWGENGYIRMQRDIDAKEGLCGIAMQA 341
               WGE GY R+ R ++     CGI   A
Sbjct:   310 VAWGEKGYFRIIRGLNE----CGIEHSA 333

 Score = 121 (47.7 bits), Expect = 1.5e-18, Sum P(2) = 1.5e-18
 Identities = 31/110 (28%), Positives = 49/110 (44%)

Query:   134 WRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKL--TSLSEQELVDCDTS--GED 189
             W    ++  ++DQ  CG CWAF+A  A+     I +     T LS ++L+ C T      
Sbjct:    92 WPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLLSCCTGMFSCG 151

Query:   190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
              GCEGG    A+++ + + GL T   Y    +   C      P    ++G
Sbjct:   152 NGCEGGYPIQAWKWWVKH-GLVTGGSYE---TQFGCKPYSIAPCGETVNG 197


>UNIPROTKB|P05689 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:BC122603
            EMBL:X01809 IPI:IPI00708474 PIR:A29172 RefSeq:NP_001071303.1
            UniGene:Bt.4902 ProteinModelPortal:P05689 SMR:P05689 MEROPS:C01.013
            PRIDE:P05689 GeneID:404187 KEGG:bta:404187 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 InParanoid:P05689 KO:K08568
            OrthoDB:EOG42Z4QN BRENDA:3.4.18.1 NextBio:20817615 Uniprot:P05689
        Length = 304

 Score = 222 (83.2 bits), Expect = 8.9e-18, P = 8.9e-18
 Identities = 66/223 (29%), Positives = 106/223 (47%)

Query:   126 ASVPASIDWRKKGAV---TGVKDQG---QCGCCWAFSAVAAMEGINHITTRKL---TSLS 176
             + +P S DWR    V   +  ++Q     CG CWA  + +AM    +I  +     T LS
Sbjct:    61 SDLPKSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLS 120

Query:   177 EQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNK--------- 227
              Q ++DC  +G    CEGG     +E+     G+  E    Y+A D  C+K         
Sbjct:   121 VQHVIDCGDAGS---CEGGNDLPVWEYA-HRHGIPDETCNNYQAKDQECDKFNQCGTCTE 176

Query:   228 -KEA----NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQ 282
              KE     N +  K+  Y  + S  E  + +   N P+S  I A+      Y+ G+++  
Sbjct:   177 FKECHVIKNYTLWKVGDYGSL-SGREKMMAEIYTNGPISCGIMAT-EKMSNYTGGIYSEY 234

Query:   283 CGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
                  ++H V+  G+G +D G +YW+V+NSWG  WGE+G++R+
Sbjct:   235 NDQAFINHIVSVAGWGVSD-GMEYWIVRNSWGEPWGEHGWMRI 276


>DICTYBASE|DDB_G0286055 [details] [associations]
            symbol:DDB_G0286055 "peptidase C1A family protein"
            species:44689 "Dictyostelium discoideum" [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 dictyBase:DDB_G0286055 Pfam:PF00188 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411 EMBL:AAFI02000085
            Gene3D:3.40.33.10 InterPro:IPR001283 InterPro:IPR014044
            PRINTS:PR00837 SMART:SM00198 SUPFAM:SSF55797
            ProtClustDB:CLSZ2429919 RefSeq:XP_637918.1
            ProteinModelPortal:Q54MB6 EnsemblProtists:DDB0186794 GeneID:8625429
            KEGG:ddi:DDB_G0286055 InParanoid:Q54MB6 OMA:GENGFAR Uniprot:Q54MB6
        Length = 435

 Score = 234 (87.4 bits), Expect = 1.2e-17, P = 1.2e-17
 Identities = 73/239 (30%), Positives = 112/239 (46%)

Query:   126 ASVPA--SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
             ASVP   S DWR  G V   KD   C   WAF+A    E  + + TR     S Q+L+DC
Sbjct:   204 ASVPTDGSFDWRDNGVVGFPKDSSNCASGWAFTAAGIFESRSAMRTRHRYDYSAQQLIDC 263

Query:   184 D----------TSGEDQGCE--GGLMDDAFEFIISNKGLATEAKYPYK-ASDGSCNKKEA 230
                        + G    C    G ++ A  +  +  GL   + YPY  AS   C+  ++
Sbjct:   264 INVCIIIFSNFSIGNYTKCSRFSGELNKALMYAQAY-GLQATSTYPYVGASSIGCSYNQS 322

Query:   231 NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTEL--- 287
             +  A +    E      ++ + K     PV V I  + ++F +Y+ G+F  +C   L   
Sbjct:   323 S-IAVEGGDVEYSQVGRDSIVEKCRKQGPVGVGIYVT-NEFLYYAGGIF--ECNNTLIDN 378

Query:   288 ---DHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASY 343
                +H V  VGY   D+   Y+++KN++G TWGENG+ R+  D++ K+  C IA   +Y
Sbjct:   379 ANINHNVLLVGYNEKDN---YYIIKNNFGRTWGENGFARITADVN-KD--CLIAKNPAY 431


>UNIPROTKB|A1E295 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9823 "Sus scrofa"
            [GO:0005615 "extracellular space" evidence=IDA] [GO:0042470
            "melanosome" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730
            "nucleolus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0042470
            GO:GO:0005615 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 MEROPS:C01.060 CTD:1508
            GeneTree:ENSGT00560000076599 HOVERGEN:HBG003480 KO:K01363
            OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16 EMBL:EF095956
            RefSeq:NP_001090927.1 UniGene:Ssc.53773 ProteinModelPortal:A1E295
            SMR:A1E295 PRIDE:A1E295 Ensembl:ENSSSCT00000026923 GeneID:100037961
            KEGG:ssc:100037961 Uniprot:A1E295
        Length = 335

 Score = 159 (61.0 bits), Expect = 1.2e-17, Sum P(2) = 1.2e-17
 Identities = 38/95 (40%), Positives = 51/95 (53%)

Query:   245 SNNEAALMKAV-ANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD-HGVTAVGYGTADDG 302
             S NE  +M  +  N PV  A     SDF  Y SGV+    G  +  H +  +G+G  ++G
Sbjct:   233 SRNEKEIMAEIYKNGPVEGAFTVY-SDFLQYKSGVYQHVTGDLMGGHAIRILGWGV-ENG 290

Query:   303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
             T YWLV NSW T WG+NG+ ++ R  D     CGI
Sbjct:   291 TPYWLVGNSWNTDWGDNGFFKILRGQDH----CGI 321

 Score = 120 (47.3 bits), Expect = 1.2e-17, Sum P(2) = 1.2e-17
 Identities = 36/120 (30%), Positives = 53/120 (44%)

Query:   128 VPASID----WRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITT--RKLTSLSEQELV 181
             +P S D    W     +  ++DQG CG CWAF AV A+     I +  R    +S ++++
Sbjct:    80 LPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDML 139

Query:   182 DC--DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
              C  D  G+  GC GG    A+ F  + KGL +   Y    S   C      P    ++G
Sbjct:   140 TCCGDECGD--GCNGGFPSGAWNFW-TKKGLVSGGLYD---SHVGCRPYSIPPCEHHVNG 193


>UNIPROTKB|F1MW68 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9913 "Bos taurus"
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0060441
            GeneTree:ENSGT00560000076599 IPI:IPI00708474 UniGene:Bt.4902
            OMA:QCGTCTE EMBL:DAAA02036315 PRIDE:F1MW68
            Ensembl:ENSBTAT00000025007 Uniprot:F1MW68
        Length = 304

 Score = 220 (82.5 bits), Expect = 2.5e-17, P = 2.5e-17
 Identities = 66/223 (29%), Positives = 106/223 (47%)

Query:   126 ASVPASIDWRKKGAV---TGVKDQG---QCGCCWAFSAVAAMEGINHITTRKL---TSLS 176
             + +P S DWR    V   +  ++Q     CG CWA  + +AM    +I  +     T LS
Sbjct:    61 SDLPKSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLS 120

Query:   177 EQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNK--------- 227
              Q ++DC  +G    CEGG     +E+     G+  E    Y+A D  C+K         
Sbjct:   121 VQHVLDCGDAGS---CEGGNDLPVWEYA-HRHGIPDETCNNYQAKDQECDKFNQCGTCTE 176

Query:   228 -KEA----NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQ 282
              KE     N +  K+  Y  + S  E  + +   N P+S  I A+      Y+ G+++  
Sbjct:   177 FKECHVIKNYTLWKVGDYGSL-SGREKMMAEIYTNGPISCGIMAT-EKMSNYTGGIYSEY 234

Query:   283 CGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
                  ++H V+  G+G +D G +YW+V+NSWG  WGE+G++R+
Sbjct:   235 NDQAFINHIVSVAGWGVSD-GMEYWIVRNSWGEPWGEHGWMRI 276


>UNIPROTKB|F1N9D7 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9031 "Gallus gallus"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005730 "nucleolus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005739 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0005764
            GO:GO:0004197 GeneTree:ENSGT00560000076599 OMA:GYPSGAW
            GO:GO:0097067 PANTHER:PTHR12411:SF16 IPI:IPI00573387
            EMBL:AADN02018292 Ensembl:ENSGALT00000026896
            Ensembl:ENSGALT00000036723 Uniprot:F1N9D7
        Length = 340

 Score = 157 (60.3 bits), Expect = 4.8e-17, Sum P(2) = 4.8e-17
 Identities = 36/102 (35%), Positives = 53/102 (51%)

Query:   237 ISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD-HGVTAVG 295
             I+ Y  VP + +  + +   N PV  A      DF  Y SGV+    G ++  H +  +G
Sbjct:   228 ITSY-GVPRSEKEIMAEIYKNGPVEGAFIVY-EDFLMYKSGVYQHVSGEQVGGHAIRILG 285

Query:   296 YGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
             +G  ++GT YWL  NSW T WG+NG+ ++ R  D     CGI
Sbjct:   286 WGV-ENGTPYWLAANSWNTDWGDNGFFKILRGEDH----CGI 322

 Score = 117 (46.2 bits), Expect = 4.8e-17, Sum P(2) = 4.8e-17
 Identities = 25/85 (29%), Positives = 40/85 (47%)

Query:   134 WRKKGAVTGVKDQGQCGCCWAFSAVAAMEG--INHITTRKLTSLSEQELVDCDTSGEDQG 191
             W     ++ ++DQG CG CWAF AV A+      H   +    +S ++L+ C       G
Sbjct:    90 WPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLLSCCGFECGMG 149

Query:   192 CEGGLMDDAFEFIISNKGLATEAKY 216
             C GG    A+ +  + +GL +   Y
Sbjct:   150 CNGGYPSGAWRYW-TERGLVSGGLY 173


>UNIPROTKB|E1C4M3 [details] [associations]
            symbol:CTSZ "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0060441 "epithelial tube branching
            involved in lung morphogenesis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GeneTree:ENSGT00560000076599 CTD:1522 KO:K08568 OMA:QCGTCTE
            EMBL:AADN02019004 IPI:IPI00596430 RefSeq:XP_417483.3
            Ensembl:ENSGALT00000012067 GeneID:419311 KEGG:gga:419311
            Uniprot:E1C4M3
        Length = 305

 Score = 219 (82.2 bits), Expect = 5.4e-17, P = 5.4e-17
 Identities = 72/244 (29%), Positives = 113/244 (46%)

Query:   106 RRLPSVRSSETTDVSFRY-ENASVPASIDWRKKGAV---TGVKDQG---QCGCCWAFSAV 158
             RR P +R   T      Y + A +P S DWR    V   +  ++Q     CG CWA  + 
Sbjct:    43 RRAPGLR---TYPRPHEYLDMAELPQSWDWRNVNGVNYASTTRNQHIPQYCGSCWAHGST 99

Query:   159 AAM-EGINHITTRKLTS--LSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
             +A+ + IN        S  LS Q ++DC  +G    CEGG     + +   + G+  E  
Sbjct:   100 SALADRINIKRKGAWPSAYLSVQNVIDCANAGS---CEGGDHTGVWMYA-HDHGIPDETC 155

Query:   216 YPYKASDGSCNKKEA--------------NPSAAKISGYEDVPSNNEAALMKAVANQPVS 261
               Y+A +  C K                 N +  K++ Y  V S  E  + +  AN P+S
Sbjct:   156 NNYQAKNQKCKKFNQCGTCVTFGECHVIKNYTLWKVADYGAV-SGREKMMAEIYANGPIS 214

Query:   262 VAIDASGSDFQFYSSGVFTG-QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENG 320
               I A+      Y+ G++T       ++H V+  G+G  ++GT+YW+V+NSWG  WGE G
Sbjct:   215 CGIMAT-EKLDAYTGGLYTEYNPSPTVNHIVSVAGWGV-ENGTEYWIVRNSWGEPWGERG 272

Query:   321 YIRM 324
             ++R+
Sbjct:   273 WLRI 276


>DICTYBASE|DDB_G0283401 [details] [associations]
            symbol:ctsZ "cathepsin Z precursor" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            dictyBase:DDB_G0283401 GO:GO:0005615 GenomeReviews:CM000153_GR
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 EMBL:AAFI02000055 KO:K08568 OMA:QCGTCTE
            eggNOG:NOG275763 RefSeq:XP_639036.1 ProteinModelPortal:Q54R55
            IntAct:Q54R55 MEROPS:C01.A60 PRIDE:Q54R55
            EnsemblProtists:DDB0233836 GeneID:8624061 KEGG:ddi:DDB_G0283401
            InParanoid:Q54R55 Uniprot:Q54R55
        Length = 296

 Score = 215 (80.7 bits), Expect = 7.9e-17, P = 7.9e-17
 Identities = 58/194 (29%), Positives = 97/194 (50%)

Query:   149 CGCCWAFSAVAAMEGINHITTRKL---TSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
             CG CWAF++ +++     I  +      +++ Q L+DC+  G    C+GG   DAF FI 
Sbjct:    85 CGGCWAFASTSSISDRIKIQRKAAFPDVNVAPQHLIDCNGGGT---CDGGDPGDAFAFIN 141

Query:   206 SNKGLATEAKYPYKASD--GSCNK--KEANPSAA----------KISGYEDVPSNNEAAL 251
              N G+  E   PY+A +    C+   K  NP              ++ Y  V    +  +
Sbjct:   142 EN-GIVDETCKPYQAKNLPDECSPACKTCNPDGTCQAIPVHTNITVTEYGSVRGAKDM-M 199

Query:   252 MKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTEL-DHGVTAVGYGTADDGTKYWLVKN 310
              +  A  P++ +IDA+ S  + Y+SG+F       L +H ++ +G+G   D T YW+V+N
Sbjct:   200 AEIYARGPIACSIDAT-SKLEAYTSGIFKEFKLDPLPNHIISVIGWGV-QDSTPYWIVRN 257

Query:   311 SWGTTWGENGYIRM 324
             SWG+ +GE G+  +
Sbjct:   258 SWGSYYGEGGFFNI 271

 Score = 131 (51.2 bits), Expect = 5.8e-06, P = 5.8e-06
 Identities = 39/121 (32%), Positives = 58/121 (47%)

Query:   125 NASVPASIDWRKKGAVTGVK-DQGQ-----CGCCWAFSAVAAMEGINHITTRKL---TSL 175
             N  VP S DWR    V  +  ++ Q     CG CWAF++ +++     I  +      ++
Sbjct:    55 NLEVPQSWDWRNVSGVNYLTMNRNQHIPQYCGGCWAFASTSSISDRIKIQRKAAFPDVNV 114

Query:   176 SEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASD--GSCNK--KEAN 231
             + Q L+DC+  G    C+GG   DAF FI  N G+  E   PY+A +    C+   K  N
Sbjct:   115 APQHLIDCNGGGT---CDGGDPGDAFAFINEN-GIVDETCKPYQAKNLPDECSPACKTCN 170

Query:   232 P 232
             P
Sbjct:   171 P 171


>WB|WBGene00000789 [details] [associations]
            symbol:cpz-2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GeneTree:ENSGT00560000076599 KO:K08568 EMBL:Z81103
            HSSP:P80067 PIR:T23720 RefSeq:NP_506318.1 ProteinModelPortal:P92005
            SMR:P92005 STRING:P92005 MEROPS:C01.A41 PaxDb:P92005
            EnsemblMetazoa:M04G12.2 GeneID:179818 KEGG:cel:CELE_M04G12.2
            UCSC:M04G12.2 CTD:179818 WormBase:M04G12.2 eggNOG:NOG275763
            InParanoid:P92005 OMA:VEYWIAR NextBio:906990 Uniprot:P92005
        Length = 467

 Score = 226 (84.6 bits), Expect = 1.6e-16, P = 1.6e-16
 Identities = 53/193 (27%), Positives = 93/193 (48%)

Query:   149 CGCCWAFSAVAAMEGINHITTR---KLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
             CG CW F    A+    ++  +    +T LS QE++DC+  G    C+GG + +  E   
Sbjct:   248 CGSCWVFGTTGALNDRFNVARKGRWPMTQLSPQEIIDCNGKGN---CQGGEIGNVLEHA- 303

Query:   206 SNKGLATEAKYPYKASDGSCNKKE-------------ANPSAAKISGYEDVPSNNEAALM 252
               +GL  E    Y+A++G CN                 N +   +  Y  V   ++  + 
Sbjct:   304 KIQGLVEEGCNVYRATNGECNPYHRCGSCWPNECFSLTNYTRYYVKDYGQVQGRDKI-MS 362

Query:   253 KAVANQPVSVAIDASGSDFQF-YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNS 311
             +     P++ AI A+   F++ Y  GV++ +   E +H ++  G+G  ++G +YW+ +NS
Sbjct:   363 EIKKGGPIACAIGAT-KKFEYEYVKGVYSEKSDLESNHIISLTGWGVDENGVEYWIARNS 421

Query:   312 WGTTWGENGYIRM 324
             WG  WGE G+ R+
Sbjct:   422 WGEAWGELGWFRV 434


>ZFIN|ZDB-GENE-060503-240 [details] [associations]
            symbol:tinagl1 "tubulointerstitial nephritis
            antigen-like 1" species:7955 "Danio rerio" [GO:0005044 "scavenger
            receptor activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0030414 "peptidase inhibitor activity" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0002040 "sprouting
            angiogenesis" evidence=IMP] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR008037 InterPro:IPR013128 Pfam:PF00112 Pfam:PF05375
            PROSITE:PS00524 PROSITE:PS50958 SMART:SM00201 SMART:SM00645
            ZFIN:ZDB-GENE-060503-240 GO:GO:0006955 GO:GO:0030247 GO:GO:0030414
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 GO:GO:0002040
            GO:GO:0005044 GeneTree:ENSGT00560000076599 GO:GO:0010466
            SUPFAM:SSF57283 HOVERGEN:HBG053961 MEROPS:C01.975 OMA:DNCNRCT
            EMBL:BX950864 IPI:IPI00609339 UniGene:Dr.103937
            Ensembl:ENSDART00000087096 Ensembl:ENSDART00000126228
            InParanoid:Q1LUC6 Uniprot:Q1LUC6
        Length = 471

 Score = 142 (55.0 bits), Expect = 1.8e-16, Sum P(2) = 1.8e-16
 Identities = 44/144 (30%), Positives = 71/144 (49%)

Query:    87 NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPA---SID-WRKKGAVTG 142
             ++F   T +E    R G KR   ++ +     ++    N  +P+   ++D W   G +  
Sbjct:   160 SQFWGMTLDEGLRFRLGTKRPTRTIMNMNEMQMNMN-GNDHLPSYFNAVDKW--PGKIHE 216

Query:   143 VKDQGQCGCCWAFSAVA-AMEGINHITTRKLT-SLSEQELVDCDTSGEDQGCEGGLMDDA 200
               DQG C   WAFS  A A + I+  +   +T  LS Q L+ CDT  +D GC GG +D A
Sbjct:   217 PLDQGNCNASWAFSTAAVASDRISIQSMGHMTPQLSPQNLISCDTRHQD-GCAGGRIDGA 275

Query:   201 FEFIISNKGLATEAKYPYKASDGS 224
             + F+   +G+ T+  YP+   + S
Sbjct:   276 WWFM-RRRGVVTQDCYPFSPPEQS 298

 Score = 134 (52.2 bits), Expect = 1.8e-16, Sum P(2) = 1.8e-16
 Identities = 34/99 (34%), Positives = 46/99 (46%)

Query:   245 SNNEAALMKAVA-NQPVSVAIDASGSDFQFYSSGVFT---------GQCGTELDHGVTAV 294
             S NE  +MK +  N PV   ++    DF  Y SG+F           Q      H V   
Sbjct:   343 STNENEIMKEIMDNGPVQAIMEVH-EDFFVYKSGIFRHTDVNYHKPSQYRKHATHSVRIT 401

Query:   295 GYGTADDGT----KYWLVKNSWGTTWGENGYIRMQRDID 329
             G+G   D +    KYW+  NSWG  WGE+GY R+ R ++
Sbjct:   402 GWGEERDYSGRTRKYWIGANSWGKNWGEDGYFRIARGVN 440


>UNIPROTKB|P43233 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9031 "Gallus gallus"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 MEROPS:C01.060 CTD:1508
            HOGENOM:HOG000241341 HOVERGEN:HBG003480 KO:K01363 OrthoDB:EOG4K6G4C
            PANTHER:PTHR12411:SF16 EMBL:U18083 IPI:IPI00573387 PIR:S58770
            RefSeq:NP_990702.1 UniGene:Gga.3854 ProteinModelPortal:P43233
            SMR:P43233 STRING:P43233 PRIDE:P43233 GeneID:396329 KEGG:gga:396329
            InParanoid:P43233 NextBio:20816377 Uniprot:P43233
        Length = 340

 Score = 146 (56.5 bits), Expect = 2.4e-16, Sum P(2) = 2.4e-16
 Identities = 35/102 (34%), Positives = 51/102 (50%)

Query:   237 ISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD-HGVTAVG 295
             I+ Y  VP + +  + +   N PV  A      DF  Y SGV+    G ++  H +  +G
Sbjct:   228 ITSY-GVPRSEKEIMAEIYKNGPVEGAFIVY-EDFLMYKSGVYQHVSGEQVGGHAIRILG 285

Query:   296 YGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
             +G  ++GT YWL  NSW T WG  G+ ++ R  D     CGI
Sbjct:   286 WGV-ENGTPYWLAANSWNTDWGITGFFKILRGEDH----CGI 322

 Score = 123 (48.4 bits), Expect = 2.4e-16, Sum P(2) = 2.4e-16
 Identities = 29/99 (29%), Positives = 48/99 (48%)

Query:   124 ENASVPASIDWRKKG----AVTGVKDQGQCGCCWAFSAVAAMEG--INHITTRKLTSLSE 177
             E+  +P + D RK+      ++ ++DQG CG CWAF AV A+      H   +    +S 
Sbjct:    76 EDMDLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSA 135

Query:   178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
             ++L+ C       GC GG    A+ +  + +GL +   Y
Sbjct:   136 EDLLSCCGFECGMGCNGGYPSGAWRYW-TERGLVSGGLY 173


>WB|WBGene00000788 [details] [associations]
            symbol:cpz-1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] [GO:0009792 "embryo development ending in
            birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0010171 "body morphogenesis" evidence=IMP]
            [GO:0018996 "molting cycle, collagen and cuticulin-based cuticle"
            evidence=IMP] [GO:0031012 "extracellular matrix" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0009792 GO:GO:0040007 GO:GO:0002119
            GO:GO:0018996 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0010171 GO:GO:0031012
            GeneTree:ENSGT00560000076599 KO:K08568 OMA:QCGTCTE EMBL:FO081275
            EMBL:BK001409 PIR:T29872 RefSeq:NP_491023.2 HSSP:Q9UBR2
            ProteinModelPortal:G5EGP8 SMR:G5EGP8 IntAct:G5EGP8 MEROPS:C01.A38
            EnsemblMetazoa:F32B5.8 GeneID:171829 KEGG:cel:CELE_F32B5.8
            CTD:171829 WormBase:F32B5.8 NextBio:872879 Uniprot:G5EGP8
        Length = 306

 Score = 215 (80.7 bits), Expect = 2.8e-16, P = 2.8e-16
 Identities = 71/236 (30%), Positives = 114/236 (48%)

Query:   112 RSSETTDVSFRYENASVPASIDWRKKGAVTGVK-DQGQ-----CGCCWAFSAVAAMEGIN 165
             R  ET D    +++  +P + DWR    +     D+ Q     CG CWAF A +A+    
Sbjct:    53 RIYETED----FDSEDLPKTWDWRDANGINYASADRNQHIPQYCGSCWAFGATSALADRI 108

Query:   166 HITTRKL---TSLSEQELVDCDTSGED-QGCE-GGLMDDAFEFIIS----NKGLATEAKY 216
             +I  +       LS QE++DC  +G    G E GG+   A E  I     N   A + K 
Sbjct:   109 NIKRKNAWPQAYLSVQEVIDCSGAGTCVMGGEPGGVYKYAHEHGIPHETCNNYQARDGKC 168

Query:   217 -PYKASDGSCNKKEA----NPSAAKISGYEDVPSNNEAALMKA-VANQ-PVSVAIDASGS 269
              PY    GSC   E     N +  K+S Y  V    +   MKA + ++ P++  I A+ +
Sbjct:   169 DPYNRC-GSCWPGECFSIKNYTLYKVSEYGTVHGYEK---MKAEIYHKGPIACGIAATKA 224

Query:   270 DFQFYSSGVFTGQCGTELDHGVTAVGYGTA-DDGTKYWLVKNSWGTTWGENGYIRM 324
              F+ Y+ G++      ++DH ++  G+G   + G +YW+ +NSWG  WGE+G+ ++
Sbjct:   225 -FETYAGGIYKEVTDEDIDHIISVHGWGVDHESGVEYWIGRNSWGEPWGEHGWFKI 279


>FB|FBgn0030521 [details] [associations]
            symbol:CtsB1 "Cathepsin B1" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS] [GO:0035071 "salivary gland cell autophagic cell
            death" evidence=IEP] [GO:0048102 "autophagic cell death"
            evidence=IEP] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:AE014298 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0035071
            GO:GO:0004197 MEROPS:C01.060 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 KO:K01363 PANTHER:PTHR12411:SF16
            HSSP:P07688 EMBL:AY060640 RefSeq:NP_572920.1 UniGene:Dm.3926
            SMR:Q9VY87 IntAct:Q9VY87 MINT:MINT-932864 STRING:Q9VY87
            EnsemblMetazoa:FBtr0073838 GeneID:32341 KEGG:dme:Dmel_CG10992
            UCSC:CG10992-RA FlyBase:FBgn0030521 InParanoid:Q9VY87 OMA:TEGHIRR
            OrthoDB:EOG48W9HM ChiTaRS:CG10992 GenomeRNAi:32341 NextBio:778020
            Uniprot:Q9VY87
        Length = 340

 Score = 134 (52.2 bits), Expect = 6.0e-16, Sum P(2) = 6.0e-16
 Identities = 31/90 (34%), Positives = 44/90 (48%)

Query:   255 VANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD-HGVTAVGYGT-ADDGTKYWLVKNSW 312
             + N PV  A      D   Y  GV+  + G EL  H +  +G+G   ++   YWL+ NSW
Sbjct:   250 MTNGPVEGAFTVY-EDLILYKDGVYQHEHGKELGGHAIRILGWGVWGEEKIPYWLIGNSW 308

Query:   313 GTTWGENGYIRMQRDIDAKEGLCGIAMQAS 342
              T WG++G+ R+ R  D     CGI    S
Sbjct:   309 NTDWGDHGFFRILRGQDH----CGIESSIS 334

 Score = 133 (51.9 bits), Expect = 6.0e-16, Sum P(2) = 6.0e-16
 Identities = 55/185 (29%), Positives = 80/185 (43%)

Query:    68 EYIASFNNKARNKPYKLGINEFADQTNEEFR----APRNGYKRRLPSVRSSETTDVSFRY 123
             E+I    +KA  K + +G N  A  T    R       + +K  LP  R     D+   Y
Sbjct:    27 EFIEVVRSKA--KTWTVGRNFDASVTEGHIRRLMGVHPDAHKFALPDKREV-LGDL---Y 80

Query:   124 ENA--SVPASIDWRKKG----AVTGVKDQGQCGCCWAFSAVAAMEG--INHITTRKLTSL 175
              N+   +P   D RK+      +  ++DQG CG CWAF AV AM      H   +     
Sbjct:    81 VNSVDELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHF 140

Query:   176 SEQELVDC-DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSA 234
             S  +LV C  T G   GC GG    A+ +  + KG+ +    PY ++ G C   E +P  
Sbjct:   141 SADDLVSCCHTCGF--GCNGGFPGAAWSYW-TRKGIVSGG--PYGSNQG-CRPYEISPCE 194

Query:   235 AKISG 239
               ++G
Sbjct:   195 HHVNG 199


>WB|WBGene00022026 [details] [associations]
            symbol:Y65B4A.2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 SMART:SM00645 GO:GO:0008234
            GO:GO:0006508 PANTHER:PTHR12411 GeneTree:ENSGT00560000076599
            PANTHER:PTHR12411:SF16 HSSP:P07688 EMBL:FO081482 RefSeq:NP_490763.1
            ProteinModelPortal:Q9BL59 MEROPS:C01.A46 PaxDb:Q9BL59
            EnsemblMetazoa:Y65B4A.2.1 EnsemblMetazoa:Y65B4A.2.2 GeneID:171655
            KEGG:cel:CELE_Y65B4A.2 UCSC:Y65B4A.2 CTD:171655 WormBase:Y65B4A.2
            eggNOG:NOG311760 HOGENOM:HOG000017674 InParanoid:Q9BL59 OMA:DRIVYWH
            NextBio:872169 Uniprot:Q9BL59
        Length = 421

 Score = 138 (53.6 bits), Expect = 1.2e-15, Sum P(2) = 1.2e-15
 Identities = 32/82 (39%), Positives = 42/82 (51%)

Query:   259 PVSVAIDASGSDFQFYSSGVFTGQCGTELD------HGVTAVGYGTADDGTKYWLVKNSW 312
             P ++A      +F  YSSGVF        D      H V  +G+G +DDGT YWL  NS+
Sbjct:   334 PTTMAFPVP-EEFLHYSSGVFRPYPTDGFDDRIVYWHVVRLIGWGESDDGTHYWLAVNSF 392

Query:   313 GTTWGENGYIRMQRDIDAKEGL 334
             G  WG+NG  ++  D   K GL
Sbjct:   393 GNHWGDNGLFKINTDDMEKYGL 414

 Score = 129 (50.5 bits), Expect = 1.2e-15, Sum P(2) = 1.2e-15
 Identities = 57/223 (25%), Positives = 101/223 (45%)

Query:    10 LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
             L+L A+L L   +   + R + DA     ++ ++ +  R   D+ E   + K  K  V+ 
Sbjct:    34 LLLLAVLGLVYGSFYLYRRYVTDANDKRDNDEYLRKLVRQVNDSPETTWKAKFNKFGVKN 93

Query:    70 IASFNNK-ARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
               S+  K  RN+     + E+ +Q  + F +  +  KR L  + +  ++DV   ++    
Sbjct:    94 -RSYGFKYTRNQT---AVEEYVEQIRKFFES--DAMKRHLDELENFNSSDVPKNFD---- 143

Query:   129 PASIDWRKKGAVTGVKDQGQCGCCWAFSA--VAAMEGINHITTRKLTSLSEQELVDC-DT 185
              A   W    +++ V +QG CG C+A +A  VA+     H      + LSE++++ C   
Sbjct:   144 -ARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSNGTFKSLLSEEDIIGCCSV 202

Query:   186 SGEDQGCEGGLMDDAFEFIISNKGLATEAK---YPYKASDGSC 225
              G    C GG    A  + + N+GL T  +    PY + D SC
Sbjct:   203 CGN---CYGGDPLKALTYWV-NQGLVTGGRDGCRPY-SFDLSC 240


>UNIPROTKB|E1BTI7 [details] [associations]
            symbol:TINAG "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0005044 "scavenger receptor activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0006955 "immune
            response" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0030247 "polysaccharide binding"
            evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
            [GO:0007155 "cell adhesion" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GO:GO:0007155 GO:GO:0005604 GO:GO:0005044
            GeneTree:ENSGT00560000076599 CTD:27283 OMA:WGQLTSS
            EMBL:AADN02002720 EMBL:AADN02002721 IPI:IPI00581566
            RefSeq:XP_419905.3 UniGene:Gga.11215 Ensembl:ENSGALT00000026295
            GeneID:421888 KEGG:gga:421888 Uniprot:E1BTI7
        Length = 467

 Score = 132 (51.5 bits), Expect = 6.6e-15, Sum P(2) = 6.6e-15
 Identities = 36/121 (29%), Positives = 59/121 (48%)

Query:   214 AKYPYKASDGSC-NKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQ 272
             ++Y    ++G C N  E +    +   +  V S     + + +A  PV  AI     DF 
Sbjct:   332 SEYGKNHTNGPCPNALEDSNRLYRCGSHYRVSSKETDIMEEIMAKGPVQ-AIMKVYEDFF 390

Query:   273 FYSSGVF--TGQCGTELD-HGVTAVGYGT--ADDGTK--YWLVKNSWGTTWGENGYIRMQ 325
              Y  G++  + + G++   H V  +G+G+    +G K  +W+  NSWG  WGENGY R+ 
Sbjct:   391 LYKEGIYRHSYKAGSKWKTHSVKLLGWGSLPGKNGQKQKFWIAANSWGKYWGENGYFRIL 450

Query:   326 R 326
             R
Sbjct:   451 R 451

 Score = 130 (50.8 bits), Expect = 6.6e-15, Sum P(2) = 6.6e-15
 Identities = 30/75 (40%), Positives = 45/75 (60%)

Query:   145 DQGQCGCCWAFS-AVAAMEGINHITTRKLT-SLSEQELVDCDTSGEDQGCEGGLMDDAFE 202
             DQ  CG  WAFS A  A + I   +  ++T +LS Q L+ CDT G  +GC GG +D A+ 
Sbjct:   241 DQRNCGASWAFSTASVAADRITIHSDGQITDNLSVQNLISCDT-GNQRGCNGGSIDGAWR 299

Query:   203 FIISNKGLATEAKYP 217
             ++ ++ G+ + A YP
Sbjct:   300 YLTTH-GVVSYACYP 313


>WB|WBGene00000782 [details] [associations]
            symbol:cpr-2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 eggNOG:NOG315657 GeneTree:ENSGT00560000076599
            HOGENOM:HOG000241341 PANTHER:PTHR12411:SF16 EMBL:Z81531
            RefSeq:NP_507186.3 ProteinModelPortal:O45466 SMR:O45466
            MEROPS:C01.A40 PaxDb:O45466 EnsemblMetazoa:F36D3.9 GeneID:185355
            KEGG:cel:CELE_F36D3.9 CTD:185355 WormBase:F36D3.9 OMA:FDARLRW
            Uniprot:O45466
        Length = 326

 Score = 155 (59.6 bits), Expect = 7.9e-15, Sum P(2) = 7.9e-15
 Identities = 40/101 (39%), Positives = 52/101 (51%)

Query:   238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCG-TELDHGVTAVGY 296
             S Y  VP    A       N PV VA      DF+ Y SG++    G ++  H V  +G+
Sbjct:   223 SAYP-VPRTVAAIQADIYYNGPV-VAAFIVYEDFEKYKSGIYRHIAGRSKGGHAVKLIGW 280

Query:   297 GTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
             GT + GT YWL  NSWG+ WGE+G  R+ R +D     CGI
Sbjct:   281 GT-ERGTPYWLAVNSWGSQWGESGTFRILRGVDE----CGI 316

 Score = 97 (39.2 bits), Expect = 7.9e-15, Sum P(2) = 7.9e-15
 Identities = 22/85 (25%), Positives = 40/85 (47%)

Query:   134 WRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTS--LSEQELVDCDTSGEDQG 191
             W +  ++  +++Q  CG CWAFS    +     I +       +S  +L+ C      +G
Sbjct:    93 WPQCKSMKLIREQSNCGSCWAFSTAEVISDRTCIASNGTQQPIISPTDLLTCCGMSCGEG 152

Query:   192 CEGGLMDDAFEFIISNKGLATEAKY 216
             C+GG    AF++  + +G+ T   Y
Sbjct:   153 CDGGFPYRAFQWW-ARRGVVTGGDY 176


>RGD|1359482 [details] [associations]
            symbol:Tinag "tubulointerstitial nephritis antigen"
            species:10116 "Rattus norvegicus" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0005604 "basement membrane"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=IEA] [GO:0006955
            "immune response" evidence=IEA] [GO:0007155 "cell adhesion"
            evidence=ISO] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR001212 InterPro:IPR013128
            Pfam:PF00112 Pfam:PF01033 PROSITE:PS00524 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 RGD:1359482 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GO:GO:0007155 EMBL:CH473954 GO:GO:0005604
            GO:GO:0005044 MEROPS:C01.973 CTD:27283 eggNOG:NOG310046
            HOGENOM:HOG000241342 HOVERGEN:HBG053961 OMA:WGQLTSS
            OrthoDB:EOG47PX5P EMBL:BC081887 IPI:IPI00370427
            RefSeq:NP_001005549.1 UniGene:Rn.43851 STRING:Q66HF6
            Ensembl:ENSRNOT00000041567 GeneID:300846 KEGG:rno:300846
            UCSC:RGD:1359482 InParanoid:Q66HF6 NextBio:647630
            Genevestigator:Q66HF6 Uniprot:Q66HF6
        Length = 475

 Score = 138 (53.6 bits), Expect = 9.5e-15, Sum P(2) = 9.5e-15
 Identities = 34/102 (33%), Positives = 51/102 (50%)

Query:   245 SNNEAALMKAVA-NQPVSVAIDASGSDFQFYSSGVFTGQCGTELD---------HGVTAV 294
             S+NE  +M+ +  N PV  AI     DF +Y +G++     T  +         H V   
Sbjct:   357 SSNETEIMREIIQNGPVQ-AIMQVHEDFFYYKTGIYRHVVSTNEEPEKYRKLRTHAVKLT 415

Query:   295 GYGT--ADDGTK--YWLVKNSWGTTWGENGYIRMQRDIDAKE 332
             G+GT     G K  +W+  NSWG +WGENGY R+ R ++  +
Sbjct:   416 GWGTLRGAQGKKEKFWIAANSWGKSWGENGYFRILRGVNESD 457

 Score = 122 (48.0 bits), Expect = 9.5e-15, Sum P(2) = 9.5e-15
 Identities = 47/160 (29%), Positives = 66/160 (41%)

Query:   103 GYKRRL----PSVRSSETTDVSFRYENASVP----ASIDWRKKGAVTGVKDQGQCGCCWA 154
             G+K RL    PS       +++  Y  A +P    AS  W   G   G  DQ  C   WA
Sbjct:   187 GFKFRLGTLPPSPMLLSMNEMTASYPRADLPEVFIASYKW--PGWTHGPLDQKNCAASWA 244

Query:   155 FS--AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLAT 212
             FS  +VAA         R   +LS Q L+ C       GC  G +D A+ F+   +GL +
Sbjct:   245 FSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNR-HGCNSGSIDRAWWFL-RKRGLVS 302

Query:   213 EAKYP-YK---ASDGSCNKKEANPSAAKISGYEDVPSNNE 248
              A YP +K    ++ SC     +    K       P++ E
Sbjct:   303 HACYPLFKEQSTNNNSCAMASRSDGRGKRHATRPCPNSFE 342


>UNIPROTKB|Q9UJW2 [details] [associations]
            symbol:TINAG "Tubulointerstitial nephritis antigen"
            species:9606 "Homo sapiens" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0007155 "cell adhesion"
            evidence=IDA] [GO:0005604 "basement membrane" evidence=IDA]
            [GO:0000166 "nucleotide binding" evidence=TAS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR001212 InterPro:IPR013128
            Pfam:PF00112 Pfam:PF01033 PROSITE:PS00524 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0006955 EMBL:CH471081
            GO:GO:0000166 GO:GO:0030247 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0007155 GO:GO:0005604
            GO:GO:0004197 GO:GO:0005044 EMBL:AL359380 MEROPS:C01.973 CTD:27283
            eggNOG:NOG310046 HOGENOM:HOG000241342 HOVERGEN:HBG053961
            OMA:WGQLTSS EMBL:AB022277 EMBL:AF195116 EMBL:AF195117 EMBL:AK312918
            EMBL:AL589946 IPI:IPI00099386 IPI:IPI00478705 PIR:JC7189
            RefSeq:NP_055279.3 UniGene:Hs.127011 ProteinModelPortal:Q9UJW2
            SMR:Q9UJW2 IntAct:Q9UJW2 STRING:Q9UJW2 PhosphoSite:Q9UJW2
            DMDM:212276468 PRIDE:Q9UJW2 DNASU:27283 Ensembl:ENST00000259782
            GeneID:27283 KEGG:hsa:27283 UCSC:uc003pcj.2 GeneCards:GC06P054220
            H-InvDB:HIX0025004 HGNC:HGNC:14599 HPA:HPA035427 MIM:606749
            neXtProt:NX_Q9UJW2 PharmGKB:PA37905 InParanoid:Q9UJW2
            PhylomeDB:Q9UJW2 GenomeRNAi:27283 NextBio:50212 ArrayExpress:Q9UJW2
            Bgee:Q9UJW2 CleanEx:HS_TINAG Genevestigator:Q9UJW2
            GermOnline:ENSG00000137251 Uniprot:Q9UJW2
        Length = 476

 Score = 138 (53.6 bits), Expect = 9.6e-15, Sum P(2) = 9.6e-15
 Identities = 35/102 (34%), Positives = 50/102 (49%)

Query:   245 SNNEAALMKAVA-NQPVSVAIDASGSDFQFYSSGVFTGQCGTELD---------HGVTAV 294
             S+NE  +MK +  N PV  AI     DF  Y +G++     T  +         H V   
Sbjct:   358 SSNETEIMKEIMQNGPVQ-AIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVKLT 416

Query:   295 GYGT--ADDGTK--YWLVKNSWGTTWGENGYIRMQRDIDAKE 332
             G+GT     G K  +W+  NSWG +WGENGY R+ R ++  +
Sbjct:   417 GWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESD 458

 Score = 122 (48.0 bits), Expect = 9.6e-15, Sum P(2) = 9.6e-15
 Identities = 46/161 (28%), Positives = 70/161 (43%)

Query:   102 NGYKRRLPSVRSSETTDVSFRYENASVPASID--------WRKKGAVTGVKDQGQCGCCW 153
             +G+K RL ++  S    +S     AS+PA+ D        ++  G   G  DQ  C   W
Sbjct:   186 DGFKFRLGTLPPSPML-LSMNEMTASLPATTDLPEFFVASYKWPGWTHGPLDQKNCAASW 244

Query:   154 AFS--AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
             AFS  +VAA         R   +LS Q L+ C       GC  G +D A+ ++   +GL 
Sbjct:   245 AFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNR-HGCNSGSIDRAWWYL-RKRGLV 302

Query:   212 TEAKYP-YK---ASDGSCNKKEANPSAAKISGYEDVPSNNE 248
             + A YP +K   A++  C     +    K    +  P+N E
Sbjct:   303 SHACYPLFKDQNATNNGCAMASRSDGRGKRHATKPCPNNVE 343


>WB|WBGene00013072 [details] [associations]
            symbol:Y51A2D.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            EMBL:AL021497 RefSeq:NP_001256811.1 ProteinModelPortal:O62484
            SMR:O62484 MEROPS:C01.A37 EnsemblMetazoa:Y51A2D.1 GeneID:180204
            KEGG:cel:CELE_Y51A2D.1 UCSC:Y51A2D.1 CTD:180204 WormBase:Y51A2D.1a
            HOGENOM:HOG000019851 NextBio:908416 Uniprot:O62484
        Length = 314

 Score = 134 (52.2 bits), Expect = 1.5e-14, Sum P(2) = 1.5e-14
 Identities = 37/113 (32%), Positives = 58/113 (51%)

Query:   244 PSNNEAALMKAVANQPVSVAID-ASGSDFQFYSSGVF-TGQC---GTELDHGVTAVGYGT 298
             P N E+ +++ +      VA+  A+G+ F  Y SGV  T  C   GT + H    VGYG 
Sbjct:   201 PENAESEIIEILNTWKTPVAVYFAAGTAFLQYKSGVLVTEDCDLAGT-VWHAGAIVGYGE 259

Query:   299 ADD----GTKYWLVKNSWGTT-WGENGYIRMQRDID---AKEGLCGIAMQASY 343
              +D      ++W++KNSWG + WG  GY+++ R  +    + G  G  M+  Y
Sbjct:   260 ENDLRGRSQRFWIMKNSWGVSGWGTGGYVKLIRGKNWCGIERGAIGANMEEHY 312

 Score = 118 (46.6 bits), Expect = 1.5e-14, Sum P(2) = 1.5e-14
 Identities = 32/133 (24%), Positives = 59/133 (44%)

Query:    45 QYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRAPRN 102
             ++ R Y+  AE ++R + F ++   +   N  A+   +     +N+F+D T  E     +
Sbjct:    50 KFSRTYKSEAENQLRLQNFVKSRNNVVRLNKNAQKAGRNSNFAVNQFSDLTTSELHQRLS 109

Query:   103 GYKRRLP--SVRSSETTDV----SFRYENASVPASIDWRKKGA----VTG-VKDQGQCGC 151
              +   L   SV       +      + +N+    + D R +      + G +K+QGQC C
Sbjct:   110 RFPPNLTENSVFHKNFKKLLGKTRTKRQNSEFARNFDLRSQKVNGRYIVGPIKNQGQCAC 169

Query:   152 CWAFSAVAAMEGI 164
             CW F+  A +E I
Sbjct:   170 CWGFAVTAMLETI 182

 Score = 48 (22.0 bits), Expect = 2.6e-07, Sum P(2) = 2.6e-07
 Identities = 12/54 (22%), Positives = 26/54 (48%)

Query:    53 NAEKEMRFKIFKENVEYIASFNNKARNKPY-KLGINEFADQTNEEFRAPRNGYK 105
             N +++   K+++E VE+   F+   +++   +L +  F    N   R  +N  K
Sbjct:    31 NIDRDHPEKVYQEFVEFKKKFSRTYKSEAENQLRLQNFVKSRNNVVRLNKNAQK 84


>UNIPROTKB|E2RNP9 [details] [associations]
            symbol:TINAG "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0007155 "cell adhesion" evidence=IEA]
            [GO:0005604 "basement membrane" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044
            "scavenger receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 Pfam:PF01033
            PROSITE:PS00524 PROSITE:PS50958 SMART:SM00201 SMART:SM00645
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0007155
            GO:GO:0005604 GO:GO:0005044 GeneTree:ENSGT00560000076599 CTD:27283
            OMA:WGQLTSS EMBL:AAEX03008403 RefSeq:XP_538969.2
            ProteinModelPortal:E2RNP9 Ensembl:ENSCAFT00000003638 GeneID:481848
            KEGG:cfa:481848 NextBio:20856579 Uniprot:E2RNP9
        Length = 476

 Score = 134 (52.2 bits), Expect = 2.2e-14, Sum P(2) = 2.2e-14
 Identities = 34/102 (33%), Positives = 49/102 (48%)

Query:   245 SNNEAALMKAVA-NQPVSVAIDASGSDFQFYSSGVFTGQCGTELD---------HGVTAV 294
             S+NE  +MK +  N PV  AI     DF  Y +G++     T  +         H V   
Sbjct:   358 SSNETEIMKEIMQNGPVQ-AIMQVHEDFFHYKTGIYRHITRTNEESRKYQKLQTHAVKLT 416

Query:   295 GYGTADDGT----KYWLVKNSWGTTWGENGYIRMQRDIDAKE 332
             G+GT         K+W+  NSWG +WGENGY R+ R ++  +
Sbjct:   417 GWGTLKGAQGQKEKFWIAANSWGISWGENGYFRILRGVNESD 458

 Score = 123 (48.4 bits), Expect = 2.2e-14, Sum P(2) = 2.2e-14
 Identities = 47/160 (29%), Positives = 69/160 (43%)

Query:   103 GYKRRLPSVRSSETTDVSFRYENASVPASID--------WRKKGAVTGVKDQGQCGCCWA 154
             G+K RL ++  S    +S     AS+PA+ D        ++  G   G  DQ  C   WA
Sbjct:   187 GFKYRLGTLPPSPML-LSMNEMTASLPATTDLPEFFIASYKWPGWTHGPLDQKNCAASWA 245

Query:   155 FS--AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLAT 212
             FS  +VAA         R   +LS Q L+ C       GC  G +D A+ F+   +GL +
Sbjct:   246 FSTASVAADRIAIQSNGRYTANLSPQNLISCCAKNR-HGCNSGSIDRAWWFL-RKRGLVS 303

Query:   213 EAKYP-YK---ASDGSCNKKEANPSAAKISGYEDVPSNNE 248
              A YP +K   A++  C     +    K    +  P+N E
Sbjct:   304 HACYPLFKDQNATNYGCAMASRSDGRGKRHATKPCPNNIE 343


>ZFIN|ZDB-GENE-041010-139 [details] [associations]
            symbol:ctsz "cathepsin Z" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0001525 "angiogenesis"
            evidence=IMP] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 ZFIN:ZDB-GENE-041010-139 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0001525
            CTD:1522 HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568
            OrthoDB:EOG42Z4QN UniGene:Dr.935 eggNOG:NOG275763 EMBL:BC083369
            IPI:IPI00483065 RefSeq:NP_001006043.1 ProteinModelPortal:Q5XJD4
            SMR:Q5XJD4 STRING:Q5XJD4 GeneID:450022 KEGG:dre:450022
            InParanoid:Q5XJD4 NextBio:20833005 ArrayExpress:Q5XJD4
            Uniprot:Q5XJD4
        Length = 301

 Score = 201 (75.8 bits), Expect = 2.3e-14, P = 2.3e-14
 Identities = 69/245 (28%), Positives = 113/245 (46%)

Query:   105 KRRLPSVRSSETTDVSFRYENASVPASIDWRK-KGA--VTGVKDQG---QCGCCWAFSAV 158
             +R L  V++      S   +   +P   DWR  KG   V+  ++Q     CG CWA  + 
Sbjct:    33 RRNLQGVKTGPRPYESMNLKE--LPKEWDWRNIKGVNYVSTTRNQHIPQYCGSCWAHGST 90

Query:   159 AAM-EGINHITTRKLTS--LSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
             +A+ + IN        S  LS Q ++DC  +G    C GG     +E+   NKG+  E  
Sbjct:    91 SALADRINIKRKAAWPSAYLSVQNVIDCGDAGS---CSGGDHSGVWEYA-HNKGIPDETC 146

Query:   216 YPYKASDGSCNKKEANPSAAKIS-GYEDVPSN----------NEAAL--MKA--VANQPV 260
               Y+A D  C  K  N      + G  ++  N          + + L  MKA   +  P+
Sbjct:   147 NNYQAKDQDC--KPFNQCGTCTTFGVCNIVKNFTLWKVGDYGSASGLDKMKAEIYSGGPI 204

Query:   261 SVAIDASGSDFQFYSSGVFTGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGEN 319
             S  I A+      Y+ G+++       ++H V+  G+G  ++G ++W+V+NSWG  WGE 
Sbjct:   205 SCGIMATDK-LDAYTGGLYSEYVQEPYINHIVSVAGWGVDENGVEFWVVRNSWGEPWGEK 263

Query:   320 GYIRM 324
             G++R+
Sbjct:   264 GWLRI 268


>UNIPROTKB|F1SVA2 [details] [associations]
            symbol:TINAGL1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005615 "extracellular space" evidence=IDA] [GO:0043236
            "laminin binding" evidence=IEA] [GO:0031012 "extracellular matrix"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044
            "scavenger receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005737
            GO:GO:0005615 GO:GO:0006955 GO:GO:0030247 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GO:GO:0031012 GO:GO:0005044 GeneTree:ENSGT00560000076599
            OMA:DNCNRCT EMBL:CU856262 Ensembl:ENSSSCT00000003995 Uniprot:F1SVA2
        Length = 467

 Score = 135 (52.6 bits), Expect = 2.6e-14, Sum P(2) = 2.6e-14
 Identities = 32/91 (35%), Positives = 49/91 (53%)

Query:   145 DQGQCGCCWAFSAVA-AMEGIN-HITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFE 202
             DQG C   WAFS  A A + ++ H        LS Q L+ CDT  + QGC+GG +D A+ 
Sbjct:   222 DQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHNQ-QGCQGGRLDGAWW 280

Query:   203 FIISNKGLATEAKYPYKASDGSCNKKEANPS 233
             F+   +G+ ++  YP+   +    + EA P+
Sbjct:   281 FL-RRRGVVSDHCYPFSGHE----RNEAGPA 306

 Score = 121 (47.7 bits), Expect = 2.6e-14, Sum P(2) = 2.6e-14
 Identities = 34/95 (35%), Positives = 47/95 (49%)

Query:   246 NNEAALMKAVA-NQPVSVAIDASGSDFQFYSSGVFT------GQCGTELDHGVTAV---G 295
             +NE  +MK +  N PV   ++    DF  Y SG+++      G+      HG  +V   G
Sbjct:   348 SNEKDIMKELMENGPVQALMEVH-EDFFLYQSGIYSHTPVSHGRPERYRRHGTHSVKITG 406

Query:   296 YG--TADDGT--KYWLVKNSWGTTWGENGYIRMQR 326
             +G  T  DG   KYW   NSWG  WGE G+ R+ R
Sbjct:   407 WGEETLPDGRMLKYWTAANSWGPGWGERGHFRIVR 441


>UNIPROTKB|H0YDT2 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            EMBL:AP001201 HGNC:HGNC:2546 Ensembl:ENST00000526034 Bgee:H0YDT2
            Uniprot:H0YDT2
        Length = 211

 Score = 168 (64.2 bits), Expect = 3.3e-14, Sum P(2) = 3.3e-14
 Identities = 48/170 (28%), Positives = 73/170 (42%)

Query:    11 VLAAILVLGVWAP-QSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
             +L A L  G+  P ++         + E  +++  Q+ R Y    E   R  IF  N+  
Sbjct:    12 LLVAGLAQGIRGPLRAQDLGPQPLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQ 71

Query:    70 IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVP 129
                   +      + G+  F+D T EEF     GY+R    V S    ++       SVP
Sbjct:    72 AQRLQEEDLGTA-EFGVTPFSDLTEEEF-GQLYGYRRAAGGVPSMGR-EIRSEEPEESVP 128

Query:   130 ASIDWRK-KGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQ 178
              S DWRK   A++ +KDQ  C CCWA +A   +E +  I+      +S Q
Sbjct:   129 FSCDWRKVASAISPIKDQKNCNCCWAMAAAGNIETLWRISFWDFVDVSVQ 178

 Score = 38 (18.4 bits), Expect = 3.3e-14, Sum P(2) = 3.3e-14
 Identities = 9/23 (39%), Positives = 13/23 (56%)

Query:   209 GLATEAKYPY--KASDGSCNKKE 229
             GLA+E  YP+  K     C+ K+
Sbjct:   180 GLASEKDYPFQGKVRAHRCHPKK 202


>WB|WBGene00009158 [details] [associations]
            symbol:F26E4.3 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005576
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005044
            GeneTree:ENSGT00560000076599 HSSP:P07711 EMBL:Z81070
            eggNOG:NOG310046 HOGENOM:HOG000241342 OMA:DNCNRCT PIR:T21421
            RefSeq:NP_492593.2 ProteinModelPortal:P90850 SMR:P90850
            PaxDb:P90850 EnsemblMetazoa:F26E4.3.1 EnsemblMetazoa:F26E4.3.2
            GeneID:172827 KEGG:cel:CELE_F26E4.3 UCSC:F26E4.3.1 CTD:172827
            WormBase:F26E4.3 InParanoid:P90850 NextBio:877161 Uniprot:P90850
        Length = 452

 Score = 130 (50.8 bits), Expect = 4.2e-14, Sum P(2) = 4.2e-14
 Identities = 41/126 (32%), Positives = 54/126 (42%)

Query:   215 KYPYKASDG-SCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
             K  Y    G  C     + +A K++    V S  E    + + N PV         DF  
Sbjct:   291 KRDYTNRQGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVH-EDFFM 349

Query:   274 YSSGVF-----TGQCGT----ELDHGVTAVGYGTADDGT----KYWLVKNSWGTTWGENG 320
             Y+ GV+       Q G     E  H V  +G+G  D  T    KYWL  NSWGT WGE+G
Sbjct:   350 YAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGV-DHSTGKPIKYWLCANSWGTQWGEDG 408

Query:   321 YIRMQR 326
             Y ++ R
Sbjct:   409 YFKVLR 414

 Score = 124 (48.7 bits), Expect = 4.2e-14, Sum P(2) = 4.2e-14
 Identities = 39/128 (30%), Positives = 65/128 (50%)

Query:    98 RAPRNGYKRRLPSV---RSSETTDVSFRYENASVPASIDWRKKGA--VTGVKDQGQCGCC 152
             R+  +G K RL ++   RS +  +     +   +P   D R K    +  V DQG CG  
Sbjct:   152 RSLSDGIKYRLGTLFPERSVQNMN-EILIKPRELPEHFDARDKWGPLIHPVADQGDCGSS 210

Query:   153 WAFSAVA-AMEGINHITTRKLTS-LSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
             W+ S  A + + +  I+  ++ S LS Q+L+ C+   + +GCEGG +D A+ +I    G+
Sbjct:   211 WSVSTTAISSDRLAIISEGRINSTLSSQQLLSCNQHRQ-KGCEGGYLDRAWWYI-RKLGV 268

Query:   211 ATEAKYPY 218
               +  YPY
Sbjct:   269 VGDHCYPY 276


>UNIPROTKB|Q9GZM7 [details] [associations]
            symbol:TINAGL1 "Tubulointerstitial nephritis antigen-like"
            species:9606 "Homo sapiens" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0005737 "cytoplasm"
            evidence=IEA] [GO:0043236 "laminin binding" evidence=IEA]
            [GO:0016197 "endosomal transport" evidence=TAS] [GO:0005201
            "extracellular matrix structural constituent" evidence=NAS]
            [GO:0005576 "extracellular region" evidence=NAS] [GO:0031012
            "extracellular matrix" evidence=IDA] [GO:0005615 "extracellular
            space" evidence=ISS] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0005737 GO:GO:0005615
            GO:GO:0006955 GO:GO:0030247 EMBL:CH471059 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GO:GO:0016197 EMBL:AC114488 GO:GO:0005044 GO:GO:0005201
            eggNOG:NOG310046 HOGENOM:HOG000241342 HOVERGEN:HBG053961
            EMBL:AF236155 EMBL:AF236151 EMBL:AF236152 EMBL:AF236153
            EMBL:AF236154 EMBL:AF236150 EMBL:AF205436 EMBL:AB050716
            EMBL:AB050719 EMBL:AK074124 EMBL:AY358421 EMBL:AF289569
            EMBL:AK027839 EMBL:AK292770 EMBL:AK298382 EMBL:AK075398
            EMBL:BC009048 EMBL:BC064633 IPI:IPI00005563 IPI:IPI00439435
            IPI:IPI00910801 RefSeq:NP_001191343.1 RefSeq:NP_001191344.1
            RefSeq:NP_071447.1 UniGene:Hs.199368 ProteinModelPortal:Q9GZM7
            SMR:Q9GZM7 IntAct:Q9GZM7 MINT:MINT-253718 STRING:Q9GZM7
            MEROPS:C01.975 PhosphoSite:Q9GZM7 DMDM:61213628 PaxDb:Q9GZM7
            PRIDE:Q9GZM7 Ensembl:ENST00000271064 Ensembl:ENST00000457433
            GeneID:64129 KEGG:hsa:64129 UCSC:uc001bta.3 CTD:64129
            GeneCards:GC01P032042 HGNC:HGNC:19168 HPA:HPA048695
            neXtProt:NX_Q9GZM7 PharmGKB:PA38810 InParanoid:Q9GZM7 OMA:DNCNRCT
            OrthoDB:EOG4BG8W0 PhylomeDB:Q9GZM7 ChiTaRS:TINAGL1 GenomeRNAi:64129
            NextBio:66016 ArrayExpress:Q9GZM7 Bgee:Q9GZM7 CleanEx:HS_TINAGL1
            Genevestigator:Q9GZM7 GermOnline:ENSG00000142910 Uniprot:Q9GZM7
        Length = 467

 Score = 132 (51.5 bits), Expect = 4.5e-14, Sum P(2) = 4.5e-14
 Identities = 32/91 (35%), Positives = 48/91 (52%)

Query:   145 DQGQCGCCWAFSAVA-AMEGIN-HITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFE 202
             DQG C   WAFS  A A + ++ H        LS Q L+ CDT  + QGC GG +D A+ 
Sbjct:   222 DQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTH-QQQGCRGGRLDGAWW 280

Query:   203 FIISNKGLATEAKYPYKASDGSCNKKEANPS 233
             F+   +G+ ++  YP+   +    + EA P+
Sbjct:   281 FL-RRRGVVSDHCYPFSGRE----RDEAGPA 306

 Score = 122 (48.0 bits), Expect = 4.5e-14, Sum P(2) = 4.5e-14
 Identities = 31/98 (31%), Positives = 48/98 (48%)

Query:   245 SNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFT------GQCGTELDHGVTAV---G 295
             SN++  + + + N PV   ++    DF  Y  G+++      G+      HG  +V   G
Sbjct:   348 SNDKEIMKELMENGPVQALMEVH-EDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITG 406

Query:   296 YG--TADDGT--KYWLVKNSWGTTWGENGYIRMQRDID 329
             +G  T  DG   KYW   NSWG  WGE G+ R+ R ++
Sbjct:   407 WGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVN 444


>UNIPROTKB|E1B9H1 [details] [associations]
            symbol:TINAGL1 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0043236 "laminin binding" evidence=IEA] [GO:0031012
            "extracellular matrix" evidence=IEA] [GO:0005737 "cytoplasm"
            evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0005044 "scavenger receptor
            activity" evidence=IEA] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0005737 GO:GO:0006955
            GO:GO:0030247 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00639 GO:GO:0031012 GO:GO:0005044
            GeneTree:ENSGT00560000076599 OMA:DNCNRCT EMBL:DAAA02006255
            IPI:IPI00732137 Ensembl:ENSBTAT00000038022 Uniprot:E1B9H1
        Length = 469

 Score = 127 (49.8 bits), Expect = 1.0e-13, Sum P(2) = 1.0e-13
 Identities = 29/76 (38%), Positives = 42/76 (55%)

Query:   145 DQGQCGCCWAFSAVA-AMEGIN-HITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFE 202
             DQG C   WAFS  A A + ++ H        LS Q L+ CDT  + QGC GG +D A+ 
Sbjct:   224 DQGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQNLLSCDTHNQ-QGCRGGRLDGAWW 282

Query:   203 FIISNKGLATEAKYPY 218
             F+   +G+ ++  YP+
Sbjct:   283 FL-RRRGVVSDHCYPF 297

 Score = 124 (48.7 bits), Expect = 1.0e-13, Sum P(2) = 1.0e-13
 Identities = 34/95 (35%), Positives = 47/95 (49%)

Query:   246 NNEAALMKAVA-NQPVSVAIDASGSDFQFYSSGVFT------GQCGTELDHGVTAV---G 295
             +NE  +MK +  N PV   ++    DF  Y SG+++      G+      HG  +V   G
Sbjct:   350 SNEKEIMKELMENGPVQALMEVH-EDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITG 408

Query:   296 YG--TADDGT--KYWLVKNSWGTTWGENGYIRMQR 326
             +G  T  DG   KYW   NSWG  WGE G+ R+ R
Sbjct:   409 WGEETLPDGRTIKYWTAANSWGPAWGERGHFRIVR 443


>UNIPROTKB|E2QXH3 [details] [associations]
            symbol:TINAGL1 "Uncharacterized protein" species:9615
            "Canis lupus familiaris" [GO:0043236 "laminin binding"
            evidence=IEA] [GO:0031012 "extracellular matrix" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006955 "immune response" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044 "scavenger
            receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005737
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639 GO:GO:0031012
            GO:GO:0005044 GeneTree:ENSGT00560000076599 CTD:64129 OMA:DNCNRCT
            EMBL:AAEX03001668 RefSeq:XP_535330.3 Ensembl:ENSCAFT00000035659
            GeneID:478155 KEGG:cfa:478155 NextBio:20853523 Uniprot:E2QXH3
        Length = 467

 Score = 130 (50.8 bits), Expect = 1.5e-13, Sum P(2) = 1.5e-13
 Identities = 41/140 (29%), Positives = 69/140 (49%)

Query:   103 GYKRRLPSVR-SSETTDVSFRYE----NASVPASIDWRKK--GAVTGVKDQGQCGCCWAF 155
             G + RL ++R SS  T+++  +        +P + +  +K    +    DQG C   WAF
Sbjct:   173 GIRYRLGTIRPSSSVTNMNEIHTVLRPGEVLPTAFEAAEKWPNLIHEPLDQGNCAGSWAF 232

Query:   156 SAVA-AMEGIN-HITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
             S  A A + ++ H        LS Q L+ CDT  + QGC GG +D A+ F+   +G+ ++
Sbjct:   233 STAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHNQ-QGCRGGRLDGAWWFL-RRRGVVSD 290

Query:   214 AKYPYKASDGSCNKKEANPS 233
               YP+   +    + EA P+
Sbjct:   291 HCYPFVGRE----QDEAGPA 306

 Score = 119 (46.9 bits), Expect = 1.5e-13, Sum P(2) = 1.5e-13
 Identities = 33/94 (35%), Positives = 45/94 (47%)

Query:   247 NEAALMKAVA-NQPVSVAIDASGSDFQFYSSGVFT------GQCGTELDHGVTAV---GY 296
             NE  +MK +  N PV   ++    DF  Y  G+++      G+      HG  +V   G+
Sbjct:   349 NEKEIMKELMENGPVQALMEVH-EDFFLYQGGIYSHTPVSLGRPERYRRHGTHSVKITGW 407

Query:   297 G--TADDGT--KYWLVKNSWGTTWGENGYIRMQR 326
             G  T  DG   KYW   NSWG  WGE G+ R+ R
Sbjct:   408 GEETLPDGRTLKYWTAANSWGPAWGERGHFRIVR 441


>MGI|MGI:2137617 [details] [associations]
            symbol:Tinagl1 "tubulointerstitial nephritis antigen-like 1"
            species:10090 "Mus musculus" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
            [GO:0043236 "laminin binding" evidence=IPI] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 MGI:MGI:2137617
            GO:GO:0005737 GO:GO:0005576 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GO:GO:0031012 CleanEx:MM_ARG1 GO:GO:0005044
            GeneTree:ENSGT00560000076599 eggNOG:NOG310046 HOGENOM:HOG000241342
            HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OMA:DNCNRCT
            OrthoDB:EOG4BG8W0 EMBL:AB047402 EMBL:AB050626 EMBL:BC005738
            EMBL:BC018539 IPI:IPI00115458 RefSeq:NP_001161805.1
            RefSeq:NP_075965.2 UniGene:Mm.15801 ProteinModelPortal:Q99JR5
            SMR:Q99JR5 STRING:Q99JR5 PhosphoSite:Q99JR5 PaxDb:Q99JR5
            PRIDE:Q99JR5 Ensembl:ENSMUST00000030560 Ensembl:ENSMUST00000105998
            Ensembl:ENSMUST00000105999 GeneID:94242 KEGG:mmu:94242
            InParanoid:Q99JR5 NextBio:352247 Bgee:Q99JR5 Genevestigator:Q99JR5
            GermOnline:ENSMUSG00000028776 Uniprot:Q99JR5
        Length = 466

 Score = 135 (52.6 bits), Expect = 2.2e-13, Sum P(2) = 2.2e-13
 Identities = 41/140 (29%), Positives = 70/140 (50%)

Query:   103 GYKRRLPSVRSSETT-DVSFRY----ENASVPASIDWRKK--GAVTGVKDQGQCGCCWAF 155
             G + RL ++R S T  +++  Y    +   +P + +  +K    +    DQG C   WAF
Sbjct:   172 GIRYRLGTIRPSSTVMNMNEIYTVLGQGEVLPTAFEASEKWPNLIHEPLDQGNCAGSWAF 231

Query:   156 SAVA-AMEGIN-HITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
             S  A A + ++ H        LS Q L+ CDT  + QGC GG +D A+ F+   +G+ ++
Sbjct:   232 STAAVASDRVSIHSLGHMTPILSPQNLLSCDTHHQ-QGCRGGRLDGAWWFL-RRRGVVSD 289

Query:   214 AKYPYKASDGSCNKKEANPS 233
               YP+   +    + EA+P+
Sbjct:   290 NCYPFSGRE----QNEASPT 305

 Score = 112 (44.5 bits), Expect = 2.2e-13, Sum P(2) = 2.2e-13
 Identities = 32/95 (33%), Positives = 46/95 (48%)

Query:   246 NNEAALMKAVA-NQPVSVAIDASGSDFQFYSSGVFT------GQCGTELDHGVTAV---G 295
             ++E  +MK +  N PV   ++    DF  Y  G+++      G+      HG  +V   G
Sbjct:   347 SDEKEIMKELMENGPVQALMEVH-EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITG 405

Query:   296 YG--TADDGT--KYWLVKNSWGTTWGENGYIRMQR 326
             +G  T  DG   KYW   NSWG  WGE G+ R+ R
Sbjct:   406 WGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVR 440


>RGD|70956 [details] [associations]
            symbol:Tinagl1 "tubulointerstitial nephritis antigen-like 1"
           species:10116 "Rattus norvegicus" [GO:0005044 "scavenger receptor
           activity" evidence=IEA] [GO:0005576 "extracellular region"
           evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA;ISO] [GO:0006508
           "proteolysis" evidence=IEA] [GO:0006955 "immune response"
           evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
           evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
           [GO:0031012 "extracellular matrix" evidence=IEA;ISO] [GO:0043236
           "laminin binding" evidence=IEA;ISO] InterPro:IPR000668
           InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
           PROSITE:PS50958 SMART:SM00201 SMART:SM00645 RGD:70956 GO:GO:0005737
           GO:GO:0005576 GO:GO:0006955 GO:GO:0030247 GO:GO:0008234
           GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
           GO:GO:0031012 GO:GO:0005044 eggNOG:NOG310046 HOGENOM:HOG000241342
           HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OrthoDB:EOG4BG8W0
           EMBL:AB050717 IPI:IPI00190428 RefSeq:NP_446034.1 UniGene:Rn.1256
           ProteinModelPortal:Q9EQT5 PRIDE:Q9EQT5 GeneID:94174 KEGG:rno:94174
           UCSC:RGD:70956 InParanoid:Q9EQT5 NextBio:617830 ArrayExpress:Q9EQT5
           Genevestigator:Q9EQT5 GermOnline:ENSRNOG00000013179 Uniprot:Q9EQT5
        Length = 467

 Score = 130 (50.8 bits), Expect = 2.5e-13, Sum P(2) = 2.5e-13
 Identities = 40/140 (28%), Positives = 69/140 (49%)

Query:   103 GYKRRLPSVR-SSETTDVSFRY----ENASVPASIDWRKK--GAVTGVKDQGQCGCCWAF 155
             G + RL ++R SS   +++  Y    +   +P + +  +K    +    DQG C   WAF
Sbjct:   172 GIRYRLGTIRPSSSVMNMNEIYTVLGQGEVLPTAFEASEKWPNLIHEPLDQGNCAGSWAF 231

Query:   156 SAVA-AMEGIN-HITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
             S  A A + ++ H        LS Q L+ CDT  + +GC GG +D A+ F+   +G+ ++
Sbjct:   232 STAAVASDRVSIHSLGHMTPILSPQNLLSCDTHHQ-KGCRGGRLDGAWWFL-RRRGVVSD 289

Query:   214 AKYPYKASDGSCNKKEANPS 233
               YP+   +      EA+P+
Sbjct:   290 NCYPFSGRE---QNDEASPT 306

 Score = 117 (46.2 bits), Expect = 2.5e-13, Sum P(2) = 2.5e-13
 Identities = 33/99 (33%), Positives = 49/99 (49%)

Query:   245 SNNEAALMKAVA-NQPVSVAIDASGSDFQFYSSGVFT------GQCGTELDHGVTAV--- 294
             +++E  +MK +  N PV   ++    DF  Y  G+++      G+      HG  +V   
Sbjct:   347 ASDEKEIMKELMENGPVQALMEVH-EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKIT 405

Query:   295 GYG--TADDGT--KYWLVKNSWGTTWGENGYIRMQRDID 329
             G+G  T  DG   KYW   NSWG  WGE G+ R+ R I+
Sbjct:   406 GWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGIN 444


>UNIPROTKB|Q9EQT5 [details] [associations]
            symbol:Tinagl1 "Tubulointerstitial nephritis antigen-like"
            species:10116 "Rattus norvegicus" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 RGD:70956 GO:GO:0005737
            GO:GO:0005576 GO:GO:0006955 GO:GO:0030247 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GO:GO:0031012 GO:GO:0005044 eggNOG:NOG310046 HOGENOM:HOG000241342
            HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OrthoDB:EOG4BG8W0
            EMBL:AB050717 IPI:IPI00190428 RefSeq:NP_446034.1 UniGene:Rn.1256
            ProteinModelPortal:Q9EQT5 PRIDE:Q9EQT5 GeneID:94174 KEGG:rno:94174
            UCSC:RGD:70956 InParanoid:Q9EQT5 NextBio:617830 ArrayExpress:Q9EQT5
            Genevestigator:Q9EQT5 GermOnline:ENSRNOG00000013179 Uniprot:Q9EQT5
        Length = 467

 Score = 130 (50.8 bits), Expect = 2.5e-13, Sum P(2) = 2.5e-13
 Identities = 40/140 (28%), Positives = 69/140 (49%)

Query:   103 GYKRRLPSVR-SSETTDVSFRY----ENASVPASIDWRKK--GAVTGVKDQGQCGCCWAF 155
             G + RL ++R SS   +++  Y    +   +P + +  +K    +    DQG C   WAF
Sbjct:   172 GIRYRLGTIRPSSSVMNMNEIYTVLGQGEVLPTAFEASEKWPNLIHEPLDQGNCAGSWAF 231

Query:   156 SAVA-AMEGIN-HITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
             S  A A + ++ H        LS Q L+ CDT  + +GC GG +D A+ F+   +G+ ++
Sbjct:   232 STAAVASDRVSIHSLGHMTPILSPQNLLSCDTHHQ-KGCRGGRLDGAWWFL-RRRGVVSD 289

Query:   214 AKYPYKASDGSCNKKEANPS 233
               YP+   +      EA+P+
Sbjct:   290 NCYPFSGRE---QNDEASPT 306

 Score = 117 (46.2 bits), Expect = 2.5e-13, Sum P(2) = 2.5e-13
 Identities = 33/99 (33%), Positives = 49/99 (49%)

Query:   245 SNNEAALMKAVA-NQPVSVAIDASGSDFQFYSSGVFT------GQCGTELDHGVTAV--- 294
             +++E  +MK +  N PV   ++    DF  Y  G+++      G+      HG  +V   
Sbjct:   347 ASDEKEIMKELMENGPVQALMEVH-EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKIT 405

Query:   295 GYG--TADDGT--KYWLVKNSWGTTWGENGYIRMQRDID 329
             G+G  T  DG   KYW   NSWG  WGE G+ R+ R I+
Sbjct:   406 GWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGIN 444


>UNIPROTKB|Q3SZI1 [details] [associations]
            symbol:TINAG "Tubulointerstitial nephritis antigen"
            species:9913 "Bos taurus" [GO:0005604 "basement membrane"
            evidence=IEA] [GO:0007155 "cell adhesion" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044
            "scavenger receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 Pfam:PF01033
            PROSITE:PS00524 PROSITE:PS50958 SMART:SM00201 SMART:SM00645
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0007155
            GO:GO:0005604 GO:GO:0005044 GeneTree:ENSGT00560000076599
            EMBL:BC102843 IPI:IPI00689615 RefSeq:NP_001030279.1
            UniGene:Bt.29080 ProteinModelPortal:Q3SZI1 MEROPS:C01.973
            PRIDE:Q3SZI1 Ensembl:ENSBTAT00000016790 GeneID:512517
            KEGG:bta:512517 CTD:27283 eggNOG:NOG310046 HOGENOM:HOG000241342
            HOVERGEN:HBG053961 InParanoid:Q3SZI1 OMA:WGQLTSS OrthoDB:EOG47PX5P
            NextBio:20870427 Uniprot:Q3SZI1
        Length = 476

 Score = 135 (52.6 bits), Expect = 9.9e-13, Sum P(2) = 9.9e-13
 Identities = 35/102 (34%), Positives = 50/102 (49%)

Query:   245 SNNEAALMKAVA-NQPVSVAIDASGSDFQFYSSGVFTGQCGTELD---------HGVTAV 294
             S+NE  +M+ +  N PV  AI     DF  Y +G++     T  D         H V   
Sbjct:   358 SSNETEIMREIMQNGPVQ-AIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVKLT 416

Query:   295 GYGT--ADDGTK--YWLVKNSWGTTWGENGYIRMQRDIDAKE 332
             G+GT     G K  +W+  NSWG +WGENGY R+ R ++  +
Sbjct:   417 GWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESD 458

 Score = 106 (42.4 bits), Expect = 9.9e-13, Sum P(2) = 9.9e-13
 Identities = 33/100 (33%), Positives = 47/100 (47%)

Query:   130 ASIDWRKKGAVTGVKDQGQCGCCWAFS--AVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
             AS  W   G   G  DQ  C   WAFS  +VAA         R   +LS Q L+ C  + 
Sbjct:   223 ASYKW--PGWTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNLISC-CAK 279

Query:   188 EDQGCEGGLMDDAFEFIISNKGLATEAKYP-YKASDGSCN 226
             +  GC  G +D A+ ++   +GL + A YP +K  + + N
Sbjct:   280 KRHGCNSGSVDRAWWYL-RKRGLVSHACYPLFKDQNATNN 318


>DICTYBASE|DDB_G0283921 [details] [associations]
            symbol:ctsB "cathepsin B precursor" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0283921 GO:GO:0005615
            GenomeReviews:CM000153_GR GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:AAFI02000058
            eggNOG:NOG315657 PANTHER:PTHR12411:SF16 OMA:CSLSCQS
            RefSeq:XP_638805.1 HSSP:P07688 MEROPS:C01.A59
            EnsemblProtists:DDB0233997 GeneID:8624329 KEGG:ddi:DDB_G0283921
            Uniprot:Q54QD9
        Length = 311

 Score = 188 (71.2 bits), Expect = 1.3e-12, P = 1.3e-12
 Identities = 64/235 (27%), Positives = 104/235 (44%)

Query:   127 SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
             S  A  +W     ++ +++Q +CG CWAF A  +      I   +   LS  ++V CD +
Sbjct:    82 SFNAQTNWPNCTTISQIQNQARCGSCWAFGATESATDRLCIHNNENVQLSFMDMVTCDET 141

Query:   187 --GEDQGC----------EGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKK-EANPS 233
               G + G           +G + ++   + I     A +    +  +  SC K+ ++N S
Sbjct:   142 DNGCEGGDAFSAWNWLRKQGAVSEECLPYTIPTCPPAQQPCLNF-VNTPSCTKECQSNSS 200

Query:   234 A---------AKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCG 284
                       AKI  ++    ++EA + + V N PV         DF  Y SGV+    G
Sbjct:   201 LIYSQDKHKMAKIYSFD----SDEAIMQEIVTNGPVEACFTVF-EDFLAYKSGVYVHTTG 255

Query:   285 TELD-HGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA 338
              +L  H V  VG+GT + G  Y+   N W T+WG+NG   ++R      G CGI+
Sbjct:   256 KDLGGHCVKLVGFGTLN-GVDYYAANNQWTTSWGDNGTFLIKR------GDCGIS 303


>TAIR|locus:2060420 [details] [associations]
            symbol:AT2G22160 "AT2G22160" species:3702 "Arabidopsis
            thaliana" [GO:0005575 "cellular_component" evidence=ND] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] EMBL:CP002685
            GenomeReviews:CT485783_GR InterPro:IPR013201 Pfam:PF08246
            SMART:SM00848 EMBL:AC007168 IPI:IPI00544896 PIR:F84609
            RefSeq:NP_179806.1 UniGene:At.66231 HSSP:P25774
            ProteinModelPortal:Q9SIE8 SMR:Q9SIE8 EnsemblPlants:AT2G22160.1
            GeneID:816750 KEGG:ath:AT2G22160 TAIR:At2g22160 eggNOG:NOG297278
            InParanoid:Q9SIE8 OMA:HRCITLA PhylomeDB:Q9SIE8 ArrayExpress:Q9SIE8
            Genevestigator:Q9SIE8 Uniprot:Q9SIE8
        Length = 105

 Score = 170 (64.9 bits), Expect = 2.5e-12, P = 2.5e-12
 Identities = 42/94 (44%), Positives = 53/94 (56%)

Query:    55 EKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSS 114
             + E  F +FK+N EYI    NK R KPYKL +N+FA+ T+ EF      +     S    
Sbjct:    10 QTESSFDVFKKNAEYIVK-TNKER-KPYKLKLNKFANLTDVEFVNAHTCFDM---SDHKK 64

Query:   115 ETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQG 147
                   F YEN +  P S+DWR+KGAVT VKDQG
Sbjct:    65 ILDSKPFFYENMTQAPDSLDWREKGAVTNVKDQG 98

WARNING:  HSPs involving 48 database sequences were not reported due to the
          limiting value of parameter B = 250.


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.315   0.130   0.392    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      346       346   0.00098  116 3  11 23  0.44    34
                                                     33  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  298
  No. of states in DFA:  622 (66 KB)
  Total size of DFA:  252 KB (2134 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  28.07u 0.23s 28.30t   Elapsed:  00:00:01
  Total cpu time:  28.12u 0.23s 28.35t   Elapsed:  00:00:01
  Start:  Fri May 10 09:30:14 2013   End:  Fri May 10 09:30:15 2013
WARNINGS ISSUED:  2

Back to top