BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>043883
MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD
NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS
SQVPPSVNWIEKGAVTPVKYQGQCAVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC
YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESL
LKAVANQPVSVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWG
QDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSADKSSAC

High Scoring Gene Products

Symbol, full name Information P value
AT3G49340 protein from Arabidopsis thaliana 1.8e-78
AT2G34080 protein from Arabidopsis thaliana 7.9e-78
SAG12
senescence-associated gene 12
protein from Arabidopsis thaliana 7.9e-78
AT2G27420 protein from Arabidopsis thaliana 7.1e-77
AT3G19390 protein from Arabidopsis thaliana 1.1e-76
AT1G29090 protein from Arabidopsis thaliana 2.7e-75
CEP1
cysteine endopeptidase 1
protein from Arabidopsis thaliana 5.1e-74
AT1G29080 protein from Arabidopsis thaliana 2.9e-71
CEP3
cysteine endopeptidase 3
protein from Arabidopsis thaliana 2.9e-71
XCP1
xylem cysteine peptidase 1
protein from Arabidopsis thaliana 1.8e-69
XCP2
AT1G20850
protein from Arabidopsis thaliana 1.3e-68
AT3G19400 protein from Arabidopsis thaliana 2.7e-68
AT3G43960 protein from Arabidopsis thaliana 1.5e-67
AT1G29110 protein from Arabidopsis thaliana 5.1e-67
RD21A
responsive to dehydration 21A
protein from Arabidopsis thaliana 1.3e-66
RD21B
esponsive to dehydration 21B
protein from Arabidopsis thaliana 1.7e-66
XBCP3
xylem bark cysteine peptidase 3
protein from Arabidopsis thaliana 3.2e-65
AT4G23520 protein from Arabidopsis thaliana 1.6e-63
AT1G06260 protein from Arabidopsis thaliana 3.3e-63
CP1
cysteine protease 1
protein from Arabidopsis thaliana 1.3e-61
CP2
cysteine protease 2
protein from Arabidopsis thaliana 2.4e-60
CTSL2
Uncharacterized protein
protein from Gallus gallus 3.3e-56
Ssc.54235
Uncharacterized protein
protein from Sus scrofa 6.1e-55
Cys
Crustapain
protein from Pandalus borealis 6.1e-55
Ctsll3
cathepsin L-like 3
gene from Rattus norvegicus 6.1e-55
RGD1308751
similar to Cathepsin L precursor (Major excreted protein) (MEP)
gene from Rattus norvegicus 1.6e-54
Ctsl1
cathepsin L1
gene from Rattus norvegicus 3.4e-54
Ctsl
cathepsin L
protein from Mus musculus 5.5e-54
CTSL1
Cathepsin L1
protein from Canis lupus familiaris 8.9e-54
Cp1
Cysteine proteinase-1
protein from Drosophila melanogaster 1.5e-53
CTSS
Uncharacterized protein
protein from Sus scrofa 2.4e-53
CTSL1
CTSL1 protein
protein from Bos taurus 6.3e-53
cprB
cysteine proteinase 2
gene from Dictyostelium discoideum 8.7e-53
CTSS
Cathepsin S
protein from Homo sapiens 1.3e-52
wu:fb37b09 gene_product from Danio rerio 1.7e-52
CTSS
Cathepsin S
protein from Bos taurus 4.4e-52
zgc:174855 gene_product from Danio rerio 4.4e-52
CTSL1
Cathepsin L1
protein from Sus scrofa 7.2e-52
ctsl1a
cathepsin L, 1 a
gene_product from Danio rerio 7.2e-52
CTSS
Cathepsin S
protein from Canis lupus familiaris 1.2e-51
CTSL1
Cathepsin L1
protein from Homo sapiens 1.9e-51
CTSS
Cathepsin S
protein from Canis lupus familiaris 3.1e-51
zgc:174153 gene_product from Danio rerio 3.1e-51
CTSL1
Cathepsin L1
protein from Canis lupus familiaris 4.0e-51
cpl-1 gene from Caenorhabditis elegans 4.0e-51
CTSL1
Cathepsin L1
protein from Bos taurus 5.1e-51
CTSK
Cathepsin K
protein from Canis lupus familiaris 5.1e-51
CTSK
Cathepsin K
protein from Canis lupus familiaris 5.1e-51
Testin
testin gene
gene from Rattus norvegicus 5.1e-51
ctsl.1
cathepsin L.1
gene_product from Danio rerio 1.1e-50
ctsl1b
cathepsin L, 1 b
gene_product from Danio rerio 1.7e-50
CTSK
Cathepsin K
protein from Sus scrofa 2.2e-50
cprD
cysteine proteinase 4
gene from Dictyostelium discoideum 2.9e-50
ctssa
cathepsin S, a
gene_product from Danio rerio 3.6e-50
ctssb.1
cathepsin S, b.1
gene_product from Danio rerio 3.6e-50
CTSK
Cathepsin K
protein from Bos taurus 4.6e-50
CTSL2
Cathepsin L2
protein from Bos taurus 4.6e-50
CTSK
Cathepsin K
protein from Homo sapiens 4.6e-50
LOC420160
Uncharacterized protein
protein from Gallus gallus 5.8e-50
CTSL2
Cathepsin L2
protein from Homo sapiens 1.2e-49
Ctsk
cathepsin K
gene from Rattus norvegicus 1.5e-49
4930486L24Rik
RIKEN cDNA 4930486L24 gene
protein from Mus musculus 2.0e-49
cprC
cysteine proteinase 3
gene from Dictyostelium discoideum 2.5e-49
Ctsj
cathepsin J
protein from Mus musculus 2.5e-49
Ctsj
cathepsin J
gene from Rattus norvegicus 8.5e-49
ctsll
cathepsin L, like
gene_product from Danio rerio 1.1e-48
Ctsk
cathepsin K
protein from Mus musculus 2.3e-48
Ctss
cathepsin S
protein from Mus musculus 2.3e-48
cprF
cysteine proteinase 6
gene from Dictyostelium discoideum 2.3e-48
DDB_G0272298 gene from Dictyostelium discoideum 4.7e-48
ctssb.2
cathepsin S, b.2
gene_product from Danio rerio 6.0e-48
CTSK
Cathepsin K
protein from Gallus gallus 1.2e-47
Ctss
cathepsin S
gene from Rattus norvegicus 2.3e-46
ctsk
cathepsin K
gene_product from Danio rerio 3.0e-46
CTSH
Uncharacterized protein
protein from Callithrix jacchus 1.0e-45
AT3G45310 protein from Arabidopsis thaliana 1.3e-45
CTSH
Pro-cathepsin H
protein from Bos taurus 1.6e-45
Ctsh
cathepsin H
gene from Rattus norvegicus 2.7e-45
D3ZZR3
Uncharacterized protein
protein from Rattus norvegicus 2.7e-45
CTSL2
Uncharacterized protein
protein from Gallus gallus 3.4e-45
LOC100662496
Uncharacterized protein
protein from Loxodonta africana 4.4e-45
CTSH
Uncharacterized protein
protein from Callithrix jacchus 9.1e-45
CTSH
Uncharacterized protein
protein from Macaca mulatta 1.5e-44
CTSH
Pro-cathepsin H
protein from Homo sapiens 1.9e-44
CTSH
Uncharacterized protein
protein from Nomascus leucogenys 1.9e-44
CTSH
Pro-cathepsin H
protein from Sus scrofa 2.4e-44
CTSH
Uncharacterized protein
protein from Oryctolagus cuniculus 2.4e-44
CTSH
Uncharacterized protein
protein from Gorilla gorilla gorilla 3.1e-44
CG12163 protein from Drosophila melanogaster 3.9e-44
MGC114246
similar to cathepsin R
gene from Rattus norvegicus 3.9e-44
Ctsh
cathepsin H
protein from Mus musculus 5.0e-44
CTSL1
Cathepsin L1
protein from Gallus gallus 6.4e-44
CTSH
Uncharacterized protein
protein from Canis lupus familiaris 6.4e-44
P83443
Macrodontain-1
protein from Pseudananas sagenarius 8.1e-44
Cat-1
Cathepsin L-like proteinase
protein from Fasciola hepatica 8.1e-44
Ctsr
cathepsin R
protein from Mus musculus 1.0e-43
CTSH
Uncharacterized protein
protein from Equus caballus 1.7e-43
CTSH
Uncharacterized protein
protein from Ailuropoda melanoleuca 3.5e-43

The BLAST search returned 2 gene products which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  043883
        (348 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2082881 - symbol:AT3G49340 species:3702 "Arabi...   789  1.8e-78   1
TAIR|locus:2055440 - symbol:AT2G34080 species:3702 "Arabi...   783  7.9e-78   1
TAIR|locus:2152445 - symbol:SAG12 "senescence-associated ...   783  7.9e-78   1
TAIR|locus:2038588 - symbol:AT2G27420 species:3702 "Arabi...   774  7.1e-77   1
TAIR|locus:2090614 - symbol:AT3G19390 species:3702 "Arabi...   772  1.1e-76   1
TAIR|locus:2029924 - symbol:AT1G29090 species:3702 "Arabi...   759  2.7e-75   1
TAIR|locus:2157712 - symbol:CEP1 "cysteine endopeptidase ...   747  5.1e-74   1
TAIR|locus:2029934 - symbol:AT1G29080 species:3702 "Arabi...   721  2.9e-71   1
TAIR|locus:505006391 - symbol:CEP3 "cysteine endopeptidas...   721  2.9e-71   1
TAIR|locus:2122113 - symbol:XCP1 "xylem cysteine peptidas...   704  1.8e-69   1
TAIR|locus:2030427 - symbol:XCP2 "xylem cysteine peptidas...   696  1.3e-68   1
TAIR|locus:2090629 - symbol:AT3G19400 species:3702 "Arabi...   693  2.7e-68   1
TAIR|locus:2097104 - symbol:AT3G43960 species:3702 "Arabi...   686  1.5e-67   1
TAIR|locus:2030027 - symbol:AT1G29110 species:3702 "Arabi...   681  5.1e-67   1
TAIR|locus:2825832 - symbol:RD21A "responsive to dehydrat...   677  1.3e-66   1
TAIR|locus:2167821 - symbol:RD21B "esponsive to dehydrati...   676  1.7e-66   1
TAIR|locus:2024362 - symbol:XBCP3 "xylem bark cysteine pe...   664  3.2e-65   1
TAIR|locus:2117979 - symbol:AT4G23520 species:3702 "Arabi...   648  1.6e-63   1
TAIR|locus:2038515 - symbol:AT1G06260 species:3702 "Arabi...   645  3.3e-63   1
TAIR|locus:2128243 - symbol:AT4G11310 species:3702 "Arabi...   630  1.3e-61   1
TAIR|locus:2128253 - symbol:AT4G11320 species:3702 "Arabi...   618  2.4e-60   1
UNIPROTKB|F1NYJ1 - symbol:CTSL2 "Uncharacterized protein"...   579  3.3e-56   1
UNIPROTKB|F1S4J6 - symbol:Ssc.54235 "Cathepsin L1" specie...   567  6.1e-55   1
UNIPROTKB|Q86GF7 - symbol:Cys "Crustapain" species:6703 "...   567  6.1e-55   1
RGD|1560071 - symbol:Ctsll3 "cathepsin L-like 3" species:...   567  6.1e-55   1
RGD|1308751 - symbol:RGD1308751 "similar to Cathepsin L p...   563  1.6e-54   1
RGD|2448 - symbol:Ctsl1 "cathepsin L1" species:10116 "Rat...   560  3.4e-54   1
MGI|MGI:88564 - symbol:Ctsl "cathepsin L" species:10090 "...   558  5.5e-54   1
UNIPROTKB|Q9GL24 - symbol:CTSL1 "Cathepsin L1" species:96...   556  8.9e-54   1
FB|FBgn0013770 - symbol:Cp1 "Cysteine proteinase-1" speci...   554  1.5e-53   1
UNIPROTKB|F1SS93 - symbol:CTSS "Uncharacterized protein" ...   552  2.4e-53   1
UNIPROTKB|A4IFS7 - symbol:CTSL1 "CTSL1 protein" species:9...   548  6.3e-53   1
DICTYBASE|DDB_G0279799 - symbol:cprB "cysteine proteinase...   452  8.7e-53   2
UNIPROTKB|P25774 - symbol:CTSS "Cathepsin S" species:9606...   545  1.3e-52   1
ZFIN|ZDB-GENE-030131-572 - symbol:wu:fb37b09 "wu:fb37b09"...   544  1.7e-52   1
UNIPROTKB|P25326 - symbol:CTSS "Cathepsin S" species:9913...   540  4.4e-52   1
ZFIN|ZDB-GENE-071004-74 - symbol:zgc:174855 "zgc:174855" ...   540  4.4e-52   1
UNIPROTKB|Q28944 - symbol:CTSL1 "Cathepsin L1" species:98...   538  7.2e-52   1
ZFIN|ZDB-GENE-030131-106 - symbol:ctsl1a "cathepsin L, 1 ...   538  7.2e-52   1
UNIPROTKB|Q8HY81 - symbol:CTSS "Cathepsin S" species:9615...   536  1.2e-51   1
UNIPROTKB|P07711 - symbol:CTSL1 "Cathepsin L1" species:96...   534  1.9e-51   1
UNIPROTKB|F1PAK0 - symbol:CTSS "Cathepsin S" species:9615...   532  3.1e-51   1
ZFIN|ZDB-GENE-080215-7 - symbol:zgc:174153 "zgc:174153" s...   532  3.1e-51   1
UNIPROTKB|F1PMM9 - symbol:CTSL1 "Cathepsin L1" species:96...   531  4.0e-51   1
WB|WBGene00000776 - symbol:cpl-1 species:6239 "Caenorhabd...   531  4.0e-51   1
UNIPROTKB|P25975 - symbol:CTSL1 "Cathepsin L1" species:99...   530  5.1e-51   1
UNIPROTKB|G1K2A7 - symbol:CTSK "Cathepsin K" species:9615...   530  5.1e-51   1
UNIPROTKB|Q3ZKN1 - symbol:CTSK "Cathepsin K" species:9615...   530  5.1e-51   1
RGD|708447 - symbol:Testin "testin gene" species:10116 "R...   530  5.1e-51   1
ZFIN|ZDB-GENE-040718-61 - symbol:ctsl.1 "cathepsin L.1" s...   527  1.1e-50   1
ZFIN|ZDB-GENE-980526-285 - symbol:ctsl1b "cathepsin L, 1 ...   525  1.7e-50   1
UNIPROTKB|Q9GLE3 - symbol:CTSK "Cathepsin K" species:9823...   524  2.2e-50   1
DICTYBASE|DDB_G0278721 - symbol:cprD "cysteine proteinase...   414  2.9e-50   2
ZFIN|ZDB-GENE-040426-1583 - symbol:ctssa "cathepsin S, a"...   522  3.6e-50   1
ZFIN|ZDB-GENE-050522-559 - symbol:ctssb.1 "cathepsin S, b...   522  3.6e-50   1
UNIPROTKB|Q5E968 - symbol:CTSK "Cathepsin K" species:9913...   521  4.6e-50   1
UNIPROTKB|Q5E998 - symbol:CTSL2 "Cathepsin L2" species:99...   521  4.6e-50   1
UNIPROTKB|P43235 - symbol:CTSK "Cathepsin K" species:9606...   521  4.6e-50   1
UNIPROTKB|F1NZ37 - symbol:LOC420160 "Uncharacterized prot...   520  5.8e-50   1
UNIPROTKB|O60911 - symbol:CTSL2 "Cathepsin L2" species:96...   517  1.2e-49   1
RGD|61810 - symbol:Ctsk "cathepsin K" species:10116 "Ratt...   516  1.5e-49   1
MGI|MGI:1922258 - symbol:4930486L24Rik "RIKEN cDNA 493048...   515  2.0e-49   1
DICTYBASE|DDB_G0283867 - symbol:cprC "cysteine proteinase...   514  2.5e-49   1
MGI|MGI:1349426 - symbol:Ctsj "cathepsin J" species:10090...   514  2.5e-49   1
RGD|69241 - symbol:Ctsj "cathepsin J" species:10116 "Ratt...   509  8.5e-49   1
ZFIN|ZDB-GENE-041010-76 - symbol:ctsll "cathepsin L, like...   508  1.1e-48   1
MGI|MGI:107823 - symbol:Ctsk "cathepsin K" species:10090 ...   505  2.3e-48   1
MGI|MGI:107341 - symbol:Ctss "cathepsin S" species:10090 ...   505  2.3e-48   1
DICTYBASE|DDB_G0279185 - symbol:cprF "cysteine proteinase...   395  2.3e-48   2
UNIPROTKB|Q4QRC2 - symbol:Ctsql2 "Protein Ctsql2" species...   503  3.7e-48   1
DICTYBASE|DDB_G0272298 - symbol:DDB_G0272298 species:4468...   502  4.7e-48   1
ZFIN|ZDB-GENE-050626-55 - symbol:ctssb.2 "cathepsin S, b....   501  6.0e-48   1
UNIPROTKB|Q90686 - symbol:CTSK "Cathepsin K" species:9031...   498  1.2e-47   1
UNIPROTKB|E9PSK9 - symbol:Ctsql2 "Protein Ctsql2" species...   489  1.1e-46   1
RGD|621513 - symbol:Ctss "cathepsin S" species:10116 "Rat...   486  2.3e-46   1
ZFIN|ZDB-GENE-001205-4 - symbol:ctsk "cathepsin K" specie...   485  3.0e-46   1
UNIPROTKB|F7B939 - symbol:CTSH "Uncharacterized protein" ...   480  1.0e-45   1
TAIR|locus:2078312 - symbol:AT3G45310 species:3702 "Arabi...   479  1.3e-45   1
UNIPROTKB|Q3T0I2 - symbol:CTSH "Pro-cathepsin H" species:...   478  1.6e-45   1
RGD|2447 - symbol:Ctsh "cathepsin H" species:10116 "Rattu...   476  2.7e-45   1
UNIPROTKB|D3ZZR3 - symbol:D3ZZR3 "Uncharacterized protein...   476  2.7e-45   1
UNIPROTKB|F1NEC8 - symbol:CTSL2 "Uncharacterized protein"...   475  3.4e-45   1
UNIPROTKB|G3SSC1 - symbol:CTSH "Uncharacterized protein" ...   474  4.4e-45   1
UNIPROTKB|F7BRD4 - symbol:CTSH "Uncharacterized protein" ...   471  9.1e-45   1
UNIPROTKB|F6R7P5 - symbol:CTSH "Uncharacterized protein" ...   469  1.5e-44   1
UNIPROTKB|P09668 - symbol:CTSH "Pro-cathepsin H" species:...   468  1.9e-44   1
UNIPROTKB|G1RBY1 - symbol:CTSH "Uncharacterized protein" ...   468  1.9e-44   1
UNIPROTKB|O46427 - symbol:CTSH "Pro-cathepsin H" species:...   467  2.4e-44   1
UNIPROTKB|G1SQF0 - symbol:CTSH "Uncharacterized protein" ...   467  2.4e-44   1
UNIPROTKB|G3R9A7 - symbol:CTSH "Uncharacterized protein" ...   466  3.1e-44   1
FB|FBgn0260462 - symbol:CG12163 species:7227 "Drosophila ...   465  3.9e-44   1
RGD|1562210 - symbol:MGC114246 "similar to cathepsin R" s...   465  3.9e-44   1
MGI|MGI:107285 - symbol:Ctsh "cathepsin H" species:10090 ...   464  5.0e-44   1
UNIPROTKB|P09648 - symbol:CTSL1 "Cathepsin L1" species:90...   463  6.4e-44   1
UNIPROTKB|F6X9C1 - symbol:CTSH "Uncharacterized protein" ...   463  6.4e-44   1
UNIPROTKB|P83443 - symbol:P83443 "Macrodontain-1" species...   462  8.1e-44   1
UNIPROTKB|Q24940 - symbol:Cat-1 "Cathepsin L-like protein...   462  8.1e-44   1
MGI|MGI:1861723 - symbol:Ctsr "cathepsin R" species:10090...   461  1.0e-43   1
UNIPROTKB|F7BJD8 - symbol:CTSH "Uncharacterized protein" ...   459  1.7e-43   1
UNIPROTKB|G1M0X4 - symbol:CTSH "Uncharacterized protein" ...   456  3.5e-43   1

WARNING:  Descriptions of 182 database sequences were not reported due to the
          limiting value of parameter V = 100.


>TAIR|locus:2082881 [details] [associations]
            symbol:AT3G49340 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002686
            GenomeReviews:BA000014_GR EMBL:AC012329 EMBL:AL132956
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 MEROPS:I29.003 HOGENOM:HOG000230773 HSSP:P07711
            KO:K01376 IPI:IPI00520642 PIR:T45839 RefSeq:NP_566920.1
            UniGene:At.53854 ProteinModelPortal:Q9SG15 SMR:Q9SG15
            EnsemblPlants:AT3G49340.1 GeneID:824096 KEGG:ath:AT3G49340
            TAIR:At3g49340 InParanoid:Q9SG15 OMA:PQNDEEA PhylomeDB:Q9SG15
            ProtClustDB:CLSN2688476 Genevestigator:Q9SG15 Uniprot:Q9SG15
        Length = 341

 Score = 789 (282.8 bits), Expect = 1.8e-78, P = 1.8e-78
 Identities = 167/344 (48%), Positives = 221/344 (64%)

Query:     4 YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
             +FL+ +L+ S +    +    F E S  EK EQW +++ R Y + +E + RFEIF +NL 
Sbjct:     6 FFLLAILLSSRTSGVTSRGGLF-EASAVEKHEQWMSRFNRVYSDDSEKTSRFEIFTNNLK 64

Query:    64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGT----PFLYK 119
              VE  N     N++YTL +N+F+DLT +EF A  TG  + +  + +    +     F Y+
Sbjct:    65 FVESINMNT--NKTYTLDVNEFSDLTDEEFKARYTGLVVPEGMTRISTTDSHETVSFRYE 122

Query:   120 S-SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
             +  +   S++WI++GAVT VK+Q QC       AVAAVEG+  I    LVSLSEQQL+DC
Sbjct:   123 NVGETGESMDWIQEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIANGELVSLSEQQLLDC 182

Query:   172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
             +T   NNGC GG M  AF YI +N+GIT +  Y Y+G     C+S      AA I+ YE 
Sbjct:   183 STE--NNGCGGGIMWKAFDYIKENQGITTEDNYPYQGAQQ-TCESNHLA--AATISGYET 237

Query:   232 VPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
             VP NDEE+LLKAV+ QPVSVAI+ S  +F  YSGG+FNG C T L H VT VGYG SEEG
Sbjct:   238 VPQNDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTQLTHAVTIVGYGVSEEG 297

Query:   290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
             IKYWL+KNSWG+ WGE+GY R+ RD+D PQG CG+A  A +PV+
Sbjct:   298 IKYWLLKNSWGESWGENGYMRIMRDVDSPQGMCGLASLAYYPVA 341


>TAIR|locus:2055440 [details] [associations]
            symbol:AT2G34080 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002685 GenomeReviews:CT485783_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 MEROPS:I29.003 EMBL:AC002341
            HOGENOM:HOG000230773 HSSP:P53634 IPI:IPI00530325 PIR:B84752
            RefSeq:NP_565780.1 UniGene:At.28613 UniGene:At.37859
            ProteinModelPortal:O22961 SMR:O22961 EnsemblPlants:AT2G34080.1
            GeneID:817969 KEGG:ath:AT2G34080 TAIR:At2g34080 InParanoid:O22961
            OMA:SENDYSY PhylomeDB:O22961 ProtClustDB:CLSN2688064
            ArrayExpress:O22961 Genevestigator:O22961 Uniprot:O22961
        Length = 345

 Score = 783 (280.7 bits), Expect = 7.9e-78, P = 7.9e-78
 Identities = 171/345 (49%), Positives = 227/345 (65%)

Query:     6 LIVVLII--SGSCASQATYRT--FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
             L+ VLII  +G   SQAT RT  F E S+ +K EQW A++ R Y++  E + R ++FK N
Sbjct:     7 LVTVLIILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDVFKKN 66

Query:    62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFK-MSDHSSS---LKANGTPFL 117
             L  +E FN    GN+SY L +N+FAD T +EF+A  TG K +++ S S    K   +   
Sbjct:    67 LKFIENFNKK--GNKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVSPSKVVAKTISSQTW 124

Query:   118 YKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVD 170
               S  V  S +W  +GAVTPVKYQGQC       AVAAVEG+  I    LVSLSEQQL+D
Sbjct:   125 NVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLD 184

Query:   171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
             C   + + GC GG M DAF Y++QN+GI ++  YSY+G S G C S  A   AA+I+ ++
Sbjct:   185 C-DREYDRGCDGGIMSDAFNYVVQNRGIASENDYSYQG-SDGGCRS-NARP-AARISGFQ 240

Query:   231 DVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEE 288
              VP N+E +LL+AV+ QPVSV++DA+   F  YSGGV++G C T  NH VT VGYGTS++
Sbjct:   241 TVPSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQD 300

Query:   289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
             G KYWL KNSWG+ WGE GY R++RD+  PQG CG+A +A +PV+
Sbjct:   301 GTKYWLAKNSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345


>TAIR|locus:2152445 [details] [associations]
            symbol:SAG12 "senescence-associated gene 12" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009723 "response to ethylene stimulus" evidence=IEP]
            [GO:0007568 "aging" evidence=IEP;TAS] [GO:0010150 "leaf senescence"
            evidence=IEP;TAS] [GO:0010282 "senescence-associated vacuole"
            evidence=IDA] [GO:0009817 "defense response to fungus, incompatible
            interaction" evidence=IEP] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:CP002688 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0010150 GO:GO:0009817 EMBL:AB016870
            HSSP:O65039 OMA:NDEQALM EMBL:AF370131 EMBL:AY040073 IPI:IPI00544181
            RefSeq:NP_568651.1 UniGene:At.75256 UniGene:At.7710
            ProteinModelPortal:Q9FJ47 SMR:Q9FJ47 IntAct:Q9FJ47 STRING:Q9FJ47
            MEROPS:C01.117 PRIDE:Q9FJ47 ProMEX:Q9FJ47 EnsemblPlants:AT5G45890.1
            GeneID:834629 KEGG:ath:AT5G45890 TAIR:At5g45890 InParanoid:Q9FJ47
            PhylomeDB:Q9FJ47 ProtClustDB:CLSN2917735 ArrayExpress:Q9FJ47
            Genevestigator:Q9FJ47 GO:GO:0010282 Uniprot:Q9FJ47
        Length = 346

 Score = 783 (280.7 bits), Expect = 7.9e-78, P = 7.9e-78
 Identities = 165/341 (48%), Positives = 214/341 (62%)

Query:     5 FLIVVLIISGSCASQATYRTFDEGSIAEKFE-QWKAQYGRTYKESAENSKRFEIFKDNLV 63
             FL V  I S  C S    R  D   I +K   +W  ++GR Y +  E + R+ +FK+N+ 
Sbjct:     9 FLFVA-IFSSFCFSITLSRPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVE 67

Query:    64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFK-MSDHSSSLKANGTPFLYK--- 119
              +E  N+   G R++ L +N+FADLT  EF +  TGFK +S  SS  +   +PF Y+   
Sbjct:    68 RIEHLNSIPAG-RTFKLAVNQFADLTNDEFRSMYTGFKGVSALSSQSQTKMSPFRYQNVS 126

Query:   120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
             S  +P SV+W +KGAVTP+K QG C       AVAA+EG   IK  +L+SLSEQQLVDC 
Sbjct:   127 SGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCD 186

Query:   173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
             TND   GC GG MD AF++I    G+T ++ Y Y+G     C+S K    A  IT YEDV
Sbjct:   187 TNDF--GCEGGLMDTAFEHIKATGGLTTESNYPYKG-EDATCNSKKTNPKATSITGYEDV 243

Query:   233 PPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
             P NDE++L+KAVA+QPVSV I+      QFYS GVF G C T+L+H VTA+GYG S  G 
Sbjct:   244 PVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGESTNGS 303

Query:   291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
             KYW+IKNSWG  WGE GY R+Q+D+   QG CG+AM AS+P
Sbjct:   304 KYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYP 344


>TAIR|locus:2038588 [details] [associations]
            symbol:AT2G27420 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002685
            GenomeReviews:CT485783_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC006232
            MEROPS:I29.003 OMA:EEFRATH HOGENOM:HOG000230773 HSSP:P53634
            ProtClustDB:CLSN2688476 EMBL:AY064033 EMBL:AY096388 IPI:IPI00539752
            PIR:F84672 RefSeq:NP_565649.1 UniGene:At.27094
            ProteinModelPortal:Q9ZQH7 SMR:Q9ZQH7 PRIDE:Q9ZQH7
            EnsemblPlants:AT2G27420.1 GeneID:817287 KEGG:ath:AT2G27420
            TAIR:At2g27420 InParanoid:Q9ZQH7 PhylomeDB:Q9ZQH7
            ArrayExpress:Q9ZQH7 Genevestigator:Q9ZQH7 Uniprot:Q9ZQH7
        Length = 348

 Score = 774 (277.5 bits), Expect = 7.1e-77, P = 7.1e-77
 Identities = 170/353 (48%), Positives = 223/353 (63%)

Query:     1 MAKYFLIVVLIISGSCASQATYR-TFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
             MA   + ++ I      S AT R +  E S  EK EQW A++ R Y +  E   RF IFK
Sbjct:     1 MASTIIFILTIFLSYRTSLATSRGSLFEASAIEKHEQWMARFNRVYSDETEKRNRFNIFK 60

Query:    60 DNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD---HSSSLKA--NG 113
              NL  V+ FN N  I   +Y + +N+F+DLT +EF A+ TG  + +     S+L +  N 
Sbjct:    61 KNLEFVQNFNMNNKI---TYKVDINEFSDLTDEEFRATHTGLVVPEAITRISTLSSGKNT 117

Query:   114 TPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSE 165
              PF Y + S    S++W ++GAVTPVKYQG+C       AVAAVEGI  I    LVSLSE
Sbjct:   118 VPFRYGNVSDNGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSE 177

Query:   166 QQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDS---IKAEDH 222
             QQL+DC   D N GC GG M  AF+YII+N+GIT +  Y Y+  S   C S   + +   
Sbjct:   178 QQLLDC-DRDYNQGCRGGIMSKAFEYIIKNQGITTEDNYPYQE-SQQTCSSSTTLSSSFR 235

Query:   223 AAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTA 280
             AA I+ YE VP N+EE+LL+AV+ QPVSV I+ +   F  YSGGVFNG C T L+H VT 
Sbjct:   236 AATISGYETVPMNNEEALLQAVSQQPVSVGIEGTGAAFRHYSGGVFNGECGTDLHHAVTI 295

Query:   281 VGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
             VGYG SEEG KYW++KNSWG+ WGE+GY R++RD+D PQG CG+A+ A +P++
Sbjct:   296 VGYGMSEEGTKYWVVKNSWGETWGENGYMRIKRDVDAPQGMCGLAILAFYPLA 348


>TAIR|locus:2090614 [details] [associations]
            symbol:AT3G19390 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA;ISS] [GO:0000041 "transition metal ion
            transport" evidence=RCA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:CP002686 GenomeReviews:BA000014_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AB025624
            MEROPS:I29.003 OMA:KAMDQKC HSSP:O65039 HOGENOM:HOG000230773
            InterPro:IPR000118 Pfam:PF00396 SMART:SM00277 EMBL:AY062725
            EMBL:AY093350 IPI:IPI00520189 RefSeq:NP_566633.1 UniGene:At.27473
            ProteinModelPortal:Q9LT78 SMR:Q9LT78 IntAct:Q9LT78 STRING:Q9LT78
            PaxDb:Q9LT78 PRIDE:Q9LT78 EnsemblPlants:AT3G19390.1 GeneID:821473
            KEGG:ath:AT3G19390 TAIR:At3g19390 InParanoid:Q9LT78
            PhylomeDB:Q9LT78 ProtClustDB:CLSN2917188 Genevestigator:Q9LT78
            Uniprot:Q9LT78
        Length = 452

 Score = 772 (276.8 bits), Expect = 1.1e-76, P = 1.1e-76
 Identities = 167/359 (46%), Positives = 216/359 (60%)

Query:     9 VLIISGSCAS-QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVER 67
             VL+IS S  S  AT  T +E      +E+W  +  + Y    E  +RFEIFKDNL  VE 
Sbjct:    17 VLLISLSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEKERRFEIFKDNLKFVEE 76

Query:    68 FNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPS 126
               +++I NR+Y + L +FADLT  EF A     KM    + +   G  +LYK    +P +
Sbjct:    77 --HSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKME--RTRVPVKGEKYLYKVGDSLPDA 132

Query:   127 VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNG 179
             ++W  KGAV PVK QG C       A+ AVEGIN IK   L+SLSEQ+LVDC T+  N+G
Sbjct:   133 IDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCDTS-YNDG 191

Query:   180 CYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEES 239
             C GG MD AFK+II+N GI  +  Y Y      +C+S K       I  YEDVP NDE+S
Sbjct:   192 CGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCNSDKKNTRVVTIDGYEDVPQNDEKS 251

Query:   240 LLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKN 297
             L KA+ANQP+SVAI+A   A Q Y+ GVF G C T L+HGV AVGYG SE G  YW+++N
Sbjct:   252 LKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTSLDHGVVAVGYG-SEGGQDYWIVRN 310

Query:   298 SWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSS--------ADKSSAC 348
             SWG +WGE GYF+L+R+I +  G+CG+AM AS+P     + P           DKS+ C
Sbjct:   311 SWGSNWGESGYFKLERNIKESSGKCGVAMMASYPTKSSGSNPPKPPAPSPVVCDKSNTC 369


>TAIR|locus:2029924 [details] [associations]
            symbol:AT1G29090 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002684 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            HOGENOM:HOG000230773 HSSP:P53634 ProtClustDB:CLSN2688064
            EMBL:BT004146 IPI:IPI00545702 RefSeq:NP_564321.2 UniGene:At.40814
            ProteinModelPortal:Q84W75 SMR:Q84W75 MEROPS:C01.A15
            EnsemblPlants:AT1G29090.1 GeneID:839784 KEGG:ath:AT1G29090
            TAIR:At1g29090 InParanoid:Q84W75 OMA:SIRGHED PhylomeDB:Q84W75
            ArrayExpress:Q84W75 Genevestigator:Q84W75 Uniprot:Q84W75
        Length = 355

 Score = 759 (272.2 bits), Expect = 2.7e-75, P = 2.7e-75
 Identities = 166/347 (47%), Positives = 224/347 (64%)

Query:     4 YFLIVVLIISGSC-ASQATYR-TFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
             + L+ + I+S +   SQAT R TF E  +AE  +QW  ++ R Y +  E   RF++FK N
Sbjct:    15 FMLVSLTILSMNLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKN 74

Query:    62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--SSSLKANGTP-FLY 118
             L  +E+FN    G+R+Y L +N+FAD T +EFIA+ TG K  +   SS       P + +
Sbjct:    75 LKFIEKFNKK--GDRTYKLGVNEFADWTREEFIATHTGLKGVNGIPSSEFVDEMIPSWNW 132

Query:   119 KSSQVP--PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
               S V    + +W  +GAVTPVKYQGQC       +VAAVEG+  I  N LVSLSEQQL+
Sbjct:   133 NVSDVAGRETKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLL 192

Query:   170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
             DC   + +NGC GG M DAF YII+N+GI ++A Y Y+  + G C     +  +A I  +
Sbjct:   193 DC-DRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQA-AEGTC-RYNGKP-SAWIRGF 248

Query:   230 EDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNG-YCETFLNHGVTAVGYGTS 286
             + VP N+E +LL+AV+ QPVSV+IDA    F  YSGGV++  YC T +NH VT VGYGTS
Sbjct:   249 QTVPSNNERALLEAVSKQPVSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTS 308

Query:   287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
              EGIKYWL KNSWG+ WGE+GY R++RD+  PQG CG+A +A +PV+
Sbjct:   309 PEGIKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 355


>TAIR|locus:2157712 [details] [associations]
            symbol:CEP1 "cysteine endopeptidase 1" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005783 EMBL:CP002688
            GenomeReviews:BA000015_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 PROSITE:PS00014
            EMBL:AB024031 MEROPS:I29.003 EMBL:HM367092 EMBL:AY091087
            IPI:IPI00516991 RefSeq:NP_568722.1 UniGene:At.7918 HSSP:O65039
            ProteinModelPortal:Q9FGR9 SMR:Q9FGR9 PaxDb:Q9FGR9 PRIDE:Q9FGR9
            EnsemblPlants:AT5G50260.1 GeneID:835091 KEGG:ath:AT5G50260
            TAIR:At5g50260 HOGENOM:HOG000230773 InParanoid:Q9FGR9 KO:K16292
            OMA:WHSKKYH PhylomeDB:Q9FGR9 ProtClustDB:CLSN2689970
            Genevestigator:Q9FGR9 Uniprot:Q9FGR9
        Length = 361

 Score = 747 (268.0 bits), Expect = 5.1e-74, P = 5.1e-74
 Identities = 156/327 (47%), Positives = 206/327 (62%)

Query:    27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
             E S+ E +E+W++ +    +   E +KRF +FK N+  +   N     ++SY L+LNKF 
Sbjct:    31 ENSLWELYERWRSHH-TVARSLEEKAKRFNVFKHNVKHIHETNKK---DKSYKLKLNKFG 86

Query:    87 DLTPQEFIASQTGFKMSDH---SSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQG 142
             D+T +EF  +  G  +  H       KA  + F+Y + + +P SV+W + GAVTPVK QG
Sbjct:    87 DMTSEEFRRTYAGSNIKHHRMFQGEKKATKS-FMYANVNTLPTSVDWRKNGAVTPVKNQG 145

Query:   143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
             QC        V AVEGIN I+  +L SLSEQ+LVDC TN N  GC GG MD AF++I + 
Sbjct:   146 QCGSCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCDTNQNQ-GCNGGLMDLAFEFIKEK 204

Query:   196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
              G+T++ VY Y+  S   CD+ K       I  +EDVP N E+ L+KAVANQPVSVAIDA
Sbjct:   205 GGLTSELVYPYKA-SDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDA 263

Query:   256 --SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
               S  QFYS GVF G C T LNHGV  VGYGT+ +G KYW++KNSWG++WGE GY R+QR
Sbjct:   264 GGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQR 323

Query:   314 DIDQPQGQCGIAMFASFPVSKESAQPS 340
              I   +G CGIAM AS+P+   +  PS
Sbjct:   324 GIRHKEGLCGIAMEASYPLKNSNTNPS 350


>TAIR|locus:2029934 [details] [associations]
            symbol:AT1G29080 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002684 GenomeReviews:CT485782_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AC021043 MEROPS:I29.003 HOGENOM:HOG000230773
            HSSP:P53634 ProtClustDB:CLSN2688064 EMBL:DQ056468 IPI:IPI00521747
            PIR:C86413 RefSeq:NP_564320.1 UniGene:At.51814
            ProteinModelPortal:Q9LP39 SMR:Q9LP39 EnsemblPlants:AT1G29080.1
            GeneID:839783 KEGG:ath:AT1G29080 TAIR:At1g29080 InParanoid:Q9LP39
            OMA:KTWGENG PhylomeDB:Q9LP39 Genevestigator:Q9LP39 Uniprot:Q9LP39
        Length = 346

 Score = 721 (258.9 bits), Expect = 2.9e-71, P = 2.9e-71
 Identities = 158/347 (45%), Positives = 220/347 (63%)

Query:     5 FLIVVLII--SGSCASQATYRT--FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
             F+ VVL I       S+AT R   +   SI +  +QW  Q+ R Y +  E   R ++  +
Sbjct:     6 FVCVVLTIFFMDLKISEATSRVALYKPSSIVDYHQQWMIQFSRVYDDEFEKQLRLQVLTE 65

Query:    61 NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKA-NGTPFLYK 119
             NL  +E FNN  +GN+SY L +N+F D T +EF+A+ TG +  + +S  +  N T   + 
Sbjct:    66 NLKFIESFNN--MGNQSYKLGVNEFTDWTKEEFLATYTGLRGVNVTSPFEVVNETKPAWN 123

Query:   120 ---SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
                S  +  + +W  +GAVTPVK QG+C       A+AAVEG+  I    L+SLSEQQL+
Sbjct:   124 WTVSDVLGTNKDWRNEGAVTPVKSQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLL 183

Query:   170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
             DC T + NNGC GG   +AF YII+++GI+++  Y Y+ +  G C S  A   A  I  +
Sbjct:   184 DC-TREQNNGCKGGTFVNAFNYIIKHRGISSENEYPYQ-VKEGPCRS-NARP-AILIRGF 239

Query:   230 EDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGY-CETFLNHGVTAVGYGTS 286
             E+VP N+E +LL+AV+ QPV+VAIDAS   F  YSGGV+N   C T +NH VT VGYGTS
Sbjct:   240 ENVPSNNERALLEAVSRQPVAVAIDASEAGFVHYSGGVYNARNCGTSVNHAVTLVGYGTS 299

Query:   287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
              EG+KYWL KNSWG+ WGE+GY R++RD++ PQG CG+A +AS+PV+
Sbjct:   300 PEGMKYWLAKNSWGKTWGENGYIRIRRDVEWPQGMCGVAQYASYPVA 346


>TAIR|locus:505006391 [details] [associations]
            symbol:CEP3 "cysteine endopeptidase 3" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005783 EMBL:CP002686
            GenomeReviews:BA000014_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 PROSITE:PS00014
            EMBL:AL049659 HSSP:O65039 HOGENOM:HOG000230773 KO:K16292
            EMBL:AK119026 IPI:IPI00525150 PIR:T06707 RefSeq:NP_566901.1
            UniGene:At.3162 ProteinModelPortal:Q9STL5 SMR:Q9STL5 MEROPS:C01.A02
            PRIDE:Q9STL5 EnsemblPlants:AT3G48350.1 GeneID:823993
            KEGG:ath:AT3G48350 TAIR:At3g48350 InParanoid:Q9STL5 OMA:DITHHEF
            PhylomeDB:Q9STL5 ProtClustDB:CLSN2917387 Genevestigator:Q9STL5
            Uniprot:Q9STL5
        Length = 364

 Score = 721 (258.9 bits), Expect = 2.9e-71, P = 2.9e-71
 Identities = 160/359 (44%), Positives = 219/359 (61%)

Query:     1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEK------FEQWKAQYGRTYKESAENSKR 54
             M  +F++++  +S   AS+     FDE  +  +      +E+W+  +  + + S E  KR
Sbjct:     1 MKLFFIVLISFLSLLQASKGF--DFDEKELETEENVWKLYERWRGHHSVS-RASHEAIKR 57

Query:    55 FEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSS--SLKAN 112
             F +F+ N++ V R N     N+ Y L++N+FAD+T  EF +S  G  +  H      K  
Sbjct:    58 FNVFRHNVLHVHRTNKK---NKPYKLKINRFADITHHEFRSSYAGSNVKHHRMLRGPKRG 114

Query:   113 GTPFLYKS-SQVPPSVNWIEKGAVTPVKYQ---GQC----AVAAVEGINAIKINRLVSLS 164
                F+Y++ ++VP SV+W EKGAVT VK Q   G C     VAAVEGIN I+ N+LVSLS
Sbjct:   115 SGGFMYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLS 174

Query:   165 EQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAA 224
             EQ+LVDC T +N  GC GG M+ AF++I  N GI  +  Y Y+      C +        
Sbjct:   175 EQELVDCDTEENQ-GCAGGLMEPAFEFIKNNGGIKTEETYPYDSSDVQFCRANSIGGETV 233

Query:   225 QITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVG 282
              I  +E VP NDEE LLKAVA+QPVSVAIDA  S  Q YS GVF G C T LNHGV  VG
Sbjct:   234 TIDGHEHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVG 293

Query:   283 YGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSS 341
             YG ++ G KYW+++NSWG +WGE GY R++R I + +G+CGIAM AS+P +K S+ PS+
Sbjct:   294 YGETKNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYP-TKLSSTPST 351


>TAIR|locus:2122113 [details] [associations]
            symbol:XCP1 "xylem cysteine peptidase 1" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0000325 "plant-type vacuole" evidence=IDA] [GO:0005634
            "nucleus" evidence=IDA] [GO:0010623 "developmental programmed cell
            death" evidence=IMP] [GO:0010413 "glucuronoxylan metabolic process"
            evidence=RCA] [GO:0045492 "xylan biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005886
            GO:GO:0005634 EMBL:CP002687 GenomeReviews:CT486007_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0000325
            EMBL:AL022604 EMBL:AL161587 GO:GO:0010623 MEROPS:I29.003
            HOGENOM:HOG000230773 EMBL:AF191027 EMBL:AK117394 EMBL:BT005179
            IPI:IPI00532220 PIR:T06122 RefSeq:NP_567983.1 UniGene:At.2280
            UniGene:At.67622 ProteinModelPortal:O65493 SMR:O65493 STRING:O65493
            PaxDb:O65493 PRIDE:O65493 EnsemblPlants:AT4G35350.1 GeneID:829688
            KEGG:ath:AT4G35350 GeneFarm:5033 TAIR:At4g35350 InParanoid:O65493
            KO:K16290 OMA:FEVFREN PhylomeDB:O65493 ProtClustDB:CLSN2689772
            Genevestigator:O65493 Uniprot:O65493
        Length = 355

 Score = 704 (252.9 bits), Expect = 1.8e-69, P = 1.8e-69
 Identities = 150/312 (48%), Positives = 196/312 (62%)

Query:    30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
             + E FE W +++ + YK   E   RFE+F++NL+ +++ NN      SY L LN+FADLT
Sbjct:    47 LLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEI---NSYWLGLNEFADLT 103

Query:    90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA--- 145
              +EF     G      S   + +   F Y+  + +P SV+W +KGAV PVK QGQC    
Sbjct:   104 HEEFKGRYLGLAKPQFSRKRQPSAN-FRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCW 162

Query:   146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
                 VAAVEGIN I    L SLSEQ+L+DC T  N+ GC GG MD AF+YII   G+  +
Sbjct:   163 AFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNS-GCNGGLMDYAFQYIISTGGLHKE 221

Query:   202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
               Y Y  M  GIC   K +     I+ YEDVP ND+ESL+KA+A+QPVSVAI+AS    Q
Sbjct:   222 DDYPYL-MEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQ 280

Query:   260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
             FY GGVFNG C T L+HGV AVGYG+S+ G  Y ++KNSWG  WGE G+ R++R+  +P+
Sbjct:   281 FYKGGVFNGKCGTDLDHGVAAVGYGSSK-GSDYVIVKNSWGPRWGEKGFIRMKRNTGKPE 339

Query:   320 GQCGIAMFASFP 331
             G CGI   AS+P
Sbjct:   340 GLCGINKMASYP 351


>TAIR|locus:2030427 [details] [associations]
            symbol:XCP2 "xylem cysteine peptidase 2" species:3702
            "Arabidopsis thaliana" [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009507 "chloroplast" evidence=ISM] [GO:0008233 "peptidase
            activity" evidence=ISS] [GO:0005618 "cell wall" evidence=IDA]
            [GO:0010623 "developmental programmed cell death" evidence=IMP]
            [GO:0010075 "regulation of meristem growth" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0005886 GO:GO:0005618 GO:GO:0005773
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AC069251 EMBL:AC007369 GO:GO:0010623
            OMA:YKEIPEG HOGENOM:HOG000230773 KO:K16290 EMBL:AF191028
            EMBL:BT004822 IPI:IPI00526722 PIR:A86341 RefSeq:NP_564126.1
            UniGene:At.21316 ProteinModelPortal:Q9LM66 SMR:Q9LM66 IntAct:Q9LM66
            STRING:Q9LM66 MEROPS:C01.120 PaxDb:Q9LM66 PRIDE:Q9LM66
            ProMEX:Q9LM66 EnsemblPlants:AT1G20850.1 GeneID:838677
            KEGG:ath:AT1G20850 GeneFarm:5034 TAIR:At1g20850 InParanoid:Q9LM66
            PhylomeDB:Q9LM66 ProtClustDB:CLSN2917031 Genevestigator:Q9LM66
            GermOnline:AT1G20850 Uniprot:Q9LM66
        Length = 356

 Score = 696 (250.1 bits), Expect = 1.3e-68, P = 1.3e-68
 Identities = 149/312 (47%), Positives = 192/312 (61%)

Query:    30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
             + E FE W + + + Y+   E   RFE+FKDNL  ++  N      +SY L LN+FADL+
Sbjct:    47 LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG---KSYWLGLNEFADLS 103

Query:    90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA--- 145
              +EF     G K        + +   F Y+  + VP SV+W +KGAV  VK QG C    
Sbjct:   104 HEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCW 163

Query:   146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
                 VAAVEGIN I    L +LSEQ+L+DC T   NNGC GG MD AF+YI++N G+  +
Sbjct:   164 AFSTVAAVEGINKIVTGNLTTLSEQELIDCDTT-YNNGCNGGLMDYAFEYIVKNGGLRKE 222

Query:   202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
               Y Y  M  G C+  K E     I  ++DVP NDE+SLLKA+A+QP+SVAIDAS    Q
Sbjct:   223 EDYPYS-MEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQ 281

Query:   260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
             FYSGGVF+G C   L+HGV AVGYG+S+ G  Y ++KNSWG  WGE GY RL+R+  +P+
Sbjct:   282 FYSGGVFDGRCGVDLDHGVAAVGYGSSK-GSDYIIVKNSWGPKWGEKGYIRLKRNTGKPE 340

Query:   320 GQCGIAMFASFP 331
             G CGI   ASFP
Sbjct:   341 GLCGINKMASFP 352


>TAIR|locus:2090629 [details] [associations]
            symbol:AT3G19400 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0019344 "cysteine biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            EMBL:CP002686 GenomeReviews:BA000014_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AB025624
            MEROPS:I29.003 HOGENOM:HOG000230773 EMBL:AK118509 IPI:IPI00543468
            RefSeq:NP_566634.2 UniGene:At.38409 ProteinModelPortal:Q9LT77
            SMR:Q9LT77 PaxDb:Q9LT77 PRIDE:Q9LT77 EnsemblPlants:AT3G19400.1
            GeneID:821474 KEGG:ath:AT3G19400 TAIR:At3g19400 InParanoid:Q9LT77
            OMA:IGEHERR ProtClustDB:CLSN2679975 Genevestigator:Q9LT77
            Uniprot:Q9LT77
        Length = 362

 Score = 693 (249.0 bits), Expect = 2.7e-68, P = 2.7e-68
 Identities = 157/350 (44%), Positives = 207/350 (59%)

Query:     6 LIVVLIISGSCA-SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
             ++ VL++S S   +  T    +E  +   +EQW  +  + Y    E  +RF+IFKDNL  
Sbjct:    15 ILSVLLLSSSLGVATETEIERNETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKF 74

Query:    65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV- 123
             V+  N+  + +R++ + L +FADLT +EF A     KM     S+K     +LYK   V 
Sbjct:    75 VDEHNS--VPDRTFEVGLTRFADLTNEEFRAIYLRKKMERTKDSVKTER--YLYKEGDVL 130

Query:   124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
             P  V+W   GAV  VK QG C       AV AVEGIN I    L+SLSEQ+LVDC     
Sbjct:   131 PDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFV 190

Query:   177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED-HAAQITNYEDVPPN 235
             N GC GG M+ AF++I++N GI  D  Y Y     G+C++ K  +     I  YEDVP +
Sbjct:   191 NAGCDGGIMNYAFEFIMKNGGIETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRD 250

Query:   236 DEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
             DE+SL KAVA+QPVSVAI+AS  A Q Y  GV  G C   L+HGV  VGYG S  G  YW
Sbjct:   251 DEKSLKKAVAHQPVSVAIEASSQAFQLYKSGVMTGTCGISLDHGVVVVGYG-STSGEDYW 309

Query:   294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSAD 343
             +I+NSWG +WG+ GY +LQR+ID P G+CGIAM  S+P   +S+ PSS D
Sbjct:   310 IIRNSWGLNWGDSGYVKLQRNIDDPFGKCGIAMMPSYPT--KSSFPSSFD 357


>TAIR|locus:2097104 [details] [associations]
            symbol:AT3G43960 species:3702 "Arabidopsis thaliana"
            [GO:0005886 "plasma membrane" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0031225 "anchored to
            membrane" evidence=TAS] [GO:0048767 "root hair elongation"
            evidence=IMP] [GO:0016132 "brassinosteroid biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:CP002686
            GenomeReviews:BA000014_GR GO:GO:0031225 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0048767 MEROPS:I29.003 HOGENOM:HOG000230773
            EMBL:AL163975 EMBL:AK118634 IPI:IPI00526842 PIR:T48950
            RefSeq:NP_566867.1 UniGene:At.43352 ProteinModelPortal:Q9LXW3
            SMR:Q9LXW3 STRING:Q9LXW3 PaxDb:Q9LXW3 PRIDE:Q9LXW3
            EnsemblPlants:AT3G43960.1 GeneID:823513 KEGG:ath:AT3G43960
            TAIR:At3g43960 eggNOG:NOG286334 InParanoid:Q9LXW3 KO:K01376
            OMA:MAISFRT PhylomeDB:Q9LXW3 ProtClustDB:CLSN2917367
            Genevestigator:Q9LXW3 GermOnline:AT3G43960 Uniprot:Q9LXW3
        Length = 376

 Score = 686 (246.5 bits), Expect = 1.5e-67, P = 1.5e-67
 Identities = 154/341 (45%), Positives = 197/341 (57%)

Query:     9 VLIISGSCAS-QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVER 67
             VL+IS S     AT    +EG +   +EQW  + G+ Y    E  +RF+IFKDNL  +E 
Sbjct:    15 VLLISISLGVVTATESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEE 74

Query:    68 FNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV-PPS 126
              N+    NRSY   LNKF+DLT  EF AS  G KM   S S  A    + YK   V P  
Sbjct:    75 HNSDP--NRSYERGLNKFSDLTADEFQASYLGGKMEKKSLSDVAER--YQYKEGDVLPDE 130

Query:   127 VNWIEKGAVTP-VKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
             V+W E+GAV P VK QG+C       A  AVEGIN I    LVSLSEQ+L+DC   ++N 
Sbjct:   131 VDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNF 190

Query:   179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED-HAAQITNYEDVPPNDE 237
             GC GG    AF++I +N GI +D VY Y G  T  C +I+ +      I  +E VP NDE
Sbjct:   191 GCAGGGAVWAFEFIKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDE 250

Query:   238 ESLLKAVANQPVSVAIDASALQFYSGGVFNGYCETFL-NHGVTAVGYGTSEEGIKYWLIK 296
              SL KAVA QP+SV I A+ +  Y  GV+ G C     +H V  VGYGTS +   YWLI+
Sbjct:   251 MSLKKAVAYQPISVMISAANMSDYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIR 310

Query:   297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
             NSWG +WGE GY RLQR+  +P G+C +A+   +P+   S+
Sbjct:   311 NSWGPEWGEGGYLRLQRNFHEPTGKCAVAVAPVYPIKSNSS 351


>TAIR|locus:2030027 [details] [associations]
            symbol:AT1G29110 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            EMBL:CP002684 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            IPI:IPI00544534 RefSeq:NP_564322.1 UniGene:At.51816
            ProteinModelPortal:F4HZW2 SMR:F4HZW2 EnsemblPlants:AT1G29110.1
            GeneID:839786 KEGG:ath:AT1G29110 OMA:SCRANAR Uniprot:F4HZW2
        Length = 334

 Score = 681 (244.8 bits), Expect = 5.1e-67, P = 5.1e-67
 Identities = 145/337 (43%), Positives = 208/337 (61%)

Query:     5 FLIVVLIISGSCASQAT-YRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
             F+ + ++      SQA  + T +E SI +  +QW  Q+ R YK+ +E   R ++FK NL 
Sbjct:     8 FVALTILSMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKVFKKNLK 67

Query:    64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGT-PFL-YKSS 121
              +E FNN  +GN+SYTL +N+F D   +EF+A+ TG +++  S S   N T P   +  S
Sbjct:    68 FIENFNN--MGNQSYTLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSRNWNMS 125

Query:   122 QVP---PSVNWIEKGAVTPVKYQGQCAVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
              +     S +W ++GAVTPVKYQG C +  + G N      L++LSEQQL+DC   + N 
Sbjct:   126 DIDMEDESKDWRDEGAVTPVKYQGACRLTKISGKN------LLTLSEQQLIDCDI-EKNG 178

Query:   179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
             GC GG  ++AFKYII+N G++ +  Y Y+        + +   H  QI  ++ VP ++E 
Sbjct:   179 GCNGGEFEEAFKYIIKNGGVSLETEYPYQVKKESCRANARRAPHT-QIRGFQMVPSHNER 237

Query:   239 SLLKAVANQPVSVAIDASALQF--YSGGVFNGY-CETFLNHGVTAVGYGTSEEGIKYWLI 295
             +LL+AV  QPVSV IDA A  F  Y GGV+ G  C T +NH VT VGYGT   G+ YW++
Sbjct:   238 ALLEAVRRQPVSVLIDARADSFGHYKGGVYAGLDCGTDVNHAVTIVGYGTMS-GLNYWVL 296

Query:   296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             KNSWG+ WGE+GY R++RD++ PQG CGIA  A++PV
Sbjct:   297 KNSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAYPV 333


>TAIR|locus:2825832 [details] [associations]
            symbol:RD21A "responsive to dehydration 21A" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;IMP]
            [GO:0008234 "cysteine-type peptidase activity"
            evidence=ISS;IDA;IMP] [GO:0009507 "chloroplast" evidence=IDA]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0008233 "peptidase
            activity" evidence=IDA] [GO:0048046 "apoplast" evidence=IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005829 "cytosol"
            evidence=RCA] [GO:0009506 "plasmodesma" evidence=IDA] [GO:0050832
            "defense response to fungus" evidence=IMP] [GO:0006096 "glycolysis"
            evidence=RCA] [GO:0006833 "water transport" evidence=RCA]
            [GO:0006972 "hyperosmotic response" evidence=RCA] [GO:0007030
            "Golgi organization" evidence=RCA] [GO:0009266 "response to
            temperature stimulus" evidence=RCA] [GO:0009651 "response to salt
            stress" evidence=RCA] [GO:0015996 "chlorophyll catabolic process"
            evidence=RCA] [GO:0042744 "hydrogen peroxide catabolic process"
            evidence=RCA] [GO:0046686 "response to cadmium ion" evidence=RCA]
            [GO:0009414 "response to water deprivation" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0009506 GO:GO:0009507 GO:GO:0005773
            GO:GO:0050832 GO:GO:0048046 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC083835
            HOGENOM:HOG000230773 KO:K01376 InterPro:IPR000118 Pfam:PF00396
            SMART:SM00277 UniGene:At.43549 EMBL:D13043 EMBL:AY072130
            EMBL:AY133781 IPI:IPI00530094 PIR:JN0719 RefSeq:NP_564497.1
            UniGene:At.47599 UniGene:At.71705 ProteinModelPortal:P43297
            SMR:P43297 IntAct:P43297 STRING:P43297 MEROPS:C01.064 PaxDb:P43297
            PRIDE:P43297 ProMEX:P43297 EnsemblPlants:AT1G47128.1 GeneID:841122
            KEGG:ath:AT1G47128 TAIR:At1g47128 InParanoid:P43297 OMA:EAWLVKH
            PhylomeDB:P43297 ProtClustDB:CLSN2688498 Genevestigator:P43297
            GermOnline:AT1G47128 Uniprot:P43297
        Length = 462

 Score = 677 (243.4 bits), Expect = 1.3e-66, P = 1.3e-66
 Identities = 145/328 (44%), Positives = 195/328 (59%)

Query:    27 EGSIAEKFEQWKAQYGRTYKESA--ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
             E  +   +E W  ++G+   +++  E  +RFEIFKDNL  V+  N     N SY L L +
Sbjct:    43 EAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEK---NLSYRLGLTR 99

Query:    85 FADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS---SQVPPSVNWIEKGAVTPVKYQ 141
             FADLT  E+ +   G KM           T   Y++    ++P S++W +KGAV  VK Q
Sbjct:   100 FADLTNDEYRSKYLGAKMEKKGE----RRTSLRYEARVGDELPESIDWRKKGAVAEVKDQ 155

Query:   142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
             G C        + AVEGIN I    L++LSEQ+LVDC T+  N GC GG MD AF++II+
Sbjct:   156 GGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIK 214

Query:   195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
             N GI  D  Y Y+G+  G CD I+       I +YEDVP   EESL KAVA+QP+S+AI+
Sbjct:   215 NGGIDTDKDYPYKGVD-GTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIE 273

Query:   255 AS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
             A   A Q Y  G+F+G C T L+HGV AVGYGT E G  YW+++NSWG+ WGE GY R+ 
Sbjct:   274 AGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLRMA 332

Query:   313 RDIDQPQGQCGIAMFASFPVSKESAQPS 340
             R+I    G+CGIA+  S+P+      P+
Sbjct:   333 RNIASSSGKCGIAIEPSYPIKNGENPPN 360


>TAIR|locus:2167821 [details] [associations]
            symbol:RD21B "esponsive to dehydration 21B" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS] [GO:0005773
            "vacuole" evidence=IDA] [GO:0009651 "response to salt stress"
            evidence=IEP] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005829 "cytosol" evidence=IDA] [GO:0052541 "plant-type cell
            wall cellulose metabolic process" evidence=RCA] [GO:0052546 "cell
            wall pectin metabolic process" evidence=RCA] [GO:0005783
            "endoplasmic reticulum" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005783 GO:GO:0005829 EMBL:CP002688
            GO:GO:0005773 GO:GO:0009651 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AB008267 HSSP:O65039
            HOGENOM:HOG000230773 KO:K01376 InterPro:IPR000118 Pfam:PF00396
            SMART:SM00277 ProtClustDB:CLSN2688498 EMBL:AY062608 EMBL:AY114661
            IPI:IPI00520971 RefSeq:NP_568620.1 UniGene:At.24130 SMR:Q9FMH8
            IntAct:Q9FMH8 STRING:Q9FMH8 MEROPS:C01.A12
            EnsemblPlants:AT5G43060.1 GeneID:834321 KEGG:ath:AT5G43060
            TAIR:At5g43060 InParanoid:Q9FMH8 OMA:ENSEASL Genevestigator:Q9FMH8
            Uniprot:Q9FMH8
        Length = 463

 Score = 676 (243.0 bits), Expect = 1.7e-66, P = 1.7e-66
 Identities = 154/344 (44%), Positives = 201/344 (58%)

Query:    11 IISGSCASQATYRTFDEGSIAEK-FEQWKAQYGRTYKES----AENSKRFEIFKDNLVAV 65
             IIS       T  T    S  E+ +E W  ++G+         AE  +RFEIFKDNL  +
Sbjct:    26 IISYDENHHITTETSRSDSEVERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFI 85

Query:    66 ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPP 125
             +  N     N SY L L +FADLT +E+ +   G K +     LK +          +P 
Sbjct:    86 DEHNTK---NLSYKLGLTRFADLTNEEYRSMYLGAKPTKRV--LKTSDRYQARVGDALPD 140

Query:   126 SVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
             SV+W ++GAV  VK QG C        + AVEGIN I    L+SLSEQ+LVDC T+  N 
Sbjct:   141 SVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTS-YNQ 199

Query:   179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
             GC GG MD AF++II+N GI  +A Y Y+  + G CD  +       I +YEDVP N E 
Sbjct:   200 GCNGGLMDYAFEFIIKNGGIDTEADYPYKA-ADGRCDQNRKNAKVVTIDSYEDVPENSEA 258

Query:   239 SLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
             SL KA+A+QP+SVAI+A   A Q YS GVF+G C T L+HGV AVGYGT E G  YW+++
Sbjct:   259 SLKKALAHQPISVAIEAGGRAFQLYSSGVFDGLCGTELDHGVVAVGYGT-ENGKDYWIVR 317

Query:   297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPS 340
             NSWG  WGE GY ++ R+I+ P G+CGIAM AS+P+ K    P+
Sbjct:   318 NSWGNRWGESGYIKMARNIEAPTGKCGIAMEASYPIKKGQNPPN 361


>TAIR|locus:2024362 [details] [associations]
            symbol:XBCP3 "xylem bark cysteine peptidase 3"
            species:3702 "Arabidopsis thaliana" [GO:0005576 "extracellular
            region" evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0005783 "endoplasmic
            reticulum" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005783 EMBL:CP002684 GO:GO:0005773 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 MEROPS:I29.003
            HOGENOM:HOG000230773 InterPro:IPR000118 Pfam:PF00396 SMART:SM00277
            UniGene:At.10233 OMA:CEIESAV EMBL:BT026490 EMBL:AK226753
            IPI:IPI00536687 RefSeq:NP_563855.1 ProteinModelPortal:Q0WVJ5
            SMR:Q0WVJ5 PRIDE:Q0WVJ5 EnsemblPlants:AT1G09850.1 GeneID:837517
            KEGG:ath:AT1G09850 TAIR:At1g09850 InParanoid:Q0WVJ5
            PhylomeDB:Q0WVJ5 ProtClustDB:CLSN2687747 Genevestigator:Q0WVJ5
            Uniprot:Q0WVJ5
        Length = 437

 Score = 664 (238.8 bits), Expect = 3.2e-65, P = 3.2e-65
 Identities = 145/329 (44%), Positives = 194/329 (58%)

Query:    30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
             I+E F+ W  ++G+TY    E  +R +IFKDN   V + N   I N +Y+L LN FADLT
Sbjct:    28 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHN--LITNATYSLSLNAFADLT 85

Query:    90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC----- 144
               EF AS+ G  +S  S  + + G   L  S +VP SV+W +KGAVT VK QG C     
Sbjct:    86 HHEFKASRLGLSVSAPSVIMASKGQS-LGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWS 144

Query:   145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
               A  A+EGIN I    L+SLSEQ+L+DC     N GC GG MD AF+++I+N GI  + 
Sbjct:   145 FSATGAMEGINQIVTGDLISLSEQELIDC-DKSYNAGCNGGLMDYAFEFVIKNHGIDTEK 203

Query:   203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQF 260
              Y Y+    G C   K +     I +Y  V  NDE++L++AVA QPVSV I  S  A Q 
Sbjct:   204 DYPYQERD-GTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQL 262

Query:   261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
             YS G+F+G C T L+H V  VGYG S+ G+ YW++KNSWG+ WG DG+  +QR+ +   G
Sbjct:   263 YSSGIFSGPCSTSLDHAVLIVGYG-SQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDG 321

Query:   321 QCGIAMFASFPVSKE-SAQPSSADKSSAC 348
              CGI M AS+P+    +  P S    + C
Sbjct:   322 VCGINMLASYPIKTHPNPPPPSPPGPTKC 350


>TAIR|locus:2117979 [details] [associations]
            symbol:AT4G23520 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002687 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            KO:K01376 IPI:IPI00527171 RefSeq:NP_567686.2 UniGene:At.32421
            ProteinModelPortal:F4JNL3 SMR:F4JNL3 MEROPS:C01.A22 PRIDE:F4JNL3
            EnsemblPlants:AT4G23520.1 GeneID:828452 KEGG:ath:AT4G23520
            OMA:PANDEIS ArrayExpress:F4JNL3 Uniprot:F4JNL3
        Length = 356

 Score = 648 (233.2 bits), Expect = 1.6e-63, P = 1.6e-63
 Identities = 147/354 (41%), Positives = 213/354 (60%)

Query:     1 MAKYFLIVVLIISGSCASQ---ATYRTFDEGS--IAEKFEQWKAQYGRTYKES-AENSKR 54
             M   FL++V ++S   ++    AT    +  +  +   F+ W +++G+TY  +  E  +R
Sbjct:     9 MTILFLLIVFVLSAPSSAMDLPATSGGHNRSNEEVEFIFQMWMSKHGKTYTNALGEKERR 68

Query:    55 FEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGT 114
             F+ FKDNL  +++ +NA   N SY L L +FADLT QE+     G        +LK +  
Sbjct:    69 FQNFKDNLRFIDQ-HNAK--NLSYQLGLTRFADLTVQEYRDLFPG-SPKPKQRNLKTSRR 124

Query:   115 PFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQ 167
                    Q+P SV+W ++GAV+ +K QG C        VAAVEG+N I    L+SLSEQ+
Sbjct:   125 YVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQE 184

Query:   168 LVDCATNDNNNGCYG-GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAA-Q 225
             LVDC  N  NNGCYG G MD AF+++I N G+ ++  Y Y+G + G C+  ++  +    
Sbjct:   185 LVDC--NLVNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQG-TQGSCNRKQSTSNKVIT 241

Query:   226 ITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGY 283
             I +YEDVP NDE SL KAVA+QPVSV +D  + +F  Y   ++NG C T L+H +  VGY
Sbjct:   242 IDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVGY 301

Query:   284 GTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
             G SE G  YW+++NSWG  WG+ GY ++ R+ + P+G CGIAM AS+P+ K SA
Sbjct:   302 G-SENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPI-KNSA 353


>TAIR|locus:2038515 [details] [associations]
            symbol:AT1G06260 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0048046 "apoplast"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0048046 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC025290
            MEROPS:I29.003 HSSP:O65039 HOGENOM:HOG000230773 OMA:METAFEF
            IPI:IPI00525965 PIR:D86198 RefSeq:NP_563764.1 UniGene:At.24617
            ProteinModelPortal:Q9LNC1 SMR:Q9LNC1 PaxDb:Q9LNC1 PRIDE:Q9LNC1
            EnsemblPlants:AT1G06260.1 GeneID:837137 KEGG:ath:AT1G06260
            TAIR:At1g06260 InParanoid:Q9LNC1 PhylomeDB:Q9LNC1
            ProtClustDB:CLSN2916975 Genevestigator:Q9LNC1 Uniprot:Q9LNC1
        Length = 343

 Score = 645 (232.1 bits), Expect = 3.3e-63, P = 3.3e-63
 Identities = 142/334 (42%), Positives = 192/334 (57%)

Query:     9 VLIISGSCASQATYRTFD-EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVER 67
             VLI S  C+  ++   +D   ++ ++FE+W   + + Y    E   RF I++ N+  ++ 
Sbjct:    19 VLIASKLCSVDSS--VYDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDY 76

Query:    68 FNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSV 127
              N+  +    + L  N+FAD+T  EF A   G   S  S  L     P    +  VP +V
Sbjct:    77 INSLHL---PFKLTDNRFADMTNSEFKAHFLGLNTS--SLRLHKKQRPVCDPAGNVPDAV 131

Query:   128 NWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
             +W  +GAVTP++ QG+C       AVAA+EGIN IK   LVSLSEQQL+DC     N GC
Sbjct:   132 DWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGC 191

Query:   181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESL 240
              GG M+ AF++I  N G+  +  Y Y G+  G CD  K+++    I  Y+ V  N E SL
Sbjct:   192 SGGLMETAFEFIKTNGGLATETDYPYTGIE-GTCDQEKSKNKVVTIQGYQKVAQN-EASL 249

Query:   241 LKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNS 298
               A A QPVSV IDA     Q YS GVF  YC T LNHGVT VGYG  E   KYW++KNS
Sbjct:   250 QIAAAQQPVSVGIDAGGFIFQLYSSGVFTNYCGTNLNHGVTVVGYGV-EGDQKYWIVKNS 308

Query:   299 WGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             WG  WGE+GY R++R + +  G+CGIAM AS+P+
Sbjct:   309 WGTGWGEEGYIRMERGVSEDTGKCGIAMMASYPL 342


>TAIR|locus:2128243 [details] [associations]
            symbol:AT4G11310 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005618 "cell wall"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 GO:GO:0005618 EMBL:CP002687
            GenomeReviews:CT486007_GR EMBL:AL096882 EMBL:AL161531
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            HOGENOM:HOG000230773 KO:K01376 EMBL:AY093066 EMBL:BT000099
            IPI:IPI00520496 PIR:T13022 RefSeq:NP_567376.1 UniGene:At.43189
            ProteinModelPortal:Q9SUT0 SMR:Q9SUT0 IntAct:Q9SUT0 STRING:Q9SUT0
            MEROPS:C01.A20 PaxDb:Q9SUT0 PRIDE:Q9SUT0 EnsemblPlants:AT4G11310.1
            GeneID:826733 KEGG:ath:AT4G11310 TAIR:At4g11310 InParanoid:Q9SUT0
            OMA:EVCHGAD PhylomeDB:Q9SUT0 ProtClustDB:CLSN2689395
            Genevestigator:Q9SUT0 GermOnline:AT4G11310 Uniprot:Q9SUT0
        Length = 364

 Score = 630 (226.8 bits), Expect = 1.3e-61, P = 1.3e-61
 Identities = 157/370 (42%), Positives = 216/370 (58%)

Query:     2 AKYFLIVVLIISGSCAS--QATYRTFDEG----SI--AEK---FEQWKAQYGRTYKESAE 50
             A   L+V ++I+ SCA+    +  ++D+     S+  AE    FE W  ++G+ Y   AE
Sbjct:     7 AMLILLVAMVIA-SCATAIDMSVVSYDDNNRLHSVFDAEASLIFESWMVKHGKVYGSVAE 65

Query:    51 NSKRFEIFKDNLVAVERF-NNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSL 109
               +R  IF+DNL    RF NN    N SY L L  FADL+  E+     G       + +
Sbjct:    66 KERRLTIFEDNL----RFINNRNAENLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHV 121

Query:   110 KANGTPFLYKSSQ---VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINR 159
                 +   YK+S    +P SV+W  +GAVT VK QG C        V AVEG+N I    
Sbjct:   122 FMTSSD-RYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGE 180

Query:   160 LVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDS-IK 218
             LV+LSEQ L++C  N  NNGC GG ++ A+++I++N G+  D  Y Y+ ++ G+CD  +K
Sbjct:   181 LVTLSEQDLINC--NKENNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVN-GVCDGRLK 237

Query:   219 AEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNH 276
               +    I  YE++P NDE +L+KAVA+QPV+  ID+S+ +F  Y  GVF+G C T LNH
Sbjct:   238 ENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNH 297

Query:   277 GVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKES 336
             GV  VGYGT E G  YWL+KNS G  WGE GY ++ R+I  P+G CGIAM AS+P+ K S
Sbjct:   298 GVVVVGYGT-ENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPL-KNS 355

Query:   337 AQPSSADKSS 346
                 S DKSS
Sbjct:   356 F---STDKSS 362


>TAIR|locus:2128253 [details] [associations]
            symbol:AT4G11320 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002687 GenomeReviews:CT486007_GR EMBL:AL096882 EMBL:AL161531
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 OMA:ICHGADP
            HOGENOM:HOG000230773 KO:K01376 ProtClustDB:CLSN2689395
            EMBL:AY035055 EMBL:AY051062 IPI:IPI00520480 PIR:T13023
            RefSeq:NP_567377.1 UniGene:At.25206 ProteinModelPortal:Q9SUS9
            SMR:Q9SUS9 STRING:Q9SUS9 MEROPS:C01.A21 PaxDb:Q9SUS9 PRIDE:Q9SUS9
            EnsemblPlants:AT4G11320.1 GeneID:826734 KEGG:ath:AT4G11320
            TAIR:At4g11320 InParanoid:Q9SUS9 PhylomeDB:Q9SUS9
            Genevestigator:Q9SUS9 GermOnline:AT4G11320 Uniprot:Q9SUS9
        Length = 371

 Score = 618 (222.6 bits), Expect = 2.4e-60, P = 2.4e-60
 Identities = 142/327 (43%), Positives = 195/327 (59%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF-NNAAIGNRSYTLRLNKFADLTPQE 92
             FE W  ++G+ Y   AE  +R  IF+DNL    RF  N    N SY L LN+FADL+  E
Sbjct:    56 FESWMVKHGKVYDSVAEKERRLTIFEDNL----RFITNRNAENLSYRLGLNRFADLSLHE 111

Query:    93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQ---VPPSVNWIEKGAVTPVKYQGQC----- 144
             +     G       + +    +   YK+S    +P SV+W  +GAVT VK QG C     
Sbjct:   112 YGEICHGADPRPPRNHVFMTSSN-RYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWA 170

Query:   145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
                V AVEG+N I    LV+LSEQ L++C  N  NNGC GG ++ A+++I+ N G+  D 
Sbjct:   171 FSTVGAVEGLNKIVTGELVTLSEQDLINC--NKENNGCGGGKVETAYEFIMNNGGLGTDN 228

Query:   203 VYSYEGMSTGICDS-IKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF- 260
              Y Y+ ++ G+C+  +K ++    I  YE++P NDE +L+KAVA+QPV+  +D+S+ +F 
Sbjct:   229 DYPYKALN-GVCEGRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQ 287

Query:   261 -YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
              Y  GVF+G C T LNHGV  VGYGT E G  YW++KNS G  WGE GY ++ R+I  P+
Sbjct:   288 LYESGVFDGTCGTNLNHGVVVVGYGT-ENGRDYWIVKNSRGDTWGEAGYMKMARNIANPR 346

Query:   320 GQCGIAMFASFPVSKESAQPSSADKSS 346
             G CGIAM AS+P+ K S    S DK S
Sbjct:   347 GLCGIAMRASYPL-KNSF---STDKVS 369


>UNIPROTKB|F1NYJ1 [details] [associations]
            symbol:CTSL2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 IPI:IPI00602255
            OMA:DITHHEF EMBL:AADN02067812 Ensembl:ENSGALT00000020588
            ArrayExpress:F1NYJ1 Uniprot:F1NYJ1
        Length = 339

 Score = 579 (208.9 bits), Expect = 3.3e-56, P = 3.3e-56
 Identities = 138/342 (40%), Positives = 191/342 (55%)

Query:     8 VVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVER 67
             V L I   C   A      +  +   ++ WK+ + + Y E  E+ +R  +++ NL  +E 
Sbjct:     4 VCLTILSLCLGLAFAAPRVDPDLDSHWQLWKSWHSKDYHEREESWRRV-VWEKNLKMIEL 62

Query:    68 FN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QVPP 125
              N + ++G  SY L +N+F D+T +EF     G+K     S  K  G+ FL  S  + P 
Sbjct:    63 HNLDHSLGKHSYKLGMNQFGDMTAEEFRQLMNGYKHK--KSERKYRGSQFLEPSFLEAPR 120

Query:   126 SVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
             SV+W EKG VTPVK QGQC          A+EG +  K  +LVSLSEQ LVDC+  + N 
Sbjct:   121 SVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQ 180

Query:   179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
             GC GG MD AF+Y+  N GI ++  Y Y       C   KAE +AA  T + D+P   E 
Sbjct:   181 GCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDC-RYKAEYNAANDTGFVDIPQGHER 239

Query:   239 SLLKAVANQ-PVSVAIDA--SALQFYSGGVF-NGYCETF-LNHGVTAVGYGTSEE---GI 290
             +L+KAVA+  PVSVAIDA  S+ QFY  G++    C +  L+HGV  VGYG   E   G 
Sbjct:   240 ALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGK 299

Query:   291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             KYW++KNSWG+ WG+ GY  + +D    +  CGIA  AS+P+
Sbjct:   300 KYWIVKNSWGEKWGDKGYIYMAKD---RKNHCGIATAASYPL 338


>UNIPROTKB|F1S4J6 [details] [associations]
            symbol:Ssc.54235 "Cathepsin L1" species:9823 "Sus scrofa"
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197
            GO:GO:0002250 GeneTree:ENSGT00560000076577 GO:GO:0097067
            EMBL:CU571031 RefSeq:XP_003130681.1 Ensembl:ENSSSCT00000011983
            GeneID:100515919 KEGG:ssc:100515919 OMA:IAICATK Uniprot:F1S4J6
        Length = 332

 Score = 567 (204.7 bits), Expect = 6.1e-55, P = 6.1e-55
 Identities = 136/339 (40%), Positives = 188/339 (55%)

Query:    10 LIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN 69
             L+++  C   A+     + S+   + +WKA + + Y  + E  +R  I++ N+  +ER N
Sbjct:     5 LLLAAFCLGIASAAPRHDHSLDADWYKWKATHRKLYGLNEEGRRR-AIWEKNMKMIERHN 63

Query:    70 -NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPP-SV 127
                  G  S+T+ +N F D+T +EF  +  GF+   H       G  FL   S + P SV
Sbjct:    64 WEHRQGKHSFTMAMNAFGDMTNEEFRKTMNGFQNQKHKK-----GKVFLDAGSALTPHSV 118

Query:   128 NWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
             +W EKG VT VK QG C       A  A+EG    K ++L+SLSEQ LVDC+  + N GC
Sbjct:   119 DWREKGYVTAVKNQGHCGSCWAFSATGALEGQMFRKTSKLISLSEQNLVDCSWPEGNEGC 178

Query:   181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESL 240
              GG MD+AF+YI  N G+ ++  Y Y G   G C   K +  AA  T Y D+P   E++L
Sbjct:   179 NGGLMDNAFQYIKDNGGLDSEESYPYFGKD-GSC-KYKPQSSAANDTGYVDIP-KQEKAL 235

Query:   241 LKAVANQ-PVSVAIDAS--ALQFYSGGV-FNGYCETF-LNHGVTAVGYGT--SEEGIKYW 293
             +KAVA   P+SV IDAS  + QFYS G+ F   C +  L+HGV  VGYG   +    KYW
Sbjct:   236 MKAVATVGPISVGIDASHESFQFYSTGIYFEPQCSSEDLDHGVLVVGYGVEGAHSNNKYW 295

Query:   294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             L+KNSWG  WG DGY ++ +D       CGIA  AS+PV
Sbjct:   296 LVKNSWGNTWGMDGYIKMTKD---QNNHCGIATMASYPV 331


>UNIPROTKB|Q86GF7 [details] [associations]
            symbol:Cys "Crustapain" species:6703 "Pandalus borealis"
            [GO:0005576 "extracellular region" evidence=IC] [GO:0007586
            "digestion" evidence=NAS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IDA] [GO:0030163 "protein catabolic process"
            evidence=NAS] [GO:0030574 "collagen catabolic process"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005576
            GO:GO:0007586 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0030163 GO:GO:0030574 EMBL:AB091669
            ProteinModelPortal:Q86GF7 SMR:Q86GF7 MEROPS:C01.030 Uniprot:Q86GF7
        Length = 323

 Score = 567 (204.7 bits), Expect = 6.1e-55, P = 6.1e-55
 Identities = 126/313 (40%), Positives = 181/313 (57%)

Query:    33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAI-GNRSYTLRLNKFADLTPQ 91
             ++E +K ++G+ Y  S E S R  +F D L  ++  N     G  +Y L++N F+DLT +
Sbjct:    19 EWENFKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHE 78

Query:    92 EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------- 144
             E +A++TG     H  S+     P    ++ +   V+W  KGAVTPVK QGQC       
Sbjct:    79 EVLATKTGMTRRRHPLSVLPKSAP----TTPMAADVDWRNKGAVTPVKDQGQCGSCWAFS 134

Query:   145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
             AVAA+EG + +K   LVSLSEQ LVDC+++  N GC GG+   A++YII N+GI  ++ Y
Sbjct:   135 AVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRGIDTESSY 194

Query:   205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDASALQF--Y 261
              Y+ +    C    A +  A +++Y +    DE +L  AV N+ PVSV IDA    F  Y
Sbjct:   195 PYKAIDDN-C-RYDAGNIGATVSSYVEPASGDESALQHAVQNEGPVSVCIDAGQSSFGSY 252

Query:   262 SGGVF-NGYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
              GGV+    C++ + NH VTAVGYGT   G  YW++KNSWG  WGE GY ++ R+ D   
Sbjct:   253 GGGVYYEPNCDSWYANHAVTAVGYGTDANGGDYWIVKNSWGAWWGESGYIKMARNRDN-- 310

Query:   320 GQCGIAMFASFPV 332
               C IA ++ +PV
Sbjct:   311 -NCAIATYSVYPV 322


>RGD|1560071 [details] [associations]
            symbol:Ctsll3 "cathepsin L-like 3" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1560071 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:CH474032 IPI:IPI00560469 RefSeq:XP_001065834.2
            RefSeq:XP_573976.3 UniGene:Rn.104851 MEROPS:C01.107
            Ensembl:ENSRNOT00000061398 GeneID:498691 KEGG:rno:498691
            UCSC:RGD:1560071 CTD:70202 OMA:NCGIASD OrthoDB:EOG4HDSTZ
            NextBio:700548 Uniprot:D3ZJV2
        Length = 330

 Score = 567 (204.7 bits), Expect = 6.1e-55, P = 6.1e-55
 Identities = 132/345 (38%), Positives = 191/345 (55%)

Query:     1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
             M   FL+  L + G  ++  T+    + S    +E+WK ++G+TY  + E  KR  ++++
Sbjct:     1 MTPIFLLATLCL-GMISAAPTH----DPSFDTVWEEWKTKHGKTYNTNEEGQKR-AVWEN 54

Query:    61 NLVAVERFNNAAI-GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
             N+  +   N   + G   ++L +N F DLT  EF    TGF+    +  +K    PFL  
Sbjct:    55 NMKMINLHNEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQ-GQKTKMMKVFPEPFL-- 111

Query:   120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
                VP +V+W + G VTPVK QG C       AV ++EG    K  +LV LSEQ LVDC+
Sbjct:   112 -GDVPKTVDWRKHGYVTPVKNQGPCGSCWAFSAVGSLEGQVFRKTGKLVPLSEQNLVDCS 170

Query:   173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
              +  N GC GG  D AF+Y+  N G+     Y YE ++ G C     +  AA++  +  +
Sbjct:   171 WSHGNKGCDGGLPDFAFQYVKDNGGLDTSVSYPYEALN-GTC-RYNPKYSAAKVVGFMSI 228

Query:   233 PPNDEESLLKAVANQ-PVSVAIDAS--ALQFYSGGVF-NGYCE-TFLNHGVTAVGYGTSE 287
             PP+ E +L+KAVA   P+SV ID    + QFY GG++    C  T LNH V  VGYG   
Sbjct:   229 PPS-ENALMKAVATVGPISVGIDIKHKSFQFYKGGMYYEPDCSSTNLNHAVLVVGYGEES 287

Query:   288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             +G KYWL+KNSWG+DWG DGY ++ +D +     CGIA  AS+P+
Sbjct:   288 DGRKYWLVKNSWGRDWGMDGYIKMAKDWNN---NCGIASDASYPI 329


>RGD|1308751 [details] [associations]
            symbol:RGD1308751 "similar to Cathepsin L precursor (Major
            excreted protein) (MEP)" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1308751 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:CH474032 IPI:IPI00365697 RefSeq:XP_001065885.2
            RefSeq:XP_225137.5 MEROPS:C01.069 Ensembl:ENSRNOT00000061391
            GeneID:290981 KEGG:rno:290981 UCSC:RGD:1308751 CTD:290981
            OMA:ESYAYEA OrthoDB:EOG42823G NextBio:631921 Uniprot:D3ZKC3
        Length = 330

 Score = 563 (203.2 bits), Expect = 1.6e-54, P = 1.6e-54
 Identities = 132/345 (38%), Positives = 196/345 (56%)

Query:     1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
             M   FL+  L + G  ++  T+    + S    +E+WK ++G+TY  + E  KR  ++++
Sbjct:     1 MTPIFLLATLCL-GMISAAPTH----DPSFDTVWEEWKTKHGKTYNTNEEGQKR-AVWEN 54

Query:    61 NLVAVERFNNAAI-GNRSYTLRLNKFADLTPQEFIASQTGFK-MSDHSSSLKANGTPFLY 118
             N+  +   N   + G   ++L +N F DLT  EF    TGF+ M    +++     PFL 
Sbjct:    55 NMKMINLHNEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQSMGPKETTIFRE--PFL- 111

Query:   119 KSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
                 +P S++W E G VTPVK QGQC       AV ++EG    K  +LVSLSEQ LVDC
Sbjct:   112 --GDIPKSLDWREHGYVTPVKNQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDC 169

Query:   172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
             + +  N GC GG M+ AF+Y+ +N+G+     Y+YE    G+C     +  AA +T +  
Sbjct:   170 SWSYGNLGCNGGLMEFAFQYVKENRGLDTGESYAYEAQD-GLC-RYNPKYSAANVTGFVK 227

Query:   232 VPPNDEESLLKAVANQ-PVSVAIDA--SALQFYSGGVF-NGYCE-TFLNHGVTAVGYGTS 286
             VP + E+ L+ AVA+  PVSV ID+   + +FYSGG++    C  T ++H V  VGYG  
Sbjct:   228 VPLS-EDDLMSAVASVGPVSVGIDSHHQSFRFYSGGMYYEPDCSSTEMDHAVLVVGYGEE 286

Query:   287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
              +G KYWL+KNSWG+DWG DGY ++ +D       CGIA +A +P
Sbjct:   287 SDGGKYWLVKNSWGEDWGMDGYIKMAKD---QNNNCGIATYAIYP 328


>RGD|2448 [details] [associations]
            symbol:Ctsl1 "cathepsin L1" species:10116 "Rattus norvegicus"
          [GO:0002250 "adaptive immune response" evidence=ISO] [GO:0004177
          "aminopeptidase activity" evidence=IDA] [GO:0004197 "cysteine-type
          endopeptidase activity" evidence=ISO;IDA] [GO:0005576 "extracellular
          region" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IDA]
          [GO:0005764 "lysosome" evidence=ISO;IDA] [GO:0005773 "vacuole"
          evidence=IDA] [GO:0005902 "microvillus" evidence=IDA] [GO:0006508
          "proteolysis" evidence=IEP;ISO] [GO:0007154 "cell communication"
          evidence=IDA] [GO:0007283 "spermatogenesis" evidence=IEP] [GO:0008234
          "cysteine-type peptidase activity" evidence=ISO] [GO:0008584 "male
          gonad development" evidence=IEP] [GO:0009267 "cellular response to
          starvation" evidence=IEP] [GO:0009749 "response to glucose stimulus"
          evidence=IEP] [GO:0009897 "external side of plasma membrane"
          evidence=IDA] [GO:0010259 "multicellular organismal aging"
          evidence=IEP] [GO:0014070 "response to organic cyclic compound"
          evidence=IEP] [GO:0021675 "nerve development" evidence=IEP]
          [GO:0030984 "kininogen binding" evidence=IPI] [GO:0032403 "protein
          complex binding" evidence=IPI] [GO:0034698 "response to gonadotropin
          stimulus" evidence=IEP] [GO:0042277 "peptide binding" evidence=IDA]
          [GO:0042393 "histone binding" evidence=ISO] [GO:0043005 "neuron
          projection" evidence=IDA] [GO:0043204 "perikaryon" evidence=IDA]
          [GO:0046697 "decidualization" evidence=IEP] [GO:0048102 "autophagic
          cell death" evidence=IEP] [GO:0051384 "response to glucocorticoid
          stimulus" evidence=IEP] [GO:0060008 "Sertoli cell differentiation"
          evidence=IEP] [GO:0097067 "cellular response to thyroid hormone
          stimulus" evidence=ISO] [GO:0030141 "secretory granule" evidence=IDA]
          [GO:0045177 "apical part of cell" evidence=IDA] [GO:0060441
          "epithelial tube branching involved in lung morphogenesis"
          evidence=ISO] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
          PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:Y00697 RGD:2448
          GO:GO:0005576 GO:GO:0009897 GO:GO:0034698 GO:GO:0043204 GO:GO:0009749
          GO:GO:0051384 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
          InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
          PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
          PROSITE:PS00139 PROSITE:PS00639 GO:GO:0043005 GO:GO:0007283
          GO:GO:0004177 GO:GO:0005764 GO:GO:0042277 GO:GO:0009267 GO:GO:0021675
          GO:GO:0005902 GO:GO:0010259 GO:GO:0004197 GO:GO:0048102 GO:GO:0046697
          GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 KO:K01365
          OrthoDB:EOG48PMKF MEROPS:C01.032 OMA:FDQNLDT CTD:1514
          BRENDA:3.4.22.15 GO:GO:0060008 EMBL:AF025476 EMBL:BC063175
          EMBL:S85184 IPI:IPI00326070 PIR:S07098 RefSeq:NP_037288.1
          UniGene:Rn.1294 ProteinModelPortal:P07154 SMR:P07154 IntAct:P07154
          STRING:P07154 PhosphoSite:P07154 PRIDE:P07154
          Ensembl:ENSRNOT00000025462 GeneID:25697 KEGG:rno:25697 UCSC:RGD:2448
          InParanoid:P07154 SABIO-RK:P07154 BindingDB:P07154 ChEMBL:CHEMBL2305
          NextBio:607715 Genevestigator:P07154 GermOnline:ENSRNOG00000018566
          Uniprot:P07154
        Length = 334

 Score = 560 (202.2 bits), Expect = 3.4e-54, P = 3.4e-54
 Identities = 131/348 (37%), Positives = 196/348 (56%)

Query:     1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
             M    L+ VL +  + A+    +TF+      ++ QWK+ + R Y  + E  +R  +++ 
Sbjct:     1 MTPLLLLAVLCLGTALATPKFDQTFNA-----QWHQWKSTHRRLYGTNEEEWRR-AVWEK 54

Query:    61 NLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
             N+  ++  N   + G   +T+ +N F D+T +EF     G++   H    +    P +  
Sbjct:    55 NMRMIQLHNGEYSNGKHGFTMEMNAFGDMTNEEFRQIVNGYRHQKHKKG-RLFQEPLML- 112

Query:   120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
               Q+P +V+W EKG VTPVK QGQC       A   +EG   +K  +L+SLSEQ LVDC+
Sbjct:   113 --QIPKTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCS 170

Query:   173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
              +  N GC GG MD AF+YI +N G+ ++  Y YE    G C   +AE   A  T + D+
Sbjct:   171 HDQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKD-GSC-KYRAEYAVANDTGFVDI 228

Query:   233 PPNDEESLLKAVANQ-PVSVAIDAS--ALQFYSGGVF-NGYCETF-LNHGVTAVGYG--- 284
             P   E++L+KAVA   P+SVA+DAS  +LQFYS G++    C +  L+HGV  VGYG   
Sbjct:   229 P-QQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEG 287

Query:   285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             T     KYWL+KNSWG++WG DGY ++ +D +     CG+A  AS+P+
Sbjct:   288 TDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNN---HCGLATAASYPI 332


>MGI|MGI:88564 [details] [associations]
            symbol:Ctsl "cathepsin L" species:10090 "Mus musculus"
            [GO:0004177 "aminopeptidase activity" evidence=ISO] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISO] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005730 "nucleolus"
            evidence=NAS] [GO:0005737 "cytoplasm" evidence=ISO] [GO:0005764
            "lysosome" evidence=ISO] [GO:0005773 "vacuole" evidence=ISO]
            [GO:0005902 "microvillus" evidence=ISO] [GO:0006508 "proteolysis"
            evidence=ISO;IDA] [GO:0007154 "cell communication" evidence=ISO]
            [GO:0008152 "metabolic process" evidence=TAS] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISO;TAS] [GO:0009897 "external side of
            plasma membrane" evidence=ISO] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0030141 "secretory granule" evidence=ISO]
            [GO:0030984 "kininogen binding" evidence=ISO] [GO:0032403 "protein
            complex binding" evidence=ISO] [GO:0042277 "peptide binding"
            evidence=ISO] [GO:0042393 "histone binding" evidence=ISO;NAS]
            [GO:0043005 "neuron projection" evidence=ISO] [GO:0043204
            "perikaryon" evidence=ISO] [GO:0045177 "apical part of cell"
            evidence=ISO] [GO:0048863 "stem cell differentiation" evidence=NAS]
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IMP] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:88564 GO:GO:0005730 GO:GO:0009897 GO:GO:0034698
            GO:GO:0043204 GO:GO:0009749 GO:GO:0030141 GO:GO:0048863
            GO:GO:0051384 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045177 GO:GO:0043005
            GO:GO:0007283 GO:GO:0004177 GO:GO:0005764 GO:GO:0042277
            GO:GO:0009267 GO:GO:0021675 GO:GO:0042393 GO:GO:0005902
            GO:GO:0010259 GO:GO:0004197 GO:GO:0048102 GO:GO:0046697
            HOVERGEN:HBG011513 KO:K01365 OMA:EEFRATH OrthoDB:EOG48PMKF
            MEROPS:C01.032 BRENDA:3.4.22.15 ChiTaRS:CTSL1 EMBL:X06086
            EMBL:J02583 EMBL:M20495 EMBL:AF121837 EMBL:AF121838 EMBL:AF121839
            EMBL:BC068163 EMBL:X04392 IPI:IPI00128154 PIR:S01177
            RefSeq:NP_034114.1 UniGene:Mm.930 PDB:1MVV PDBsum:1MVV
            ProteinModelPortal:P06797 SMR:P06797 STRING:P06797
            PhosphoSite:P06797 PaxDb:P06797 PRIDE:P06797
            Ensembl:ENSMUST00000021933 GeneID:13039 KEGG:mmu:13039 CTD:13039
            InParanoid:P06797 BioCyc:MetaCyc:MONOMER-14812 BindingDB:P06797
            ChEMBL:CHEMBL5291 NextBio:282928 Bgee:P06797 CleanEx:MM_CTSL
            Genevestigator:P06797 GermOnline:ENSMUSG00000021477 GO:GO:0060008
            Uniprot:P06797
        Length = 334

 Score = 558 (201.5 bits), Expect = 5.5e-54, P = 5.5e-54
 Identities = 133/342 (38%), Positives = 193/342 (56%)

Query:     8 VVLIISGSCASQATYRT-FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
             ++L+++  C   A     FD+   AE + QWK+ + R Y  + E  +R  I++ N+  ++
Sbjct:     3 LLLLLAVLCLGTALATPKFDQTFSAE-WHQWKSTHRRLYGTNEEEWRR-AIWEKNMRMIQ 60

Query:    67 RFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPP 125
               N   + G   +++ +N F D+T +EF     G++   H    +    P + K   +P 
Sbjct:    61 LHNGEYSNGQHGFSMEMNAFGDMTNEEFRQVVNGYRHQKHKKG-RLFQEPLMLK---IPK 116

Query:   126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
             SV+W EKG VTPVK QGQC       A   +EG   +K  +L+SLSEQ LVDC+    N 
Sbjct:   117 SVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQ 176

Query:   179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
             GC GG MD AF+YI +N G+ ++  Y YE    G C   +AE   A  T + D+P   E+
Sbjct:   177 GCNGGLMDFAFQYIKENGGLDSEESYPYEAKD-GSC-KYRAEFAVANDTGFVDIP-QQEK 233

Query:   239 SLLKAVANQ-PVSVAIDAS--ALQFYSGGVF-NGYCETF-LNHGVTAVGYG---TSEEGI 290
             +L+KAVA   P+SVA+DAS  +LQFYS G++    C +  L+HGV  VGYG   T     
Sbjct:   234 ALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKN 293

Query:   291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             KYWL+KNSWG +WG +GY ++ +D D     CG+A  AS+PV
Sbjct:   294 KYWLVKNSWGSEWGMEGYIKIAKDRDN---HCGLATAASYPV 332


>UNIPROTKB|Q9GL24 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 CTD:1515 KO:K01365
            OrthoDB:EOG48PMKF EMBL:AJ279008 RefSeq:NP_001239115.1
            UniGene:Cfa.3571 ProteinModelPortal:Q9GL24 SMR:Q9GL24
            MEROPS:C01.032 Ensembl:ENSCAFT00000001770
            Ensembl:ENSCAFT00000023837 GeneID:100684364 KEGG:cfa:100684364
            InParanoid:Q9GL24 OMA:FDQNLDT NextBio:20817211 Uniprot:Q9GL24
        Length = 333

 Score = 556 (200.8 bits), Expect = 8.9e-54, P = 8.9e-54
 Identities = 132/338 (39%), Positives = 190/338 (56%)

Query:    10 LIISGSCASQATYRT-FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF 68
             L ++  C   A+    FD+ S+  ++ QWKA + R Y  + E  +R  +++ N+  +E  
Sbjct:     5 LFLTALCLGIASAAPKFDQ-SLNAQWYQWKATHRRLYGMNEEGWRR-AVWEKNMKMIELH 62

Query:    69 NNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSV 127
             N   + G   +T+ +N F D+T +EF     GF+   H    K    P     +++P SV
Sbjct:    63 NREYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFQNQKHKKG-KMFQEPLF---AEIPKSV 118

Query:   128 NWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
             +W EKG VTPVK QGQC       A  A+EG    K  +LVSLSEQ LVDC+    N GC
Sbjct:   119 DWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGC 178

Query:   181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESL 240
              GG MD+AF+Y+  N G+ ++  Y Y G  T  C+  K E  AA  T + D+P   E++L
Sbjct:   179 NGGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCN-YKPECSAANDTGFVDLPQR-EKAL 236

Query:   241 LKAVANQ-PVSVAIDAS--ALQFYSGGV-FNGYCETF-LNHGVTAVGYGT--SEEGIKYW 293
             +KAVA   P+SVAIDA   + QFY  G+ F+  C +  L+HGV  VGYG   ++   K+W
Sbjct:   237 MKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKFW 296

Query:   294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
             ++KNSWG +WG +GY ++ +D       CGIA  AS+P
Sbjct:   297 IVKNSWGPEWGWNGYVKMAKD---QNNHCGIATAASYP 331


>FB|FBgn0013770 [details] [associations]
            symbol:Cp1 "Cysteine proteinase-1" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS;NAS] [GO:0030163 "protein catabolic process"
            evidence=NAS] [GO:0005764 "lysosome" evidence=NAS] [GO:0048102
            "autophagic cell death" evidence=IEP] [GO:0035071 "salivary gland
            cell autophagic cell death" evidence=IEP] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0045169 "fusome" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:AE013599 GO:GO:0007586 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0035071 GO:GO:0045169 GeneTree:ENSGT00660000095458 KO:K01365
            EMBL:U75652 EMBL:AF012089 EMBL:BT016071 EMBL:D31970
            RefSeq:NP_523735.2 RefSeq:NP_725347.1 RefSeq:NP_725348.1
            UniGene:Dm.7400 ProteinModelPortal:Q95029 SMR:Q95029 IntAct:Q95029
            MINT:MINT-814156 STRING:Q95029 MEROPS:C01.092 PaxDb:Q95029
            EnsemblMetazoa:FBtr0087593 GeneID:36546 KEGG:dme:Dmel_CG6692
            CTD:36546 FlyBase:FBgn0013770 InParanoid:Q95029 OMA:ICHGADP
            OrthoDB:EOG46M91C PhylomeDB:Q95029 GenomeRNAi:36546 NextBio:799136
            Bgee:Q95029 GermOnline:CG6692 Uniprot:Q95029
        Length = 371

 Score = 554 (200.1 bits), Expect = 1.5e-53, P = 1.5e-53
 Identities = 130/321 (40%), Positives = 180/321 (56%)

Query:    30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
             + E++  +K ++ + Y++  E   R +IF +N   + + N   A G  S+ L +NK+ADL
Sbjct:    55 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 114

Query:    89 TPQEFIASQTGFKMSDHSSSLKAN----GTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQ 143
                EF     GF  + H     A+    G  F+  +   +P SV+W  KGAVT VK QG 
Sbjct:   115 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 174

Query:   144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
             C       +  A+EG +  K   LVSLSEQ LVDC+T   NNGC GG MD+AF+YI  N 
Sbjct:   175 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 234

Query:   197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDA 255
             GI  +  Y YE +    C   K    A     + D+P  DE+ + +AVA   PVSVAIDA
Sbjct:   235 GIDTEKSYPYEAIDDS-CHFNKGTVGATD-RGFTDIPQGDEKKMAEAVATVGPVSVAIDA 292

Query:   256 S--ALQFYSGGVFNG-YCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
             S  + QFYS GV+N   C+   L+HGV  VG+GT E G  YWL+KNSWG  WG+ G+ ++
Sbjct:   293 SHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKM 352

Query:   312 QRDIDQPQGQCGIAMFASFPV 332
              R+    + QCGIA  +S+P+
Sbjct:   353 LRN---KENQCGIASASSYPL 370


>UNIPROTKB|F1SS93 [details] [associations]
            symbol:CTSS "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0002250 "adaptive immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0016020 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0002250
            GeneTree:ENSGT00560000076577 GO:GO:0097067 EMBL:CU463875
            Ensembl:ENSSSCT00000007284 OMA:CEIESAV Uniprot:F1SS93
        Length = 342

 Score = 552 (199.4 bits), Expect = 2.4e-53, P = 2.4e-53
 Identities = 139/342 (40%), Positives = 197/342 (57%)

Query:     6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
             L+ VL++  S  +Q  +R   + ++   ++ WK  YG+ YKE  E   R  I++ NL  V
Sbjct:    15 LVWVLLLCSSAMAQL-HR---DPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTV 70

Query:    66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS--- 121
                N   ++G  SY L +N   D+T +E I+  +  ++    S    N T   YKS+   
Sbjct:    71 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVP---SQWPRNVT---YKSNPNQ 124

Query:   122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
             ++P S++W EKG VT VKYQG C       AV A+E    +K  RLVSLS Q LVDC+T 
Sbjct:   125 KLPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTE 184

Query:   175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
                N GC GGFM +AF+YII N GI ++A Y Y+ +  G C    +++ AA  + Y ++P
Sbjct:   185 KYRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVD-GKC-KYDSKNRAATCSRYTELP 242

Query:   234 PNDEESLLKAVANQ-PVSVAIDA--SALQFYSGGVF-NGYCETFLNHGVTAVGYGTSEEG 289
               DE +L +AVAN+ PVSVAIDA  S+  FY  GV+ +  C   +NHGV  VGYG    G
Sbjct:   243 FADEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQNVNHGVLVVGYGNLN-G 301

Query:   290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
               YWL+KNSWG ++G+ GY R+ R+    +  CGIA + S+P
Sbjct:   302 KDYWLVKNSWGLNFGDGGYIRMARN---SENHCGIANYPSYP 340


>UNIPROTKB|A4IFS7 [details] [associations]
            symbol:CTSL1 "CTSL1 protein" species:9913 "Bos taurus"
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197 GO:GO:0002250
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 GO:GO:0097067
            OrthoDB:EOG48PMKF MEROPS:C01.032 CTD:1514 EMBL:DAAA02023987
            EMBL:BC134741 IPI:IPI00708619 RefSeq:NP_001077155.1
            UniGene:Bt.23199 SMR:A4IFS7 Ensembl:ENSBTAT00000000962
            GeneID:515200 KEGG:bta:515200 InParanoid:A4IFS7 OMA:NDEQALM
            NextBio:20871707 Uniprot:A4IFS7
        Length = 333

 Score = 548 (198.0 bits), Expect = 6.3e-53, P = 6.3e-53
 Identities = 123/338 (36%), Positives = 194/338 (57%)

Query:    10 LIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN 69
             L+++  C   A+     + S+  +++ WKA + + Y  + E  ++  ++K N+  +E  N
Sbjct:     5 LLLTALCLGIASAAPKFDHSLDTQWKLWKAAHRKPYDLNEEGWRK-AVWKKNMKMIELHN 63

Query:    70 NA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVN 128
                + G  S+++ +N F D+T +EF  +  GF+   +    + + T F    + +PPSV+
Sbjct:    64 QEYSQGKHSFSMAMNAFGDMTNEEFRHTMNGFQRQKNKKGKEFHETIF----ASIPPSVD 119

Query:   129 WIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCY 181
             W EKG VTPVK QG+C       A  A+EG    K  +LVSLSEQ LVDC+  + N GC+
Sbjct:   120 WREKGYVTPVKNQGKCGSCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCSQPEGNRGCH 179

Query:   182 GGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLL 241
             GGF+D+AF+Y++   G+ ++  Y Y G+  G C      + AA  T + D+P   E++L+
Sbjct:   180 GGFIDNAFQYVLDVGGLDSEESYPYTGL-VGTC-LYNPNNSAANETGFVDLP-KQEKALM 236

Query:   242 KAVANQ-PVSVAIDAS--ALQFYSGGVF-NGYCET-FLNHGVTAVGYG---TSEEGIKYW 293
             KAVAN  P+SVA+DA   + QFY  G++    C +  ++H V  VGYG      +  KYW
Sbjct:   237 KAVANLGPISVAVDAHNPSFQFYKSGIYYEPNCSSESVDHAVLVVGYGFEGADSDDNKYW 296

Query:   294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
             L+KNSWG+ WG +GY ++ +D +     CGIA  AS+P
Sbjct:   297 LVKNSWGEHWGMNGYIKMAKDRNN---HCGIATMASYP 331


>DICTYBASE|DDB_G0279799 [details] [associations]
            symbol:cprB "cysteine proteinase 2" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0279799 GenomeReviews:CM000152_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            MEROPS:I29.003 KO:K01365 EMBL:AAFI02000033 EMBL:M16039 EMBL:X03344
            PIR:A25439 RefSeq:XP_641494.1 ProteinModelPortal:P04989 SMR:P04989
            EnsemblProtists:DDB0214998 GeneID:8622234 KEGG:ddi:DDB_G0279799
            OMA:YVNITAG Uniprot:P04989
        Length = 376

 Score = 452 (164.2 bits), Expect = 8.7e-53, Sum P(2) = 8.7e-53
 Identities = 113/294 (38%), Positives = 164/294 (55%)

Query:     5 FLIVVLIISGSCAS-QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
             FLI+++ ++ S A+ +   R F E      F +W  ++ R Y  S+E S R+ IFK N+ 
Sbjct:     6 FLILLIFVNFSFANVRPNGRRFSESQYRTAFTEWTLKFNRQYS-SSEFSNRYSIFKSNMD 64

Query:    64 AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL-YKSSQ 122
              V+ +N+   G+    L LN FAD+T +E+  +  G +++ HS +   +G   L  +  Q
Sbjct:    65 YVDNWNSK--GDSQTVLGLNNFADITNEEYRKTYLGTRVNAHSYN-GYDGREVLNVEDLQ 121

Query:   123 V-PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATN 174
               P S++W  K AVTP+K QGQC          + EG +A+K  +LVSLSEQ LVDC+  
Sbjct:   122 TNPKSIDWRTKNAVTPIKDQGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGP 181

Query:   175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             + N GC GG M++AF YII+NKGI  ++ Y Y   +   C   K+ D  A I  Y ++  
Sbjct:   182 EENFGCDGGLMNNAFDYIIKNKGIDTESSYPYTAETGSTCLFNKS-DIGATIKGYVNITA 240

Query:   235 NDEESLLKAVANQPVSVAIDAS--ALQFYSGGVF-NGYCE-TFLNHGVTAVGYG 284
               E SL     + PVSVAIDAS  + Q Y+ G++    C  T L+HGV  VGYG
Sbjct:   241 GSEISLENGAQHGPVSVAIDASHNSFQLYTSGIYYEPKCSPTELDHGVLVVGYG 294

 Score = 112 (44.5 bits), Expect = 8.7e-53, Sum P(2) = 8.7e-53
 Identities = 18/42 (42%), Positives = 27/42 (64%)

Query:   292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
             YW++KNSWG  WG  GY  + +D    +  CGIA  +S+P++
Sbjct:   338 YWIVKNSWGTSWGIKGYILMSKD---RKNNCGIASVSSYPLA 376


>UNIPROTKB|P25774 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0016020 "membrane"
            evidence=IEA] [GO:0005576 "extracellular region" evidence=NAS]
            [GO:0005764 "lysosome" evidence=IDA;NAS] [GO:0097067 "cellular
            response to thyroid hormone stimulus" evidence=IEP] [GO:0002250
            "adaptive immune response" evidence=IEP] [GO:0019882 "antigen
            processing and presentation" evidence=TAS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=TAS] [GO:0006955
            "immune response" evidence=TAS] [GO:0002474 "antigen processing and
            presentation of peptide antigen via MHC class I" evidence=TAS]
            [GO:0002480 "antigen processing and presentation of exogenous
            peptide antigen via MHC class I, TAP-independent" evidence=TAS]
            [GO:0019886 "antigen processing and presentation of exogenous
            peptide antigen via MHC class II" evidence=TAS] [GO:0036021
            "endolysosome lumen" evidence=TAS] [GO:0042590 "antigen processing
            and presentation of exogenous peptide antigen via MHC class I"
            evidence=TAS] [GO:0043202 "lysosomal lumen" evidence=TAS]
            [GO:0045087 "innate immune response" evidence=TAS] [GO:0043231
            "intracellular membrane-bounded organelle" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 Reactome:REACT_118779
            Reactome:REACT_6900 GO:GO:0005576 GO:GO:0002480 GO:GO:0016020
            GO:GO:0019886 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087 EMBL:CH471121
            GO:GO:0004197 GO:GO:0002250 HOVERGEN:HBG011513 GO:GO:0097067
            GO:GO:0036021 EMBL:AL356292 CTD:1520 KO:K01368 OMA:KAMDQKC
            OrthoDB:EOG4JM7Q2 EMBL:S93414 EMBL:M86553 EMBL:M90696 EMBL:U07374
            EMBL:U07370 EMBL:U07371 EMBL:U07372 EMBL:U07373 EMBL:CR541676
            EMBL:AK301472 EMBL:AK314482 EMBL:BC002642 IPI:IPI00299150
            IPI:IPI00910216 PIR:A42482 RefSeq:NP_001186668.1 RefSeq:NP_004070.3
            UniGene:Hs.181301 PDB:1BXF PDB:1GLO PDB:1MS6 PDB:1NPZ PDB:1NQC
            PDB:2C0Y PDB:2F1G PDB:2FQ9 PDB:2FRA PDB:2FRQ PDB:2FT2 PDB:2FUD
            PDB:2FYE PDB:2G6D PDB:2G7Y PDB:2H7J PDB:2HH5 PDB:2HHN PDB:2HXZ
            PDB:2OP3 PDB:2R9M PDB:2R9N PDB:2R9O PDB:3IEJ PDB:3KWN PDB:3MPE
            PDB:3MPF PDB:3N3G PDB:3N4C PDB:3OVX PDBsum:1BXF PDBsum:1GLO
            PDBsum:1MS6 PDBsum:1NPZ PDBsum:1NQC PDBsum:2C0Y PDBsum:2F1G
            PDBsum:2FQ9 PDBsum:2FRA PDBsum:2FRQ PDBsum:2FT2 PDBsum:2FUD
            PDBsum:2FYE PDBsum:2G6D PDBsum:2G7Y PDBsum:2H7J PDBsum:2HH5
            PDBsum:2HHN PDBsum:2HXZ PDBsum:2OP3 PDBsum:2R9M PDBsum:2R9N
            PDBsum:2R9O PDBsum:3IEJ PDBsum:3KWN PDBsum:3MPE PDBsum:3MPF
            PDBsum:3N3G PDBsum:3N4C PDBsum:3OVX ProteinModelPortal:P25774
            SMR:P25774 IntAct:P25774 STRING:P25774 MEROPS:I29.004
            PhosphoSite:P25774 DMDM:88984046 PaxDb:P25774 PeptideAtlas:P25774
            PRIDE:P25774 DNASU:1520 Ensembl:ENST00000368985
            Ensembl:ENST00000448301 GeneID:1520 KEGG:hsa:1520 UCSC:uc001evn.3
            GeneCards:GC01M150702 HGNC:HGNC:2545 HPA:CAB000460 HPA:HPA002988
            MIM:116845 neXtProt:NX_P25774 PharmGKB:PA27041 InParanoid:P25774
            PhylomeDB:P25774 BRENDA:3.4.22.27 BindingDB:P25774
            ChEMBL:CHEMBL2954 ChiTaRS:CTSS EvolutionaryTrace:P25774
            GenomeRNAi:1520 NextBio:6291 PMAP-CutDB:P25774 ArrayExpress:P25774
            Bgee:P25774 CleanEx:HS_CTSS Genevestigator:P25774
            GermOnline:ENSG00000163131 Uniprot:P25774
        Length = 331

 Score = 545 (196.9 bits), Expect = 1.3e-52, P = 1.3e-52
 Identities = 137/342 (40%), Positives = 191/342 (55%)

Query:     6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
             L+ VL++   C+S A  +   + ++   +  WK  YG+ YKE  E + R  I++ NL  V
Sbjct:     4 LVCVLLV---CSS-AVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV 59

Query:    66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-- 122
                N   ++G  SY L +N   D+T +E ++  +  ++    S  + N T   YKS+   
Sbjct:    60 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP---SQWQRNIT---YKSNPNR 113

Query:   123 -VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
              +P SV+W EKG VT VKYQG C       AV A+E    +K  +LVSLS Q LVDC+T 
Sbjct:   114 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173

Query:   175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
                N GC GGFM  AF+YII NKGI +DA Y Y+ M    C    ++  AA  + Y ++P
Sbjct:   174 KYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQK-CQ-YDSKYRAATCSKYTELP 231

Query:   234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
                E+ L +AVAN+ PVSV +DA    F+   SG  +   C   +NHGV  VGYG    G
Sbjct:   232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLN-G 290

Query:   290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
              +YWL+KNSWG ++GE+GY R+ R+       CGIA F S+P
Sbjct:   291 KEYWLVKNSWGHNFGEEGYIRMARN---KGNHCGIASFPSYP 329


>ZFIN|ZDB-GENE-030131-572 [details] [associations]
            symbol:wu:fb37b09 "wu:fb37b09" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030131-572 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:BX465840 IPI:IPI00866294 RefSeq:XP_001923796.1
            UniGene:Dr.25683 PRIDE:E9QBE2 Ensembl:ENSDART00000133962
            GeneID:321853 KEGG:dre:321853 NextBio:20807556 Uniprot:E9QBE2
        Length = 335

 Score = 544 (196.6 bits), Expect = 1.7e-52, P = 1.7e-52
 Identities = 126/345 (36%), Positives = 191/345 (55%)

Query:     4 YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
             + L+V L IS   A+ +     D+      +  WK+Q+G++Y E  E  +R  I+++NL 
Sbjct:     3 FALLVTLYISAVFAAPSIDIQLDD-----HWNSWKSQHGKSYHEDVEVGRRM-IWEENLR 56

Query:    64 AVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY-KSS 121
              +E+ N   ++GN ++ + +N+F D+T +EF  +  G+K   H  +  + G  F+  K  
Sbjct:    57 KIEQHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYK---HDPNRTSQGPLFMEPKFF 113

Query:   122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
               P  V+W ++G VTPVK Q QC       +  A+EG    K  +L+S+SEQ LVDC+  
Sbjct:   114 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173

Query:   175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
               N GC GG MD AF+Y+ +NKG+ ++  Y Y       C       + A+IT + D+P 
Sbjct:   174 HGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPC-RYDPRFNVAKITGFVDIPK 232

Query:   235 NDEESLLKAVANQ-PVSVAIDAS--ALQFYSGGVF-NGYCETFLNHGVTAVGYG---TSE 287
              +E +L+ AVA   PVSVAIDAS  +LQFY  G++    C + L+H V  VGYG      
Sbjct:   233 GNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSQLDHAVLVVGYGYQGADV 292

Query:   288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
              G +YW++KNSW   WG+ GY  + +D       CGIA  AS+P+
Sbjct:   293 AGNRYWIVKNSWSDKWGDKGYIYMAKD---KNNHCGIATMASYPL 334


>UNIPROTKB|P25326 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9913 "Bos taurus"
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0016020 "membrane" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0002250 "adaptive
            immune response" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0016020 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0002250 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            GO:GO:0097067 EMBL:BC102245 EMBL:M95211 EMBL:X62001 IPI:IPI00702008
            PIR:S15844 RefSeq:NP_001028787.1 UniGene:Bt.7938
            ProteinModelPortal:P25326 SMR:P25326 STRING:P25326 PRIDE:P25326
            Ensembl:ENSBTAT00000022774 GeneID:327711 KEGG:bta:327711 CTD:1520
            InParanoid:P25326 KO:K01368 OMA:KAMDQKC OrthoDB:EOG4JM7Q2
            NextBio:20810175 Uniprot:P25326
        Length = 331

 Score = 540 (195.1 bits), Expect = 4.4e-52, P = 4.4e-52
 Identities = 135/332 (40%), Positives = 185/332 (55%)

Query:    16 CASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIG 74
             C+S A      + ++   ++ WK  YG+ YKE  E   R  I++ NL  V   N   ++G
Sbjct:    11 CSS-AMAHVHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVTLHNLEHSMG 69

Query:    75 NRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS---QVPPSVNWIE 131
               SY L +N   D+T +E I+  +  ++    S    N T   YKS    ++P S++W E
Sbjct:    70 MHSYELGMNHLGDMTSEEVISLMSSLRVP---SQWPRNVT---YKSDPNQKLPDSMDWRE 123

Query:   132 KGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGG 183
             KG VT VKYQG C       AV A+E    +K  +LVSLS Q LVDC+T    N GC GG
Sbjct:   124 KGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGG 183

Query:   184 FMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKA 243
             FM +AF+YII N GI ++A Y Y+ M  G C     ++ AA  + Y ++P   EE+L +A
Sbjct:   184 FMTEAFQYIIDNNGIDSEASYPYKAMD-GKCQ-YDVKNRAATCSRYIELPFGSEEALKEA 241

Query:   244 VANQ-PVSVAIDASALQF--YSGGVF-NGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSW 299
             VAN+ PVSV IDAS   F  Y  GV+ +  C   +NHGV  VGYG  + G  YWL+KNSW
Sbjct:   242 VANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNLD-GKDYWLVKNSW 300

Query:   300 GQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
             G  +G+ GY R+ R+       CGIA + S+P
Sbjct:   301 GLHFGDQGYIRMARNSGN---HCGIANYPSYP 329


>ZFIN|ZDB-GENE-071004-74 [details] [associations]
            symbol:zgc:174855 "zgc:174855" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-071004-74
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 MEROPS:C01.032 EMBL:BX000534 EMBL:BC152282
            IPI:IPI00773140 RefSeq:NP_001096592.1 UniGene:Dr.104905 SMR:A7MCR6
            STRING:A7MCR6 Ensembl:ENSDART00000109968 GeneID:569326
            KEGG:dre:569326 NextBio:20889622 Uniprot:A7MCR6
        Length = 335

 Score = 540 (195.1 bits), Expect = 4.4e-52, P = 4.4e-52
 Identities = 123/343 (35%), Positives = 192/343 (55%)

Query:     6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
             ++  L+I+   ++  T  + D   + + +  WK+Q+G++Y E  E  +R  I+++NL  +
Sbjct:     1 MMFALLITLCISAVFTAPSIDI-QLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58

Query:    66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QV 123
             E+ N   ++GN ++ + +N+F D+T +EF  +  G+K   + +S    G  F+  S    
Sbjct:    59 EQHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKQDPNRTS---KGALFMEPSFFAA 115

Query:   124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
             P  V+W ++G VTPVK Q QC       +  A+EG    K  +L+S+SEQ LVDC+    
Sbjct:   116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175

Query:   177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
             N GC GG MD AF+Y+ +NKG+ ++  Y Y       C       + A+IT + D+P  +
Sbjct:   176 NQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPC-RYDPRFNVAKITGFVDIPRGN 234

Query:   237 EESLLKAVANQ-PVSVAIDAS--ALQFYSGGVF-NGYCETFLNHGVTAVGYG---TSEEG 289
             E +L+ AVA   PVSVAIDAS  +LQFY  G++    C + L+H V  VGYG       G
Sbjct:   235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAG 294

Query:   290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
              +YW++KNSW   WG+ GY  + +D       CGIA  AS+P+
Sbjct:   295 NRYWIVKNSWSDKWGDKGYIYMAKD---KNNHCGIATMASYPL 334


>UNIPROTKB|Q28944 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9823 "Sus scrofa"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 KO:K01365 OrthoDB:EOG48PMKF MEROPS:C01.032
            CTD:1514 EMBL:D37917 EMBL:AJ315771 PIR:A58195 RefSeq:NP_999057.1
            UniGene:Ssc.54036 ProteinModelPortal:Q28944 SMR:Q28944
            STRING:Q28944 Ensembl:ENSSSCT00000012233 GeneID:396926
            KEGG:ssc:396926 OMA:DASETGK ArrayExpress:Q28944 Uniprot:Q28944
        Length = 334

 Score = 538 (194.4 bits), Expect = 7.2e-52, P = 7.2e-52
 Identities = 127/338 (37%), Positives = 186/338 (55%)

Query:    10 LIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN 69
             L ++  C   A+     + ++   + +WKA +GR Y  + E  +R  +++ N+  +E  N
Sbjct:     5 LFLTALCLGIASAAPKLDQNLDADWYKWKATHGRLYGMNEEGWRR-AVWEKNMKMIELHN 63

Query:    70 NA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVN 128
                + G   +++ +N F D+T +EF     GF+   H      + +  L    +VP SV+
Sbjct:    64 QEYSQGKHGFSMAMNAFGDMTNEEFRQVMNGFQNQKHKKGKVFHESLVL----EVPKSVD 119

Query:   129 WIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCY 181
             W EKG VT VK QGQC       A  A+EG    K  +LVSLSEQ LVDC+    N GC 
Sbjct:   120 WREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCN 179

Query:   182 GGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLL 241
             GG MD+AF+Y+  N G+  +  Y Y G  T  C + K E  AA  T + D+P   E++L+
Sbjct:   180 GGLMDNAFQYVKDNGGLDTEESYPYLGRETNSC-TYKPECSAANDTGFVDIPQR-EKALM 237

Query:   242 KAVANQ-PVSVAIDA--SALQFYSGGVF-NGYCETF-LNHGVTAVGYG---TSEEGIKYW 293
             KAVA   P+SVAIDA  S+ QFY  G++ +  C +  L+HGV  VGYG   T     K+W
Sbjct:   238 KAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFW 297

Query:   294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
             ++KNSWG +WG +GY ++ +D       CGI+  AS+P
Sbjct:   298 IVKNSWGPEWGWNGYVKMAKD---QNNHCGISTAASYP 332


>ZFIN|ZDB-GENE-030131-106 [details] [associations]
            symbol:ctsl1a "cathepsin L, 1 a" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030131-106 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 HSSP:P43235
            KO:K01365 EMBL:BC066490 IPI:IPI00495935 RefSeq:NP_997749.1
            UniGene:Dr.104499 ProteinModelPortal:Q6NYR5 SMR:Q6NYR5
            MEROPS:C01.074 PRIDE:Q6NYR5 GeneID:321453 KEGG:dre:321453
            CTD:321453 InParanoid:Q6NYR5 NextBio:20807387 ArrayExpress:Q6NYR5
            Bgee:Q6NYR5 Uniprot:Q6NYR5
        Length = 337

 Score = 538 (194.4 bits), Expect = 7.2e-52, P = 7.2e-52
 Identities = 125/326 (38%), Positives = 181/326 (55%)

Query:    24 TFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRL 82
             T D+  + + ++QWK  + + Y  + E  +R  I++ NL  +E  N   ++G  +Y L +
Sbjct:    20 TLDQ-QLNDHWDQWKKWHSKKYHATEEGWRRV-IWEKNLKKIEMHNLEHSMGIHTYRLGM 77

Query:    83 NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQ 141
             N F D+T +EF     GFK   H    +  G+ F+  +  +VP  ++W EKG VTPVK Q
Sbjct:    78 NHFGDMTHEEFRQVMNGFK---HKKDRRFRGSLFMEPNFIEVPNKLDWREKGYVTPVKDQ 134

Query:   142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
             G+C          A+EG    K  +LVSLSEQ LVDC+  + N GC GG MD AF+Y+  
Sbjct:   135 GECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKD 194

Query:   195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAI 253
               G+ ++  Y Y G     C     ++ AA  T + D+P   E +L+KA+A   PVSVAI
Sbjct:   195 QNGLDSEESYPYLGTDDQPCH-FDPKNSAANDTGFVDIPSGKERALMKAIAAVGPVSVAI 253

Query:   254 DAS--ALQFYSGGVF-NGYCETF-LNHGVTAVGYGTSEE---GIKYWLIKNSWGQDWGED 306
             DA   + QFY  G++    C +  L+HGV AVGYG   E   G KYW++KNSW ++WG+ 
Sbjct:   254 DAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVDGKKYWIVKNSWSENWGDK 313

Query:   307 GYFRLQRDIDQPQGQCGIAMFASFPV 332
             GY  + +D       CGIA  AS+P+
Sbjct:   314 GYIYMAKD---RHNHCGIATAASYPL 336


>UNIPROTKB|Q8HY81 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            CTD:1520 KO:K01368 OrthoDB:EOG4JM7Q2 EMBL:AY156692
            RefSeq:NP_001002938.2 UniGene:Cfa.1661 ProteinModelPortal:Q8HY81
            SMR:Q8HY81 STRING:Q8HY81 MEROPS:C01.034 GeneID:403400
            KEGG:cfa:403400 InParanoid:Q8HY81 NextBio:20816922 Uniprot:Q8HY81
        Length = 331

 Score = 536 (193.7 bits), Expect = 1.2e-51, P = 1.2e-51
 Identities = 130/327 (39%), Positives = 180/327 (55%)

Query:    18 SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNR 76
             S A  +   + ++   +  WK  Y + YKE  E   R  I++ NL  V   N   ++G  
Sbjct:    12 SYAVAQVHKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMH 71

Query:    77 SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVT 136
             SY L +N   D+T +E I+     ++    S  + N T     + ++P SV+W EKG VT
Sbjct:    72 SYDLGMNHLGDMTGEEVISLMGSLRVP---SQWQRNVTYRSNSNQKLPDSVDWREKGCVT 128

Query:   137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDA 188
              VKYQG C       AV A+E    +K  +LVSLS Q LVDC+T    N GC GGFM  A
Sbjct:   129 EVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTA 188

Query:   189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ- 247
             F+YII N GI ++A Y Y+ M+ G C    ++  AA  + Y ++P   E++L +AVAN+ 
Sbjct:   189 FQYIIDNNGIDSEASYPYKAMN-GKC-RYDSKKRAATCSKYTELPFGSEDALKEAVANKG 246

Query:   248 PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG 304
             PVSVAIDAS   F+   SG  +   C   +NHGV  VGYG    G  YWL+KNSWG ++G
Sbjct:   247 PVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNLN-GKDYWLVKNSWGLNFG 305

Query:   305 EDGYFRLQRDIDQPQGQCGIAMFASFP 331
             + GY R+ R+       CGIA + S+P
Sbjct:   306 DQGYIRMARNSGN---HCGIASYPSYP 329


>UNIPROTKB|P07711 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9606 "Homo sapiens"
            [GO:0005576 "extracellular region" evidence=NAS] [GO:0005764
            "lysosome" evidence=IDA;NAS] [GO:0006508 "proteolysis"
            evidence=IDA] [GO:0002250 "adaptive immune response" evidence=IEP]
            [GO:0019882 "antigen processing and presentation" evidence=TAS]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0036021 "endolysosome lumen" evidence=TAS] [GO:0043202
            "lysosomal lumen" evidence=TAS] [GO:0045087 "innate immune
            response" evidence=TAS] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042393 "histone binding" evidence=IDA] [GO:0005634 "nucleus"
            evidence=TAS] [GO:0071888 "macrophage apoptotic process"
            evidence=NAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            Reactome:REACT_118779 EMBL:X12451 GO:GO:0005634 Reactome:REACT_6900
            GO:GO:0005576 GO:GO:0019886 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087
            GO:GO:0042393 GO:GO:0004197 GO:GO:0002250 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0036021 KO:K01365 OrthoDB:EOG48PMKF EMBL:M20496
            EMBL:CR457053 EMBL:BX537395 EMBL:AL160279 EMBL:BC012612 EMBL:X05256
            IPI:IPI00012887 PIR:S01002 RefSeq:NP_001244900.1
            RefSeq:NP_001244901.1 RefSeq:NP_001903.1 RefSeq:NP_666023.1
            UniGene:Hs.731507 UniGene:Hs.731952 PDB:1CJL PDB:1CS8 PDB:1ICF
            PDB:1MHW PDB:2NQD PDB:2VHS PDB:2XU1 PDB:2XU3 PDB:2XU4 PDB:2XU5
            PDB:2YJ2 PDB:2YJ8 PDB:2YJ9 PDB:2YJB PDB:2YJC PDB:3BC3 PDB:3H89
            PDB:3H8B PDB:3H8C PDB:3HHA PDB:3HWN PDB:3IV2 PDB:3K24 PDB:3KSE
            PDB:3OF8 PDB:3OF9 PDBsum:1CJL PDBsum:1CS8 PDBsum:1ICF PDBsum:1MHW
            PDBsum:2NQD PDBsum:2VHS PDBsum:2XU1 PDBsum:2XU3 PDBsum:2XU4
            PDBsum:2XU5 PDBsum:2YJ2 PDBsum:2YJ8 PDBsum:2YJ9 PDBsum:2YJB
            PDBsum:2YJC PDBsum:3BC3 PDBsum:3H89 PDBsum:3H8B PDBsum:3H8C
            PDBsum:3HHA PDBsum:3HWN PDBsum:3IV2 PDBsum:3K24 PDBsum:3KSE
            PDBsum:3OF8 PDBsum:3OF9 ProteinModelPortal:P07711 SMR:P07711
            IntAct:P07711 STRING:P07711 MEROPS:I29.001 PhosphoSite:P07711
            DMDM:115741 PaxDb:P07711 PeptideAtlas:P07711 PRIDE:P07711
            DNASU:1514 Ensembl:ENST00000340342 Ensembl:ENST00000343150
            GeneID:1514 KEGG:hsa:1514 UCSC:uc004aph.3 CTD:1514
            GeneCards:GC09P090341 H-InvDB:HIX0058839 H-InvDB:HIX0170314
            HGNC:HGNC:2537 HPA:CAB000459 MIM:116880 neXtProt:NX_P07711
            PharmGKB:PA162382890 InParanoid:P07711 OMA:REPLFAQ PhylomeDB:P07711
            BRENDA:3.4.22.15 BindingDB:P07711 ChEMBL:CHEMBL3837 ChiTaRS:CTSL1
            DrugBank:DB00040 EvolutionaryTrace:P07711 GenomeRNAi:1514
            NextBio:6271 PMAP-CutDB:P07711 ArrayExpress:P07711 Bgee:P07711
            CleanEx:HS_CTSL1 Genevestigator:P07711 GermOnline:ENSG00000135047
            GO:GO:0071888 Uniprot:P07711
        Length = 333

 Score = 534 (193.0 bits), Expect = 1.9e-51, P = 1.9e-51
 Identities = 131/339 (38%), Positives = 184/339 (54%)

Query:    10 LIISGSCASQATYR-TFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF 68
             LI++  C   A+   TFD  S+  ++ +WKA + R Y  + E  +R  +++ N+  +E  
Sbjct:     5 LILAAFCLGIASATLTFDH-SLEAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELH 62

Query:    69 NNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSV 127
             N     G  S+T+ +N F D+T +EF     GF+        K    P  Y++   P SV
Sbjct:    63 NQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKG-KVFQEPLFYEA---PRSV 118

Query:   128 NWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
             +W EKG VTPVK QGQC       A  A+EG    K  RL+SLSEQ LVDC+    N GC
Sbjct:   119 DWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGC 178

Query:   181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESL 240
              GG MD AF+Y+  N G+ ++  Y YE      C     +   A  T + D+P   E++L
Sbjct:   179 NGGLMDYAFQYVQDNGGLDSEESYPYEATEES-C-KYNPKYSVANDTGFVDIP-KQEKAL 235

Query:   241 LKAVANQ-PVSVAIDAS--ALQFYSGGV-FNGYCETF-LNHGVTAVGYG---TSEEGIKY 292
             +KAVA   P+SVAIDA   +  FY  G+ F   C +  ++HGV  VGYG   T  +  KY
Sbjct:   236 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKY 295

Query:   293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
             WL+KNSWG++WG  GY ++ +D    +  CGIA  AS+P
Sbjct:   296 WLVKNSWGEEWGMGGYVKMAKD---RRNHCGIASAASYP 331


>UNIPROTKB|F1PAK0 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9615 "Canis lupus
            familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:AAEX03011051 Ensembl:ENSCAFT00000019176 OMA:YEPACTQ
            Uniprot:F1PAK0
        Length = 339

 Score = 532 (192.3 bits), Expect = 3.1e-51, P = 3.1e-51
 Identities = 129/327 (39%), Positives = 180/327 (55%)

Query:    18 SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNR 76
             S A  +   + ++   +  WK  Y + YKE  E   R  I++ NL  V   N   ++G  
Sbjct:    20 SYAVAQVHKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMH 79

Query:    77 SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVT 136
             SY L +N   D+T +E I+     ++    S  + N T     + ++P SV+W EKG VT
Sbjct:    80 SYDLGMNHLGDMTGEEVISLMGSLRVP---SQWQRNVTYRSNSNQKLPDSVDWREKGCVT 136

Query:   137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDA 188
              VKYQG C       AV A+E    +K  +LVSLS Q LVDC+T    N GC GGFM  A
Sbjct:   137 EVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTA 196

Query:   189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ- 247
             F+YII N GI ++A Y Y+ ++ G C    ++  AA  + Y ++P   E++L +AVAN+ 
Sbjct:   197 FQYIIDNNGIDSEASYPYKAVN-GKC-RYDSKKRAATCSKYTELPFGSEDALKEAVANKG 254

Query:   248 PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG 304
             PVSVAIDAS   F+   SG  +   C   +NHGV  VGYG    G  YWL+KNSWG ++G
Sbjct:   255 PVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNLN-GKDYWLVKNSWGLNFG 313

Query:   305 EDGYFRLQRDIDQPQGQCGIAMFASFP 331
             + GY R+ R+       CGIA + S+P
Sbjct:   314 DQGYIRMARNSGN---HCGIASYPSYP 337


>ZFIN|ZDB-GENE-080215-7 [details] [associations]
            symbol:zgc:174153 "zgc:174153" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-080215-7
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 EMBL:BX000534 EMBL:BX322603
            IPI:IPI00483644 Ensembl:ENSDART00000113654 OMA:ITLCISA Bgee:F1R8Y0
            Uniprot:F1R8Y0
        Length = 336

 Score = 532 (192.3 bits), Expect = 3.1e-51, P = 3.1e-51
 Identities = 123/344 (35%), Positives = 191/344 (55%)

Query:     6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
             ++  LII+   ++  T  + D   + + +  WK+Q+G++Y E  E  +R  I+++NL  +
Sbjct:     1 MMFALIITLCISAVFTAPSIDI-QLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58

Query:    66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QV 123
             E+ N   + GN ++ + +N+F D+T +EF  +  G+K   H  +  + G  F+  S    
Sbjct:    59 EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYK---HDPNQTSQGPLFMEPSFFAA 115

Query:   124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
             P  V+W ++G VTPVK Q QC       +  A+EG    K  +L+S+SEQ LVDC+    
Sbjct:   116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175

Query:   177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
             N GC GG MD AF+Y+ +NKG+ ++  Y Y       C       + A+IT + D+P  +
Sbjct:   176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPC-RYDPRFNVAKITGFVDIPSGN 234

Query:   237 EESLLKAVANQ-PVSVAIDAS--ALQFYSGGVF-NGYCETF-LNHGVTAVGYG---TSEE 288
             E +L+ AVA   PVSVAIDAS  +LQFY  G++    C +  L+H V  VGYG       
Sbjct:   235 EPALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVA 294

Query:   289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             G +YW++KNSW   WG+ GY  + +D       CG+A  AS+P+
Sbjct:   295 GNRYWIVKNSWSDKWGDKGYIYMAKD---KNNHCGVATKASYPL 335


>UNIPROTKB|F1PMM9 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9615 "Canis lupus
            familiaris" [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00639 GO:GO:0004197 GO:GO:0002250
            GeneTree:ENSGT00560000076577 GO:GO:0097067 EMBL:AAEX03000499
            Ensembl:ENSCAFT00000002029 OMA:EFKQVLN Uniprot:F1PMM9
        Length = 341

 Score = 531 (192.0 bits), Expect = 4.0e-51, P = 4.0e-51
 Identities = 127/339 (37%), Positives = 183/339 (53%)

Query:    10 LIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN 69
             L ++  C   A+     + S+   + QWK  +G+ Y +  E  +R  +++ N+  +E+ N
Sbjct:    13 LFLAALCLGIASAAPQQDHSLDAHWSQWKEAHGKLYDKDEEGWRR-TVWERNMEMIEQHN 71

Query:    70 NA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVN 128
                + G  S+TL +N F D+T +EF      FK+  H    K    P     ++VP SV+
Sbjct:    72 QEYSQGEHSFTLAMNAFGDMTNEEFKQVLNDFKIQKHKKG-KVFPAPLF---AEVPSSVD 127

Query:   129 WIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCY 181
             W E+G VTPVK QGQC       A  A+EG    K  +LVSLSEQ LVDC+ +  N GC 
Sbjct:   128 WREQGYVTPVKDQGQCLGCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWSQGNRGCN 187

Query:   182 GGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLL 241
             GG M+ AF+Y+  N G+ ++  Y Y   +   C   + E  AA +T +  +  N+E+ L+
Sbjct:   188 GGLMEYAFQYVKDNGGLDSEESYPYLARNEP-C-KYRPEKSAANVTAFWPIL-NEEDGLM 244

Query:   242 KAVANQ-PVSVAIDAS--ALQFYSGGVF-NGYCET-FLNHGVTAVGYG---TSEEGIKYW 293
               VA   PVS A+D+S  + QFY  G++ +  C    LNHGV  VGYG      +  KYW
Sbjct:   245 TTVATVGPVSAAVDSSPQSFQFYKKGIYYDPKCSNKLLNHGVLVVGYGFEGAESDNKKYW 304

Query:   294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             ++KNSWG +WG  GY  L +D D     CGIA  AS+PV
Sbjct:   305 IVKNSWGTNWGMQGYMLLAKDRDN---HCGIATRASYPV 340


>WB|WBGene00000776 [details] [associations]
            symbol:cpl-1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0040010 "positive regulation
            of growth rate" evidence=IMP] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] [GO:0040011
            "locomotion" evidence=IMP] [GO:0070265 "necrotic cell death"
            evidence=IMP] [GO:0031983 "vesicle lumen" evidence=IDA] [GO:0042718
            "yolk granule" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0009792 GO:GO:0040010 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0040011
            GO:GO:0031983 GO:GO:0070265 GeneTree:ENSGT00660000095458 KO:K01365
            GO:GO:0042718 MEROPS:I29.009 EMBL:Z92812 GeneID:180111
            KEGG:cel:CELE_T03E6.7 CTD:180111 PIR:T24387 RefSeq:NP_001256718.1
            HSSP:P80067 ProteinModelPortal:O45734 SMR:O45734 DIP:DIP-26616N
            IntAct:O45734 MINT:MINT-211563 STRING:O45734 PaxDb:O45734
            EnsemblMetazoa:T03E6.7.1 EnsemblMetazoa:T03E6.7.2 UCSC:T03E6.7.1
            WormBase:T03E6.7a InParanoid:O45734 OMA:HIENHNR NextBio:908128
            Uniprot:O45734
        Length = 337

 Score = 531 (192.0 bits), Expect = 4.0e-51, P = 4.0e-51
 Identities = 130/342 (38%), Positives = 187/342 (54%)

Query:     5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
             F+++ L+ +    + A      E +I EK++ +K  + + Y ES E +   E F  N++ 
Sbjct:     4 FILLALVAAVVAVNSAKLSRQIESAI-EKWDDYKEDFDKEYSESEEQTY-MEAFVKNMIH 61

Query:    65 VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-Q 122
             +E  N +  +G +++ + LN  ADL   ++     G++     S +K N + FL   + Q
Sbjct:    62 IENHNRDHRLGRKTFEMGLNHIADLPFSQY-RKLNGYRRLFGDSRIK-NSSSFLAPFNVQ 119

Query:   123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
             VP  V+W +   VT VK QG C       A  A+EG +A K+ +LVSLSEQ LVDC+T  
Sbjct:   120 VPDEVDWRDTHLVTDVKNQGMCGSCWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKY 179

Query:   176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
              N+GC GG MD AF+YI  N G+  +  Y Y+G     C   K +   A    Y D P  
Sbjct:   180 GNHGCNGGLMDQAFEYIRDNHGVDTEESYPYKGRDMK-CHFNK-KTVGADDKGYVDTPEG 237

Query:   236 DEESLLKAVANQ-PVSVAIDAS--ALQFYSGGVF-NGYCETF-LNHGVTAVGYGTSEEGI 290
             DEE L  AVA Q P+S+AIDA   + Q Y  GV+ +  C +  L+HGV  VGYGT  E  
Sbjct:   238 DEEQLKIAVATQGPISIAIDAGHRSFQLYKKGVYYDEECSSEELDHGVLLVGYGTDPEHG 297

Query:   291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
              YW++KNSWG  WGE GY R+ R+ +     CG+A  AS+P+
Sbjct:   298 DYWIVKNSWGAGWGEKGYIRIARNRNN---HCGVATKASYPL 336


>UNIPROTKB|P25975 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 EMBL:X91755 EMBL:BC102312 EMBL:AB017648
            IPI:IPI00687440 PIR:S15845 RefSeq:NP_776457.1 UniGene:Bt.3987
            ProteinModelPortal:P25975 SMR:P25975 STRING:P25975
            Ensembl:ENSBTAT00000022710 Ensembl:ENSBTAT00000036427 GeneID:281108
            KEGG:bta:281108 CTD:1515 InParanoid:P25975 KO:K01365 OMA:EEFRATH
            OrthoDB:EOG48PMKF BindingDB:P25975 ChEMBL:CHEMBL2113
            NextBio:20805179 ArrayExpress:P25975 Uniprot:P25975
        Length = 334

 Score = 530 (191.6 bits), Expect = 5.1e-51, P = 5.1e-51
 Identities = 125/314 (39%), Positives = 174/314 (55%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADLTPQE 92
             + QWKA + R Y  + E  +R  +++ N   ++  N   + G   + + +N F D+T +E
Sbjct:    29 WHQWKATHRRLYGMNEEEWRR-AVWEKNKKIIDLHNQEYSEGKHGFRMAMNAFGDMTNEE 87

Query:    93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
             F     GF+   H    K    P L     VP SV+W +KG VTPVK QGQC       A
Sbjct:    88 FRQVMNGFQNQKHKKG-KLFHEPLLV---DVPKSVDWTKKGYVTPVKNQGQCGSCWAFSA 143

Query:   146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
               A+EG    K  +LVSLSEQ LVDC+    N GC GG MD+AF+YI  N G+ ++  Y 
Sbjct:   144 TGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGGLDSEESYP 203

Query:   206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDA--SALQFYS 262
             Y    T  C+  K E  AA  T + D+P   E++L+KAVA   P+SVAIDA  ++ QFY 
Sbjct:   204 YLATDTNSCN-YKPECSAANDTGFVDIPQR-EKALMKAVATVGPISVAIDAGHTSFQFYK 261

Query:   263 GGVF-NGYCETF-LNHGVTAVGYG---TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
              G++ +  C +  L+HGV  VGYG   T     K+W++KNSWG +WG +GY ++ +D   
Sbjct:   262 SGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPEWGWNGYVKMAKD--- 318

Query:   318 PQGQCGIAMFASFP 331
                 CGIA  AS+P
Sbjct:   319 QNNHCGIATAASYP 332


>UNIPROTKB|G1K2A7 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9615 "Canis lupus
            familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 PANTHER:PTHR12411:SF55 OMA:LKVPPSH
            EMBL:AAEX03011051 Ensembl:ENSCAFT00000019202 Uniprot:G1K2A7
        Length = 333

 Score = 530 (191.6 bits), Expect = 5.1e-51, P = 5.1e-51
 Identities = 124/326 (38%), Positives = 187/326 (57%)

Query:    20 ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSY 78
             A++  + E  +  +++ WK  Y + Y    +   R  I++ NL  +   N  A++G  +Y
Sbjct:    16 ASFALYPEEILDTQWDLWKKTYRKQYNSKVDELSRRLIWEKNLKHISIHNLEASLGVHTY 75

Query:    79 TLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTP 137
              L +N   D+T +E +   TG K+    S  ++N T ++    S+ P SV++ +KG VTP
Sbjct:    76 ELAMNHLGDMTSEEVVQKMTGLKVPPSHS--RSNDTLYIPDWESRAPDSVDYRKKGYVTP 133

Query:   138 VKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
             VK QGQC       +V A+EG    K  +L++LS Q LVDC +   N+GC GG+M +AF+
Sbjct:   134 VKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDGCGGGYMTNAFQ 191

Query:   191 YIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PV 249
             Y+ +N+GI ++  Y Y G     C        AA+   Y ++P  +E++L +AVA   P+
Sbjct:   192 YVQKNRGIDSEDAYPYVGQDES-C-MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPI 249

Query:   250 SVAIDAS--ALQFYSGGVF-NGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
             SVAIDAS  + QFYS GV+ +  C +  LNH V AVGYG  ++G K+W+IKNSWG++WG 
Sbjct:   250 SVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGI-QKGNKHWIIKNSWGENWGN 308

Query:   306 DGYFRLQRDIDQPQGQCGIAMFASFP 331
              GY  + R+       CGIA  ASFP
Sbjct:   309 KGYILMARN---KNNACGIANLASFP 331


>UNIPROTKB|Q3ZKN1 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:AY738221
            RefSeq:NP_001029168.1 UniGene:Cfa.588 HSSP:P43235
            ProteinModelPortal:Q3ZKN1 SMR:Q3ZKN1 STRING:Q3ZKN1 GeneID:608843
            KEGG:cfa:608843 InParanoid:Q3ZKN1 NextBio:20894470 Uniprot:Q3ZKN1
        Length = 330

 Score = 530 (191.6 bits), Expect = 5.1e-51, P = 5.1e-51
 Identities = 124/326 (38%), Positives = 187/326 (57%)

Query:    20 ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSY 78
             A++  + E  +  +++ WK  Y + Y    +   R  I++ NL  +   N  A++G  +Y
Sbjct:    13 ASFALYPEEILDTQWDLWKKTYRKQYNSKVDELSRRLIWEKNLKHISIHNLEASLGVHTY 72

Query:    79 TLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTP 137
              L +N   D+T +E +   TG K+    S  ++N T ++    S+ P SV++ +KG VTP
Sbjct:    73 ELAMNHLGDMTSEEVVQKMTGLKVPPSHS--RSNDTLYIPDWESRAPDSVDYRKKGYVTP 130

Query:   138 VKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
             VK QGQC       +V A+EG    K  +L++LS Q LVDC +   N+GC GG+M +AF+
Sbjct:   131 VKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDGCGGGYMTNAFQ 188

Query:   191 YIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PV 249
             Y+ +N+GI ++  Y Y G     C        AA+   Y ++P  +E++L +AVA   P+
Sbjct:   189 YVQKNRGIDSEDAYPYVGQDES-C-MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPI 246

Query:   250 SVAIDAS--ALQFYSGGVF-NGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
             SVAIDAS  + QFYS GV+ +  C +  LNH V AVGYG  ++G K+W+IKNSWG++WG 
Sbjct:   247 SVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGI-QKGNKHWIIKNSWGENWGN 305

Query:   306 DGYFRLQRDIDQPQGQCGIAMFASFP 331
              GY  + R+       CGIA  ASFP
Sbjct:   306 KGYILMARN---KNNACGIANLASFP 328


>RGD|708447 [details] [associations]
            symbol:Testin "testin gene" species:10116 "Rattus norvegicus"
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0030054 "cell junction" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 RGD:708447 GO:GO:0005576 GO:GO:0030054 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            MEROPS:C01.972 OMA:RYHAENS OrthoDB:EOG4XWG0N EMBL:U16858
            IPI:IPI00207173 PIR:I52525 PIR:PC1251 RefSeq:NP_775155.1
            UniGene:Rn.10029 ProteinModelPortal:P15242 SMR:P15242
            Ensembl:ENSRNOT00000024467 GeneID:286916 KEGG:rno:286916
            UCSC:RGD:708447 CTD:286916 InParanoid:P15242 NextBio:625036
            Genevestigator:P15242 GermOnline:ENSRNOG00000018028 Uniprot:P15242
        Length = 333

 Score = 530 (191.6 bits), Expect = 5.1e-51, P = 5.1e-51
 Identities = 124/343 (36%), Positives = 186/343 (54%)

Query:     6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
             +I VL ++  C    +     + S+  ++ +W+ ++G+TY  + E  KR  +++ N   +
Sbjct:     1 MIAVLFLAILCLEVDSTAPTPDPSLDVEWNEWRTKHGKTYNMNEERLKR-AVWEKNFKMI 59

Query:    66 ERFNNAAI-GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP 124
             E  N   + G   +T+ +N F DLT  EF+   TGF+      +       FLY    VP
Sbjct:    60 ELHNWEYLEGRHDFTMAMNAFGDLTNIEFVKMMTGFQRQKIKKTHIFQDHQFLY----VP 115

Query:   125 PSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
               V+W + G VTPVK QG CA         ++EG    K  RL+ LSEQ L+DC  ++  
Sbjct:   116 KRVDWRQLGYVTPVKNQGHCASSWAFSATGSLEGQMFRKTERLIPLSEQNLLDCMGSNVT 175

Query:   178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
             +GC GGFM  AF+Y+  N G+  +  Y Y G     C    AE+ AA + ++  +P   E
Sbjct:   176 HGCSGGFMQYAFQYVKDNGGLATEESYPYRGQGRE-C-RYHAENSAANVRDFVQIP-GSE 232

Query:   238 ESLLKAVANQ-PVSVAIDAS--ALQFYSGGVF-NGYCE-TFLNHGVTAVGYG-TSEE--G 289
             E+L+KAVA   P+SVA+DAS  + QFY  G++    C+   LNH V  VGYG   EE  G
Sbjct:   233 EALMKAVAKVGPISVAVDASHGSFQFYGSGIYYEPQCKRVHLNHAVLVVGYGFEGEESDG 292

Query:   290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
               +WL+KNSWG++WG  GY +L +D       CGIA ++++P+
Sbjct:   293 NSFWLVKNSWGEEWGMKGYMKLAKDWSN---HCGIATYSTYPI 332


>ZFIN|ZDB-GENE-040718-61 [details] [associations]
            symbol:ctsl.1 "cathepsin L.1" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-040718-61
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            GeneTree:ENSGT00660000095458 MEROPS:C01.092 EMBL:FP015965
            EMBL:BC075887 IPI:IPI00513499 RefSeq:NP_001002368.1
            UniGene:Dr.85174 SMR:Q6DHT0 Ensembl:ENSDART00000017756
            GeneID:436641 KEGG:dre:436641 CTD:436641 InParanoid:Q6DHT0
            OMA:GGQMENA OrthoDB:EOG41ZFB9 NextBio:20831086 Uniprot:Q6DHT0
        Length = 334

 Score = 527 (190.6 bits), Expect = 1.1e-50, P = 1.1e-50
 Identities = 131/318 (41%), Positives = 180/318 (56%)

Query:    33 KFEQWKAQYGRTYKESAENSKRFEIFKDN--LVAVERFNNAAIGNRSYTLRLNKFADLTP 90
             +F  WK ++G++Y+ + E S R   +  N  LV V     A  G +SY L +  FAD++ 
Sbjct:    25 EFHAWKLKFGKSYRSAEEESHRQLTWLTNRKLVLVHNMM-ADQGLKSYRLGMTYFADMSN 83

Query:    91 QEFIASQTGFK--MSDHSSSLKANGTPF--LYKSSQVPPSVNWIEKGAVTPVKYQGQC-- 144
             +E+   Q  F+  +   +++    G+ F  L K++ VP +V+W +KG VT +K Q QC  
Sbjct:    84 EEY--RQLVFRGCLGSMNNTKARGGSTFFRLRKAAVVPDTVDWRDKGYVTDIKDQKQCGS 141

Query:   145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                  A  ++EG    K  +LVSLSEQQLVDC+ +  N GC GG MD AF+YI  NKG+ 
Sbjct:   142 CWAFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGSYGNYGCDGGLMDQAFQYIEANKGLD 201

Query:   200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--S 256
              +  Y YE    G C         A  T Y D+   DE +L +AVA   P+SVAIDA  S
Sbjct:   202 TEDSYPYEAQD-GEC-RFNPSTVGASCTGYVDIASGDESALQEAVATIGPISVAIDAGHS 259

Query:   257 ALQFYSGGVFNGY-CETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
             + Q YS GV+N   C +  L+HGV AVGYG+S  G  YW++KNSWG DWG  GY  + R+
Sbjct:   260 SFQLYSSGVYNEPDCSSSELDHGVLAVGYGSSN-GDDYWIVKNSWGLDWGVQGYILMSRN 318

Query:   315 IDQPQGQCGIAMFASFPV 332
                   QCGIA  AS+P+
Sbjct:   319 ---KSNQCGIATAASYPL 333


>ZFIN|ZDB-GENE-980526-285 [details] [associations]
            symbol:ctsl1b "cathepsin L, 1 b" species:7955
            "Danio rerio" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005576
            "extracellular region" evidence=IDA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-980526-285 GO:GO:0005576 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:BX465840 IPI:IPI00498443 Ensembl:ENSDART00000145570
            Bgee:F1R7B3 Uniprot:F1R7B3
        Length = 352

 Score = 525 (189.9 bits), Expect = 1.7e-50, P = 1.7e-50
 Identities = 124/346 (35%), Positives = 189/346 (54%)

Query:     4 YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
             + L+V L IS   A+ +     D+      +  WK+Q+G++Y E  E  +R  I+++NL 
Sbjct:    19 FALLVTLYISAVFAAPSIDIQLDD-----HWNSWKSQHGKSYHEDVEVGRRM-IWEENLR 72

Query:    64 AVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS- 121
              +E+ N   + GN ++ + +N+F D+T +EF  +  G+    H  +  + G  F+  S  
Sbjct:    73 KIEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYT---HDPNQTSQGPLFMEPSFF 129

Query:   122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
               P  V+W ++G VTPVK Q QC       +  A+EG    K  +L+S+SEQ LVDC+  
Sbjct:   130 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 189

Query:   175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
               N GC GG MD AF+Y+ +NKG+ ++  Y Y       C       + A+IT + D+P 
Sbjct:   190 QGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPC-RYDPRFNVAKITGFVDIPS 248

Query:   235 NDEESLLKAVANQ-PVSVAIDAS--ALQFYSGGVF-NGYCETF-LNHGVTAVGYG---TS 286
              +E +L+ AVA   PVSVAIDAS  +LQFY  G++    C +  L+H V  VGYG     
Sbjct:   249 GNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGAD 308

Query:   287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
               G +YW++KNSW   WG+ GY  + +D       CG+A  AS+P+
Sbjct:   309 VAGNRYWIVKNSWSDKWGDKGYIYMAKD---KNNHCGVATKASYPL 351


>UNIPROTKB|Q9GLE3 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9823 "Sus scrofa"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0005576 "extracellular
            region" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005576 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 MEROPS:I29.007
            CTD:1513 KO:K01371 OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55
            OMA:LKVPPSH EMBL:AF292030 RefSeq:NP_999467.1 UniGene:Ssc.1020
            ProteinModelPortal:Q9GLE3 SMR:Q9GLE3 STRING:Q9GLE3
            Ensembl:ENSSSCT00000007283 GeneID:397569 KEGG:ssc:397569
            ArrayExpress:Q9GLE3 Uniprot:Q9GLE3
        Length = 330

 Score = 524 (189.5 bits), Expect = 2.2e-50, P = 2.2e-50
 Identities = 130/340 (38%), Positives = 191/340 (56%)

Query:     6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
             L VVL++     S A Y    E  +  ++E WK  Y + Y    +   R  I++ NL  +
Sbjct:     4 LKVVLLLP--VMSSALY---PEEILDTQWELWKKTYRKQYNSKVDEISRRLIWEKNLKHI 58

Query:    66 ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQV 123
                N  A++G  +Y L +N   D+T +E +   TG K+    S  ++N T ++     + 
Sbjct:    59 SIHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPPSHS--RSNDTLYIPDWEGRT 116

Query:   124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
             P S+++ +KG VTPVK QGQC       +V A+EG    K  +L++LS Q LVDC +   
Sbjct:   117 PDSIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE-- 174

Query:   177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
             N+GC GG+M +AF+Y+ +N+GI ++  Y Y G     C        AA+   Y ++P  +
Sbjct:   175 NDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDEN-C-MYNPTGKAAKCRGYREIPEGN 232

Query:   237 EESLLKAVANQ-PVSVAIDAS--ALQFYSGGVF-NGYCETF-LNHGVTAVGYGTSEEGIK 291
             E++L +AVA   PVSVAIDAS  + QFYS GV+ +  C +  LNH V AVGYG  ++G K
Sbjct:   233 EKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGI-QKGKK 291

Query:   292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
             +W+IKNSWG++WG  GY  + R+       CGIA  ASFP
Sbjct:   292 HWIIKNSWGENWGNKGYILMARN---KNNACGIANLASFP 328


>DICTYBASE|DDB_G0278721 [details] [associations]
            symbol:cprD "cysteine proteinase 4" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0278721 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000024 EMBL:L36204 RefSeq:XP_641963.1
            ProteinModelPortal:P54639 SMR:P54639 MEROPS:C01.A57 PRIDE:P54639
            EnsemblProtists:DDB0214999 GeneID:8621695 KEGG:ddi:DDB_G0278721
            OMA:NAFADIT ProtClustDB:CLSZ2846820 Uniprot:P54639
        Length = 442

 Score = 414 (150.8 bits), Expect = 2.9e-50, Sum P(2) = 2.9e-50
 Identities = 114/293 (38%), Positives = 154/293 (52%)

Query:     7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
             + +L++S + A Q     F E      F  W   + RTY  S E + R++IFK N+  V 
Sbjct:     7 LCLLLVSYASAKQQ----FSELQYRNAFTNWMQAHQRTYS-SEEFNARYQIFKSNMDYVH 61

Query:    67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPS 126
             ++N+   G  +  L LN FAD+T QE+  +  G    D S+ +   GT      S   P+
Sbjct:    62 QWNSK--GGET-VLGLNVFADITNQEYRTTYLGTPF-DGSALI---GTEEEKIFSTPAPT 114

Query:   127 VNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINR---LVSLSEQQLVDCATNDN 176
             V+W  +GAVTP+K QGQC          + EG + I       LVSLSEQ L+DC+ +  
Sbjct:   115 VDWRAQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSEQNLIDCSKSYG 174

Query:   177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
             NNGC GG M  AF+YII NKGI  ++ Y Y       C   K  +  AQI +Y++V    
Sbjct:   175 NNGCEGGLMTLAFEYIINNKGIDTESSYPYTAEDGKEC-KFKTSNIGAQIVSYQNVTSGS 233

Query:   237 EESLLKAVANQPVSVAIDAS--ALQFYSGGVF-NGYCE-TFLNHGVTAVGYGT 285
             E SL  A  N PVSVAIDAS  + Q Y  G++    C  T L+HGV  VGYG+
Sbjct:   234 EASLQSASNNAPVSVAIDASNESFQLYESGIYYEPACSPTQLDHGVLVVGYGS 286

 Score = 126 (49.4 bits), Expect = 2.9e-50, Sum P(2) = 2.9e-50
 Identities = 24/56 (42%), Positives = 31/56 (55%)

Query:   282 GYGTSEEGI-KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKES 336
             G G  E     YW++KNSWG  WG DGY  + +D +     CGIA  ASFP +  +
Sbjct:   390 GSGAVEASSGNYWIVKNSWGTSWGMDGYIFMSKDRNN---NCGIATMASFPTASSN 442

 Score = 39 (18.8 bits), Expect = 6.6e-05, Sum P(2) = 6.6e-05
 Identities = 10/20 (50%), Positives = 13/20 (65%)

Query:    95 ASQTGFKMSDHSSSLKANGT 114
             +S TG K S  SSS KA+ +
Sbjct:   302 SSSTGGKTSSSSSSGKASSS 321


>ZFIN|ZDB-GENE-040426-1583 [details] [associations]
            symbol:ctssa "cathepsin S, a" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-040426-1583
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            GeneTree:ENSGT00660000095458 EMBL:CR548627 IPI:IPI00491948
            UniGene:Dr.81560 SMR:Q1L8W8 Ensembl:ENSDART00000053638 OMA:RNTREER
            OrthoDB:EOG480HX9 Uniprot:Q1L8W8
        Length = 328

 Score = 522 (188.8 bits), Expect = 3.6e-50, P = 3.6e-50
 Identities = 127/316 (40%), Positives = 171/316 (54%)

Query:    30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADL 88
             +  ++  WK+Q+ +TY+ + E   R  ++K NL  +   N AA +G  SYTL LN+ +D+
Sbjct:    23 LTNQWTTWKSQHNKTYRNTREERLRRSVWKQNLQDILLHNEAAAVGLHSYTLGLNQLSDM 82

Query:    89 TPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC--- 144
             T  E +    G    D       N T F   S Q +P  VNW E G V+PV+ QG C   
Sbjct:    83 TADE-VNDMNGLLEEDFPD---VNAT-FSPPSLQTLPQRVNWTEHGMVSPVQNQGPCGSC 137

Query:   145 -AVAAVEGINAIKINR---LVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
              A +AV  + A    R   LV LS Q L+DC+ +  N GC GGF+  AF Y+IQN+GI +
Sbjct:   138 WAFSAVGSLEAQMKRRTAALVPLSAQNLLDCSVSLGNRGCKGGFLSRAFLYVIQNRGIDS 197

Query:   201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASALQ 259
                Y YE    G+C        A   T +  VP ++E +L  AVAN  PVSV I+A  L 
Sbjct:   198 STFYPYEHKE-GVC-RYSVSGRAGYCTGFRIVPRHNEAALQSAVANIGPVSVGINAKLLS 255

Query:   260 F--YSGGVFNG-YCETFL-NHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
             F  Y  G++N   C + L NH V  VGYG SE G  YWL+KNSWG  WGE+GY R+ R+ 
Sbjct:   256 FHRYRSGIYNDPKCSSALINHAVLVVGYG-SENGQDYWLVKNSWGTAWGENGYIRMARN- 313

Query:   316 DQPQGQCGIAMFASFP 331
                +  CGI+ F  +P
Sbjct:   314 ---KNMCGISSFGIYP 326


>ZFIN|ZDB-GENE-050522-559 [details] [associations]
            symbol:ctssb.1 "cathepsin S, b.1" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-050522-559 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 MEROPS:C01.034
            EMBL:BC095694 IPI:IPI00607338 UniGene:Dr.75553
            ProteinModelPortal:Q502H6 SMR:Q502H6 InParanoid:Q502H6
            ArrayExpress:Q502H6 Uniprot:Q502H6
        Length = 330

 Score = 522 (188.8 bits), Expect = 3.6e-50, P = 3.6e-50
 Identities = 130/330 (39%), Positives = 178/330 (53%)

Query:    16 CASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIG 74
             C S A    F+  ++ + +E WK  YG+ Y    E   R ++++ NL  +   N  A++G
Sbjct:    11 CCSAALAH-FNT-NLDQHWELWKKTYGKIYTTEVEEFGRRQLWERNLQLITVHNLEASMG 68

Query:    75 NRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKG 133
               SY L +N   DLT +E +  QT   ++   S  K      +  S   VP S++W EKG
Sbjct:    69 MHSYDLSMNHMGDLTTEEIL--QT-LALTHVPSGFKRQIANIVGSSGDAVPDSLDWREKG 125

Query:   134 AVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMD 186
              V+ VK QG C       +V A+EG       +LV LS Q LVDC++   N GC GGFM 
Sbjct:   126 YVSSVKMQGACGSCWAFSSVGALEGQLKKTTGKLVDLSPQNLVDCSSKYGNKGCNGGFMS 185

Query:   187 DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
             DAF+Y+I N GI +D+ Y Y G+    C S  +   AA  T Y  V   DE +L +AVA+
Sbjct:   186 DAFQYVIDNGGIASDSAYPYRGVQQQ-C-SYSSSQRAANCTKYYFVRQGDENALKQAVAS 243

Query:   247 Q-PVSVAIDASALQF--YSGGVFNG-YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
               P+SVAIDA+  QF  Y  GV+N   C   +NH V  VGYGT   G  +WL+KNSWG  
Sbjct:   244 VGPISVAIDATRPQFVLYHSGVYNDPTCSKRVNHAVLVVGYGTLS-GQDHWLVKNSWGTR 302

Query:   303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             +G+ GY R+ R+       CGIA +A +PV
Sbjct:   303 FGDGGYIRMARN---KNNMCGIASYACYPV 329


>UNIPROTKB|Q5E968 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:BT021052
            EMBL:BC109853 IPI:IPI00709374 RefSeq:NP_001029607.1
            UniGene:Bt.23218 ProteinModelPortal:Q5E968 SMR:Q5E968 STRING:Q5E968
            MEROPS:I29.007 PRIDE:Q5E968 Ensembl:ENSBTAT00000028016
            GeneID:513038 KEGG:bta:513038 CTD:1513 InParanoid:Q5E968 KO:K01371
            OrthoDB:EOG4SJ5FC NextBio:20870669 PANTHER:PTHR12411:SF55
            Uniprot:Q5E968
        Length = 329

 Score = 521 (188.5 bits), Expect = 4.6e-50, P = 4.6e-50
 Identities = 122/325 (37%), Positives = 185/325 (56%)

Query:    21 TYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYT 79
             ++  + E  +  ++E WK  Y + Y    +   R  I++ NL  +   N  A++G  +Y 
Sbjct:    13 SFALYPEEILDTQWELWKKTYRKQYNSKGDEISRRLIWEKNLKHISIHNLEASLGVHTYE 72

Query:    80 LRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTPV 138
             L +N   D+T +E +   TG K+   +S  ++N T ++     + P SV++ +KG VTPV
Sbjct:    73 LAMNHLGDMTSEEVVQKMTGLKVP--ASRSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPV 130

Query:   139 KYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
             K QGQC       +V A+EG    K  +L++LS Q LVDC +   N+GC GG+M +AF+Y
Sbjct:   131 KNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDGCGGGYMTNAFQY 188

Query:   192 IIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVS 250
             + +N+GI ++  Y Y G     C        AA+   Y ++P  +E++L +AVA   P+S
Sbjct:   189 VQKNRGIDSEDAYPYVGQDEN-C-MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPIS 246

Query:   251 VAIDAS--ALQFYSGGVF-NGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
             VAIDAS  + QFY  GV+ +  C +  LNH V AVGYG  ++G K+W+IKNSWG++WG  
Sbjct:   247 VAIDASLTSFQFYRKGVYYDENCNSDNLNHAVLAVGYGI-QKGNKHWIIKNSWGENWGNK 305

Query:   307 GYFRLQRDIDQPQGQCGIAMFASFP 331
             GY  + R+       CGIA  ASFP
Sbjct:   306 GYILMARN---KNNACGIANLASFP 327


>UNIPROTKB|Q5E998 [details] [associations]
            symbol:CTSL2 "Cathepsin L2" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            HOVERGEN:HBG011513 UniGene:Bt.3987 MEROPS:C01.032 EMBL:BT021022
            IPI:IPI00711962 ProteinModelPortal:Q5E998 SMR:Q5E998 STRING:Q5E998
            InParanoid:Q5E998 Uniprot:Q5E998
        Length = 334

 Score = 521 (188.5 bits), Expect = 4.6e-50, P = 4.6e-50
 Identities = 124/314 (39%), Positives = 173/314 (55%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADLTPQE 92
             + QWKA + R Y  + E  +R  +++ N   ++  N   + G   + + +N F D+T +E
Sbjct:    29 WHQWKATHRRLYGMNEEEWRR-AVWEKNKKIIDLHNQEYSEGKHGFRMAMNAFGDMTNEE 87

Query:    93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
             F     GF+   H    K    P L     VP SV+W +KG VTPVK QGQC       A
Sbjct:    88 FRQVMNGFQNQKHKKG-KLFHEPLLV---DVPKSVDWTKKGYVTPVKNQGQCGSCWAFSA 143

Query:   146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
               A+EG    K  +LVSLSEQ LVDC+    N GC GG MD+AF+YI  N  + ++  Y 
Sbjct:   144 TGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGCLDSEESYP 203

Query:   206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDA--SALQFYS 262
             Y    T  C+  K E  AA  T + D+P   E++L+KAVA   P+SVAIDA  ++ QFY 
Sbjct:   204 YLATDTNSCN-YKPECSAANDTGFVDIPQR-EKALMKAVATVGPISVAIDAGHTSFQFYK 261

Query:   263 GGVF-NGYCETF-LNHGVTAVGYG---TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
              G++ +  C +  L+HGV  VGYG   T     K+W++KNSWG +WG +GY ++ +D   
Sbjct:   262 SGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPEWGWNGYVKMAKD--- 318

Query:   318 PQGQCGIAMFASFP 331
                 CGIA  AS+P
Sbjct:   319 QNNHCGIATAASYP 332


>UNIPROTKB|P43235 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0001957
            "intramembranous ossification" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0045453 "bone resorption"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=TAS] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=TAS] [GO:0036021 "endolysosome lumen" evidence=TAS]
            [GO:0045087 "innate immune response" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            Reactome:REACT_118779 Reactome:REACT_6900 GO:GO:0005615
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087 GO:GO:0045453
            EMBL:CH471121 EMBL:AL355860 GO:GO:0004197 GO:GO:0001957
            HOVERGEN:HBG011513 GO:GO:0036021 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:U13665 EMBL:X82153
            EMBL:U20280 EMBL:S79895 EMBL:CR541675 EMBL:AL356292 EMBL:BC016058
            IPI:IPI00300599 PIR:JC2476 RefSeq:NP_000387.1 UniGene:Hs.632466
            PDB:1ATK PDB:1AU0 PDB:1AU2 PDB:1AU3 PDB:1AU4 PDB:1AYU PDB:1AYV
            PDB:1AYW PDB:1BGO PDB:1BY8 PDB:1MEM PDB:1NL6 PDB:1NLJ PDB:1Q6K
            PDB:1SNK PDB:1TU6 PDB:1U9V PDB:1U9W PDB:1U9X PDB:1VSN PDB:1YK7
            PDB:1YK8 PDB:1YT7 PDB:2ATO PDB:2AUX PDB:2AUZ PDB:2BDL PDB:2R6N
            PDB:3C9E PDB:3H7D PDB:3KW9 PDB:3KWB PDB:3KWZ PDB:3KX1 PDB:3O0U
            PDB:3O1G PDB:3OVZ PDB:4DMX PDB:4DMY PDB:7PCK PDBsum:1ATK
            PDBsum:1AU0 PDBsum:1AU2 PDBsum:1AU3 PDBsum:1AU4 PDBsum:1AYU
            PDBsum:1AYV PDBsum:1AYW PDBsum:1BGO PDBsum:1BY8 PDBsum:1MEM
            PDBsum:1NL6 PDBsum:1NLJ PDBsum:1Q6K PDBsum:1SNK PDBsum:1TU6
            PDBsum:1U9V PDBsum:1U9W PDBsum:1U9X PDBsum:1VSN PDBsum:1YK7
            PDBsum:1YK8 PDBsum:1YT7 PDBsum:2ATO PDBsum:2AUX PDBsum:2AUZ
            PDBsum:2BDL PDBsum:2R6N PDBsum:3C9E PDBsum:3H7D PDBsum:3KW9
            PDBsum:3KWB PDBsum:3KWZ PDBsum:3KX1 PDBsum:3O0U PDBsum:3O1G
            PDBsum:3OVZ PDBsum:4DMX PDBsum:4DMY PDBsum:7PCK
            ProteinModelPortal:P43235 SMR:P43235 DIP:DIP-39993N IntAct:P43235
            STRING:P43235 PhosphoSite:P43235 DMDM:1168793 PaxDb:P43235
            PRIDE:P43235 DNASU:1513 Ensembl:ENST00000271651 GeneID:1513
            KEGG:hsa:1513 UCSC:uc001evp.2 GeneCards:GC01M150768 HGNC:HGNC:2536
            MIM:265800 MIM:601105 neXtProt:NX_P43235 Orphanet:763
            PharmGKB:PA27034 InParanoid:P43235 OMA:LKVPPSH PhylomeDB:P43235
            BindingDB:P43235 ChEMBL:CHEMBL268 EvolutionaryTrace:P43235
            GenomeRNAi:1513 NextBio:6267 ArrayExpress:P43235 Bgee:P43235
            CleanEx:HS_CTSK CleanEx:HS_CTSO Genevestigator:P43235
            GermOnline:ENSG00000143387 Uniprot:P43235
        Length = 329

 Score = 521 (188.5 bits), Expect = 4.6e-50, P = 4.6e-50
 Identities = 125/326 (38%), Positives = 185/326 (56%)

Query:    21 TYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYT 79
             ++  + E  +   +E WK  + + Y    +   R  I++ NL  +   N  A++G  +Y 
Sbjct:    13 SFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYE 72

Query:    80 LRLNKFADLTPQEFIASQTGFKMS-DHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTP 137
             L +N   D+T +E +   TG K+   HS S   N T ++ +   + P SV++ +KG VTP
Sbjct:    73 LAMNHLGDMTSEEVVQKMTGLKVPLSHSRS---NDTLYIPEWEGRAPDSVDYRKKGYVTP 129

Query:   138 VKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
             VK QGQC       +V A+EG    K  +L++LS Q LVDC +   N+GC GG+M +AF+
Sbjct:   130 VKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDGCGGGYMTNAFQ 187

Query:   191 YIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PV 249
             Y+ +N+GI ++  Y Y G     C        AA+   Y ++P  +E++L +AVA   PV
Sbjct:   188 YVQKNRGIDSEDAYPYVGQEES-C-MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPV 245

Query:   250 SVAIDAS--ALQFYSGGVF-NGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
             SVAIDAS  + QFYS GV+ +  C +  LNH V AVGYG  ++G K+W+IKNSWG++WG 
Sbjct:   246 SVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGI-QKGNKHWIIKNSWGENWGN 304

Query:   306 DGYFRLQRDIDQPQGQCGIAMFASFP 331
              GY  + R+       CGIA  ASFP
Sbjct:   305 KGYILMARN---KNNACGIANLASFP 327


>UNIPROTKB|F1NZ37 [details] [associations]
            symbol:LOC420160 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 EMBL:AADN02062018
            IPI:IPI00587784 Ensembl:ENSGALT00000006765 OMA:CGVANQA
            Uniprot:F1NZ37
        Length = 340

 Score = 520 (188.1 bits), Expect = 5.8e-50, P = 5.8e-50
 Identities = 120/319 (37%), Positives = 174/319 (54%)

Query:    30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
             + E +E+WK+ Y + Y   AE  +R E++++NL  +E+ N   + G  ++ L +N + DL
Sbjct:    30 LEEAWERWKSLYAKEYPGEAELIRR-EVWENNLRRIEQHNWEESQGQHTFRLGMNHYGDL 88

Query:    89 TPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC--- 144
               +EF     GF    H          F   ++Q  P  V+W  +G VTPVK QG C   
Sbjct:    89 MDEEFNQLLNGFAPVQHEEP----ALTFQASAAQKTPAEVDWRMRGYVTPVKNQGHCGSC 144

Query:   145 ----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
                 A  A+EG+      +L  LSEQ L+DC+    NNGC GG+M  AF+Y+  N G+ +
Sbjct:   145 WAFSATGALEGLVFNWTGKLAVLSEQNLIDCSWKLGNNGCQGGYMTRAFQYVHDNGGMNS 204

Query:   201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDASAL- 258
             + +Y Y+   T  C    A D AA  +    V    E +L +AVA   PVSVA+DAS+  
Sbjct:   205 EHIYPYQATDTSSCRYNPA-DRAANCSTVWLVAQGSEAALEQAVATVGPVSVAVDASSFF 263

Query:   259 -QFYSGGVFNG-YCETFLNHGVTAVGYGTSEEG---IKYWLIKNSWGQDWGEDGYFRLQR 313
               FY  G+FN  +C   +NHG+ AVGYG S+E    + YW++KNSW + WGE GY RL +
Sbjct:   264 FHFYKSGIFNSMFCSQKVNHGMLAVGYGISQEARKNVSYWILKNSWSEVWGEKGYIRLLK 323

Query:   314 DIDQPQGQCGIAMFASFPV 332
              ++     CG+A  ASFP+
Sbjct:   324 GVNN---HCGVANQASFPL 339


>UNIPROTKB|O60911 [details] [associations]
            symbol:CTSL2 "Cathepsin L2" species:9606 "Homo sapiens"
            [GO:0004177 "aminopeptidase activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA] [GO:0005902
            "microvillus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0007283 "spermatogenesis" evidence=IEA] [GO:0009267 "cellular
            response to starvation" evidence=IEA] [GO:0009749 "response to
            glucose stimulus" evidence=IEA] [GO:0009897 "external side of
            plasma membrane" evidence=IEA] [GO:0010259 "multicellular
            organismal aging" evidence=IEA] [GO:0021675 "nerve development"
            evidence=IEA] [GO:0030984 "kininogen binding" evidence=IEA]
            [GO:0032403 "protein complex binding" evidence=IEA] [GO:0034698
            "response to gonadotropin stimulus" evidence=IEA] [GO:0042277
            "peptide binding" evidence=IEA] [GO:0043005 "neuron projection"
            evidence=IEA] [GO:0043204 "perikaryon" evidence=IEA] [GO:0046697
            "decidualization" evidence=IEA] [GO:0048102 "autophagic cell death"
            evidence=IEA] [GO:0051384 "response to glucocorticoid stimulus"
            evidence=IEA] [GO:0060008 "Sertoli cell differentiation"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=TAS] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_118779 Reactome:REACT_6900
            GO:GO:0009897 GO:GO:0019886 GO:GO:0034698 GO:GO:0043204
            GO:GO:0009749 GO:GO:0030141 GO:GO:0051384 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0045177 GO:GO:0043005 GO:GO:0007283
            GO:GO:0004177 GO:GO:0042277 GO:GO:0009267 GO:GO:0021675
            GO:GO:0043202 GO:GO:0005902 GO:GO:0010259 GO:GO:0004197
            GO:GO:0048102 GO:GO:0046697 HOVERGEN:HBG011513 CTD:1515
            OrthoDB:EOG48PMKF OMA:FDQNLDT GO:GO:0060008 EMBL:Y14734
            EMBL:AB001928 EMBL:AF070448 EMBL:AB019534 EMBL:AY358641
            EMBL:AL445670 EMBL:BC023504 EMBL:BC110512 IPI:IPI00000013
            RefSeq:NP_001188504.1 RefSeq:NP_001324.2 UniGene:Hs.610096 PDB:1FH0
            PDB:3H6S PDB:3KFQ PDBsum:1FH0 PDBsum:3H6S PDBsum:3KFQ
            ProteinModelPortal:O60911 SMR:O60911 IntAct:O60911 STRING:O60911
            MEROPS:I29.010 PhosphoSite:O60911 PaxDb:O60911 PeptideAtlas:O60911
            PRIDE:O60911 Ensembl:ENST00000259470 Ensembl:ENST00000538255
            GeneID:1515 KEGG:hsa:1515 UCSC:uc004awt.3 GeneCards:GC09M099794
            HGNC:HGNC:2538 HPA:CAB017112 MIM:603308 neXtProt:NX_O60911
            PharmGKB:PA27036 InParanoid:O60911 KO:K01375 PhylomeDB:O60911
            BRENDA:3.4.22.43 SABIO-RK:O60911 BindingDB:O60911 ChEMBL:CHEMBL3272
            ChiTaRS:CTSL2 EvolutionaryTrace:O60911 GenomeRNAi:1515 NextBio:6277
            Bgee:O60911 CleanEx:HS_CTSL2 Genevestigator:O60911
            GermOnline:ENSG00000136943 Uniprot:O60911
        Length = 334

 Score = 517 (187.1 bits), Expect = 1.2e-49, P = 1.2e-49
 Identities = 127/339 (37%), Positives = 183/339 (53%)

Query:    10 LIISGSCASQAT-YRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF 68
             L+++  C   A+    FD+ ++  K+ QWKA + R Y  + E  +R  +++ N+  +E  
Sbjct:     5 LVLAAFCLGIASAVPKFDQ-NLDTKWYQWKATHRRLYGANEEGWRR-AVWEKNMKMIELH 62

Query:    69 NNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSV 127
             N   + G   +T+ +N F D+T +EF      F+              FL     +P SV
Sbjct:    63 NGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFRNQKFRKGKVFREPLFL----DLPKSV 118

Query:   128 NWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
             +W +KG VTPVK Q QC       A  A+EG    K  +LVSLSEQ LVDC+    N GC
Sbjct:   119 DWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGC 178

Query:   181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESL 240
              GGFM  AF+Y+ +N G+ ++  Y Y  +   IC   + E+  A  T +  V P  E++L
Sbjct:   179 NGGFMARAFQYVKENGGLDSEESYPYVAVDE-IC-KYRPENSVANDTGFTVVAPGKEKAL 236

Query:   241 LKAVANQ-PVSVAIDA--SALQFYSGGV-FNGYCETF-LNHGVTAVGYG---TSEEGIKY 292
             +KAVA   P+SVA+DA  S+ QFY  G+ F   C +  L+HGV  VGYG    +    KY
Sbjct:   237 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKY 296

Query:   293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
             WL+KNSWG +WG +GY ++ +D       CGIA  AS+P
Sbjct:   297 WLVKNSWGPEWGSNGYVKIAKD---KNNHCGIATAASYP 332


>RGD|61810 [details] [associations]
            symbol:Ctsk "cathepsin K" species:10116 "Rattus norvegicus"
           [GO:0001957 "intramembranous ossification" evidence=IEP] [GO:0005615
           "extracellular space" evidence=IDA] [GO:0005737 "cytoplasm"
           evidence=IDA] [GO:0005764 "lysosome" evidence=IDA] [GO:0006508
           "proteolysis" evidence=TAS] [GO:0008234 "cysteine-type peptidase
           activity" evidence=TAS] [GO:0045453 "bone resorption" evidence=IMP]
           InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
           Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
           RGD:61810 GO:GO:0005615 eggNOG:COG4870 HOGENOM:HOG000230774
           GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
           InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
           PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
           GO:GO:0045453 GO:GO:0001957 GeneTree:ENSGT00560000076577
           HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
           OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:AF010306 EMBL:BC078793
           IPI:IPI00206378 RefSeq:NP_113748.1 UniGene:Rn.5598
           ProteinModelPortal:O35186 SMR:O35186 STRING:O35186
           PhosphoSite:O35186 PRIDE:O35186 Ensembl:ENSRNOT00000028730
           GeneID:29175 KEGG:rno:29175 UCSC:RGD:61810 InParanoid:O35186
           OMA:YKEIPEG BindingDB:O35186 ChEMBL:CHEMBL3034 NextBio:608248
           Genevestigator:O35186 GermOnline:ENSRNOG00000021155 Uniprot:O35186
        Length = 329

 Score = 516 (186.7 bits), Expect = 1.5e-49, P = 1.5e-49
 Identities = 123/319 (38%), Positives = 179/319 (56%)

Query:    27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKF 85
             E ++  ++E WK  +G+ Y    +   R  I++ NL  +   N  A++G  +Y L +N  
Sbjct:    19 EETLDTQWELWKKTHGKQYNSKVDEISRRLIWEKNLKKISVHNLEASLGAHTYELAMNHL 78

Query:    86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTPVKYQGQC 144
              D+T +E +   TG ++    S   +N T +  +   +VP S+++ +KG VTPVK QGQC
Sbjct:    79 GDMTSEEVVQKMTGLRVPPSRSF--SNDTLYTPEWEGRVPDSIDYRKKGYVTPVKNQGQC 136

Query:   145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                    +  A+EG    K  +L++LS Q LVDC +   N GC GG+M  AF+Y+ QN G
Sbjct:   137 GSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVSE--NYGCGGGYMTTAFQYVQQNGG 194

Query:   198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDAS 256
             I ++  Y Y G     C    A   AA+   Y ++P  +E++L +AVA   PVSV+IDAS
Sbjct:   195 IDSEDAYPYVGQDES-C-MYNATAKAAKCRGYREIPVGNEKALKRAVARVGPVSVSIDAS 252

Query:   257 --ALQFYSGGVF-NGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
               + QFYS GV+ +  C+   +NH V  VGYGT ++G KYW+IKNSWG+ WG  GY  L 
Sbjct:   253 LTSFQFYSRGVYYDENCDRDNVNHAVLVVGYGT-QKGNKYWIIKNSWGESWGNKGYVLLA 311

Query:   313 RDIDQPQGQCGIAMFASFP 331
             R+       CGI   ASFP
Sbjct:   312 RN---KNNACGITNLASFP 327


>MGI|MGI:1922258 [details] [associations]
            symbol:4930486L24Rik "RIKEN cDNA 4930486L24 gene"
            species:10090 "Mus musculus" [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030054 "cell
            junction" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 MGI:MGI:1922258
            GO:GO:0005576 GO:GO:0030054 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 HSSP:P07711
            EMBL:AY146988 EMBL:AK145933 EMBL:BC061218 IPI:IPI00280732
            RefSeq:NP_835199.1 UniGene:Mm.19839 ProteinModelPortal:Q80UB0
            SMR:Q80UB0 MEROPS:C01.972 PRIDE:Q80UB0 Ensembl:ENSMUST00000091569
            GeneID:214639 KEGG:mmu:214639 UCSC:uc007qvs.1 InParanoid:Q80UB0
            OMA:RYHAENS OrthoDB:EOG4XWG0N NextBio:374408 Bgee:Q80UB0
            CleanEx:MM_4930486L24RIK Genevestigator:Q80UB0 Uniprot:Q80UB0
        Length = 333

 Score = 515 (186.3 bits), Expect = 2.0e-49, P = 2.0e-49
 Identities = 126/344 (36%), Positives = 185/344 (53%)

Query:     6 LIVVLIISGSCAS-QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
             +I VL ++  C    +T  T D  S+  ++ +W+ ++G+ Y  + E  +R  +++ N   
Sbjct:     1 MIAVLFLAILCLEIDSTAPTLDP-SLDVQWNEWRTKHGKAYNVNEERLRR-AVWEKNFKM 58

Query:    65 VERFNNAAI-GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV 123
             +E  N   + G   +T+ +N F DLT  EF+   TGF+              FLY    V
Sbjct:    59 IELHNWEYLEGKHDFTMTMNAFGDLTNTEFVKMMTGFRRQKIKRMHVFQDHQFLY----V 114

Query:   124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
             P  V+W   G VTPVK QG CA         ++EG    K  RLV LSEQ L+DC  ++ 
Sbjct:   115 PKYVDWRMLGYVTPVKNQGYCASSWAFSATGSLEGQMFKKTGRLVPLSEQNLLDCMGSNV 174

Query:   177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
              + C GGFM +AF+Y+  N G+  +  Y Y G     C    AE+ AA + ++  +P   
Sbjct:   175 THDCSGGFMQNAFQYVKDNGGLATEESYPYIGPGRK-C-RYHAENSAANVRDFVQIPGR- 231

Query:   237 EESLLKAVANQ-PVSVAIDAS--ALQFYSGGVF-NGYCE-TFLNHGVTAVGYG-TSEE-- 288
             EE+L+KAVA   P+SVA+DAS  + QFY  G++    C+   LNH V  VGYG   EE  
Sbjct:   232 EEALMKAVAKVGPISVAVDASHDSFQFYDSGIYYEPQCKRVHLNHAVLVVGYGFEGEESD 291

Query:   289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             G  YWL+KNSWG++WG  GY ++ +D +     CGIA  A++P+
Sbjct:   292 GNSYWLVKNSWGEEWGMKGYIKIAKDWNN---HCGIATLATYPI 332


>DICTYBASE|DDB_G0283867 [details] [associations]
            symbol:cprC "cysteine proteinase 3" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0283867 GenomeReviews:CM000153_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:AAFI02000057
            KO:K01365 EMBL:X03930 RefSeq:XP_638859.1 ProteinModelPortal:Q23894
            SMR:Q23894 MEROPS:C01.114 EnsemblProtists:DDB0220784 GeneID:8624257
            KEGG:ddi:DDB_G0283867 OMA:NNVEHIN Uniprot:Q23894
        Length = 337

 Score = 514 (186.0 bits), Expect = 2.5e-49, P = 2.5e-49
 Identities = 128/342 (37%), Positives = 186/342 (54%)

Query:     5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
             F ++VL IS   A       F      + F  W     + Y    E   R+E FK N+  
Sbjct:     9 FTLIVLSISFISAGNV----FSHKQYQDSFIDWMRSNNKAYTHK-EFMPRYEEFKKNMDY 63

Query:    65 VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSL-KAN-GTPFLYKSSQ 122
             V  +N+   G+++  L LN+ ADL+ +E+  +  G +     +   K N G        +
Sbjct:    64 VHNWNSK--GSKT-VLGLNQHADLSNEEYRLNYLGTRAHIKLNGYHKRNLGLRLNRPQFK 120

Query:   123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
              P +V+W EK AVTPVK QGQC          +VEG+ AIK  +LVSLSEQ ++DC+++ 
Sbjct:   121 QPLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNILDCSSSF 180

Query:   176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
              N GC GG M +AF+YII+N G+ ++  Y YE      C   +    AA+IT+Y+++   
Sbjct:   181 GNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDEC-KFQEGSVAAKITSYKEIEAG 239

Query:   236 DEESLLKAVANQPVSVAIDAS--ALQFYSGGVF-NGYCETF-LNHGVTAVGYGTSEEGIK 291
             DE  L  A+   PVSVAIDAS  + Q Y+ GV+    C +  L+HGV AVG GT + G  
Sbjct:   240 DENDLQNALLLNPVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHGVLAVGMGT-DNGED 298

Query:   292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
             Y+++KNSWG  WG +GY  + R+ D     CGI+  AS+P++
Sbjct:   299 YYIVKNSWGPSWGLNGYIHMARNKDN---NCGISTMASYPIA 337


>MGI|MGI:1349426 [details] [associations]
            symbol:Ctsj "cathepsin J" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=ISO]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0048471 "perinuclear region
            of cytoplasm" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:1349426 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF136272
            EMBL:AF158182 EMBL:AY034579 EMBL:AK005526 EMBL:AK131661
            EMBL:BC103769 IPI:IPI00126770 RefSeq:NP_036137.1 UniGene:Mm.31948
            ProteinModelPortal:Q9R014 SMR:Q9R014 MEROPS:C01.038 PRIDE:Q9R014
            Ensembl:ENSMUST00000071526 GeneID:26898 KEGG:mmu:26898
            UCSC:uc007qwa.1 CTD:26898 InParanoid:Q9R014 KO:K09599
            NextBio:304745 Bgee:Q9R014 CleanEx:MM_CTSJ Genevestigator:Q9R014
            GermOnline:ENSMUSG00000055298 Uniprot:Q9R014
        Length = 334

 Score = 514 (186.0 bits), Expect = 2.5e-49, P = 2.5e-49
 Identities = 124/341 (36%), Positives = 193/341 (56%)

Query:     7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
             +++LI+    AS A  +  D    AE ++ WK +Y ++Y    E  +R  ++++N+  ++
Sbjct:     5 VLLLILCFGVASGA--QAHDPKLDAE-WKDWKTKYAKSYSPKEEALRR-AVWEENMRMIK 60

Query:    67 RFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPP 125
               N   ++G  ++T+++NKF D T +EF  S     +    +   A      + S  +P 
Sbjct:    61 LHNKENSLGKNNFTMKMNKFGDQTSEEFRKSIDNIPIPAAMTDPHAQN----HVSIGLPD 116

Query:   126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
               +W E+G VTPV+ QG+C       A  A+EG    K   L  LS Q L+DC+    N 
Sbjct:   117 YKDWREEGYVTPVRNQGKCGSCWAFAAAGAIEGQMFWKTGNLTPLSVQNLLDCSKTVGNK 176

Query:   179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
             GC  G    AF+Y+++NKG+  +A Y YEG   G C   ++E+ +A IT+Y ++PPN E 
Sbjct:   177 GCQSGTAHQAFEYVLKNKGLEAEATYPYEGKD-GPC-RYRSENASANITDYVNLPPN-EL 233

Query:   239 SLLKAVAN-QPVSVAIDAS--ALQFYSGGVF-NGYCET-FLNHGVTAVGYGTS---EEGI 290
              L  AVA+  PVS AIDAS  + +FY+GG++    C + F+NH V  VGYG+    ++G 
Sbjct:   234 YLWVAVASIGPVSAAIDASHDSFRFYNGGIYYEPNCSSYFVNHAVLVVGYGSEGDVKDGN 293

Query:   291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
              YWLIKNSWG++WG +GY ++ +D +     CGIA  AS+P
Sbjct:   294 NYWLIKNSWGEEWGMNGYMQIAKDHNN---HCGIASLASYP 331


>RGD|69241 [details] [associations]
            symbol:Ctsj "cathepsin J" species:10116 "Rattus norvegicus"
           [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
           evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
           evidence=IEA] [GO:0048471 "perinuclear region of cytoplasm"
           evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
           PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:L14776
           RGD:69241 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
           InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246
           SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
           GO:GO:0005764 HOVERGEN:HBG011513 MEROPS:C01.038 CTD:26898 KO:K09599
           EMBL:AF310623 EMBL:BC097263 IPI:IPI00205027 PIR:I58002
           RefSeq:NP_058817.1 UniGene:Rn.34875 ProteinModelPortal:Q63088
           SMR:Q63088 PRIDE:Q63088 GeneID:29174 KEGG:rno:29174 NextBio:608244
           Genevestigator:Q63088 Uniprot:Q63088
        Length = 334

 Score = 509 (184.2 bits), Expect = 8.5e-49, P = 8.5e-49
 Identities = 127/341 (37%), Positives = 190/341 (55%)

Query:     7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
             + ++I+    AS A  R  D    AE ++ WK +Y ++Y    E  KR  ++++NL  ++
Sbjct:     5 VFLVILCFGVASGAPAR--DPNLDAE-WQDWKTKYAKSYSPVEEELKR-AVWEENLKMIQ 60

Query:    67 RFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPP 125
               N    +G   +T+ +N FAD T +EF  S +   +     +   N +     S  +P 
Sbjct:    61 LHNKENGLGKNGFTMEMNAFADTTGEEFRKSLSDILIP----AAVTNPSAQKQVSIGLPN 116

Query:   126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
               +W ++G VTPV+ QG+C       AV A+EG    K   L  LS Q L+DC+ ++ NN
Sbjct:   117 FKDWRKEGYVTPVRNQGKCGSCWAFAAVGAIEGQMFSKTGNLTPLSVQNLLDCSKSEGNN 176

Query:   179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
             GC  G    AF Y+++NKG+  +A Y YEG   G C    +E+ +A IT + ++PPN E 
Sbjct:   177 GCRWGTAHQAFNYVLKNKGLEAEATYPYEGKD-GPC-RYHSENASANITGFVNLPPN-EL 233

Query:   239 SLLKAVAN-QPVSVAIDAS--ALQFYSGGVFNG-YCETFL-NHGVTAVGYG---TSEEGI 290
              L  AVA+  PVS AIDAS  + +FYSGGV++   C +++ NH V  VGYG      +G 
Sbjct:   234 YLWVAVASIGPVSAAIDASHDSFRFYSGGVYHEPNCSSYVVNHAVLVVGYGFEGNETDGN 293

Query:   291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
              YWLIKNSWG++WG +G+ ++ +D +     CGIA  ASFP
Sbjct:   294 NYWLIKNSWGEEWGINGFMKIAKDRNN---HCGIASQASFP 331


>ZFIN|ZDB-GENE-041010-76 [details] [associations]
            symbol:ctsll "cathepsin L, like" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-041010-76
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            OrthoDB:EOG48PMKF MEROPS:C01.032 EMBL:BX119902 IPI:IPI00616622
            UniGene:Dr.79994 SMR:A2BEM8 Ensembl:ENSDART00000144226
            InParanoid:A2BEM8 OMA:PRYSAAN Uniprot:A2BEM8
        Length = 337

 Score = 508 (183.9 bits), Expect = 1.1e-48, P = 1.1e-48
 Identities = 122/349 (34%), Positives = 187/349 (53%)

Query:     1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
             M  +  +V L IS   A+     T D+  + + +  WK  + ++Y E  E  +R  +++ 
Sbjct:     1 MLLFASLVTLCISAVFAAP----TLDQ-KLDDHWHLWKRWHEKSYHEKEEGWRRM-VWEK 54

Query:    61 NLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
             NL  +E  N   ++G  ++ L +N+F D+T +EF  +  G+   +   + K+ G+ F+  
Sbjct:    55 NLKKIELHNLEHSVGKHTFRLGMNQFGDMTNEEFRQAMNGY---NRDPNRKSKGSLFIEP 111

Query:   120 SS-QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
             S    P  ++W +KG VTP+K Q +C       +  A+EG    K  +LVSLSEQ L+DC
Sbjct:   112 SFFTAPQQIDWRQKGYVTPIKDQKRCGSCWAFSSTGALEGQVFRKTGKLVSLSEQNLMDC 171

Query:   172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
             +    NNGC GG MD AF+Y+  N G+ ++  Y Y       C        AA +T + D
Sbjct:   172 SRPQGNNGCDGGLMDQAFQYVQDNNGLDSEESYPYLATDDQPCH-YDPRYSAANVTGFVD 230

Query:   232 VPPNDEESLLKAVANQ-PVSVAIDAS--ALQFYSGGVF-NGYCETF-LNHGVTAVGYG-- 284
             +P   E +L+KAVA   PV+VAIDA   + QFY  G++    C T  L+HGV  VGYG  
Sbjct:   231 IPSGKEHALMKAVAAVGPVAVAIDAGHESFQFYQSGIYYEKACSTEELDHGVLVVGYGYE 290

Query:   285 -TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
                  G +YW++KNSW   WG+ GY  + +D+   +  CGIA  AS+P+
Sbjct:   291 GVDVAGRRYWIVKNSWTDRWGDKGYIYMAKDL---KNHCGIATSASYPL 336


>MGI|MGI:107823 [details] [associations]
            symbol:Ctsk "cathepsin K" species:10090 "Mus musculus"
            [GO:0005615 "extracellular space" evidence=ISO] [GO:0005737
            "cytoplasm" evidence=ISO] [GO:0005764 "lysosome" evidence=ISO]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0045453 "bone resorption" evidence=ISO]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:107823 GO:GO:0005615 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0045453
            GO:GO:0001957 HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 OMA:LKVPPSH EMBL:X94444
            EMBL:AJ006033 EMBL:BC046320 IPI:IPI00316575 PIR:S74227
            RefSeq:NP_031828.2 UniGene:Mm.272085 ProteinModelPortal:P55097
            SMR:P55097 MINT:MINT-3089515 STRING:P55097 PhosphoSite:P55097
            PRIDE:P55097 Ensembl:ENSMUST00000015664 GeneID:13038 KEGG:mmu:13038
            InParanoid:P55097 BioCyc:MetaCyc:MONOMER-14811 ChEMBL:CHEMBL1075277
            NextBio:282924 Bgee:P55097 CleanEx:MM_CTSK Genevestigator:P55097
            GermOnline:ENSMUSG00000028111 Uniprot:P55097
        Length = 329

 Score = 505 (182.8 bits), Expect = 2.3e-48, P = 2.3e-48
 Identities = 121/319 (37%), Positives = 177/319 (55%)

Query:    27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKF 85
             E  +  ++E WK  + + Y    +   R  I++ NL  +   N  A++G  +Y L +N  
Sbjct:    19 EEMLDTQWELWKKTHQKQYNSKVDEISRRLIWEKNLKQISAHNLEASLGVHTYELAMNHL 78

Query:    86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTPVKYQGQC 144
              D+T +E +   TG ++    S   +N T +  +   +VP S+++ +KG VTPVK QGQC
Sbjct:    79 GDMTSEEVVQKMTGLRIPPSRSY--SNDTLYTPEWEGRVPDSIDYRKKGYVTPVKNQGQC 136

Query:   145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                    +  A+EG    K  +L++LS Q LVDC T   N GC GG+M  AF+Y+ QN G
Sbjct:   137 GSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVTE--NYGCGGGYMTTAFQYVQQNGG 194

Query:   198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDAS 256
             I ++  Y Y G     C    A   AA+   Y ++P  +E++L +AVA   P+SV+IDAS
Sbjct:   195 IDSEDAYPYVGQDES-C-MYNATAKAAKCRGYREIPVGNEKALKRAVARVGPISVSIDAS 252

Query:   257 --ALQFYSGGVF-NGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
               + QFYS GV+ +  C+   +NH V  VGYGT ++G K+W+IKNSWG+ WG  GY  L 
Sbjct:   253 LASFQFYSRGVYYDENCDRDNVNHAVLVVGYGT-QKGSKHWIIKNSWGESWGNKGYALLA 311

Query:   313 RDIDQPQGQCGIAMFASFP 331
             R+       CGI   ASFP
Sbjct:   312 RN---KNNACGITNMASFP 327


>MGI|MGI:107341 [details] [associations]
            symbol:Ctss "cathepsin S" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=ISO;IDA] [GO:0006508 "proteolysis"
            evidence=IDA] [GO:0008152 "metabolic process" evidence=ISO]
            [GO:0008233 "peptidase activity" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0009986 "cell
            surface" evidence=ISO] [GO:0016020 "membrane" evidence=IDA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0045453 "bone
            resorption" evidence=ISO] [GO:0051930 "regulation of sensory
            perception of pain" evidence=ISO] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:107341 GO:GO:0016020 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0008233 GO:GO:0031905 Reactome:REACT_102124
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 BRENDA:3.4.22.27
            ChiTaRS:CTSS EMBL:AF051732 EMBL:AF051727 EMBL:AF051728
            EMBL:AF051729 EMBL:AF051726 EMBL:AF051730 EMBL:AF051731
            EMBL:AF038546 EMBL:AJ002386 EMBL:AC092203 EMBL:Y18466 EMBL:AJ223208
            IPI:IPI00309520 UniGene:Mm.3619 PDB:1M0H PDBsum:1M0H
            ProteinModelPortal:O70370 SMR:O70370 STRING:O70370
            PhosphoSite:O70370 PaxDb:O70370 PRIDE:O70370
            Ensembl:ENSMUST00000116304 BindingDB:O70370 ChEMBL:CHEMBL4098
            NextBio:282932 Bgee:O70370 CleanEx:MM_CTSS Genevestigator:O70370
            GermOnline:ENSMUSG00000038642 Uniprot:O70370
        Length = 340

 Score = 505 (182.8 bits), Expect = 2.3e-48, P = 2.3e-48
 Identities = 118/312 (37%), Positives = 174/312 (55%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQE 92
             ++ WK  + + YK+  E   R  I++ NL  +   N   ++G  +Y + +N   D+T +E
Sbjct:    36 WDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEE 95

Query:    93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
              +      ++   S       +   Y +  +P +V+W EKG VT VKYQG C       A
Sbjct:    96 ILCRMGALRIPRQSPKTVTFRS---YSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSA 152

Query:   146 VAAVEGINAIKINRLVSLSEQQLVDCATNDN--NNGCYGGFMDDAFKYIIQNKGITNDAV 203
             V A+EG   +K  +L+SLS Q LVDC+  +   N GC GG+M +AF+YII N GI  DA 
Sbjct:   153 VGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADAS 212

Query:   204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDAS--ALQF 260
             Y Y+      C    +++ AA  + Y  +P  DE++L +AVA + PVSV IDAS  +  F
Sbjct:   213 YPYKATDEK-CH-YNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFF 270

Query:   261 YSGGVFNG-YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
             Y  GV++   C   +NHGV  VGYGT + G  YWL+KNSWG ++G+ GY R+ R+    +
Sbjct:   271 YKSGVYDDPSCTGNVNHGVLVVGYGTLD-GKDYWLVKNSWGLNFGDQGYIRMARN---NK 326

Query:   320 GQCGIAMFASFP 331
               CGIA + S+P
Sbjct:   327 NHCGIASYCSYP 338


>DICTYBASE|DDB_G0279185 [details] [associations]
            symbol:cprF "cysteine proteinase 6" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0279185 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000030 HSSP:P07711 ProtClustDB:CLSZ2846820 EMBL:U72745
            RefSeq:XP_641725.1 ProteinModelPortal:Q94503 SMR:Q94503
            MEROPS:C01.081 PRIDE:Q94503 EnsemblProtists:DDB0215002
            GeneID:8621921 KEGG:ddi:DDB_G0279185 Uniprot:Q94503
        Length = 434

 Score = 395 (144.1 bits), Expect = 2.3e-48, Sum P(2) = 2.3e-48
 Identities = 110/296 (37%), Positives = 155/296 (52%)

Query:     7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
             + VL++S + A Q       E      F  W   + R Y  S E + RF IFK N+  + 
Sbjct:     7 LCVLLVSVATAKQQ----LSELQYRNAFTNWMIAHQRHYS-SEEFNGRFNIFKANMDYIN 61

Query:    67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPS 126
              +N    G+ +  L LN FAD+T +E+ A+  G      +SSL+   +  ++   Q   S
Sbjct:    62 EWNTK--GSET-VLGLNVFADITNEEYRATYLGTPFD--ASSLEMTPSEKVFGGVQAN-S 115

Query:   127 VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKI--NRLVSLSEQQLVDCATNDNN 177
             V+W  KGAVTP+K QG+C       A  A EG   I    + L S+SEQQL+DC+ +  N
Sbjct:   116 VDWRAKGAVTPIKNQGECGGCWSFSATGATEGAQYIANGDSDLTSVSEQQLIDCSGSYGN 175

Query:   178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
             NGC GG M  AF+YII N GI  ++ Y +   +T  C      +  A++++Y +V    E
Sbjct:   176 NGCEGGLMTLAFEYIINNGGIDTESSYPFTA-NTEKC-KYNPSNIGAELSSYVNVTSGSE 233

Query:   238 ESLLKAVANQPVSVAIDAS--ALQFYSGGVFNG-YCE-TFLNHGVTAVGYGTSEEG 289
               L   V   P SVAIDAS  + QFYS G++N   C  T L+HGV AVG+G+   G
Sbjct:   234 SDLAAKVTQGPTSVAIDASQPSFQFYSSGIYNEPACSSTQLDHGVLAVGFGSGSSG 289

 Score = 127 (49.8 bits), Expect = 2.3e-48, Sum P(2) = 2.3e-48
 Identities = 22/40 (55%), Positives = 27/40 (67%)

Query:   292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
             YW++KNSWG DWG +GY  + +D D    QCGIA  AS P
Sbjct:   389 YWIVKNSWGLDWGINGYILMSKDKDN---QCGIATMASIP 425


>UNIPROTKB|Q4QRC2 [details] [associations]
            symbol:Ctsql2 "Protein Ctsql2" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 HOVERGEN:HBG011513 EMBL:CH474032
            RGD:1303225 EMBL:BC097257 IPI:IPI00421946 RefSeq:NP_001002813.2
            UniGene:Rn.128678 SMR:Q4QRC2 MEROPS:C01.111
            Ensembl:ENSRNOT00000038758 GeneID:408201 KEGG:rno:408201 CTD:408201
            InParanoid:Q4QRC2 OMA:NDEGALM NextBio:696394 Genevestigator:Q4QRC2
            Uniprot:Q4QRC2
        Length = 343

 Score = 503 (182.1 bits), Expect = 3.7e-48, P = 3.7e-48
 Identities = 125/355 (35%), Positives = 198/355 (55%)

Query:     2 AKYFLIVVL--IISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
             A  FLI++   ++SG+ A           S+  ++++WK +Y + Y    E  KR  +++
Sbjct:     3 AALFLIILCLGVVSGASAFNL--------SLDVQWQEWKMKYEKLYSPEEELLKRV-VWE 53

Query:    60 DNLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSS--SL--KANGT 114
             +N+  +E  N   ++G  +Y + +N FADLT +EF    TG  +  +++  SL  +A G+
Sbjct:    54 ENVKKIELHNRENSLGKNTYIMEINNFADLTDEEFKDMITGITLPINNTMKSLWKRALGS 113

Query:   115 PF---LYKSSQVPPSVNWIEKGAVTPVKYQGQCA------VA-AVEGINAIKINRLVSLS 164
             PF    Y    +P S++W ++G VT V+ QG+C       VA A+EG    K  +L  LS
Sbjct:   114 PFPNSWYWRDALPKSIDWRKEGYVTRVREQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLS 173

Query:   165 EQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAA 224
              Q LVDC+    N GC GG   +AF+Y++QN G+ ++A Y Y+G   G+C     ++  A
Sbjct:   174 VQNLVDCSKPQGNKGCRGGTTYNAFQYVLQNGGLESEATYPYKGKE-GLC-KYNPKNAYA 231

Query:   225 QITNYEDVPPNDEESLLKAVANQ-PVSVAIDA--SALQFYSGGVFNG-YCETFLNHGVTA 280
             +IT +  +P  DE+ L+ A+A + PV+  I    S+L+FY  G+++   C   +NH V  
Sbjct:   232 KITRFVALP-EDEDVLMDALATKGPVAAGIHVVYSSLRFYKKGIYHEPKCNNRVNHAVLV 290

Query:   281 VGYG---TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             VGYG      +G  YWLIKNSWG+ WG  GY ++ +D +     CGIA FA +P+
Sbjct:   291 VGYGFEGNETDGNNYWLIKNSWGKQWGLKGYMKIAKDRNN---HCGIATFAQYPI 342


>DICTYBASE|DDB_G0272298 [details] [associations]
            symbol:DDB_G0272298 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0272298 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246
            SMART:SM00848 EMBL:AAFI02000008 KO:K01365 RefSeq:XP_645281.1
            ProteinModelPortal:Q559Q3 MEROPS:C01.A53 EnsemblProtists:DDB0203746
            GeneID:8618447 KEGG:ddi:DDB_G0272298 InParanoid:Q559Q3 OMA:PANINWR
            Uniprot:Q559Q3
        Length = 305

 Score = 502 (181.8 bits), Expect = 4.7e-48, P = 4.7e-48
 Identities = 117/309 (37%), Positives = 179/309 (57%)

Query:    40 QYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTG 99
             +Y + YK + E  KRF+IF+DN   +    N      +  + LN+++DLT +EF A +  
Sbjct:     3 KYNKHYKNNKEYLKRFDIFQDNYNFILNHRNK--NGENIEMDLNEYSDLTQKEF-ADKFF 59

Query:   100 FKMSDHSSSLKAN---GTPFLYK-SSQVPPSVNWIEKGAVTPVKYQGQCA-------VAA 148
              K+     S   N    TPF +  ++ +P S +W + GAV  VK QG CA       + A
Sbjct:    60 EKLVPEPRSGPINDIKATPFKHNVNATIPKSFDWRDHGAVGKVKNQGSCASCWSFSALGA 119

Query:   149 VEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG 208
             +EG   IK   L+ LSEQ LVDCAT     GC  G+M DAFKYII + G+  ++ Y Y G
Sbjct:   120 LEGHYYIKYGELLDLSEQNLVDCATPFGPKGCKTGWMHDAFKYIISSGGVNLESQYPYTG 179

Query:   209 MSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDASALQFY--SGGV 265
                 +C   ++E  A +++ +  +P  DE +L++A+A   PV+V ID S  +F   SGG+
Sbjct:   180 KDE-VCKFNQSEKEA-KVSGFVMIPKFDESALMEAIALYGPVAVPIDTSTKEFQHLSGGI 237

Query:   266 F-NGYCETFLN-HGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
             + +  C+ +   H V A+GYGT E G+ Y+L+KNSWG+ WG +G+F+++R +   +G+CG
Sbjct:   238 YYSDSCDPWNTIHAVLAIGYGTDENGVDYFLMKNSWGKSWGTNGFFKVKRGV---KGKCG 294

Query:   324 IAMFASFPV 332
             I   AS+P+
Sbjct:   295 IVTAASYPI 303


>ZFIN|ZDB-GENE-050626-55 [details] [associations]
            symbol:ctssb.2 "cathepsin S, b.2" species:7955
            "Danio rerio" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-050626-55
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            KO:K01368 EMBL:BC093339 IPI:IPI00507098 RefSeq:NP_001017661.1
            UniGene:Dr.132688 ProteinModelPortal:Q566T8 SMR:Q566T8
            GeneID:337572 KEGG:dre:337572 CTD:337572 InParanoid:Q566T8
            NextBio:20812306 ArrayExpress:Q566T8 Uniprot:Q566T8
        Length = 330

 Score = 501 (181.4 bits), Expect = 6.0e-48, P = 6.0e-48
 Identities = 126/330 (38%), Positives = 184/330 (55%)

Query:    16 CASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIG 74
             C S A    F++ ++ + +E WK ++ + Y    E   R E+++ NL  +   N  A++G
Sbjct:    11 CCSAALAH-FNK-NLDQHWELWKKKHVKLYSCEDEEVGRRELWERNLELIAIHNLEASMG 68

Query:    75 NRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKG 133
               SY L +N  AD+T +E +  QT   ++      K     ++  S + VP +++W +KG
Sbjct:    69 MHSYDLAINHMADMTTEEIL--QT-LAVTRVPPGFKRPTAEYVSSSFAVVPDTLDWRDKG 125

Query:   134 AVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMD 186
              VT VK QG C       +V A+EG       +LV LS Q LVDC++   N GC GG+M 
Sbjct:   126 YVTSVKNQGACGSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGYMS 185

Query:   187 DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
              AF+Y+I N GI +++ Y Y+G + G C        AA  T+Y+ V   DE++L +A+AN
Sbjct:   186 QAFQYVIDNGGIDSESSYPYQG-TQGSC-RYDPSQRAANCTSYKFVSQGDEQALKEALAN 243

Query:   247 -QPVSVAIDASALQF--YSGGVFNG-YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
               PVSVAIDA+  QF  Y  GV++   C   +NHGV AVGYGT   G  YWL+KNSWG  
Sbjct:   244 IGPVSVAIDATRPQFIFYRSGVYDDPSCTQKVNHGVLAVGYGTLS-GQDYWLVKNSWGAG 302

Query:   303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             +G+ GY R+ R+       CGIA  A +P+
Sbjct:   303 FGDGGYIRIARN---KNNMCGIASEACYPI 329


>UNIPROTKB|Q90686 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9031 "Gallus gallus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            CTD:1513 KO:K01371 PANTHER:PTHR12411:SF55 EMBL:U37691
            IPI:IPI00575213 RefSeq:NP_990302.1 UniGene:Gga.51509
            ProteinModelPortal:Q90686 SMR:Q90686 MEROPS:C01.036 GeneID:395818
            KEGG:gga:395818 NextBio:20815886 Uniprot:Q90686
        Length = 334

 Score = 498 (180.4 bits), Expect = 1.2e-47, P = 1.2e-47
 Identities = 121/319 (37%), Positives = 180/319 (56%)

Query:    27 EGSIAEKFEQWKAQYGRTY-KESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
             E  +  +++ WK    +   ++   N    ++ ++  V       A +G  S+ L +N  
Sbjct:    24 EPELDAQWDLWKRTIQKAVQRQGGRNVPEVDLGEEPEVHRCPQRGARLGKHSFQLAMNYL 83

Query:    86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTPVKYQGQC 144
              D+T +E + + TG ++    S  + NGT ++   SS+ P +V+W  KG VTPVK QGQC
Sbjct:    84 GDMTSEEVVRTMTGLRVP--RSRPRPNGTLYVPDWSSRAPAAVDWRRKGYVTPVKDQGQC 141

Query:   145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                    +V A+EG    +  +L+SLS Q LV C +N  NNGC GG+M +AF+Y+  N+G
Sbjct:   142 GSCWAFSSVGALEGQLKRRTGKLLSLSPQNLVYCVSN--NNGCGGGYMTNAFEYVRLNRG 199

Query:   198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS 256
             I ++  Y Y G       S   +  AA+   Y ++P ++E++L +AVA   PVSV IDAS
Sbjct:   200 IDSEDAYPYIGQDESCMYSPTGK--AAKCRGYREIPEDNEKALKRAVARIGPVSVGIDAS 257

Query:   257 --ALQFYSGGVF--NGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
               + QFYS GV+   G     +NH V AVGYG +++G K+W+IKNSWG +WG  GY  L 
Sbjct:   258 LPSFQFYSRGVYYDTGCNPENINHAVLAVGYG-AQKGTKHWIIKNSWGTEWGNKGYVLLA 316

Query:   313 RDIDQPQGQCGIAMFASFP 331
             R++ Q    CGIA  ASFP
Sbjct:   317 RNMKQT---CGIANLASFP 332


>UNIPROTKB|E9PSK9 [details] [associations]
            symbol:Ctsql2 "Protein Ctsql2" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            IPI:IPI00562656 Ensembl:ENSRNOT00000045847 RGD:1303225
            ArrayExpress:E9PSK9 Uniprot:E9PSK9
        Length = 342

 Score = 489 (177.2 bits), Expect = 1.1e-46, P = 1.1e-46
 Identities = 124/354 (35%), Positives = 192/354 (54%)

Query:     2 AKYFLIVVL--IISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
             A  FLI++   ++SG+ A           S+  ++++WK +Y + Y    E  KR  +++
Sbjct:     3 AALFLIILCLGVVSGASAFNL--------SLDVQWQEWKMKYEKLYSPEEELLKRV-VWE 53

Query:    60 DNLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSS--SL--KANGT 114
             +N+  +E  N   ++G  +Y + +N FADLT +EF    TG  +  +++  SL  +A G+
Sbjct:    54 ENVKKIELHNRENSLGKNTYIMEINNFADLTDEEFKDMITGITLPINNTMKSLWKRALGS 113

Query:   115 PF---LYKSSQVPPSVNWIEKGAVTPVKYQGQCA------VA-AVEGINAIKINRLVSLS 164
             PF    Y    +P S++W ++G VT V+ QG+C       VA A+EG    K  +L  LS
Sbjct:   114 PFPNSWYWRDALPKSIDWRKEGYVTRVREQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLS 173

Query:   165 EQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAA 224
              Q LVDC+    N GC GG   +AF+Y++QN G+ ++A Y Y+G   G+C     ++  A
Sbjct:   174 VQNLVDCSKPQGNKGCRGGTTYNAFQYVLQNGGLESEATYPYKGKE-GLC-KYNPKNAYA 231

Query:   225 QITNYEDVPPNDEESLLKAVANQ-PVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAV 281
             +IT +  +P  DE+ L+ A+A + PV+  I    S   F SG      C   +NH V  V
Sbjct:   232 KITRFVALP-EDEDVLMDALATKGPVAAGIHVVYSYFHFVSGIYHEPKCNNRVNHAVLVV 290

Query:   282 GYG---TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             GYG      +G  YWLIKNSWG+ WG  GY ++ +D +     CGIA FA +P+
Sbjct:   291 GYGFEGNETDGNNYWLIKNSWGKQWGLKGYMKIAKDRNN---HCGIATFAQYPI 341


>RGD|621513 [details] [associations]
            symbol:Ctss "cathepsin S" species:10116 "Rattus norvegicus"
            [GO:0001656 "metanephros development" evidence=IEP] [GO:0002250
            "adaptive immune response" evidence=ISO] [GO:0005764 "lysosome"
            evidence=IEA;ISO] [GO:0006508 "proteolysis" evidence=IEA;ISO]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0009986 "cell
            surface" evidence=IDA] [GO:0016020 "membrane" evidence=ISO]
            [GO:0043231 "intracellular membrane-bounded organelle"
            evidence=ISO] [GO:0045453 "bone resorption" evidence=IMP]
            [GO:0051930 "regulation of sensory perception of pain"
            evidence=IMP] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            RGD:621513 GO:GO:0009986 GO:GO:0051930 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0045453
            GO:GO:0001656 HOVERGEN:HBG011513 CTD:1520 KO:K01368 MEROPS:I29.004
            BRENDA:3.4.22.27 EMBL:L03201 IPI:IPI00210228 PIR:A45087
            RefSeq:NP_059016.1 UniGene:Rn.11347 ProteinModelPortal:Q02765
            PhosphoSite:Q02765 PRIDE:Q02765 GeneID:50654 KEGG:rno:50654
            UCSC:RGD:621513 ChEMBL:CHEMBL1075217 NextBio:610462
            Genevestigator:Q02765 Uniprot:Q02765
        Length = 330

 Score = 486 (176.1 bits), Expect = 2.3e-46, P = 2.3e-46
 Identities = 128/330 (38%), Positives = 180/330 (54%)

Query:    26 DEGSIAEK------FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSY 78
             D G+ AE+      ++ WK    R   +  E   R  I++ NL  +   N   ++G  SY
Sbjct:    12 DNGATAERPTLDHHWDLWKKTRMRRNTDQNEEDVRRLIWEKNLKFIMLHNLEHSMGMHSY 71

Query:    79 TLRLNKFADLTPQEFIASQTGFKMS---DHSSSLKANGTPFLYKSSQVPPSVNWIEKGAV 135
             ++ +N   D+TP+E I      ++    + S +LK++    L      P SV+W EKG V
Sbjct:    72 SVGMNHMGDMTPEEVIGYMGSLRIPRPWNRSGTLKSSSNQTL------PDSVDWREKGCV 125

Query:   136 TPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN--NNGCYGGFMD 186
             T VKYQG C       A  A+EG   +K  +LVSLS Q LVDC+T +   N GC GGFM 
Sbjct:   126 TNVKYQGSCGSCWAFSAEGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFMT 185

Query:   187 DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
             +AF+YII    I ++A Y Y+ M    C     ++ AA  + Y ++P  DEE+L +AVA 
Sbjct:   186 EAFQYIIDTS-IDSEASYPYKAMDEK-C-LYDPKNRAATCSRYIELPFGDEEALKEAVAT 242

Query:   247 Q-PVSVAID-ASALQF--YSGGVFNG-YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQ 301
             + PVSV ID AS   F  Y  GV++   C   +NHGV  VGYGT + G  YWL+KNSWG 
Sbjct:   243 KGPVSVGIDDASHSSFFLYQSGVYDDPSCTENMNHGVLVVGYGTLD-GKDYWLVKNSWGL 301

Query:   302 DWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
              +G+ GY R+ R+    +  CGIA + S+P
Sbjct:   302 HFGDQGYIRMARN---NKNHCGIASYCSYP 328


>ZFIN|ZDB-GENE-001205-4 [details] [associations]
            symbol:ctsk "cathepsin K" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-001205-4 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            CTD:1513 KO:K01371 OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55
            EMBL:BC092901 IPI:IPI00512751 RefSeq:NP_001017778.1
            UniGene:Dr.76224 ProteinModelPortal:Q568D6 SMR:Q568D6 GeneID:550475
            KEGG:dre:550475 InParanoid:Q568D6 NextBio:20879718
            ArrayExpress:Q568D6 Uniprot:Q568D6
        Length = 333

 Score = 485 (175.8 bits), Expect = 3.0e-46, P = 3.0e-46
 Identities = 120/322 (37%), Positives = 174/322 (54%)

Query:    24 TFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRL 82
             + D  S+ E +E WK  + R Y    E S R  I++ N++ +E  N    +G  +Y L +
Sbjct:    20 SLDNLSLDEAWESWKITHKREYNGLNEESIRRTIWEKNMLFIEAHNKEYELGIHTYDLGM 79

Query:    83 NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG 142
             N F D+T +E      G +M  +     AN      +  ++P S+++ + G VT VK QG
Sbjct:    80 NHFGDMTLEEVAEKVMGLQMPMYRDP--ANTFVPDDRVGKLPKSIDYRKLGYVTSVKNQG 137

Query:   143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
              C       +V A+EG       +LV LS Q LVDC T   N+GC GG+M +AF+Y+  N
Sbjct:   138 SCGSCWAFSSVGALEGQLMKTKGQLVDLSPQNLVDCVTE--NDGCGGGYMTNAFRYVSNN 195

Query:   196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAID 254
             +GI ++  Y Y G     C +      AA    Y+++P  +E +L  AVAN  PVSV ID
Sbjct:   196 QGIDSEESYPYVGTDQQ-C-AYNTSGVAASCRGYKEIPQGNERALTAAVANVGPVSVGID 253

Query:   255 A--SALQFYSGGVF-NGYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
             A  S   +Y  GV+ +  C +  +NH V AVGYG +  G KYW++KNSWG++WG+ GY  
Sbjct:   254 AMQSTFLYYKSGVYYDPNCNKEDVNHAVLAVGYGATPRGKKYWIVKNSWGEEWGKKGYVL 313

Query:   311 LQRDIDQPQGQCGIAMFASFPV 332
             + R+ +     CGIA  ASFPV
Sbjct:   314 MARNRNNA---CGIANLASFPV 332


>UNIPROTKB|F7B939 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458 CTD:1512
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 EMBL:ACFV01158341
            EMBL:ACFV01158342 EMBL:ACFV01158343 RefSeq:XP_002753411.1
            Ensembl:ENSCJAT00000004397 GeneID:100413104 Uniprot:F7B939
        Length = 336

 Score = 480 (174.0 bits), Expect = 1.0e-45, P = 1.0e-45
 Identities = 119/341 (34%), Positives = 183/341 (53%)

Query:    10 LIISGSCASQATYRTFDEGSI--AEKF--EQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
             L+ +G C   A  R   E S+   EKF  + W A++ +TY    E  +R + F  N   +
Sbjct:     7 LLCAGVCLLGAPARGAAELSVNSLEKFHFKSWMAKHHKTYSREEEYHQRLQTFASNWRKI 66

Query:    66 ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPP 125
                NN   GN ++ + +N+F+D++  E I  +  +    + S+ K+N   +L  +   PP
Sbjct:    67 NAHNN---GNHTFKMAVNQFSDMSFAE-IKRKYLWSEPQNCSATKSN---YLRGTGPYPP 119

Query:   126 SVNWIEKGA-VTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
             SV+W +KG  V+PVK QG C          A+E   AI   +++SL+EQQLVDCA + NN
Sbjct:   120 SVDWRKKGHFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNN 179

Query:   178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
             +GC GG    AF+YI+ N GI  +  Y Y+G  +  C   +       + +  ++   DE
Sbjct:   180 HGCQGGLPSQAFEYILYNNGIMGEDTYPYQGKDSD-C-KFQPGKAIGFVKDVANITIYDE 237

Query:   238 ESLLKAVA-NQPVSVAIDASA-LQFYSGGVFNGY-CETF---LNHGVTAVGYGTSEEGIK 291
             +++++AVA   PVS A + +     Y  G+++   C      +NH V AVGYG  E GI 
Sbjct:   238 DAMVEAVALYNPVSFAFEVTQDFMMYKRGIYSSTSCHKTPDKVNHAVLAVGYG-EENGIP 296

Query:   292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             YW++KNSWG  WG +GYF ++R     +  CG+A  AS+PV
Sbjct:   297 YWIVKNSWGPQWGMNGYFLIERG----KNMCGLAACASYPV 333


>TAIR|locus:2078312 [details] [associations]
            symbol:AT3G45310 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005773 EMBL:CP002686
            GenomeReviews:BA000014_GR eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AL132953
            EMBL:AY091771 IPI:IPI00540369 PIR:T47471 RefSeq:NP_566880.1
            UniGene:At.25239 ProteinModelPortal:Q8RWQ9 SMR:Q8RWQ9
            MEROPS:C01.162 PaxDb:Q8RWQ9 PRIDE:Q8RWQ9 EnsemblPlants:AT3G45310.1
            GeneID:823669 KEGG:ath:AT3G45310 GeneFarm:5032 TAIR:At3g45310
            InParanoid:Q8RWQ9 KO:K01366 OMA:AFEVVHE PhylomeDB:Q8RWQ9
            ProtClustDB:CLSN2689015 Genevestigator:Q8RWQ9 Uniprot:Q8RWQ9
        Length = 358

 Score = 479 (173.7 bits), Expect = 1.3e-45, P = 1.3e-45
 Identities = 118/312 (37%), Positives = 170/312 (54%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F ++  +YG+ Y+   E   RF +FK+NL  +   N   +   SY L LN+FADLT QEF
Sbjct:    59 FSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGL---SYKLSLNQFADLTWQEF 115

Query:    94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------V 146
                + G    + S++LK  G+  + +++ VP + +W E G V+PVK QG C         
Sbjct:   116 QRYKLG-AAQNCSATLK--GSHKITEAT-VPDTKDWREDGIVSPVKEQGHCGSCWTFSTT 171

Query:   147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
              A+E        + +SLSEQQLVDCA   NN GC+GG    AF+YI  N G+  +  Y Y
Sbjct:   172 GALEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPY 231

Query:   207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA-SALQFYSGG 264
              G   G C    A++   Q+ +  ++    E+ L  AV   +PVSVA +     +FY  G
Sbjct:   232 TGKDGG-C-KFSAKNIGVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEVVHEFRFYKKG 289

Query:   265 VFN----GYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
             VF     G     +NH V AVGYG  E+ + YWLIKNSWG +WG++GYF+++      + 
Sbjct:   290 VFTSNTCGNTPMDVNHAVLAVGYGV-EDDVPYWLIKNSWGGEWGDNGYFKMEMG----KN 344

Query:   321 QCGIAMFASFPV 332
              CG+A  +S+PV
Sbjct:   345 MCGVATCSSYPV 356


>UNIPROTKB|Q3T0I2 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9913 "Bos taurus"
            [GO:0031638 "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=ISS] [GO:0010815 "bradykinin
            catabolic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0010813 "neuropeptide catabolic
            process" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0045766 "positive regulation of angiogenesis" evidence=ISS]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=ISS] [GO:0008234 "cysteine-type peptidase activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0033619 "membrane protein proteolysis" evidence=ISS]
            [GO:0043066 "negative regulation of apoptotic process"
            evidence=ISS] [GO:0004252 "serine-type endopeptidase activity"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0016505 "apoptotic protease activator activity"
            evidence=ISS] [GO:0010952 "positive regulation of peptidase
            activity" evidence=ISS] [GO:0030108 "HLA-A specific activating MHC
            class I receptor activity" evidence=ISS] [GO:0002764 "immune
            response-regulating signaling pathway" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0070324 "thyroid
            hormone binding" evidence=ISS] [GO:0006508 "proteolysis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0097208
            "alveolar lamellar body" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004175
            "endopeptidase activity" evidence=ISS] [GO:0032526 "response to
            retinoic acid" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 EMBL:BC102386 IPI:IPI00693034
            RefSeq:NP_001029557.1 UniGene:Bt.52393 ProteinModelPortal:Q3T0I2
            SMR:Q3T0I2 STRING:Q3T0I2 MEROPS:C01.040 PRIDE:Q3T0I2
            Ensembl:ENSBTAT00000014593 GeneID:510524 KEGG:bta:510524 CTD:1512
            InParanoid:Q3T0I2 OMA:STSCHKT OrthoDB:EOG4W9J43 NextBio:20869490
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 Uniprot:Q3T0I2
        Length = 335

 Score = 478 (173.3 bits), Expect = 1.6e-45, P = 1.6e-45
 Identities = 117/313 (37%), Positives = 172/313 (54%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F+ W  Q+ + Y  S E   R + F  NL  +   +NA   N ++ + LN+F+D++  E 
Sbjct:    35 FQSWMVQHQKKYS-SEEYYHRLQAFASNLREINA-HNAR--NHTFKMGLNQFSDMSFDE- 89

Query:    94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGA-VTPVKYQGQCA------- 145
             +  +  +    + S+ K+N   +L  +   PPS++W +KG  VTPVK QG C        
Sbjct:    90 LKRKYLWSEPQNCSATKSN---YLRGTGPYPPSMDWRKKGNFVTPVKNQGSCGSCWTFST 146

Query:   146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
               A+E   AI   +L  L+EQQLVDCA N NN+GC GG    AF+YI  NKGI  +  Y 
Sbjct:   147 TGALESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYP 206

Query:   206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA-NQPVSVAIDASA-LQFYSG 263
             Y G   G C   +     A + +  ++  NDEE++++AVA + PVS A + +A    Y  
Sbjct:   207 YRGQD-GDC-KYQPSKAIAFVKDVANITLNDEEAMVEAVALHNPVSFAFEVTADFMMYRK 264

Query:   264 GVFNGY-CETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
             G+++   C      +NH V AVGYG  E+GI YW++KNSWG +WG  GYF ++R     +
Sbjct:   265 GIYSSTSCHKTPDKVNHAVLAVGYG-EEKGIPYWIVKNSWGPNWGMKGYFLIERG----K 319

Query:   320 GQCGIAMFASFPV 332
               CG+A  ASFP+
Sbjct:   320 NMCGLAACASFPI 332


>RGD|2447 [details] [associations]
            symbol:Ctsh "cathepsin H" species:10116 "Rattus norvegicus"
          [GO:0001520 "outer dense fiber" evidence=IDA] [GO:0001656
          "metanephros development" evidence=IEP] [GO:0001669 "acrosomal
          vesicle" evidence=IDA] [GO:0001913 "T cell mediated cytotoxicity"
          evidence=ISO;ISS] [GO:0002250 "adaptive immune response"
          evidence=ISO] [GO:0002764 "immune response-regulating signaling
          pathway" evidence=ISO;ISS] [GO:0004175 "endopeptidase activity"
          evidence=ISO] [GO:0004177 "aminopeptidase activity" evidence=ISO;IDA]
          [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISO;ISS]
          [GO:0004252 "serine-type endopeptidase activity" evidence=ISO;ISS]
          [GO:0005615 "extracellular space" evidence=ISO;ISS;IDA] [GO:0005764
          "lysosome" evidence=ISO;ISS;IDA] [GO:0005829 "cytosol"
          evidence=ISO;ISS] [GO:0006508 "proteolysis" evidence=IEP;ISO]
          [GO:0007283 "spermatogenesis" evidence=IEP] [GO:0008233 "peptidase
          activity" evidence=ISO] [GO:0008234 "cysteine-type peptidase
          activity" evidence=ISO] [GO:0008284 "positive regulation of cell
          proliferation" evidence=ISO;ISS] [GO:0010628 "positive regulation of
          gene expression" evidence=ISO;ISS] [GO:0010634 "positive regulation
          of epithelial cell migration" evidence=ISO;ISS] [GO:0010813
          "neuropeptide catabolic process" evidence=ISO;ISS] [GO:0010815
          "bradykinin catabolic process" evidence=ISO;ISS] [GO:0010952
          "positive regulation of peptidase activity" evidence=ISO;ISS]
          [GO:0016505 "apoptotic protease activator activity" evidence=ISO;ISS]
          [GO:0030108 "HLA-A specific activating MHC class I receptor activity"
          evidence=ISO;ISS] [GO:0030335 "positive regulation of cell migration"
          evidence=ISO;ISS] [GO:0030984 "kininogen binding" evidence=IPI]
          [GO:0031638 "zymogen activation" evidence=ISO;ISS] [GO:0031648
          "protein destabilization" evidence=ISO;ISS] [GO:0032403 "protein
          complex binding" evidence=IPI] [GO:0032526 "response to retinoic
          acid" evidence=ISO;ISS] [GO:0033619 "membrane protein proteolysis"
          evidence=ISO;ISS] [GO:0035085 "cilium axoneme" evidence=IDA]
          [GO:0043066 "negative regulation of apoptotic process"
          evidence=ISO;ISS] [GO:0043129 "surfactant homeostasis"
          evidence=ISO;ISS] [GO:0043621 "protein self-association"
          evidence=IDA] [GO:0045766 "positive regulation of angiogenesis"
          evidence=ISO;ISS] [GO:0060448 "dichotomous subdivision of terminal
          units involved in lung branching" evidence=ISO;ISS] [GO:0070324
          "thyroid hormone binding" evidence=ISO;ISS] [GO:0070371 "ERK1 and
          ERK2 cascade" evidence=ISO;ISS] [GO:0097067 "cellular response to
          thyroid hormone stimulus" evidence=ISO;IEP] [GO:0097208 "alveolar
          lamellar body" evidence=ISO;ISS;IDA] InterPro:IPR000668
          InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
          InterPro:IPR000169 RGD:2447 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
          GO:GO:0008284 GO:GO:0070371 GO:GO:0001669 eggNOG:COG4870
          HOGENOM:HOG000230774 InterPro:IPR025661 InterPro:IPR025660
          InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
          PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0007283
          GO:GO:0045766 GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
          GO:GO:0043621 GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 KO:K01366
          GO:GO:0016505 GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
          HOVERGEN:HBG011513 GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
          GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT OrthoDB:EOG4W9J43
          GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
          GO:GO:0010813 GO:GO:0043129 MEROPS:I29.003 EMBL:Y00708 EMBL:BC085352
          EMBL:M38135 IPI:IPI00212809 PIR:S00211 RefSeq:NP_037071.1
          UniGene:Rn.1997 ProteinModelPortal:P00786 SMR:P00786 STRING:P00786
          PRIDE:P00786 Ensembl:ENSRNOT00000019285 GeneID:25425 KEGG:rno:25425
          UCSC:RGD:2447 InParanoid:P00786 BindingDB:P00786 NextBio:606599
          Genevestigator:P00786 GermOnline:ENSRNOG00000014064 GO:GO:0035086
          GO:GO:0001520 Uniprot:P00786
        Length = 333

 Score = 476 (172.6 bits), Expect = 2.7e-45, P = 2.7e-45
 Identities = 113/313 (36%), Positives = 173/313 (55%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F  W  Q+ +TY  S E S R ++F +N   ++  N     N ++ + LN+F+D++  E 
Sbjct:    33 FTSWMKQHQKTYS-SREYSHRLQVFANNWRKIQAHNQR---NHTFKMGLNQFSDMSFAE- 87

Query:    94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKG-AVTPVKYQGQCA------- 145
             I  +  +    + S+ K+N   +L  +   P S++W +KG  V+PVK QG C        
Sbjct:    88 IKHKYLWSEPQNCSATKSN---YLRGTGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFST 144

Query:   146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
               A+E   AI   ++++L+EQQLVDCA N NN+GC GG    AF+YI+ NKGI  +  Y 
Sbjct:   145 TGALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYP 204

Query:   206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA-NQPVSVAIDASA-LQFYSG 263
             Y G + G C     E   A + N  ++  NDE ++++AVA   PVS A + +     Y  
Sbjct:   205 YIGKN-GQC-KFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTEDFMMYKS 262

Query:   264 GVFNGY-CETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
             GV++   C      +NH V AVGYG  + G+ YW++KNSWG +WG +GYF ++R     +
Sbjct:   263 GVYSSNSCHKTPDKVNHAVLAVGYG-EQNGLLYWIVKNSWGSNWGNNGYFLIERG----K 317

Query:   320 GQCGIAMFASFPV 332
               CG+A  AS+P+
Sbjct:   318 NMCGLAACASYPI 330


>UNIPROTKB|D3ZZR3 [details] [associations]
            symbol:D3ZZR3 "Uncharacterized protein" species:10116
            "Rattus norvegicus" [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0016020 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0002250 GeneTree:ENSGT00560000076577 GO:GO:0097067
            OrthoDB:EOG4JM7Q2 IPI:IPI00210228 PRIDE:D3ZZR3
            Ensembl:ENSRNOT00000028732 Uniprot:D3ZZR3
        Length = 331

 Score = 476 (172.6 bits), Expect = 2.7e-45, P = 2.7e-45
 Identities = 122/327 (37%), Positives = 175/327 (53%)

Query:    26 DEGSIAEK-----FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYT 79
             D G+ AE+     ++ WK  + + YK+  E   R  I++ NL  +   N   ++G  SY+
Sbjct:    12 DNGATAERPLDHHWDLWKKTHEKEYKDQNEEDVRRLIWEKNLKFIMLHNLEHSMGMHSYS 71

Query:    80 LRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIE--KGAVTP 137
             + +N   D+  +  I      ++       KA G      +  +P  V W E  KG    
Sbjct:    72 VGMNHMGDMVAETIIGEMGSERLP---RKRKALGLIPSSVNQNLPAGVKWKERTKGCWKN 128

Query:   138 VKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN--NNGCYGGFMDDA 188
             + +QG C       AV A+EG   +K  +LVSLS Q LVDC+T +   N GC GGFM +A
Sbjct:   129 LVFQGSCGSCWAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEA 188

Query:   189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ- 247
             F+YII N GI ++A Y Y+ M    C     ++ AA  + Y ++P  DEE+L +AVA + 
Sbjct:   189 FQYIIDNGGIDSEASYPYKAMDEK-CH-YDPKNRAATCSRYIELPFGDEEALKEAVATKG 246

Query:   248 PVSVAIDASALQF--YSGGVFNG-YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG 304
             PVSV IDAS   F  Y  GV++   C   +NHGV  VGYGT + G  YWL+KNSWG  +G
Sbjct:   247 PVSVGIDASHSSFFLYQSGVYDDPSCTENVNHGVLVVGYGTLD-GKDYWLVKNSWGLHFG 305

Query:   305 EDGYFRLQRDIDQPQGQCGIAMFASFP 331
             + GY R+ R+    +  CGIA + S+P
Sbjct:   306 DQGYIRMARN---NKNHCGIASYCSYP 329


>UNIPROTKB|F1NEC8 [details] [associations]
            symbol:CTSL2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:AADN02067812 IPI:IPI00820956 Ensembl:ENSGALT00000037988
            ArrayExpress:F1NEC8 Uniprot:F1NEC8
        Length = 218

 Score = 475 (172.3 bits), Expect = 3.4e-45, P = 3.4e-45
 Identities = 105/221 (47%), Positives = 136/221 (61%)

Query:   124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
             P SV+W EKG VTPVK QGQC          A+EG +  K  +LVSLSEQ LVDC+  + 
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEG 61

Query:   177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
             N GC GG MD AF+Y+  N GI ++  Y Y       C   KAE +AA  T + D+P   
Sbjct:    62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDC-RYKAEYNAANDTGFVDIPQGH 120

Query:   237 EESLLKAVANQ-PVSVAIDA--SALQFYSGGVF-NGYCETF-LNHGVTAVGYGTSEEGIK 291
             E +L+KAVA+  PVSVAIDA  S+ QFY  G++    C +  L+HGV  VGYG  E+G K
Sbjct:   121 ERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGF-EDGKK 179

Query:   292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             YW++KNSWG+ WG+ GY  + +D    +  CGIA  AS+P+
Sbjct:   180 YWIVKNSWGEKWGDKGYIYMAKD---RKNHCGIATAASYPL 217


>UNIPROTKB|G3SSC1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9785
            "Loxodonta africana" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 RefSeq:XP_003413898.1
            Ensembl:ENSLAFT00000003415 GeneID:100662496 Uniprot:G3SSC1
        Length = 335

 Score = 474 (171.9 bits), Expect = 4.4e-45, P = 4.4e-45
 Identities = 117/313 (37%), Positives = 175/313 (55%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F+ W AQ+ + Y  S E  +R + F  N   +   +NA   N ++ + LN+F+D+T  E 
Sbjct:    35 FQSWMAQHQKKYS-SEEYHQRQQTFVSNWRKINA-HNAR--NHTFKMALNQFSDMTFAE- 89

Query:    94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGA-VTPVKYQGQCA------- 145
             I  +  +    + S+ K N   +L  +   PP V+W +KG  V+PVK QG C        
Sbjct:    90 IKQKYLWSEPQNCSATKGN---YLRGTGPYPPFVDWRKKGHFVSPVKNQGACGSCWTFST 146

Query:   146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
               A+E   AI   +L+SL+EQQLVDCA + NN+GC GG    AF+YI+ NKGI  +  Y 
Sbjct:   147 TGALESAIAIAGGKLLSLAEQQLVDCAKDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYP 206

Query:   206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA-NQPVSVAIDASA-LQFYSG 263
             Y+G    +C   + +   A + +  ++  NDEE++++AVA   PVS A + +     YS 
Sbjct:   207 YKGQDD-VC-KFQPKKAIAFVKDVANITLNDEEAMVEAVALYNPVSFAFEVTDDFMKYSK 264

Query:   264 GVFNGY-CETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
             G+++   C      +NH V AVGYG  E+GI YW++KNSWG  WG DGYF ++R     +
Sbjct:   265 GIYSSTSCHKTPDKVNHAVLAVGYG-EEKGIPYWIVKNSWGPYWGMDGYFLIERG----K 319

Query:   320 GQCGIAMFASFPV 332
               CG+A  AS+P+
Sbjct:   320 NMCGLAACASYPI 332


>UNIPROTKB|F7BRD4 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001656 "metanephros development"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0001656
            GeneTree:ENSGT00660000095458 EMBL:ACFV01158341 EMBL:ACFV01158342
            EMBL:ACFV01158343 Ensembl:ENSCJAT00000004396 Uniprot:F7BRD4
        Length = 336

 Score = 471 (170.9 bits), Expect = 9.1e-45, P = 9.1e-45
 Identities = 110/313 (35%), Positives = 171/313 (54%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F+ W A++ +TY    E  +R + F  N   +   NN   GN ++ + +N+F+D++  E 
Sbjct:    35 FKSWMAKHHKTYSREEEYHQRLQTFASNWRKINAHNN---GNHTFKMAVNQFSDMSFAE- 90

Query:    94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGA-VTPVKYQGQCA------- 145
             I  +  +    + S+ K+N   +L  +   PPSV+W +KG  V+PVK QG C        
Sbjct:    91 IKRKYLWSEPQNCSATKSN---YLRGTGPYPPSVDWRKKGHFVSPVKNQGACGSCWTFST 147

Query:   146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
               A+E   AI   +++SL+EQQLVDCA + NN+GC GG    AF+YI+ N GI  +  Y 
Sbjct:   148 TGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNNGIMGEDTYP 207

Query:   206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA-NQPVSVAIDASA-LQFYSG 263
             Y+G  +  C   +       + +  ++   DE+++++AVA   PVS A + +     Y  
Sbjct:   208 YQGKDSD-C-KFQPGKAIGFVKDVANITIYDEDAMVEAVALYNPVSFAFEVTQDFMMYKR 265

Query:   264 GVFNGY-CETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
             G+++   C      +NH V AVGYG  E GI YW++KNSWG  WG +GYF ++R     +
Sbjct:   266 GIYSSTSCHKTPDKVNHAVLAVGYG-EENGIPYWIVKNSWGPQWGMNGYFLIERG----K 320

Query:   320 GQCGIAMFASFPV 332
               CG+A  AS+PV
Sbjct:   321 NMCGLAACASYPV 333


>UNIPROTKB|F6R7P5 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9544 "Macaca
            mulatta" [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=ISS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0004175 "endopeptidase activity" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0004252
            "serine-type endopeptidase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005829 "cytosol" evidence=ISS] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0010628 "positive regulation of
            gene expression" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0010815 "bradykinin catabolic
            process" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0016505 "apoptotic protease
            activator activity" evidence=ISS] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=ISS] [GO:0030335
            "positive regulation of cell migration" evidence=ISS] [GO:0031638
            "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0032526 "response to retinoic
            acid" evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0043129 "surfactant homeostasis"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0070324
            "thyroid hormone binding" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            GO:GO:0043066 GO:GO:0005615 GO:GO:0008284 GO:GO:0070371
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458 CTD:1512
            OMA:STSCHKT GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 RefSeq:XP_001108862.1
            UniGene:Mmu.3000 Ensembl:ENSMMUT00000014095 GeneID:711437
            KEGG:mcc:711437 NextBio:19969972 Uniprot:F6R7P5
        Length = 335

 Score = 469 (170.2 bits), Expect = 1.5e-44, P = 1.5e-44
 Identities = 111/313 (35%), Positives = 172/313 (54%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F+ W +++ +TY  + E   R + F  N   +   NN   GN ++ + LN+F+D++  E 
Sbjct:    35 FKSWMSKHHKTYS-TEEYHHRMQTFASNWRKINAHNN---GNHTFKMALNQFSDMSFAE- 89

Query:    94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGA-VTPVKYQGQCA------- 145
             I  +  +    + S+ K+N   +L  +   PPS++W +KG  V+PVK QG C        
Sbjct:    90 IKHKYLWSEPQNCSATKSN---YLRGTGPYPPSMDWRKKGNFVSPVKNQGACGSCWTFST 146

Query:   146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
               A+E   AI   +++SL+EQQLVDCA + NN+GC GG    AF+YI+ NKGI  +  Y 
Sbjct:   147 TGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYP 206

Query:   206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA-NQPVSVAIDASA-LQFYSG 263
             Y+G   G C   +       + +  ++   DEE++++AVA   PVS A + +     Y  
Sbjct:   207 YQGKD-GDC-KFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMIYKT 264

Query:   264 GVFNGY-CETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
             G+++   C      +NH V AVGYG  E GI YW++KNSWG  WG +GYF ++R     +
Sbjct:   265 GIYSSTSCHKTPDKVNHAVLAVGYG-EENGIPYWIVKNSWGPQWGMNGYFLIERG----K 319

Query:   320 GQCGIAMFASFPV 332
               CG+A  AS+P+
Sbjct:   320 NMCGLAACASYPI 332


>UNIPROTKB|P09668 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9606 "Homo sapiens"
            [GO:0001520 "outer dense fiber" evidence=IEA] [GO:0001669
            "acrosomal vesicle" evidence=IEA] [GO:0007283 "spermatogenesis"
            evidence=IEA] [GO:0030984 "kininogen binding" evidence=IEA]
            [GO:0032403 "protein complex binding" evidence=IEA] [GO:0043621
            "protein self-association" evidence=IEA] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0031648 "protein destabilization"
            evidence=IMP] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=ISS] [GO:0032526 "response to retinoic acid"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0030108 "HLA-A
            specific activating MHC class I receptor activity" evidence=IDA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEP] [GO:0010813 "neuropeptide catabolic process"
            evidence=IDA] [GO:0010815 "bradykinin catabolic process"
            evidence=IDA] [GO:0030335 "positive regulation of cell migration"
            evidence=IDA] [GO:0070371 "ERK1 and ERK2 cascade" evidence=IDA]
            [GO:0010628 "positive regulation of gene expression" evidence=IDA]
            [GO:0006508 "proteolysis" evidence=IDA;TAS] [GO:0031638 "zymogen
            activation" evidence=IDA] [GO:0016505 "apoptotic protease activator
            activity" evidence=IDA] [GO:0010952 "positive regulation of
            peptidase activity" evidence=IDA] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=ISS] [GO:0004252 "serine-type endopeptidase
            activity" evidence=ISS] [GO:0043066 "negative regulation of
            apoptotic process" evidence=ISS] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0033619 "membrane protein proteolysis"
            evidence=IDA] [GO:0004175 "endopeptidase activity" evidence=IDA]
            [GO:0004177 "aminopeptidase activity" evidence=IDA] [GO:0005764
            "lysosome" evidence=IDA] [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0002250 "adaptive immune response" evidence=IEP]
            [GO:0019882 "antigen processing and presentation" evidence=TAS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0070324 "thyroid hormone binding" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0008284
            "positive regulation of cell proliferation" evidence=ISS]
            [GO:0045766 "positive regulation of angiogenesis" evidence=ISS]
            [GO:0008233 "peptidase activity" evidence=IDA] [GO:0097208
            "alveolar lamellar body" evidence=IDA] [GO:0043129 "surfactant
            homeostasis" evidence=IDA] [GO:0004197 "cysteine-type endopeptidase
            activity" evidence=IDA;TAS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 Reactome:REACT_6900 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913 MEROPS:C01.040 CTD:1512
            OMA:STSCHKT OrthoDB:EOG4W9J43 GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 EMBL:X16832 EMBL:AF426247 EMBL:AK314698 EMBL:AC011944
            EMBL:BC002479 EMBL:X07549 IPI:IPI00297487 PIR:S12486
            RefSeq:NP_004381.2 UniGene:Hs.148641 PDB:1BZN PDBsum:1BZN
            ProteinModelPortal:P09668 SMR:P09668 IntAct:P09668 STRING:P09668
            PhosphoSite:P09668 DMDM:288558851 PaxDb:P09668 PRIDE:P09668
            DNASU:1512 Ensembl:ENST00000220166 GeneID:1512 KEGG:hsa:1512
            UCSC:uc021srk.1 GeneCards:GC15M079213 H-InvDB:HIX0012481
            HGNC:HGNC:2535 HPA:CAB000458 HPA:HPA003524 MIM:116820
            neXtProt:NX_P09668 PharmGKB:PA27033 InParanoid:P09668
            PhylomeDB:P09668 BRENDA:3.4.22.16 ChEMBL:CHEMBL2225 GenomeRNAi:1512
            NextBio:6261 ArrayExpress:P09668 Bgee:P09668 CleanEx:HS_CTSH
            Genevestigator:P09668 GermOnline:ENSG00000103811 GO:GO:0019882
            Uniprot:P09668
        Length = 335

 Score = 468 (169.8 bits), Expect = 1.9e-44, P = 1.9e-44
 Identities = 111/313 (35%), Positives = 172/313 (54%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F+ W +++ +TY  + E   R + F  N   +   NN   GN ++ + LN+F+D++  E 
Sbjct:    35 FKSWMSKHRKTYS-TEEYHHRLQTFASNWRKINAHNN---GNHTFKMALNQFSDMSFAE- 89

Query:    94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGA-VTPVKYQGQCA------- 145
             I  +  +    + S+ K+N   +L  +   PPSV+W +KG  V+PVK QG C        
Sbjct:    90 IKHKYLWSEPQNCSATKSN---YLRGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFST 146

Query:   146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
               A+E   AI   +++SL+EQQLVDCA + NN+GC GG    AF+YI+ NKGI  +  Y 
Sbjct:   147 TGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYP 206

Query:   206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA-NQPVSVAIDASA-LQFYSG 263
             Y+G   G C   +       + +  ++   DEE++++AVA   PVS A + +     Y  
Sbjct:   207 YQGKD-GYC-KFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRT 264

Query:   264 GVFNGY-CETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
             G+++   C      +NH V AVGYG  + GI YW++KNSWG  WG +GYF ++R     +
Sbjct:   265 GIYSSTSCHKTPDKVNHAVLAVGYG-EKNGIPYWIVKNSWGPQWGMNGYFLIERG----K 319

Query:   320 GQCGIAMFASFPV 332
               CG+A  AS+P+
Sbjct:   320 NMCGLAACASYPI 332


>UNIPROTKB|G1RBY1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:61853
            "Nomascus leucogenys" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 EMBL:ADFV01087552 RefSeq:XP_003275518.1
            Ensembl:ENSNLET00000011249 GeneID:100584322 Uniprot:G1RBY1
        Length = 335

 Score = 468 (169.8 bits), Expect = 1.9e-44, P = 1.9e-44
 Identities = 110/313 (35%), Positives = 173/313 (55%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F+ W +++ +TY  + E   R ++F  N   +   NN   GN ++ + LN+F+D++  E 
Sbjct:    35 FKSWMSKHHKTYS-TEEYHHRLQMFASNWRKINAHNN---GNHTFKMALNQFSDMSFAE- 89

Query:    94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGA-VTPVKYQGQCA------- 145
             I  +  +    + S+ K+N   +L  +   PPS++W +KG  V+PVK QG C        
Sbjct:    90 IKHKYLWSEPQNCSATKSN---YLRGTGPYPPSMDWRKKGNFVSPVKNQGACGSCWTFST 146

Query:   146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
               A+E   AI   +++SL+EQQLVDCA + NN+GC GG    AF+YI+ NKGI  +  Y 
Sbjct:   147 TGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYP 206

Query:   206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA-NQPVSVAIDASA-LQFYSG 263
             Y+G   G C   +       + +  ++   DEE++++AVA   PVS A + +     Y  
Sbjct:   207 YQGKD-GYC-KFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRR 264

Query:   264 GVFNGY-CETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
             G+++   C      +NH V AVGYG  + GI YW++KNSWG  WG +GYF ++R     +
Sbjct:   265 GIYSSTSCHKTPDKVNHAVLAVGYG-EKNGIPYWIVKNSWGPQWGMNGYFLIERG----K 319

Query:   320 GQCGIAMFASFPV 332
               CG+A  AS+P+
Sbjct:   320 NMCGLAACASYPI 332


>UNIPROTKB|O46427 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9823 "Sus scrofa"
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0032526 "response to retinoic acid" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0043129
            "surfactant homeostasis" evidence=ISS] [GO:0010815 "bradykinin
            catabolic process" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0005829 "cytosol"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0030335 "positive regulation of cell
            migration" evidence=ISS] [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0016505 "apoptotic protease activator
            activity" evidence=ISS] [GO:0004252 "serine-type endopeptidase
            activity" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=ISS] [GO:0031638 "zymogen activation"
            evidence=ISS] [GO:0030108 "HLA-A specific activating MHC class I
            receptor activity" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0010628 "positive regulation of gene
            expression" evidence=ISS] [GO:0070324 "thyroid hormone binding"
            evidence=ISS] [GO:0006508 "proteolysis" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0060448
            "dichotomous subdivision of terminal units involved in lung
            branching" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] [GO:0005764 "lysosome" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0004177 "aminopeptidase
            activity" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=ISS] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 MEROPS:C01.040 CTD:1512 OMA:STSCHKT
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 EMBL:AF001169
            RefSeq:NP_999094.1 UniGene:Ssc.3593 PDB:1NB3 PDB:1NB5 PDB:8PCH
            PDBsum:1NB3 PDBsum:1NB5 PDBsum:8PCH ProteinModelPortal:O46427
            SMR:O46427 Ensembl:ENSSSCT00000001983 GeneID:396969 KEGG:ssc:396969
            EvolutionaryTrace:O46427 ArrayExpress:O46427 Uniprot:O46427
        Length = 335

 Score = 467 (169.5 bits), Expect = 2.4e-44, P = 2.4e-44
 Identities = 113/313 (36%), Positives = 169/313 (53%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F+ W  Q+ + Y    E   R ++F  N   +   N    GN ++ L LN+F+D++  E 
Sbjct:    35 FKSWMVQHQKKYSLE-EYHHRLQVFVSNWRKINAHN---AGNHTFKLGLNQFSDMSFDE- 89

Query:    94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGA-VTPVKYQGQCA------- 145
             I  +  +    + S+ K N   +L  +   PPS++W +KG  V+PVK QG C        
Sbjct:    90 IRHKYLWSEPQNCSATKGN---YLRGTGPYPPSMDWRKKGNFVSPVKNQGSCGSCWTFST 146

Query:   146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
               A+E   AI   +++SL+EQQLVDCA N NN+GC GG    AF+YI  NKGI  +  Y 
Sbjct:   147 TGALESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYP 206

Query:   206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA-NQPVSVAIDASA-LQFYSG 263
             Y+G     C   + +   A + +  ++  NDEE++++AVA   PVS A + +     Y  
Sbjct:   207 YKGQDDH-C-KFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFAFEVTNDFLMYRK 264

Query:   264 GVFNGY-CETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
             G+++   C      +NH V AVGYG  E GI YW++KNSWG  WG +GYF ++R     +
Sbjct:   265 GIYSSTSCHKTPDKVNHAVLAVGYG-EENGIPYWIVKNSWGPQWGMNGYFLIERG----K 319

Query:   320 GQCGIAMFASFPV 332
               CG+A  AS+P+
Sbjct:   320 NMCGLAACASYPI 332


>UNIPROTKB|G1SQF0 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9986
            "Oryctolagus cuniculus" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 RefSeq:XP_002721635.1 UniGene:Ocu.7137
            Ensembl:ENSOCUT00000006138 GeneID:100101597 Uniprot:G1SQF0
        Length = 333

 Score = 467 (169.5 bits), Expect = 2.4e-44, P = 2.4e-44
 Identities = 113/313 (36%), Positives = 173/313 (55%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F+ W +Q+ + Y  + E  +R + F  N   +   NN   GN ++ + LN+F+D++  E 
Sbjct:    33 FKSWMSQHHKKYS-AEEYPRRLQTFVRNWRKINAHNN---GNHTFQMGLNQFSDMSFAE- 87

Query:    94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGA-VTPVKYQGQCA------- 145
             I  +  +    + S+ K+N   +L  +   P SV+W +KG  V+PVK QG C        
Sbjct:    88 IKHKYLWTEPQNCSATKSN---YLRGTGPYPSSVDWRKKGNFVSPVKNQGACGSCWTFST 144

Query:   146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
               A+E   AI   +++SL+EQQLVDCA N NN+GC GG    AF+YI+ NKGI  +  Y 
Sbjct:   145 TGALESAVAIAGGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQAFEYILYNKGIMGEDSYP 204

Query:   206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA-NQPVSVAIDASA-LQFYSG 263
             Y  M  G C   + +   A + +  ++  NDEE++++AVA   PVS A + +     Y  
Sbjct:   205 YRAME-GRC-KFQPQKAIAFVKDVANITLNDEEAMVEAVALYNPVSFAFEVTEDFMQYRK 262

Query:   264 GVFNGY-CETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
             G+++   C      +NH V AVGYG  E G+ YW++KNSWG  WG +GYF ++R     +
Sbjct:   263 GIYSSTSCHKTPDKVNHAVLAVGYG-EENGVPYWIVKNSWGSHWGMNGYFYIERG----K 317

Query:   320 GQCGIAMFASFPV 332
               CG+A  AS+P+
Sbjct:   318 NMCGLAACASYPI 330


>UNIPROTKB|G3R9A7 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9595 "Gorilla
            gorilla gorilla" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 OMA:STSCHKT GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 RefSeq:XP_004056662.1 Ensembl:ENSGGOT00000012331
            GeneID:101144312 Uniprot:G3R9A7
        Length = 335

 Score = 466 (169.1 bits), Expect = 3.1e-44, P = 3.1e-44
 Identities = 111/313 (35%), Positives = 171/313 (54%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F  W +++ +TY  + E   R + F  N   +   NN   GN ++ + LN+F+D++  E 
Sbjct:    35 FRSWMSKHRKTYS-TEEYHHRLQTFASNWRKINAHNN---GNHTFKMALNQFSDMSFAE- 89

Query:    94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGA-VTPVKYQGQCA------- 145
             I  +  +    + S+ K+N   +L  +   PPSV+W +KG  V+PVK QG C        
Sbjct:    90 IKHKYLWSEPQNCSATKSN---YLRGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFST 146

Query:   146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
               A+E   AI   +++SL+EQQLVDCA + NN+GC GG    AF+YI+ NKGI  +  Y 
Sbjct:   147 TGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYP 206

Query:   206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA-NQPVSVAIDASA-LQFYSG 263
             Y+G   G C   +       + +  ++   DEE++++AVA   PVS A + +     Y  
Sbjct:   207 YQGKD-GYC-KFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRT 264

Query:   264 GVFNGY-CETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
             G+++   C      +NH V AVGYG  + GI YW++KNSWG  WG +GYF ++R     +
Sbjct:   265 GIYSSTSCHKTPDKVNHAVLAVGYG-EKNGIPYWIVKNSWGPKWGMNGYFLIERG----K 319

Query:   320 GQCGIAMFASFPV 332
               CG+A  AS+P+
Sbjct:   320 NMCGLAACASYPI 332


>FB|FBgn0260462 [details] [associations]
            symbol:CG12163 species:7227 "Drosophila melanogaster"
            [GO:0035071 "salivary gland cell autophagic cell death"
            evidence=IEP] [GO:0048102 "autophagic cell death" evidence=IEP]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0004869 "cysteine-type
            endopeptidase inhibitor activity" evidence=IEA] [GO:0045169
            "fusome" evidence=IDA] [GO:0035220 "wing disc development"
            evidence=IGI] [GO:0022416 "chaeta development" evidence=IGI]
            InterPro:IPR000010 InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00043 SMART:SM00645
            InterPro:IPR000169 EMBL:AE014297 GO:GO:0004869 eggNOG:COG4870
            KO:K01373 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0022416 GO:GO:0035220 GO:GO:0035071
            GO:GO:0045169 GeneTree:ENSGT00660000095458 EMBL:AY121614
            EMBL:BT003231 RefSeq:NP_649521.1 RefSeq:NP_730901.1
            RefSeq:NP_730902.2 UniGene:Dm.7315 ProteinModelPortal:Q9VN93
            SMR:Q9VN93 DIP:DIP-17491N IntAct:Q9VN93 MINT:MINT-763966
            STRING:Q9VN93 MEROPS:C01.A27 PaxDb:Q9VN93
            EnsemblMetazoa:FBtr0078823 GeneID:40628 KEGG:dme:Dmel_CG12163
            UCSC:CG12163-RA FlyBase:FBgn0260462 InParanoid:Q9VN93 OMA:GPRWGEQ
            OrthoDB:EOG4CC2G9 PhylomeDB:Q9VN93 GenomeRNAi:40628 NextBio:819744
            Bgee:Q9VN93 GermOnline:CG12163 Uniprot:Q9VN93
        Length = 614

 Score = 465 (168.7 bits), Expect = 3.9e-44, P = 3.9e-44
 Identities = 108/325 (33%), Positives = 170/325 (52%)

Query:    25 FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
             FD+  +   F +++ ++GR Y  +AE   R  IF+ NL  +E  N   +G+  Y +   +
Sbjct:   301 FDK--VDHLFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGI--TE 356

Query:    85 FADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC 144
             FAD+T  E+   +TG    D + +   +         ++P   +W +K AVT VK QG C
Sbjct:   357 FADMTSSEY-KERTGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQGSC 415

Query:   145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                        +EG+ A+K   L   SEQ+L+DC T D+   C GG MD+A+K I    G
Sbjct:   416 GSCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDS--ACNGGLMDNAYKAIKDIGG 473

Query:   198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLK-AVANQPVSVAIDAS 256
             +  +A Y Y+      C   +   H  Q+  + D+P  +E ++ +  +AN P+S+ I+A+
Sbjct:   474 LEYEAEYPYKAKKNQ-CHFNRTLSHV-QVAGFVDLPKGNETAMQEWLLANGPISIGINAN 531

Query:   257 ALQFYSGGV---FNGYC-ETFLNHGVTAVGYGTSE-----EGIKYWLIKNSWGQDWGEDG 307
             A+QFY GGV   +   C +  L+HGV  VGYG S+     + + YW++KNSWG  WGE G
Sbjct:   532 AMQFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQG 591

Query:   308 YFRLQRDIDQPQGQCGIAMFASFPV 332
             Y+R+ R        CG++  A+  V
Sbjct:   592 YYRVYRG----DNTCGVSEMATSAV 612


>RGD|1562210 [details] [associations]
            symbol:MGC114246 "similar to cathepsin R" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1562210 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 EMBL:CH474032 MEROPS:C01.042 KO:K09601
            OMA:ASHESFK OrthoDB:EOG4ZCT6D EMBL:BC091563 IPI:IPI00555186
            RefSeq:NP_001017509.1 UniGene:Rn.198321 SMR:Q5BJA0
            Ensembl:ENSRNOT00000061470 GeneID:498688 KEGG:rno:498688
            UCSC:RGD:1562210 InParanoid:Q5BJA0 NextBio:700535
            Genevestigator:Q5BJA0 Uniprot:Q5BJA0
        Length = 334

 Score = 465 (168.7 bits), Expect = 3.9e-44, P = 3.9e-44
 Identities = 110/334 (32%), Positives = 175/334 (52%)

Query:    16 CASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIG 74
             C   A+     + S+  ++++WK +Y ++Y    E  +R  ++++NL  ++  N    +G
Sbjct:    11 CLGVASGAPILDPSLDAEWQEWKKKYDKSYSLEEEELRR-AVWEENLKMIKLHNGENGLG 69

Query:    75 NRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS--SQVPPSVNWIEK 132
                +T+ +N+F D T +EF      F +  H       G   + ++  S  P  V+W +K
Sbjct:    70 KNGFTMEINEFGDTTGEEFRKMMVEFPVQTHRE-----GKSIMKRAAGSIFPKFVDWRKK 124

Query:   133 GAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFM 185
             G VTPV+ QG C          A+E     +  +L+ LS Q LVDC+    NNGC GG  
Sbjct:   125 GYVTPVRRQGNCNACWAFSVTGAIEAQTIWQSGKLIPLSVQNLVDCSKPQGNNGCLGGDT 184

Query:   186 DDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA 245
              +AF+Y++ N G+ ++A Y YEG   G C     ++ +A+IT +  +P + E+ L+ AVA
Sbjct:   185 YNAFQYVLHNGGLQSEATYPYEGKD-GPC-RYNPKNSSAEITGFVSLPES-EDILMVAVA 241

Query:   246 N-QPVSVAIDAS--ALQFYSGGVFNG-YCET-FLNHGVTAVGYG---TSEEGIKYWLIKN 297
                P+S  IDAS  + +FY  G+++   C +  + HGV  VGYG       G  YWLIKN
Sbjct:   242 TIGPISAGIDASHESFKFYKKGIYHEPNCSSNSVTHGVLVVGYGFKGNDTGGDHYWLIKN 301

Query:   298 SWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
             SWG+ WG  GY ++ +D       C IA +A +P
Sbjct:   302 SWGKQWGIRGYMKITKD---KNNHCAIASYAHYP 332


>MGI|MGI:107285 [details] [associations]
            symbol:Ctsh "cathepsin H" species:10090 "Mus musculus"
            [GO:0001520 "outer dense fiber" evidence=ISO] [GO:0001669
            "acrosomal vesicle" evidence=ISO] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=IGI] [GO:0002764 "immune response-regulating
            signaling pathway" evidence=ISO] [GO:0004175 "endopeptidase
            activity" evidence=ISO;IMP] [GO:0004177 "aminopeptidase activity"
            evidence=ISO] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISO;IDA] [GO:0004252 "serine-type endopeptidase activity"
            evidence=IMP] [GO:0005615 "extracellular space" evidence=ISO]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0005829 "cytosol"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=ISO;IMP]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0008284
            "positive regulation of cell proliferation" evidence=IMP]
            [GO:0010628 "positive regulation of gene expression" evidence=ISO]
            [GO:0010634 "positive regulation of epithelial cell migration"
            evidence=IMP] [GO:0010813 "neuropeptide catabolic process"
            evidence=ISO] [GO:0010815 "bradykinin catabolic process"
            evidence=ISO] [GO:0010952 "positive regulation of peptidase
            activity" evidence=IGI;ISO] [GO:0016505 "apoptotic protease
            activator activity" evidence=IGI;ISO] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0030108 "HLA-A specific activating MHC
            class I receptor activity" evidence=ISO] [GO:0030335 "positive
            regulation of cell migration" evidence=ISO] [GO:0030984 "kininogen
            binding" evidence=ISO] [GO:0031638 "zymogen activation"
            evidence=ISO;IMP] [GO:0031648 "protein destabilization"
            evidence=ISO;IMP] [GO:0032403 "protein complex binding"
            evidence=ISO] [GO:0032526 "response to retinoic acid" evidence=IDA]
            [GO:0033619 "membrane protein proteolysis" evidence=ISO;IMP]
            [GO:0035085 "cilium axoneme" evidence=ISO] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IMP] [GO:0043129
            "surfactant homeostasis" evidence=ISO] [GO:0043621 "protein
            self-association" evidence=ISO] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IMP] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=IMP]
            [GO:0070324 "thyroid hormone binding" evidence=ISO] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISO] [GO:0097208 "alveolar
            lamellar body" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:107285 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 EMBL:CH466560 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            HOVERGEN:HBG011513 GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT OrthoDB:EOG4W9J43
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 BRENDA:3.4.22.16
            EMBL:U06119 EMBL:AK149949 EMBL:AK150583 EMBL:AK157376 EMBL:AK160026
            EMBL:Y18464 IPI:IPI00118987 RefSeq:NP_031827.2 UniGene:Mm.2277
            ProteinModelPortal:P49935 SMR:P49935 STRING:P49935 MEROPS:I29.003
            PhosphoSite:P49935 PaxDb:P49935 PRIDE:P49935
            Ensembl:ENSMUST00000034915 GeneID:13036 KEGG:mmu:13036
            InParanoid:Q3UCD6 ChEMBL:CHEMBL1949491 NextBio:282920 Bgee:P49935
            CleanEx:MM_CTSH Genevestigator:P49935 GermOnline:ENSMUSG00000032359
            Uniprot:P49935
        Length = 333

 Score = 464 (168.4 bits), Expect = 5.0e-44, P = 5.0e-44
 Identities = 111/313 (35%), Positives = 172/313 (54%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F+ W  Q+ +TY  S E + R ++F +N   ++  N     N ++ + LN+F+D++  E 
Sbjct:    33 FKSWMKQHQKTYS-SVEYNHRLQMFANNWRKIQAHNQR---NHTFKMALNQFSDMSFAE- 87

Query:    94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKG-AVTPVKYQGQCA------- 145
             I  +  +    + S+ K+N   +L  +   P S++W +KG  V+PVK QG C        
Sbjct:    88 IKHKFLWSEPQNCSATKSN---YLRGTGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFST 144

Query:   146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
               A+E   AI   +++SL+EQQLVDCA   NN+GC GG    AF+YI+ NKGI  +  Y 
Sbjct:   145 TGALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFEYILYNKGIMEEDSYP 204

Query:   206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA-NQPVSVAIDASA-LQFYSG 263
             Y G  +  C     +   A + N  ++  NDE ++++AVA   PVS A + +     Y  
Sbjct:   205 YIGKDSS-C-RFNPQKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTEDFLMYKS 262

Query:   264 GVFNGY-CETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
             GV++   C      +NH V AVGYG  + G+ YW++KNSWG  WGE+GYF ++R     +
Sbjct:   263 GVYSSKSCHKTPDKVNHAVLAVGYG-EQNGLLYWIVKNSWGSQWGENGYFLIERG----K 317

Query:   320 GQCGIAMFASFPV 332
               CG+A  AS+P+
Sbjct:   318 NMCGLAACASYPI 330


>UNIPROTKB|P09648 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9031 "Gallus gallus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            OrthoDB:EOG48PMKF MEROPS:C01.032 IPI:IPI00602255 PIR:S00081
            UniGene:Gga.523 ProteinModelPortal:P09648 SMR:P09648 Uniprot:P09648
        Length = 218

 Score = 463 (168.0 bits), Expect = 6.4e-44, P = 6.4e-44
 Identities = 104/221 (47%), Positives = 134/221 (60%)

Query:   124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
             P SV+W EKG VTPVK QGQC          A+EG +     +LVSLSEQ LVDC+  + 
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRPEG 61

Query:   177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
             N GC GG MD AF+Y+  N GI ++  Y Y       C   KAE +AA  T + D+P   
Sbjct:    62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDC-RYKAEYNAANDTGFVDIPQGH 120

Query:   237 EESLLKAVANQ-PVSVAIDA--SALQFYSGGVF-NGYCETF-LNHGVTAVGYGTSEEGIK 291
             E +L+KAVA+  PVSVAIDA  S+ QFY  G++    C +  L+HGV  VGYG  E G K
Sbjct:   121 ERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGF-EGGKK 179

Query:   292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             YW++KNSWG+ WG+ GY  + +D    +  CGIA  AS+P+
Sbjct:   180 YWIVKNSWGEKWGDKGYIYMAKD---RKNHCGIATAASYPL 217


>UNIPROTKB|F6X9C1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00660000095458
            OMA:STSCHKT Ensembl:ENSCAFT00000036196 EMBL:AAEX03002388
            Uniprot:F6X9C1
        Length = 305

 Score = 463 (168.0 bits), Expect = 6.4e-44, P = 6.4e-44
 Identities = 114/313 (36%), Positives = 170/313 (54%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F+ W  Q+ + Y  S E  +R + F  N   +   N    GN ++ + LN+F+D+   E 
Sbjct:     5 FKSWAVQHQKKYS-SEEYLQRLQTFVGNWRKINAHN---AGNHTFKMGLNQFSDMNFAE- 59

Query:    94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGA-VTPVKYQGQCA------- 145
             I  +  +    + S+ K N   +L  +   PP V+W +KG  V+PVK QG C        
Sbjct:    60 IKHKYLWSEPQNCSATKGN---YLRGTGPYPPFVDWRKKGKFVSPVKNQGSCGSCWTFST 116

Query:   146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
               A+E   AIK  +L+SL+EQQLVDCA N NN+GC GG    AF+YI  NKGI  +  Y 
Sbjct:   117 TGALESAIAIKSGKLLSLAEQQLVDCAQNFNNHGCQGGAPLQAFEYIRYNKGIMGEDSYP 176

Query:   206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA-NQPVSVAIDASA-LQFYSG 263
             Y+G   G C   +     A + +  ++  NDE+++++AVA   PVS A + ++    Y  
Sbjct:   177 YKGQD-GDC-KYQPSKAIAFVKDVANITINDEQAMVEAVALYNPVSFAFEVTSDFMMYRK 234

Query:   264 GVFNGY-CETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
             G+++   C      +NH V AVGYG  + GI YW++KNSWG  WG +GYF ++R     +
Sbjct:   235 GIYSSTSCHKTPDKVNHAVLAVGYG-EQNGIPYWIVKNSWGPQWGMNGYFLMERG----K 289

Query:   320 GQCGIAMFASFPV 332
               CG+A  AS+P+
Sbjct:   290 NMCGLAACASYPI 302


>UNIPROTKB|P83443 [details] [associations]
            symbol:P83443 "Macrodontain-1" species:203992 "Pseudananas
            sagenarius" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0006508 "proteolysis" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197
            ProteinModelPortal:P83443 SMR:P83443 MEROPS:C01.028 Uniprot:P83443
        Length = 213

 Score = 462 (167.7 bits), Expect = 8.1e-44, P = 8.1e-44
 Identities = 95/218 (43%), Positives = 131/218 (60%)

Query:   123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
             VP S++W + GAV  VK QG C       A+A VEGI  I+   LV LSEQ+++DCA + 
Sbjct:     2 VPQSIDWRDYGAVNEVKNQGPCGGCWAFAAIATVEGIYKIRKGNLVYLSEQEVLDCAVS- 60

Query:   176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
                GC GG+++ A+ +II N G+T D  Y Y     G C++     ++A IT Y  V  N
Sbjct:    61 --YGCKGGWVNRAYDFIISNNGVTTDENYPYRAYQ-GTCNA-NYFPNSAYITGYSYVRRN 116

Query:   236 DEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
             DE  ++ AV+NQP++  IDAS    Q+Y GGV++G C   LNH +T +GYG       YW
Sbjct:   117 DESHMMYAVSNQPIAALIDASGDNFQYYKGGVYSGPCGFSLNHAITIIGYGRDS----YW 172

Query:   294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
             +++NSWG  WG+ GY R++RD+    G CGIAM   FP
Sbjct:   173 IVRNSWGSSWGQGGYVRIRRDVSHSGGVCGIAMSPLFP 210


>UNIPROTKB|Q24940 [details] [associations]
            symbol:Cat-1 "Cathepsin L-like proteinase" species:6192
            "Fasciola hepatica" [GO:0004175 "endopeptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005576 "extracellular region" evidence=IDA]
            [GO:0006508 "proteolysis" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005576 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0004197 EMBL:L33771 PIR:S43991 PDB:2O6X
            PDBsum:2O6X ProteinModelPortal:Q24940 SMR:Q24940 MEROPS:C01.033
            EvolutionaryTrace:Q24940 Uniprot:Q24940
        Length = 326

 Score = 462 (167.7 bits), Expect = 8.1e-44, P = 8.1e-44
 Identities = 117/318 (36%), Positives = 169/318 (53%)

Query:    28 GSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFA 86
             GS  + + QWK  Y + Y   A++  R  I++ N+  ++  N    +G  +YTL LN+F 
Sbjct:    15 GSNDDLWHQWKRMYNKEYN-GADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFT 73

Query:    87 DLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA- 145
             D+T +EF A     +MS  +S + ++G P+   +  VP  ++W E G VT VK QG C  
Sbjct:    74 DMTFEEFKAKYLT-EMS-RASDILSHGVPYEANNRAVPDKIDWRESGYVTEVKDQGNCGS 131

Query:   146 ------VAAVEGINAIKINRL-VSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
                      +EG   +K  R  +S SEQQLVDC+    NNGC GG M++A++Y+ Q  G+
Sbjct:   132 CWAFSTTGTMEG-QYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQF-GL 189

Query:   199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAV-ANQPVSVAIDA-S 256
               ++ Y Y  +  G C   K +   A++T Y  V    E  L   V A +P +VA+D  S
Sbjct:   190 ETESSYPYTAVE-GQCRYNK-QLGVAKVTGYYTVHSGSEVELKNLVGARRPAAVAVDVES 247

Query:   257 ALQFYSGGVFNGY-CETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
                 Y  G++    C    +NH V AVGYGT + G  YW++KNSWG  WGE GY R+ R+
Sbjct:   248 DFMMYRSGIYQSQTCSPLRVNHAVLAVGYGT-QGGTDYWIVKNSWGTYWGERGYIRMARN 306

Query:   315 IDQPQGQCGIAMFASFPV 332
                    CGIA  AS P+
Sbjct:   307 RGN---MCGIASLASLPM 321


>MGI|MGI:1861723 [details] [associations]
            symbol:Ctsr "cathepsin R" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=ISA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=ISA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0030163 "protein
            catabolic process" evidence=ISA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1861723 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0030163
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF245399
            EMBL:AY014778 EMBL:AK014432 EMBL:AK005429 IPI:IPI00120321
            RefSeq:NP_064680.1 UniGene:Mm.315715 ProteinModelPortal:Q9JIA9
            SMR:Q9JIA9 MEROPS:C01.042 PRIDE:Q9JIA9 Ensembl:ENSMUST00000021889
            GeneID:56835 KEGG:mmu:56835 CTD:56835 InParanoid:Q9JIA9 KO:K09601
            OMA:ASHESFK OrthoDB:EOG4ZCT6D NextBio:313379 Bgee:Q9JIA9
            CleanEx:MM_CTSR Genevestigator:Q9JIA9 GermOnline:ENSMUSG00000055679
            Uniprot:Q9JIA9
        Length = 334

 Score = 461 (167.3 bits), Expect = 1.0e-43, P = 1.0e-43
 Identities = 112/323 (34%), Positives = 171/323 (52%)

Query:    27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKF 85
             + S+  +++ WK +Y ++Y    E  KR  ++++ L  ++  N   ++G   +T+++N+F
Sbjct:    22 DSSLDAEWQDWKIKYNKSYSLKEEKLKRV-VWEEKLKMIKLHNRENSLGKNGFTMKMNEF 80

Query:    86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS--SQVPPSVNWIEKGAVTPVKYQGQ 143
              D T +EF        +  H       G   + +   S +P  V+W +KG VTPV+ QG 
Sbjct:    81 GDQTDEEFRKMMIEISVWTHRE-----GKSIMKREAGSILPKFVDWRKKGYVTPVRRQGD 135

Query:   144 C----AVAAVEGINAIKI---NRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
             C    A A    I A  I    +L  LS Q LVDC+    NNGC GG   +AF+Y++ N 
Sbjct:   136 CDACWAFAVTGAIEAQAIWQTGKLTPLSVQNLVDCSKPQGNNGCLGGDTYNAFQYVLHNG 195

Query:   197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA 255
             G+ ++A Y YEG   G C     ++  A+IT +  +P + E+ L+ AVA   P++  IDA
Sbjct:   196 GLESEATYPYEGKD-GPC-RYNPKNSKAEITGFVSLPQS-EDILMAAVATIGPITAGIDA 252

Query:   256 SALQF--YSGGVFNG-YCET-FLNHGVTAVGYG---TSEEGIKYWLIKNSWGQDWGEDGY 308
             S   F  Y GG+++   C +  + HGV  VGYG      +G  YWLIKNSWG+ WG  GY
Sbjct:   253 SHESFKNYKGGIYHEPNCSSDTVTHGVLVVGYGFKGIETDGNHYWLIKNSWGKRWGIRGY 312

Query:   309 FRLQRDIDQPQGQCGIAMFASFP 331
              +L +D       CGIA +A +P
Sbjct:   313 MKLAKD---KNNHCGIASYAHYP 332


>UNIPROTKB|F7BJD8 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9796 "Equus
            caballus" [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=ISS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0004175 "endopeptidase activity" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0004252
            "serine-type endopeptidase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005829 "cytosol" evidence=ISS] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0010628 "positive regulation of
            gene expression" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0010815 "bradykinin catabolic
            process" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0016505 "apoptotic protease
            activator activity" evidence=ISS] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=ISS] [GO:0030335
            "positive regulation of cell migration" evidence=ISS] [GO:0031638
            "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0032526 "response to retinoic
            acid" evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0043129 "surfactant homeostasis"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0070324
            "thyroid hormone binding" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            GO:GO:0043066 GO:GO:0005615 GO:GO:0008284 GO:GO:0070371
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458
            OMA:STSCHKT GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129
            Ensembl:ENSECAT00000013967 Uniprot:F7BJD8
        Length = 305

 Score = 459 (166.6 bits), Expect = 1.7e-43, P = 1.7e-43
 Identities = 113/313 (36%), Positives = 167/313 (53%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F+ W  Q+ + Y  S E   R + F  N   +   N    GN ++ + LN+F+ +   E 
Sbjct:     5 FKSWMVQHQKKYS-SEEYHHRLQTFVSNWRKINAHNT---GNHTFRMGLNQFSAMNFAE- 59

Query:    94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGA-VTPVKYQGQCA------- 145
             +  +  +    + S+ K N   +L  +   PPSV+W +KG  V+PVK QG C        
Sbjct:    60 LKHKYLWSEPQNCSATKGN---YLRGAGPYPPSVDWRKKGNFVSPVKNQGGCGSCWTFST 116

Query:   146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
               A+E   AI   +L+SL+EQQLVDCA N NN+GC GG    AF+YI  NKGI  +  Y 
Sbjct:   117 TGALESAVAIASGKLLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYP 176

Query:   206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA-NQPVSVAIDASA-LQFYSG 263
             Y+G   G C   +     A + +  ++  NDE+++++AVA   PVS A + +     Y  
Sbjct:   177 YKGQD-GDC-KFQPNKAIAFVKDVANITLNDEKAMVEAVALYNPVSFAFEVTEDFMMYRK 234

Query:   264 GVFNGY-CETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
             G+++   C      +NH V AVGYG  E GI YW++KNSWG  WG +GYF ++R     +
Sbjct:   235 GIYSSTSCHKTPDKVNHAVLAVGYG-EENGIPYWIVKNSWGPHWGMNGYFLIERG----K 289

Query:   320 GQCGIAMFASFPV 332
               CG+A  AS+P+
Sbjct:   290 NMCGLAACASYPI 302


>UNIPROTKB|G1M0X4 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9646
            "Ailuropoda melanoleuca" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 EMBL:ACTA01057330 EMBL:ACTA01065330
            Ensembl:ENSAMET00000013529 Uniprot:G1M0X4
        Length = 337

 Score = 456 (165.6 bits), Expect = 3.5e-43, P = 3.5e-43
 Identities = 112/313 (35%), Positives = 167/313 (53%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F+ W  Q+ + Y  S E   R   F  N   +   N    GN ++ + LN+F+D++  E 
Sbjct:    37 FKSWMVQHQKKYS-SEEYQHRLRTFVGNWRKINAHN---AGNHTFKMGLNQFSDMSFAE- 91

Query:    94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGA-VTPVKYQGQCA------- 145
             I  +  +    + S+ K N   +L  +   PP V+W +KG  V+PVK QG C        
Sbjct:    92 IKRKYLWSEPQNCSATKGN---YLRGTGPYPPFVDWRKKGKFVSPVKNQGGCGSCWTFST 148

Query:   146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
               A+E   AIK  +L+SL+EQQLVDCA + NN+GC GG    AF+YI  N+GI  +  Y 
Sbjct:   149 TGALESAIAIKTGKLLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYIRYNRGIMGEDSYP 208

Query:   206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASA-LQFYSG 263
             Y+G   G C   +     A + +  ++  NDE+++++AVA   PVS A + +     Y  
Sbjct:   209 YKGQD-GDC-KFQPSKAIAFVKDVANITINDEQAMVEAVALFNPVSFAFEVTGDFMMYRK 266

Query:   264 GVFNGY-CETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
             GV++   C      +NH V AVGYG  + G+ YW++KNSWG  WG  GYF ++R     +
Sbjct:   267 GVYSSTSCHKTPDKVNHAVLAVGYG-EQNGVPYWIVKNSWGPQWGMHGYFLIERG----K 321

Query:   320 GQCGIAMFASFPV 332
               CG+A  AS+P+
Sbjct:   322 NMCGLAACASYPI 334


>MGI|MGI:1927229 [details] [associations]
            symbol:Ctsm "cathepsin M" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008152 "metabolic process" evidence=ISS] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015645 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1927229 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF202528
            EMBL:AY014777 EMBL:AY057446 EMBL:AK005550 EMBL:AK005428
            IPI:IPI00131133 RefSeq:NP_071721.2 UniGene:Mm.279933
            ProteinModelPortal:Q9JL96 SMR:Q9JL96 STRING:Q9JL96 MEROPS:C01.023
            PRIDE:Q9JL96 DNASU:64139 Ensembl:ENSMUST00000099451 GeneID:64139
            KEGG:mmu:64139 UCSC:uc007qwj.1 CTD:64139 InParanoid:Q9JL96
            KO:K09600 OrthoDB:EOG4TTGKR NextBio:319931 Bgee:Q9JL96
            CleanEx:MM_CTSM Genevestigator:Q9JL96 GermOnline:ENSMUSG00000074484
            GermOnline:ENSMUSG00000074871 PANTHER:PTHR12411:SF58 Uniprot:Q9JL96
        Length = 333

 Score = 456 (165.6 bits), Expect = 3.5e-43, P = 3.5e-43
 Identities = 111/315 (35%), Positives = 168/315 (53%)

Query:    33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADLTPQ 91
             ++++WK +YG+ Y    E  KR  +++DN+  ++  N    +G   +T+ +N F D+T +
Sbjct:    28 EWQKWKIKYGKAYSLEEEGQKR-AVWEDNMKKIKLHNGENGLGKHGFTMEMNAFGDMTLE 86

Query:    92 EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------- 144
             EF        +     ++K   +     S  +P  +NW ++G VTPV+ QG+C       
Sbjct:    87 EFRKVMIEIPVP----TVKKGKSVQKRLSVNLPKFINWKKRGYVTPVQTQGRCNSCWAFS 142

Query:   145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
                A+EG    K  +L+ LS Q LVDC+    N GCY G    A  Y+++N G+ ++A Y
Sbjct:   143 VTGAIEGQMFRKTGQLIPLSVQNLVDCSRPQGNWGCYLGNTYLALHYVMENGGLESEATY 202

Query:   205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--SALQFY 261
              YE    G C     E+  A IT +E VP N E++L+ AVA+  P+SVAIDA  ++  FY
Sbjct:   203 PYEEKD-GSC-RYSPENSTANITGFEFVPKN-EDALMNAVASIGPISVAIDARHASFLFY 259

Query:   262 SGGVF-NGYCET-FLNHGVTAVGYG-TSEE--GIKYWLIKNSWGQDWGEDGYFRLQRDID 316
               G++    C +  + H +  VGYG T  E  G KYWL+KNS G  WG  GY ++ RD  
Sbjct:   260 KRGIYYEPNCSSCVVTHSMLLVGYGFTGRESDGRKYWLVKNSMGTQWGNKGYMKISRD-- 317

Query:   317 QPQGQCGIAMFASFP 331
                  CGIA +A +P
Sbjct:   318 -KGNHCGIATYALYP 331


>TAIR|locus:2175088 [details] [associations]
            symbol:ALP "aleurain-like protease" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009723 "response to ethylene stimulus" evidence=IEP]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0005829 "cytosol"
            evidence=RCA] [GO:0006096 "glycolysis" evidence=RCA] [GO:0006816
            "calcium ion transport" evidence=RCA] [GO:0006833 "water transport"
            evidence=RCA] [GO:0006972 "hyperosmotic response" evidence=RCA]
            [GO:0007030 "Golgi organization" evidence=RCA] [GO:0009266
            "response to temperature stimulus" evidence=RCA] [GO:0009651
            "response to salt stress" evidence=RCA] [GO:0009750 "response to
            fructose stimulus" evidence=RCA] [GO:0042744 "hydrogen peroxide
            catabolic process" evidence=RCA] [GO:0046686 "response to cadmium
            ion" evidence=RCA] [GO:0007568 "aging" evidence=IEP]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002688 GO:GO:0005773
            GO:GO:0007568 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AB011483 KO:K01366
            ProtClustDB:CLSN2689015 UniGene:At.25414 IPI:IPI00846287
            RefSeq:NP_001078774.1 ProteinModelPortal:A8MQZ1 SMR:A8MQZ1
            STRING:A8MQZ1 PRIDE:A8MQZ1 EnsemblPlants:AT5G60360.3 GeneID:836158
            KEGG:ath:AT5G60360 OMA:CGSTPMD Genevestigator:A8MQZ1 Uniprot:A8MQZ1
        Length = 361

 Score = 456 (165.6 bits), Expect = 3.5e-43, P = 3.5e-43
 Identities = 111/292 (38%), Positives = 162/292 (55%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F ++  +YG+ Y+   E   RF IFK+NL  +   N   +   SY L +N+FADLT QEF
Sbjct:    59 FARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGL---SYKLGVNQFADLTWQEF 115

Query:    94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------V 146
               ++ G    + S++LK  G+  + +++ +P + +W E G V+PVK QG C         
Sbjct:   116 QRTKLG-AAQNCSATLK--GSHKVTEAA-LPETKDWREDGIVSPVKDQGGCGSCWTFSTT 171

Query:   147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
              A+E        + +SLSEQQLVDCA   NN GC GG    AF+YI  N G+  +  Y Y
Sbjct:   172 GALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPY 231

Query:   207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA-SALQFYSGG 264
              G     C    AE+   Q+ N  ++    E+ L  AV   +PVS+A +   + + Y  G
Sbjct:   232 TGKDE-TC-KFSAENVGVQVLNSVNITLGAEDELKHAVGLVRPVSIAFEVIHSFRLYKSG 289

Query:   265 VF-NGYCETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
             V+ + +C +    +NH V AVGYG  E+G+ YWLIKNSWG DWG+ GYF+++
Sbjct:   290 VYTDSHCGSTPMDVNHAVLAVGYGV-EDGVPYWLIKNSWGADWGDKGYFKME 340


>UNIPROTKB|Q10991 [details] [associations]
            symbol:CTSL "Cathepsin L1" species:9940 "Ovis aries"
            [GO:0005515 "protein binding" evidence=IPI] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            MEROPS:C01.032 ProteinModelPortal:Q10991 SMR:Q10991 Uniprot:Q10991
        Length = 217

 Score = 455 (165.2 bits), Expect = 4.5e-43, P = 4.5e-43
 Identities = 102/221 (46%), Positives = 135/221 (61%)

Query:   123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
             VP SV+W +KG VTPVK QGQC       A  A+EG    K  +LVSLSEQ LVD +   
Sbjct:     1 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQ 60

Query:   176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
              N GC GG MD+AF+YI +N G+ ++  Y YE   T  C+  K E  AA+ T + D+P  
Sbjct:    61 GNQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTS-CN-YKPEYSAAKDTGFVDIPQR 118

Query:   236 DEESLLKAVANQ-PVSVAIDA--SALQFYSGGVF-NGYCETF-LNHGVTAVGYGTSEEGI 290
              E++L+KAVA   P+SVAIDA  S+ QFY  G++ +  C +  L+HGV  VGYG      
Sbjct:   119 -EKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTNN 177

Query:   291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
             K+W++KNSWG +WG  GY ++ +D       CGIA  AS+P
Sbjct:   178 KFWIVKNSWGPEWGNKGYVKMAKD---QNNHCGIATAASYP 215


>RGD|1588248 [details] [associations]
            symbol:Cts8 "cathepsin 8" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1588248 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 IPI:IPI00765053
            RefSeq:NP_001121688.1 UniGene:Rn.220599 Ensembl:ENSRNOT00000061486
            GeneID:680718 KEGG:rno:680718 UCSC:RGD:1588248 CTD:56094
            OMA:DSEWQEW OrthoDB:EOG4JT07C NextBio:719350 Uniprot:D3ZP54
        Length = 333

 Score = 455 (165.2 bits), Expect = 4.5e-43, P = 4.5e-43
 Identities = 118/339 (34%), Positives = 178/339 (52%)

Query:     9 VLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF 68
             V++++  C   A      + S+  ++++WK +Y + Y    E  KR  ++++N+  V++ 
Sbjct:     4 VVLLAILCLGVARATQPSDPSLDSEWQEWKTKYEKNYSLEEEGQKR-AVWEENMKVVKQH 62

Query:    69 N-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSV 127
             N       +++T+ LN FAD+T +EF    T   + +       +   F Y    +P  V
Sbjct:    63 NIEYDQEKKNFTMELNAFADMTGEEFRKMMTNIPVQNLRKKKSIHQPIFRY----LPKFV 118

Query:   128 NWIEKGAVTPVKYQGQC------AVA-AVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
             +W  +G VT VK QG C      +VA A+EG    K  RLVSLS Q LVDC+  + N+GC
Sbjct:   119 DWRRRGYVTSVKNQGTCNSCWAFSVAGAIEGQMFRKTGRLVSLSPQNLVDCSRPEGNHGC 178

Query:   181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESL 240
             + G    A KY+  N G+  ++ Y YEG   G C  +     AA++T +  V    EE+L
Sbjct:   179 HMGSTLYALKYVWSNGGLEAESTYPYEGKE-GPCRYLPRRS-AARVTGFSTVA-RSEEAL 235

Query:   241 LKAVAN-QPVSVAIDAS--ALQFYSGGVF-NGYCETF-LNHGVTAVGYG---TSEEGIKY 292
             + AVA   P+SV IDAS  + +FY  G++    C +  +NH V  VGYG      +G KY
Sbjct:   236 MHAVATIGPISVGIDASHVSFRFYRRGIYYEPRCSSNRINHSVLVVGYGYEGRESDGRKY 295

Query:   293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
             WLIKNS G  WG +GY +L R  +     CGIA +  +P
Sbjct:   296 WLIKNSHGVGWGMNGYMKLARGWNN---HCGIATYGFYP 331


>RGD|631421 [details] [associations]
            symbol:Ctsq "cathepsin Q" species:10116 "Rattus norvegicus"
            [GO:0005764 "lysosome" evidence=NAS] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 RGD:631421 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 UniGene:Rn.34875 EMBL:AF187323 IPI:IPI00214897
            PIR:JC7183 RefSeq:NP_640355.1 UniGene:Rn.35820
            ProteinModelPortal:Q9QZE3 SMR:Q9QZE3 STRING:Q9QZE3 MEROPS:C01.039
            PRIDE:Q9QZE3 Ensembl:ENSRNOT00000024208 GeneID:246147
            KEGG:rno:246147 UCSC:RGD:631421 CTD:104002 InParanoid:Q9QZE3
            OMA:ESEDVLM OrthoDB:EOG4HHP48 NextBio:623425 Genevestigator:Q9QZE3
            GermOnline:ENSRNOG00000017946 Uniprot:Q9QZE3
        Length = 343

 Score = 455 (165.2 bits), Expect = 4.5e-43, P = 4.5e-43
 Identities = 107/325 (32%), Positives = 176/325 (54%)

Query:    29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
             S+  ++++WK +Y + Y    E  KR  ++++N+  +E  N   ++G  +YT+ +N FAD
Sbjct:    24 SLDVQWQEWKIKYEKLYSPEEEVLKRV-VWEENVKKIELHNRENSLGKNTYTMEINDFAD 82

Query:    88 LTPQEFIASQTGFKMSDHSSSLK----ANGTPFLYK---SSQVPPSVNWIEKGAVTPVKY 140
             +T +EF     GF++  H++  +    A G+ F         +P  V+W  +G VT V+ 
Sbjct:    83 MTDEEFKDMIIGFQLPVHNTEKRLWKRALGSFFPNSWNWRDALPKFVDWRNEGYVTRVRK 142

Query:   141 QGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
             QG C+         A+EG    K  +L+ LS Q L+DC+    N GC  G   +AF+Y++
Sbjct:   143 QGGCSSCWAFPVTGAIEGQMFKKTGKLIPLSVQNLIDCSKPQGNRGCLWGNTYNAFQYVL 202

Query:   194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVA 252
              N G+  +A Y YE    G+C     ++ +A+IT +  V P  E+ L+ AVA + P++  
Sbjct:   203 HNGGLEAEATYPYE-RKEGVC-RYNPKNSSAKITGFV-VLPESEDVLMDAVATKGPIATG 259

Query:   253 ID--ASALQFYSGGVFNG-YCETFLNHGVTAVGYG---TSEEGIKYWLIKNSWGQDWGED 306
             +   +S+ +FY  GV++   C +++NH V  VGYG      +G  YWLIKNSWG+ WG  
Sbjct:   260 VHVISSSFRFYQKGVYHEPKCSSYVNHAVLVVGYGFEGNETDGNNYWLIKNSWGKRWGLR 319

Query:   307 GYFRLQRDIDQPQGQCGIAMFASFP 331
             GY ++ +D +     C IA  A +P
Sbjct:   320 GYMKIAKDRNN---HCAIASLAQYP 341


>FB|FBgn0034229 [details] [associations]
            symbol:CG4847 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0032504
            "multicellular organism reproduction" evidence=IEP] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0005615 "extracellular space"
            evidence=ISM;IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:AE013599 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 GO:GO:0032504 GeneTree:ENSGT00560000076599
            KO:K01371 EMBL:BT099507 RefSeq:NP_725686.1 UniGene:Dm.4677
            SMR:A1ZAU4 IntAct:A1ZAU4 MEROPS:C01.A28 EnsemblMetazoa:FBtr0086935
            GeneID:36973 KEGG:dme:Dmel_CG4847 UCSC:CG4847-RB
            FlyBase:FBgn0034229 InParanoid:A1ZAU4 OMA:GGFQEYA OrthoDB:EOG4J9KFC
            ChiTaRS:CG4847 GenomeRNAi:36973 NextBio:801302 Uniprot:A1ZAU4
        Length = 420

 Score = 452 (164.2 bits), Expect = 9.3e-43, P = 9.3e-43
 Identities = 115/320 (35%), Positives = 163/320 (50%)

Query:    29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
             S  + F  + +Q G+TY  +A+ +     F      VE  N A A G  ++   +N FAD
Sbjct:   107 SNVQDFGDFLSQSGKTYLSAADRALHEGAFASTKNLVEAGNAAFAQGVHTFKQAVNAFAD 166

Query:    88 LTPQEFIASQTGFKMSDHSSSLKANGTPFL-YKSSQVPPSVNWIEKGAVTPVKYQGQCA- 145
             LT  EF++  TG K S  + +  A     +   +  +P + +W E G VTPVK+QG C  
Sbjct:   167 LTHSEFLSQLTGLKRSPEAKARAAASLKLVNLPAKPIPDAFDWREHGGVTPVKFQGTCGS 226

Query:   146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNN--NGCYGGFMDDAFKYIIQ-NK 196
                     A+EG    K   L +LSEQ LVDC   ++   NGC GGF + AF +I +  K
Sbjct:   227 CWAFATTGAIEGHTFRKTGSLPNLSEQNLVDCGPVEDFGLNGCDGGFQEAAFCFIDEVQK 286

Query:   197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDA 255
             G++ +  Y Y   + G C         A +  +  +PP DEE L K VA   PV+ +++ 
Sbjct:   287 GVSQEGAYPYID-NKGTC-KYDGSKSGATLQGFAAIPPKDEEQLKKVVATLGPVACSVNG 344

Query:   256 -SALQFYSGGVFNG-YCETFL-NHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
                L+ Y+GG++N   C     NH +  VGYG SE+G  YW++KNSW   WGE GYFRL 
Sbjct:   345 LETLKNYAGGIYNDDECNKGEPNHSILVVGYG-SEKGQDYWIVKNSWDDTWGEKGYFRLP 403

Query:   313 RDIDQPQGQCGIAMFASFPV 332
             R     +  C IA   S+PV
Sbjct:   404 RG----KNYCFIAEECSYPV 419


>DICTYBASE|DDB_G0278401 [details] [associations]
            symbol:cprH "cysteine proteinase 8" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0278401 EMBL:AAFI02000023
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 ProtClustDB:CLSZ2430780 RefSeq:XP_642342.1
            ProteinModelPortal:Q54Y60 MEROPS:C01.A62 EnsemblProtists:DDB0205428
            GeneID:8621547 KEGG:ddi:DDB_G0278401 InParanoid:Q54Y60 OMA:FANMENE
            Uniprot:Q54Y60
        Length = 337

 Score = 449 (163.1 bits), Expect = 1.9e-42, P = 1.9e-42
 Identities = 131/350 (37%), Positives = 181/350 (51%)

Query:    11 IISGSCASQATYRT----FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
             ++S  CA   T  T      E    + F  W     ++Y  S+E   R+ IFK N   +E
Sbjct:     3 VLSVLCALLITVATAKQELSESQYRDAFTDWMISNQKSYS-SSEFITRYNIFKTNFDYIE 61

Query:    67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPS 126
              +N+   G+ +  L LNK AD+T +E+ +   G K  D +SSL       L+ S++   +
Sbjct:    62 EWNSK--GSET-VLGLNKMADITNEEYRSLYLG-KPFD-ASSLIGTKEEILF-SNKFSST 115

Query:   127 VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIK---INRLVSLSEQQLVDCATNDN 176
             V+W +KGAVT VK Q  C       A  A EG + +     N LVSLSEQ L+DC+T   
Sbjct:   116 VDWRKKGAVTHVKNQQSCSGCWSFSATGATEGAHKLANNGTNELVSLSEQNLIDCSTPFG 175

Query:   177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
             N GC GG +  AF+YII N GI  +  Y +EG + G C   K+E+  A I++Y +V    
Sbjct:   176 NTGCNGGVITYAFEYIISNGGIDTEKSYPFEG-TDGTC-RYKSENSGATISSYVNVTFGS 233

Query:   237 EESLLKAVANQPVSVAIDAS--ALQFYSGGV-FNGYCE-TFLNHGVTAVGYGT----SEE 288
             E SL  AV   PV+ +IDAS  +  FY  G+ F   C  T L+HGV  VGYGT    S++
Sbjct:   234 ESSLESAVNVNPVACSIDASHSSFLFYKSGIYFEPACSRTNLDHGVLVVGYGTENSQSQD 293

Query:   289 GIK------YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
                      YW+ KNSWG +    GY  + +D D     CGI+  ASFP+
Sbjct:   294 SSSEPNHSNYWIAKNSWGIN----GYILMSKDRDN---MCGISTLASFPI 336


>UNIPROTKB|G3V9F8 [details] [associations]
            symbol:Ctsm "RCG24133" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015645 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 EMBL:CH474032
            PANTHER:PTHR12411:SF58 Ensembl:ENSRNOT00000045830 RGD:631420
            Uniprot:G3V9F8
        Length = 333

 Score = 447 (162.4 bits), Expect = 3.2e-42, P = 3.2e-42
 Identities = 110/316 (34%), Positives = 172/316 (54%)

Query:    33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADLTPQ 91
             ++++WK +Y +TY    E  KR  ++++N+  ++  N    +G   +T+ +N F D+T +
Sbjct:    28 EWQKWKIKYEKTYSLEEEGQKR-AVWEENMKKIKLHNGENGLGKHGFTMEMNAFGDMTIE 86

Query:    92 EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCAVA---- 147
             EF        +     ++K   +    ++  VP  +NW ++G VTPV+ QG+C V     
Sbjct:    87 EFRKLMIEIPIP----TVKKENSVQKRQAVNVPNFINWRKRGYVTPVRRQGRCNVCWAFS 142

Query:   148 ---AVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
                A+EG    K  +L+ LS Q LVDC+    N GCY G    A +Y+ +N G+ ++A Y
Sbjct:   143 VAGAIEGQMFQKTGQLIPLSVQNLVDCSRPQGNLGCYLGNTYLALQYVKENGGLESEATY 202

Query:   205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDA--SALQFY 261
              YE    G C     ++  A IT++E VP N E++L+ AVA   P+SVAIDA   +  FY
Sbjct:   203 PYEEKE-GSC-RYHPDNSTASITDFEFVPKN-EDALMNAVATLGPISVAIDARHESFLFY 259

Query:   262 SGGVFNG-YCET-FLNHGVTAVGYG-TSEE--GIKYWLIKNSWGQDWGEDGYFRLQRDID 316
               G+++   C +  + H +  VGYG   EE  G KYW++KNS G  WG  GY ++ +D  
Sbjct:   260 RNGIYHEPNCSSSVVTHAMLLVGYGFVGEESDGRKYWILKNSMGNKWGNRGYMKIAKD-- 317

Query:   317 QPQGQ-CGIAMFASFP 331
               QG  CGIA +A +P
Sbjct:   318 --QGNHCGIATYALYP 331


>UNIPROTKB|P83654 [details] [associations]
            symbol:P83654 "Ervatamin-C" species:52861 "Tabernaemontana
            divaricata" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0006508 "proteolysis" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005576 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197 PDB:1O0E PDB:2PNS
            PDBsum:1O0E PDBsum:2PNS MEROPS:C01.116 EvolutionaryTrace:P83654
            Uniprot:P83654
        Length = 208

 Score = 442 (160.7 bits), Expect = 1.1e-41, P = 1.1e-41
 Identities = 101/218 (46%), Positives = 130/218 (59%)

Query:   123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
             +P  ++W +KGAVTPVK QG C        V+ VE IN I+   L+SLSEQ+LVDC  + 
Sbjct:     1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDC--DK 58

Query:   176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
              N+GC GG    A++YII N GI   A Y Y+ +  G C   +A      I  Y  VP  
Sbjct:    59 KNHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQ-GPC---QAASKVVSIDGYNGVPFC 114

Query:   236 DEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
             +E +L +AVA QP +VAIDAS+ QF  YS G+F+G C T LNHGVT VGY  +     YW
Sbjct:   115 NEXALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQAN-----YW 169

Query:   294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
             +++NSWG+ WGE GY R+ R      G CGIA    +P
Sbjct:   170 IVRNSWGRYWGEKGYIRMLRV--GGCGLCGIARLPYYP 205


>MGI|MGI:1860262 [details] [associations]
            symbol:Cts7 "cathepsin 7" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005576
            "extracellular region" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005768 "endosome" evidence=IEA]
            [GO:0005794 "Golgi apparatus" evidence=IEA] [GO:0006508
            "proteolysis" evidence=ISA] [GO:0007049 "cell cycle" evidence=IEA]
            [GO:0007067 "mitosis" evidence=IEA] [GO:0008152 "metabolic process"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=ISA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0051301 "cell
            division" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:1860262 GO:GO:0005634 GO:GO:0005794 GO:GO:0048471
            GO:GO:0005615 GO:GO:0051301 GO:GO:0007067 GO:GO:0005768
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GO:GO:0008233 EMBL:CH466546
            EMBL:AY014779 EMBL:CT030645 EMBL:BC064740 EMBL:AF250837
            IPI:IPI00131132 RefSeq:NP_062412.1 UniGene:Mm.3692 HSSP:O60911
            ProteinModelPortal:Q91ZF2 SMR:Q91ZF2 STRING:Q91ZF2 MEROPS:C01.016
            PRIDE:Q91ZF2 Ensembl:ENSMUST00000021892 GeneID:56092 KEGG:mmu:56092
            UCSC:uc007qwi.1 CTD:56092 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 InParanoid:Q91ZF2 OMA:ERRVIWE OrthoDB:EOG44QT2S
            NextBio:311908 Bgee:Q91ZF2 Genevestigator:Q91ZF2 Uniprot:Q91ZF2
        Length = 331

 Score = 442 (160.7 bits), Expect = 1.1e-41, P = 1.1e-41
 Identities = 112/339 (33%), Positives = 179/339 (52%)

Query:    10 LIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVER-F 68
             + +S  C   A      + ++  ++E+WK    RTY    E  +R  +++ N+  +++  
Sbjct:     5 VFLSILCLGVALAAPAPDYNLDAEWEEWKRSNDRTYSPEEEKQRR-AVWEGNVKWIKQHI 63

Query:    69 NNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKA-NGTPFLYKSSQVPPSV 127
                 +   ++T+ +N+F D+T +E        KM   SSS    NG     ++ ++PP++
Sbjct:    64 MENGLWMNNFTIEMNEFGDMTGEEM-------KMLTESSSYPLRNGKHIQKRNPKIPPTL 116

Query:   128 NWIEKGAVTPVKYQGQCAV-------AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
             +W ++G VTPV+ QG C         A +EG    K  +L+ LS Q L+DC+ +    GC
Sbjct:   117 DWRKEGYVTPVRRQGSCGACWAFSVTACIEGQLFKKTGKLIPLSVQNLMDCSVSYGTKGC 176

Query:   181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESL 240
              GG   DAF+Y+  N G+  +A Y YE  +   C   + E    ++  +  VP N EE+L
Sbjct:   177 DGGRPYDAFQYVKNNGGLEAEATYPYEAKAKH-C-RYRPERSVVKVNRFFVVPRN-EEAL 233

Query:   241 LKA-VANQPVSVAIDASALQFYS--GGVFNG-YC-ETFLNHGVTAVGYGTS---EEGIKY 292
             L+A V + P++VAID S   F+S  GG+++   C +  L+HG+  VGYG      E  KY
Sbjct:   234 LQALVTHGPIAVAIDGSHASFHSYRGGIYHEPKCRKDTLDHGLLLVGYGYEGHESENRKY 293

Query:   293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
             WL+KNS G+ WGE+GY +L R        CGIA +A +P
Sbjct:   294 WLLKNSHGERWGENGYMKLPRG---QNNYCGIASYAMYP 329


>DICTYBASE|DDB_G0281605 [details] [associations]
            symbol:cfaD "peptidase C1A family protein"
            species:44689 "Dictyostelium discoideum" [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IDA] [GO:0006508 "proteolysis" evidence=IDA] [GO:0031410
            "cytoplasmic vesicle" evidence=IDA] [GO:0031288 "sorocarp
            morphogenesis" evidence=IMP] [GO:0008285 "negative regulation of
            cell proliferation" evidence=IGI;IDA] [GO:0005576 "extracellular
            region" evidence=IEA;IDA] [GO:0005515 "protein binding"
            evidence=IPI] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0281605
            GO:GO:0008285 GO:GO:0005615 GenomeReviews:CM000152_GR
            eggNOG:COG4870 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0031410 EMBL:AAFI02000042
            GO:GO:0031288 RefSeq:XP_640530.1 HSSP:P07711
            ProteinModelPortal:Q54TR1 STRING:Q54TR1 PRIDE:Q54TR1
            EnsemblProtists:DDB0229857 GeneID:8623140 KEGG:ddi:DDB_G0281605
            InParanoid:Q54TR1 OMA:PSAHEHE ProtClustDB:CLSZ2430523
            Uniprot:Q54TR1
        Length = 531

 Score = 440 (159.9 bits), Expect = 1.7e-41, P = 1.7e-41
 Identities = 112/321 (34%), Positives = 168/321 (52%)

Query:    27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
             E   +  F+++KAQY + Y    E+ +RF  FK     +   +NA     SY L +N +A
Sbjct:   218 EEQASNLFKEYKAQYNKEYSSQDEHDERFINFKAARKIIAT-HNAK--ESSYKLGMNHYA 274

Query:    87 DLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC- 144
             DL+ +EF    T  K      S+    +    +S + +P +V+W  +  VTPVK QG C 
Sbjct:   275 DLSNKEF---NTLVKPKVARPSVTGADSVHDDESLRSIPSTVDWRNQNCVTPVKDQGICG 331

Query:   145 ------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
                   +  ++EG N +    LVSLSEQQLVDCA    + GC GGF   AF+Y+++   +
Sbjct:   332 SCWTFGSTGSLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGSL 391

Query:   199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDASA 257
               ++ Y Y  M  G+C           IT Y +V    E +L  A+A   PV++AIDAS 
Sbjct:   392 ATESNYPYL-MQNGLCRDRTVTPSGVSITGYVNVTSGSESALQNAIATTGPVAIAIDASV 450

Query:   258 --LQFYSGGVFNG-YCETFLN---HGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
                ++Y  GV+N   C+  L+   H V A+GYGT + G  Y+L+KNSW  +WG DGY  +
Sbjct:   451 DDFRYYMSGVYNNPACKNGLDDLDHEVLAIGYGTYQ-GQDYFLVKNSWSTNWGMDGYVYM 509

Query:   312 QRDIDQPQGQCGIAMFASFPV 332
              R+       CG++  A++P+
Sbjct:   510 ARN---DNNLCGVSSQATYPI 527


>ZFIN|ZDB-GENE-050208-336 [details] [associations]
            symbol:ctskl "cathepsin K, like" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-050208-336 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:BX465190
            GeneTree:ENSGT00660000095458 IPI:IPI00491185 RefSeq:XP_695425.1
            UniGene:Dr.110795 Ensembl:ENSDART00000062749 GeneID:567046
            KEGG:dre:567046 CTD:567046 NextBio:20888499 Bgee:F1QCP8
            Uniprot:F1QCP8
        Length = 349

 Score = 435 (158.2 bits), Expect = 5.9e-41, P = 5.9e-41
 Identities = 116/312 (37%), Positives = 166/312 (53%)

Query:    37 WKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIA 95
             WK ++  +Y E +E+  R  I++ N+  + + NN  + G   + + +NK+ DLT  E+  
Sbjct:    44 WKKKHEISYDEESEDVHRKTIWETNMQKIWKNNNDFSFGLSMFKMAMNKYGDLTSVEY-K 102

Query:    96 SQTGFKMSDHSSSL-KANGTPFLYKSSQV--PPSVNWIEKGAVTPVKYQGQCA------- 145
                G K+    +   K      L  +++     ++++  KG VT VK QG C        
Sbjct:   103 RLLGSKIKGTGNRKGKITSAQMLRLNAKRLGVTNIDYRAKGYVTEVKDQGYCGSCWSFST 162

Query:   146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
               A+EG       RLVSLSEQQLVDC+ +    GC G +M +A+ Y+I N   ++D  Y 
Sbjct:   163 TGAIEGQMYKHTGRLVSLSEQQLVDCSRSYGTYGCSGAWMANAYDYVINNALESSDT-YP 221

Query:   206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDAS--ALQFYS 262
             Y  + T  C   K    A  I++Y  VP  +E++L  AVA   PVSVAIDA   +  FYS
Sbjct:   222 YTSVDTQPCFYEKNLAMAG-ISDYRFVPAGNEQALADAVATVGPVSVAIDADNPSFLFYS 280

Query:   263 GGVFN-GYCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
              G++    C    LNH V  VGYG SEEG  YW+IKNSWG  WGE GY R+ R+    + 
Sbjct:   281 SGIYKESNCNPNNLNHAVLVVGYG-SEEGTDYWIIKNSWGTGWGEGGYMRMIRN---GKN 336

Query:   321 QCGIAMFASFPV 332
              CGIA +A +P+
Sbjct:   337 TCGIASYALYPI 348


>ZFIN|ZDB-GENE-030131-3539 [details] [associations]
            symbol:ctsh "cathepsin H" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-030131-3539
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 KO:K01366 HOVERGEN:HBG011513
            CTD:1512 OrthoDB:EOG4W9J43 MEROPS:I29.003 HSSP:P43235 EMBL:BC067615
            IPI:IPI00506892 RefSeq:NP_997853.1 UniGene:Dr.14176
            ProteinModelPortal:Q6NWF2 SMR:Q6NWF2 PRIDE:Q6NWF2 GeneID:324818
            KEGG:dre:324818 InParanoid:Q6NWF2 NextBio:20808976 Bgee:Q6NWF2
            Uniprot:Q6NWF2
        Length = 330

 Score = 434 (157.8 bits), Expect = 7.5e-41, P = 7.5e-41
 Identities = 103/313 (32%), Positives = 166/313 (53%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F+ W +QY + Y E  E  +R +IF +N   +++ N    GN  +++ LN+F+D+T  EF
Sbjct:    30 FKSWMSQYNKKY-EINEFYQRLQIFLENKKRIDQHNE---GNHKFSMGLNQFSDMTFAEF 85

Query:    94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGA-VTPVKYQGQCA------- 145
                +T + +++  +     G   +  +   P +++W  KG  +T VK QG C        
Sbjct:    86 --KKT-YLLTEPQNCSATRGN-HVSSNGLYPDAIDWRTKGHYITDVKNQGPCGSCWTFST 141

Query:   146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
                +E + AI   +L+ L+EQQL+DCA + +N+GC GG    AF+YI+ NKG+  +  Y 
Sbjct:   142 TGCLESVTAIATGKLLQLAEQQLIDCAGDFDNHGCNGGLPSHAFEYIMYNKGLMTEDDYP 201

Query:   206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASA-LQFYSG 263
             Y+    G C   K +  AA +    ++   DE  ++ AVA   PVS A + ++    Y  
Sbjct:   202 YQAKG-GQC-RFKPQLAAAFVKEVVNITKYDEMGMVDAVARLNPVSFAYEVTSDFMHYKD 259

Query:   264 GVFNGY-CET---FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
             G++    C      +NH V AVGY   E G  YW++KNSWG +WG  GYF ++R     +
Sbjct:   260 GIYTSTECHNTTDMVNHAVLAVGYA-EENGTPYWIVKNSWGTNWGIKGYFYIERG----K 314

Query:   320 GQCGIAMFASFPV 332
               CG+A  +S+P+
Sbjct:   315 NMCGLAACSSYPI 327


>UNIPROTKB|J9P7C5 [details] [associations]
            symbol:J9P7C5 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 EMBL:AAEX03010953
            Ensembl:ENSCAFT00000012925 Uniprot:J9P7C5
        Length = 321

 Score = 431 (156.8 bits), Expect = 1.6e-40, P = 1.6e-40
 Identities = 111/310 (35%), Positives = 160/310 (51%)

Query:    36 QWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFI 94
             QWKA + R Y  + E  +R  +++ N+  +E  N   + G   +T+ +N F D+T +EF 
Sbjct:    26 QWKAMHRRLYGMNEEGWRR-AVWEKNMKMIELHNREYSQGKHGFTMAMNAFGDMTNEEFR 84

Query:    95 ASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVA 147
                 GF+   H    K    P     +++P SV+W EKG VTPVK QGQC       A  
Sbjct:    85 QVINGFQNQKHKKG-KVFQEPLF---AEIPKSVDWREKGYVTPVKNQGQCGSCWAFSATG 140

Query:   148 AVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYE 207
             A EG    K   LV LSEQ L        N GC GG MD+AF+Y+  N+ + ++  Y Y 
Sbjct:   141 AFEGQMFWKTGNLVPLSEQNLAQ-----GNEGCNGGLMDNAFQYVKDNRCLDSEESYPYL 195

Query:   208 GMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDASA--LQFYSGG 264
             G  T  C+  K E  AA  + + D+P   E++L+KA+A    ++VAIDA     QFY   
Sbjct:   196 GRDTDTCN-YKPECSAAHDSGFVDLPQR-EKALMKAMATLGSITVAIDAGHQYFQFYKSS 253

Query:   265 V-FNGYCETF-LNHGVTAVGYG-TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
             + F+  C +  L+HGV  VGYG    +    W++KNSW  +WG + Y ++ +        
Sbjct:   254 IYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKWIVKNSWSPEWGWNSYVKMAKG---QNNH 310

Query:   322 CGIAMFASFP 331
             CGI   AS+P
Sbjct:   311 CGITA-ASYP 319


>UNIPROTKB|H9KYW5 [details] [associations]
            symbol:CTSS "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0002250 "adaptive immune response" evidence=IEA]
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 OMA:YEPACTQ EMBL:AADN02010496
            Ensembl:ENSGALT00000001122 Uniprot:H9KYW5
        Length = 245

 Score = 430 (156.4 bits), Expect = 2.0e-40, P = 2.0e-40
 Identities = 107/255 (41%), Positives = 140/255 (54%)

Query:    89 TPQEFIASQTGFKM-SDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC--- 144
             T ++  A  TG ++ S H      N T    +    P +++W EKG VT VK QG C   
Sbjct:     1 TSEDVAALLTGLRVPSGH------NQTSTYRRRGGAPDAMDWREKGCVTEVKNQGACGAC 54

Query:   145 ----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
                 AV A+E    +K  +LVSLS Q LVDC+    N GC GGFM  AF+YII N GI +
Sbjct:    55 WAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSMMYGNKGCGGGFMTRAFQYIIDNNGIDS 114

Query:   201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDASALQ 259
             +  Y Y   + G C        AA  + Y ++P  DE +L  AVAN  PVSVAIDA+   
Sbjct:   115 EESYPYMAQN-GTCQ-YNVSTRAATCSKYVELPYADEAALKDAVANVGPVSVAIDATQPT 172

Query:   260 F--YSGGVFNG-YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
             F  Y  GV++   C   +NHGV  VGYGT  E   +WL+KNSWG+ +G+ GY R+ R+  
Sbjct:   173 FFLYRSGVYDDPRCTQEVNHGVLVVGYGTLNEK-DFWLVKNSWGERFGDGGYIRMSRN-- 229

Query:   317 QPQGQCGIAMFASFP 331
                  CGIA +AS+P
Sbjct:   230 -HANHCGIASYASYP 243


>WB|WBGene00007055 [details] [associations]
            symbol:tag-196 species:6239 "Caenorhabditis elegans"
            [GO:0004869 "cysteine-type endopeptidase inhibitor activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000010
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00031 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00043 SMART:SM00645 InterPro:IPR000169
            GO:GO:0004869 eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 EMBL:FO080488 PIR:T31871
            RefSeq:NP_505215.2 HSSP:Q9UBX1 ProteinModelPortal:O16454 SMR:O16454
            DIP:DIP-27400N IntAct:O16454 MINT:MINT-1044990 MEROPS:C01.A50
            PaxDb:O16454 EnsemblMetazoa:F41E6.6.1 EnsemblMetazoa:F41E6.6.2
            EnsemblMetazoa:F41E6.6.3 GeneID:179240 KEGG:cel:CELE_F41E6.6
            UCSC:F41E6.6.1 CTD:179240 WormBase:F41E6.6 InParanoid:O16454
            OMA:GGGLMTN NextBio:904514 Uniprot:O16454
        Length = 477

 Score = 428 (155.7 bits), Expect = 3.3e-40, P = 3.3e-40
 Identities = 114/316 (36%), Positives = 159/316 (50%)

Query:    30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
             I   F  +  ++ + Y    E  KRF +FK N   +        G   Y     KF+D+T
Sbjct:   170 IWNSFLDFVDRHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVYGF--TKFSDMT 227

Query:    90 PQEFIASQTGFKMSDHSSSLK-ANGTPF--LYKSSQVPPSVNWIEKGAVTPVKYQGQCA- 145
               EF      ++       ++ AN            +P S +W EKGAVT VK QG C  
Sbjct:   228 TMEFKKIMLPYQWEQPVYPMEQANFEKHDVTINEEDLPESFDWREKGAVTQVKNQGNCGS 287

Query:   146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                      VEG   I  N+LVSLSEQ+LVDC + D   GC GG   +A+K II+  G+ 
Sbjct:   288 CWAFSTTGNVEGAWFIAKNKLVSLSEQELVDCDSMDQ--GCNGGLPSNAYKEIIRMGGLE 345

Query:   200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLK-AVANQPVSVAIDASAL 258
              +  Y Y+G     C  ++ +D A  I    ++P +DE  + K  V   P+S+ ++A+ L
Sbjct:   346 PEDAYPYDGRGE-TCHLVR-KDIAVYINGSVELP-HDEVEMQKWLVTKGPISIGLNANTL 402

Query:   259 QFYSGGV---FNGYCETF-LNHGVTAVGYGTSEEGIK-YWLIKNSWGQDWGEDGYFRLQR 313
             QFY  GV   F  +CE F LNHGV  VGYG  ++G K YW++KNSWG +WGE GYF+L R
Sbjct:   403 QFYRHGVVHPFKIFCEPFMLNHGVLIVGYG--KDGRKPYWIVKNSWGPNWGEAGYFKLYR 460

Query:   314 DIDQPQGQCGIAMFAS 329
                  +  CG+   A+
Sbjct:   461 G----KNVCGVQEMAT 472


>RGD|1309226 [details] [associations]
            symbol:Cts7 "cathepsin 7" species:10116 "Rattus norvegicus"
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005768 "endosome" evidence=IEA] [GO:0005794 "Golgi apparatus"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0007067
            "mitosis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IEA] [GO:0051301 "cell division" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 RGD:1309226 GO:GO:0005634
            GO:GO:0005794 GO:GO:0048471 GO:GO:0005615 GO:GO:0051301
            GO:GO:0007067 GO:GO:0005768 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 MEROPS:C01.016 CTD:56092
            GeneTree:ENSGT00560000076577 OrthoDB:EOG44QT2S EMBL:CH474032
            IPI:IPI00870531 RefSeq:NP_001099569.1 UniGene:Rn.218615
            Ensembl:ENSRNOT00000043686 GeneID:290970 KEGG:rno:290970
            UCSC:RGD:1309226 OMA:VESFNAN Uniprot:D3ZZ07
        Length = 331

 Score = 427 (155.4 bits), Expect = 4.2e-40, P = 4.2e-40
 Identities = 111/344 (32%), Positives = 176/344 (51%)

Query:     6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
             + V + ++  C   A      + S+  ++E+WK    +TY    E  +R  ++++N+  +
Sbjct:     1 MTVAVFLAILCLRAALAAPRPDYSLDAEWEEWKRNNAKTYSPEEEKQRR-AVWEENVKMI 59

Query:    66 E--RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV 123
             +     N    N ++T+ +N+F D+T +E         M+D S+    NG     ++ ++
Sbjct:    60 KWHTMQNGLWMN-NFTIEMNEFGDMTGEEMRM------MTDSSALTLRNGKHIQKRNVKI 112

Query:   124 PPSVNWIEKGAVTPVKYQGQC------AVAA-VEGINAIKINRLVSLSEQQLVDCATNDN 176
             P +++W + G V PV+ QG C      +VAA +E     K  +L+ LS Q L+DC     
Sbjct:   113 PKTLDWRDTGCVAPVRSQGGCGACWAFSVAASIESQLFKKTGKLIPLSVQNLIDCTVTYG 172

Query:   177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
             NN C GG    AF+Y+  N G+  +A Y YE      C   + E    +I  +  VP N 
Sbjct:   173 NNDCSGGKPYTAFQYVKNNGGLEAEATYPYEAKLRH-C-RYRPERSVVKIARFFVVPRN- 229

Query:   237 EESLLKAVANQ-PVSVAIDASALQF--YSGGVFNG-YCET-FLNHGVTAVGYGTS---EE 288
             EE+L++A+    P++VAID S   F  Y GG+++   C    L+HG+  VGYG      E
Sbjct:   230 EEALMQALVTYGPIAVAIDGSHASFKRYRGGIYHEPKCRRDTLDHGLLLVGYGYEGHESE 289

Query:   289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
               KYWL+KNS G+ WGE GY +L RD       CGIA +A +P+
Sbjct:   290 NRKYWLLKNSHGEQWGERGYMKLPRD---QNNYCGIASYAMYPL 330


>TAIR|locus:2130180 [details] [associations]
            symbol:AT4G16190 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0042744 "hydrogen peroxide catabolic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005773
            EMBL:CP002687 HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 EMBL:Z97340 EMBL:AL161543 UniGene:At.25555
            EMBL:AY039556 EMBL:AY129473 EMBL:AY136316 EMBL:BT000733
            EMBL:AK226366 IPI:IPI00543588 PIR:D71428 RefSeq:NP_567489.1
            HSSP:P25779 ProteinModelPortal:Q9SUL1 SMR:Q9SUL1 STRING:Q9SUL1
            MEROPS:C01.A06 PRIDE:Q9SUL1 EnsemblPlants:AT4G16190.1 GeneID:827311
            KEGG:ath:AT4G16190 TAIR:At4g16190 InParanoid:Q9SUL1 OMA:NACGINK
            PhylomeDB:Q9SUL1 ProtClustDB:CLSN2917559 Genevestigator:Q9SUL1
            Uniprot:Q9SUL1
        Length = 373

 Score = 370 (135.3 bits), Expect = 4.6e-34, P = 4.6e-34
 Identities = 93/268 (34%), Positives = 140/268 (52%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F  +K++Y +TY    E+  RF +FK NL    R  N  + + S    + +F+DLTP+EF
Sbjct:    55 FTLFKSKYEKTYATQVEHDHRFRVFKANLRRARR--NQLL-DPSAVHGVTQFSDLTPKEF 111

Query:    94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AV 146
                  G K             P L  +S +P   +W E+GAVTPVK QG C       A+
Sbjct:   112 RRKFLGLKRRGFRLPTDTQTAPIL-PTSDLPTEFDWREQGAVTPVKNQGMCGSCWSFSAI 170

Query:   147 AAVEGINAIKINRLVSLSEQQLVDC-------ATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
              A+EG + +    LVSLSEQQLVDC         N  ++GC GG M++AF+Y ++  G+ 
Sbjct:   171 GALEGAHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALKAGGLM 230

Query:   200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQ 259
              +  Y Y G     C   K++   A ++N+  V  ++++     V + P+++AI+A  +Q
Sbjct:   231 KEEDYPYTGRDHTACKFDKSKI-VASVSNFSVVSSDEDQIAANLVQHGPLAIAINAMWMQ 289

Query:   260 FYSGGVFNGY-CETFLNHGVTAVGYGTS 286
              Y GGV   Y C    +HGV  VG+G+S
Sbjct:   290 TYIGGVSCPYVCSKSQDHGVLLVGFGSS 317

 Score = 310 (114.2 bits), Expect = 1.1e-38, Sum P(2) = 1.1e-38
 Identities = 72/199 (36%), Positives = 109/199 (54%)

Query:   145 AVAAVEGINAIKINRLVSLSEQQLVDC-------ATNDNNNGCYGGFMDDAFKYIIQNKG 197
             A+ A+EG + +    LVSLSEQQLVDC         N  ++GC GG M++AF+Y ++  G
Sbjct:   169 AIGALEGAHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALKAGG 228

Query:   198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
             +  +  Y Y G     C   K++   A ++N+  V  ++++     V + P+++AI+A  
Sbjct:   229 LMKEEDYPYTGRDHTACKFDKSKI-VASVSNFSVVSSDEDQIAANLVQHGPLAIAINAMW 287

Query:   258 LQFYSGGVFNGY-CETFLNHGVTAVGYGTSEEG---IK---YWLIKNSWGQDWGEDGYFR 310
             +Q Y GGV   Y C    +HGV  VG+G+S      +K   YW+IKNSWG  WGE GY++
Sbjct:   288 MQTYIGGVSCPYVCSKSQDHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGAMWGEHGYYK 347

Query:   311 LQRDIDQPQGQCGIAMFAS 329
             + R    P   CG+    S
Sbjct:   348 ICRG---PHNMCGMDTMVS 363

 Score = 120 (47.3 bits), Expect = 1.1e-38, Sum P(2) = 1.1e-38
 Identities = 45/156 (28%), Positives = 68/156 (43%)

Query:     4 YFLIVVLIISGSCASQATYRTFDEGSI--------AEKFEQW-KAQYGRT-YKESAENSK 53
             +FLI   +++GS  S        +G +         E  EQ   A++  T +K   E + 
Sbjct:     7 FFLIAATLLAGSLGSTVISGEVTDGFVNPIRQVVPEENDEQLLNAEHHFTLFKSKYEKTY 66

Query:    54 RFEIFKDNLVAVERFN-NAAIGNR----SYTLRLNKFADLTPQEFIASQTGFKMSDHSSS 108
               ++  D+   V + N   A  N+    S    + +F+DLTP+EF     G K       
Sbjct:    67 ATQVEHDHRFRVFKANLRRARRNQLLDPSAVHGVTQFSDLTPKEFRRKFLGLKRRGFRLP 126

Query:   109 LKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC 144
                   P L  +S +P   +W E+GAVTPVK QG C
Sbjct:   127 TDTQTAPIL-PTSDLPTEFDWREQGAVTPVKNQGMC 161


>UNIPROTKB|F1NHB8 [details] [associations]
            symbol:F1NHB8 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 EMBL:AADN02044011
            IPI:IPI00586027 Ensembl:ENSGALT00000021873 OMA:SELDHAV
            Uniprot:F1NHB8
        Length = 329

 Score = 412 (150.1 bits), Expect = 1.6e-38, P = 1.6e-38
 Identities = 120/317 (37%), Positives = 161/317 (50%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F  +K ++G+ Y    E+  R   F  N+  V   N AA+   SY+L LN  AD TPQE 
Sbjct:    26 FHHYKERFGKRYSSEEEHEHRKRTFIHNMRFVHSKNRAAL---SYSLALNHLADRTPQE- 81

Query:    94 IASQTGFKMSDHSSSLKANGTPF---LYKSSQVPPSVNWIEKGAVTPVKYQ---GQC--- 144
             +A+  G + S    S    G PF   LY S  +P S++W   GAVTPVK Q   G C   
Sbjct:    82 MAALRGRRRSGDPKS----GQPFSMQLYASLVLPESLDWRLYGAVTPVKDQAVCGSCWSF 137

Query:   145 -AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
                 A+EG   +K   L  LS+Q L+DC+    N  C GG    A+++I ++ GI +   
Sbjct:   138 ATTGAMEGALFLKTGVLTPLSQQVLIDCSWGFGNYACDGGEEWRAYEWIKKHGGIASTES 197

Query:   204 YS-YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDAS--ALQ 259
             Y  Y G + G C   ++E   A +  Y  V   + E+L  A+    PV+V IDAS  +  
Sbjct:   198 YGPYLGQN-GYCHYNQSE-LVAPLAGYVTVESGNAEALKAALFKHGPVAVNIDASHKSFT 255

Query:   260 FYSGGVFNG-YC--ETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
             FY+ GV+   +C  ET  L+H V AVGYG    G  YWLIKNSW   WG DGY  +    
Sbjct:   256 FYANGVYEEPHCGNETSELDHAVLAVGYGVLH-GKSYWLIKNSWSTYWGNDGYILMA--- 311

Query:   316 DQPQGQCGIAMFASFPV 332
                   CG+A  ASFP+
Sbjct:   312 -MKDNNCGVATAASFPI 327


>DICTYBASE|DDB_G0282991 [details] [associations]
            symbol:DDB_G0282991 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0282991 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            EMBL:AAFI02000049 eggNOG:NOG331187 RefSeq:XP_639299.1
            ProteinModelPortal:Q54RQ2 EnsemblProtists:DDB0185304 GeneID:8623870
            KEGG:ddi:DDB_G0282991 InParanoid:Q54RQ2 OMA:PENGNEY Uniprot:Q54RQ2
        Length = 339

 Score = 409 (149.0 bits), Expect = 3.4e-38, P = 3.4e-38
 Identities = 111/318 (34%), Positives = 161/318 (50%)

Query:    30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
             I   F +W  +Y + Y    E   RF  FK N   V+++N   +      L LN FADL+
Sbjct:    23 IENLFIEWTNKYNKIYSNK-EFYMRFNNFKKNKEYVDQWNEKQLET---ILELNFFADLS 78

Query:    90 PQEFIASQTG--FKMSD-HSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-- 144
               E+I +       +S+    + K  G      ++ +  S++W    AVTPVK QG C  
Sbjct:    79 RNEYINNYLASFIDISNIEQKNTKYEGNLKNNFNNSIK-SIDWRNFDAVTPVKNQGLCSG 137

Query:   145 ------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
                   A+  +E  + IK   L++LSEQ ++DC T+  NNGC GG    AF YII+ KGI
Sbjct:   138 AGYSFSAIGVIESSHFIKNKELITLSEQNIIDCTTDMGNNGCMGGLALIAFDYIIKQKGI 197

Query:   199 TNDAVYSYEGM------STGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
              ++  Y YEG         G C    +    A I++Y ++   +E  L +++   PVSV 
Sbjct:   198 DSEFNYPYEGYLIEPYEGRGRC-RYNSFYSKASISSYIEIERFNENELTQSLIKSPVSVM 256

Query:   253 IDASALQF--YSGGVFNG-YCE-TFLNHGVTAVGYG-TSEEGIKYWLIKNSWGQDWGEDG 307
             IDAS L F  Y  GV+    C  T LNHG+  +G+G T E G +Y+++KNS+G  WG  G
Sbjct:   257 IDASQLSFMLYKSGVYKDPSCSSTILNHGILNIGFGVTPENGNEYYILKNSFGSKWGMKG 316

Query:   308 YFRLQRDIDQPQGQCGIA 325
             Y  L R+ +     CGI+
Sbjct:   317 YIYLSRNFNN---HCGIS 331


>UNIPROTKB|E9PTT3 [details] [associations]
            symbol:Ctsr "Protein Ctsr" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            IPI:IPI00627092 Ensembl:ENSRNOT00000024115 RGD:631422
            Uniprot:E9PTT3
        Length = 334

 Score = 408 (148.7 bits), Expect = 4.3e-38, P = 4.3e-38
 Identities = 105/323 (32%), Positives = 163/323 (50%)

Query:    25 FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLN 83
             FD    AE +   K +Y ++Y    E  +R  ++++N+  ++  N   ++G   + + +N
Sbjct:    21 FDPSLDAE-WHDXKTEYEKSYTMEEEGHRR-AVWEENMKMIKLHNRENSLGKNGFIMEMN 78

Query:    84 KFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
             +F DLT +EF        +  H               + +P  V+W +KG VT V+ Q  
Sbjct:    79 EFGDLTAEEFRKMMVNIPIRSHRKGKIIRKRDV---GNVLPKFVDWRKKGYVTRVQNQKF 135

Query:   144 C------AVA-AVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
             C      AV  A+EG    K  +L  LS Q LVDC  +  N GC  G    A++Y++ N 
Sbjct:   136 CNSCWAFAVTGAIEGQMFNKTGQLTPLSVQNLVDCTKSQGNEGCQWGDPHIAYEYVLNNG 195

Query:   197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA 255
             G+  +A Y Y+G   G+C     +   A+IT +  +P + E+ L++AVA   P+SVA+DA
Sbjct:   196 GLEAEATYPYKGKE-GVC-RYNPKHSKAEITGFVSLPES-EDILMEAVATIGPISVAVDA 252

Query:   256 S--ALQFYSGGVFNG-YCET-FLNHGVTAVGYG---TSEEGIKYWLIKNSWGQDWGEDGY 308
             S  +  FY  G+++   C    +NH V  VGYG      +G  YWLIKNSWG+ WG  GY
Sbjct:   253 SFNSFGFYKKGLYDEPNCSNNTVNHSVLVVGYGFEGNETDGNSYWLIKNSWGRKWGLRGY 312

Query:   309 FRLQRDIDQPQGQCGIAMFASFP 331
              ++ +D       C IA +A +P
Sbjct:   313 MKIPKD---QNNFCAIASYAHYP 332


>ZFIN|ZDB-GENE-030131-9831 [details] [associations]
            symbol:ctsf "cathepsin F" species:7955 "Danio
            rerio" [GO:0004869 "cysteine-type endopeptidase inhibitor activity"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000010 InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00031 Pfam:PF00112 PRINTS:PR00705 SMART:SM00043
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-030131-9831
            GO:GO:0004869 eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 HOVERGEN:HBG011513 CTD:8722 OrthoDB:EOG4CC41T
            MEROPS:I25.006 EMBL:BC124243 IPI:IPI00503226 RefSeq:NP_001071036.1
            UniGene:Dr.81265 ProteinModelPortal:Q08CH0 SMR:Q08CH0 GeneID:565588
            KEGG:dre:565588 InParanoid:Q08CH0 NextBio:20885952
            ArrayExpress:Q08CH0 Uniprot:Q08CH0
        Length = 473

 Score = 408 (148.7 bits), Expect = 4.3e-38, P = 4.3e-38
 Identities = 106/307 (34%), Positives = 151/307 (49%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F+ +   Y RTY    E  KR  IF+ N+   +   +   G+  Y +   KF+DLT  EF
Sbjct:   175 FKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGI--TKFSDLTEDEF 232

Query:    94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------V 146
                     +S  S  LK    P +  S+  P + +W + GAV+PVK QG C         
Sbjct:   233 RMMYLNPMLSQWS--LKKEMKPAIPASAPAPDTWDWRDHGAVSPVKNQGMCGSCWAFSVT 290

Query:   147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
               +EG    K  +L+SLSEQ+LVDC   D    C GG   +A++ I    G+  +  YSY
Sbjct:   291 GNIEGQWFKKTGQLLSLSEQELVDCDKLDQ--ACGGGLPSNAYEAIENLGGLETETDYSY 348

Query:   207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQFYSGGVF 266
              G     CD       AA I +  ++P +++E       N PVS A++A A+QFY  GV 
Sbjct:   349 TGHKQS-CD-FSTGKVAAYINSSVELPKDEKEIAAFLAENGPVSAALNAFAMQFYRKGVS 406

Query:   267 NG---YCETFL-NHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
             +    +C  ++ +H V  VG+G    G+ +W IKNSWG+D+GE GY+ L R      G C
Sbjct:   407 HPLKIFCNPWMIDHAVLLVGFG-QRNGVPFWAIKNSWGEDYGEQGYYYLYRG----SGLC 461

Query:   323 GIAMFAS 329
             GI    S
Sbjct:   462 GIHKMCS 468


>UNIPROTKB|F1P3U9 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005829 "cytosol" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=IEA] [GO:0010628 "positive regulation of gene expression"
            evidence=IEA] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=IEA] [GO:0010813 "neuropeptide catabolic
            process" evidence=IEA] [GO:0010815 "bradykinin catabolic process"
            evidence=IEA] [GO:0016505 "apoptotic protease activator activity"
            evidence=IEA] [GO:0030108 "HLA-A specific activating MHC class I
            receptor activity" evidence=IEA] [GO:0031638 "zymogen activation"
            evidence=IEA] [GO:0031648 "protein destabilization" evidence=IEA]
            [GO:0032526 "response to retinoic acid" evidence=IEA] [GO:0033619
            "membrane protein proteolysis" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0043129
            "surfactant homeostasis" evidence=IEA] [GO:0045766 "positive
            regulation of angiogenesis" evidence=IEA] [GO:0060448 "dichotomous
            subdivision of terminal units involved in lung branching"
            evidence=IEA] [GO:0070324 "thyroid hormone binding" evidence=IEA]
            [GO:0070371 "ERK1 and ERK2 cascade" evidence=IEA] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA]
            [GO:0097208 "alveolar lamellar body" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066
            GO:GO:0005615 GO:GO:0008284 GO:GO:0070371 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0032526 GO:GO:0010628
            GO:GO:0070324 GO:GO:0016505 GO:GO:0010634 GO:GO:0004197
            GO:GO:0042599 GO:GO:0031648 GO:GO:0097067 GO:GO:0031638
            GO:GO:0001913 GeneTree:ENSGT00660000095458 OMA:STSCHKT
            GO:GO:0030108 GO:GO:0010815 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 EMBL:AADN02038832 EMBL:AADN02038831 IPI:IPI00594147
            Ensembl:ENSGALT00000013440 Uniprot:F1P3U9
        Length = 261

 Score = 406 (148.0 bits), Expect = 7.0e-38, P = 7.0e-38
 Identities = 100/269 (37%), Positives = 145/269 (53%)

Query:    78 YTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGA-VT 136
             + + LN+F+D+T  EF   +  +  S+  +     G  FL      P +V+W +KG  VT
Sbjct:     1 FLVALNQFSDMTFAEF---KKLYLWSEPQNCSATRGN-FLRSDGPCPEAVDWRKKGNFVT 56

Query:   137 PVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAF 189
             PVK QG C           +E   AI   +L+SL+EQ LVDCA   NN+GC GG    AF
Sbjct:    57 PVKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQLLVDCAQAFNNHGCSGGLPSQAF 116

Query:   190 KYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA-NQP 248
             +YI+ NKG+  +  Y Y   + G C   + +   A + +  ++   DE  +++AV  + P
Sbjct:   117 EYILYNKGLMGEDAYPYRAQN-GTC-KFQPDKAIAFVKDVINITQYDEAGMVEAVGKHNP 174

Query:   249 VSVAIDASA-LQFYSGGVF-NGYCETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDW 303
             VS A + ++    Y  GV+ N  CE     +NH V AVGYG  E+G  YW++KNSWG  W
Sbjct:   175 VSFAFEVTSDFMHYRKGVYSNPRCEHTPDKVNHAVLAVGYG-EEDGRPYWIVKNSWGPLW 233

Query:   304 GEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             G DGYF ++R     +  CG+A  AS+PV
Sbjct:   234 GMDGYFLIERG----KNMCGLAACASYPV 258


>DICTYBASE|DDB_G0272815 [details] [associations]
            symbol:cprE "cysteine proteinase 5" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0272815 GO:GO:0005615
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GenomeReviews:CM000151_GR GO:GO:0005764
            EMBL:AAFI02000008 MEROPS:I29.003 KO:K01376 EMBL:L36205
            RefSeq:XP_644977.1 ProteinModelPortal:P54640 SMR:P54640
            PRIDE:P54640 EnsemblProtists:DDB0185092 GeneID:8618654
            KEGG:ddi:DDB_G0272815 OMA:METAFEF ProtClustDB:CLSZ2430780
            Uniprot:P54640
        Length = 344

 Score = 405 (147.6 bits), Expect = 8.9e-38, P = 8.9e-38
 Identities = 110/290 (37%), Positives = 158/290 (54%)

Query:     7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
             + VL++S + A Q     F E      F  W   + ++Y  S E   R+ IFK N+  V+
Sbjct:     7 LCVLLVSVATAKQQ----FSELQYRNAFTDWMITHQKSYT-SEEFGARYNIFKANMDYVQ 61

Query:    67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPS 126
             ++N+   G+ +  L LN FAD+T +E+  +  G K    +SSL       ++ +S    S
Sbjct:    62 QWNSK--GSET-VLGLNNFADITNEEYRNTYLGTKFD--ASSLIGTQEEKVFTTSSAA-S 115

Query:   127 VNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNG 179
              +W  +GAVTPVK QGQC          + EG +      LVSLSEQ L+DC+T   N+G
Sbjct:   116 KDWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCSTE--NSG 173

Query:   180 CYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEES 239
             C GG M  AF+YII N GI  ++ Y Y+    G C+  K+E+  A +++Y+ V    E S
Sbjct:   174 CDGGLMTYAFEYIINNNGIDTESSYPYKA-ENGKCE-YKSENSGATLSSYKTVTAGSESS 231

Query:   240 LLKAVANQPVSVAIDAS--ALQFYSGGVF-NGYCETF-LNHGVTAVGYGT 285
             L  AV   PVSVAIDAS  + Q Y+ G++    C +  L+HGV AVGYG+
Sbjct:   232 LESAVNVNPVSVAIDASHQSFQLYTSGIYYEPECSSENLDHGVLAVGYGS 281

 Score = 126 (49.4 bits), Expect = 3.0e-05, P = 3.0e-05
 Identities = 40/146 (27%), Positives = 67/146 (45%)

Query:   190 KYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPV 249
             +Y  +N G T   + SY+ ++ G   S+++  +   ++   D      +     +  +P 
Sbjct:   208 EYKSENSGAT---LSSYKTVTAGSESSLESAVNVNPVSVAIDASHQSFQLYTSGIYYEPE 264

Query:   250 --SVAIDASALQF-YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
               S  +D   L   Y  G  +G   +  + G ++     S    +YW++KNSWG  WG +
Sbjct:   265 CSSENLDHGVLAVGYGSG--SG-SSSGQSSGQSSGNLSASSSN-EYWIVKNSWGTSWGIE 320

Query:   307 GYFRLQRDIDQPQGQCGIAMFASFPV 332
             GY  + R+ D     CGIA  ASFPV
Sbjct:   321 GYILMSRNRDN---NCGIASSASFPV 343


>FB|FBgn0033874 [details] [associations]
            symbol:CG6347 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE013599 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 HSSP:P53634 EMBL:AY069609
            RefSeq:NP_610906.1 UniGene:Dm.608 SMR:Q7K0S6 MEROPS:C01.A29
            EnsemblMetazoa:FBtr0087637 GeneID:36531 KEGG:dme:Dmel_CG6347
            UCSC:CG6347-RA FlyBase:FBgn0033874 InParanoid:Q7K0S6 OMA:FEYIRDH
            OrthoDB:EOG4FQZ74 GenomeRNAi:36531 NextBio:799046 Uniprot:Q7K0S6
        Length = 352

 Score = 402 (146.6 bits), Expect = 1.9e-37, P = 1.9e-37
 Identities = 114/345 (33%), Positives = 173/345 (50%)

Query:    10 LIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN 69
             L + G+ + Q   ++F +    + F+ +  Q G+ Y +  E   R  IF   +  +   N
Sbjct:    15 LALLGAVSLQQL-QSFPKLCDVQNFDDFLRQTGKVYSDE-ERVYRESIFAAKMSLITLSN 72

Query:    70 -NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK----SSQVP 124
              NA  G   + L +N  AD+T +E IA+  G K+S+           F+      S+ +P
Sbjct:    73 KNADNGVSGFRLGVNTLADMTRKE-IATLLGSKISEFGERYTNGHINFVTARNPASANLP 131

Query:   125 PSVNWIEKGAVTPVKYQG----QC----AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
                +W EKG VTP  +QG     C       A+EG    +   L SLS+Q LVDCA +  
Sbjct:   132 EMFDWREKGGVTPPGFQGVGCGACWSFATTGALEGHLFRRTGVLASLSQQNLVDCADDYG 191

Query:   177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKA----EDHAAQITNYEDV 232
             N GC GGF +  F+YI ++ G+T    Y Y         +  A     +   +I +Y  +
Sbjct:   192 NMGCDGGFQEYGFEYI-RDHGVTLANKYPYTQTEMQCRQNETAGRPPRESLVKIRDYATI 250

Query:   233 PPNDEESLLKAVANQ-PVSVAIDASALQF--YSGGVFNGY-C-ETFLNHGVTAVGYGTSE 287
              P DEE + + +A   P++ +++A  + F  YSGG++    C +  LNH VT VGYGT E
Sbjct:   251 TPGDEEKMKEVIATLGPLACSMNADTISFEQYSGGIYEDEECNQGELNHSVTVVGYGT-E 309

Query:   288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
              G  YW+IKNS+ Q+WGE G+ R+ R+     G CGIA   S+P+
Sbjct:   310 NGRDYWIIKNSYSQNWGEGGFMRILRNAG---GFCGIASECSYPI 351


>ZFIN|ZDB-GENE-050417-107 [details] [associations]
            symbol:zgc:110239 "zgc:110239" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-050417-107
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 MEROPS:I29.003 OrthoDB:EOG412M56 EMBL:BC092817
            IPI:IPI00503987 RefSeq:NP_001017633.1 UniGene:Dr.39081
            ProteinModelPortal:Q568K7 GeneID:550326 KEGG:dre:550326
            HOGENOM:HOG000007373 HOVERGEN:HBG105018 InParanoid:Q568K7
            NextBio:20879584 ArrayExpress:Q568K7 Uniprot:Q568K7
        Length = 546

 Score = 402 (146.6 bits), Expect = 1.9e-37, P = 1.9e-37
 Identities = 111/312 (35%), Positives = 158/312 (50%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F  +K ++ R Y    E+ +R   F  N+  V   N A +   S++L +N  AD + +E 
Sbjct:   243 FGHYKEKFNRQYDNEMEHEEREHNFVHNIRYVHSMNRAGL---SFSLSVNHLADRSQKE- 298

Query:    94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQ---GQC----AV 146
             ++   G + + H    KA   P   +S   P SV+W   GAVTPVK Q   G C      
Sbjct:   299 LSMMRGCQRT-HKVHRKAQPFPSEIRSIATPNSVDWRLYGAVTPVKDQAVCGSCWSFATT 357

Query:   147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY-S 205
               +EG   +K  +L SLS+Q LVDC     NNGC GG    AF++I+++ GI+    Y +
Sbjct:   358 GTLEGALFLKTGQLTSLSQQMLVDCTWGFGNNGCDGGEEWRAFEWIMKHGGISTAESYGA 417

Query:   206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYS 262
             Y GM+ G+C   K+    AQ+T Y +V   D  +L  A+    PV+V+IDA+  +  FYS
Sbjct:   418 YMGMN-GLCHYDKSS-MVAQLTGYTNVTSGDILALKAAIFKFGPVAVSIDAAHRSFAFYS 475

Query:   263 GGVF-NGYCETFLN---HGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
              GV+    C+  +N   H V AVGYG       YWL+KNSW   WG DGY  +       
Sbjct:   476 NGVYYEPECKNGINDLDHAVLAVGYGIMNNE-SYWLVKNSWSSYWGNDGYILMS----MK 530

Query:   319 QGQCGIAMFASF 330
                CG+A  A +
Sbjct:   531 DNNCGVATDAIY 542


>UNIPROTKB|Q0VCU3 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            HOVERGEN:HBG011513 MEROPS:C01.018 CTD:8722 OMA:LAPPEWD
            OrthoDB:EOG4CC41T GeneTree:ENSGT00660000095458 EMBL:DAAA02063594
            EMBL:BC120003 IPI:IPI00717812 RefSeq:NP_001068884.1 UniGene:Bt.7264
            SMR:Q0VCU3 Ensembl:ENSBTAT00000014587 GeneID:509715 KEGG:bta:509715
            InParanoid:Q0VCU3 NextBio:20869091 Uniprot:Q0VCU3
        Length = 460

 Score = 400 (145.9 bits), Expect = 3.0e-37, P = 3.0e-37
 Identities = 109/315 (34%), Positives = 150/315 (47%)

Query:    30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
             +A  F+ +   Y RTY    E S R  +F +N+V  ++      G   Y +   KF+DLT
Sbjct:   159 MASIFKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTARYGV--TKFSDLT 216

Query:    90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
              +EF        + D       N  P    +   PP  +W  KGAVT VK QG C     
Sbjct:   217 EEEFRTIYLNPLLKDAPGR---NMRPAQPVTDVPPPQWDWRNKGAVTNVKDQGMCGSCWA 273

Query:   146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
                   VEG   +K   L+SLSEQ+L+DC   D    C GG   +A+  I    G+  + 
Sbjct:   274 FSVTGNVEGQWFLKRGTLLSLSEQELLDCDKTDK--ACLGGLPSNAYSAIRTLGGLETED 331

Query:   203 VYSYEG-MSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQFY 261
              YSY G + T  C S  AE     I +  ++  N+++       N PVS+AI+A  +QFY
Sbjct:   332 DYSYRGRLQT--C-SFSAEKAKVYINDSVELSKNEQKLAAWLAKNGPVSIAINAFGMQFY 388

Query:   262 SGGV---FNGYCETFL-NHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
               G+       C  +L +H V  VGYG +   I +W IKNSWG DWGE+GY+ L R    
Sbjct:   389 RHGISHPLRPLCSPWLIDHAVLLVGYG-NRSAIPFWAIKNSWGTDWGEEGYYYLHRG--- 444

Query:   318 PQGQCGIAMFASFPV 332
               G CG+ + AS  V
Sbjct:   445 -SGACGVNIMASSAV 458


>WB|WBGene00019986 [details] [associations]
            symbol:R09F10.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            EMBL:FO081137 HSSP:P53634 PIR:D89588 RefSeq:NP_509408.1
            ProteinModelPortal:Q23030 SMR:Q23030 STRING:Q23030 MEROPS:C01.A44
            PaxDb:Q23030 EnsemblMetazoa:R09F10.1 GeneID:181087
            KEGG:cel:CELE_R09F10.1 UCSC:R09F10.1 CTD:181087 WormBase:R09F10.1
            InParanoid:Q23030 OMA:EYPYSAL NextBio:912346 Uniprot:Q23030
        Length = 383

 Score = 400 (145.9 bits), Expect = 3.0e-37, P = 3.0e-37
 Identities = 107/343 (31%), Positives = 167/343 (48%)

Query:     6 LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
             L+ +LI+      Q      +     + F  +  ++ R Y    E   R++IF  N++  
Sbjct:    54 LLTMLILLSFFVFQRLNHKMENLKHEQMFNDFILKFDRKYTSVEEFEYRYQIFLRNVIEF 113

Query:    66 ERFNNAAIGNRSYTLRLNKFADLTPQEF--IASQTGFKMSDHSSSLKANGTPFLYKSSQV 123
             E      +G     L +N+F D T +E   +  +  +   D  +  K  G+ +L      
Sbjct:   114 EAEEERNLG---LDLDVNEFTDWTDEELQKMVQENKYTKYDFDTP-KFEGS-YLETGVIR 168

Query:   124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
             P S++W E+G +TP+K QGQC        VA+VE  NAIK  +LVSLSEQ++VDC  +  
Sbjct:   169 PASIDWREQGKLTPIKNQGQCGSCWAFATVASVEAQNAIKKGKLVSLSEQEMVDC--DGR 226

Query:   177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
             NNGC GG+   A K++ +N G+ ++  Y Y  +    C  +K  D    I ++  +  N+
Sbjct:   227 NNGCSGGYRPYAMKFVKEN-GLESEKEYPYSALKHDQC-FLKENDTRVFIDDFRMLS-NN 283

Query:   237 EESLLKAVANQ-PVSVAIDA-SALQFYSGGVFNGYCETFLN-----HGVTAVGYGTSEEG 289
             EE +   V  + PV+  ++   A+  Y  G+FN   E         H +T +GYG   E 
Sbjct:   284 EEDIANWVGTKGPVTFGMNVVKAMYSYRSGIFNPSVEDCTEKSMGAHALTIIGYGGEGES 343

Query:   290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
               YW++KNSWG  WG  GYFRL R ++     CG+A     P+
Sbjct:   344 A-YWIVKNSWGTSWGASGYFRLARGVNS----CGLANTVVAPI 381


>FB|FBgn0032228 [details] [associations]
            symbol:CG5367 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE014134 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 HSSP:P80067
            RefSeq:NP_609387.1 UniGene:Dm.26782 ProteinModelPortal:Q9VKY4
            SMR:Q9VKY4 MEROPS:C01.A30 EnsemblMetazoa:FBtr0080055 GeneID:34401
            KEGG:dme:Dmel_CG5367 UCSC:CG5367-RA FlyBase:FBgn0032228
            InParanoid:Q9VKY4 OMA:QIVDCSV OrthoDB:EOG4THT8X PhylomeDB:Q9VKY4
            GenomeRNAi:34401 NextBio:788324 ArrayExpress:Q9VKY4 Bgee:Q9VKY4
            Uniprot:Q9VKY4
        Length = 338

 Score = 393 (143.4 bits), Expect = 1.7e-36, P = 1.7e-36
 Identities = 96/317 (30%), Positives = 167/317 (52%)

Query:    33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQ 91
             +FE++K    R Y  + +  + ++ F++N   +E  N N   G  S+ L+ N FAD++  
Sbjct:    35 EFEKFKNNNNRKYLRTYDEMRSYKAFEENFKVIEEHNQNYKEGQTSFRLKPNIFADMSTD 94

Query:    92 EFIASQTGFKMS--DHSSSLKAN--GTPFLYKSSQVPPSVNWIEKGAVTPVKYQ---GQC 144
              ++        S  + S+   A   G+P +   + VP S++W  KG +TP   Q   G C
Sbjct:    95 GYLKGFLRLLKSNIEDSADNMAEIVGSPLM---ANVPESLDWRSKGFITPPYNQLSCGSC 151

Query:   145 -AVAAVEGI--NAIK-INRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
              A +  E I     K   +++SLS+QQ+VDC+ +  N GC GG + +   Y+    GI  
Sbjct:   152 YAFSIAESIMGQVFKRTGKILSLSKQQIVDCSVSHGNQGCVGGSLRNTLSYLQSTGGIMR 211

Query:   201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--A 257
             D  Y Y     G C  +  +     +T++  +P  DE+++  AV +  PV+++I+AS   
Sbjct:   212 DQDYPYVARK-GKCQFVP-DLSVVNVTSWAILPVRDEQAIQAAVTHIGPVAISINASPKT 269

Query:   258 LQFYSGGVFNG-YCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
              Q YS G+++   C +  +NH +  +G+G       YW++KN WGQ+WGE+GY R+++ +
Sbjct:   270 FQLYSDGIYDDPLCSSASVNHAMVVIGFGKD-----YWILKNWWGQNWGENGYIRIRKGV 324

Query:   316 DQPQGQCGIAMFASFPV 332
             +     CGIA +A++ +
Sbjct:   325 NM----CGIANYAAYAI 337


>UNIPROTKB|E2RR02 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            OMA:LAPPEWD GeneTree:ENSGT00660000095458 EMBL:AAEX03011628
            Ensembl:ENSCAFT00000019742 Uniprot:E2RR02
        Length = 460

 Score = 392 (143.0 bits), Expect = 2.1e-36, P = 2.1e-36
 Identities = 104/314 (33%), Positives = 151/314 (48%)

Query:    30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
             +A  F+++   Y RTY+   E   R  +F +N+V  ++      G   Y +   KF+DLT
Sbjct:   158 MASVFKEFVTTYNRTYETKEEAEWRMSVFSNNMVRAQKIQALDRGTAQYGI--TKFSDLT 215

Query:    90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
              +EF        + ++    K      +   +  PP  +W  KGAVT VK QG C     
Sbjct:   216 EEEFRTIYLNPLLRENRGK-KMRLAKSISDHAP-PPEWDWRSKGAVTKVKDQGMCGSCWA 273

Query:   146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
                   VEG   +K   L+SLSEQ+L+DC   D    C GG   +A+  I+   G+  + 
Sbjct:   274 FSVTGNVEGQWFLKEGTLLSLSEQELLDCDKVDK--ACLGGLPSNAYSAIMTLGGLETED 331

Query:   203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQFYS 262
              YSY+G     C S  A+     I +  ++  N+++         P+SVAI+A  +QFY 
Sbjct:   332 DYSYQGHLQA-C-SFSAKKARVYINDSMELSQNEQKLAAWLAKKGPISVAINAFGMQFYR 389

Query:   263 GGV---FNGYCETFL-NHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
              G+       C  +L +H V  VGYG +  GI +W IKNSWG DWGE+GY+ L R     
Sbjct:   390 HGISHPLRPLCSPWLIDHAVLLVGYG-NRSGIPFWAIKNSWGTDWGEEGYYYLHRG---- 444

Query:   319 QGQCGIAMFASFPV 332
              G CG+   AS  V
Sbjct:   445 SGACGVNTMASSAV 458


>DICTYBASE|DDB_G0279187 [details] [associations]
            symbol:cprG "cysteine proteinase 7" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0279187 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000030 ProtClustDB:CLSZ2846820 MEROPS:C01.081
            EMBL:U72746 RefSeq:XP_641720.2 ProteinModelPortal:Q94504 SMR:Q94504
            PRIDE:Q94504 EnsemblProtists:DDB0215005 GeneID:8621915
            KEGG:ddi:DDB_G0279187 OMA:INTETEK Uniprot:Q94504
        Length = 460

 Score = 390 (142.3 bits), Expect = 3.5e-36, P = 3.5e-36
 Identities = 108/292 (36%), Positives = 152/292 (52%)

Query:     7 IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
             + VL++S + A Q       E      F  W   + R Y  S E + R+ IFK N+  V 
Sbjct:     7 LCVLLVSVATAKQQ----LSEVEYRNAFTNWMIAHQRHYS-SEEFNGRYNIFKANMDYVN 61

Query:    67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPS 126
              +N    G+ +  L LN FAD++ +E+ A+  G      +SSL+   +  ++ +S     
Sbjct:    62 EWNTK--GSET-VLGLNVFADISNEEYRATYLGTPFD--ASSLEMTESDKIFDASA---Q 113

Query:   127 VNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINR--LVSLSEQQLVDCATNDNN 177
             V+W  +GAVTP+K QGQC          A EG   +   +  LVSLSEQ L+DC+ +  N
Sbjct:   114 VDWRTQGAVTPIKNQGQCGGCWSFSTTGATEGAQYLANGKKNLVSLSEQNLIDCSGSYGN 173

Query:   178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
             NGC GG M  AF+YII NKGI  ++ Y Y       C     ++ AAQ+++Y +V    E
Sbjct:   174 NGCEGGLMTLAFEYIINNKGIDTESSYPYTAEDGKKC-KFNPKNVAAQLSSYVNVTSGSE 232

Query:   238 ESLLKAVANQPVSVAIDAS--ALQFYSGGVFNG-YCE-TFLNHGVTAVGYGT 285
               L   V   P SVAIDAS  + Q Y  G++N   C  T L+HGV AVG+GT
Sbjct:   233 SDLAAKVTQGPTSVAIDASNQSFQLYVSGIYNEPACSSTQLDHGVLAVGFGT 284

 Score = 115 (45.5 bits), Expect = 0.00084, P = 0.00084
 Identities = 31/87 (35%), Positives = 42/87 (48%)

Query:   250 SVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
             S ++  SA    SG        +  N GV    Y T+ +   YW++KNSWG  WG DGY 
Sbjct:   383 SGSVSGSASGSASGSASGSSSGSNSNGGV----YPTAGD---YWIVKNSWGTSWGMDGYI 435

Query:   310 RLQRDIDQPQGQCGIAMFASFPVSKES 336
              + +  +    QCGIA  AS P +  S
Sbjct:   436 LMTKGNNN---QCGIATMASRPTAVAS 459


>DICTYBASE|DDB_G0281077 [details] [associations]
            symbol:DDB_G0281077 species:44689 "Dictyostelium
            discoideum" [GO:0030246 "carbohydrate binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR008965 InterPro:IPR013128 InterPro:IPR019028
            Pfam:PF00112 Pfam:PF09478 PRINTS:PR00705 SMART:SM00645
            SMART:SM01063 InterPro:IPR000169 dictyBase:DDB_G0281077
            GO:GO:0030246 EMBL:AAFI02000040 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 SUPFAM:SSF49384 ProtClustDB:CLSZ2430562
            RefSeq:XP_640803.1 ProteinModelPortal:Q54UH3
            EnsemblProtists:DDB0203998 GeneID:8622857 KEGG:ddi:DDB_G0281077
            InParanoid:Q54UH3 OMA:LINDFNF Uniprot:Q54UH3
        Length = 662

 Score = 331 (121.6 bits), Expect = 4.7e-36, Sum P(2) = 4.7e-36
 Identities = 90/243 (37%), Positives = 129/243 (53%)

Query:    69 NNAAIGNRSYTLR-LNKFADLTPQEFIASQTGFKMSDHSSSLKAN--GTPFLYKSSQVPP 125
             N+  +G+ S+T   + K   + P   +        S  SS++  +      L K S+ P 
Sbjct:   416 NSIEVGS-SHTFGYIQKAYSINPLLSVNKVCQESSSSSSSNITTDEPSKSRLLKWSR-PI 473

Query:   126 SVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
             S++W   G V+ VK QG C        V A+E     K NR+++LSEQ LVDC  N  N 
Sbjct:   474 SIDWRTWGMVSKVKNQGSCGSCYAFSTVGALEAHYYRKNNRMLNLSEQNLVDCTRNYGNG 533

Query:   179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
              C GG+M + F+YI +N GI   + Y YEG   G+C    + D  ++I+NY  +  +DEE
Sbjct:   534 ECSGGWMHNCFRYIKENGGINLQSTYPYEGR-VGLC-RYNSGDAQSRISNYVMIKQHDEE 591

Query:   239 SLLKAVANQ-PVSVAIDASALQF--YSGGVFNG-YCETF-LNHGVTAVGYGTSEEGIKYW 293
              L  AVA+  PVSVA DAS  +F  YS G++N   C+ +   H V  VGYG  E G+ +W
Sbjct:   592 DLANAVASVGPVSVAYDASTREFMYYSSGIYNSDSCDKYRTTHAVVVVGYGI-ENGVDFW 650

Query:   294 LIK 296
             +IK
Sbjct:   651 IIK 653

 Score = 89 (36.4 bits), Expect = 4.7e-36, Sum P(2) = 4.7e-36
 Identities = 20/61 (32%), Positives = 34/61 (55%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F QW  Q+ RTY+   +   ++E FKD+   +E++      N +  L L +F+D+T  EF
Sbjct:   162 FIQWSNQFNRTYRAD-QFLLKYEAFKDSSRFIEQYKREN-QNSTMELGLTQFSDMTHDEF 219

Query:    94 I 94
             +
Sbjct:   220 L 220


>UNIPROTKB|F1RU48 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            OMA:LAPPEWD GeneTree:ENSGT00660000095458 EMBL:CU928034
            EMBL:FP565364 Ensembl:ENSSSCT00000014140 Ensembl:ENSSSCT00000014154
            Uniprot:F1RU48
        Length = 460

 Score = 388 (141.6 bits), Expect = 5.7e-36, P = 5.7e-36
 Identities = 107/315 (33%), Positives = 151/315 (47%)

Query:    30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
             +A  F+++   Y RTY    E   R  +F +N+V  ++      G   Y +   KF+DLT
Sbjct:   159 MASIFKEFVTTYNRTYDTKEEARWRMSVFANNMVRAQKIQALDTGTARYGV--TKFSDLT 216

Query:    90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
              +EF        + +     K      +  SS  PP  +W +KGAVT VK QG C     
Sbjct:   217 EEEFRTIYLNPLLQEEPGR-KMRLAKSV--SSLPPPEWDWRKKGAVTKVKDQGMCGSCWA 273

Query:   146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
                   VEG   +K   L+SLSEQ+L+DC   D   GC GG   +A+  I    G+  + 
Sbjct:   274 FSVTGNVEGQWFLKQGTLLSLSEQELLDCDKVDK--GCMGGLPSNAYSAIKTLGGLETEE 331

Query:   203 VYSYEG-MSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQFY 261
              YSY G + T  C S  AE     I +  ++  N+++         P+SVAI+A  +QFY
Sbjct:   332 DYSYRGHLQT--C-SFNAEKAKVYINDSVELSQNEQKLAAWLAEKGPISVAINAFGMQFY 388

Query:   262 SGGV---FNGYCETFL-NHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
               G+       C  +L +H V  VGYG +     +W IKNSWG DWGE+GY+ L R    
Sbjct:   389 RHGISHPLRPLCSPWLIDHAVLLVGYG-NRSATPFWAIKNSWGTDWGEEGYYYLYRG--- 444

Query:   318 PQGQCGIAMFASFPV 332
               G CG+ + AS  V
Sbjct:   445 -SGACGVNIMASSAV 458


>RGD|1309354 [details] [associations]
            symbol:Ctsw "cathepsin W" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1309354 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 MEROPS:C01.037
            CTD:1521 eggNOG:NOG288820 HOVERGEN:HBG100117 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 EMBL:CH473953 EMBL:BC093401 IPI:IPI00371471
            RefSeq:NP_001019413.1 UniGene:Rn.34406 Ensembl:ENSRNOT00000037404
            GeneID:293676 KEGG:rno:293676 UCSC:RGD:1309354 InParanoid:Q561Q9
            NextBio:636716 Genevestigator:Q561Q9 Uniprot:Q561Q9
        Length = 371

 Score = 283 (104.7 bits), Expect = 5.7e-36, Sum P(2) = 5.7e-36
 Identities = 85/307 (27%), Positives = 143/307 (46%)

Query:     2 AKYFLIVVLIISGSCASQATYRTFDEG----SIAEKFEQWKAQYGRTYKESAENSKRFEI 57
             A  F  + L+++G   S +   T D G     + E F+ ++ Q+ R+Y   AE ++R  I
Sbjct:     5 AHLFYFLALLLAGQGLSDSLL-TKDAGPRPLELKEVFKLFQIQFNRSYSNPAEYTRRLGI 63

Query:    58 FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL 117
             F  NL   +R     +G   +      F+DLT +EF     G + +       A      
Sbjct:    64 FAHNLAQAQRLQEEDLGTAEFGQ--TPFSDLTEEEF-GQLYGHQRAPERILNMAKKVKSE 120

Query:   118 YKSSQVPPSVNWIE-KGAVTPVKYQGQC----AVAAVEGINA---IKINRLVSLSEQQLV 169
                  VPP+ +W + K  ++ +K QG C    A+AA + I     IK  + V +S Q+L+
Sbjct:   121 RWGESVPPTCDWRKVKNIISSIKNQGNCRCCWAIAAADNIQTLWRIKTQQFVDVSVQELL 180

Query:   170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG-MSTGICDSIKAEDHAAQITN 228
             DC  +   NGC GGF+ DA+  ++ N G+ ++  Y ++G      C + K     A I +
Sbjct:   181 DC--DRCGNGCNGGFVWDAYITVLNNSGLASEEDYPFQGHQKPHRCLADKYRK-VAWIQD 237

Query:   229 YEDVPPNDEESLLKAVANQPVSVAIDASALQFYSGGVFNGY---CETFL-NHGVTAVGYG 284
             +  +  N++        + P++V I+   LQ+Y  GV       C+  L NH V  VG+G
Sbjct:   238 FTMLSSNEQVIAGYLAIHGPITVTINMKLLQYYQKGVIKATPSTCDPHLVNHSVLLVGFG 297

Query:   285 TSEEGIK 291
               + G++
Sbjct:   298 KEKGGMQ 304

 Score = 121 (47.7 bits), Expect = 5.7e-36, Sum P(2) = 5.7e-36
 Identities = 29/82 (35%), Positives = 41/82 (50%)

Query:   274 LNHGVTAVGYGTSEEGIK----------------YWLIKNSWGQDWGEDGYFRLQRDIDQ 317
             +NH V  VG+G  + G++                YW++KNSWG +WGE GYFRL R    
Sbjct:   287 VNHSVLLVGFGKEKGGMQTGTLLSHSRKPRRSTPYWILKNSWGAEWGEKGYFRLYRG--- 343

Query:   318 PQGQCGIAMFASFPVSKESAQP 339
                 CGIA    +P++    +P
Sbjct:   344 -NNTCGIA---KYPITARVDRP 361


>FB|FBgn0250848 [details] [associations]
            symbol:26-29-p "26-29kD-proteinase" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005811
            "lipid particle" evidence=IDA] [GO:0005875 "microtubule associated
            complex" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005875 EMBL:AE014296 GO:GO:0005811 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 MEROPS:I29.003 HSSP:O65039
            EMBL:AY122222 EMBL:AB011376 RefSeq:NP_620470.1 UniGene:Dm.3049
            SMR:Q9V3U6 MINT:MINT-890485 STRING:Q9V3U6
            EnsemblMetazoa:FBtr0075766 GeneID:39547 KEGG:dme:Dmel_CG8947
            UCSC:CG8947-RA CTD:39547 FlyBase:FBgn0250848 InParanoid:Q9V3U6
            OMA:IHSKNRA OrthoDB:EOG4BVQ8T GenomeRNAi:39547 NextBio:814210
            Uniprot:Q9V3U6
        Length = 549

 Score = 384 (140.2 bits), Expect = 1.5e-35, P = 1.5e-35
 Identities = 110/325 (33%), Positives = 160/325 (49%)

Query:    26 DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
             DE  + + F  +K ++G  Y    E+  R  IF+ NL  +   N A +   +YTL +N  
Sbjct:   238 DE-HVDKAFHHFKRKHGVAYHSDTEHEHRKNIFRQNLRYIHSKNRAKL---TYTLAVNHL 293

Query:    86 ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK----SSQVPPSVNWIEKGAVTPVKYQ 141
             AD T +E + ++ G+K    SS +   G PF Y       ++P   +W   GAVTPVK Q
Sbjct:   294 ADKTEEE-LKARRGYK----SSGIYNTGKPFPYDVPKYKDEIPDQYDWRLYGAVTPVKDQ 348

Query:   142 ---GQC----AVAAVEGINAIKIN-RLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
                G C     +  +EG   +K    LV LS+Q L+DC+    NNGC GG     +++++
Sbjct:   349 SVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDCSWAYGNNGCDGGEDFRVYQWML 408

Query:   194 QNKGITNDAVYS-YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESL-LKAVANQPVSV 251
             Q+ G+  +  Y  Y G   G C  +      A I  + +V  ND  +  L  + + P+SV
Sbjct:   409 QSGGVPTEEEYGPYLGQD-GYCH-VNNVTLVAPIKGFVNVTSNDPNAFKLALLKHGPLSV 466

Query:   252 AIDAS--ALQFYSGGVF-NGYCETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
             AIDAS     FYS GV+    C+     L+H V AVGYG S  G  YWL+KNSW   WG 
Sbjct:   467 AIDASPKTFSFYSHGVYYEPTCKNDVDGLDHAVLAVGYG-SINGEDYWLVKNSWSTYWGN 525

Query:   306 DGYFRLQRDIDQPQGQCGIAMFASF 330
             DGY  +       +  CG+    ++
Sbjct:   526 DGYILMSAK----KNNCGVMTMPTY 546


>MGI|MGI:1861434 [details] [associations]
            symbol:Ctsf "cathepsin F" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=ISS]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISS] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:1861434 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513 MEROPS:C01.018
            CTD:8722 OMA:LAPPEWD OrthoDB:EOG4CC41T EMBL:AF136280 EMBL:AF217224
            EMBL:AJ131851 EMBL:AK075862 EMBL:BC058758 IPI:IPI00126769
            RefSeq:NP_063914.1 UniGene:Mm.29561 ProteinModelPortal:Q9R013
            SMR:Q9R013 STRING:Q9R013 PhosphoSite:Q9R013 PaxDb:Q9R013
            PRIDE:Q9R013 Ensembl:ENSMUST00000119694 GeneID:56464 KEGG:mmu:56464
            UCSC:uc008gbc.1 GeneTree:ENSGT00660000095458 InParanoid:Q9R013
            NextBio:312722 Bgee:Q9R013 CleanEx:MM_CTSF Genevestigator:Q9R013
            GermOnline:ENSMUSG00000006458 Uniprot:Q9R013
        Length = 462

 Score = 382 (139.5 bits), Expect = 2.4e-35, P = 2.4e-35
 Identities = 103/314 (32%), Positives = 146/314 (46%)

Query:    30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
             +A  F+ +   Y RTY+   E   R  +F  N++  ++      G   Y +   KF+DLT
Sbjct:   161 MAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGI--TKFSDLT 218

Query:    90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
              +EF        +    S  K +  P    +   PP  +W +KGAVT VK QG C     
Sbjct:   219 EEEFHTIYLN-PLLQKESGRKMS--PAKSINDLAPPEWDWRKKGAVTEVKNQGMCGSCWA 275

Query:   146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
                   VEG   +    L+SLSEQ+L+DC   D    C GG   +A+  I    G+  + 
Sbjct:   276 FSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDK--ACLGGLPSNAYAAIKNLGGLETED 333

Query:   203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQFYS 262
              Y Y+G     C+   A+     I +  ++  N+ +         P+SVAI+A  +QFY 
Sbjct:   334 DYGYQG-HVQTCN-FSAQMAKVYINDSVELSRNENKIAAWLAQKGPISVAINAFGMQFYR 391

Query:   263 GGV---FNGYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
              G+   F   C   F++H V  VGYG +   I YW IKNSWG DWGE+GY+ L R     
Sbjct:   392 HGIAHPFRPLCSPWFIDHAVLLVGYG-NRSNIPYWAIKNSWGSDWGEEGYYYLYRG---- 446

Query:   319 QGQCGIAMFASFPV 332
              G CG+   AS  V
Sbjct:   447 SGACGVNTMASSAV 460


>UNIPROTKB|F1MHV4 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 OMA:GRCGDGC EMBL:DAAA02063574
            IPI:IPI00716321 Ensembl:ENSBTAT00000027681 Uniprot:F1MHV4
        Length = 375

 Score = 274 (101.5 bits), Expect = 3.9e-35, Sum P(2) = 3.9e-35
 Identities = 84/300 (28%), Positives = 153/300 (51%)

Query:     7 IVVLIISGSCAS-QATYRTFDEG----SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
             ++ L+++G     + + R  D G     + E F  ++ QY R+Y   AE ++R +IF  N
Sbjct:    10 LLALLVAGLAQGIKDSLRGQDPGPQPLELKEVFRLFQMQYNRSYPNPAEYARRLDIFAQN 69

Query:    62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS 121
             L   +R     +G   + +   +F+DLT +EF+    G +++  +  +        +  S
Sbjct:    70 LAKAQRLQEEDLGTAEFGV--TQFSDLTEEEFVQLY-GSQVAGEALGVSRKVGSEEWGES 126

Query:   122 QVPPSVNWIEKGAVTPVKYQGQC----AVAA---VEGINAIKINRLVSLSEQ-QLVDCAT 173
             + P + +W + G ++PV+ Q  C    A+AA   +E + AIK    V +S Q +L+DC  
Sbjct:   127 E-PQTCDWRKVGTISPVRDQRNCNCCWAMAAAGNIEALWAIKFRHFVEVSVQPELLDC-- 183

Query:   174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMS-TGICDSIKAEDHAAQITNYEDV 232
             +   NGC GGF+ DAF  ++ N G+ ++  Y + G   T  C + K +   A I ++  +
Sbjct:   184 DRCGNGCRGGFVWDAFLTVLNNSGLASEKDYPFNGSGKTHRCLAKKYKK-VAWIQDFI-I 241

Query:   233 PPNDEESLLKAVANQ-PVSVAIDASALQFYSGGVFNGY---CE-TFLNHGVTAVGYGTSE 287
                 E+S+ + +A + P++V I+ + LQ Y  GV       C+ T ++H V  VG+G ++
Sbjct:   242 LQACEQSMARHLATEGPITVTINMTLLQQYQKGVIKATPTTCDPTQVDHSVLLVGFGKTK 301

 Score = 122 (48.0 bits), Expect = 3.9e-35, Sum P(2) = 3.9e-35
 Identities = 30/84 (35%), Positives = 41/84 (48%)

Query:   268 GYCETFLNHGVT--AVGYGTS---EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
             G+ +T L  G    A  +G+       + YW++KNSWG  WGE+GYFRL R        C
Sbjct:   296 GFGKTKLVEGRQGKAASFGSHARPRRSMAYWILKNSWGPQWGEEGYFRLHRG----SNTC 351

Query:   323 GIAMFASFPVSKESAQPSSADKSS 346
             GI     FPV+    +P    + S
Sbjct:   352 GIT---KFPVTARVDKPKKQHQVS 372


>RGD|1308181 [details] [associations]
            symbol:Ctsf "cathepsin F" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1308181 eggNOG:COG4870 HOGENOM:HOG000230774
            KO:K01373 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 MEROPS:C01.018
            CTD:8722 OMA:LAPPEWD OrthoDB:EOG4CC41T GeneTree:ENSGT00660000095458
            EMBL:CH473953 EMBL:BC099780 EMBL:EU253481 IPI:IPI00201100
            RefSeq:NP_001029282.1 UniGene:Rn.25087 SMR:Q499S6
            Ensembl:ENSRNOT00000026718 GeneID:361704 KEGG:rno:361704
            UCSC:RGD:1308181 InParanoid:Q499S6 NextBio:677325
            Genevestigator:Q499S6 Uniprot:Q499S6
        Length = 462

 Score = 378 (138.1 bits), Expect = 6.5e-35, P = 6.5e-35
 Identities = 106/317 (33%), Positives = 149/317 (47%)

Query:    30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
             +A  F+ +   Y RTY+   E   R  +F  N++  ++      G   Y +   KF+DLT
Sbjct:   161 MATLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGI--TKFSDLT 218

Query:    90 PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ--VPPSVNWIEKGAVTPVKYQGQCA-- 145
              +EF        +   S      G   L KS     PP  +W +KGAVT VK QG C   
Sbjct:   219 EEEFHTIYLNPLLQKESG-----GKMSLAKSINDLAPPEWDWRKKGAVTEVKDQGMCGSC 273

Query:   146 -----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
                     VEG   +    L+SLSEQ+L+DC   D    C GG   +A+  I    G+  
Sbjct:   274 WAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKMDK--ACMGGLPSNAYTAIKNLGGLET 331

Query:   201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDASALQ 259
             +  Y Y+G     C+    +     I +  ++   DE  +   +A + P+SVAI+A  +Q
Sbjct:   332 EDDYGYQG-HVQACN-FSTQMAKVYINDSVELS-RDENKIAAWLAQKGPISVAINAFGMQ 388

Query:   260 FYSGGV---FNGYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
             FY  G+   F   C   F++H V  VGYG +   I YW IKNSWG+DWGE+GY+ L R  
Sbjct:   389 FYRHGIAHPFRPLCSPWFIDHAVLLVGYG-NRSNIPYWAIKNSWGRDWGEEGYYYLYRG- 446

Query:   316 DQPQGQCGIAMFASFPV 332
                 G CG+   AS  V
Sbjct:   447 ---SGACGVNTMASSAV 460


>UNIPROTKB|Q9UBX1 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=TAS] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_6900 GO:GO:0019886 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0043202
            GO:GO:0004197 HOVERGEN:HBG011513 EMBL:AJ007331 EMBL:AF088886
            EMBL:AF132894 EMBL:AF136279 EMBL:AF071748 EMBL:AF071749
            EMBL:AK313657 EMBL:BC011682 EMBL:BC036451 EMBL:AL137742
            IPI:IPI00002816 RefSeq:NP_003784.2 UniGene:Hs.11590 PDB:1D5U
            PDB:1M6D PDBsum:1D5U PDBsum:1M6D ProteinModelPortal:Q9UBX1
            SMR:Q9UBX1 STRING:Q9UBX1 MEROPS:C01.018 PhosphoSite:Q9UBX1
            DMDM:12643325 PaxDb:Q9UBX1 PeptideAtlas:Q9UBX1 PRIDE:Q9UBX1
            DNASU:8722 Ensembl:ENST00000310325 GeneID:8722 KEGG:hsa:8722
            UCSC:uc001oip.3 CTD:8722 GeneCards:GC11M066332 HGNC:HGNC:2531
            HPA:CAB002141 MIM:603539 neXtProt:NX_Q9UBX1 PharmGKB:PA27031
            InParanoid:Q9UBX1 OMA:LAPPEWD OrthoDB:EOG4CC41T PhylomeDB:Q9UBX1
            BindingDB:Q9UBX1 ChEMBL:CHEMBL2517 ChiTaRS:CTSF
            EvolutionaryTrace:Q9UBX1 GenomeRNAi:8722 NextBio:32715
            ArrayExpress:Q9UBX1 Bgee:Q9UBX1 CleanEx:HS_CTSF
            Genevestigator:Q9UBX1 GermOnline:ENSG00000174080 Uniprot:Q9UBX1
        Length = 484

 Score = 376 (137.4 bits), Expect = 1.1e-34, P = 1.1e-34
 Identities = 104/316 (32%), Positives = 147/316 (46%)

Query:    30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
             +A  F+ +   Y RTY+   E   R  +F +N+V  ++      G   Y +   KF+DLT
Sbjct:   183 MASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGV--TKFSDLT 240

Query:    90 PQEF--IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-- 145
              +EF  I   T  +    +   +A     L      PP  +W  KGAVT VK QG C   
Sbjct:   241 EEEFRTIYLNTLLRKEPGNKMKQAKSVGDL-----APPEWDWRSKGAVTKVKDQGMCGSC 295

Query:   146 -----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
                     VEG   +    L+SLSEQ+L+DC   D    C GG   +A+  I    G+  
Sbjct:   296 WAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDK--ACMGGLPSNAYSAIKNLGGLET 353

Query:   201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF 260
             +  YSY+G     C+   AE     I +  ++  N+++         P+SVAI+A  +QF
Sbjct:   354 EDDYSYQGHMQS-CN-FSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQF 411

Query:   261 YSGGV---FNGYCETFL-NHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
             Y  G+       C  +L +H V  VGYG   + + +W IKNSWG DWGE GY+ L R   
Sbjct:   412 YRHGISRPLRPLCSPWLIDHAVLLVGYGNRSD-VPFWAIKNSWGTDWGEKGYYYLHRG-- 468

Query:   317 QPQGQCGIAMFASFPV 332
                G CG+   AS  V
Sbjct:   469 --SGACGVNTMASSAV 482


>DICTYBASE|DDB_G0291191 [details] [associations]
            symbol:DDB_G0291191 "cysteine protease" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0291191
            GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AAFI02000175 MEROPS:C01.022
            ProtClustDB:CLSZ2429603 RefSeq:XP_635374.1
            ProteinModelPortal:Q54F16 PRIDE:Q54F16 EnsemblProtists:DDB0252831
            GeneID:8628022 KEGG:ddi:DDB_G0291191 OMA:NETQIAS Uniprot:Q54F16
        Length = 352

 Score = 373 (136.4 bits), Expect = 2.2e-34, P = 2.2e-34
 Identities = 90/237 (37%), Positives = 124/237 (52%)

Query:   112 NGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLS 164
             + TP  +       S  + +   VT VK QGQC           VEG + +    LV LS
Sbjct:   114 SATPAAFDWRNTGGSTKFPQGTPVTAVKNQGQCGSCWSFSTTGNVEGQHYLSTGTLVGLS 173

Query:   165 EQQLVDC----ATNDNNN----GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDS 216
             EQ LVDC     T +N N    GC GG   +A+ YII+N GI  +A Y Y  +  G C  
Sbjct:   174 EQNLVDCDHTCMTYENENVCNAGCDGGLQPNAYNYIIKNGGIQTEATYPYTAVD-GECKF 232

Query:   217 IKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQFYSGGVFNGYCETFLNH 276
               A+   A+I+++  VP N+ +       N P+++A DA   QFY GGVF+  C   L+H
Sbjct:   233 NSAQV-GAKISSFTMVPQNETQIASYLFNNGPLAIAADAEEWQFYMGGVFDFPCGQTLDH 291

Query:   277 GVTAVGYGTSE----EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFAS 329
             G+  VGYG  +    +   YW+IKNSWG DWGE GY +++R+ D+    CG+A F S
Sbjct:   292 GILIVGYGAQDTIVGKNTPYWIIKNSWGADWGEAGYLKVERNTDK----CGVANFVS 344

 Score = 142 (55.0 bits), Expect = 4.9e-07, P = 4.9e-07
 Identities = 56/186 (30%), Positives = 84/186 (45%)

Query:     3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
             ++ L  VL+++   A +   R   E S   +F  ++ +Y + Y  + E   +FE FK NL
Sbjct:     2 RFILFFVLMLTALAAGR---RLSVEES---QFIAFQNKYNKIYS-AEEYLVKFETFKSNL 54

Query:    63 VAVERFNNAAIGNRSYT-LRLNKFADLTPQEF---IASQTGFKMSDHSSSLK------AN 112
             + ++  N  A    S T   +NKFADL+ +EF     S    +++D    L        +
Sbjct:    55 LNIDALNKQATTIGSDTKFGVNKFADLSKEEFKKYYLSSKEARLTDDLPMLPNLSDDIIS 114

Query:   113 GTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSE 165
              TP  +       S  + +   VT VK QGQC           VEG + +    LV LSE
Sbjct:   115 ATPAAFDWRNTGGSTKFPQGTPVTAVKNQGQCGSCWSFSTTGNVEGQHYLSTGTLVGLSE 174

Query:   166 QQLVDC 171
             Q LVDC
Sbjct:   175 QNLVDC 180


>MGI|MGI:1338045 [details] [associations]
            symbol:Ctsw "cathepsin W" species:10090 "Mus musculus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 MGI:MGI:1338045 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 MEROPS:C01.037
            CTD:1521 eggNOG:NOG288820 HOVERGEN:HBG100117 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 EMBL:AF014941 EMBL:AC122861 IPI:IPI00111727
            RefSeq:NP_034115.2 UniGene:Mm.113590 ProteinModelPortal:P56203
            SMR:P56203 PhosphoSite:P56203 PRIDE:P56203 DNASU:13041
            Ensembl:ENSMUST00000025844 GeneID:13041 KEGG:mmu:13041
            InParanoid:P56203 NextBio:282936 Bgee:P56203 CleanEx:MM_CTSW
            Genevestigator:P56203 GermOnline:ENSMUSG00000024910 Uniprot:P56203
        Length = 371

 Score = 269 (99.8 bits), Expect = 9.0e-34, Sum P(2) = 9.0e-34
 Identities = 87/305 (28%), Positives = 146/305 (47%)

Query:     4 YFLIVVLIISGSCASQATYRTFDEG----SIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
             YFL  VL+++G   S +   T D G     + E F+ ++ ++ R+Y   AE ++R  IF 
Sbjct:     9 YFL--VLLLAGQGLSDSLL-TKDAGPRPLELKEVFKLFQIRFNRSYWNPAEYTRRLSIFA 65

Query:    60 DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
              NL   +R     +G   +      F+DLT +EF     G + S   +            
Sbjct:    66 HNLAQAQRLQQEDLGTAEFGE--TPFSDLTEEEF-GQLYGQERSPERTPNMTKKVESNTW 122

Query:   120 SSQVPPSVNWIE-KGAVTPVKYQGQC----AVAAVEGINA---IKINRLVSLSEQQLVDC 171
                VP + +W + K  ++ VK QG C    A+AA + I A   IK  + V +S Q+L+DC
Sbjct:   123 GESVPRTCDWRKAKNIISSVKNQGSCKCCWAMAAADNIQALWRIKHQQFVDVSVQELLDC 182

Query:   172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
                   NGC GGF+ DA+  ++ N G+ ++  Y ++G         K     A I ++  
Sbjct:   183 ERC--GNGCNGGFVWDAYLTVLNNSGLASEKDYPFQGDRKPHRCLAKKYKKVAWIQDFTM 240

Query:   232 VPPNDEESLLKAVA-NQPVSVAIDASALQFYSGGVFNGY---CETF-LNHGVTAVGYGTS 286
             +  N+E+++   +A + P++V I+   LQ Y  GV       C+   ++H V  VG+G  
Sbjct:   241 LS-NNEQAIAHYLAVHGPITVTINMKLLQHYQKGVIKATPSSCDPRQVDHSVLLVGFGKE 299

Query:   287 EEGIK 291
             +EG++
Sbjct:   300 KEGMQ 304

 Score = 114 (45.2 bits), Expect = 9.0e-34, Sum P(2) = 9.0e-34
 Identities = 28/89 (31%), Positives = 42/89 (47%)

Query:   274 LNHGVTAVGYGTSEEGIK----------------YWLIKNSWGQDWGEDGYFRLQRDIDQ 317
             ++H V  VG+G  +EG++                YW++KNSWG  WGE GYFRL R    
Sbjct:   287 VDHSVLLVGFGKEKEGMQTGTVLSHSRKRRHSSPYWILKNSWGAHWGEKGYFRLYRG--- 343

Query:   318 PQGQCGIAMFASFPVSKESAQPSSADKSS 346
                 CG+     +P + +   P    ++S
Sbjct:   344 -NNTCGVT---KYPFTAQVDSPVKKARTS 368


>FB|FBgn0037396 [details] [associations]
            symbol:CG11459 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE014297 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 KO:K01365 HSSP:P07711 EMBL:AY060710
            RefSeq:NP_649608.1 UniGene:Dm.3894 SMR:Q9VNK6 MEROPS:C01.A31
            EnsemblMetazoa:FBtr0078623 GeneID:40741 KEGG:dme:Dmel_CG11459
            UCSC:CG11459-RA FlyBase:FBgn0037396 InParanoid:Q9VNK6 OMA:NYDEREL
            OrthoDB:EOG4MGQPX ChiTaRS:CG11459 GenomeRNAi:40741 NextBio:820359
            Uniprot:Q9VNK6
        Length = 336

 Score = 367 (134.2 bits), Expect = 9.5e-34, P = 9.5e-34
 Identities = 103/316 (32%), Positives = 160/316 (50%)

Query:    33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAI-GNRSYTLRLNKFADLTPQ 91
             +++Q+KA+Y + Y+   +  +   +++  ++AVE  N   + G  ++ + LNKF+D T Q
Sbjct:    29 EWDQYKAKYNKQYRNRDKYHRA--LYEQRVLAVESHNQLYLQGKVAFKMGLNKFSD-TDQ 85

Query:    92 EFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQG----QCAV 146
               + +      +   +S  A      YK   Q+   ++W + G ++PV  QG     C  
Sbjct:    86 RILFNYRSSIPAPLETSTNALTETVNYKRYDQITEGIDWRQYGYISPVGDQGTECLSCWA 145

Query:   147 AAVEGI----NAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
              +  G+     A K   LV LS + LVDC    NN GC GG++  AF Y  ++ GI    
Sbjct:   146 FSTSGVLEAHMAKKYGNLVPLSPKHLVDCVPYPNN-GCSGGWVSVAFNYT-RDHGIATKE 203

Query:   203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASALQF- 260
              Y YE +S G C   K++  A  ++ Y  +   DE  L + V N  PV+V+ID    +F 
Sbjct:   204 SYPYEPVS-GEC-LWKSDRSAGTLSGYVTLGNYDERELAEVVYNIGPVAVSIDHLHEEFD 261

Query:   261 -YSGGVFN-GYCETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
              YSGGV +   C +    L H V  VG+GT  +   YW+IKNS+G DWGE GY +L R+ 
Sbjct:   262 QYSGGVLSIPACRSKRQDLTHSVLLVGFGTHRKWGDYWIIKNSYGTDWGESGYLKLARNA 321

Query:   316 DQPQGQCGIAMFASFP 331
             +     CG+A    +P
Sbjct:   322 NN---MCGVASLPQYP 334


>UNIPROTKB|F1NT07 [details] [associations]
            symbol:LOC100857883 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 EMBL:AADN02044012
            EMBL:AADN02044013 EMBL:AADN02044014 IPI:IPI00577314
            Ensembl:ENSGALT00000000192 OMA:IYKHGPV Uniprot:F1NT07
        Length = 317

 Score = 367 (134.2 bits), Expect = 9.5e-34, P = 9.5e-34
 Identities = 108/315 (34%), Positives = 155/315 (49%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F  ++ + GR Y  + E   R  IF  ++  V   N AA+   SY+L LN  AD TPQE 
Sbjct:    12 FHHYRRRLGRPYGSAREMEHRQRIFAHHMRFVHSKNRAAL---SYSLALNHLADRTPQEM 68

Query:    94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQ---GQC----AV 146
              A +   +  D +  L        Y    +P S++W   GAVTPVK Q   G C      
Sbjct:    69 AALRGRRRSGDPNHGLPFPAEH--YTGIILPESLDWRMYGAVTPVKDQAVCGSCWSFATT 126

Query:   147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN-DAVYS 205
              A+EG   +K   L  LS+Q L+DC+    N  C GG    A  +I ++ GI + ++  S
Sbjct:   127 GAMEGALFLKTGVLTPLSQQVLIDCSWGKGNYACDGGEEWRAKGWIKKHGGIASTESPPS 186

Query:   206 YE-GMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDAS--ALQFY 261
             +   +  G+C   ++E   A+IT Y +V   +  ++  A+    PV+V+IDAS     FY
Sbjct:   187 FPLVLQNGLCHYNQSE-MLAKITGYVNVTSGNITAVKTAIYKHGPVAVSIDASHKTFSFY 245

Query:   262 SGGVF-NGYCETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
             S G++    C      L+H V AVGYG  + G  YWLIKNSW   WG DGY  +      
Sbjct:   246 SNGIYYEPKCANKPGQLDHAVLAVGYGVLQ-GETYWLIKNSWSTYWGNDGYILMA----M 300

Query:   318 PQGQCGIAMFASFPV 332
                 CG+A  A++P+
Sbjct:   301 KDNNCGVATEATYPI 315


>RGD|1564827 [details] [associations]
            symbol:RGD1564827 "similar to cathepsin M" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 IPI:IPI00192321
            Ensembl:ENSRNOT00000023990 ArrayExpress:D3ZY04 Uniprot:D3ZY04
        Length = 338

 Score = 367 (134.2 bits), Expect = 9.5e-34, P = 9.5e-34
 Identities = 107/350 (30%), Positives = 173/350 (49%)

Query:     5 FLIVVLI--ISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
             FLI++ +  +SG+ A       FD  S+  ++++WK +Y + Y       K  ++    +
Sbjct:     6 FLIILCVGVVSGASA-------FDL-SLDVQWQEWKMKYEKLYSP-VRIQKTVQMVP-RV 55

Query:    63 VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN--GTPFLYKS 120
              A        +   +     N+   L  Q  + S T  ++  +  + K N     F+  S
Sbjct:    56 KASNPLEQQVV-KVTMCRSFNRMGCLPDQAGLES-TEIQLPLNPKTFKVNWRDCKFMPPS 113

Query:   121 SQV----PPSVN-WIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQL 168
             + +    PP ++  +    V     QG+C        V A+EG    K  +L  LS Q L
Sbjct:   114 NFINNPSPPQMSVCVCVCYVHTASTQGRCNSCWAFPVVGAIEGQMFKKTGKLTPLSVQNL 173

Query:   169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
             VDC+    N GC GG   +AF+Y++QN G+ ++A Y YEG   G+C      + +A+IT 
Sbjct:   174 VDCSKPQGNKGCRGGTTYNAFQYVLQNGGLESEATYPYEGKE-GLCRY--NPNSSAKITX 230

Query:   229 YEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNG-YCETFLNHGVTAVGYG- 284
                 P  +E+ L+ AVA +PV+  I    S+L+FY  G+++   C  ++NH V  VGYG 
Sbjct:   231 ICAPPQKNEDVLMDAVATKPVAAGIHVVHSSLRFYKKGIYHEPKCNNYVNHAVLVVGYGF 290

Query:   285 --TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
                  +G  YWLI+NSWG+ WG +GY ++ +D +     CGIA FA +P+
Sbjct:   291 EGNETDGNNYWLIQNSWGERWGLNGYMKIAKDRNN---HCGIATFAQYPI 337


>UNIPROTKB|P56202 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 GO:GO:0006955 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AF013611
            EMBL:AF015954 EMBL:AF055903 EMBL:AP001201 EMBL:BC048255
            IPI:IPI00328978 RefSeq:NP_001326.2 UniGene:Hs.416848
            ProteinModelPortal:P56202 SMR:P56202 STRING:P56202 MEROPS:C01.037
            PhosphoSite:P56202 DMDM:259016196 PaxDb:P56202 PRIDE:P56202
            Ensembl:ENST00000307886 GeneID:1521 KEGG:hsa:1521 UCSC:uc001ogc.1
            CTD:1521 GeneCards:GC11P065647 HGNC:HGNC:2546 HPA:CAB016345
            MIM:602364 neXtProt:NX_P56202 PharmGKB:PA27042 eggNOG:NOG288820
            HOVERGEN:HBG100117 InParanoid:P56202 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 PhylomeDB:P56202 GenomeRNAi:1521 NextBio:6295
            ArrayExpress:P56202 Bgee:P56202 CleanEx:HS_CTSW
            Genevestigator:P56202 GermOnline:ENSG00000172543 Uniprot:P56202
        Length = 376

 Score = 257 (95.5 bits), Expect = 1.0e-32, Sum P(2) = 1.0e-32
 Identities = 85/307 (27%), Positives = 152/307 (49%)

Query:     7 IVVLIISGSCAS-QATYRTFDEG----SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
             ++ L+++G     +   R  D G     + E F+ ++ Q+ R+Y    E++ R +IF  N
Sbjct:    10 LLALLVAGLAQGIRGPLRAQDLGPQPLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHN 69

Query:    62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS 121
             L   +R     +G   + +    F+DLT +EF     G++ +  +  + + G     +  
Sbjct:    70 LAQAQRLQEEDLGTAEFGV--TPFSDLTEEEF-GQLYGYRRA--AGGVPSMGREIRSEEP 124

Query:   122 Q--VPPSVNWIE-KGAVTPVKYQGQC----AVAAVEGINAI-KIN--RLVSLSEQQLVDC 171
             +  VP S +W +   A++P+K Q  C    A+AA   I  + +I+    V +S Q+L+DC
Sbjct:   125 EESVPFSCDWRKVASAISPIKDQKNCNCCWAMAAAGNIETLWRISFWDFVDVSVQELLDC 184

Query:   172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG-MSTGICDSIKAEDHAAQITNYE 230
                   +GC+GGF+ DAF  ++ N G+ ++  Y ++G +    C   K +   A I ++ 
Sbjct:   185 GRC--GDGCHGGFVWDAFITVLNNSGLASEKDYPFQGKVRAHRCHPKKYQK-VAWIQDFI 241

Query:   231 DVPPNDEESLLKAVANQ-PVSVAIDASALQFYSGGVFNGY---CETFL-NHGVTAVGYGT 285
              +  N+E  + + +A   P++V I+   LQ Y  GV       C+  L +H V  VG+G+
Sbjct:   242 MLQ-NNEHRIAQYLATYGPITVTINMKPLQLYRKGVIKATPTTCDPQLVDHSVLLVGFGS 300

Query:   286 --SEEGI 290
               SEEGI
Sbjct:   301 VKSEEGI 307

 Score = 116 (45.9 bits), Expect = 1.0e-32, Sum P(2) = 1.0e-32
 Identities = 23/49 (46%), Positives = 27/49 (55%)

Query:   292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFA-SFPVSKESAQP 339
             YW++KNSWG  WGE GYFRL R        CGI  F  +  V K   +P
Sbjct:   326 YWILKNSWGAQWGEKGYFRLHRG----SNTCGITKFPLTARVQKPDMKP 370


>TAIR|locus:2050145 [details] [associations]
            symbol:AT2G21430 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002685 GenomeReviews:CT485783_GR
            EMBL:AC006841 EMBL:X74359 IPI:IPI00519637 PIR:B84601
            RefSeq:NP_565512.1 UniGene:At.14069 ProteinModelPortal:P43295
            SMR:P43295 MEROPS:C01.A04 PRIDE:P43295 EnsemblPlants:AT2G21430.1
            GeneID:816682 KEGG:ath:AT2G21430 TAIR:At2g21430 eggNOG:COG4870
            HOGENOM:HOG000230774 InParanoid:P43295 KO:K01373 OMA:GSIEEHY
            PhylomeDB:P43295 ProtClustDB:CLSN2688311 Genevestigator:P43295
            GermOnline:AT2G21430 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 Uniprot:P43295
        Length = 361

 Score = 353 (129.3 bits), Expect = 2.9e-32, P = 2.9e-32
 Identities = 94/304 (30%), Positives = 150/304 (49%)

Query:    29 SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
             S  + F  +K ++G+ Y    E+  RF +FK NL+   R        R    + +     
Sbjct:    43 SSEDHFTLFKKKFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRS 102

Query:    89 T-PQEFIASQTGFKM---SDHSSSLKANGTP--FLYKS-SQVPPSVNWIEKGAVTPVKYQ 141
                ++ +  + GFK+   ++ +  L     P  F ++    V P  N    G+       
Sbjct:   103 EFRRKHLGVKGGFKLPKDANQAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTT 162

Query:   142 GQCAVAAVEGINAIKINRLVSLSEQQLVDC-------ATNDNNNGCYGGFMDDAFKYIIQ 194
             G     A+EG + +   +LVSLSEQQLVDC            ++GC GG M+ AF+Y ++
Sbjct:   163 G-----ALEGAHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLK 217

Query:   195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
               G+  +  Y Y G   G C  +      A ++N+  V  N+++     + N P++VAI+
Sbjct:   218 TGGLMREKDYPYTGTDGGSC-KLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAIN 276

Query:   255 ASALQFYSGGVFNGY-CETFLNHGVTAVGYGT---SEEGIK---YWLIKNSWGQDWGEDG 307
             A+ +Q Y GGV   Y C   LNHGV  VGYG+   S+  +K   YW+IKNSWG+ WGE+G
Sbjct:   277 AAYMQTYIGGVSCPYICSRRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENG 336

Query:   308 YFRL 311
             ++++
Sbjct:   337 FYKI 340


>TAIR|locus:2120222 [details] [associations]
            symbol:RD19 "RESPONSIVE TO DEHYDRATION 19" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009269 "response to desiccation" evidence=IEP] [GO:0006970
            "response to osmotic stress" evidence=IGI] [GO:0005634 "nucleus"
            evidence=IDA] [GO:0005773 "vacuole" evidence=IDA] [GO:0042742
            "defense response to bacterium" evidence=IMP] [GO:0006096
            "glycolysis" evidence=RCA] [GO:0006833 "water transport"
            evidence=RCA] [GO:0006972 "hyperosmotic response" evidence=RCA]
            [GO:0007030 "Golgi organization" evidence=RCA] [GO:0009266
            "response to temperature stimulus" evidence=RCA] [GO:0009651
            "response to salt stress" evidence=IEP;RCA] [GO:0046686 "response
            to cadmium ion" evidence=RCA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=ISS] [GO:0009414 "response to
            water deprivation" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005634 GO:GO:0005773 EMBL:CP002687
            GenomeReviews:CT486007_GR GO:GO:0009651 GO:GO:0042742
            eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            ProtClustDB:CLSN2688311 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AL035679 EMBL:AL161594 GO:GO:0004197
            MEROPS:C01.022 EMBL:D13042 EMBL:AY080598 EMBL:AY133844
            IPI:IPI00544363 PIR:JN0718 RefSeq:NP_568052.1 UniGene:At.2850
            UniGene:At.74924 ProteinModelPortal:P43296 SMR:P43296 STRING:P43296
            PaxDb:P43296 PRIDE:P43296 EnsemblPlants:AT4G39090.1 GeneID:830064
            KEGG:ath:AT4G39090 TAIR:At4g39090 InParanoid:P43296 OMA:EDFDWRD
            PhylomeDB:P43296 Genevestigator:P43296 GermOnline:AT4G39090
            Uniprot:P43296
        Length = 368

 Score = 348 (127.6 bits), Expect = 9.8e-32, P = 9.8e-32
 Identities = 96/305 (31%), Positives = 153/305 (50%)

Query:    32 EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN----NAAIGNRSYT-LRLNKFA 86
             + F  +K ++G+ Y  + E+  RF +FK NL    R      +A  G   ++ L  ++F 
Sbjct:    49 DHFSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQFSDLTRSEFR 108

Query:    87 DLTPQEFIASQTGFKMSDHSSS---LKANGTP--FLYKS-SQVPPSVNWIEKGAVTPVKY 140
                 ++ +  ++GFK+   ++    L     P  F ++    V P  N    G+      
Sbjct:   109 ----KKHLGVRSGFKLPKDANKAPILPTENLPEDFDWRDHGAVTPVKNQGSCGSCWSFS- 163

Query:   141 QGQCAVAAVEGINAIKINRLVSLSEQQLVDC-------ATNDNNNGCYGGFMDDAFKYII 193
                 A  A+EG N +   +LVSLSEQQLVDC         +  ++GC GG M+ AF+Y +
Sbjct:   164 ----ATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTL 219

Query:   194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
             +  G+  +  Y Y G     C   K++   A ++N+  +  ++E+     V N P++VAI
Sbjct:   220 KTGGLMKEEDYPYTGKDGKTCKLDKSKI-VASVSNFSVISIDEEQIAANLVKNGPLAVAI 278

Query:   254 DASALQFYSGGVFNGY-CETFLNHGVTAVGYGTSEEG---IK---YWLIKNSWGQDWGED 306
             +A  +Q Y GGV   Y C   LNHGV  VGYG +       K   YW+IKNSWG+ WGE+
Sbjct:   279 NAGYMQTYIGGVSCPYICTRRLNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGETWGEN 338

Query:   307 GYFRL 311
             G++++
Sbjct:   339 GFYKI 343


>DICTYBASE|DDB_G0272742 [details] [associations]
            symbol:DDB_G0272742 species:44689 "Dictyostelium
            discoideum" [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0272742 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639 EMBL:AAFI02000008
            eggNOG:NOG331187 RefSeq:XP_644986.1 ProteinModelPortal:Q7KWP5
            PRIDE:Q7KWP5 EnsemblProtists:DDB0168242 GeneID:8618663
            KEGG:ddi:DDB_G0272742 InParanoid:Q7KWP5 OMA:ATESAHF Uniprot:Q7KWP5
        Length = 345

 Score = 336 (123.3 bits), Expect = 1.8e-30, P = 1.8e-30
 Identities = 103/305 (33%), Positives = 160/305 (52%)

Query:     1 MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
             M KY L+++++      S+ T     E     +F  W     RTY  S+E + R+  FK 
Sbjct:     1 MKKYSLLILILFINCSFSKLT-----EIQYRNEFTAWMTSNQRTYA-SSEFTNRYNTFKS 54

Query:    61 NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS----SSLKANGTPF 116
             NL  + ++N+   G+++  L LN+FAD++ +E+   +  +  +D++    SSL  N    
Sbjct:    55 NLDFINQWNSK--GSKT-VLALNEFADISNEEY---RKNYLRNDNNINKLSSLLINDKED 108

Query:   117 L-YKSSQVPPS----VNWIEKGAVTPVKYQ-GQC------AVAAVEGINAIKINR--LVS 162
                KSS    S    ++W +KGAV  VK Q G C      AV A E  + +   +   +S
Sbjct:   109 KEIKSSSSSGSGSSGIDWRKKGAVPSVKSQIGGCGSWPITAVGATESAHFLANPKDPFIS 168

Query:   163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH 222
             LS Q L+DC+  + N  CY G +++AF+YII+N GI ++  Y + G   G C    + + 
Sbjct:   169 LSMQNLIDCS--NLNKQCYQGTVNEAFQYIIENGGIDSEESYKFSGGEPGKC-KYNSSNS 225

Query:   223 AAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVF-NGYCE-TFLNHGV 278
              A+IT+YE V    E SL  AV+ +PV+  IDAS  + QFYS G++    C  T LNH +
Sbjct:   226 VAKITSYEKVKSGSESSLESAVSLKPVAAYIDASLSSFQFYSSGIYYEPSCNSTDLNHSI 285

Query:   279 TAVGY 283
               VG+
Sbjct:   286 LIVGF 290

 Score = 319 (117.4 bits), Expect = 1.2e-28, P = 1.2e-28
 Identities = 75/202 (37%), Positives = 116/202 (57%)

Query:   145 AVAAVEGINAIKINR--LVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
             AV A E  + +   +   +SLS Q L+DC+  + N  CY G +++AF+YII+N GI ++ 
Sbjct:   149 AVGATESAHFLANPKDPFISLSMQNLIDCS--NLNKQCYQGTVNEAFQYIIENGGIDSEE 206

Query:   203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQF 260
              Y + G   G C    + +  A+IT+YE V    E SL  AV+ +PV+  IDAS  + QF
Sbjct:   207 SYKFSGGEPGKC-KYNSSNSVAKITSYEKVKSGSESSLESAVSLKPVAAYIDASLSSFQF 265

Query:   261 YSGGVF-NGYCE-TFLNHGVTAVGYG----TSEEGIK----YWLIKNSWGQDWGEDGYFR 310
             YS G++    C  T LNH +  VG+     T  + +K    YW+++NS+G++WGE+GY  
Sbjct:   266 YSSGIYYEPSCNSTDLNHSILIVGFSDFSTTPTDSLKHSSNYWIVQNSFGKNWGENGYIF 325

Query:   311 LQRDIDQPQGQCGIAMFASFPV 332
             + +D D     CGI+  AS+ +
Sbjct:   326 MSKDRDD---NCGISKMASYVI 344


>DICTYBASE|DDB_G0290957 [details] [associations]
            symbol:cprA "cysteine proteinase 1" species:44689
            "Dictyostelium discoideum" [GO:0006972 "hyperosmotic response"
            evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0290957
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GenomeReviews:CM000154_GR GO:GO:0005764
            GO:GO:0006972 EMBL:AAFI02000174 KO:K01376 EMBL:X02407 PIR:A22827
            RefSeq:XP_635417.1 ProteinModelPortal:P04988 MEROPS:C01.022
            GlycoSuiteDB:P04988 SWISS-2DPAGE:P04988 EnsemblProtists:DDB0201647
            GeneID:8627918 KEGG:ddi:DDB_G0290957 OMA:KISNFTM
            ProtClustDB:CLSZ2429603 Uniprot:P04988
        Length = 343

 Score = 333 (122.3 bits), Expect = 3.8e-30, P = 3.8e-30
 Identities = 105/349 (30%), Positives = 163/349 (46%)

Query:     3 KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRT-YKESAE--NSKRFEIFK 59
             K  L+ VL +     S       ++    E  +++  +Y    Y E  E   S   +I +
Sbjct:     2 KVILLFVLAVFTVFVSSRGIPLEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEE 61

Query:    60 DNLVAVERFNNAAIG-NRSYTLRLNKFAD--LTPQEFIASQTGFKMSDHSSSLKANGTP- 115
              NL+A+    +   G N+   L  ++F +  L  +E I +     ++D+      N  P 
Sbjct:    62 LNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDD-LPVADYLDDEFINSIPT 120

Query:   116 -FLYKS-SQVPPSVNWIEKGAVTPVKYQGQCAVAAVEGINAIKINRLVSLSEQQLVDC-- 171
              F +++   V P  N  + G+       G      VEG + I  N+LVSLSEQ LVDC  
Sbjct:   121 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGN-----VEGQHFISQNKLVSLSEQNLVDCDH 175

Query:   172 ------ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQ 225
                        + GC GG   +A+ YII+N GI  ++ Y Y    TG   +  + +  A+
Sbjct:   176 ECMEYEGEQACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTA-ETGTQCNFNSANIGAK 234

Query:   226 ITNYEDVPPNDEESLLKAVANQPVSVAIDASALQFYSGGVFNGYCE-TFLNHGVTAVGYG 284
             I+N+  +P N+       V+  P+++A DA   QFY GGVF+  C    L+HG+  VGY 
Sbjct:   235 ISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYS 294

Query:   285 TSE----EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFAS 329
                    + + YW++KNSWG DWGE GY  L+R     +  CG++ F S
Sbjct:   295 AKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG----KNTCGVSNFVS 339


>ZFIN|ZDB-GENE-080724-8 [details] [associations]
            symbol:ctso "cathepsin O" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            ZFIN:ZDB-GENE-080724-8 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 CTD:1519 KO:K01374 EMBL:CR931784
            IPI:IPI00513613 RefSeq:XP_695717.3 UniGene:Dr.88386
            Ensembl:ENSDART00000074786 GeneID:567333 KEGG:dre:567333
            NextBio:20888622 Uniprot:E7FA09
        Length = 334

 Score = 332 (121.9 bits), Expect = 4.9e-30, P = 4.9e-30
 Identities = 87/303 (28%), Positives = 154/303 (50%)

Query:    38 KAQYGRTYKESAENS--KRFEIFKDNLVAVERFNNAAIG--NRSYTLRLNKFADLTPQEF 93
             + Q+  T+++   N   +R+  ++ +L   + F N+A+G  N+S    +N+F+ L+ ++F
Sbjct:    35 RLQHSDTFQQDVNNELYQRWINYQSSLQR-QAFLNSALGKSNQSAQYGVNQFSYLSQKQF 93

Query:    94 IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------V 146
                Q     ++ +     + +    K++  PP  +W + G V PV  QG C        V
Sbjct:    94 -KEQYLTARAEAAPKFDQSKSEIKVKANN-PPRFDWRDHGVVGPVHNQGSCGGCWAFSIV 151

Query:   147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK-GITNDAVYS 205
              A+E ++A    +L  LS QQ++DC+    N GC GG   +A  ++ Q+K  + ++A Y 
Sbjct:   152 EAIESVSAKGGEKLQQLSVQQVIDCSYQ--NQGCNGGSPVEALYWLTQSKLKLVSEAEYP 209

Query:   206 YEGMSTGICDSIKAEDHAAQITNYEDVP-PNDEESLLKAVAN-QPVSVAIDASALQFYSG 263
             ++G + G+C           + NY        EE ++ A+ +  P+ V +DA + Q Y G
Sbjct:   210 FKG-ADGVCQFFPQAHAGVAVRNYSAYDFSGQEEVMMSALVDFGPLVVIVDAISWQDYLG 268

Query:   264 GVFNGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
             G+   +C +   NH V   GY T+ E + YW+++NSWG  WG+DGY  ++   D     C
Sbjct:   269 GIIQHHCSSHKANHAVLITGYDTTGE-VPYWIVRNSWGTSWGDDGYAYIKIGNDV----C 323

Query:   323 GIA 325
             G+A
Sbjct:   324 GVA 326


>UNIPROTKB|E2RPX3 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 CTD:1521 KO:K08569 OMA:GRCGDGC
            EMBL:AAEX03011632 RefSeq:XP_540846.2 Ensembl:ENSCAFT00000020910
            GeneID:483725 KEGG:cfa:483725 Uniprot:E2RPX3
        Length = 374

 Score = 248 (92.4 bits), Expect = 6.0e-30, Sum P(2) = 6.0e-30
 Identities = 74/272 (27%), Positives = 128/272 (47%)

Query:    30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
             + + F  ++ QY R+Y    E ++R +IF  NL   ++  +  +G   + +    F+DLT
Sbjct:    38 LKQVFALFQIQYNRSYSNPEEYARRLDIFAHNLAQAQQLEDEDLGTAEFGV--TPFSDLT 95

Query:    90 PQEFIASQTGFKMSDHSSSL--KANGTPFLYKSSQVPPSVNWIE-KGAVTPVKYQGQC-- 144
              +EF       +M+  + S+  K     +      VPP+ +W +  G ++P+K QG C  
Sbjct:    96 EEEFGQFYGHQRMAGEAPSVGRKVESEEW---GEPVPPTCDWRKLPGIISPIKQQGNCRC 152

Query:   145 --AVAAVEGINA---IKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
               A+AA   I A   I+ ++ V +S Q+L+DC      +GC GGF  DAF  ++ N G+ 
Sbjct:   153 CWAMAAAGNIEALWGIRYHQPVEVSVQELLDCGRC--GDGCKGGFTWDAFITVLNNSGLA 210

Query:   200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQ 259
             +   Y + G +       K     A I ++  +  N++          P++V I+   LQ
Sbjct:   211 SAKDYPFLGNTKPHRCLAKKYKKVAWIQDFIMLQGNEQAIAWYLATKGPITVTINMKLLQ 270

Query:   260 FYSGGVFNGY---CETF-LNHGVTAVGYGTSE 287
              Y  GV       C+   ++H V  VG+G S+
Sbjct:   271 HYQKGVIQATHTTCDPQRVDHSVLLVGFGKSK 302

 Score = 119 (46.9 bits), Expect = 6.0e-30, Sum P(2) = 6.0e-30
 Identities = 20/38 (52%), Positives = 25/38 (65%)

Query:   290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMF 327
             I YW++KNSWG +WGE+GYFRL R        CGI  +
Sbjct:   322 IPYWILKNSWGAEWGEEGYFRLHRG----NNTCGITKY 355


>GENEDB_PFALCIPARUM|PF11_0165 [details] [associations]
            symbol:PF11_0165 "falcipain 2 precursor"
            species:5833 "Plasmodium falciparum" [GO:0020020 "food vacuole"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014186 HSSP:O65039 GO:GO:0020020
            RefSeq:XP_001347836.1 ProteinModelPortal:Q8I6U4 SMR:Q8I6U4
            IntAct:Q8I6U4 MINT:MINT-1559493 MEROPS:C01.046
            EnsemblProtists:PF11_0165:mRNA GeneID:810712 KEGG:pfa:PF11_0165
            EuPathDB:PlasmoDB:PF3D7_1115700 HOGENOM:HOG000065857 OMA:NESLHAN
            ProtClustDB:PTZ00021 BindingDB:Q8I6U4 ChEMBL:CHEMBL3470
            Uniprot:Q8I6U4
        Length = 484

 Score = 331 (121.6 bits), Expect = 6.5e-30, P = 6.5e-30
 Identities = 95/317 (29%), Positives = 154/317 (48%)

Query:    32 EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
             E  E+++      +K +  N+ +  ++K  L            N+  +LR +K   L   
Sbjct:   181 EMKERFQVFLQNAHKVNMHNNNKNSLYKKELNRFADLTYHEFKNKYLSLRSSK--PLKNS 238

Query:    92 EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQ---GQC---- 144
             +++  Q  ++  +     K N   F + +       +W     VTPVK Q   G C    
Sbjct:   239 KYLLDQMNYE--EVIKKYKGNEN-FDHAA------YDWRLHSGVTPVKDQKNCGSCWAFS 289

Query:   145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
             ++ +VE   AI+ N+L++LSEQ+LVDC+    N GC GG +++AF+ +I+  GI  D  Y
Sbjct:   290 SIGSVESQYAIRKNKLITLSEQELVDCSFK--NYGCNGGLINNAFEDMIELGGICTDDDY 347

Query:   205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQFYSGG 264
              Y   +  +C+  +  +    I NY  VP N  +  L+ +    +SVA+ +    FY  G
Sbjct:   348 PYVSDAPNLCNIDRCTEKYG-IKNYLSVPDNKLKEALRFLGPISISVAV-SDDFAFYKEG 405

Query:   265 VFNGYCETFLNHGVTAVGYGTSE-------EGIK--YWLIKNSWGQDWGEDGYFRLQRDI 315
             +F+G C   LNH V  VG+G  E       +G K  Y++IKNSWGQ WGE G+  ++ D 
Sbjct:   406 IFDGECGDQLNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDE 465

Query:   316 DQPQGQCGIAMFASFPV 332
                  +CG+   A  P+
Sbjct:   466 SGLMRKCGLGTDAFIPL 482

 Score = 299 (110.3 bits), Expect = 3.6e-26, P = 3.6e-26
 Identities = 87/261 (33%), Positives = 129/261 (49%)

Query:    43 RTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKM 102
             + Y    E  +RF++F  N   V   NN    N  Y   LN+FADLT  EF       + 
Sbjct:   174 KQYNSPNEMKERFQVFLQNAHKVNMHNNNK--NSLYKKELNRFADLTYHEFKNKYLSLRS 231

Query:   103 SD---HSSSL--KANGTPFL--YKSSQV--PPSVNWIEKGAVTPVKYQ---GQC----AV 146
             S    +S  L  + N    +  YK ++     + +W     VTPVK Q   G C    ++
Sbjct:   232 SKPLKNSKYLLDQMNYEEVIKKYKGNENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSI 291

Query:   147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
              +VE   AI+ N+L++LSEQ+LVDC+    N GC GG +++AF+ +I+  GI  D  Y Y
Sbjct:   292 GSVESQYAIRKNKLITLSEQELVDCSFK--NYGCNGGLINNAFEDMIELGGICTDDDYPY 349

Query:   207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQFYSGGVF 266
                +  +C+  +  +    I NY  VP N  +  L+ +    +SVA+ +    FY  G+F
Sbjct:   350 VSDAPNLCNIDRCTEKYG-IKNYLSVPDNKLKEALRFLGPISISVAV-SDDFAFYKEGIF 407

Query:   267 NGYCETFLNHGVTAVGYGTSE 287
             +G C   LNH V  VG+G  E
Sbjct:   408 DGECGDQLNHAVMLVGFGMKE 428


>UNIPROTKB|Q8I6U4 [details] [associations]
            symbol:PF11_0165 "Falcipain-2A" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            EMBL:AE014186 HSSP:O65039 GO:GO:0020020 RefSeq:XP_001347836.1
            ProteinModelPortal:Q8I6U4 SMR:Q8I6U4 IntAct:Q8I6U4
            MINT:MINT-1559493 MEROPS:C01.046 EnsemblProtists:PF11_0165:mRNA
            GeneID:810712 KEGG:pfa:PF11_0165 EuPathDB:PlasmoDB:PF3D7_1115700
            HOGENOM:HOG000065857 OMA:NESLHAN ProtClustDB:PTZ00021
            BindingDB:Q8I6U4 ChEMBL:CHEMBL3470 Uniprot:Q8I6U4
        Length = 484

 Score = 331 (121.6 bits), Expect = 6.5e-30, P = 6.5e-30
 Identities = 95/317 (29%), Positives = 154/317 (48%)

Query:    32 EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
             E  E+++      +K +  N+ +  ++K  L            N+  +LR +K   L   
Sbjct:   181 EMKERFQVFLQNAHKVNMHNNNKNSLYKKELNRFADLTYHEFKNKYLSLRSSK--PLKNS 238

Query:    92 EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQ---GQC---- 144
             +++  Q  ++  +     K N   F + +       +W     VTPVK Q   G C    
Sbjct:   239 KYLLDQMNYE--EVIKKYKGNEN-FDHAA------YDWRLHSGVTPVKDQKNCGSCWAFS 289

Query:   145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
             ++ +VE   AI+ N+L++LSEQ+LVDC+    N GC GG +++AF+ +I+  GI  D  Y
Sbjct:   290 SIGSVESQYAIRKNKLITLSEQELVDCSFK--NYGCNGGLINNAFEDMIELGGICTDDDY 347

Query:   205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQFYSGG 264
              Y   +  +C+  +  +    I NY  VP N  +  L+ +    +SVA+ +    FY  G
Sbjct:   348 PYVSDAPNLCNIDRCTEKYG-IKNYLSVPDNKLKEALRFLGPISISVAV-SDDFAFYKEG 405

Query:   265 VFNGYCETFLNHGVTAVGYGTSE-------EGIK--YWLIKNSWGQDWGEDGYFRLQRDI 315
             +F+G C   LNH V  VG+G  E       +G K  Y++IKNSWGQ WGE G+  ++ D 
Sbjct:   406 IFDGECGDQLNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDE 465

Query:   316 DQPQGQCGIAMFASFPV 332
                  +CG+   A  P+
Sbjct:   466 SGLMRKCGLGTDAFIPL 482

 Score = 299 (110.3 bits), Expect = 3.6e-26, P = 3.6e-26
 Identities = 87/261 (33%), Positives = 129/261 (49%)

Query:    43 RTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKM 102
             + Y    E  +RF++F  N   V   NN    N  Y   LN+FADLT  EF       + 
Sbjct:   174 KQYNSPNEMKERFQVFLQNAHKVNMHNNNK--NSLYKKELNRFADLTYHEFKNKYLSLRS 231

Query:   103 SD---HSSSL--KANGTPFL--YKSSQV--PPSVNWIEKGAVTPVKYQ---GQC----AV 146
             S    +S  L  + N    +  YK ++     + +W     VTPVK Q   G C    ++
Sbjct:   232 SKPLKNSKYLLDQMNYEEVIKKYKGNENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSI 291

Query:   147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
              +VE   AI+ N+L++LSEQ+LVDC+    N GC GG +++AF+ +I+  GI  D  Y Y
Sbjct:   292 GSVESQYAIRKNKLITLSEQELVDCSFK--NYGCNGGLINNAFEDMIELGGICTDDDYPY 349

Query:   207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQFYSGGVF 266
                +  +C+  +  +    I NY  VP N  +  L+ +    +SVA+ +    FY  G+F
Sbjct:   350 VSDAPNLCNIDRCTEKYG-IKNYLSVPDNKLKEALRFLGPISISVAV-SDDFAFYKEGIF 407

Query:   267 NGYCETFLNHGVTAVGYGTSE 287
             +G C   LNH V  VG+G  E
Sbjct:   408 DGECGDQLNHAVMLVGFGMKE 428


>GENEDB_PFALCIPARUM|PF11_0161 [details] [associations]
            symbol:PF11_0161 "falcipain-2 precursor,
            putative" species:5833 "Plasmodium falciparum" [GO:0020020 "food
            vacuole" evidence=TAS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014186 HSSP:O65039 OMA:NNVEHIN GO:GO:0020020
            MEROPS:C01.046 HOGENOM:HOG000065857 ProtClustDB:PTZ00021
            RefSeq:XP_001347832.1 ProteinModelPortal:Q8I6U5 SMR:Q8I6U5
            IntAct:Q8I6U5 MINT:MINT-1546851 EnsemblProtists:PF11_0161:mRNA
            GeneID:810708 KEGG:pfa:PF11_0161 EuPathDB:PlasmoDB:PF3D7_1115300
            Uniprot:Q8I6U5
        Length = 482

 Score = 329 (120.9 bits), Expect = 1.1e-29, P = 1.1e-29
 Identities = 81/222 (36%), Positives = 120/222 (54%)

Query:   128 NWIEKGAVTPVKYQ---GQC----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
             +W     VTPVK Q   G C    ++ +VE   AI+ N+L++LSEQ+LVDC+    N GC
Sbjct:   264 DWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFK--NYGC 321

Query:   181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESL 240
              GG +++AF+ +I+  GI  D  Y Y   +  +C+  +  +    I NY  VP N  +  
Sbjct:   322 NGGLINNAFEDMIELGGICTDDDYPYVSDAPNLCNIDRCTEKYG-IKNYLSVPDNKLKEA 380

Query:   241 LKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSE-------EGIK- 291
             L+ +   P+S++I  S    FY  G+F+G C   LNH V  VG+G  E       +G K 
Sbjct:   381 LRFLG--PISISIAVSDDFPFYKEGIFDGECGDELNHAVMLVGFGMKEIVNPLTKKGEKH 438

Query:   292 -YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
              Y++IKNSWGQ WGE G+  ++ D      +CG+   A  P+
Sbjct:   439 YYYIIKNSWGQQWGERGFINIETDESGLMRKCGLGTDAFIPL 480

 Score = 297 (109.6 bits), Expect = 5.9e-26, P = 5.9e-26
 Identities = 87/263 (33%), Positives = 131/263 (49%)

Query:    43 RTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRS-YTLRLNKFADLTPQEF------IA 95
             + Y    E  +RF++F  N   V+  NN     +S Y   LN+FADLT  EF      + 
Sbjct:   172 KQYNSPNEMKERFQVFLQNAHKVKMHNN---NKKSLYKKELNRFADLTYHEFKSKYLTLR 228

Query:    96 SQTGFKMSDHS-SSLKANGTPFLYKSSQV--PPSVNWIEKGAVTPVKYQ---GQC----A 145
             S    K S +    +  +     YK ++     + +W     VTPVK Q   G C    +
Sbjct:   229 SSKPLKNSKYLLDQINYDAVIKKYKGNENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSS 288

Query:   146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
             + +VE   AI+ N+L++LSEQ+LVDC+    N GC GG +++AF+ +I+  GI  D  Y 
Sbjct:   289 IGSVESQYAIRKNKLITLSEQELVDCSFK--NYGCNGGLINNAFEDMIELGGICTDDDYP 346

Query:   206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-LQFYSGG 264
             Y   +  +C+  +  +    I NY  VP N  +  L+ +   P+S++I  S    FY  G
Sbjct:   347 YVSDAPNLCNIDRCTEKYG-IKNYLSVPDNKLKEALRFLG--PISISIAVSDDFPFYKEG 403

Query:   265 VFNGYCETFLNHGVTAVGYGTSE 287
             +F+G C   LNH V  VG+G  E
Sbjct:   404 IFDGECGDELNHAVMLVGFGMKE 426


>UNIPROTKB|Q8I6U5 [details] [associations]
            symbol:PF11_0161 "Falcipain-2B" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            EMBL:AE014186 HSSP:O65039 OMA:NNVEHIN GO:GO:0020020 MEROPS:C01.046
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347832.1
            ProteinModelPortal:Q8I6U5 SMR:Q8I6U5 IntAct:Q8I6U5
            MINT:MINT-1546851 EnsemblProtists:PF11_0161:mRNA GeneID:810708
            KEGG:pfa:PF11_0161 EuPathDB:PlasmoDB:PF3D7_1115300 Uniprot:Q8I6U5
        Length = 482

 Score = 329 (120.9 bits), Expect = 1.1e-29, P = 1.1e-29
 Identities = 81/222 (36%), Positives = 120/222 (54%)

Query:   128 NWIEKGAVTPVKYQ---GQC----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
             +W     VTPVK Q   G C    ++ +VE   AI+ N+L++LSEQ+LVDC+    N GC
Sbjct:   264 DWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFK--NYGC 321

Query:   181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESL 240
              GG +++AF+ +I+  GI  D  Y Y   +  +C+  +  +    I NY  VP N  +  
Sbjct:   322 NGGLINNAFEDMIELGGICTDDDYPYVSDAPNLCNIDRCTEKYG-IKNYLSVPDNKLKEA 380

Query:   241 LKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSE-------EGIK- 291
             L+ +   P+S++I  S    FY  G+F+G C   LNH V  VG+G  E       +G K 
Sbjct:   381 LRFLG--PISISIAVSDDFPFYKEGIFDGECGDELNHAVMLVGFGMKEIVNPLTKKGEKH 438

Query:   292 -YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
              Y++IKNSWGQ WGE G+  ++ D      +CG+   A  P+
Sbjct:   439 YYYIIKNSWGQQWGERGFINIETDESGLMRKCGLGTDAFIPL 480

 Score = 297 (109.6 bits), Expect = 5.9e-26, P = 5.9e-26
 Identities = 87/263 (33%), Positives = 131/263 (49%)

Query:    43 RTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRS-YTLRLNKFADLTPQEF------IA 95
             + Y    E  +RF++F  N   V+  NN     +S Y   LN+FADLT  EF      + 
Sbjct:   172 KQYNSPNEMKERFQVFLQNAHKVKMHNN---NKKSLYKKELNRFADLTYHEFKSKYLTLR 228

Query:    96 SQTGFKMSDHS-SSLKANGTPFLYKSSQV--PPSVNWIEKGAVTPVKYQ---GQC----A 145
             S    K S +    +  +     YK ++     + +W     VTPVK Q   G C    +
Sbjct:   229 SSKPLKNSKYLLDQINYDAVIKKYKGNENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSS 288

Query:   146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
             + +VE   AI+ N+L++LSEQ+LVDC+    N GC GG +++AF+ +I+  GI  D  Y 
Sbjct:   289 IGSVESQYAIRKNKLITLSEQELVDCSFK--NYGCNGGLINNAFEDMIELGGICTDDDYP 346

Query:   206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-LQFYSGG 264
             Y   +  +C+  +  +    I NY  VP N  +  L+ +   P+S++I  S    FY  G
Sbjct:   347 YVSDAPNLCNIDRCTEKYG-IKNYLSVPDNKLKEALRFLG--PISISIAVSDDFPFYKEG 403

Query:   265 VFNGYCETFLNHGVTAVGYGTSE 287
             +F+G C   LNH V  VG+G  E
Sbjct:   404 IFDGECGDELNHAVMLVGFGMKE 426


>UNIPROTKB|Q5T8F0 [details] [associations]
            symbol:CTSL1 "Cathepsin L1 light chain" species:9606 "Homo
            sapiens" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            EMBL:AL160279 UniGene:Hs.731507 UniGene:Hs.731952 HGNC:HGNC:2537
            ChiTaRS:CTSL1 IPI:IPI00640540 SMR:Q5T8F0 Ensembl:ENST00000342020
            ChEMBL:CHEMBL1293261 Uniprot:Q5T8F0
        Length = 225

 Score = 326 (119.8 bits), Expect = 2.1e-29, P = 2.1e-29
 Identities = 80/212 (37%), Positives = 113/212 (53%)

Query:    10 LIISGSCASQATYR-TFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF 68
             LI++  C   A+   TFD  S+  ++ +WKA + R Y  + E  +R  +++ N+  +E  
Sbjct:     5 LILAAFCLGIASATLTFDH-SLEAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELH 62

Query:    69 NNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSV 127
             N     G  S+T+ +N F D+T +EF     GF+        K    P  Y++   P SV
Sbjct:    63 NQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKG-KVFQEPLFYEA---PRSV 118

Query:   128 NWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
             +W EKG VTPVK QGQC       A  A+EG    K  RL+SLSEQ LVDC+    N GC
Sbjct:   119 DWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGC 178

Query:   181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTG 212
              GG MD AF+Y+  N G+ ++  Y YE   +G
Sbjct:   179 NGGLMDYAFQYVQDNGGLDSEESYPYEATVSG 210


>WB|WBGene00012747 [details] [associations]
            symbol:Y40H7A.10 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000230773 EMBL:AL033510
            HSSP:P80067 MEROPS:C01.A48 PIR:T26792 RefSeq:NP_502836.1
            ProteinModelPortal:Q9XWA4 SMR:Q9XWA4 STRING:Q9XWA4
            EnsemblMetazoa:Y40H7A.10 GeneID:189809 KEGG:cel:CELE_Y40H7A.10
            UCSC:Y40H7A.10 CTD:189809 WormBase:Y40H7A.10 eggNOG:NOG286423
            InParanoid:Q9XWA4 OMA:NGPMIVC NextBio:943702 Uniprot:Q9XWA4
        Length = 343

 Score = 323 (118.8 bits), Expect = 4.4e-29, P = 4.4e-29
 Identities = 103/305 (33%), Positives = 147/305 (48%)

Query:    34 FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
             F+ +  +Y R Y    E  KRF IF  NL  VER+N    G  +Y   LN F+DLT +E+
Sbjct:    51 FQNFLVKYLREYPNEYEIVKRFTIFSRNLDLVERYNKEDAGKVTY--ELNDFSDLTEEEW 108

Query:    94 IASQTGFKMSDHSS-SLKANGTPFLYKSSQVPPSVNWIE-KGA--VTPVKYQGQCA---- 145
                    K  DHS  SLK      L     +P SV+W    G   VT +KYQG C     
Sbjct:   109 KKYLMTPK-PDHSEKSLKPKT---LIDKKNLPNSVDWRNVNGTNHVTGIKYQGPCGSCWA 164

Query:   146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
                 AA+E   +I    L SLS QQL+DC    +   C GG   +A KY  Q+ GIT   
Sbjct:   165 FATAAAIESAVSISGGGLQSLSSQQLLDCTVVSDK--CGGGEPVEALKYA-QSHGITTAH 221

Query:   203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID--ASALQF 260
              Y Y   +T   +++      A+I+++      DE + + A+ N P+ V  +   +  +F
Sbjct:   222 NYPYYFWTTKCRETVPT---VARISSWMKAESEDEMAQIVAL-NGPMIVCANFATNKNRF 277

Query:   261 YSGGVFNGY-CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
             Y  G+     C T   H +  +GYG       YW++KN++ + WGE GY R++RD++   
Sbjct:   278 YHSGIAEDPDCGTEPTHALIVIGYGPD-----YWILKNTYSKVWGEKGYMRVKRDVNW-- 330

Query:   320 GQCGI 324
               CGI
Sbjct:   331 --CGI 333


>UNIPROTKB|E9PI30 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            EMBL:AP001201 HGNC:HGNC:2546 IPI:IPI00984532
            ProteinModelPortal:E9PI30 SMR:E9PI30 Ensembl:ENST00000528419
            ArrayExpress:E9PI30 Bgee:E9PI30 Uniprot:E9PI30
        Length = 364

 Score = 257 (95.5 bits), Expect = 2.0e-28, Sum P(2) = 2.0e-28
 Identities = 85/307 (27%), Positives = 152/307 (49%)

Query:     7 IVVLIISGSCAS-QATYRTFDEG----SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
             ++ L+++G     +   R  D G     + E F+ ++ Q+ R+Y    E++ R +IF  N
Sbjct:    10 LLALLVAGLAQGIRGPLRAQDLGPQPLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHN 69

Query:    62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS 121
             L   +R     +G   + +    F+DLT +EF     G++ +  +  + + G     +  
Sbjct:    70 LAQAQRLQEEDLGTAEFGV--TPFSDLTEEEF-GQLYGYRRA--AGGVPSMGREIRSEEP 124

Query:   122 Q--VPPSVNWIE-KGAVTPVKYQGQC----AVAAVEGINAI-KIN--RLVSLSEQQLVDC 171
             +  VP S +W +   A++P+K Q  C    A+AA   I  + +I+    V +S Q+L+DC
Sbjct:   125 EESVPFSCDWRKVASAISPIKDQKNCNCCWAMAAAGNIETLWRISFWDFVDVSVQELLDC 184

Query:   172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG-MSTGICDSIKAEDHAAQITNYE 230
                   +GC+GGF+ DAF  ++ N G+ ++  Y ++G +    C   K +   A I ++ 
Sbjct:   185 GRC--GDGCHGGFVWDAFITVLNNSGLASEKDYPFQGKVRAHRCHPKKYQK-VAWIQDFI 241

Query:   231 DVPPNDEESLLKAVANQ-PVSVAIDASALQFYSGGVFNGY---CETFL-NHGVTAVGYGT 285
              +  N+E  + + +A   P++V I+   LQ Y  GV       C+  L +H V  VG+G+
Sbjct:   242 MLQ-NNEHRIAQYLATYGPITVTINMKPLQLYRKGVIKATPTTCDPQLVDHSVLLVGFGS 300

Query:   286 --SEEGI 290
               SEEGI
Sbjct:   301 VKSEEGI 307

 Score = 75 (31.5 bits), Expect = 2.0e-28, Sum P(2) = 2.0e-28
 Identities = 10/14 (71%), Positives = 12/14 (85%)

Query:   292 YWLIKNSWGQDWGE 305
             YW++KNSWG  WGE
Sbjct:   326 YWILKNSWGAQWGE 339


>GENEDB_PFALCIPARUM|PF11_0162 [details] [associations]
            symbol:PF11_0162 "falcipain-3" species:5833
            "Plasmodium falciparum" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 HSSP:P43235 EMBL:AE014186 GO:GO:0020020
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347833.1
            ProteinModelPortal:Q8IIL0 SMR:Q8IIL0 MEROPS:C01.063
            EnsemblProtists:PF11_0162:mRNA GeneID:810709 KEGG:pfa:PF11_0162
            EuPathDB:PlasmoDB:PF3D7_1115400 OMA:ENDEDYW ChEMBL:CHEMBL1250373
            Uniprot:Q8IIL0
        Length = 492

 Score = 314 (115.6 bits), Expect = 7.8e-28, P = 7.8e-28
 Identities = 80/222 (36%), Positives = 113/222 (50%)

Query:   128 NWIEKGAVTPVKYQ---GQC----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
             +W   G VTPVK Q   G C    +V +VE   AI+   L   SEQ+LVDC+    NNGC
Sbjct:   274 DWRLHGGVTPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVDCSVK--NNGC 331

Query:   181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESL 240
             YGG++ +AF  +I   G+ +   Y Y       C+ +K  +    I +Y  +P +  +  
Sbjct:   332 YGGYITNAFDDMIDLGGLCSQDDYPYVSNLPETCN-LKRCNERYTIKSYVSIPDDKFKEA 390

Query:   241 LKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSE---------EGI 290
             L+ +   P+S++I AS    FY GG ++G C    NH V  VGYG  +         E  
Sbjct:   391 LRYLG--PISISIAASDDFAFYRGGFYDGECGAAPNHAVILVGYGMKDIYNEDTGRMEKF 448

Query:   291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
              Y++IKNSWG DWGE GY  L+ D +  +  C I   A  P+
Sbjct:   449 YYYIIKNSWGSDWGEGGYINLETDENGYKKTCSIGTEAYVPL 490

 Score = 246 (91.7 bits), Expect = 4.2e-19, P = 4.2e-19
 Identities = 76/262 (29%), Positives = 117/262 (44%)

Query:    43 RTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF------IAS 96
             + Y+ S E  KRF IF +N   +E  N     N  Y   +NKF DL+P+EF      + +
Sbjct:   180 KKYETSEEMQKRFIIFSENYRKIELHNKKT--NSLYKRGMNKFGDLSPEEFRSKYLNLKT 237

Query:    97 QTGFKMSDHSSSLKANGTPFL--YKSSQVPP---SVNWIEKGAVTPVKYQGQCAVA-AVE 150
                FK      S +AN    +  YK +       + +W   G VTPVK Q  C    A  
Sbjct:   238 HGPFKTLSPPVSYEANYEDVIKKYKPADAKLDRIAYDWRLHGGVTPVKDQALCGSCWAFS 297

Query:   151 GINAIKINRLVSLSEQQLVD----CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
              + +++    +      L         +  NNGCYGG++ +AF  +I   G+ +   Y Y
Sbjct:   298 SVGSVESQYAIRKKALFLFSEQELVDCSVKNNGCYGGYITNAFDDMIDLGGLCSQDDYPY 357

Query:   207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGV 265
                    C+ +K  +    I +Y  +P +  +  L+ +   P+S++I AS    FY GG 
Sbjct:   358 VSNLPETCN-LKRCNERYTIKSYVSIPDDKFKEALRYLG--PISISIAASDDFAFYRGGF 414

Query:   266 FNGYCETFLNHGVTAVGYGTSE 287
             ++G C    NH V  VGYG  +
Sbjct:   415 YDGECGAAPNHAVILVGYGMKD 436


>UNIPROTKB|Q8IIL0 [details] [associations]
            symbol:PF11_0162 "Falcipain-3" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 HSSP:P43235 EMBL:AE014186 GO:GO:0020020
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347833.1
            ProteinModelPortal:Q8IIL0 SMR:Q8IIL0 MEROPS:C01.063
            EnsemblProtists:PF11_0162:mRNA GeneID:810709 KEGG:pfa:PF11_0162
            EuPathDB:PlasmoDB:PF3D7_1115400 OMA:ENDEDYW ChEMBL:CHEMBL1250373
            Uniprot:Q8IIL0
        Length = 492

 Score = 314 (115.6 bits), Expect = 7.8e-28, P = 7.8e-28
 Identities = 80/222 (36%), Positives = 113/222 (50%)

Query:   128 NWIEKGAVTPVKYQ---GQC----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
             +W   G VTPVK Q   G C    +V +VE   AI+   L   SEQ+LVDC+    NNGC
Sbjct:   274 DWRLHGGVTPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVDCSVK--NNGC 331

Query:   181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESL 240
             YGG++ +AF  +I   G+ +   Y Y       C+ +K  +    I +Y  +P +  +  
Sbjct:   332 YGGYITNAFDDMIDLGGLCSQDDYPYVSNLPETCN-LKRCNERYTIKSYVSIPDDKFKEA 390

Query:   241 LKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSE---------EGI 290
             L+ +   P+S++I AS    FY GG ++G C    NH V  VGYG  +         E  
Sbjct:   391 LRYLG--PISISIAASDDFAFYRGGFYDGECGAAPNHAVILVGYGMKDIYNEDTGRMEKF 448

Query:   291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
              Y++IKNSWG DWGE GY  L+ D +  +  C I   A  P+
Sbjct:   449 YYYIIKNSWGSDWGEGGYINLETDENGYKKTCSIGTEAYVPL 490

 Score = 246 (91.7 bits), Expect = 4.2e-19, P = 4.2e-19
 Identities = 76/262 (29%), Positives = 117/262 (44%)

Query:    43 RTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF------IAS 96
             + Y+ S E  KRF IF +N   +E  N     N  Y   +NKF DL+P+EF      + +
Sbjct:   180 KKYETSEEMQKRFIIFSENYRKIELHNKKT--NSLYKRGMNKFGDLSPEEFRSKYLNLKT 237

Query:    97 QTGFKMSDHSSSLKANGTPFL--YKSSQVPP---SVNWIEKGAVTPVKYQGQCAVA-AVE 150
                FK      S +AN    +  YK +       + +W   G VTPVK Q  C    A  
Sbjct:   238 HGPFKTLSPPVSYEANYEDVIKKYKPADAKLDRIAYDWRLHGGVTPVKDQALCGSCWAFS 297

Query:   151 GINAIKINRLVSLSEQQLVD----CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
              + +++    +      L         +  NNGCYGG++ +AF  +I   G+ +   Y Y
Sbjct:   298 SVGSVESQYAIRKKALFLFSEQELVDCSVKNNGCYGGYITNAFDDMIDLGGLCSQDDYPY 357

Query:   207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGV 265
                    C+ +K  +    I +Y  +P +  +  L+ +   P+S++I AS    FY GG 
Sbjct:   358 VSNLPETCN-LKRCNERYTIKSYVSIPDDKFKEALRYLG--PISISIAASDDFAFYRGGF 414

Query:   266 FNGYCETFLNHGVTAVGYGTSE 287
             ++G C    NH V  VGYG  +
Sbjct:   415 YDGECGAAPNHAVILVGYGMKD 436


>DICTYBASE|DDB_G0274385 [details] [associations]
            symbol:DDB_G0274385 "Cysteine proteinase 1,
            mitochondrial" species:44689 "Dictyostelium discoideum" [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0274385 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 EMBL:AAFI02000012 RefSeq:XP_644301.1
            ProteinModelPortal:Q86KD4 EnsemblProtists:DDB0167535 GeneID:8619729
            KEGG:ddi:DDB_G0274385 InParanoid:Q86KD4 OMA:SICVDAS Uniprot:Q86KD4
        Length = 358

 Score = 310 (114.2 bits), Expect = 1.0e-27, P = 1.0e-27
 Identities = 80/212 (37%), Positives = 111/212 (52%)

Query:   126 SVNWIEKGAVTPVKYQGQCA----VAAVEGINA--IKI-NRLVSLSEQQLVDCATNDNNN 178
             S++W +KG VTPVK QGQC      +AVE I    IK  N+ + LSEQQ VDC   D   
Sbjct:   148 SIDWRKKGLVTPVKDQGQCGSCYIFSAVEQIETAWIKAGNKPILLSEQQAVDCDPYDGQ- 206

Query:   179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN-DE 237
              C GG     ++Y  Q  G++ +A Y Y   + G C ++     A  + +Y  V    DE
Sbjct:   207 -CGGGDPYTVYEYFSQVGGVSTNAQYPYTA-TDGTCVNMS---RAVPVVSYHYVTQGGDE 261

Query:   238 ESLLKAVANQ-PVSVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG----IKY 292
              +L+K + N  PVS+ +DAS  Q YSGG+    C   ++H V  VG    +      ++Y
Sbjct:   262 NTLIKTIVNDGPVSICVDASTWQSYSGGIITTGCGKNIDHCVQVVGLEVDKTDPSNPVQY 321

Query:   293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
             ++I+NSWG DWG DGY  +    D     CGI
Sbjct:   322 YIIRNSWGTDWGIDGYIYVATGSDL----CGI 349

 Score = 214 (80.4 bits), Expect = 1.8e-15, P = 1.8e-15
 Identities = 77/277 (27%), Positives = 120/277 (43%)

Query:    27 EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
             + S+ + F  W  ++ + YK+S E   RF  FK+N+      N+   G   +    N F+
Sbjct:    37 DSSMRDTFNHWAKKHSKIYKDSIEMENRFSNFKENMKKNIELNSMHAGKAKF--ESNGFS 94

Query:    87 DLTPQEF--IASQTGFK-MSDH-SSSLKANGTPFL-----YKSSQVPP-----SVNWIEK 132
             DL+ +EF        FK    H  +S+K   TP       YK  +        S++W +K
Sbjct:    95 DLSEEEFSNFHLNKAFKGKPSHLRNSIKPQPTPHHSLINGYKEMENGDLNELYSIDWRKK 154

Query:   133 GAVTPVKYQGQCAVAAV-EGINAIKINRLVSLSEQQLVD---CATNDNNNGCYGGFMDDA 188
             G VTPVK QGQC    +   +  I+   + + ++  L+        D  +G  GG     
Sbjct:   155 GLVTPVKDQGQCGSCYIFSAVEQIETAWIKAGNKPILLSEQQAVDCDPYDGQCGGGDPYT 214

Query:   189 FKYIIQNKGITNDAVYSYEGMST-GICDSIKAEDHAAQITNYEDVPPN-DEESLLKAVAN 246
                     G  +     Y   +T G C ++     A  + +Y  V    DE +L+K + N
Sbjct:   215 VYEYFSQVGGVSTNA-QYPYTATDGTCVNMS---RAVPVVSYHYVTQGGDENTLIKTIVN 270

Query:   247 Q-PVSVAIDASALQFYSGGVFNGYCETFLNHGVTAVG 282
               PVS+ +DAS  Q YSGG+    C   ++H V  VG
Sbjct:   271 DGPVSICVDASTWQSYSGGIITTGCGKNIDHCVQVVG 307


>UNIPROTKB|F1RU23 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 KO:K08569 EMBL:CU928325
            RefSeq:XP_003122571.1 UniGene:Ssc.28940 Ensembl:ENSSSCT00000014177
            GeneID:100525853 KEGG:ssc:100525853 OMA:CWAMAAV Uniprot:F1RU23
        Length = 367

 Score = 305 (112.4 bits), Expect = 3.5e-27, P = 3.5e-27
 Identities = 94/304 (30%), Positives = 154/304 (50%)

Query:     7 IVVLIISGSCAS-QATYRTFDEG----SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
             ++VL+++G     +   R+ D G     + E F  ++ QY R+Y   AE+++R +IF  N
Sbjct:    10 LLVLVVAGPAQGLKDALRSQDPGPQPMGLKEVFTLFQIQYNRSYSNPAEHARRLDIFAQN 69

Query:    62 LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS 121
             L   +R     +G   + +    F+DLT +EF     G     H  + KA        S 
Sbjct:    70 LAKAQRLQEEDLGTAEFGV--TPFSDLTEEEF-----GQLHGHHWGAGKAPSMGIKVGSE 122

Query:   122 Q----VPPSVNWIEK-GAVTPVKYQGQC----AVAAVEGINA---IKINRLVSLSEQQLV 169
             +    VP S +W +K G ++ +K+Q  C    A+AAV+ + A   IK ++ V LS QQ++
Sbjct:   123 ESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQQVL 182

Query:   170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG-MSTGICDSIKAEDHAAQITN 228
             DC  +   NGC GGF+ DAF  ++   G+ ++  Y Y+G + T  C + K     A I +
Sbjct:   183 DC--DRCGNGCNGGFVWDAFLTVLNTSGLASEQDYPYKGTVKTHRCLA-KQHRKVAWIQD 239

Query:   229 YEDVPPNDEESLLKAVANQ-PVSVAIDASALQFYSGGVFNGY---CETFL-NHGVTAVGY 283
             +  +    E+S+ + +A + P++V I+A  LQ Y  GV       C+  L NH V  VG+
Sbjct:   240 FLMLQ-FCEQSIARYLATEGPITVTINAGLLQQYKRGVIRATPATCDPHLVNHSVLLVGF 298

Query:   284 GTSE 287
             G S+
Sbjct:   299 GKSK 302

 Score = 285 (105.4 bits), Expect = 4.6e-25, P = 4.6e-25
 Identities = 79/214 (36%), Positives = 116/214 (54%)

Query:   145 AVAAVEGINA---IKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
             A+AAV+ + A   IK ++ V LS QQ++DC  +   NGC GGF+ DAF  ++   G+ ++
Sbjct:   155 AMAAVDNVEAQWAIKYHQAVQLSVQQVLDC--DRCGNGCNGGFVWDAFLTVLNTSGLASE 212

Query:   202 AVYSYEG-MSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDASALQ 259
               Y Y+G + T  C + K     A I ++  +    E+S+ + +A + P++V I+A  LQ
Sbjct:   213 QDYPYKGTVKTHRCLA-KQHRKVAWIQDFLMLQ-FCEQSIARYLATEGPITVTINAGLLQ 270

Query:   260 FYSGGVFNGY---CETFL-NHGVTAVGYGTSE--EG--------IKYWLIKNSWGQDWGE 305
              Y  GV       C+  L NH V  VG+G S+  EG        I YW++KNSWG DWGE
Sbjct:   271 QYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGRRPRPGHSIPYWILKNSWGPDWGE 330

Query:   306 DGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQP 339
             +GYFRL R        CGI     +PV+    +P
Sbjct:   331 EGYFRLHRG----SNTCGIT---KYPVTARVDKP 357


>TAIR|locus:2082687 [details] [associations]
            symbol:AT3G54940 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002686 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HSSP:P53634
            OMA:GGGLMTN EMBL:AY070063 IPI:IPI00528988 RefSeq:NP_567010.5
            UniGene:At.28412 ProteinModelPortal:Q8VYS0 SMR:Q8VYS0 PRIDE:Q8VYS0
            EnsemblPlants:AT3G54940.2 GeneID:824659 KEGG:ath:AT3G54940
            TAIR:At3g54940 PhylomeDB:Q8VYS0 ProtClustDB:CLSN2718801
            ArrayExpress:Q8VYS0 Genevestigator:Q8VYS0 Uniprot:Q8VYS0
        Length = 367

 Score = 303 (111.7 bits), Expect = 5.7e-27, P = 5.7e-27
 Identities = 88/291 (30%), Positives = 133/291 (45%)

Query:    33 KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
             KF  + + YG+ Y    E   R  IF  N++         + + S    + +F+DLT +E
Sbjct:    50 KFRLFMSDYGKNYSTREEYIHRLGIFAKNVLKAAEHQ---MMDPSAVHGVTQFSDLTEEE 106

Query:    93 FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA------- 145
             F    TG      S          + +   +P   +W EKG VT VK QG C        
Sbjct:   107 FKRMYTGVADVGGSRGGTVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFST 166

Query:   146 VAAVEGINAIKINRLVSLSEQQLVDCAT----NDN---NNGCYGGFMDDAFKYIIQNKGI 198
               A EG + +   +L+SLSEQQLVDC       D    +NGC GG M +A++Y+++  G+
Sbjct:   167 TGAAEGAHFVSTGKLLSLSEQQLVDCDQACDPKDKKACDNGCGGGLMTNAYEYLMEAGGL 226

Query:   199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL 258
               +  Y Y G   G C     E  A ++ N+  +P ++ +     V + P++V ++A  +
Sbjct:   227 EEERSYPYTG-KRGHC-KFDPEKVAVRVLNFTTIPLDENQIAANLVRHGPLAVGLNAVFM 284

Query:   259 QFYSGGVFNGY-C-ETFLNHGVTAVGYGTSEEGIKYWLIKNSW--GQDWGE 305
             Q Y GGV     C +  +NHGV  VGYG+    I     K  W     WG+
Sbjct:   285 QTYIGGVSCPLICSKRNVNHGVLLVGYGSKGFSILRLSNKPYWIIKNSWGK 335

 Score = 294 (108.6 bits), Expect = 5.2e-26, P = 5.2e-26
 Identities = 73/205 (35%), Positives = 110/205 (53%)

Query:   148 AVEGINAIKINRLVSLSEQQLVDCAT----NDN---NNGCYGGFMDDAFKYIIQNKGITN 200
             A EG + +   +L+SLSEQQLVDC       D    +NGC GG M +A++Y+++  G+  
Sbjct:   169 AAEGAHFVSTGKLLSLSEQQLVDCDQACDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEE 228

Query:   201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF 260
             +  Y Y G   G C     E  A ++ N+  +P ++ +     V + P++V ++A  +Q 
Sbjct:   229 ERSYPYTG-KRGHC-KFDPEKVAVRVLNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQT 286

Query:   261 YSGGVFNGY-C-ETFLNHGVTAVGYGTSEEGI------KYWLIKNSWGQDWGEDGYFRLQ 312
             Y GGV     C +  +NHGV  VGYG+    I       YW+IKNSWG+ WGE+GY++L 
Sbjct:   287 YIGGVSCPLICSKRNVNHGVLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKLC 346

Query:   313 RDIDQPQGQCGIAMFASFPVSKESA 337
             R  D     CGI    S   ++ S+
Sbjct:   347 RGHDI----CGINSMVSAVATQVSS 367


>UNIPROTKB|F1P0K2 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            OMA:SNVCGIA EMBL:AADN02016534 IPI:IPI00651180
            Ensembl:ENSGALT00000015270 Uniprot:F1P0K2
        Length = 320

 Score = 290 (107.1 bits), Expect = 1.4e-25, P = 1.4e-25
 Identities = 84/298 (28%), Positives = 135/298 (45%)

Query:    42 GRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIA---SQT 98
             GR   +     +     +++   +   N+ +  N S     N+F+ L P+EF A      
Sbjct:    30 GRPPWDGGGREEEAAALRESAKRIRLLNSPSNDNGSAFYGKNQFSHLFPEEFKAIYLRSI 89

Query:    99 GFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEG 151
              +K+  +    K    P       +P   +W +K  +  V+ Q  C        V  +E 
Sbjct:    90 PYKLPRYIKVPKGEEKP-------LPKKFDWRDKKVIAEVRNQQTCGGCWAFSVVGGIES 142

Query:   152 INAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK-GITNDAVYSYEGMS 210
               AIK + L  LS QQ++DC+ +  N GC GG    A  ++ Q K  +  D+ Y+++   
Sbjct:   143 AYAIKGHNLEELSVQQVIDCSYS--NYGCSGGSTITALSWLNQTKVKLVRDSEYTFKAQ- 199

Query:   211 TGICDSIKAEDHAAQITNYE--DVPPNDEESLLKAVANQPVSVAIDASALQFYSGGVFNG 268
             TG+C      D    IT +   D    +EE +   V   P++V +DA + Q Y GG+   
Sbjct:   200 TGLCHYFPHSDFGVSITGFAAYDFSGQEEEMMRVLVDWGPLAVTVDAVSWQDYLGGIIQY 259

Query:   269 YCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIA 325
             +C +   NH V   G+ T+   I YW+++NSWG+ WG DGY R++         CGIA
Sbjct:   260 HCSSGKANHAVLITGFDTTGI-IPYWIVQNSWGRTWGIDGYVRVKIG----SNVCGIA 312


>DICTYBASE|DDB_G0281079 [details] [associations]
            symbol:DDB_G0281079 species:44689 "Dictyostelium
            discoideum" [GO:0030246 "carbohydrate binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR008965 InterPro:IPR013128 InterPro:IPR019028
            Pfam:PF00112 Pfam:PF09478 PRINTS:PR00705 SMART:SM00645
            SMART:SM01063 InterPro:IPR000169 dictyBase:DDB_G0281079
            GO:GO:0030246 EMBL:AAFI02000040 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 SUPFAM:SSF49384 RefSeq:XP_640804.1
            ProteinModelPortal:Q54UH2 EnsemblProtists:DDB0204000 GeneID:8622858
            KEGG:ddi:DDB_G0281079 InParanoid:Q54UH2 OMA:ALESHYY
            ProtClustDB:CLSZ2430562 Uniprot:Q54UH2
        Length = 664

 Score = 296 (109.3 bits), Expect = 2.5e-25, P = 2.5e-25
 Identities = 87/245 (35%), Positives = 127/245 (51%)

Query:    69 NNAAIGNRSYTLR-LNKFADLTPQEFIASQTGFKMSDHSSSLKAN--GTPFLYKSSQVPP 125
             N+  +G+ S+T   + K   + P   +        S  SS++  +      L K S+ P 
Sbjct:   415 NSIEVGS-SHTFGYIQKAYSINPLLSVNKVCQESSSSSSSNITTDEPSKSRLLKWSR-PI 472

Query:   126 SVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDC-ATND-N 176
             S++W   G V+ VK QG C        V A+E     K NR++ LSEQ LVDC A+N   
Sbjct:   473 SIDWRTWGMVSKVKNQGSCGSCYAFSTVGALESHYYRKNNRMLDLSEQNLVDCTASNKYR 532

Query:   177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
             N GC GG+M + + YI +N GI  ++ Y YEG   G C    + D  ++I+ +  +  +D
Sbjct:   533 NGGCSGGWMHNCYSYIQENGGINQESTYPYEG-KFGQC-RYNSGDAQSRISKFVMIKQHD 590

Query:   237 EESLLKAVANQ-PVSVAIDASALQF--YSGGVF-NGYCETF-LNHGVTAVGYGTSEEGIK 291
             EE L   VA+  PVSVA DAS  +F  YS G++ +  C  +   H V  VGY  +E G+ 
Sbjct:   591 EEDLADTVASVGPVSVAYDASTREFMYYSRGIYYSDNCNKYRTTHAVVVVGYD-NENGVD 649

Query:   292 YWLIK 296
             YW+IK
Sbjct:   650 YWIIK 654


>WB|WBGene00011102 [details] [associations]
            symbol:R07E3.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            EMBL:Z49207 HSSP:P53634 PIR:T24030 RefSeq:NP_001041280.1
            ProteinModelPortal:Q21810 SMR:Q21810 STRING:Q21810 MEROPS:C01.A43
            PaxDb:Q21810 EnsemblMetazoa:R07E3.1a GeneID:181242
            KEGG:cel:CELE_R07E3.1 UCSC:R07E3.1a CTD:181242 WormBase:R07E3.1a
            HOGENOM:HOG000021028 InParanoid:Q21810 OMA:ACKNEVI NextBio:913066
            ArrayExpress:Q21810 Uniprot:Q21810
        Length = 402

 Score = 284 (105.0 bits), Expect = 5.9e-25, P = 5.9e-25
 Identities = 95/319 (29%), Positives = 151/319 (47%)

Query:    29 SIAEKFEQWKAQYGRTYKESAENSKRFEIF--KDNLVAVERFNNAAIGNRSYTLR-LNKF 85
             +IA+++  +  ++ ++Y  S E+ KR   +   D  +A     N   G+  Y    ++ +
Sbjct:    85 NIAKEYIAYTEKFDKSYATSQESLKRLNAYYNTDENIANWNIQNEH-GSAEYGHNDMSDW 143

Query:    86 ADLTPQEFIASQTGFK-MSDHSSSLKANGTPFLYK----SSQVPPSVNWIEKGAVTPVKY 140
              D   ++ +  ++ +K +   +  ++        K    SS  P   +W +K  +TPVK 
Sbjct:   144 TDEEFEKTLLPKSFYKRLHKEAEFIEPIPESLTAKKGESSSPFPDFFDWRDKNVITPVKA 203

Query:   141 QGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
             QGQC       + A VE   AI      +LSEQ L+DC   DN   C GG  D AF+YI 
Sbjct:   204 QGQCGSCWAFASTATVEAAWAIAHGEKRNLSEQTLLDCDLVDN--ACDGGDEDKAFRYIH 261

Query:   194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVA 252
             +N G+ N     Y       C ++    +  +I     +  +DE+S++  + N  PV++ 
Sbjct:   262 RN-GLANAVDLPYVAHRQNGC-AVNDHWNTTRIKAAYFLH-HDEDSIINWLVNFGPVNIG 318

Query:   253 IDA-SALQFYSGGVF--NGY-C--ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG-E 305
             +     ++ Y GGVF  + Y C  E    H +   GYGTS+ G KYW++KNSWG  WG E
Sbjct:   319 MAVIQPMRAYKGGVFTPSEYACKNEVIGLHALLITGYGTSKTGEKYWIVKNSWGNTWGVE 378

Query:   306 DGYFRLQRDIDQPQGQCGI 324
              GY    R I+     CGI
Sbjct:   379 HGYIYFARGINA----CGI 393


>UNIPROTKB|P43234 [details] [associations]
            symbol:CTSO "Cathepsin O" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 Reactome:REACT_6900
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0004197
            CleanEx:HS_CTSO EMBL:X77383 EMBL:BC049206 IPI:IPI00017257
            PIR:A55090 RefSeq:NP_001325.1 UniGene:Hs.75262
            ProteinModelPortal:P43234 SMR:P43234 IntAct:P43234 STRING:P43234
            MEROPS:C01.035 PhosphoSite:P43234 DMDM:1168795 PRIDE:P43234
            DNASU:1519 Ensembl:ENST00000433477 GeneID:1519 KEGG:hsa:1519
            UCSC:uc003ipg.3 CTD:1519 GeneCards:GC04M156845 HGNC:HGNC:2542
            HPA:HPA002041 MIM:600550 neXtProt:NX_P43234 PharmGKB:PA27040
            HOVERGEN:HBG105050 InParanoid:P43234 KO:K01374 OMA:SNVCGIA
            OrthoDB:EOG4V6ZH1 PhylomeDB:P43234 BindingDB:P43234
            ChEMBL:CHEMBL3035 GenomeRNAi:1519 NextBio:6287 Bgee:P43234
            Genevestigator:P43234 GermOnline:ENSG00000151792 Uniprot:P43234
        Length = 321

 Score = 271 (100.5 bits), Expect = 1.4e-23, P = 1.4e-23
 Identities = 88/302 (29%), Positives = 138/302 (45%)

Query:    38 KAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTL--RLNKFADLTPQEFIA 95
             +A +  T+  S E  +    F+++L    R+ N+   + + T    +N+F+ L P+EF A
Sbjct:    26 RAPFTPTWPRSRE--REAAAFRESLNR-HRYLNSLFPSENSTAFYGINQFSYLFPEEFKA 82

Query:    96 SQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAA 148
                  K S       A        +  +P   +W +K  VT V+ Q  C        V A
Sbjct:    83 IYLRSKPSKFPR-YSAE-VHMSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGA 140

Query:   149 VEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI--IQNKGITNDAVYSY 206
             VE   AIK   L  LS QQ++DC+ N  N GC GG   +A  ++  +Q K +  D+ Y +
Sbjct:   141 VESAYAIKGKPLEDLSVQQVIDCSYN--NYGCNGGSTLNALNWLNKMQVK-LVKDSEYPF 197

Query:   207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEES-LLKAVAN-QPVSVAIDASALQFYSGG 264
             +  + G+C           I  Y     +D+E  + KA+    P+ V +DA + Q Y GG
Sbjct:   198 KAQN-GLCHYFSGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYLGG 256

Query:   265 VFNGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
             +   +C +   NH V   G+  +     YW+++NSWG  WG DGY  ++         CG
Sbjct:   257 IIQHHCSSGEANHAVLITGFDKTGS-TPYWIVRNSWGSSWGVDGYAHVKMG----SNVCG 311

Query:   324 IA 325
             IA
Sbjct:   312 IA 313


>UNIPROTKB|Q5QP40 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015644 Pfam:PF00112
            InterPro:IPR000169 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 EMBL:AL355860 HOVERGEN:HBG011513
            PANTHER:PTHR12411:SF55 EMBL:AL356292 UniGene:Hs.632466
            HGNC:HGNC:2536 IPI:IPI00514633 SMR:Q5QP40 STRING:Q5QP40
            Ensembl:ENST00000443913 Uniprot:Q5QP40
        Length = 258

 Score = 270 (100.1 bits), Expect = 1.8e-23, P = 1.8e-23
 Identities = 65/191 (34%), Positives = 106/191 (55%)

Query:    21 TYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYT 79
             ++  + E  +   +E WK  + + Y    +   R  I++ NL  +   N  A++G  +Y 
Sbjct:    72 SFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYE 131

Query:    80 LRLNKFADLTPQEFIASQTGFKMS-DHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTP 137
             L +N   D+T +E +   TG K+   HS S   N T ++ +   + P SV++ +KG VTP
Sbjct:   132 LAMNHLGDMTSEEVVQKMTGLKVPLSHSRS---NDTLYIPEWEGRAPDSVDYRKKGYVTP 188

Query:   138 VKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
             VK QGQC       +V A+EG    K  +L++LS Q LVDC +   N+GC GG+M +AF+
Sbjct:   189 VKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDGCGGGYMTNAFQ 246

Query:   191 YIIQNKGITND 201
             Y+ +N+GI ++
Sbjct:   247 YVQKNRGIDSE 257


>UNIPROTKB|F1PGK4 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 OMA:SNVCGIA
            EMBL:AAEX03010073 Ensembl:ENSCAFT00000013638 Uniprot:F1PGK4
        Length = 316

 Score = 266 (98.7 bits), Expect = 4.8e-23, P = 4.8e-23
 Identities = 82/281 (29%), Positives = 130/281 (46%)

Query:    58 FKDNLVAVERFNNAAIG--NRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP 115
             F+++L    R+ N+     N S    +N+F+ L+P+EF A     K S  S    A    
Sbjct:    39 FRESLNR-HRYLNSVFPRENSSAVYGINQFSYLSPEEFKAIYLRSKPS-RSPRYPAEVRT 96

Query:   116 FLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQL 168
              + ++  +P   +W +K  VT V+ Q  C        V AVE   AIK   L  +S QQ+
Sbjct:    97 SI-RNVSLPLRFDWRDKRVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGKPLADISVQQV 155

Query:   169 VDCATNDNNNGCYGGFMDDAFKYIIQNK-GITNDAVYSYEGMSTGICDSIKAEDHAAQIT 227
             +DC+ N  N GC GG   +A  ++ + +  +  D+ Y ++  + G+C           I 
Sbjct:   156 IDCSYN--NYGCSGGSTLNALNWLNKTQVKLVRDSEYPFKAQN-GLCHYFSDSYSGFSIR 212

Query:   228 NYEDVPPNDEES-LLKAVAN-QPVSVAIDASALQFYSGGVFNGYCETF-LNHGVTAVGYG 284
              Y     +D+E  + K +    P+ V +DA + Q Y GG+   +C +   NH V   G+ 
Sbjct:   213 GYSAYDFSDQEDEMAKVLLTFGPLVVVVDAVSWQDYLGGIIQHHCSSGEANHAVLITGFD 272

Query:   285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIA 325
                    YW+++NSWG  WG DGY  ++         CGIA
Sbjct:   273 KIGS-TPYWIVRNSWGSSWGVDGYAHVKMG----GNICGIA 308


>UNIPROTKB|E1BPI9 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 CTD:1519 KO:K01374 OMA:SNVCGIA
            EMBL:DAAA02044933 IPI:IPI01004081 RefSeq:XP_002694471.2
            RefSeq:XP_874012.4 Ensembl:ENSBTAT00000014691 GeneID:616804
            KEGG:bta:616804 Uniprot:E1BPI9
        Length = 313

 Score = 265 (98.3 bits), Expect = 6.1e-23, P = 6.1e-23
 Identities = 84/283 (29%), Positives = 135/283 (47%)

Query:    58 FKDNLVAVERFNNAAIG--NRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP 115
             F+++L   +R+ N+     N +    +N+F+ L P+EF A    +  S  S   +     
Sbjct:    36 FRESLNR-QRYLNSLFPYENSTAVYGINQFSYLFPEEFKAI---YLRSSPSRFPRFPAEE 91

Query:   116 FLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQ 167
             +   S+  +P   +W +K  VT V+ Q  C        V AVE + AIK   L  LS QQ
Sbjct:    92 YTSISNLSLPLRFDWRDKHVVTQVRNQKTCGGCWAFSVVGAVESVCAIKGQPLEVLSVQQ 151

Query:   168 LVDCATNDNNNGCYGGFMDDAFKYI--IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQ 225
             ++DC+ +  N GC GG    A  ++  +Q K +  D+ Y ++  + G+C         + 
Sbjct:   152 VIDCSYS--NYGCNGGSPLSALYWLNKLQVK-LVRDSEYPFQAQN-GLCRYFSDSHSGSS 207

Query:   226 ITNYEDVP-PNDEESLLKAV-ANQPVSVAIDASALQFYSGGVFNGYCETF-LNHGVTAVG 282
             I  Y        E+ + +A+ A  P+ V +DA + Q Y GG+   +C +   NH V   G
Sbjct:   208 IKGYSAYDFSGQEDKMAEALLALGPLIVVVDAMSWQDYLGGIIQHHCSSGEANHAVLVTG 267

Query:   283 YGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIA 325
             +  +   I YW+++NSWG  WG DGY R++         CGIA
Sbjct:   268 FDKTGS-IPYWIVRNSWGTSWGIDGYVRVKMG----GNVCGIA 305


>MGI|MGI:2139628 [details] [associations]
            symbol:Ctso "cathepsin O" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:2139628 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0005764 GeneTree:ENSGT00560000076599 MEROPS:C01.035 CTD:1519
            HOVERGEN:HBG105050 KO:K01374 OMA:SNVCGIA OrthoDB:EOG4V6ZH1
            EMBL:AK034490 EMBL:AK049470 EMBL:AK165930 EMBL:AK166103
            EMBL:BC044664 IPI:IPI00453524 RefSeq:NP_808330.1 UniGene:Mm.254642
            ProteinModelPortal:Q8BM88 SMR:Q8BM88 STRING:Q8BM88
            PhosphoSite:Q8BM88 PRIDE:Q8BM88 Ensembl:ENSMUST00000029649
            GeneID:229445 KEGG:mmu:229445 UCSC:uc008pon.1 InParanoid:Q8BM88
            NextBio:379433 Bgee:Q8BM88 CleanEx:MM_CTSO Genevestigator:Q8BM88
            GermOnline:ENSMUSG00000028015 Uniprot:Q8BM88
        Length = 312

 Score = 254 (94.5 bits), Expect = 9.0e-22, P = 9.0e-22
 Identities = 72/255 (28%), Positives = 123/255 (48%)

Query:    82 LNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQ 141
             +N+F+ L P+EF A   G K +  +    A G   +   S +P   +W +K  V PV+ Q
Sbjct:    60 VNQFSYLFPEEFKALYLGSKYA-WAPRYPAEGQRPIPNVS-LPLRFDWRDKHVVNPVRNQ 117

Query:   142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
               C        V+A+E   AI+   L  LS QQ++DC+ N  N+GC GG    A +++ +
Sbjct:   118 EMCGGCWAFSVVSAIESARAIQGKSLDYLSVQQVIDCSFN--NSGCLGGSPLCALRWLNE 175

Query:   195 NK-GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP-PNDEESLLKAVAN-QPVSV 251
              +  +  D+ Y ++ ++ G C           + ++        E+ + +A+ +  P+ V
Sbjct:   176 TQLKLVADSQYPFKAVN-GQCRHFPQSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVV 234

Query:   252 AIDASALQFYSGGVFNGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
              +DA + Q Y GG+   +C +   NH V   G+  +     YW+++NSWG  WG +GY  
Sbjct:   235 IVDAMSWQDYLGGIIQHHCSSGEANHAVLITGFDRTGN-TPYWMVRNSWGSSWGVEGYAH 293

Query:   311 LQRDIDQPQGQCGIA 325
             ++         CGIA
Sbjct:   294 VKMG----GNVCGIA 304


>WB|WBGene00013764 [details] [associations]
            symbol:Y113G7B.15 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 GeneTree:ENSGT00560000076599
            EMBL:AL110477 HOGENOM:HOG000019851 RefSeq:NP_507904.2
            ProteinModelPortal:Q9U2X1 SMR:Q9U2X1 DIP:DIP-25339N IntAct:Q9U2X1
            MINT:MINT-1058673 STRING:Q9U2X1 MEROPS:C01.A47
            EnsemblMetazoa:Y113G7B.15 GeneID:190976 KEGG:cel:CELE_Y113G7B.15
            UCSC:Y113G7B.15 CTD:190976 WormBase:Y113G7B.15 eggNOG:NOG302449
            OMA:AEEDIME Uniprot:Q9U2X1
        Length = 362

 Score = 222 (83.2 bits), Expect = 1.1e-21, Sum P(2) = 1.1e-21
 Identities = 56/182 (30%), Positives = 88/182 (48%)

Query:   147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
             A  E  N +      SLS+Q++ DCA + +  GC GG   +  K ++  +G ++D  Y Y
Sbjct:   167 AITEAANTLYSKSFTSLSDQEICDCADSGDTPGCVGGDPRNGLK-MVHLRGQSSDGDYPY 225

Query:   207 E---GMSTGIC--DSIKAEDHAAQITNYEDVPPNDEESLLKAV-ANQ-PVSVAIDASA-L 258
             E     +TG C  D          +  Y       EE +++ +  N  P +V        
Sbjct:   226 EEYRANTTGNCVGDEKSTVIQPETLNVYRFDQDYAEEDIMENLYLNHIPTAVYFRVGENF 285

Query:   259 QFYSGGVFNG---YCETFLN-HGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
             ++Y+ GV      Y  T    H V  VGYGTS++G+ YWL++NSW  DWG  GY +++R 
Sbjct:   286 EWYTSGVLQSEDCYQMTPAEWHSVAIVGYGTSDDGVPYWLVRNSWNSDWGLHGYVKIRRG 345

Query:   315 ID 316
             ++
Sbjct:   346 VN 347

 Score = 82 (33.9 bits), Expect = 1.1e-21, Sum P(2) = 1.1e-21
 Identities = 25/103 (24%), Positives = 41/103 (39%)

Query:     5 FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
             F+I  L +   C    T  + +   +   F  +   + + Y+  AE  +R   F  N   
Sbjct:     4 FIISPLFLLSLCQPTVTQHSQE---VLSHFNNFTMHHKKHYRTPAEKDRRLAHFAKNHQK 60

Query:    65 VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS 106
             ++  N  A    R+ T   NKFAD   QE  A  +     +H+
Sbjct:    61 IQELNAKARREGRNVTFGWNKFADKNRQELSARNSKIHPKNHT 103


>UNIPROTKB|H0YD65 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 EMBL:AP002748
            HGNC:HGNC:2531 ChiTaRS:CTSF Ensembl:ENST00000524994 Uniprot:H0YD65
        Length = 283

 Score = 249 (92.7 bits), Expect = 3.0e-21, P = 3.0e-21
 Identities = 81/262 (30%), Positives = 118/262 (45%)

Query:    30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
             +A  F+ +   Y RTY ES E   R  +F +N+V  ++      G   Y +   KF+DLT
Sbjct:    32 MASIFKNFVITYNRTY-ESKEARWRLSVFVNNMVRAQKIQALDRGTAQYGV--TKFSDLT 88

Query:    90 PQEF--IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-- 145
              +EF  I   T  +    +   +A     L      PP  +W  KGAVT VK QG C   
Sbjct:    89 EEEFRTIYLNTLLRKEPGNKMKQAKSVGDL-----APPEWDWRSKGAVTKVKDQGMCGSC 143

Query:   146 -----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
                     VEG   +    L+SLSEQ+L+DC   D    C GG   +A+  I    G+  
Sbjct:   144 WAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDK--ACMGGLPSNAYSAIKNLGGLET 201

Query:   201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF 260
             +  YSY+G     C+   AE     I +  ++  N+++         P+SVAI+A  +QF
Sbjct:   202 EDDYSYQGHMQS-CN-FSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQF 259

Query:   261 YSGGV---FNGYCETFL-NHGV 278
             Y  G+       C  +L +H V
Sbjct:   260 YRHGISRPLRPLCSPWLIDHAV 281


>WB|WBGene00044760 [details] [associations]
            symbol:Y71H2AM.25 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0004181
            "metallocarboxypeptidase activity" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            GeneTree:ENSGT00560000076599 EMBL:FO081822 eggNOG:NOG331187
            HOGENOM:HOG000114005 RefSeq:NP_001040887.1
            ProteinModelPortal:Q2AAB9 SMR:Q2AAB9 EnsemblMetazoa:Y71H2AM.25
            GeneID:4363054 KEGG:cel:CELE_Y71H2AM.25 UCSC:Y71H2AM.25 CTD:4363054
            WormBase:Y71H2AM.25 InParanoid:Q2AAB9 NextBio:959635 Uniprot:Q2AAB9
        Length = 299

 Score = 246 (91.7 bits), Expect = 6.3e-21, P = 6.3e-21
 Identities = 78/223 (34%), Positives = 117/223 (52%)

Query:   127 VNWIEKGAVTPVKYQGQC------AVAA-VEGINAIKIN-RLVSLSEQQLVDCATNDNN- 177
             ++W +KG V PVK QG+C      A+++ +E + A   N  L+S SEQQL+DC  +D+  
Sbjct:    86 LDWRDKGIVGPVKDQGKCNASHAFAISSSIESMYAKATNGSLLSFSEQQLIDC--DDHGF 143

Query:   178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGIC--DSIKAEDHAAQITNYEDVPPN 235
              GC      +A  Y I + GI  +A Y Y G   G C  DS K++    Q+ + E V  N
Sbjct:   144 KGCEEQPAINAVSYFIFH-GIETEADYPYAGKENGKCTFDSTKSK---IQLKDAEFVVSN 199

Query:   236 DEESLLKAVANQ-PVSVAIDAS-ALQFYSGGVFNGYCETFLN-HGVTA---VGYGTSEEG 289
             + +   + V N  P    + A  +L  Y  G++N   E   + H + +   VGYG   EG
Sbjct:   200 ETQGK-ELVTNYGPAFFTMRAPPSLYDYKIGIYNPSIEECTSTHEIRSMVIVGYGI--EG 256

Query:   290 I-KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
             + KYW++K S+G  WGE GY +L RD++     C +A F + P
Sbjct:   257 VQKYWIVKGSFGTSWGEQGYMKLARDVNA----CAMADFITVP 295


>WB|WBGene00013076 [details] [associations]
            symbol:Y51A2D.8 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            EMBL:AL021497 HSSP:P53634 HOGENOM:HOG000019851 PIR:T27079
            RefSeq:NP_507627.1 ProteinModelPortal:Q9XXQ7 SMR:Q9XXQ7
            MEROPS:C01.A49 EnsemblMetazoa:Y51A2D.8 GeneID:180208
            KEGG:cel:CELE_Y51A2D.8 UCSC:Y51A2D.8 CTD:180208 WormBase:Y51A2D.8
            eggNOG:NOG307864 InParanoid:Q9XXQ7 OMA:VAVYFKV NextBio:908434
            Uniprot:Q9XXQ7
        Length = 386

 Score = 191 (72.3 bits), Expect = 8.8e-20, Sum P(2) = 8.8e-20
 Identities = 59/189 (31%), Positives = 86/189 (45%)

Query:   142 GQCAVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
             G    A VE + A    +  SLS+Q++ DC T +   GC GG +    +Y+ +  G++ D
Sbjct:   178 GFAVTALVETVYAAHSGKFKSLSDQEVCDCGT-EGTPGCKGGSLTLGVQYV-KKYGLSGD 235

Query:   202 AVYSYEG--MSTGICDSIKAEDHA--AQITNYEDVPPND-EESLLKAVANQPVSVAIDAS 256
               Y Y+    + G    ++  D    A+  N+  + P   EE +++ +    V VA+   
Sbjct:   236 EDYPYDQNRANQGRRCRLRETDRIVPARAFNFAVINPRRAEEQIIQVLTEWKVPVAVYFK 295

Query:   257 AL-QF--YSGGVF-NGYCETFLN-HGVTAVGYGTSEEGI----KYWLIKNSWGQDWGEDG 307
                QF  Y  GV     C      H    VGY T E+       YW+IKNSWG DW E G
Sbjct:   296 VGDQFKEYKEGVIIEDDCRRATQWHAGAIVGYDTVEDSRGRSHDYWIIKNSWGGDWAESG 355

Query:   308 YFRLQRDID 316
             Y R+ R  D
Sbjct:   356 YVRVVRGRD 364

 Score = 130 (50.8 bits), Expect = 1.8e-12, Sum P(2) = 1.8e-12
 Identities = 50/171 (29%), Positives = 76/171 (44%)

Query:   135 VTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
             V P+K QGQCA        A VE + A    +  SLS+Q++ DC T +   GC GG +  
Sbjct:   164 VGPIKDQGQCACCWGFAVTALVETVYAAHSGKFKSLSDQEVCDCGT-EGTPGCKGGSLTL 222

Query:   188 AFKYIIQNKGITNDAVYSYEG--MSTGICDSIKAEDHA--AQITNYEDVPPND-EESLLK 242
               +Y+ +  G++ D  Y Y+    + G    ++  D    A+  N+  + P   EE +++
Sbjct:   223 GVQYV-KKYGLSGDEDYPYDQNRANQGRRCRLRETDRIVPARAFNFAVINPRRAEEQIIQ 281

Query:   243 AVANQPVSVAIDASAL-QF--YSGGVF-NGYCETFLN-HGVTAVGYGTSEE 288
              +    V VA+      QF  Y  GV     C      H    VGY T E+
Sbjct:   282 VLTEWKVPVAVYFKVGDQFKEYKEGVIIEDDCRRATQWHAGAIVGYDTVED 332

 Score = 106 (42.4 bits), Expect = 8.8e-20, Sum P(2) = 8.8e-20
 Identities = 33/96 (34%), Positives = 48/96 (50%)

Query:     7 IVVLIISGSCASQATYRTFDEGSI----AEK----FEQWKAQYGRTYKESAENSKRFEIF 58
             +VVL ISG  +   T   F E +I     EK    FE +K +Y R YK+ +EN +RF  F
Sbjct:     8 LVVLPISGVVSINITEPEFFEINIDRDHPEKLYKAFEDFKKKYNRKYKDESENQQRFNNF 67

Query:    59 KDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEF 93
               +   V++ N  +          +NKF+DL+  EF
Sbjct:    68 VKSYNNVDKLNAKSKAAGYDTQFGINKFSDLSTAEF 103


>WB|WBGene00019314 [details] [associations]
            symbol:K02E7.10 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 SMART:SM00645 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 HSSP:P07711
            EMBL:FO080411 PIR:T32392 RefSeq:NP_493904.1 UniGene:Cel.14828
            ProteinModelPortal:O17255 SMR:O17255 EnsemblMetazoa:K02E7.10
            GeneID:186889 KEGG:cel:CELE_K02E7.10 UCSC:K02E7.10 CTD:186889
            WormBase:K02E7.10 eggNOG:NOG331187 HOGENOM:HOG000114005
            InParanoid:O17255 OMA:GNANEAR NextBio:933344 Uniprot:O17255
        Length = 299

 Score = 235 (87.8 bits), Expect = 9.2e-20, P = 9.2e-20
 Identities = 73/223 (32%), Positives = 109/223 (48%)

Query:   127 VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKIN-RLVSLSEQQLVDCATNDNNN 178
             ++W EKG V PVK QG+C       A+AA+E + A   N +L+S SEQQ++DCA  +  N
Sbjct:    84 LDWREKGIVGPVKDQGKCNASYAFAAIAAIESMYAKANNGKLLSFSEQQIIDCA--NFTN 141

Query:   179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGM-STGIC--DSIKAEDHAAQITNYEDVPPN 235
              C     +      ++  G+  +A Y Y G  + G C  DS K +        Y DV PN
Sbjct:   142 PCQENLENVLSNRFLKENGVGTEADYPYVGKENVGKCEYDSSKMKLRPT----YIDVYPN 197

Query:   236 DEESLLKAVANQPVSVAIDASALQF-YSGGVFNGYCETFLN----HGVTAVGYGTSEEGI 290
             +E +             + +    F Y  G++N   E   N      +  VGYG  ++G 
Sbjct:   198 EEWARAHITTFGTGYFRMRSPPSFFHYKTGIYNPTKEECGNANEARSLAIVGYG--KDGA 255

Query:   291 -KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
              KYW++K S+G  WGE GY +L R+++     CG+A   S P+
Sbjct:   256 EKYWIVKGSFGTSWGEHGYMKLARNVNA----CGMAESISIPI 294


>UNIPROTKB|E9PKT6 [details] [associations]
            symbol:CTSH "Cathepsin H" species:9606 "Homo sapiens"
            [GO:0001520 "outer dense fiber" evidence=IEA] [GO:0001656
            "metanephros development" evidence=IEA] [GO:0001669 "acrosomal
            vesicle" evidence=IEA] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0007283 "spermatogenesis" evidence=IEA] [GO:0008284 "positive
            regulation of cell proliferation" evidence=IEA] [GO:0010634
            "positive regulation of epithelial cell migration" evidence=IEA]
            [GO:0016505 "apoptotic protease activator activity" evidence=IEA]
            [GO:0030984 "kininogen binding" evidence=IEA] [GO:0031638 "zymogen
            activation" evidence=IEA] [GO:0031648 "protein destabilization"
            evidence=IEA] [GO:0032403 "protein complex binding" evidence=IEA]
            [GO:0032526 "response to retinoic acid" evidence=IEA] [GO:0033619
            "membrane protein proteolysis" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0043621 "protein
            self-association" evidence=IEA] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IEA] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            InterPro:IPR000169 GO:GO:0043066 GO:GO:0008284 PANTHER:PTHR12411
            PROSITE:PS00139 GO:GO:0045766 GO:GO:0004252 GO:GO:0032526
            GO:GO:0016505 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 GO:GO:0060448 GO:GO:0033619
            EMBL:AC011944 HGNC:HGNC:2535 IPI:IPI00375426
            ProteinModelPortal:E9PKT6 SMR:E9PKT6 PRIDE:E9PKT6
            Ensembl:ENST00000528741 ArrayExpress:E9PKT6 Bgee:E9PKT6
            Uniprot:E9PKT6
        Length = 134

 Score = 234 (87.4 bits), Expect = 1.2e-19, P = 1.2e-19
 Identities = 55/137 (40%), Positives = 81/137 (59%)

Query:    80 LRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGA-VTPV 138
             + LN+F+D++  E I  +  +    + S+ K+N   +L  +   PPSV+W +KG  V+PV
Sbjct:     1 MALNQFSDMSFAE-IKHKYLWSEPQNCSATKSN---YLRGTGPYPPSVDWRKKGNFVSPV 56

Query:   139 KYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
             K QG C          A+E   AI   +++SL+EQQLVDCA + NN+GC GG    AF+Y
Sbjct:    57 KNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEY 116

Query:   192 IIQNKGITNDAVYSYEG 208
             I+ NKGI  +  Y Y+G
Sbjct:   117 ILYNKGIMGEDTYPYQG 133


>GENEDB_PFALCIPARUM|PF14_0553 [details] [associations]
            symbol:PF14_0553 "cysteine proteinase
            falcipain-1" species:5833 "Plasmodium falciparum" [GO:0042540
            "hemoglobin catabolic process" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014187 KO:K01376 HSSP:P07688 GO:GO:0042540
            RefSeq:XP_001348727.1 ProteinModelPortal:Q8I6V0 PRIDE:Q8I6V0
            EnsemblProtists:PF14_0553:mRNA GeneID:812135 KEGG:pfa:PF14_0553
            EuPathDB:PlasmoDB:PF3D7_1458000 HOGENOM:HOG000065906
            ProtClustDB:CLSZ2457715 ChEMBL:CHEMBL1250371 Uniprot:Q8I6V0
        Length = 569

 Score = 202 (76.2 bits), Expect = 1.6e-19, Sum P(2) = 1.6e-19
 Identities = 59/175 (33%), Positives = 90/175 (51%)

Query:   121 SQVPPSVNWIEKGAVTPVKYQGQC----AVAAVEGINAI--KINR-LVSLSEQQLVDCAT 173
             S+VP  +++ EKG V   K QG C    A A+V  I ++  K N+ ++S SEQ++VDC+ 
Sbjct:   331 SKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSK 390

Query:   174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
             +  N GC GG    +F Y++QN+    D  Y Y+      C + + +     +++   V 
Sbjct:   391 D--NFGCDGGHPFYSFLYVLQNELCLGDE-YKYKAKDDMFCLNYRCK-RKVSLSSIGAVK 446

Query:   234 PNDEESLLKAVANQPVSVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
              N     L  V    V+V ++   +  YS GV+NG C   LNH V  VGYG  E+
Sbjct:   447 ENQLILALNEVGPLSVNVGVNNDFVA-YSEGVYNGTCSEELNHSVLLVGYGQVEK 500

 Score = 117 (46.2 bits), Expect = 1.2e-09, Sum P(2) = 1.2e-09
 Identities = 34/105 (32%), Positives = 48/105 (45%)

Query:   235 NDEESLLKAVANQPVSVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG----- 289
             ND  +  + V N   S  ++ S L    G V     +T LN+      Y T E       
Sbjct:   468 NDFVAYSEGVYNGTCSEELNHSVLLVGYGQVE----KTKLNYNNKIQTYNTKENSNQPDD 523

Query:   290 --IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
               I YW+IKNSW + WGE+G+ RL R+ +     CGI     +P+
Sbjct:   524 NIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPI 568

 Score = 98 (39.6 bits), Expect = 1.6e-19, Sum P(2) = 1.6e-19
 Identities = 19/62 (30%), Positives = 36/62 (58%)

Query:    31 AEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTP 90
             A KF ++  ++ + YK   E  ++FEIFK N ++++  N     N  Y  ++N+F+D + 
Sbjct:   222 ASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLN-KNAMYKKKVNQFSDYSE 280

Query:    91 QE 92
             +E
Sbjct:   281 EE 282


>UNIPROTKB|Q8I6V0 [details] [associations]
            symbol:PF14_0553 "Cysteine proteinase falcipain-1"
            species:36329 "Plasmodium falciparum 3D7" [GO:0042540 "hemoglobin
            catabolic process" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014187 KO:K01376 HSSP:P07688 GO:GO:0042540
            RefSeq:XP_001348727.1 ProteinModelPortal:Q8I6V0 PRIDE:Q8I6V0
            EnsemblProtists:PF14_0553:mRNA GeneID:812135 KEGG:pfa:PF14_0553
            EuPathDB:PlasmoDB:PF3D7_1458000 HOGENOM:HOG000065906
            ProtClustDB:CLSZ2457715 ChEMBL:CHEMBL1250371 Uniprot:Q8I6V0
        Length = 569

 Score = 202 (76.2 bits), Expect = 1.6e-19, Sum P(2) = 1.6e-19
 Identities = 59/175 (33%), Positives = 90/175 (51%)

Query:   121 SQVPPSVNWIEKGAVTPVKYQGQC----AVAAVEGINAI--KINR-LVSLSEQQLVDCAT 173
             S+VP  +++ EKG V   K QG C    A A+V  I ++  K N+ ++S SEQ++VDC+ 
Sbjct:   331 SKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSK 390

Query:   174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
             +  N GC GG    +F Y++QN+    D  Y Y+      C + + +     +++   V 
Sbjct:   391 D--NFGCDGGHPFYSFLYVLQNELCLGDE-YKYKAKDDMFCLNYRCK-RKVSLSSIGAVK 446

Query:   234 PNDEESLLKAVANQPVSVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
              N     L  V    V+V ++   +  YS GV+NG C   LNH V  VGYG  E+
Sbjct:   447 ENQLILALNEVGPLSVNVGVNNDFVA-YSEGVYNGTCSEELNHSVLLVGYGQVEK 500

 Score = 117 (46.2 bits), Expect = 1.2e-09, Sum P(2) = 1.2e-09
 Identities = 34/105 (32%), Positives = 48/105 (45%)

Query:   235 NDEESLLKAVANQPVSVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG----- 289
             ND  +  + V N   S  ++ S L    G V     +T LN+      Y T E       
Sbjct:   468 NDFVAYSEGVYNGTCSEELNHSVLLVGYGQVE----KTKLNYNNKIQTYNTKENSNQPDD 523

Query:   290 --IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
               I YW+IKNSW + WGE+G+ RL R+ +     CGI     +P+
Sbjct:   524 NIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPI 568

 Score = 98 (39.6 bits), Expect = 1.6e-19, Sum P(2) = 1.6e-19
 Identities = 19/62 (30%), Positives = 36/62 (58%)

Query:    31 AEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTP 90
             A KF ++  ++ + YK   E  ++FEIFK N ++++  N     N  Y  ++N+F+D + 
Sbjct:   222 ASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLN-KNAMYKKKVNQFSDYSE 280

Query:    91 QE 92
             +E
Sbjct:   281 EE 282


>DICTYBASE|DDB_G0276111 [details] [associations]
            symbol:DDB_G0276111 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0276111 Pfam:PF00188
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            PROSITE:PS00139 EMBL:AAFI02000014 Gene3D:3.40.33.10
            InterPro:IPR001283 InterPro:IPR014044 PRINTS:PR00837 SMART:SM00198
            SUPFAM:SSF55797 ProtClustDB:CLSZ2429919 RefSeq:XP_643261.1
            ProteinModelPortal:Q75JH0 EnsemblProtists:DDB0169514 GeneID:8620304
            KEGG:ddi:DDB_G0276111 InParanoid:Q75JH0 OMA:GFVTSIK Uniprot:Q75JH0
        Length = 415

 Score = 244 (91.0 bits), Expect = 4.4e-19, P = 4.4e-19
 Identities = 80/267 (29%), Positives = 124/267 (46%)

Query:    74 GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKG 133
             G + YT ++   + LT        T  + S    S+  +   F+  +S     V+W   G
Sbjct:   164 GIKPYTPKVQ--SQLTENGSGGGDTKLEPSVKPISINVSSR-FILPTSSTG-DVDWKSLG 219

Query:   134 AVTPVKYQGQCA-------VAAVEGINAIKINRL----VSLSEQQLVDCATNDNNNGCYG 182
              VT +K QGQC         AA+E    IK N L    + LSEQ  V C     N GC G
Sbjct:   220 FVTSIKNQGQCGGCYSFATCAALESAYLIK-NNLPNTDIDLSEQNFVSCV----NYGCGG 274

Query:   183 GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLK 242
             G        + ++ GI  +  Y Y+ + TG C ++       + T Y ++  N +E+ L 
Sbjct:   275 GNGQSCLDKL-KSTGIMYETSYPYKAV-TGSCPNVIQSPQPFKWTGYSNIQGN-KEAFLN 331

Query:   243 AVANQPV--SVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWG 300
             A+ + P+  S+ +D S  Q Y  G+++    +  NH +T VGY +++     +LIKNSWG
Sbjct:   332 ALKSGPIYASLYVD-SGFQLYKSGIYSCSQSSTPNHAITIVGYSSADNS---YLIKNSWG 387

Query:   301 QDWGEDGYFRLQRDIDQPQGQCGIAMF 327
               +GE GY RL+      +G C +  F
Sbjct:   388 TIYGESGYIRLK------EGSCNLYSF 408


>UNIPROTKB|F1RWA9 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 EMBL:CU855637
            Ensembl:ENSSSCT00000009707 OMA:WAFSIVG Uniprot:F1RWA9
        Length = 194

 Score = 225 (84.3 bits), Expect = 1.4e-18, P = 1.4e-18
 Identities = 53/167 (31%), Positives = 84/167 (50%)

Query:   146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK-GITNDAVY 204
             V+AVE   AIK   L  LS QQ++DC+ N  N GC GG   +A  ++ + +  + +D+ Y
Sbjct:    11 VSAVESAYAIKGQPLEVLSVQQVIDCSYN--NYGCNGGSTLNALYWLNKTQVKVVSDSEY 68

Query:   205 SYEGMSTGICDSIKAEDHAAQITNYE--DVPPNDEESLLKAVANQPVSVAIDASALQFYS 262
              ++  + G+C           I +Y   D    ++E     +   P+ V +DA + Q Y 
Sbjct:    69 PFKAQN-GLCHYFSCSHSGVSIKDYSAYDFSGQEDEMAKTLLTLGPLIVIVDAVSWQDYL 127

Query:   263 GGVFNGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
             GG+   +C +   NH V   G+  +     YW+++NSWG  WG DGY
Sbjct:   128 GGIIQHHCSSGEANHAVLVTGFDKTGS-TPYWIVRNSWGSAWGIDGY 173


>WB|WBGene00022189 [details] [associations]
            symbol:Y71H2AR.2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0004181
            "metallocarboxypeptidase activity" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] [GO:0008340 "determination of adult lifespan"
            evidence=IMP] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0008340 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 HSSP:P07711
            eggNOG:NOG331187 HOGENOM:HOG000114005 EMBL:FO081570
            RefSeq:NP_497627.1 UniGene:Cel.28419 ProteinModelPortal:Q9BL26
            SMR:Q9BL26 EnsemblMetazoa:Y71H2AR.2 GeneID:190615
            KEGG:cel:CELE_Y71H2AR.2 UCSC:Y71H2AR.2 CTD:190615
            WormBase:Y71H2AR.2 InParanoid:Q9BL26 OMA:CAMATTI NextBio:946382
            Uniprot:Q9BL26
        Length = 345

 Score = 230 (86.0 bits), Expect = 1.1e-17, P = 1.1e-17
 Identities = 74/208 (35%), Positives = 105/208 (50%)

Query:   127 VNWIEKGAVTPVKYQGQC------AV-AAVEGINAIKIN-RLVSLSEQQLVDCATNDNN- 177
             ++W EKG V PVK QG+C      A+ +++E + A   N  L+S SEQQL+DC  ND   
Sbjct:    86 LDWREKGIVGPVKDQGKCNASHAFAITSSIESMYAKATNGTLLSFSEQQLIDC--NDQGY 143

Query:   178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGIC--DSIKAEDHAAQITNYEDVPPN 235
              GC   F  +A  Y+  + GI  +A Y Y   +   C  DS K++ H  +      V   
Sbjct:   144 KGCEEQFAMNAIGYLATH-GIETEADYPYVDKTNEKCTFDSTKSKIHLKK-----GVVAE 197

Query:   236 DEESLLKA-VANQ-PVSVAIDAS-ALQFYSGGVFNGYCETFLN-HGVTA---VGYGTSEE 288
               E L K  V N  P    + A  +L  Y  G++N   E   + H + +   VGYG   E
Sbjct:   198 GNEVLGKVYVTNYGPAFFTMRAPPSLYDYKIGIYNPSIEECTSTHEIRSMVIVGYGIEGE 257

Query:   289 GIKYWLIKNSWGQDWGEDGYFRLQRDID 316
               KYW++K S+G  WGE GY +L RD++
Sbjct:   258 Q-KYWIVKGSFGTSWGEQGYMKLARDVN 284


>UNIPROTKB|E2QV47 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0097208 "alveolar lamellar body"
            evidence=IEA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0070371 "ERK1 and ERK2 cascade"
            evidence=IEA] [GO:0070324 "thyroid hormone binding" evidence=IEA]
            [GO:0060448 "dichotomous subdivision of terminal units involved in
            lung branching" evidence=IEA] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IEA] [GO:0043129 "surfactant homeostasis"
            evidence=IEA] [GO:0043066 "negative regulation of apoptotic
            process" evidence=IEA] [GO:0033619 "membrane protein proteolysis"
            evidence=IEA] [GO:0032526 "response to retinoic acid" evidence=IEA]
            [GO:0031648 "protein destabilization" evidence=IEA] [GO:0031638
            "zymogen activation" evidence=IEA] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=IEA] [GO:0016505
            "apoptotic protease activator activity" evidence=IEA] [GO:0010815
            "bradykinin catabolic process" evidence=IEA] [GO:0010813
            "neuropeptide catabolic process" evidence=IEA] [GO:0010634
            "positive regulation of epithelial cell migration" evidence=IEA]
            [GO:0010628 "positive regulation of gene expression" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=IEA] [GO:0005829 "cytosol" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0004252 "serine-type endopeptidase activity"
            evidence=IEA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IEA] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 GO:GO:0016505 GO:GO:0010634
            GO:GO:0004197 GO:GO:0042599 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 Ensembl:ENSCAFT00000036196 Uniprot:E2QV47
        Length = 136

 Score = 215 (80.7 bits), Expect = 2.0e-17, P = 2.0e-17
 Identities = 48/135 (35%), Positives = 76/135 (56%)

Query:   204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA-NQPVSVAIDASA-LQFY 261
             Y Y+G   G C   +     A + +  ++  NDE+++++AVA   PVS A + ++    Y
Sbjct:     6 YPYKGQD-GDC-KYQPSKAIAFVKDVANITINDEQAMVEAVALYNPVSFAFEVTSDFMMY 63

Query:   262 SGGVFNGY-CETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
               G+++   C      +NH V AVGYG  + GI YW++KNSWG  WG +GYF ++R    
Sbjct:    64 RKGIYSSTSCHKTPDKVNHAVLAVGYG-EQNGIPYWIVKNSWGPQWGMNGYFLMERG--- 119

Query:   318 PQGQCGIAMFASFPV 332
              +  CG+A  AS+P+
Sbjct:   120 -KNMCGLAACASYPI 133


>DICTYBASE|DDB_G0288221 [details] [associations]
            symbol:DDB_G0288221 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0288221 Pfam:PF00188 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 PANTHER:PTHR12411 EMBL:AAFI02000109 Gene3D:3.40.33.10
            InterPro:IPR001283 InterPro:IPR014044 SMART:SM00198 SUPFAM:SSF55797
            MEROPS:C01.A52 ProtClustDB:CLSZ2429919 RefSeq:XP_636852.1
            ProteinModelPortal:Q54J84 EnsemblProtists:DDB0187839 GeneID:8626520
            KEGG:ddi:DDB_G0288221 InParanoid:Q54J84 Uniprot:Q54J84
        Length = 395

 Score = 220 (82.5 bits), Expect = 5.1e-16, P = 5.1e-16
 Identities = 66/218 (30%), Positives = 111/218 (50%)

Query:   126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGI----NAIKINRLVSLSEQQLVDCATN 174
             SV+W +    TPV+ QG+C       ++AA+E      N +     + LS Q  ++C T+
Sbjct:   191 SVDWSDYQ--TPVRDQGECKSCWVFGSLAALESRYLIKNGVSEKSTLHLSAQNAMNCITS 248

Query:   175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
                 GC  G+  + F Y  ++ GI  +  Y Y+ + +  C S     +  + + Y+ V  
Sbjct:   249 ----GCESGWPANVFDYF-ESSGIAFEKDYPYDAIGSDNCTS---SSNKFEYSGYDSVE- 299

Query:   235 NDEESLLKAVANQPVSVAIDA-SALQFYSGGVFNGYCE-TFLNHGVTAVGYGTSEEGIKY 292
             N ++SL++ + N P+++A+ + +A Q Y+GG+++   E   +NH V  VGY    +    
Sbjct:   300 NTKDSLIQELKNGPITIALYSDTAFQSYAGGIYDSVEEYKDVNHIVLLVGYDKPTDS--- 356

Query:   293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASF 330
             W IKNS G  WGE GY R+    D+     GI ++ SF
Sbjct:   357 WKIKNSLGTKWGELGYARITASNDK----LGILLYNSF 390


>WB|WBGene00008231 [details] [associations]
            symbol:tag-329 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            eggNOG:NOG288820 EMBL:Z70750 HSSP:P53634 HOGENOM:HOG000019851
            PIR:T20110 RefSeq:NP_505458.1 ProteinModelPortal:Q18740 SMR:Q18740
            MEROPS:C01.A36 EnsemblMetazoa:C50F4.3 GeneID:183677
            KEGG:cel:CELE_C50F4.3 UCSC:C50F4.3 CTD:183677 WormBase:C50F4.3
            InParanoid:Q18740 OMA:WIFRNSW NextBio:921986 Uniprot:Q18740
        Length = 374

 Score = 215 (80.7 bits), Expect = 1.7e-15, P = 1.7e-15
 Identities = 62/191 (32%), Positives = 90/191 (47%)

Query:   142 GQCAVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
             G  A A  E    + + + ++LSEQ++ DCA   +  GC GG   D  +YI +  G+T  
Sbjct:   171 GFAATAVAEAALTVHLKKAMNLSEQEVCDCAPK-HGPGCNGGDPVDGLEYI-KEMGLTGG 228

Query:   202 AVYSYE-GMST--GICDSIK--AEDHAAQITNYEDVPPNDEESLLKAV--ANQPVSVAID 254
               Y +    ST  G C+S K   E +  ++  Y   P N E  +   +   N P+SVA  
Sbjct:   229 KEYPFNVNRSTQLGRCESEKYDRELNPLELDYYAIDPFNAEYQMTHHLYLLNLPISVAFR 288

Query:   255 ASA-LQFYSGGVFN-GYCETFLN---HGVTAVGYGTSEEG----IKYWLIKNSWGQDWGE 305
               A L  Y  G+     C+       H    VGYGT++      + YW+ +NSW  DWG+
Sbjct:   289 TGASLSSYLSGILELADCDDEKGGHWHSGAIVGYGTTKNSAGRTVDYWIFRNSWWTDWGD 348

Query:   306 DGYFRLQRDID 316
             DGY R+ R  D
Sbjct:   349 DGYARIVRGED 359

 Score = 144 (55.7 bits), Expect = 3.3e-07, P = 3.3e-07
 Identities = 51/191 (26%), Positives = 85/191 (44%)

Query:    32 EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAI-GNRSYTLRLNKFADLTP 90
             ++FE +  +Y R YK+  E   RF+ F      V + N AA          +NKF+DL+ 
Sbjct:    45 KEFEDFIVKYKRNYKDEIEKKFRFQQFVATHNRVGKMNKAAKKAGHDTKYGINKFSDLSK 104

Query:    91 QEFIASQTGFKMSDHSSSL-KANGTPFLYKSSQ--VPPSVNWIEK--GA---VTPVKYQG 142
             +E     + F    +++++ K N      K     +P + +   K  G    + P+K Q 
Sbjct:   105 KEIHGMYSKFGPPKNNTNVPKFNLKNLRVKRQMEGLPKTFDLRNKKVGGHYIIGPIKTQD 164

Query:   143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
              CA        A  E    + + + ++LSEQ++ DCA   +  GC GG   D  +YI + 
Sbjct:   165 SCACCWGFAATAVAEAALTVHLKKAMNLSEQEVCDCAPK-HGPGCNGGDPVDGLEYI-KE 222

Query:   196 KGITNDAVYSY 206
              G+T    Y +
Sbjct:   223 MGLTGGKEYPF 233


>UNIPROTKB|F1NWG2 [details] [associations]
            symbol:CTSC "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 InterPro:IPR014882 Pfam:PF08773
            OMA:YDDFLHY GO:GO:0001913 EMBL:AADN02004805 IPI:IPI00577371
            Ensembl:ENSGALT00000027869 Uniprot:F1NWG2
        Length = 463

 Score = 213 (80.0 bits), Expect = 6.1e-15, P = 6.1e-15
 Identities = 69/214 (32%), Positives = 103/214 (48%)

Query:   142 GQCAVAAVEGINAIKINRLVS------LSEQQLVDCATNDNNNGCYGGFMD-DAFKYIIQ 194
             G C   A  G+   +I  L +       S QQ+V C+    + GC GGF    A KY+ Q
Sbjct:   256 GSCYAFASMGMLEARIRILTNNTQKPVFSPQQVVSCS--QYSQGCDGGFPYLIAGKYV-Q 312

Query:   195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP-----NDEESLLKAVANQPV 249
             + G+  +  + Y    T  C   K   +    + Y  V       N+    L+ V + P+
Sbjct:   313 DFGVVEEDCFPYTAKDTP-C-LFKRSCYHYYTSEYHYVGGFYGACNEALMKLELVLSGPM 370

Query:   250 SVAIDA-SALQFYSGGVFN--GYCETF-----LNHGVTAVGYGTS-EEGIKYWLIKNSWG 300
             +VA +  +   FY  G+++  G  + F      NH V  VGYG   E G K+W++KNSWG
Sbjct:   371 AVAFEVYNDFMFYKEGIYHHTGLKDEFNPFELTNHAVLLVGYGKDPESGEKFWIVKNSWG 430

Query:   301 QDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
               WGEDGYFR++R  D+   +  IA+ A+ P+ K
Sbjct:   431 TSWGEDGYFRIRRGTDECAIE-SIAVAAT-PIPK 462


>UNIPROTKB|F1PIF2 [details] [associations]
            symbol:CTSZ "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0060441 "epithelial tube branching involved
            in lung morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0005783 GO:GO:0005615 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GO:GO:0060441 GeneTree:ENSGT00560000076599 OMA:QCGTCTE
            EMBL:AAEX03014054 Ensembl:ENSCAFT00000019357 Uniprot:F1PIF2
        Length = 261

 Score = 193 (73.0 bits), Expect = 6.7e-15, P = 6.7e-15
 Identities = 52/165 (31%), Positives = 84/165 (50%)

Query:   163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMST--------GIC 214
             LS Q ++DCA   N   C GG     + Y  ++ GI ++   +Y+            G C
Sbjct:    76 LSVQHVLDCA---NAGSCEGGNDLPVWSYAHEH-GIPDETCNNYQAKDQECNKFNQCGTC 131

Query:   215 DSIKAEDHAAQ------ITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF-YSGGVFN 267
                K E HA Q      + +Y  +    E+ + +  AN P+S  I A+     Y+GG+  
Sbjct:   132 TEFK-ECHAIQNYTLWRVGDYGSLSGR-EKMMAEIYANGPISCGIMATEKMVNYTGGIHA 189

Query:   268 GYCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
              Y E  ++NH ++ VG+G S+ G +YW+++NSWG+ WGE G+ R+
Sbjct:   190 EYQEQAYINHVISVVGWGVSD-GTEYWIVRNSWGEPWGERGWMRI 233


>DICTYBASE|DDB_G0292462 [details] [associations]
            symbol:DDB_G0292462 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0292462 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            EMBL:AAFI02000190 RefSeq:XP_629634.1 MEROPS:C01.A56
            EnsemblProtists:DDB0184413 GeneID:8628698 KEGG:ddi:DDB_G0292462
            InParanoid:Q54D62 OMA:NTQVESH Uniprot:Q54D62
        Length = 323

 Score = 206 (77.6 bits), Expect = 9.6e-15, P = 9.6e-15
 Identities = 62/204 (30%), Positives = 96/204 (47%)

Query:   163 LSEQQLVDC---ATNDN----NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICD 215
             LS Q L+DC     +D     NNGC GGF+  A   +I N+GI +D   SY+      C 
Sbjct:    97 LSPQYLMDCDGSCVSDGVSGCNNGCKGGFVGLALTRLI-NEGIVSDECLSYQASKDSSCP 155

Query:   216 SIKAEDHAAQITN---YEDVP----PNDEESLLKAVANQPVSVAIDA-SALQFYSGGVFN 267
             +    D  + I+N   Y+       P  +++  + + N PV       S  + +   V+ 
Sbjct:   156 TTC--DDGSPISNTTIYKATSCRAFPTVQDAQYEIMTNGPVIATFMLYSDFKPHKWDVYI 213

Query:   268 GYCETFL-NHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI-- 324
                 T + +H V  VG+GT+ +G+ YW+  NSWG  WG+ GYF+++R  D+   + G   
Sbjct:   214 KSSNTQVESHAVRVVGWGTTSDGVDYWIAANSWGTGWGDKGYFKIRRGSDEAAFEEGFIT 273

Query:   325 --AMFASFPVSKESAQPSSADKSS 346
               A  AS P S+   +      SS
Sbjct:   274 VTADTASVPTSQYGLEYQFGGNSS 297


>FB|FBgn0033873 [details] [associations]
            symbol:CG6337 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 EMBL:AE013599
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 HSSP:P80067 EMBL:AY084123
            RefSeq:NP_610905.1 UniGene:Dm.5230 SMR:Q7JYA0 IntAct:Q7JYA0
            EnsemblMetazoa:FBtr0087646 GeneID:36530 KEGG:dme:Dmel_CG6337
            UCSC:CG6337-RA FlyBase:FBgn0033873 eggNOG:NOG310593
            InParanoid:Q7JYA0 OMA:NRTTYRE OrthoDB:EOG4MCVFZ GenomeRNAi:36530
            NextBio:799041 Uniprot:Q7JYA0
        Length = 340

 Score = 205 (77.2 bits), Expect = 1.8e-14, P = 1.8e-14
 Identities = 92/333 (27%), Positives = 147/333 (44%)

Query:    25 FDEGSIAEKFEQWKAQYGRTYKE-SAEN-SKRFEIFKDNLVAVERFNNAAIGNRS-YTLR 81
             F+ G     F+ ++  + +TY   SA N +  + I+  N VA  + N  A  NR+ Y   
Sbjct:    19 FNHGQDLVDFQTYEDNFNKTYASTSARNFANYYFIYNRNQVA--QHNAQADRNRTTYREA 76

Query:    82 LNKFADLTPQEFIA------SQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAV 135
             +N+F+D+   +F A      +      SD  +S  A+ +  +     +  +V   ++G  
Sbjct:    77 VNQFSDIRLIQFAALLPKAVNTVTSAASDPPASQAASASFDIITDFGLTVAVE--DQGVN 134

Query:   136 TPVKYQGQCAVAAVEGINAIKI-NRLVS-LSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
                 +       AVE +NA++  N L S LS QQL+DCA      GC       A  Y+ 
Sbjct:   135 CSSSW-AYATAKAVEIMNAVQTANPLPSSLSAQQLLDCA--GMGTGCSTQTPLAALNYLT 191

Query:   194 QNKGITNDAVY---SY---EGMST-GICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
             Q   +T+  +Y    Y     + T G+C    +     ++  Y  V  ND+ ++++ V+N
Sbjct:   192 Q---LTDAYLYPEVDYPNNNSLKTPGMCQPPSSVSVGVKLAGYSTVADNDDAAVMRYVSN 248

Query:   247 Q-PVSVAIDASALQF--YSGGVFNGYCETFLN----HGVTAVGYGTS-EEGIKYWLIKNS 298
               PV V  + +   F  YS GV+        N      +  VGY    +  + YW   NS
Sbjct:   249 GFPVIVEYNPATFGFMQYSSGVYVQETRALTNPKSSQFLVVVGYDHDVDSNLDYWRCLNS 308

Query:   299 WGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
             +G  WGE+GY R+ R  +QP     IA  A FP
Sbjct:   309 FGDTWGEEGYIRIVRRSNQP-----IAKNAVFP 336


>WB|WBGene00008861 [details] [associations]
            symbol:F15D4.4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 SMART:SM00848 PROSITE:PS00640
            GeneTree:ENSGT00560000076599 EMBL:Z80344 HSSP:P53634
            eggNOG:NOG310593 PIR:T20981 ProteinModelPortal:Q93512 SMR:Q93512
            MEROPS:C01.A45 EnsemblMetazoa:F15D4.4 KEGG:cel:CELE_F15D4.4
            UCSC:F15D4.4 CTD:184530 WormBase:F15D4.4 InParanoid:Q93512
            OMA:ITMEQNI NextBio:925068 Uniprot:Q93512
        Length = 608

 Score = 211 (79.3 bits), Expect = 1.9e-14, P = 1.9e-14
 Identities = 78/287 (27%), Positives = 125/287 (43%)

Query:    50 ENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSS 108
             E  KRF ++      V+  N    +G  SY +  N+F+     E             +++
Sbjct:   150 EGLKRFNVYSKVKKEVDEHNIMYELGMSSYKMSTNQFSVALDGEVAPLTLNLDALTPTAT 209

Query:   109 LKANGTPFLYKSSQVPPSVNWIE-KGAVTPVKYQGQC----AVAAVEGINAIKINRLVSL 163
             +    T    K     P+V+W      +      G C     ++ +E   AI+     SL
Sbjct:   210 V-IPATISSRKKRDTEPTVDWRPFLKPILDQSTCGGCWAFSMISMIESFFAIQGYNTSSL 268

Query:   164 SEQQLVDCATN-DN-----NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSI 217
             S QQL+ C T  D+     N GC GG+   A  Y+ +     + ++  ++   T  CDS 
Sbjct:   269 SVQQLLTCDTKVDSTYGLANVGCKGGYFQIAGSYL-EVSAARDASLIPFDLEDTS-CDSS 326

Query:   218 KAEDHAAQITNYED--VPPND--------EESLLKAVANQPVSVAIDASA-LQFYSGGVF 266
                     I  ++D  +  N         E+++   V   P++V + A   +  YS GV+
Sbjct:   327 FFPPVVPTILLFDDGYISGNFTAAQLITMEQNIEDKVRKGPIAVGMAAGPDIYKYSEGVY 386

Query:   267 NGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
             +G C T +NH V  VG+ T +    YW+I+NSWG  WGE GYFR++R
Sbjct:   387 DGDCGTIINHAVVIVGF-TDD----YWIIRNSWGASWGEAGYFRVKR 428


>UNIPROTKB|F1PSK8 [details] [associations]
            symbol:F1PSK8 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 InterPro:IPR014882
            Pfam:PF08773 EMBL:AAEX03012741 Ensembl:ENSCAFT00000007054
            Uniprot:F1PSK8
        Length = 405

 Score = 205 (77.2 bits), Expect = 3.7e-14, P = 3.7e-14
 Identities = 62/184 (33%), Positives = 94/184 (51%)

Query:   163 LSEQQLVDCATNDNNNGCYGGFMD-DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED 221
             LS Q++V C+      GC GGF    A KY  Q+ G+  +A + Y G  +    +     
Sbjct:   226 LSPQEIVSCS--QYAQGCEGGFPYLIAGKYA-QDFGLVEEACFPYAGSDSPCKPNDCFRY 282

Query:   222 HAAQITNYEDVPPNDEESLLKA--VANQPVSVAIDASALQF-YSGGVF--NGYCETF--- 273
             ++++            E+L+K   V + P++VA +     F Y  G++   G  + F   
Sbjct:   283 YSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPF 342

Query:   274 --LNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASF 330
                NH V  VGYGT S  G+ YW++KNSWG  WGEDGYFR++R  D+   +  IA+ A+ 
Sbjct:   343 ELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIE-SIAVAAT- 400

Query:   331 PVSK 334
             P+ K
Sbjct:   401 PIPK 404


>UNIPROTKB|J9P219 [details] [associations]
            symbol:J9P219 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 InterPro:IPR014882
            Pfam:PF08773 OMA:YDDFLHY EMBL:AAEX03012741
            Ensembl:ENSCAFT00000050015 Uniprot:J9P219
        Length = 406

 Score = 205 (77.2 bits), Expect = 3.7e-14, P = 3.7e-14
 Identities = 62/184 (33%), Positives = 94/184 (51%)

Query:   163 LSEQQLVDCATNDNNNGCYGGFMD-DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED 221
             LS Q++V C+      GC GGF    A KY  Q+ G+  +A + Y G  +    +     
Sbjct:   227 LSPQEIVSCS--QYAQGCEGGFPYLIAGKYA-QDFGLVEEACFPYAGSDSPCKPNDCFRY 283

Query:   222 HAAQITNYEDVPPNDEESLLKA--VANQPVSVAIDASALQF-YSGGVF--NGYCETF--- 273
             ++++            E+L+K   V + P++VA +     F Y  G++   G  + F   
Sbjct:   284 YSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPF 343

Query:   274 --LNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASF 330
                NH V  VGYGT S  G+ YW++KNSWG  WGEDGYFR++R  D+   +  IA+ A+ 
Sbjct:   344 ELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIE-SIAVAAT- 401

Query:   331 PVSK 334
             P+ K
Sbjct:   402 PIPK 405


>UNIPROTKB|F1STR1 [details] [associations]
            symbol:CTSC "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0004252
            "serine-type endopeptidase activity" evidence=IEA] [GO:0001913 "T
            cell mediated cytotoxicity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 KO:K01275 InterPro:IPR014882
            Pfam:PF08773 OMA:YDDFLHY GO:GO:0001913 EMBL:CU855751
            RefSeq:XP_003129789.1 UniGene:Ssc.6155 Ensembl:ENSSSCT00000016280
            GeneID:100522387 KEGG:ssc:100522387 Uniprot:F1STR1
        Length = 463

 Score = 206 (77.6 bits), Expect = 4.1e-14, P = 4.1e-14
 Identities = 70/214 (32%), Positives = 104/214 (48%)

Query:   142 GQCAVAAVEGINAIKINRLVS------LSEQQLVDCATNDNNNGCYGGFMD-DAFKYIIQ 194
             G C   A  G+   +I  L +      LS Q++V C+      GC GGF    A KY  Q
Sbjct:   256 GSCYSFASMGMMEARIRILTNNTQTPILSPQEVVSCS--QYAQGCAGGFPYLIAGKYA-Q 312

Query:   195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP---NDEESLLKA--VANQPV 249
             + G+  +A + Y G  +  C ++K        + Y  V        E+L+K   V + P+
Sbjct:   313 DFGLVEEACFPYTGTDSP-C-TVKEGCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPM 370

Query:   250 SVAIDA-SALQFYSGGVFN--GYCETF-----LNHGVTAVGYGTS-EEGIKYWLIKNSWG 300
             +VA +       Y  G+++  G  + F      NH V  VGYGT    G+ YW++KNSWG
Sbjct:   371 AVAFEVYDDFLHYRKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDLASGMDYWIVKNSWG 430

Query:   301 QDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
               WGEDGYFR++R  D+   +  IA+ A+ P+ K
Sbjct:   431 TSWGEDGYFRIRRGTDECAIE-SIAVAAT-PIPK 462


>UNIPROTKB|O97578 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9615 "Canis
            lupus familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 EMBL:AF060171 RefSeq:NP_001182763.1
            UniGene:Cfa.28653 ProteinModelPortal:O97578 SMR:O97578
            MEROPS:C01.070 PRIDE:O97578 GeneID:403458 KEGG:cfa:403458
            InParanoid:O97578 NextBio:20816976 Uniprot:O97578
        Length = 435

 Score = 205 (77.2 bits), Expect = 4.6e-14, P = 4.6e-14
 Identities = 62/184 (33%), Positives = 94/184 (51%)

Query:   163 LSEQQLVDCATNDNNNGCYGGFMD-DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED 221
             LS Q++V C+      GC GGF    A KY  Q+ G+  +A + Y G  +    +     
Sbjct:   256 LSPQEIVSCS--QYAQGCEGGFPYLIAGKYA-QDFGLVEEACFPYAGSDSPCKPNDCFRY 312

Query:   222 HAAQITNYEDVPPNDEESLLKA--VANQPVSVAIDASALQF-YSGGVF--NGYCETF--- 273
             ++++            E+L+K   V + P++VA +     F Y  G++   G  + F   
Sbjct:   313 YSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPF 372

Query:   274 --LNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASF 330
                NH V  VGYGT S  G+ YW++KNSWG  WGEDGYFR++R  D+   +  IA+ A+ 
Sbjct:   373 ELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIE-SIAVAAT- 430

Query:   331 PVSK 334
             P+ K
Sbjct:   431 PIPK 434


>TAIR|locus:2204873 [details] [associations]
            symbol:AT1G02300 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002684 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 KO:K01363
            PANTHER:PTHR12411:SF16 OMA:ADDINAC IPI:IPI00534431
            RefSeq:NP_563647.1 UniGene:At.43952 ProteinModelPortal:F4HVZ1
            SMR:F4HVZ1 MEROPS:C01.A10 EnsemblPlants:AT1G02300.1 GeneID:839576
            KEGG:ath:AT1G02300 ArrayExpress:F4HVZ1 Uniprot:F4HVZ1
        Length = 379

 Score = 173 (66.0 bits), Expect = 4.6e-14, Sum P(2) = 4.6e-14
 Identities = 35/107 (32%), Positives = 57/107 (53%)

Query:   232 VPPNDEESLLKAVANQPVSVAIDA-SALQFYSGGVFNGYCETFLN-HGVTAVGYGTSEEG 289
             + P+ ++ + +   N PV VA         Y  GV+     T +  H V  +G+GTS++G
Sbjct:   260 INPDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDG 319

Query:   290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI--AMFASFPVSK 334
               YWL+ N W + WG+DGYF+++R  ++    CGI  ++ A  P  K
Sbjct:   320 EDYWLLANQWNRSWGDDGYFKIRRGTNE----CGIEQSVVAGLPSEK 362

 Score = 72 (30.4 bits), Expect = 4.6e-14, Sum P(2) = 4.6e-14
 Identities = 26/80 (32%), Positives = 35/80 (43%)

Query:   142 GQC-AVAAVEGIN---AIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
             G C A  AVE ++    IK N  VSLS   ++ C       GC GGF   A+ Y   +  
Sbjct:   149 GSCWAFGAVESLSDRFCIKYNLNVSLSANDVIACCGLLCGFGCNGGFPMGAWLYFKYHGV 208

Query:   198 ITN--DAVYSYEGMSTGICD 215
             +T   D  +   G S   C+
Sbjct:   209 VTQECDPYFDNTGCSHPGCE 228


>UNIPROTKB|Q9UBR2 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0060441 "epithelial tube
            branching involved in lung morphogenesis" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            Reactome:REACT_11123 Reactome:REACT_17015 InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 EMBL:CH471077 GO:GO:0005615 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AL109840 GO:GO:0060441 MEROPS:C01.013 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN
            BRENDA:3.4.18.1 EMBL:AF073890 EMBL:AF032906 EMBL:AF136273
            EMBL:AF136276 EMBL:AF136274 EMBL:AF136275 EMBL:AK314931
            EMBL:BC042168 EMBL:AF009923 IPI:IPI00002745 RefSeq:NP_001327.2
            UniGene:Hs.252549 PDB:1DEU PDB:1EF7 PDBsum:1DEU PDBsum:1EF7
            ProteinModelPortal:Q9UBR2 SMR:Q9UBR2 STRING:Q9UBR2 DMDM:12643324
            PaxDb:Q9UBR2 PeptideAtlas:Q9UBR2 PRIDE:Q9UBR2 DNASU:1522
            Ensembl:ENST00000217131 GeneID:1522 KEGG:hsa:1522 UCSC:uc002yai.2
            GeneCards:GC20M057570 HGNC:HGNC:2547 HPA:CAB025114 MIM:603169
            neXtProt:NX_Q9UBR2 PharmGKB:PA27043 InParanoid:Q9UBR2 OMA:QCGTCTE
            PhylomeDB:Q9UBR2 BindingDB:Q9UBR2 ChEMBL:CHEMBL4160 ChiTaRS:CTSZ
            EvolutionaryTrace:Q9UBR2 GenomeRNAi:1522 NextBio:6299 Bgee:Q9UBR2
            CleanEx:HS_CTSZ Genevestigator:Q9UBR2 GermOnline:ENSG00000101160
            Uniprot:Q9UBR2
        Length = 303

 Score = 199 (75.1 bits), Expect = 4.7e-14, P = 4.7e-14
 Identities = 63/197 (31%), Positives = 98/197 (49%)

Query:   139 KYQGQC-AVAAVEGI-NAIKINRLVS-----LSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
             +Y G C A A+   + + I I R  +     LS Q ++DC    N   C GG     + Y
Sbjct:    87 QYCGSCWAHASTSAMADRINIKRKGAWPSTLLSVQNVIDCG---NAGSCEGGNDLSVWDY 143

Query:   192 IIQNKGITNDAVYSYEGMST--------GICDSIKAEDHAAQITNYEDVPPND------- 236
               Q+ GI ++   +Y+            G C+  K E HA  I NY      D       
Sbjct:   144 AHQH-GIPDETCNNYQAKDQECDKFNQCGTCNEFK-ECHA--IRNYTLWRVGDYGSLSGR 199

Query:   237 EESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCET-FLNHGVTAVGYGTSEEGIKYWL 294
             E+ + +  AN P+S  I A+  L  Y+GG++  Y +T ++NH V+  G+G S+ G +YW+
Sbjct:   200 EKMMAEIYANGPISCGIMATERLANYTGGIYAEYQDTTYINHVVSVAGWGISD-GTEYWI 258

Query:   295 IKNSWGQDWGEDGYFRL 311
             ++NSWG+ WGE G+ R+
Sbjct:   259 VRNSWGEPWGERGWLRI 275


>ZFIN|ZDB-GENE-030619-9 [details] [associations]
            symbol:ctsc "cathepsin C" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030619-9 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 MEROPS:C01.070 HSSP:P43235
            EMBL:BC064286 IPI:IPI00486570 RefSeq:NP_999887.1 UniGene:Dr.32463
            ProteinModelPortal:Q6P2V1 SMR:Q6P2V1 PRIDE:Q6P2V1 GeneID:368704
            KEGG:dre:368704 InParanoid:Q6P2V1 NextBio:20813127
            ArrayExpress:Q6P2V1 Bgee:Q6P2V1 Uniprot:Q6P2V1
        Length = 455

 Score = 205 (77.2 bits), Expect = 5.2e-14, P = 5.2e-14
 Identities = 61/185 (32%), Positives = 95/185 (51%)

Query:   164 SEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHA 223
             S QQ+V C+    + GC GGF     KYI Q+ GI  +  + Y G S   C+ + A+   
Sbjct:   277 SPQQVVSCS--QYSQGCDGGFPYLIGKYI-QDFGIVEEDCFPYTG-SDSPCN-LPAKCTK 331

Query:   224 AQITNYEDVPP-----NDEESLLKAVANQPVSVAIDASA-LQFYSGGVFN--GYCET--- 272
                ++Y  V       ++   +L+ V N P+ VA++       Y  G+++  G  +    
Sbjct:   332 YYASDYHYVGGFYGGCSESAMMLELVKNGPMGVALEVYPDFMNYKEGIYHHTGLRDANNP 391

Query:   273 --FLNHGVTAVGYGTSEE-GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFAS 329
                 NH V  VGYG   + G KYW++KNSWG  WGE+G+FR++R  D+   +  IA+ A+
Sbjct:   392 FELTNHAVLLVGYGQCHKTGEKYWIVKNSWGSGWGENGFFRIRRGTDECAIE-SIAVAAT 450

Query:   330 FPVSK 334
              P+ K
Sbjct:   451 -PIPK 454


>UNIPROTKB|P53634 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9606 "Homo
            sapiens" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0005783 "endoplasmic reticulum" evidence=IEA] [GO:0005794
            "Golgi apparatus" evidence=IEA] [GO:0007568 "aging" evidence=IEA]
            [GO:0010033 "response to organic substance" evidence=IEA]
            [GO:0031404 "chloride ion binding" evidence=IEA] [GO:0042802
            "identical protein binding" evidence=IEA] [GO:0043621 "protein
            self-association" evidence=IEA] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0006508 "proteolysis" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0006955
            "immune response" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005783 GO:GO:0005794 Reactome:REACT_6900
            GO:GO:0006955 GO:GO:0007568 GO:GO:0010033 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004252 GO:GO:0005764 GO:GO:0031404 CTD:1075
            HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ InterPro:IPR014882
            Pfam:PF08773 MEROPS:C01.070 EMBL:X87212 EMBL:U79415 EMBL:AF234263
            EMBL:AF234264 EMBL:AF254757 EMBL:AF525032 EMBL:AF525033
            EMBL:AK292117 EMBL:AK311923 EMBL:AK223038 EMBL:BX537913
            EMBL:AC011088 EMBL:CH471185 EMBL:BC054028 EMBL:BC100891
            EMBL:BC100892 EMBL:BC100893 EMBL:BC100894 EMBL:BC109386
            EMBL:BC110071 EMBL:BC113850 EMBL:BC113897 IPI:IPI00022810
            IPI:IPI00171323 IPI:IPI00872258 PIR:S23941 PIR:S66504
            RefSeq:NP_001107645.1 RefSeq:NP_001805.3 RefSeq:NP_680475.1
            UniGene:Hs.128065 PDB:1K3B PDB:2DJF PDB:2DJG PDB:3PDF PDBsum:1K3B
            PDBsum:2DJF PDBsum:2DJG PDBsum:3PDF ProteinModelPortal:P53634
            SMR:P53634 IntAct:P53634 MINT:MINT-4655964 STRING:P53634
            PhosphoSite:P53634 DMDM:1705632 PaxDb:P53634 PRIDE:P53634
            DNASU:1075 Ensembl:ENST00000227266 Ensembl:ENST00000524463
            Ensembl:ENST00000529974 GeneID:1075 KEGG:hsa:1075 UCSC:uc001pck.4
            UCSC:uc001pcm.4 GeneCards:GC11M088026 HGNC:HGNC:2528 HPA:CAB025364
            MIM:170650 MIM:245000 MIM:245010 MIM:602365 neXtProt:NX_P53634
            Orphanet:2342 Orphanet:678 PharmGKB:PA27028 HOGENOM:HOG000127503
            InParanoid:P53634 OMA:YDDFLHY PhylomeDB:P53634
            BioCyc:MetaCyc:HS03265-MONOMER SABIO-RK:P53634 BindingDB:P53634
            ChEMBL:CHEMBL2252 EvolutionaryTrace:P53634 GenomeRNAi:1075
            NextBio:4488 PMAP-CutDB:P53634 ArrayExpress:P53634 Bgee:P53634
            Genevestigator:P53634 GermOnline:ENSG00000109861 GO:GO:0001913
            Uniprot:P53634
        Length = 463

 Score = 205 (77.2 bits), Expect = 5.4e-14, P = 5.4e-14
 Identities = 64/187 (34%), Positives = 96/187 (51%)

Query:   163 LSEQQLVDCATNDNNNGCYGGFMD-DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED 221
             LS Q++V C+      GC GGF    A KY  Q+ G+  +A + Y G  +  C  +K + 
Sbjct:   283 LSPQEVVSCS--QYAQGCEGGFPYLIAGKYA-QDFGLVEEACFPYTGTDSP-C-KMKEDC 337

Query:   222 HAAQITNYEDVPP---NDEESLLKA--VANQPVSVAIDA-SALQFYSGGVFN--GYCETF 273
                  + Y  V        E+L+K   V + P++VA +       Y  G+++  G  + F
Sbjct:   338 FRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPF 397

Query:   274 -----LNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMF 327
                   NH V  VGYGT S  G+ YW++KNSWG  WGE+GYFR++R  D+   +  IA+ 
Sbjct:   398 NPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIE-SIAVA 456

Query:   328 ASFPVSK 334
             A+ P+ K
Sbjct:   457 AT-PIPK 462


>DICTYBASE|DDB_G0283401 [details] [associations]
            symbol:ctsZ "cathepsin Z precursor" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            dictyBase:DDB_G0283401 GO:GO:0005615 GenomeReviews:CM000153_GR
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 EMBL:AAFI02000055 KO:K08568 OMA:QCGTCTE
            eggNOG:NOG275763 RefSeq:XP_639036.1 ProteinModelPortal:Q54R55
            IntAct:Q54R55 MEROPS:C01.A60 PRIDE:Q54R55
            EnsemblProtists:DDB0233836 GeneID:8624061 KEGG:ddi:DDB_G0283401
            InParanoid:Q54R55 Uniprot:Q54R55
        Length = 296

 Score = 197 (74.4 bits), Expect = 6.6e-14, P = 6.6e-14
 Identities = 58/197 (29%), Positives = 98/197 (49%)

Query:   139 KYQGQC-AVAAVEGIN-AIKINRL-----VSLSEQQLVDCATNDNNNG-CYGGFMDDAFK 190
             +Y G C A A+   I+  IKI R      V+++ Q L+DC    N  G C GG   DAF 
Sbjct:    83 QYCGGCWAFASTSSISDRIKIQRKAAFPDVNVAPQHLIDC----NGGGTCDGGDPGDAFA 138

Query:   191 YIIQNKGITNDAVYSYEGMST--------------GICDSIKAEDHAAQITNYEDVPPND 236
             +I +N GI ++    Y+  +               G C +I    +   +T Y  V    
Sbjct:   139 FINEN-GIVDETCKPYQAKNLPDECSPACKTCNPDGTCQAIPVHTNIT-VTEYGSVR-GA 195

Query:   237 EESLLKAVANQPVSVAIDA-SALQFYSGGVFNGY-CETFLNHGVTAVGYGTSEEGIKYWL 294
             ++ + +  A  P++ +IDA S L+ Y+ G+F  +  +   NH ++ +G+G  ++   YW+
Sbjct:   196 KDMMAEIYARGPIACSIDATSKLEAYTSGIFKEFKLDPLPNHIISVIGWGV-QDSTPYWI 254

Query:   295 IKNSWGQDWGEDGYFRL 311
             ++NSWG  +GE G+F +
Sbjct:   255 VRNSWGSYYGEGGFFNI 271


>RGD|708479 [details] [associations]
            symbol:Ctsz "cathepsin Z" species:10116 "Rattus norvegicus"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=TAS]
            [GO:0005615 "extracellular space" evidence=IEA;ISO] [GO:0005783
            "endoplasmic reticulum" evidence=IEA;ISO] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0060441 "epithelial tube branching involved in
            lung morphogenesis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:708479 GO:GO:0005576 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004197 MEROPS:C01.013 CTD:1522 HOVERGEN:HBG004456 KO:K08568
            EMBL:AB023781 EMBL:BC091110 IPI:IPI00207663 RefSeq:NP_899159.1
            UniGene:Rn.1475 ProteinModelPortal:Q9R1T3 SMR:Q9R1T3 PRIDE:Q9R1T3
            GeneID:252929 KEGG:rno:252929 BindingDB:Q9R1T3 NextBio:624097
            Genevestigator:Q9R1T3 Uniprot:Q9R1T3
        Length = 306

 Score = 198 (74.8 bits), Expect = 6.7e-14, P = 6.7e-14
 Identities = 49/165 (29%), Positives = 84/165 (50%)

Query:   163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMST--------GIC 214
             LS Q ++DC    N   C GG     ++Y  ++ GI ++   +Y+            G C
Sbjct:   120 LSVQNVIDCG---NAGSCEGGNDLPVWEYAHKH-GIPDETCNNYQAKDQECDKFNQCGTC 175

Query:   215 DSIKAEDHAAQ------ITNYEDVPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFN 267
                K E H  Q      + +Y  +    E+ + +  AN P+S  I A+  +  Y+GG++ 
Sbjct:   176 TEFK-ECHTIQNYTLWRVGDYGSLSGR-EKMMAEIYANGPISCGIMATERMSNYTGGIYT 233

Query:   268 GYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
              Y  +  +NH ++  G+G S +GI+YW+++NSWG+ WGE G+ R+
Sbjct:   234 EYQNQAIINHIISVAGWGVSNDGIEYWIVRNSWGEPWGERGWMRI 278


>TAIR|locus:2133402 [details] [associations]
            symbol:AT4G01610 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0005829 "cytosol" evidence=IDA] [GO:0000902 "cell
            morphogenesis" evidence=RCA] [GO:0006635 "fatty acid
            beta-oxidation" evidence=RCA] [GO:0010162 "seed dormancy process"
            evidence=RCA] [GO:0016049 "cell growth" evidence=RCA] [GO:0048193
            "Golgi vesicle transport" evidence=RCA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0005773 EMBL:CP002687
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 eggNOG:NOG315657
            HOGENOM:HOG000241341 KO:K01363 PANTHER:PTHR12411:SF16 OMA:DAIPDHF
            HSSP:P07858 ProtClustDB:CLSN2687619 EMBL:AF370193 EMBL:AY065167
            EMBL:AY114015 EMBL:AY086034 EMBL:AF083797 EMBL:BT001190
            EMBL:AK175280 EMBL:AK175481 EMBL:AK175539 EMBL:AK176165
            EMBL:AK176244 EMBL:AK176281 EMBL:AK176330 EMBL:AK176416
            EMBL:AK176433 EMBL:AK176487 EMBL:AK221398 EMBL:AK230235
            IPI:IPI00530811 RefSeq:NP_567215.1 UniGene:At.24471
            ProteinModelPortal:Q94K85 SMR:Q94K85 STRING:Q94K85 MEROPS:C01.144
            PaxDb:Q94K85 PRIDE:Q94K85 EnsemblPlants:AT4G01610.1 GeneID:826792
            KEGG:ath:AT4G01610 TAIR:At4g01610 InParanoid:Q94K85
            PhylomeDB:Q94K85 Genevestigator:Q94K85 Uniprot:Q94K85
        Length = 359

 Score = 163 (62.4 bits), Expect = 8.6e-14, Sum P(2) = 8.6e-14
 Identities = 38/120 (31%), Positives = 58/120 (48%)

Query:   219 AEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA-SALQFYSGGVFNGYCETFLN-H 276
             +E     ++ Y  V  N ++ + +   N PV V+         Y  GV+     + +  H
Sbjct:   228 SESKHYSVSTYT-VKSNPQDIMAEVYKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGH 286

Query:   277 GVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIA--MFASFPVSK 334
              V  +G+GTS EG  YWL+ N W + WG+DGYF ++R  ++    CGI     A  P SK
Sbjct:   287 AVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRRGTNE----CGIEDEPVAGLPSSK 342

 Score = 80 (33.2 bits), Expect = 8.6e-14, Sum P(2) = 8.6e-14
 Identities = 39/156 (25%), Positives = 63/156 (40%)

Query:    66 ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPP 125
             +RF+NA +      L +      TP++      G  +  H  SLK    P  + +    P
Sbjct:    65 DRFSNATVAEFKRLLGVKP----TPKKHFL---GVPIVSHDPSLKL---PKAFDARTAWP 114

Query:   126 SVNWIEKGAVTPVKYQGQC-AVAAVEGIN---AIKINRLVSLSEQQLVDCATNDNNNGCY 181
                 I  G +    + G C A  AVE ++    I+    +SLS   L+ C      +GC 
Sbjct:   115 QCTSI--GNILDQGHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLACCGFRCGDGCD 172

Query:   182 GGFMDDAFKYIIQNKGITN--DAVYSYEGMSTGICD 215
             GG+   A++Y   +  +T   D  +   G S   C+
Sbjct:   173 GGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCE 208


>ZFIN|ZDB-GENE-041010-139 [details] [associations]
            symbol:ctsz "cathepsin Z" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0001525 "angiogenesis"
            evidence=IMP] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 ZFIN:ZDB-GENE-041010-139 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0001525
            CTD:1522 HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568
            OrthoDB:EOG42Z4QN UniGene:Dr.935 eggNOG:NOG275763 EMBL:BC083369
            IPI:IPI00483065 RefSeq:NP_001006043.1 ProteinModelPortal:Q5XJD4
            SMR:Q5XJD4 STRING:Q5XJD4 GeneID:450022 KEGG:dre:450022
            InParanoid:Q5XJD4 NextBio:20833005 ArrayExpress:Q5XJD4
            Uniprot:Q5XJD4
        Length = 301

 Score = 195 (73.7 bits), Expect = 1.4e-13, P = 1.4e-13
 Identities = 48/165 (29%), Positives = 85/165 (51%)

Query:   163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYE------------GMS 210
             LS Q ++DC    +   C GG     ++Y   NKGI ++   +Y+            G  
Sbjct:   110 LSVQNVIDCG---DAGSCSGGDHSGVWEYA-HNKGIPDETCNNYQAKDQDCKPFNQCGTC 165

Query:   211 T--GICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS-ALQFYSGGVFN 267
             T  G+C+ +K      ++ +Y      D+    +  +  P+S  I A+  L  Y+GG+++
Sbjct:   166 TTFGVCNIVK-NFTLWKVGDYGSASGLDKMKA-EIYSGGPISCGIMATDKLDAYTGGLYS 223

Query:   268 GYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
              Y  E ++NH V+  G+G  E G+++W+++NSWG+ WGE G+ R+
Sbjct:   224 EYVQEPYINHIVSVAGWGVDENGVEFWVVRNSWGEPWGEKGWLRI 268


>UNIPROTKB|J9NSE7 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 InterPro:IPR014882 Pfam:PF08773
            EMBL:AAEX03017125 Ensembl:ENSCAFT00000014269 OMA:INGQICH
            Uniprot:J9NSE7
        Length = 458

 Score = 199 (75.1 bits), Expect = 2.7e-13, P = 2.7e-13
 Identities = 63/185 (34%), Positives = 93/185 (50%)

Query:   163 LSEQQLVDCATNDNNNGCYGGFMD-DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED 221
             LS Q++V C+      GC GGF    A KY  Q+ G+ ++A +SY G S   C       
Sbjct:   279 LSPQEIVSCS--QYAQGCEGGFPYLIAGKYA-QDFGLVDEACFSYAG-SDSPCKPNDCFH 334

Query:   222 HAAQITNYEDV---PPNDEESLLKAVANQPVSVAIDASALQF-YSGGVF--NGYCETF-- 273
             + +   +Y        N+    L+ V + P++VA +     F Y  G++   G  +    
Sbjct:   335 YYSSEYHYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPINP 394

Query:   274 ---LNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFAS 329
                 NH V  VGYGT S  G+ YW++KNSWG  WGEDGYF++ R  D+   +  IA+ A+
Sbjct:   395 FELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFQICRGTDECAIE-SIAVAAT 453

Query:   330 FPVSK 334
              P+ K
Sbjct:   454 -PIPK 457


>MGI|MGI:1891190 [details] [associations]
            symbol:Ctsz "cathepsin Z" species:10090 "Mus musculus"
            [GO:0005615 "extracellular space" evidence=ISO] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1891190 GO:GO:0005783 GO:GO:0005615 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0060441 GeneTree:ENSGT00560000076599 MEROPS:C01.013 CTD:1522
            HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN OMA:QCGTCTE
            ChiTaRS:CTSZ EMBL:AJ242663 EMBL:AF136277 EMBL:AF136278
            EMBL:BC008619 IPI:IPI00986833 RefSeq:NP_071720.1 UniGene:Mm.156919
            ProteinModelPortal:Q9WUU7 SMR:Q9WUU7 IntAct:Q9WUU7 STRING:Q9WUU7
            PaxDb:Q9WUU7 PRIDE:Q9WUU7 Ensembl:ENSMUST00000016400 GeneID:64138
            KEGG:mmu:64138 InParanoid:Q9WUU7 NextBio:319927 Bgee:Q9WUU7
            CleanEx:MM_CTSZ Genevestigator:Q9WUU7 GermOnline:ENSMUSG00000016256
            Uniprot:Q9WUU7
        Length = 306

 Score = 193 (73.0 bits), Expect = 3.0e-13, P = 3.0e-13
 Identities = 48/167 (28%), Positives = 85/167 (50%)

Query:   161 VSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMST--------G 212
             + LS Q ++DC    N   C GG     ++Y  ++ GI ++   +Y+            G
Sbjct:   118 ILLSVQNVIDCG---NAGSCEGGNDLPVWEYAHKH-GIPDETCNNYQAKDQDCDKFNQCG 173

Query:   213 ICDSIKAEDHAAQ------ITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF-YSGGV 265
              C   K E H  Q      + +Y  +    E+ + +  AN P+S  I A+ +   Y+GG+
Sbjct:   174 TCTEFK-ECHTIQNYTLWRVGDYGSLSGR-EKMMAEIYANGPISCGIMATEMMSNYTGGI 231

Query:   266 FNGYCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
             +  + +   +NH ++  G+G S +GI+YW+++NSWG+ WGE G+ R+
Sbjct:   232 YAEHQDQAVINHIISVAGWGVSNDGIEYWIVRNSWGEPWGEKGWMRI 278


>TAIR|locus:2060420 [details] [associations]
            symbol:AT2G22160 "AT2G22160" species:3702 "Arabidopsis
            thaliana" [GO:0005575 "cellular_component" evidence=ND] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] EMBL:CP002685
            GenomeReviews:CT485783_GR InterPro:IPR013201 Pfam:PF08246
            SMART:SM00848 EMBL:AC007168 IPI:IPI00544896 PIR:F84609
            RefSeq:NP_179806.1 UniGene:At.66231 HSSP:P25774
            ProteinModelPortal:Q9SIE8 SMR:Q9SIE8 EnsemblPlants:AT2G22160.1
            GeneID:816750 KEGG:ath:AT2G22160 TAIR:At2g22160 eggNOG:NOG297278
            InParanoid:Q9SIE8 OMA:HRCITLA PhylomeDB:Q9SIE8 ArrayExpress:Q9SIE8
            Genevestigator:Q9SIE8 Uniprot:Q9SIE8
        Length = 105

 Score = 178 (67.7 bits), Expect = 3.2e-13, P = 3.2e-13
 Identities = 40/94 (42%), Positives = 56/94 (59%)

Query:    50 ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSL 109
             +    F++FK N   + + N      + Y L+LNKFA+LT  EF+ + T F MSDH   L
Sbjct:    10 QTESSFDVFKKNAEYIVKTNKE---RKPYKLKLNKFANLTDVEFVNAHTCFDMSDHKKIL 66

Query:   110 KANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQG 142
              +   PF Y++ +Q P S++W EKGAVT VK QG
Sbjct:    67 DSK--PFFYENMTQAPDSLDWREKGAVTNVKDQG 98


>UNIPROTKB|F1N9D7 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9031 "Gallus gallus"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005730 "nucleolus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005739 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0005764
            GO:GO:0004197 GeneTree:ENSGT00560000076599 OMA:GYPSGAW
            GO:GO:0097067 PANTHER:PTHR12411:SF16 IPI:IPI00573387
            EMBL:AADN02018292 Ensembl:ENSGALT00000026896
            Ensembl:ENSGALT00000036723 Uniprot:F1N9D7
        Length = 340

 Score = 156 (60.0 bits), Expect = 3.6e-13, Sum P(2) = 3.6e-13
 Identities = 40/115 (34%), Positives = 56/115 (48%)

Query:   212 GICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA-IDASALQFYSGGVFNGYC 270
             G   S K + H   IT+Y  VP +++E + +   N PV  A I       Y  GV+    
Sbjct:   215 GYSPSYKEDKHYG-ITSY-GVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVS 272

Query:   271 -ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
              E    H +  +G+G  E G  YWL  NSW  DWG++G+F++ R  D     CGI
Sbjct:   273 GEQVGGHAIRILGWGV-ENGTPYWLAANSWNTDWGDNGFFKILRGEDH----CGI 322

 Score = 81 (33.6 bits), Expect = 3.6e-13, Sum P(2) = 3.6e-13
 Identities = 25/90 (27%), Positives = 44/90 (48%)

Query:   129 WIEKGAVTPVKYQGQC----AVAAVEGIN---AIKINRLVSL--SEQQLVDCATNDNNNG 179
             W     ++ ++ QG C    A  AVE I+    +  N  VS+  S + L+ C   +   G
Sbjct:    90 WPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLLSCCGFECGMG 149

Query:   180 CYGGFMDDAFKYIIQNKGITNDAVY-SYEG 208
             C GG+   A++Y  + +G+ +  +Y S+ G
Sbjct:   150 CNGGYPSGAWRYWTE-RGLVSGGLYDSHVG 178


>UNIPROTKB|E2R6Q7 [details] [associations]
            symbol:CTSB "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730 "nucleolus"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790
            GO:GO:0005764 GO:GO:0004197 CTD:1508 GeneTree:ENSGT00560000076599
            KO:K01363 OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16
            EMBL:AAEX03014318 RefSeq:XP_543203.3 Ensembl:ENSCAFT00000012692
            GeneID:486077 KEGG:cfa:486077 NextBio:20859923 Uniprot:E2R6Q7
        Length = 339

 Score = 160 (61.4 bits), Expect = 3.8e-13, Sum P(2) = 3.8e-13
 Identities = 39/115 (33%), Positives = 56/115 (48%)

Query:   212 GICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA-SALQFYSGGVFNGYC 270
             G   S K + H    ++Y  V  N++E + +   N PV  A    S    Y  GV+    
Sbjct:   214 GYSPSYKEDKHYG-CSSYS-VSDNEKEIMAEIYKNGPVEAAFTVYSDFLLYKSGVYQHVT 271

Query:   271 -ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
              E    H V  +G+G  E+G  YWL+ NSW  DWG++G+F++ R  D     CGI
Sbjct:   272 GEMMGGHAVRILGWGV-EDGTPYWLVGNSWNTDWGDNGFFKILRGRDH----CGI 321

 Score = 76 (31.8 bits), Expect = 3.8e-13, Sum P(2) = 3.8e-13
 Identities = 24/90 (26%), Positives = 45/90 (50%)

Query:   129 WIEKGAVTPVKYQGQC----AVAAVEGIN---AIKINRLVSL--SEQQLVDCATNDNNNG 179
             W     +  ++ QG C    A  AVE I+    I+ N  V++  S + ++ C  +   +G
Sbjct:    90 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNVEVSAEDMLTCCGDQCGDG 149

Query:   180 CYGGFMDDAFKYIIQNKGITNDAVY-SYEG 208
             C GGF  +A+ +  + +G+ +  +Y S+ G
Sbjct:   150 CNGGFPAEAWNFWTK-QGLVSGGLYDSHVG 178


>WB|WBGene00000781 [details] [associations]
            symbol:cpr-1 species:6239 "Caenorhabditis elegans"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008340 "determination
            of adult lifespan" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008340 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            eggNOG:NOG315657 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            KO:K01363 PANTHER:PTHR12411:SF16 EMBL:M74797 EMBL:Z78012 PIR:T20148
            RefSeq:NP_506002.2 ProteinModelPortal:P25807 SMR:P25807
            DIP:DIP-25619N MINT:MINT-1058393 STRING:P25807 MEROPS:C01.A32
            PaxDb:P25807 EnsemblMetazoa:C52E4.1 GeneID:179637
            KEGG:cel:CELE_C52E4.1 UCSC:C52E4.1 CTD:179637 WormBase:C52E4.1
            InParanoid:P25807 OMA:CSLSCQS NextBio:906250 Uniprot:P25807
        Length = 329

 Score = 165 (63.1 bits), Expect = 6.6e-13, Sum P(2) = 6.6e-13
 Identities = 39/108 (36%), Positives = 53/108 (49%)

Query:   219 AEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA-SALQFYSGGVFNGYCETFLN-H 276
             A+D    ++ Y  VP N      +  AN PV  A         Y  GV+      +L  H
Sbjct:   217 AKDKHFGVSAYA-VPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSGVYKHTAGKYLGGH 275

Query:   277 GVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
              +  +G+GT E G  YWL+ NSWG +WGE G+F++ R  DQ    CGI
Sbjct:   276 AIKIIGWGT-ESGSPYWLVANSWGVNWGESGFFKIYRGDDQ----CGI 318

 Score = 67 (28.6 bits), Expect = 6.6e-13, Sum P(2) = 6.6e-13
 Identities = 37/162 (22%), Positives = 64/162 (39%)

Query:    64 AVERFNNAAIG---NRSYTLRLNKFADLTPQEF-IASQTGFKMSDHSSSLKANGTPFLYK 119
             AVE     A+    N + +L   +  ++T +E       G   + HS  ++A     +  
Sbjct:    24 AVETLTGQALVDYVNSAQSLFKTEHVEITEEEMKFKLMDGKYAAAHSDEIRATEQEVVLA 83

Query:   120 SSQVPPSVN----WIEKGAVTPVKYQGQC----AVAAVEGIN---AIKINRLVS--LSEQ 166
             S  VP + +    W E  ++  ++ Q  C    A  A E I+    I+        +S  
Sbjct:    84 S--VPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPD 141

Query:   167 QLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG 208
              L+ C  +   NGC GG+   A ++   +KG+     Y   G
Sbjct:   142 DLLSCCGSSCGNGCEGGYPIQALRWW-DSKGVVTGGDYHGAG 182


>UNIPROTKB|P07688 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9913 "Bos taurus"
            [GO:0042470 "melanosome" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730
            "nucleolus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0042470
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 EMBL:L06075 EMBL:M64620
            EMBL:U16336 EMBL:U16337 EMBL:U16338 EMBL:U16339 EMBL:U16341
            EMBL:U16342 EMBL:U16343 EMBL:BC102997 IPI:IPI00692061 PIR:S38328
            RefSeq:NP_776456.1 UniGene:Bt.393 PDB:1ITO PDB:1QDQ PDB:1SP4
            PDB:2DC6 PDB:2DC7 PDB:2DC8 PDB:2DC9 PDB:2DCA PDB:2DCB PDB:2DCC
            PDB:2DCD PDBsum:1ITO PDBsum:1QDQ PDBsum:1SP4 PDBsum:2DC6
            PDBsum:2DC7 PDBsum:2DC8 PDBsum:2DC9 PDBsum:2DCA PDBsum:2DCB
            PDBsum:2DCC PDBsum:2DCD ProteinModelPortal:P07688 SMR:P07688
            STRING:P07688 MEROPS:C01.060 PRIDE:P07688
            Ensembl:ENSBTAT00000036795 GeneID:281105 KEGG:bta:281105 CTD:1508
            eggNOG:NOG315657 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 InParanoid:P07688 KO:K01363 OMA:GYPSGAW
            OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 BindingDB:P07688
            ChEMBL:CHEMBL2323 EvolutionaryTrace:P07688 NextBio:20805177
            ArrayExpress:P07688 GO:GO:0097067 PANTHER:PTHR12411:SF16
            Uniprot:P07688
        Length = 335

 Score = 155 (59.6 bits), Expect = 6.9e-13, Sum P(2) = 6.9e-13
 Identities = 38/115 (33%), Positives = 55/115 (47%)

Query:   212 GICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA-SALQFYSGGVFNGYC 270
             G   S K + H    ++Y  V  N++E + +   N PV  A    S    Y  GV+    
Sbjct:   214 GYSPSYKEDKHFG-CSSYS-VANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVS 271

Query:   271 -ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
              E    H +  +G+G  E G  YWL+ NSW  DWG++G+F++ R     Q  CGI
Sbjct:   272 GEIMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRG----QDHCGI 321

 Score = 79 (32.9 bits), Expect = 6.9e-13, Sum P(2) = 6.9e-13
 Identities = 24/86 (27%), Positives = 42/86 (48%)

Query:   129 WIEKGAVTPVKYQGQC----AVAAVEGIN---AIKIN-RL-VSLSEQQLVDCATNDNNNG 179
             W     +  ++ QG C    A  AVE I+    I  N R+ V +S + ++ C   +  +G
Sbjct:    90 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCGGECGDG 149

Query:   180 CYGGFMDDAFKYIIQNKGITNDAVYS 205
             C GGF   A+ +  + KG+ +  +Y+
Sbjct:   150 CNGGFPSGAWNFWTK-KGLVSGGLYN 174


>UNIPROTKB|F1N455 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1 exclusion domain chain"
            species:9913 "Bos taurus" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 IPI:IPI00697314 UniGene:Bt.49573
            InterPro:IPR014882 Pfam:PF08773 OMA:YDDFLHY GO:GO:0001913
            EMBL:DAAA02062487 EMBL:DAAA02062488 Ensembl:ENSBTAT00000014735
            Uniprot:F1N455
        Length = 463

 Score = 195 (73.7 bits), Expect = 7.9e-13, P = 7.9e-13
 Identities = 69/214 (32%), Positives = 104/214 (48%)

Query:   142 GQCAVAAVEGINAIKINRLVS------LSEQQLVDCATNDNNNGCYGGFMD-DAFKYIIQ 194
             G C   A  G+   +I  L +      LS Q++V C+      GC GGF    A KY  Q
Sbjct:   256 GSCYSFASMGMMEARIRILTNNTQTPILSPQEVVSCS--QYAQGCEGGFPYLIAGKYA-Q 312

Query:   195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP---NDEESLLKA-VANQ-PV 249
             + G+  +  + Y G  +  C  +K        + Y  V        E+L+K  + +Q P+
Sbjct:   313 DFGLVEEDCFPYTGTDSP-C-RLKEGCFRYYSSEYHYVGGFYGGCNEALMKLELVHQGPM 370

Query:   250 SVAIDA-SALQFYSGGVFN--GYCETF-----LNHGVTAVGYGT-SEEGIKYWLIKNSWG 300
             +VA +       Y  GV++  G  + F      NH V  VGYGT +  G+ YW++KNSWG
Sbjct:   371 AVAFEVYDDFLHYRKGVYHHTGLRDPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWG 430

Query:   301 QDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
               WGE+GYFR++R  D+   +  IA+ A+ P+ K
Sbjct:   431 TSWGENGYFRIRRGTDECAIE-SIALAAT-PIPK 462


>UNIPROTKB|Q3ZCJ8 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9913 "Bos
            taurus" [GO:0031638 "zymogen activation" evidence=IDA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0005764 EMBL:BC102115 IPI:IPI00697314 RefSeq:NP_001028789.1
            UniGene:Bt.49573 ProteinModelPortal:Q3ZCJ8 SMR:Q3ZCJ8 STRING:Q3ZCJ8
            PRIDE:Q3ZCJ8 GeneID:352958 KEGG:bta:352958 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 InParanoid:Q3ZCJ8 KO:K01275
            OrthoDB:EOG4H19VZ BindingDB:Q3ZCJ8 ChEMBL:CHEMBL1075050
            NextBio:20812686 GO:GO:0031638 InterPro:IPR014882 Pfam:PF08773
            Uniprot:Q3ZCJ8
        Length = 463

 Score = 195 (73.7 bits), Expect = 7.9e-13, P = 7.9e-13
 Identities = 69/214 (32%), Positives = 104/214 (48%)

Query:   142 GQCAVAAVEGINAIKINRLVS------LSEQQLVDCATNDNNNGCYGGFMD-DAFKYIIQ 194
             G C   A  G+   +I  L +      LS Q++V C+      GC GGF    A KY  Q
Sbjct:   256 GSCYSFASMGMMEARIRILTNNTQTPILSPQEVVSCS--QYAQGCEGGFPYLIAGKYA-Q 312

Query:   195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP---NDEESLLKA-VANQ-PV 249
             + G+  +  + Y G  +  C  +K        + Y  V        E+L+K  + +Q P+
Sbjct:   313 DFGLVEEDCFPYTGTDSP-C-RLKEGCFRYYSSEYHYVGGFYGGCNEALMKLELVHQGPM 370

Query:   250 SVAIDA-SALQFYSGGVFN--GYCETF-----LNHGVTAVGYGT-SEEGIKYWLIKNSWG 300
             +VA +       Y  GV++  G  + F      NH V  VGYGT +  G+ YW++KNSWG
Sbjct:   371 AVAFEVYDDFLHYRKGVYHHTGLRDPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWG 430

Query:   301 QDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
               WGE+GYFR++R  D+   +  IA+ A+ P+ K
Sbjct:   431 TSWGENGYFRIRRGTDECAIE-SIALAAT-PIPK 462


>UNIPROTKB|A1E295 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9823 "Sus scrofa"
            [GO:0005615 "extracellular space" evidence=IDA] [GO:0042470
            "melanosome" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730
            "nucleolus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0042470
            GO:GO:0005615 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 MEROPS:C01.060 CTD:1508
            GeneTree:ENSGT00560000076599 HOVERGEN:HBG003480 KO:K01363
            OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16 EMBL:EF095956
            RefSeq:NP_001090927.1 UniGene:Ssc.53773 ProteinModelPortal:A1E295
            SMR:A1E295 PRIDE:A1E295 Ensembl:ENSSSCT00000026923 GeneID:100037961
            KEGG:ssc:100037961 Uniprot:A1E295
        Length = 335

 Score = 150 (57.9 bits), Expect = 1.1e-12, Sum P(2) = 1.1e-12
 Identities = 37/116 (31%), Positives = 58/116 (50%)

Query:   212 GICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA--IDASALQFYSGGVFNGY 269
             G   S K + H    ++Y  +  N++E + +   N PV  A  + +  LQ Y  GV+   
Sbjct:   214 GYTPSYKEDKHFG-CSSYS-ISRNEKEIMAEIYKNGPVEGAFTVYSDFLQ-YKSGVYQHV 270

Query:   270 CETFLN-HGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
                 +  H +  +G+G  E G  YWL+ NSW  DWG++G+F++ R     Q  CGI
Sbjct:   271 TGDLMGGHAIRILGWGV-ENGTPYWLVGNSWNTDWGDNGFFKILRG----QDHCGI 321

 Score = 83 (34.3 bits), Expect = 1.1e-12, Sum P(2) = 1.1e-12
 Identities = 26/90 (28%), Positives = 46/90 (51%)

Query:   129 WIEKGAVTPVKYQGQC----AVAAVEGIN---AIKIN-RL-VSLSEQQLVDCATNDNNNG 179
             W     +  ++ QG C    A  AVE I+    I+ N R+ V +S + ++ C  ++  +G
Sbjct:    90 WPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTCCGDECGDG 149

Query:   180 CYGGFMDDAFKYIIQNKGITNDAVY-SYEG 208
             C GGF   A+ +  + KG+ +  +Y S+ G
Sbjct:   150 CNGGFPSGAWNFWTK-KGLVSGGLYDSHVG 178


>DICTYBASE|DDB_G0286015 [details] [associations]
            symbol:gmsA species:44689 "Dictyostelium discoideum"
            [GO:0005615 "extracellular space" evidence=IDA] [GO:0019953 "sexual
            reproduction" evidence=IEP] [GO:0009897 "external side of plasma
            membrane" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA;ISS] [GO:0000747 "conjugation with cellular
            fusion" evidence=IMP] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005576 "extracellular
            region" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0286015 Pfam:PF00188 GO:GO:0005615
            GenomeReviews:CM000153_GR GO:GO:0009897 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AAFI02000085 GO:GO:0000747
            Gene3D:3.40.33.10 InterPro:IPR001283 InterPro:IPR014044
            SMART:SM00198 SUPFAM:SSF55797 HSSP:P07688 RefSeq:XP_637893.1
            ProteinModelPortal:Q54ME1 MEROPS:C01.A52 EnsemblProtists:DDB0191145
            GeneID:8625403 KEGG:ddi:DDB_G0286015 InParanoid:Q54ME1 OMA:PGIAYEK
            ProtClustDB:CLSZ2429919 Uniprot:Q54ME1
        Length = 448

 Score = 192 (72.6 bits), Expect = 1.6e-12, P = 1.6e-12
 Identities = 62/204 (30%), Positives = 95/204 (46%)

Query:   126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRL----VSLSEQQLVDCATN 174
             +V+W      TP++ QGQC       + AA+E    IK        + LS Q  V+C  +
Sbjct:   243 TVDWTSYQ--TPIRDQGQCGSCWAFASSAALESRYLIKYGTAQKSTLQLSNQNAVNCIAS 300

Query:   175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
                 GC GG+  + F +  +  GI  +    Y+ ++   C +  +     + TNY     
Sbjct:   301 ----GCNGGWSGNYFNFF-KTPGIAYEKDDPYKAVTGTSCITTSSVARF-KYTNY-GYTE 353

Query:   235 NDEESLLKAVANQPVSVAIDA-SALQFYSGGVFNGYCE-TFLNHGVTAVGYGTSEEGIKY 292
               + +LL  +   PV++A+   SA Q Y  G++N   + T +NH V  VGY  + +  K 
Sbjct:   354 KTKAALLAELKKGPVTIAVYVDSAFQNYKSGIYNSATKYTGINHLVLLVGYDQATDAYK- 412

Query:   293 WLIKNSWGQDWGEDGYFRLQRDID 316
               IKNSWG  WGE GY R+    D
Sbjct:   413 --IKNSWGSWWGESGYMRITASND 434


>MGI|MGI:109553 [details] [associations]
            symbol:Ctsc "cathepsin C" species:10090 "Mus musculus"
            [GO:0001913 "T cell mediated cytotoxicity" evidence=IGI]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IMP]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0005783 "endoplasmic
            reticulum" evidence=ISO] [GO:0005794 "Golgi apparatus"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=ISO;IMP]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0010033
            "response to organic substance" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0031404 "chloride ion
            binding" evidence=ISO] [GO:0042802 "identical protein binding"
            evidence=ISO] [GO:0043621 "protein self-association" evidence=ISO]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 MGI:MGI:109553 GO:GO:0005783
            GO:GO:0005794 GO:GO:0007568 GO:GO:0010033 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004252 GO:GO:0005764 GO:GO:0031404 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 MEROPS:C01.070 OMA:YDDFLHY
            GO:GO:0001913 EMBL:U89269 EMBL:U74683 EMBL:BC067063 IPI:IPI00130015
            RefSeq:NP_034112.3 UniGene:Mm.322945 ProteinModelPortal:P97821
            SMR:P97821 STRING:P97821 PhosphoSite:P97821 PaxDb:P97821
            PRIDE:P97821 Ensembl:ENSMUST00000032779 GeneID:13032 KEGG:mmu:13032
            InParanoid:P97821 BindingDB:P97821 ChEMBL:CHEMBL3454 ChiTaRS:CTSC
            NextBio:282904 Bgee:P97821 CleanEx:MM_CTSC Genevestigator:P97821
            Uniprot:P97821
        Length = 462

 Score = 191 (72.3 bits), Expect = 2.3e-12, P = 2.3e-12
 Identities = 62/189 (32%), Positives = 96/189 (50%)

Query:   163 LSEQQLVDCATNDNNNGCYGGFMD-DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED 221
             LS Q++V C+      GC GGF    A KY  Q+ G+  ++ + Y    +  C   K  +
Sbjct:   282 LSPQEVVSCSPYAQ--GCDGGFPYLIAGKYA-QDFGVVEESCFPYTAKDSP-C---KPRE 334

Query:   222 HAAQI--TNYEDVPP---NDEESLLKA--VANQPVSVAIDA-SALQFYSGGVFN--GYCE 271
             +  +   ++Y  V        E+L+K   V + P++VA +       Y  G+++  G  +
Sbjct:   335 NCLRYYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSD 394

Query:   272 TF-----LNHGVTAVGYGTSE-EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIA 325
              F      NH V  VGYG     GI+YW+IKNSWG +WGE GYFR++R  D+   +  IA
Sbjct:   395 PFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRGTDECAIE-SIA 453

Query:   326 MFASFPVSK 334
             + A+ P+ K
Sbjct:   454 V-AAIPIPK 461


>UNIPROTKB|F1M8U6 [details] [associations]
            symbol:F1M8U6 "Uncharacterized protein" species:10116
            "Rattus norvegicus" [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            IPI:IPI00782277 Ensembl:ENSRNOT00000055587 OMA:EREIAAW
            Uniprot:F1M8U6
        Length = 163

 Score = 170 (64.9 bits), Expect = 2.5e-12, P = 2.5e-12
 Identities = 55/174 (31%), Positives = 82/174 (47%)

Query:   165 EQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAA 224
             +++L+DC   D    C GG   +A+  I    G+  +  Y YEG     C+ + A+    
Sbjct:     1 KKELLDCDKMDK--ACLGGLPSNAYTAIKNLGGLETEDGYGYEGHFQA-CNFL-AQMTKV 56

Query:   225 QITNYEDVPPNDEESLLKAVANQP-VSVAIDASALQFYSGGVFNGY---CET-FLNHGVT 279
              I++  ++  N E S+   +A +  +SVAI    +QF+  G  +     C   F +H V 
Sbjct:    57 YISDSVELSQN-ESSIAALLAQKGLISVAI----MQFHRYGTVHPLRPLCSPGFTDHSVL 111

Query:   280 AVGYGTS-EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
              VGYG      I YW IKN  G DWGE+G++ L R      G  G+   AS  V
Sbjct:   112 LVGYGNRPRSNIPYWAIKNIQGSDWGEEGHYYLYRG----SGDRGVNTMASSAV 161


>UNIPROTKB|P43233 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9031 "Gallus gallus"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 MEROPS:C01.060 CTD:1508
            HOGENOM:HOG000241341 HOVERGEN:HBG003480 KO:K01363 OrthoDB:EOG4K6G4C
            PANTHER:PTHR12411:SF16 EMBL:U18083 IPI:IPI00573387 PIR:S58770
            RefSeq:NP_990702.1 UniGene:Gga.3854 ProteinModelPortal:P43233
            SMR:P43233 STRING:P43233 PRIDE:P43233 GeneID:396329 KEGG:gga:396329
            InParanoid:P43233 NextBio:20816377 Uniprot:P43233
        Length = 340

 Score = 149 (57.5 bits), Expect = 2.5e-12, Sum P(2) = 2.5e-12
 Identities = 40/115 (34%), Positives = 54/115 (46%)

Query:   212 GICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA-IDASALQFYSGGVFNGYC 270
             G   S K + H   IT+Y  VP +++E + +   N PV  A I       Y  GV+    
Sbjct:   215 GYSPSYKEDKHYG-ITSY-GVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVS 272

Query:   271 -ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
              E    H +  +G+G  E G  YWL  NSW  DWG  G+F++ R  D     CGI
Sbjct:   273 GEQVGGHAIRILGWGV-ENGTPYWLAANSWNTDWGITGFFKILRGEDH----CGI 322

 Score = 81 (33.6 bits), Expect = 2.5e-12, Sum P(2) = 2.5e-12
 Identities = 25/90 (27%), Positives = 44/90 (48%)

Query:   129 WIEKGAVTPVKYQGQC----AVAAVEGIN---AIKINRLVSL--SEQQLVDCATNDNNNG 179
             W     ++ ++ QG C    A  AVE I+    +  N  VS+  S + L+ C   +   G
Sbjct:    90 WPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLLSCCGFECGMG 149

Query:   180 CYGGFMDDAFKYIIQNKGITNDAVY-SYEG 208
             C GG+   A++Y  + +G+ +  +Y S+ G
Sbjct:   150 CNGGYPSGAWRYWTE-RGLVSGGLYDSHVG 178


>UNIPROTKB|A5GFX7 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9823 "Sus scrofa"
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GO:GO:0060441 GeneTree:ENSGT00560000076599 MEROPS:C01.013 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN
            OMA:QCGTCTE EMBL:CR956646 RefSeq:NP_001116576.1 UniGene:Ssc.16769
            ProteinModelPortal:A5GFX7 SMR:A5GFX7 STRING:A5GFX7
            Ensembl:ENSSSCT00000008249 GeneID:100141405 KEGG:ssc:100141405
            ArrayExpress:A5GFX7 Uniprot:A5GFX7
        Length = 304

 Score = 185 (70.2 bits), Expect = 2.8e-12, P = 2.8e-12
 Identities = 51/166 (30%), Positives = 83/166 (50%)

Query:   163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK-GITNDAVYSYEGMST--------GI 213
             LS Q ++DC    N   C GG  DD   +   ++ GI ++   +Y+            G 
Sbjct:   119 LSVQHVIDCG---NAGSCEGG--DDLPVWAYAHRHGIPDETCNNYQAKDQVCDKFNQCGT 173

Query:   214 CDSIKAEDHAAQ------ITNYEDVPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVF 266
             C   K E H  Q      + +Y  V    E+ + +  AN P+S  I A+  +  Y+GG++
Sbjct:   174 CTEFK-ECHVIQNYTLWKVGDYGSVSGR-EKMMAEIYANGPISCGIMATEKMSNYTGGIY 231

Query:   267 NGYCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
               Y +  ++NH V+  G+G S  G +YW+++NSWG+ WGE G+ R+
Sbjct:   232 AEYKDQAYINHIVSVAGWGVSG-GTEYWIVRNSWGEPWGERGWMRI 276


>WB|WBGene00010204 [details] [associations]
            symbol:F57F5.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0006898
            "receptor-mediated endocytosis" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0009792 GO:GO:0006898
            GO:GO:0040007 GO:GO:0002119 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0040011
            EMBL:Z75953 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            PANTHER:PTHR12411:SF16 RefSeq:NP_506011.2 ProteinModelPortal:Q20950
            SMR:Q20950 DIP:DIP-24447N IntAct:Q20950 MINT:MINT-211137
            STRING:Q20950 MEROPS:C01.A42 EnsemblMetazoa:F57F5.1 GeneID:179645
            KEGG:cel:CELE_F57F5.1 UCSC:F57F5.1 CTD:179645 WormBase:F57F5.1
            OMA:ADDINAC Uniprot:Q20950
        Length = 351

 Score = 153 (58.9 bits), Expect = 3.2e-12, Sum P(2) = 3.2e-12
 Identities = 33/79 (41%), Positives = 44/79 (55%)

Query:   248 PVSVAIDA-SALQFYSGGVFNGYCETFLN-HGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
             PV VA       + YSGGV+       L  H V  +G+G  + G  YWL  NSW +DWGE
Sbjct:   267 PVEVAFTVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWGV-DNGTPYWLCANSWNEDWGE 325

Query:   306 DGYFRLQRDIDQPQGQCGI 324
             +GYFR+ R +++    CGI
Sbjct:   326 NGYFRIIRGVNE----CGI 340

 Score = 76 (31.8 bits), Expect = 3.2e-12, Sum P(2) = 3.2e-12
 Identities = 29/118 (24%), Positives = 52/118 (44%)

Query:   129 WIEKGAVTPVKYQGQC----AVAAVEGIN-----AIKINRLVSLSEQQLVDCATNDNNNG 179
             W    +++ ++ Q  C    AV+A E I+     A     ++S+S   +  C      NG
Sbjct:   107 WPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNAKTILSISADDINACCGMVCGNG 166

Query:   180 CYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAE--DHAAQITNYEDVPPN 235
             C GG+  +A+++ ++   +T     SY+   TG C        +H    T+Y+  P N
Sbjct:   167 CNGGYPIEAWRHYVKKGYVTGG---SYQD-KTG-CKPYPYPPCEHHVNGTHYKPCPSN 219


>ZFIN|ZDB-GENE-070323-1 [details] [associations]
            symbol:ctsbb "capthepsin B, b" species:7955 "Danio
            rerio" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-070323-1 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197
            GeneTree:ENSGT00560000076599 PANTHER:PTHR12411:SF16 OMA:CCGFLCG
            EMBL:CU207296 EMBL:CABZ01037785 IPI:IPI00877452
            Ensembl:ENSDART00000097263 Bgee:F1QZT5 Uniprot:F1QZT5
        Length = 326

 Score = 159 (61.0 bits), Expect = 3.4e-12, Sum P(2) = 3.4e-12
 Identities = 42/131 (32%), Positives = 64/131 (48%)

Query:   211 TGICD---SI--KAEDH-AAQITNYEDVPPNDEESLLKAVANQPVSVAIDA-SALQFYSG 263
             TG+C    S+  K + H  +++ N   VP + ++ + +   N PV  A         Y  
Sbjct:   202 TGVCIPKYSVPYKQDKHFGSKVYN---VPSDQQQIMTELYTNGPVEAAFTVYEDFPLYKS 258

Query:   264 GVFNGYCETFLN-HGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
             GV+     + L  H V  +G+G  E G  +WL+ NSW  DWG++GYF++ R  D+    C
Sbjct:   259 GVYQHLTGSALGGHAVKILGWG-EENGTPFWLVANSWNSDWGDNGYFKILRGHDE----C 313

Query:   323 GIA--MFASFP 331
             GI   M A  P
Sbjct:   314 GIESEMVAGLP 324

 Score = 67 (28.6 bits), Expect = 3.4e-12, Sum P(2) = 3.4e-12
 Identities = 41/154 (26%), Positives = 68/154 (44%)

Query:    68 FNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS---QVP 124
             F NAA    ++T  +N F D  P++++ S  G       + LK    P   K S   ++P
Sbjct:    28 FINAA--RSTWTAGVN-F-DNVPKKYLKSLCG-------TVLKGPRLPHTVKHSTNVKLP 76

Query:   125 PSVN----WIEKGAVTPVKYQGQC----AVAAVEGIN-AIKIN----RLVSLSEQQLVDC 171
              S +    W     +  ++ QG C    A  AVE I+  I I+    +   +S + L+ C
Sbjct:    77 DSFDLRDQWPNCKTLNQIRDQGSCGSCWAFGAVESISDRICIHSKGKQSPEISAEDLLSC 136

Query:   172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
               +    GC GGF  +A+ Y  +  G+    +Y+
Sbjct:   137 C-DQCGFGCSGGFPAEAWDYW-RRSGLVTGGLYN 168


>UNIPROTKB|P05689 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:BC122603
            EMBL:X01809 IPI:IPI00708474 PIR:A29172 RefSeq:NP_001071303.1
            UniGene:Bt.4902 ProteinModelPortal:P05689 SMR:P05689 MEROPS:C01.013
            PRIDE:P05689 GeneID:404187 KEGG:bta:404187 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 InParanoid:P05689 KO:K08568
            OrthoDB:EOG42Z4QN BRENDA:3.4.18.1 NextBio:20817615 Uniprot:P05689
        Length = 304

 Score = 184 (69.8 bits), Expect = 3.7e-12, P = 3.7e-12
 Identities = 50/166 (30%), Positives = 84/166 (50%)

Query:   163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMST--------GIC 214
             LS Q ++DC    +   C GG     ++Y  ++ GI ++   +Y+            G C
Sbjct:   119 LSVQHVIDCG---DAGSCEGGNDLPVWEYAHRH-GIPDETCNNYQAKDQECDKFNQCGTC 174

Query:   215 DSIKAEDHAAQITNYEDVPPND-------EESLLKAVANQPVSVAIDASA-LQFYSGGVF 266
                K E H   I NY      D       E+ + +   N P+S  I A+  +  Y+GG++
Sbjct:   175 TEFK-ECHV--IKNYTLWKVGDYGSLSGREKMMAEIYTNGPISCGIMATEKMSNYTGGIY 231

Query:   267 NGYCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
             + Y +  F+NH V+  G+G S+ G++YW+++NSWG+ WGE G+ R+
Sbjct:   232 SEYNDQAFINHIVSVAGWGVSD-GMEYWIVRNSWGEPWGEHGWMRI 276


>RGD|621509 [details] [associations]
            symbol:Ctsb "cathepsin B" species:10116 "Rattus norvegicus"
            [GO:0004175 "endopeptidase activity" evidence=IMP;IDA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA;ISO;IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005576
            "extracellular region" evidence=IDA] [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0005730 "nucleolus" evidence=IEA;ISO]
            [GO:0005737 "cytoplasm" evidence=IDA] [GO:0005739 "mitochondrion"
            evidence=IEA;ISO;IDA] [GO:0005764 "lysosome" evidence=IEA;ISO;IDA]
            [GO:0006508 "proteolysis" evidence=IEA;IEP;ISO;IMP;IDA;TAS]
            [GO:0006914 "autophagy" evidence=IEP] [GO:0006950 "response to
            stress" evidence=IEP] [GO:0007283 "spermatogenesis" evidence=IEP]
            [GO:0007519 "skeletal muscle tissue development" evidence=IEP]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0009611
            "response to wounding" evidence=IEP] [GO:0009612 "response to
            mechanical stimulus" evidence=IEP] [GO:0009749 "response to glucose
            stimulus" evidence=IEP] [GO:0009897 "external side of plasma
            membrane" evidence=IDA] [GO:0009986 "cell surface" evidence=IDA]
            [GO:0014070 "response to organic cyclic compound" evidence=IEP]
            [GO:0014075 "response to amine stimulus" evidence=IEP] [GO:0016324
            "apical plasma membrane" evidence=IDA] [GO:0030984 "kininogen
            binding" evidence=IPI] [GO:0032403 "protein complex binding"
            evidence=IPI] [GO:0034097 "response to cytokine stimulus"
            evidence=IEP] [GO:0042277 "peptide binding" evidence=IDA]
            [GO:0042383 "sarcolemma" evidence=IDA] [GO:0042470 "melanosome"
            evidence=IEA] [GO:0043231 "intracellular membrane-bounded
            organelle" evidence=ISO] [GO:0043434 "response to peptide hormone
            stimulus" evidence=IEP] [GO:0043621 "protein self-association"
            evidence=IDA] [GO:0045471 "response to ethanol" evidence=IEP]
            [GO:0048471 "perinuclear region of cytoplasm" evidence=ISO;IDA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0060548 "negative regulation of cell death" evidence=IMP]
            [GO:0070670 "response to interleukin-4" evidence=IEP] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA;ISO]
            [GO:0005901 "caveola" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:621509 GO:GO:0005739 GO:GO:0042470
            GO:GO:0005615 GO:GO:0009612 GO:GO:0009611 GO:GO:0009897
            GO:GO:0045471 GO:GO:0016324 GO:GO:0009749 GO:GO:0006914
            GO:GO:0043434 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0007283
            GO:GO:0005764 GO:GO:0042383 GO:GO:0043621 GO:GO:0014070
            GO:GO:0042277 GO:GO:0060548 GO:GO:0014075 GO:GO:0004197
            GO:GO:0070670 GO:GO:0007519 MEROPS:C01.060 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 GO:GO:0097067
            PANTHER:PTHR12411:SF16 EMBL:X82396 EMBL:M11305 IPI:IPI00212811
            PIR:S51041 UniGene:Rn.100909 PDB:1CPJ PDB:1CTE PDB:1MIR PDB:1THE
            PDBsum:1CPJ PDBsum:1CTE PDBsum:1MIR PDBsum:1THE
            ProteinModelPortal:P00787 SMR:P00787 STRING:P00787 PRIDE:P00787
            UCSC:RGD:621509 InParanoid:P00787 SABIO-RK:P00787 BindingDB:P00787
            ChEMBL:CHEMBL2602 EvolutionaryTrace:P00787 ArrayExpress:P00787
            Genevestigator:P00787 GermOnline:ENSRNOG00000010331 Uniprot:P00787
        Length = 339

 Score = 151 (58.2 bits), Expect = 4.7e-12, Sum P(2) = 4.7e-12
 Identities = 36/115 (31%), Positives = 56/115 (48%)

Query:   212 GICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA-SALQFYSGGVFNGYC 270
             G   S K + H    T+Y  V  +++E + +   N PV  A    S    Y  GV+    
Sbjct:   214 GYSTSYKEDKHYGY-TSYS-VSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEA 271

Query:   271 ETFLN-HGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
                +  H +  +G+G  E G+ YWL+ NSW  DWG++G+F++ R     +  CGI
Sbjct:   272 GDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNGFFKILRG----ENHCGI 321

 Score = 76 (31.8 bits), Expect = 4.7e-12, Sum P(2) = 4.7e-12
 Identities = 24/86 (27%), Positives = 41/86 (47%)

Query:   129 WIEKGAVTPVKYQGQC----AVAAVEGIN---AIKIN-RL-VSLSEQQLVDCATNDNNNG 179
             W     +  ++ QG C    A  AVE ++    I  N R+ V +S + L+ C      +G
Sbjct:    90 WSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCGIQCGDG 149

Query:   180 CYGGFMDDAFKYIIQNKGITNDAVYS 205
             C GG+   A+ +  + KG+ +  VY+
Sbjct:   150 CNGGYPSGAWNFWTR-KGLVSGGVYN 174


>UNIPROTKB|Q6IN22 [details] [associations]
            symbol:Ctsb "Cathepsin B" species:10116 "Rattus norvegicus"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 RGD:621509 GO:GO:0005739
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 CTD:1508
            GeneTree:ENSGT00560000076599 HOVERGEN:HBG003480 KO:K01363
            OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16 UniGene:Rn.100909
            EMBL:CH474023 HSSP:P00785 EMBL:BC072490 IPI:IPI00562653
            RefSeq:NP_072119.2 SMR:Q6IN22 IntAct:Q6IN22 STRING:Q6IN22
            Ensembl:ENSRNOT00000014177 GeneID:64529 KEGG:rno:64529
            InParanoid:Q6IN22 NextBio:613362 Genevestigator:Q6IN22
            Uniprot:Q6IN22
        Length = 339

 Score = 151 (58.2 bits), Expect = 4.7e-12, Sum P(2) = 4.7e-12
 Identities = 36/115 (31%), Positives = 56/115 (48%)

Query:   212 GICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA-SALQFYSGGVFNGYC 270
             G   S K + H    T+Y  V  +++E + +   N PV  A    S    Y  GV+    
Sbjct:   214 GYSTSYKEDKHYGY-TSYS-VSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEA 271

Query:   271 ETFLN-HGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
                +  H +  +G+G  E G+ YWL+ NSW  DWG++G+F++ R     +  CGI
Sbjct:   272 GDVMGGHAIRILGWGI-ENGVPYWLVANSWNVDWGDNGFFKILRG----ENHCGI 321

 Score = 76 (31.8 bits), Expect = 4.7e-12, Sum P(2) = 4.7e-12
 Identities = 24/86 (27%), Positives = 41/86 (47%)

Query:   129 WIEKGAVTPVKYQGQC----AVAAVEGIN---AIKIN-RL-VSLSEQQLVDCATNDNNNG 179
             W     +  ++ QG C    A  AVE ++    I  N R+ V +S + L+ C      +G
Sbjct:    90 WSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCGIQCGDG 149

Query:   180 CYGGFMDDAFKYIIQNKGITNDAVYS 205
             C GG+   A+ +  + KG+ +  VY+
Sbjct:   150 CNGGYPSGAWNFWTR-KGLVSGGVYN 174


>UNIPROTKB|F1MW68 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9913 "Bos taurus"
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0060441
            GeneTree:ENSGT00560000076599 IPI:IPI00708474 UniGene:Bt.4902
            OMA:QCGTCTE EMBL:DAAA02036315 PRIDE:F1MW68
            Ensembl:ENSBTAT00000025007 Uniprot:F1MW68
        Length = 304

 Score = 182 (69.1 bits), Expect = 6.6e-12, P = 6.6e-12
 Identities = 50/166 (30%), Positives = 84/166 (50%)

Query:   163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMST--------GIC 214
             LS Q ++DC    +   C GG     ++Y  ++ GI ++   +Y+            G C
Sbjct:   119 LSVQHVLDCG---DAGSCEGGNDLPVWEYAHRH-GIPDETCNNYQAKDQECDKFNQCGTC 174

Query:   215 DSIKAEDHAAQITNYEDVPPND-------EESLLKAVANQPVSVAIDASA-LQFYSGGVF 266
                K E H   I NY      D       E+ + +   N P+S  I A+  +  Y+GG++
Sbjct:   175 TEFK-ECHV--IKNYTLWKVGDYGSLSGREKMMAEIYTNGPISCGIMATEKMSNYTGGIY 231

Query:   267 NGYCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
             + Y +  F+NH V+  G+G S+ G++YW+++NSWG+ WGE G+ R+
Sbjct:   232 SEYNDQAFINHIVSVAGWGVSD-GMEYWIVRNSWGEPWGEHGWMRI 276


>UNIPROTKB|E1C4M3 [details] [associations]
            symbol:CTSZ "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0060441 "epithelial tube branching
            involved in lung morphogenesis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GeneTree:ENSGT00560000076599 CTD:1522 KO:K08568 OMA:QCGTCTE
            EMBL:AADN02019004 IPI:IPI00596430 RefSeq:XP_417483.3
            Ensembl:ENSGALT00000012067 GeneID:419311 KEGG:gga:419311
            Uniprot:E1C4M3
        Length = 305

 Score = 181 (68.8 bits), Expect = 8.8e-12, P = 8.8e-12
 Identities = 50/165 (30%), Positives = 82/165 (49%)

Query:   163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMST--------GIC 214
             LS Q ++DCA   N   C GG     + Y   + GI ++   +Y+  +         G C
Sbjct:   119 LSVQNVIDCA---NAGSCEGGDHTGVWMYA-HDHGIPDETCNNYQAKNQKCKKFNQCGTC 174

Query:   215 DSIKAEDHAAQ------ITNYEDVPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFN 267
              +   E H  +      + +Y  V    E+ + +  AN P+S  I A+  L  Y+GG++ 
Sbjct:   175 VTF-GECHVIKNYTLWKVADYGAVSGR-EKMMAEIYANGPISCGIMATEKLDAYTGGLYT 232

Query:   268 GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
              Y  +  +NH V+  G+G  E G +YW+++NSWG+ WGE G+ R+
Sbjct:   233 EYNPSPTVNHIVSVAGWGV-ENGTEYWIVRNSWGEPWGERGWLRI 276


>RGD|2445 [details] [associations]
            symbol:Ctsc "cathepsin C" species:10116 "Rattus norvegicus"
          [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA;ISO]
          [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
          [GO:0004252 "serine-type endopeptidase activity" evidence=IEA;ISO]
          [GO:0005764 "lysosome" evidence=IDA;TAS] [GO:0005783 "endoplasmic
          reticulum" evidence=IDA] [GO:0005794 "Golgi apparatus" evidence=IDA]
          [GO:0006508 "proteolysis" evidence=IEP;ISO;TAS] [GO:0007568 "aging"
          evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
          evidence=ISO] [GO:0010033 "response to organic substance"
          evidence=IDA] [GO:0031404 "chloride ion binding" evidence=IDA]
          [GO:0042802 "identical protein binding" evidence=IDA] [GO:0043621
          "protein self-association" evidence=IDA] InterPro:IPR000668
          InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
          InterPro:IPR000169 RGD:2445 GO:GO:0005783 GO:GO:0005794 GO:GO:0007568
          GO:GO:0010033 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
          InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139
          PROSITE:PS00639 GO:GO:0004252 GO:GO:0005764 GO:GO:0043621
          GO:GO:0042802 GO:GO:0031404 GO:GO:0004197
          GeneTree:ENSGT00560000076599 CTD:1075 HOGENOM:HOG000068022
          HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ InterPro:IPR014882
          Pfam:PF08773 MEROPS:C01.070 OMA:YDDFLHY GO:GO:0001913 EMBL:D90404
          IPI:IPI00193765 PIR:A41158 RefSeq:NP_058793.1 UniGene:Rn.203177
          PDB:1JQP PDBsum:1JQP ProteinModelPortal:P80067 SMR:P80067
          STRING:P80067 PhosphoSite:P80067 PRIDE:P80067
          Ensembl:ENSRNOT00000022342 GeneID:25423 KEGG:rno:25423
          InParanoid:P80067 SABIO-RK:P80067 EvolutionaryTrace:P80067
          NextBio:606591 ArrayExpress:P80067 Genevestigator:P80067
          GermOnline:ENSRNOG00000016496 Uniprot:P80067
        Length = 462

 Score = 185 (70.2 bits), Expect = 1.1e-11, P = 1.1e-11
 Identities = 61/189 (32%), Positives = 92/189 (48%)

Query:   163 LSEQQLVDCATNDNNNGCYGGFMD-DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED 221
             LS Q++V C+      GC GGF    A KY  Q+ G+  +  + Y       C   K ++
Sbjct:   282 LSPQEVVSCSPYAQ--GCDGGFPYLIAGKYA-QDFGVVEENCFPYTATDAP-C---KPKE 334

Query:   222 HAAQI--TNYEDVPP---NDEESLLKA--VANQPVSVAIDA-SALQFYSGGVFN--GYCE 271
             +  +   + Y  V        E+L+K   V + P++VA +       Y  G+++  G  +
Sbjct:   335 NCLRYYSSEYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSD 394

Query:   272 TF-----LNHGVTAVGYGTSE-EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIA 325
              F      NH V  VGYG     G+ YW++KNSWG  WGE GYFR++R  D+   +  IA
Sbjct:   395 PFNPFELTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRIRRGTDECAIE-SIA 453

Query:   326 MFASFPVSK 334
             M A+ P+ K
Sbjct:   454 M-AAIPIPK 461


>UNIPROTKB|P07858 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9606 "Homo sapiens"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0005739 "mitochondrion" evidence=IEA] [GO:0042470 "melanosome"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042981 "regulation of apoptotic process" evidence=TAS]
            [GO:0006508 "proteolysis" evidence=IDA] [GO:0005764 "lysosome"
            evidence=IDA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEP] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IDA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IDA] [GO:0005622 "intracellular" evidence=TAS]
            [GO:0036021 "endolysosome lumen" evidence=TAS] [GO:0045087 "innate
            immune response" evidence=TAS] [GO:0008233 "peptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0005730 "nucleolus" evidence=IDA] [GO:0043231 "intracellular
            membrane-bounded organelle" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_118779 GO:GO:0005739
            GO:GO:0042470 GO:GO:0048471 Reactome:REACT_6900 GO:GO:0005615
            GO:GO:0009612 GO:GO:0009611 GO:GO:0005730 GO:GO:0042981
            GO:GO:0009897 GO:GO:0045471 GO:GO:0016324 GO:GO:0009749
            GO:GO:0006914 GO:GO:0043434 eggNOG:COG4870 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087
            GO:GO:0050790 GO:GO:0042383 GO:GO:0014070 GO:GO:0042277
            GO:GO:0060548 GO:GO:0005901 GO:GO:0014075 GO:GO:0004197
            GO:GO:0070670 EMBL:CH471157 GO:GO:0007519 MEROPS:C01.060 CTD:1508
            HOGENOM:HOG000241341 HOVERGEN:HBG003480 KO:K01363 OMA:GYPSGAW
            OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 GO:GO:0097067
            PANTHER:PTHR12411:SF16 EMBL:M14221 EMBL:L16510 EMBL:AK092070
            EMBL:AK075393 EMBL:BC010240 EMBL:BC095408 EMBL:M13230
            IPI:IPI00295741 PIR:A26498 RefSeq:NP_001899.1 RefSeq:NP_680090.1
            RefSeq:NP_680091.1 RefSeq:NP_680092.1 RefSeq:NP_680093.1
            UniGene:Hs.520898 PDB:1CSB PDB:1GMY PDB:1HUC PDB:1PBH PDB:2IPP
            PDB:2PBH PDB:3AI8 PDB:3CBJ PDB:3CBK PDB:3K9M PDB:3PBH PDBsum:1CSB
            PDBsum:1GMY PDBsum:1HUC PDBsum:1PBH PDBsum:2IPP PDBsum:2PBH
            PDBsum:3AI8 PDBsum:3CBJ PDBsum:3CBK PDBsum:3K9M PDBsum:3PBH
            ProteinModelPortal:P07858 SMR:P07858 DIP:DIP-42785N IntAct:P07858
            MINT:MINT-1397666 STRING:P07858 PhosphoSite:P07858 DMDM:68067549
            SWISS-2DPAGE:P07858 UCD-2DPAGE:P07858 PaxDb:P07858
            PeptideAtlas:P07858 PRIDE:P07858 DNASU:1508 Ensembl:ENST00000345125
            Ensembl:ENST00000353047 Ensembl:ENST00000434271
            Ensembl:ENST00000453527 Ensembl:ENST00000530640
            Ensembl:ENST00000531089 Ensembl:ENST00000533455
            Ensembl:ENST00000534510 GeneID:1508 KEGG:hsa:1508 UCSC:uc003wum.3
            GeneCards:GC08M011700 H-InvDB:HIX0007320 HGNC:HGNC:2527
            HPA:CAB000457 HPA:HPA018156 MIM:116810 neXtProt:NX_P07858
            PharmGKB:PA27027 InParanoid:P07858 PhylomeDB:P07858
            BindingDB:P07858 ChEMBL:CHEMBL4072 ChiTaRS:CTSB
            EvolutionaryTrace:P07858 GenomeRNAi:1508 NextBio:6235
            PMAP-CutDB:P07858 ArrayExpress:P07858 Bgee:P07858 CleanEx:HS_CTSB
            Genevestigator:P07858 GermOnline:ENSG00000164733 GO:GO:0036021
            Uniprot:P07858
        Length = 339

 Score = 145 (56.1 bits), Expect = 1.2e-11, Sum P(2) = 1.2e-11
 Identities = 35/115 (30%), Positives = 53/115 (46%)

Query:   212 GICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA-SALQFYSGGVFNGYC 270
             G   + K + H     N   V  ++++ + +   N PV  A    S    Y  GV+    
Sbjct:   214 GYSPTYKQDKHYGY--NSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVT 271

Query:   271 -ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
              E    H +  +G+G  E G  YWL+ NSW  DWG++G+F++ R     Q  CGI
Sbjct:   272 GEMMGGHAIRILGWGV-ENGTPYWLVANSWNTDWGDNGFFKILRG----QDHCGI 321

 Score = 79 (32.9 bits), Expect = 1.2e-11, Sum P(2) = 1.2e-11
 Identities = 26/90 (28%), Positives = 45/90 (50%)

Query:   129 WIEKGAVTPVKYQGQC----AVAAVEGIN---AIKINRLVSL--SEQQLVDCATNDNNNG 179
             W +   +  ++ QG C    A  AVE I+    I  N  VS+  S + L+ C  +   +G
Sbjct:    90 WPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDG 149

Query:   180 CYGGFMDDAFKYIIQNKGITNDAVY-SYEG 208
             C GG+  +A+ +  + KG+ +  +Y S+ G
Sbjct:   150 CNGGYPAEAWNFWTR-KGLVSGGLYESHVG 178


>WB|WBGene00009158 [details] [associations]
            symbol:F26E4.3 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005576
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005044
            GeneTree:ENSGT00560000076599 HSSP:P07711 EMBL:Z81070
            eggNOG:NOG310046 HOGENOM:HOG000241342 OMA:DNCNRCT PIR:T21421
            RefSeq:NP_492593.2 ProteinModelPortal:P90850 SMR:P90850
            PaxDb:P90850 EnsemblMetazoa:F26E4.3.1 EnsemblMetazoa:F26E4.3.2
            GeneID:172827 KEGG:cel:CELE_F26E4.3 UCSC:F26E4.3.1 CTD:172827
            WormBase:F26E4.3 InParanoid:P90850 NextBio:877161 Uniprot:P90850
        Length = 452

 Score = 129 (50.5 bits), Expect = 1.7e-11, Sum P(2) = 1.7e-11
 Identities = 40/127 (31%), Positives = 54/127 (42%)

Query:   214 CDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF-YSGGVFN----- 267
             C S   +  A ++T    V   +E+   + + N PV          F Y+GGV+      
Sbjct:   302 CPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLA 361

Query:   268 ---GYCETFLN-HGVTAVGYGTSEE-G--IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
                G        H V  +G+G     G  IKYWL  NSWG  WGEDGYF++ R     + 
Sbjct:   362 AQKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGYFKVLRG----EN 417

Query:   321 QCGIAMF 327
              C I  F
Sbjct:   418 HCEIESF 424

 Score = 100 (40.3 bits), Expect = 1.7e-11, Sum P(2) = 1.7e-11
 Identities = 34/111 (30%), Positives = 48/111 (43%)

Query:   112 NGTPFLYKSSQVPPSVNWIEKGA--VTPVKYQGQCAVAAVEGINAIKINRLVSLSE---- 165
             N    L K  ++P   +  +K    + PV  QG C  +      AI  +RL  +SE    
Sbjct:   173 NMNEILIKPRELPEHFDARDKWGPLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRIN 232

Query:   166 -----QQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY-EGMS 210
                  QQL+ C       GC GG++D A+ YI +  G+  D  Y Y  G S
Sbjct:   233 STLSSQQLLSC-NQHRQKGCEGGYLDRAWWYI-RKLGVVGDHCYPYVSGQS 281


>MGI|MGI:88561 [details] [associations]
            symbol:Ctsb "cathepsin B" species:10090 "Mus musculus"
            [GO:0004175 "endopeptidase activity" evidence=ISO] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISO] [GO:0005576
            "extracellular region" evidence=ISO] [GO:0005615 "extracellular
            space" evidence=ISO] [GO:0005737 "cytoplasm" evidence=ISO]
            [GO:0005739 "mitochondrion" evidence=ISO;IDA] [GO:0005764
            "lysosome" evidence=ISO;IDA] [GO:0005901 "caveola" evidence=ISO]
            [GO:0006508 "proteolysis" evidence=ISO] [GO:0008233 "peptidase
            activity" evidence=ISO] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISO] [GO:0009897 "external side of plasma
            membrane" evidence=ISO] [GO:0009986 "cell surface" evidence=ISO]
            [GO:0016324 "apical plasma membrane" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0030984 "kininogen binding"
            evidence=ISO] [GO:0032403 "protein complex binding" evidence=ISO]
            [GO:0042277 "peptide binding" evidence=ISO] [GO:0042383
            "sarcolemma" evidence=ISO] [GO:0043621 "protein self-association"
            evidence=ISO] [GO:0048471 "perinuclear region of cytoplasm"
            evidence=ISO] [GO:0050790 "regulation of catalytic activity"
            evidence=IEA] [GO:0060548 "negative regulation of cell death"
            evidence=ISO] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 MGI:MGI:88561
            GO:GO:0005739 GO:GO:0042470 GO:GO:0048471 GO:GO:0005615
            GO:GO:0009612 GO:GO:0009611 GO:GO:0009897 GO:GO:0045471
            GO:GO:0016324 GO:GO:0009749 GO:GO:0006914 GO:GO:0043434
            eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0042383 GO:GO:0014070
            GO:GO:0042277 GO:GO:0060548 GO:GO:0005901 GO:GO:0014075
            GO:GO:0004197 GO:GO:0070670 GO:GO:0007519 MEROPS:C01.060 CTD:1508
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 KO:K01363 OMA:GYPSGAW OrthoDB:EOG4K6G4C
            BRENDA:3.4.22.1 GO:GO:0097067 PANTHER:PTHR12411:SF16 ChiTaRS:CTSB
            EMBL:M65270 EMBL:M65263 EMBL:M65264 EMBL:M65265 EMBL:M65266
            EMBL:M65267 EMBL:M65268 EMBL:M65269 EMBL:M14222 EMBL:X54966
            EMBL:S69034 EMBL:AK083393 EMBL:AK147192 EMBL:AK149884 EMBL:AK151790
            EMBL:AK167361 EMBL:BC006656 IPI:IPI00113517 PIR:A38458
            RefSeq:NP_031824.1 UniGene:Mm.236553 UniGene:Mm.489070
            ProteinModelPortal:P10605 SMR:P10605 IntAct:P10605 STRING:P10605
            PhosphoSite:P10605 SWISS-2DPAGE:P10605 PaxDb:P10605 PRIDE:P10605
            Ensembl:ENSMUST00000006235 GeneID:13030 KEGG:mmu:13030
            UCSC:uc007uhh.1 InParanoid:P10605 BioCyc:MetaCyc:MONOMER-14810
            BindingDB:P10605 ChEMBL:CHEMBL5187 NextBio:282900 Bgee:P10605
            CleanEx:MM_CTSB Genevestigator:P10605 GermOnline:ENSMUSG00000021939
            Uniprot:P10605
        Length = 339

 Score = 143 (55.4 bits), Expect = 3.3e-11, Sum P(2) = 3.3e-11
 Identities = 36/115 (31%), Positives = 54/115 (46%)

Query:   212 GICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA-SALQFYSGGVFNGYC 270
             G   S K + H    T+Y  V  + +E + +   N PV  A    S    Y  GV+    
Sbjct:   214 GYSPSYKEDKHFGY-TSYS-VSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEA 271

Query:   271 ETFLN-HGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
                +  H +  +G+G  E G+ YWL  NSW  DWG++G+F++ R     +  CGI
Sbjct:   272 GDMMGGHAIRILGWGV-ENGVPYWLAANSWNLDWGDNGFFKILRG----ENHCGI 321

 Score = 77 (32.2 bits), Expect = 3.3e-11, Sum P(2) = 3.3e-11
 Identities = 25/86 (29%), Positives = 41/86 (47%)

Query:   129 WIEKGAVTPVKYQGQC----AVAAVEGIN---AIKIN-RL-VSLSEQQLVDCATNDNNNG 179
             W     +  ++ QG C    A  AVE I+    I  N R+ V +S + L+ C      +G
Sbjct:    90 WSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQCGDG 149

Query:   180 CYGGFMDDAFKYIIQNKGITNDAVYS 205
             C GG+   A+ +  + KG+ +  VY+
Sbjct:   150 CNGGYPSGAWSFWTK-KGLVSGGVYN 174


>UNIPROTKB|Q9GZM7 [details] [associations]
            symbol:TINAGL1 "Tubulointerstitial nephritis antigen-like"
            species:9606 "Homo sapiens" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0005737 "cytoplasm"
            evidence=IEA] [GO:0043236 "laminin binding" evidence=IEA]
            [GO:0016197 "endosomal transport" evidence=TAS] [GO:0005201
            "extracellular matrix structural constituent" evidence=NAS]
            [GO:0005576 "extracellular region" evidence=NAS] [GO:0031012
            "extracellular matrix" evidence=IDA] [GO:0005615 "extracellular
            space" evidence=ISS] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0005737 GO:GO:0005615
            GO:GO:0006955 GO:GO:0030247 EMBL:CH471059 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GO:GO:0016197 EMBL:AC114488 GO:GO:0005044 GO:GO:0005201
            eggNOG:NOG310046 HOGENOM:HOG000241342 HOVERGEN:HBG053961
            EMBL:AF236155 EMBL:AF236151 EMBL:AF236152 EMBL:AF236153
            EMBL:AF236154 EMBL:AF236150 EMBL:AF205436 EMBL:AB050716
            EMBL:AB050719 EMBL:AK074124 EMBL:AY358421 EMBL:AF289569
            EMBL:AK027839 EMBL:AK292770 EMBL:AK298382 EMBL:AK075398
            EMBL:BC009048 EMBL:BC064633 IPI:IPI00005563 IPI:IPI00439435
            IPI:IPI00910801 RefSeq:NP_001191343.1 RefSeq:NP_001191344.1
            RefSeq:NP_071447.1 UniGene:Hs.199368 ProteinModelPortal:Q9GZM7
            SMR:Q9GZM7 IntAct:Q9GZM7 MINT:MINT-253718 STRING:Q9GZM7
            MEROPS:C01.975 PhosphoSite:Q9GZM7 DMDM:61213628 PaxDb:Q9GZM7
            PRIDE:Q9GZM7 Ensembl:ENST00000271064 Ensembl:ENST00000457433
            GeneID:64129 KEGG:hsa:64129 UCSC:uc001bta.3 CTD:64129
            GeneCards:GC01P032042 HGNC:HGNC:19168 HPA:HPA048695
            neXtProt:NX_Q9GZM7 PharmGKB:PA38810 InParanoid:Q9GZM7 OMA:DNCNRCT
            OrthoDB:EOG4BG8W0 PhylomeDB:Q9GZM7 ChiTaRS:TINAGL1 GenomeRNAi:64129
            NextBio:66016 ArrayExpress:Q9GZM7 Bgee:Q9GZM7 CleanEx:HS_TINAGL1
            Genevestigator:Q9GZM7 GermOnline:ENSG00000142910 Uniprot:Q9GZM7
        Length = 467

 Score = 140 (54.3 bits), Expect = 9.8e-11, Sum P(2) = 9.8e-11
 Identities = 38/117 (32%), Positives = 59/117 (50%)

Query:   225 QITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF-YSGGVFN------GYCETFLNHG 277
             Q+T    +  ND+E + + + N PV   ++     F Y GG+++      G  E +  HG
Sbjct:   339 QVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHG 398

Query:   278 VTAV---GYG--TSEEG--IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMF 327
               +V   G+G  T  +G  +KYW   NSWG  WGE G+FR+ R +++    C I  F
Sbjct:   399 THSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNE----CDIESF 451

 Score = 81 (33.6 bits), Expect = 9.8e-11, Sum P(2) = 9.8e-11
 Identities = 16/46 (34%), Positives = 27/46 (58%)

Query:   163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG 208
             LS Q L+ C T+    GC GG +D A+ + ++ +G+ +D  Y + G
Sbjct:   254 LSPQNLLSCDTHQQQ-GCRGGRLDGAW-WFLRRRGVVSDHCYPFSG 297


>WB|WBGene00021072 [details] [associations]
            symbol:W07B8.4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:FO081739 PIR:T31728 RefSeq:NP_503382.1
            HSSP:P53634 ProteinModelPortal:O16288 SMR:O16288 STRING:O16288
            MEROPS:C01.A39 PaxDb:O16288 EnsemblMetazoa:W07B8.4 GeneID:178611
            KEGG:cel:CELE_W07B8.4 UCSC:W07B8.4 CTD:178611 WormBase:W07B8.4
            InParanoid:O16288 OMA:ESQYGCK NextBio:901836 Uniprot:O16288
        Length = 335

 Score = 133 (51.9 bits), Expect = 1.1e-10, Sum P(2) = 1.1e-10
 Identities = 34/92 (36%), Positives = 44/92 (47%)

Query:   244 VANQPVSVA-IDASALQFYSGGVFNGYCETFLN-HGVTAVGYGTSEEGIKYWLIKNSWGQ 301
             +A+ PV V  I       Y  G++       L  H V  +G+G  + G  YWL  NSW  
Sbjct:   243 LAHGPVEVGFIVYEDFYLYKTGIYTHVAGGELGGHAVKMLGWGV-DNGTPYWLAANSWNT 301

Query:   302 DWGEDGYFRLQRDIDQPQGQCGI--AMFASFP 331
              WGE GYFR+ R +D+    CGI  A  A  P
Sbjct:   302 VWGEKGYFRILRGVDE----CGIESAAVAGMP 329

 Score = 83 (34.3 bits), Expect = 1.1e-10, Sum P(2) = 1.1e-10
 Identities = 28/92 (30%), Positives = 46/92 (50%)

Query:   128 NWIEKGAVTPVKYQGQC----AVAAVEGIN---AIKINRLVS--LSEQQLVDCATNDNN- 177
             +W +  +V  ++ Q  C    AVAA E I+    I  N  V+  LS + ++ C T   N 
Sbjct:    82 HWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGDVNTLLSAEDILTCCTGKFNC 141

Query:   178 -NGCYGGFMDDAFKYIIQNKGITNDAVYSYEG 208
              +GC GG+   A++Y ++N  +T  +  S  G
Sbjct:   142 GDGCEGGYPIQAWRYWVKNGLVTGGSFESQYG 173


>UNIPROTKB|E2QXH3 [details] [associations]
            symbol:TINAGL1 "Uncharacterized protein" species:9615
            "Canis lupus familiaris" [GO:0043236 "laminin binding"
            evidence=IEA] [GO:0031012 "extracellular matrix" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006955 "immune response" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044 "scavenger
            receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005737
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639 GO:GO:0031012
            GO:GO:0005044 GeneTree:ENSGT00560000076599 CTD:64129 OMA:DNCNRCT
            EMBL:AAEX03001668 RefSeq:XP_535330.3 Ensembl:ENSCAFT00000035659
            GeneID:478155 KEGG:cfa:478155 NextBio:20853523 Uniprot:E2QXH3
        Length = 467

 Score = 135 (52.6 bits), Expect = 1.4e-10, Sum P(2) = 1.4e-10
 Identities = 42/134 (31%), Positives = 65/134 (48%)

Query:   210 STGICDS--IKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF-YSGGVF 266
             +T  C S  + A D   Q+T    +  N++E + + + N PV   ++     F Y GG++
Sbjct:   323 ATARCPSSHVHAND-IYQVTPAYRLGTNEKEIMKELMENGPVQALMEVHEDFFLYQGGIY 381

Query:   267 N------GYCETFLNHGVTAV---GYG--TSEEG--IKYWLIKNSWGQDWGEDGYFRLQR 313
             +      G  E +  HG  +V   G+G  T  +G  +KYW   NSWG  WGE G+FR+ R
Sbjct:   382 SHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVR 441

Query:   314 DIDQPQGQCGIAMF 327
               ++    C I  F
Sbjct:   442 GANE----CDIESF 451

 Score = 85 (35.0 bits), Expect = 1.4e-10, Sum P(2) = 1.4e-10
 Identities = 17/46 (36%), Positives = 28/46 (60%)

Query:   163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG 208
             LS Q L+ C T+ N  GC GG +D A+ + ++ +G+ +D  Y + G
Sbjct:   254 LSPQNLLSCDTH-NQQGCRGGRLDGAW-WFLRRRGVVSDHCYPFVG 297


>WB|WBGene00000789 [details] [associations]
            symbol:cpz-2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GeneTree:ENSGT00560000076599 KO:K08568 EMBL:Z81103
            HSSP:P80067 PIR:T23720 RefSeq:NP_506318.1 ProteinModelPortal:P92005
            SMR:P92005 STRING:P92005 MEROPS:C01.A41 PaxDb:P92005
            EnsemblMetazoa:M04G12.2 GeneID:179818 KEGG:cel:CELE_M04G12.2
            UCSC:M04G12.2 CTD:179818 WormBase:M04G12.2 eggNOG:NOG275763
            InParanoid:P92005 OMA:VEYWIAR NextBio:906990 Uniprot:P92005
        Length = 467

 Score = 175 (66.7 bits), Expect = 1.6e-10, P = 1.6e-10
 Identities = 52/196 (26%), Positives = 91/196 (46%)

Query:   137 PVKYQGQCAVAAVEG-----INAIKINR--LVSLSEQQLVDCATNDNNNGCYGGFMDDAF 189
             PV Y G C V    G      N  +  R  +  LS Q+++DC  N   N C GG + +  
Sbjct:   245 PV-YCGSCWVFGTTGALNDRFNVARKGRWPMTQLSPQEIIDC--NGKGN-CQGGEIGNVL 300

Query:   190 KYIIQNKGITNDAVYSYEGMSTGICDSIKA-----EDHAAQITNYEDVPPND------EE 238
             ++  + +G+  +    Y   + G C+          +    +TNY      D       +
Sbjct:   301 EHA-KIQGLVEEGCNVYRA-TNGECNPYHRCGSCWPNECFSLTNYTRYYVKDYGQVQGRD 358

Query:   239 SLLKAVANQ-PVSVAIDASA-LQF-YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLI 295
              ++  +    P++ AI A+   ++ Y  GV++   +   NH ++  G+G  E G++YW+ 
Sbjct:   359 KIMSEIKKGGPIACAIGATKKFEYEYVKGVYSEKSDLESNHIISLTGWGVDENGVEYWIA 418

Query:   296 KNSWGQDWGEDGYFRL 311
             +NSWG+ WGE G+FR+
Sbjct:   419 RNSWGEAWGELGWFRV 434


>WB|WBGene00013072 [details] [associations]
            symbol:Y51A2D.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            EMBL:AL021497 RefSeq:NP_001256811.1 ProteinModelPortal:O62484
            SMR:O62484 MEROPS:C01.A37 EnsemblMetazoa:Y51A2D.1 GeneID:180204
            KEGG:cel:CELE_Y51A2D.1 UCSC:Y51A2D.1 CTD:180204 WormBase:Y51A2D.1a
            HOGENOM:HOG000019851 NextBio:908416 Uniprot:O62484
        Length = 314

 Score = 111 (44.1 bits), Expect = 1.8e-10, Sum P(2) = 1.8e-10
 Identities = 34/109 (31%), Positives = 52/109 (47%)

Query:   228 NYEDVPPNDEESLLKAVANQ---PVSV--AIDASALQFYSGGVFNGYCETF--LNHGVTA 280
             +Y  + P + ES +  + N    PV+V  A   + LQ+ SG +    C+    + H    
Sbjct:   195 DYHFIRPENAESEIIEILNTWKTPVAVYFAAGTAFLQYKSGVLVTEDCDLAGTVWHAGAI 254

Query:   281 VGYGTSEE----GIKYWLIKNSWG-QDWGEDGYFRLQRDIDQPQGQCGI 324
             VGYG   +      ++W++KNSWG   WG  GY +L R     +  CGI
Sbjct:   255 VGYGEENDLRGRSQRFWIMKNSWGVSGWGTGGYVKLIRG----KNWCGI 299

 Score = 105 (42.0 bits), Expect = 1.8e-10, Sum P(2) = 1.8e-10
 Identities = 37/129 (28%), Positives = 62/129 (48%)

Query:    30 IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
             + ++F ++K ++ RTYK  AEN  R + F  +   V R N NA    R+    +N+F+DL
Sbjct:    40 VYQEFVEFKKKFSRTYKSEAENQLRLQNFVKSRNNVVRLNKNAQKAGRNSNFAVNQFSDL 99

Query:    89 TPQEFIASQTGF--KMSD----HSSSLKANG-TPFLYKSSQVPPSVNWIEKGA-----VT 136
             T  E     + F   +++    H +  K  G T    ++S+   + +   +       V 
Sbjct:   100 TTSELHQRLSRFPPNLTENSVFHKNFKKLLGKTRTKRQNSEFARNFDLRSQKVNGRYIVG 159

Query:   137 PVKYQGQCA 145
             P+K QGQCA
Sbjct:   160 PIKNQGQCA 168


>UNIPROTKB|E1B9H1 [details] [associations]
            symbol:TINAGL1 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0043236 "laminin binding" evidence=IEA] [GO:0031012
            "extracellular matrix" evidence=IEA] [GO:0005737 "cytoplasm"
            evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0005044 "scavenger receptor
            activity" evidence=IEA] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0005737 GO:GO:0006955
            GO:GO:0030247 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00639 GO:GO:0031012 GO:GO:0005044
            GeneTree:ENSGT00560000076599 OMA:DNCNRCT EMBL:DAAA02006255
            IPI:IPI00732137 Ensembl:ENSBTAT00000038022 Uniprot:E1B9H1
        Length = 469

 Score = 128 (50.1 bits), Expect = 4.4e-10, Sum P(2) = 4.4e-10
 Identities = 41/134 (30%), Positives = 64/134 (47%)

Query:   210 STGICDS--IKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF-YSGGVF 266
             +T  C +  + A D   Q+T    +  N++E + + + N PV   ++     F Y  G++
Sbjct:   325 ATARCPNSYVHAND-IYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIY 383

Query:   267 N------GYCETFLNHGVTAV---GYG--TSEEG--IKYWLIKNSWGQDWGEDGYFRLQR 313
             +      G  E +  HG  +V   G+G  T  +G  IKYW   NSWG  WGE G+FR+ R
Sbjct:   384 SHTPVSLGRPERYRRHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPAWGERGHFRIVR 443

Query:   314 DIDQPQGQCGIAMF 327
               ++    C I  F
Sbjct:   444 GANE----CDIESF 453

 Score = 88 (36.0 bits), Expect = 4.4e-10, Sum P(2) = 4.4e-10
 Identities = 27/78 (34%), Positives = 42/78 (53%)

Query:   141 QGQCA------VAAVEGINAIKINRL--VS--LSEQQLVDCATNDNNNGCYGGFMDDAFK 190
             QG CA       AAV   + + I+ L  +S  LS Q L+ C T+ N  GC GG +D A+ 
Sbjct:   225 QGNCAGSWAFSTAAVAS-DRVSIHSLGHMSPVLSPQNLLSCDTH-NQQGCRGGRLDGAW- 281

Query:   191 YIIQNKGITNDAVYSYEG 208
             + ++ +G+ +D  Y + G
Sbjct:   282 WFLRRRGVVSDHCYPFSG 299


>UNIPROTKB|F1SVA2 [details] [associations]
            symbol:TINAGL1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005615 "extracellular space" evidence=IDA] [GO:0043236
            "laminin binding" evidence=IEA] [GO:0031012 "extracellular matrix"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044
            "scavenger receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005737
            GO:GO:0005615 GO:GO:0006955 GO:GO:0030247 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GO:GO:0031012 GO:GO:0005044 GeneTree:ENSGT00560000076599
            OMA:DNCNRCT EMBL:CU856262 Ensembl:ENSSSCT00000003995 Uniprot:F1SVA2
        Length = 467

 Score = 124 (48.7 bits), Expect = 1.2e-09, Sum P(2) = 1.2e-09
 Identities = 39/134 (29%), Positives = 64/134 (47%)

Query:   210 STGICDS--IKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF-YSGGVF 266
             +T  C +  + A D   Q+T    +  N+++ + + + N PV   ++     F Y  G++
Sbjct:   323 ATARCPNSYVHAND-IYQVTPAYRLGSNEKDIMKELMENGPVQALMEVHEDFFLYQSGIY 381

Query:   267 N------GYCETFLNHGVTAV---GYG--TSEEG--IKYWLIKNSWGQDWGEDGYFRLQR 313
             +      G  E +  HG  +V   G+G  T  +G  +KYW   NSWG  WGE G+FR+ R
Sbjct:   382 SHTPVSHGRPERYRRHGTHSVKITGWGEETLPDGRMLKYWTAANSWGPGWGERGHFRIVR 441

Query:   314 DIDQPQGQCGIAMF 327
               ++    C I  F
Sbjct:   442 GANE----CDIESF 451

 Score = 88 (36.0 bits), Expect = 1.2e-09, Sum P(2) = 1.2e-09
 Identities = 17/46 (36%), Positives = 28/46 (60%)

Query:   163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG 208
             LS Q L+ C T+ N  GC GG +D A+ + ++ +G+ +D  Y + G
Sbjct:   254 LSPQNLLSCDTH-NQQGCQGGRLDGAW-WFLRRRGVVSDHCYPFSG 297


>WB|WBGene00000785 [details] [associations]
            symbol:cpr-5 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39896 EMBL:L39927 EMBL:FO081739
            PIR:T37277 RefSeq:NP_503383.1 UniGene:Cel.19730
            ProteinModelPortal:P43509 SMR:P43509 DIP:DIP-25329N IntAct:P43509
            MINT:MINT-1051285 STRING:P43509 MEROPS:C01.A35 PaxDb:P43509
            EnsemblMetazoa:W07B8.5 GeneID:178612 KEGG:cel:CELE_W07B8.5
            UCSC:W07B8.5.1 CTD:178612 WormBase:W07B8.5 InParanoid:P43509
            OMA:DAIPDHF NextBio:901840 Uniprot:P43509
        Length = 344

 Score = 132 (51.5 bits), Expect = 1.4e-09, Sum P(2) = 1.4e-09
 Identities = 31/90 (34%), Positives = 45/90 (50%)

Query:   237 EESLLKAVANQPVSVAIDA-SALQFYSGGVFNGYCETFLN-HGVTAVGYGTSEEGIKYWL 294
             E+   + + N P+ VA         Y+ GV+       L  H V  +G+G  + G  YWL
Sbjct:   245 EQIQTEILTNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGV-DNGTPYWL 303

Query:   295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
             + NSW   WGE GYFR+ R +++    CGI
Sbjct:   304 VANSWNVAWGEKGYFRIIRGLNE----CGI 329

 Score = 74 (31.1 bits), Expect = 1.4e-09, Sum P(2) = 1.4e-09
 Identities = 26/74 (35%), Positives = 37/74 (50%)

Query:   142 GQC-AVAAVEGIN---AIKINRLVS--LSEQQLVDCATN--DNNNGCYGGFMDDAFKYII 193
             G C A AA E I+    I  N  V+  LS + L+ C T      NGC GG+   A+K+ +
Sbjct:   108 GSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLLSCCTGMFSCGNGCEGGYPIQAWKWWV 167

Query:   194 QNKGITNDAVYSYE 207
             ++  +T     SYE
Sbjct:   168 KHGLVTGG---SYE 178

 Score = 44 (20.5 bits), Expect = 1.7e-06, Sum P(2) = 1.7e-06
 Identities = 16/57 (28%), Positives = 26/57 (45%)

Query:   129 WIEKGAVTPVKYQGQ--CAVAAV----EGINAIKINRLVSLSEQ--QLVDCATNDNN 177
             W++ G VT   Y+ Q  C   ++    E +N +K       +E   + VD  T+ NN
Sbjct:   166 WVKHGLVTGGSYETQFGCKPYSIAPCGETVNGVKWPACPEDTEPTPKCVDSCTSKNN 222


>TAIR|locus:505006093 [details] [associations]
            symbol:AT1G02305 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0005829 "cytosol" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684 GO:GO:0005773
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 HOGENOM:HOG000241341
            KO:K01363 PANTHER:PTHR12411:SF16 OMA:CCGFLCG UniGene:At.23486
            UniGene:At.42610 UniGene:At.43952 EMBL:AY039887 EMBL:AF428337
            EMBL:BT002227 IPI:IPI00524601 RefSeq:NP_563648.1 HSSP:P07858
            ProteinModelPortal:Q93VC9 SMR:Q93VC9 IntAct:Q93VC9 STRING:Q93VC9
            MEROPS:C01.049 PRIDE:Q93VC9 ProMEX:Q93VC9 EnsemblPlants:AT1G02305.1
            GeneID:839538 KEGG:ath:AT1G02305 TAIR:At1g02305 InParanoid:Q93VC9
            PhylomeDB:Q93VC9 ProtClustDB:CLSN2687619 Genevestigator:Q93VC9
            Uniprot:Q93VC9
        Length = 362

 Score = 164 (62.8 bits), Expect = 1.6e-09, P = 1.6e-09
 Identities = 33/101 (32%), Positives = 55/101 (54%)

Query:   226 ITNYEDVPPNDEESLLKAVANQPVSVAIDA-SALQFYSGGVFNGYCETFLN-HGVTAVGY 283
             ++ Y+ V  + ++ + +   N PV VA         Y  GV+     T +  H V  +G+
Sbjct:   238 VSAYK-VRSHPDDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGW 296

Query:   284 GTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
             GTS++G  YWL+ N W + WG+DGYF+++R  ++    CGI
Sbjct:   297 GTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNE----CGI 333


>WB|WBGene00000788 [details] [associations]
            symbol:cpz-1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] [GO:0009792 "embryo development ending in
            birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0010171 "body morphogenesis" evidence=IMP]
            [GO:0018996 "molting cycle, collagen and cuticulin-based cuticle"
            evidence=IMP] [GO:0031012 "extracellular matrix" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0009792 GO:GO:0040007 GO:GO:0002119
            GO:GO:0018996 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0010171 GO:GO:0031012
            GeneTree:ENSGT00560000076599 KO:K08568 OMA:QCGTCTE EMBL:FO081275
            EMBL:BK001409 PIR:T29872 RefSeq:NP_491023.2 HSSP:Q9UBR2
            ProteinModelPortal:G5EGP8 SMR:G5EGP8 IntAct:G5EGP8 MEROPS:C01.A38
            EnsemblMetazoa:F32B5.8 GeneID:171829 KEGG:cel:CELE_F32B5.8
            CTD:171829 WormBase:F32B5.8 NextBio:872879 Uniprot:G5EGP8
        Length = 306

 Score = 161 (61.7 bits), Expect = 2.2e-09, P = 2.2e-09
 Identities = 44/164 (26%), Positives = 81/164 (49%)

Query:   163 LSEQQLVDCATNDNNNGCY-GGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKA-- 219
             LS Q+++DC+       C  GG     +KY  ++ GI ++   +Y+    G CD      
Sbjct:   121 LSVQEVIDCS---GAGTCVMGGEPGGVYKYAHEH-GIPHETCNNYQARD-GKCDPYNRCG 175

Query:   220 ---EDHAAQITNY------EDVPPNDEESLLKAVANQ-PVSVAIDAS-ALQFYSGGVFNG 268
                      I NY      E    +  E +   + ++ P++  I A+ A + Y+GG++  
Sbjct:   176 SCWPGECFSIKNYTLYKVSEYGTVHGYEKMKAEIYHKGPIACGIAATKAFETYAGGIYKE 235

Query:   269 YCETFLNHGVTAVGYGTSEE-GIKYWLIKNSWGQDWGEDGYFRL 311
               +  ++H ++  G+G   E G++YW+ +NSWG+ WGE G+F++
Sbjct:   236 VTDEDIDHIISVHGWGVDHESGVEYWIGRNSWGEPWGEHGWFKI 279


>RGD|70956 [details] [associations]
            symbol:Tinagl1 "tubulointerstitial nephritis antigen-like 1"
           species:10116 "Rattus norvegicus" [GO:0005044 "scavenger receptor
           activity" evidence=IEA] [GO:0005576 "extracellular region"
           evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA;ISO] [GO:0006508
           "proteolysis" evidence=IEA] [GO:0006955 "immune response"
           evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
           evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
           [GO:0031012 "extracellular matrix" evidence=IEA;ISO] [GO:0043236
           "laminin binding" evidence=IEA;ISO] InterPro:IPR000668
           InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
           PROSITE:PS50958 SMART:SM00201 SMART:SM00645 RGD:70956 GO:GO:0005737
           GO:GO:0005576 GO:GO:0006955 GO:GO:0030247 GO:GO:0008234
           GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
           GO:GO:0031012 GO:GO:0005044 eggNOG:NOG310046 HOGENOM:HOG000241342
           HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OrthoDB:EOG4BG8W0
           EMBL:AB050717 IPI:IPI00190428 RefSeq:NP_446034.1 UniGene:Rn.1256
           ProteinModelPortal:Q9EQT5 PRIDE:Q9EQT5 GeneID:94174 KEGG:rno:94174
           UCSC:RGD:70956 InParanoid:Q9EQT5 NextBio:617830 ArrayExpress:Q9EQT5
           Genevestigator:Q9EQT5 GermOnline:ENSRNOG00000013179 Uniprot:Q9EQT5
        Length = 467

 Score = 127 (49.8 bits), Expect = 2.3e-09, Sum P(2) = 2.3e-09
 Identities = 39/133 (29%), Positives = 65/133 (48%)

Query:   210 STGICDSIKAEDHAA-QITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF-YSGGVFN 267
             +T  C + + + +   Q+T    +  +++E + + + N PV   ++     F Y  G+++
Sbjct:   323 ATSRCPNSQVDSNDIYQVTPVYRLASDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYS 382

Query:   268 ------GYCETFLNHGVTAV---GYG--TSEEG--IKYWLIKNSWGQDWGEDGYFRLQRD 314
                   G  E +  HG  +V   G+G  T  +G  IKYW   NSWG  WGE G+FR+ R 
Sbjct:   383 HTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRG 442

Query:   315 IDQPQGQCGIAMF 327
             I++    C I  F
Sbjct:   443 INE----CDIETF 451

 Score = 82 (33.9 bits), Expect = 2.3e-09, Sum P(2) = 2.3e-09
 Identities = 16/46 (34%), Positives = 28/46 (60%)

Query:   163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG 208
             LS Q L+ C T+ +  GC GG +D A+ + ++ +G+ +D  Y + G
Sbjct:   253 LSPQNLLSCDTH-HQKGCRGGRLDGAW-WFLRRRGVVSDNCYPFSG 296


>UNIPROTKB|Q9EQT5 [details] [associations]
            symbol:Tinagl1 "Tubulointerstitial nephritis antigen-like"
            species:10116 "Rattus norvegicus" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 RGD:70956 GO:GO:0005737
            GO:GO:0005576 GO:GO:0006955 GO:GO:0030247 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GO:GO:0031012 GO:GO:0005044 eggNOG:NOG310046 HOGENOM:HOG000241342
            HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OrthoDB:EOG4BG8W0
            EMBL:AB050717 IPI:IPI00190428 RefSeq:NP_446034.1 UniGene:Rn.1256
            ProteinModelPortal:Q9EQT5 PRIDE:Q9EQT5 GeneID:94174 KEGG:rno:94174
            UCSC:RGD:70956 InParanoid:Q9EQT5 NextBio:617830 ArrayExpress:Q9EQT5
            Genevestigator:Q9EQT5 GermOnline:ENSRNOG00000013179 Uniprot:Q9EQT5
        Length = 467

 Score = 127 (49.8 bits), Expect = 2.3e-09, Sum P(2) = 2.3e-09
 Identities = 39/133 (29%), Positives = 65/133 (48%)

Query:   210 STGICDSIKAEDHAA-QITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF-YSGGVFN 267
             +T  C + + + +   Q+T    +  +++E + + + N PV   ++     F Y  G+++
Sbjct:   323 ATSRCPNSQVDSNDIYQVTPVYRLASDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYS 382

Query:   268 ------GYCETFLNHGVTAV---GYG--TSEEG--IKYWLIKNSWGQDWGEDGYFRLQRD 314
                   G  E +  HG  +V   G+G  T  +G  IKYW   NSWG  WGE G+FR+ R 
Sbjct:   383 HTPVSQGRPEQYRRHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRG 442

Query:   315 IDQPQGQCGIAMF 327
             I++    C I  F
Sbjct:   443 INE----CDIETF 451

 Score = 82 (33.9 bits), Expect = 2.3e-09, Sum P(2) = 2.3e-09
 Identities = 16/46 (34%), Positives = 28/46 (60%)

Query:   163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG 208
             LS Q L+ C T+ +  GC GG +D A+ + ++ +G+ +D  Y + G
Sbjct:   253 LSPQNLLSCDTH-HQKGCRGGRLDGAW-WFLRRRGVVSDNCYPFSG 296


>WB|WBGene00000784 [details] [associations]
            symbol:cpr-4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39895 EMBL:L39926 EMBL:FO081381
            PIR:T37280 RefSeq:NP_504682.1 UniGene:Cel.5404
            ProteinModelPortal:P43508 SMR:P43508 DIP:DIP-25376N
            MINT:MINT-1069892 STRING:P43508 MEROPS:C01.A34 PaxDb:P43508
            EnsemblMetazoa:F44C4.3 GeneID:179053 KEGG:cel:CELE_F44C4.3
            UCSC:F44C4.3 CTD:179053 WormBase:F44C4.3 InParanoid:P43508
            OMA:CCGFLCG NextBio:903704 Uniprot:P43508
        Length = 335

 Score = 131 (51.2 bits), Expect = 5.5e-09, Sum P(2) = 5.5e-09
 Identities = 28/83 (33%), Positives = 44/83 (53%)

Query:   244 VANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQ 301
             +A+ PV  A        Q+ +G   +   +    H +  +G+GT + G  YWL+ NSW  
Sbjct:   247 IAHGPVEAAFTVYEDFYQYKTGVYVHTTGQELGGHAIRILGWGT-DNGTPYWLVANSWNV 305

Query:   302 DWGEDGYFRLQRDIDQPQGQCGI 324
             +WGE+GYFR+ R  ++    CGI
Sbjct:   306 NWGENGYFRIIRGTNE----CGI 324

 Score = 69 (29.3 bits), Expect = 5.5e-09, Sum P(2) = 5.5e-09
 Identities = 24/72 (33%), Positives = 37/72 (51%)

Query:   142 GQC-AVAAVEGIN---AIKINRLVS--LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
             G C A AA E  +    I  N  V+  LS + ++ C +N    GC GG+  +A+KY++++
Sbjct:   107 GSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSCCSNCGY-GCEGGYPINAWKYLVKS 165

Query:   196 KGITNDAVYSYE 207
                T     SYE
Sbjct:   166 GFCTGG---SYE 174


>DICTYBASE|DDB_G0280187 [details] [associations]
            symbol:DDB_G0280187 "cathepsin Z-like protein"
            species:44689 "Dictyostelium discoideum" [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            dictyBase:DDB_G0280187 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            EMBL:AAFI02000035 KO:K08568 RefSeq:XP_641294.1
            ProteinModelPortal:Q54VR1 MEROPS:C01.A61 PRIDE:Q54VR1
            EnsemblProtists:DDB0233838 GeneID:8622427 KEGG:ddi:DDB_G0280187
            InParanoid:Q54VR1 OMA:VWKVGDY Uniprot:Q54VR1
        Length = 291

 Score = 156 (60.0 bits), Expect = 7.0e-09, P = 7.0e-09
 Identities = 53/174 (30%), Positives = 88/174 (50%)

Query:   161 VSLSEQQLVDCATNDNN-NGC--YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSI 217
             V L+ Q L++CA  DN  +G      +   A K I        +A+ + E  + GIC + 
Sbjct:   103 VVLAPQVLLNCAGPDNTCDGGDPTEAYAYMAAKGITDETCAPYEAIDN-ECNAEGICKNC 161

Query:   218 KAE------DHAAQ--ITNY---EDVPPNDEESLLKAV-ANQPVSVAIDAS-ALQFYSGG 264
               +      D  AQ   T Y   E    N   ++++ + A  P++  ++ + A + Y+ G
Sbjct:   162 NFDLSNPTADCFAQPTYTTYFVEEHGQVNGSVAMMQEIFARGPIACGMEVTDAFESYTSG 221

Query:   265 VFNGYCETF--LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
             VF     +   +NH ++ +G+GT E G+ YW+ +NSWG  +GE G+FR+QR ID
Sbjct:   222 VFTSSVGSTGEINHEISIIGWGT-ENGVDYWIGRNSWGTYFGELGFFRIQRGID 274


>ZFIN|ZDB-GENE-040426-2650 [details] [associations]
            symbol:ctsba "cathepsin B, a" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0031101 "fin regeneration"
            evidence=IEP] [GO:0008233 "peptidase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 ZFIN:ZDB-GENE-040426-2650 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790
            GO:GO:0004197 GO:GO:0031101 MEROPS:C01.060 HOVERGEN:HBG003480
            PANTHER:PTHR12411:SF16 HSSP:P07688 EMBL:BC044517 IPI:IPI00485996
            UniGene:Dr.3374 ProteinModelPortal:Q803E4 SMR:Q803E4 STRING:Q803E4
            PRIDE:Q803E4 InParanoid:Q803E4 ArrayExpress:Q803E4 Bgee:Q803E4
            Uniprot:Q803E4
        Length = 330

 Score = 157 (60.3 bits), Expect = 8.1e-09, P = 8.1e-09
 Identities = 37/109 (33%), Positives = 52/109 (47%)

Query:   218 KAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA-SALQFYSGGVFNGYCETFLN- 275
             K + H  + T+Y  VP N    + +   N PV  A         Y  GV+     + L  
Sbjct:   219 KEDKHFGK-TSYS-VPSNQNGIMAELFKNGPVEAAFTVYEDFLLYKSGVYQHMSGSALGG 276

Query:   276 HGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
             H +  +G+G  E G+ YWL  NSW  DWG++GYF++ R  D     CGI
Sbjct:   277 HAIKILGWG-EENGVPYWLAANSWNTDWGDNGYFKILRGEDH----CGI 320


>MGI|MGI:2137617 [details] [associations]
            symbol:Tinagl1 "tubulointerstitial nephritis antigen-like 1"
            species:10090 "Mus musculus" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
            [GO:0043236 "laminin binding" evidence=IPI] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 MGI:MGI:2137617
            GO:GO:0005737 GO:GO:0005576 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GO:GO:0031012 CleanEx:MM_ARG1 GO:GO:0005044
            GeneTree:ENSGT00560000076599 eggNOG:NOG310046 HOGENOM:HOG000241342
            HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OMA:DNCNRCT
            OrthoDB:EOG4BG8W0 EMBL:AB047402 EMBL:AB050626 EMBL:BC005738
            EMBL:BC018539 IPI:IPI00115458 RefSeq:NP_001161805.1
            RefSeq:NP_075965.2 UniGene:Mm.15801 ProteinModelPortal:Q99JR5
            SMR:Q99JR5 STRING:Q99JR5 PhosphoSite:Q99JR5 PaxDb:Q99JR5
            PRIDE:Q99JR5 Ensembl:ENSMUST00000030560 Ensembl:ENSMUST00000105998
            Ensembl:ENSMUST00000105999 GeneID:94242 KEGG:mmu:94242
            InParanoid:Q99JR5 NextBio:352247 Bgee:Q99JR5 Genevestigator:Q99JR5
            GermOnline:ENSMUSG00000028776 Uniprot:Q99JR5
        Length = 466

 Score = 120 (47.3 bits), Expect = 1.4e-08, Sum P(2) = 1.4e-08
 Identities = 36/108 (33%), Positives = 54/108 (50%)

Query:   235 NDEESLLKAVA-NQPVSVAIDASALQF-YSGGVFN------GYCETFLNHGVTAV---GY 283
             +DE+ ++K +  N PV   ++     F Y  G+++      G  E +  HG  +V   G+
Sbjct:   347 SDEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGW 406

Query:   284 G--TSEEG--IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMF 327
             G  T  +G  IKYW   NSWG  WGE G+FR+ R  ++    C I  F
Sbjct:   407 GEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNE----CDIETF 450

 Score = 82 (33.9 bits), Expect = 1.4e-08, Sum P(2) = 1.4e-08
 Identities = 16/46 (34%), Positives = 28/46 (60%)

Query:   163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG 208
             LS Q L+ C T+ +  GC GG +D A+ + ++ +G+ +D  Y + G
Sbjct:   253 LSPQNLLSCDTH-HQQGCRGGRLDGAW-WFLRRRGVVSDNCYPFSG 296


>FB|FBgn0030521 [details] [associations]
            symbol:CtsB1 "Cathepsin B1" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS] [GO:0035071 "salivary gland cell autophagic cell
            death" evidence=IEP] [GO:0048102 "autophagic cell death"
            evidence=IEP] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:AE014298 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0035071
            GO:GO:0004197 MEROPS:C01.060 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 KO:K01363 PANTHER:PTHR12411:SF16
            HSSP:P07688 EMBL:AY060640 RefSeq:NP_572920.1 UniGene:Dm.3926
            SMR:Q9VY87 IntAct:Q9VY87 MINT:MINT-932864 STRING:Q9VY87
            EnsemblMetazoa:FBtr0073838 GeneID:32341 KEGG:dme:Dmel_CG10992
            UCSC:CG10992-RA FlyBase:FBgn0030521 InParanoid:Q9VY87 OMA:TEGHIRR
            OrthoDB:EOG48W9HM ChiTaRS:CG10992 GenomeRNAi:32341 NextBio:778020
            Uniprot:Q9VY87
        Length = 340

 Score = 137 (53.3 bits), Expect = 1.6e-08, Sum P(2) = 1.6e-08
 Identities = 34/93 (36%), Positives = 46/93 (49%)

Query:   235 NDEESLLKAVANQPVSVAIDA-SALQFYSGGVF-NGYCETFLNHGVTAVGYGT-SEEGIK 291
             N  E   + + N PV  A      L  Y  GV+ + + +    H +  +G+G   EE I 
Sbjct:   241 NVREIQEEIMTNGPVEGAFTVYEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGEEKIP 300

Query:   292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
             YWLI NSW  DWG+ G+FR+ R     Q  CGI
Sbjct:   301 YWLIGNSWNTDWGDHGFFRILRG----QDHCGI 329

 Score = 58 (25.5 bits), Expect = 1.6e-08, Sum P(2) = 1.6e-08
 Identities = 27/90 (30%), Positives = 38/90 (42%)

Query:   129 WIEKGAVTPVKYQGQC----AVAAVEGIN---AIKINRLVSL--SEQQLVDCATNDNNNG 179
             W     +  ++ QG C    A  AVE ++    I     V+   S   LV C  +    G
Sbjct:    97 WPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCC-HTCGFG 155

Query:   180 CYGGFMDDAFKYIIQNKGITNDAVY-SYEG 208
             C GGF   A+ Y  + KGI +   Y S +G
Sbjct:   156 CNGGFPGAAWSYWTR-KGIVSGGPYGSNQG 184


>WB|WBGene00022026 [details] [associations]
            symbol:Y65B4A.2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 SMART:SM00645 GO:GO:0008234
            GO:GO:0006508 PANTHER:PTHR12411 GeneTree:ENSGT00560000076599
            PANTHER:PTHR12411:SF16 HSSP:P07688 EMBL:FO081482 RefSeq:NP_490763.1
            ProteinModelPortal:Q9BL59 MEROPS:C01.A46 PaxDb:Q9BL59
            EnsemblMetazoa:Y65B4A.2.1 EnsemblMetazoa:Y65B4A.2.2 GeneID:171655
            KEGG:cel:CELE_Y65B4A.2 UCSC:Y65B4A.2 CTD:171655 WormBase:Y65B4A.2
            eggNOG:NOG311760 HOGENOM:HOG000017674 InParanoid:Q9BL59 OMA:DRIVYWH
            NextBio:872169 Uniprot:Q9BL59
        Length = 421

 Score = 121 (47.7 bits), Expect = 1.6e-08, Sum P(2) = 1.6e-08
 Identities = 32/96 (33%), Positives = 48/96 (50%)

Query:   226 ITNYEDVPPNDEESLLKAVANQPVSVAIDA-SALQFYSGGVFNGY-CETFLN-----HGV 278
             +T Y D+    +E LL      P ++A         YS GVF  Y  + F +     H V
Sbjct:   318 VTEYRDIIK--KEILLYG----PTTMAFPVPEEFLHYSSGVFRPYPTDGFDDRIVYWHVV 371

Query:   279 TAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
               +G+G S++G  YWL  NS+G  WG++G F++  D
Sbjct:   372 RLIGWGESDDGTHYWLAVNSFGNHWGDNGLFKINTD 407

 Score = 79 (32.9 bits), Expect = 1.6e-08, Sum P(2) = 1.6e-08
 Identities = 45/167 (26%), Positives = 74/167 (44%)

Query:    67 RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL-YKSSQVPP 125
             +FN   + NRSY  +  +      +E++     F  SD   ++K +      + SS VP 
Sbjct:    85 KFNKFGVKNRSYGFKYTR-NQTAVEEYVEQIRKFFESD---AMKRHLDELENFNSSDVPK 140

Query:   126 SVN----WIEKGAVTPVKYQGQC----AVAAVEGINA----IKINRLVS--LSEQQLVDC 171
             + +    W    +++ V  QG C    AVAA  G+ +    I  N      LSE+ ++ C
Sbjct:   141 NFDARQKWPNCPSISNVPNQGGCGSCFAVAAA-GVASDRACIHSNGTFKSLLSEEDIIGC 199

Query:   172 ATNDNNNGCYGGFMDDAFKYIIQNKGITN---DAV--YSYEGMSTGI 213
              +   N  CYGG    A  Y + N+G+     D    YS++ +S G+
Sbjct:   200 CSVCGN--CYGGDPLKALTYWV-NQGLVTGGRDGCRPYSFD-LSCGV 242


>ZFIN|ZDB-GENE-060503-240 [details] [associations]
            symbol:tinagl1 "tubulointerstitial nephritis
            antigen-like 1" species:7955 "Danio rerio" [GO:0005044 "scavenger
            receptor activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0030414 "peptidase inhibitor activity" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0002040 "sprouting
            angiogenesis" evidence=IMP] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR008037 InterPro:IPR013128 Pfam:PF00112 Pfam:PF05375
            PROSITE:PS00524 PROSITE:PS50958 SMART:SM00201 SMART:SM00645
            ZFIN:ZDB-GENE-060503-240 GO:GO:0006955 GO:GO:0030247 GO:GO:0030414
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 GO:GO:0002040
            GO:GO:0005044 GeneTree:ENSGT00560000076599 GO:GO:0010466
            SUPFAM:SSF57283 HOVERGEN:HBG053961 MEROPS:C01.975 OMA:DNCNRCT
            EMBL:BX950864 IPI:IPI00609339 UniGene:Dr.103937
            Ensembl:ENSDART00000087096 Ensembl:ENSDART00000126228
            InParanoid:Q1LUC6 Uniprot:Q1LUC6
        Length = 471

 Score = 126 (49.4 bits), Expect = 3.3e-08, Sum P(2) = 3.3e-08
 Identities = 34/107 (31%), Positives = 51/107 (47%)

Query:   235 NDEESLLKAVANQPVSVAIDASALQF-YSGGVFN----GYCET-----FLNHGVTAVGYG 284
             N+ E + + + N PV   ++     F Y  G+F      Y +         H V   G+G
Sbjct:   345 NENEIMKEIMDNGPVQAIMEVHEDFFVYKSGIFRHTDVNYHKPSQYRKHATHSVRITGWG 404

Query:   285 TSEE--GI--KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMF 327
                +  G   KYW+  NSWG++WGEDGYFR+ R +++    C I  F
Sbjct:   405 EERDYSGRTRKYWIGANSWGKNWGEDGYFRIARGVNE----CDIETF 447

 Score = 72 (30.4 bits), Expect = 3.3e-08, Sum P(2) = 3.3e-08
 Identities = 25/98 (25%), Positives = 46/98 (46%)

Query:   120 SSQVPPSVNWIEK--GAVTPVKYQGQC------AVAAV--EGINAIKINRLV-SLSEQQL 168
             +  +P   N ++K  G +     QG C      + AAV  + I+   +  +   LS Q L
Sbjct:   197 NDHLPSYFNAVDKWPGKIHEPLDQGNCNASWAFSTAAVASDRISIQSMGHMTPQLSPQNL 256

Query:   169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
             + C T  + +GC GG +D A+ + ++ +G+     Y +
Sbjct:   257 ISCDTR-HQDGCAGGRIDGAW-WFMRRRGVVTQDCYPF 292


>UNIPROTKB|Q3SZI1 [details] [associations]
            symbol:TINAG "Tubulointerstitial nephritis antigen"
            species:9913 "Bos taurus" [GO:0005604 "basement membrane"
            evidence=IEA] [GO:0007155 "cell adhesion" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044
            "scavenger receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 Pfam:PF01033
            PROSITE:PS00524 PROSITE:PS50958 SMART:SM00201 SMART:SM00645
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0007155
            GO:GO:0005604 GO:GO:0005044 GeneTree:ENSGT00560000076599
            EMBL:BC102843 IPI:IPI00689615 RefSeq:NP_001030279.1
            UniGene:Bt.29080 ProteinModelPortal:Q3SZI1 MEROPS:C01.973
            PRIDE:Q3SZI1 Ensembl:ENSBTAT00000016790 GeneID:512517
            KEGG:bta:512517 CTD:27283 eggNOG:NOG310046 HOGENOM:HOG000241342
            HOVERGEN:HBG053961 InParanoid:Q3SZI1 OMA:WGQLTSS OrthoDB:EOG47PX5P
            NextBio:20870427 Uniprot:Q3SZI1
        Length = 476

 Score = 131 (51.2 bits), Expect = 3.9e-08, Sum P(2) = 3.9e-08
 Identities = 36/123 (29%), Positives = 58/123 (47%)

Query:   210 STGIC-DSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF-YSGGVFN 267
             +T  C +SI+  +   Q +    V  N+ E + + + N PV   +      F Y  G++ 
Sbjct:   334 ATTPCPNSIEKSNRIYQCSPPYRVSSNETEIMREIMQNGPVQAIMQVHEDFFNYKTGIYR 393

Query:   268 GYCET---------FLNHGVTAVGYGT--SEEGIK--YWLIKNSWGQDWGEDGYFRLQRD 314
                 T         F  H V   G+GT    +G K  +W+  NSWG+ WGE+GYFR+ R 
Sbjct:   394 HITSTNEDSEKYRKFRTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRG 453

Query:   315 IDQ 317
             +++
Sbjct:   454 VNE 456

 Score = 66 (28.3 bits), Expect = 3.9e-08, Sum P(2) = 3.9e-08
 Identities = 14/43 (32%), Positives = 24/43 (55%)

Query:   162 SLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
             +LS Q L+ C      +GC  G +D A+ Y+ + +G+ + A Y
Sbjct:   267 NLSPQNLISCCAK-KRHGCNSGSVDRAWWYL-RKRGLVSHACY 307


>WB|WBGene00000786 [details] [associations]
            symbol:cpr-6 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            PANTHER:PTHR12411:SF16 EMBL:L39894 EMBL:L39939 EMBL:FO080666
            PIR:T37274 RefSeq:NP_741818.1 UniGene:Cel.18138
            ProteinModelPortal:P43510 SMR:P43510 DIP:DIP-25139N
            MINT:MINT-1074025 STRING:P43510 MEROPS:C01.A51 PaxDb:P43510
            PRIDE:P43510 EnsemblMetazoa:C25B8.3a GeneID:180931
            KEGG:cel:CELE_C25B8.3 UCSC:C25B8.3a CTD:180931 WormBase:C25B8.3a
            InParanoid:P43510 OMA:KAKWGLM NextBio:911608 ArrayExpress:P43510
            Uniprot:P43510
        Length = 379

 Score = 152 (58.6 bits), Expect = 4.2e-08, P = 4.2e-08
 Identities = 33/93 (35%), Positives = 52/93 (55%)

Query:   235 NDEESLLKAVANQ-PVSVAIDA-SALQFYSGGVF-NGYCETFLNHGVTAVGYGTSEEGIK 291
             +D E++ K +    P+ +A +       Y GGV+ +   +    H V  +G+G  ++GI 
Sbjct:   261 DDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGI-DDGIP 319

Query:   292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
             YW + NSW  DWGEDG+FR+ R +D+    CGI
Sbjct:   320 YWTVANSWNTDWGEDGFFRILRGVDE----CGI 348


>UNIPROTKB|I3L9E7 [details] [associations]
            symbol:LOC100153159 "Uncharacterized protein" species:9823
            "Sus scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GeneTree:ENSGT00560000076599 OMA:WGQLTSS
            Ensembl:ENSSSCT00000031207 Uniprot:I3L9E7
        Length = 358

 Score = 121 (47.7 bits), Expect = 5.0e-08, Sum P(2) = 5.0e-08
 Identities = 30/100 (30%), Positives = 48/100 (48%)

Query:   232 VPPNDEESLLKAVANQPVSVAIDASALQF-YSGGVFNGYCET---------FLNHGVTAV 281
             V  N+ E + + + N PV   +      F Y  G++     T            H V   
Sbjct:   239 VSSNETEIMREIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNEESDKYRKLRTHAVKLT 298

Query:   282 GYGTSE--EGIK--YWLIKNSWGQDWGEDGYFRLQRDIDQ 317
             G+GT +  +G K  +W+  NSWG+ WGE+GYFR+ R +++
Sbjct:   299 GWGTLKGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNE 338

 Score = 72 (30.4 bits), Expect = 5.0e-08, Sum P(2) = 5.0e-08
 Identities = 15/43 (34%), Positives = 25/43 (58%)

Query:   162 SLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
             +LS Q L+ C    N +GC  G +D A+ Y+ + +G+ + A Y
Sbjct:   149 NLSPQNLISCCAK-NRHGCNSGSIDRAWWYL-RKRGLVSHACY 189

WARNING:  HSPs involving 32 database sequences were not reported due to the
          limiting value of parameter B = 250.


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.315   0.131   0.388    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      348       348   0.00099  116 3  11 22  0.43    34
                                                     33  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  282
  No. of states in DFA:  614 (65 KB)
  Total size of DFA:  246 KB (2132 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  28.58u 0.09s 28.67t   Elapsed:  00:00:03
  Total cpu time:  28.66u 0.09s 28.75t   Elapsed:  00:00:03
  Start:  Tue May 21 01:25:49 2013   End:  Tue May 21 01:25:52 2013
WARNINGS ISSUED:  2

Back to top