BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>019112
MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA
MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP
STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE
QQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGK
YEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTA
EEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEASYPVAM

High Scoring Gene Products

Symbol, full name Information P value
AT2G27420 protein from Arabidopsis thaliana 2.4e-92
AT3G49340 protein from Arabidopsis thaliana 3.9e-92
SAG12
senescence-associated gene 12
protein from Arabidopsis thaliana 3.2e-88
AT1G29090 protein from Arabidopsis thaliana 2.9e-87
AT2G34080 protein from Arabidopsis thaliana 4.3e-86
XCP1
xylem cysteine peptidase 1
protein from Arabidopsis thaliana 1.0e-84
CEP1
cysteine endopeptidase 1
protein from Arabidopsis thaliana 1.5e-83
AT1G29080 protein from Arabidopsis thaliana 1.7e-82
XCP2
AT1G20850
protein from Arabidopsis thaliana 1.7e-82
RD21A
responsive to dehydration 21A
protein from Arabidopsis thaliana 9.7e-80
RD21B
esponsive to dehydration 21B
protein from Arabidopsis thaliana 3.3e-79
AT3G19390 protein from Arabidopsis thaliana 1.3e-75
CEP3
cysteine endopeptidase 3
protein from Arabidopsis thaliana 2.7e-75
AT3G19400 protein from Arabidopsis thaliana 1.5e-74
CP1
cysteine protease 1
protein from Arabidopsis thaliana 1.7e-73
XBCP3
xylem bark cysteine peptidase 3
protein from Arabidopsis thaliana 2.8e-73
AT4G23520 protein from Arabidopsis thaliana 7.5e-73
CP2
cysteine protease 2
protein from Arabidopsis thaliana 3.2e-72
Cp1
Cysteine proteinase-1
protein from Drosophila melanogaster 2.1e-70
AT1G06260 protein from Arabidopsis thaliana 4.3e-70
AT1G29110 protein from Arabidopsis thaliana 2.1e-68
cprE
cysteine proteinase 5
gene from Dictyostelium discoideum 4.2e-67
Ssc.54235
Uncharacterized protein
protein from Sus scrofa 1.3e-66
CTSL2
Uncharacterized protein
protein from Gallus gallus 2.8e-66
ctsl1a
cathepsin L, 1 a
gene_product from Danio rerio 3.2e-65
Ctss
cathepsin S
protein from Mus musculus 5.2e-65
Ctsll3
cathepsin L-like 3
gene from Rattus norvegicus 1.1e-64
zgc:174855 gene_product from Danio rerio 1.8e-64
wu:fb37b09 gene_product from Danio rerio 2.3e-64
RGD1308751
similar to Cathepsin L precursor (Major excreted protein) (MEP)
gene from Rattus norvegicus 2.9e-64
zgc:174153 gene_product from Danio rerio 6.0e-64
CTSL1
Cathepsin L1
protein from Canis lupus familiaris 1.6e-63
CTSL1
Cathepsin L1
protein from Homo sapiens 1.6e-63
ctsl1b
cathepsin L, 1 b
gene_product from Danio rerio 1.6e-63
CTSL1
Cathepsin L1
protein from Sus scrofa 2.0e-63
AT3G43960 protein from Arabidopsis thaliana 2.0e-63
CTSS
Uncharacterized protein
protein from Sus scrofa 2.6e-63
Ctsl
cathepsin L
protein from Mus musculus 2.6e-63
CTSS
Cathepsin S
protein from Canis lupus familiaris 3.3e-63
CTSS
Cathepsin S
protein from Canis lupus familiaris 3.3e-63
CTSS
Cathepsin S
protein from Homo sapiens 4.2e-63
ctsll
cathepsin L, like
gene_product from Danio rerio 4.2e-63
Ctsl1
cathepsin L1
gene from Rattus norvegicus 5.4e-63
CTSL1
CTSL1 protein
protein from Bos taurus 1.1e-62
CTSS
Cathepsin S
protein from Bos taurus 1.4e-62
cprC
cysteine proteinase 3
gene from Dictyostelium discoideum 1.0e-61
cpl-1 gene from Caenorhabditis elegans 1.6e-61
CTSL1
Cathepsin L1
protein from Bos taurus 2.1e-61
cprB
cysteine proteinase 2
gene from Dictyostelium discoideum 2.6e-61
CTSL1
Cathepsin L1
protein from Canis lupus familiaris 4.3e-61
cprF
cysteine proteinase 6
gene from Dictyostelium discoideum 8.8e-61
CTSL2
Cathepsin L2
protein from Bos taurus 1.9e-60
ctsl.1
cathepsin L.1
gene_product from Danio rerio 2.4e-60
LOC420160
Uncharacterized protein
protein from Gallus gallus 3.1e-60
CTSL2
Cathepsin L2
protein from Homo sapiens 5.0e-60
Ctsk
cathepsin K
gene from Rattus norvegicus 1.3e-59
ctsk
cathepsin K
gene_product from Danio rerio 1.3e-59
ctssb.2
cathepsin S, b.2
gene_product from Danio rerio 1.7e-59
CTSK
Cathepsin K
protein from Homo sapiens 2.2e-59
Ctss
cathepsin S
gene from Rattus norvegicus 9.3e-59
ctsh
cathepsin H
gene_product from Danio rerio 1.2e-58
Cys
Crustapain
protein from Pandalus borealis 1.5e-58
Ctsk
cathepsin K
protein from Mus musculus 4.0e-58
Testin
testin gene
gene from Rattus norvegicus 1.1e-57
CTSK
Cathepsin K
protein from Sus scrofa 1.7e-57
Ctsj
cathepsin J
protein from Mus musculus 1.7e-57
cprH
cysteine proteinase 8
gene from Dictyostelium discoideum 4.6e-57
4930486L24Rik
RIKEN cDNA 4930486L24 gene
protein from Mus musculus 4.6e-57
ctssb.1
cathepsin S, b.1
gene_product from Danio rerio 7.5e-57
CTSH
Uncharacterized protein
protein from Oryctolagus cuniculus 9.6e-57
D3ZZR3
Uncharacterized protein
protein from Rattus norvegicus 9.6e-57
CTSK
Cathepsin K
protein from Bos taurus 1.2e-56
CTSH
Uncharacterized protein
protein from Callithrix jacchus 1.6e-56
CTSH
Uncharacterized protein
protein from Callithrix jacchus 1.6e-56
CTSK
Cathepsin K
protein from Canis lupus familiaris 2.0e-56
CTSK
Cathepsin K
protein from Canis lupus familiaris 2.0e-56
Ctsh
cathepsin H
gene from Rattus norvegicus 5.3e-56
Ctsj
cathepsin J
gene from Rattus norvegicus 6.8e-56
CG12163 protein from Drosophila melanogaster 8.6e-56
AT3G45310 protein from Arabidopsis thaliana 8.6e-56
CTSH
Pro-cathepsin H
protein from Sus scrofa 2.3e-55
MGC114246
similar to cathepsin R
gene from Rattus norvegicus 2.9e-55
CTSH
Uncharacterized protein
protein from Nomascus leucogenys 3.7e-55
CTSH
Uncharacterized protein
protein from Ailuropoda melanoleuca 4.8e-55
CTSH
Pro-cathepsin H
protein from Bos taurus 6.1e-55
CTSH
Uncharacterized protein
protein from Macaca mulatta 6.1e-55
CTSH
Pro-cathepsin H
protein from Homo sapiens 9.9e-55
Ctsh
cathepsin H
protein from Mus musculus 1.3e-54
CTSS
Uncharacterized protein
protein from Gallus gallus 1.6e-54
CTSH
Uncharacterized protein
protein from Gorilla gorilla gorilla 1.6e-54
CTSL2
Uncharacterized protein
protein from Gallus gallus 2.6e-54
CTSL1
Cathepsin L1
protein from Gallus gallus 3.4e-54
LOC100662496
Uncharacterized protein
protein from Loxodonta africana 4.3e-54
Ctsq
cathepsin Q
gene from Rattus norvegicus 5.5e-54
CTSH
Uncharacterized protein
protein from Equus caballus 1.9e-53
cfaD
peptidase C1A family protein
gene from Dictyostelium discoideum 2.4e-53
J9P7C5
Uncharacterized protein
protein from Canis lupus familiaris 3.0e-53
CTSH
Uncharacterized protein
protein from Canis lupus familiaris 1.0e-52

The BLAST search returned 2 gene products which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  019112
        (346 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2038588 - symbol:AT2G27420 species:3702 "Arabi...   920  2.4e-92   1
TAIR|locus:2082881 - symbol:AT3G49340 species:3702 "Arabi...   918  3.9e-92   1
TAIR|locus:2152445 - symbol:SAG12 "senescence-associated ...   881  3.2e-88   1
TAIR|locus:2029924 - symbol:AT1G29090 species:3702 "Arabi...   872  2.9e-87   1
TAIR|locus:2055440 - symbol:AT2G34080 species:3702 "Arabi...   861  4.3e-86   1
TAIR|locus:2122113 - symbol:XCP1 "xylem cysteine peptidas...   848  1.0e-84   1
TAIR|locus:2157712 - symbol:CEP1 "cysteine endopeptidase ...   837  1.5e-83   1
TAIR|locus:2029934 - symbol:AT1G29080 species:3702 "Arabi...   827  1.7e-82   1
TAIR|locus:2030427 - symbol:XCP2 "xylem cysteine peptidas...   827  1.7e-82   1
TAIR|locus:2825832 - symbol:RD21A "responsive to dehydrat...   801  9.7e-80   1
TAIR|locus:2167821 - symbol:RD21B "esponsive to dehydrati...   796  3.3e-79   1
TAIR|locus:2090614 - symbol:AT3G19390 species:3702 "Arabi...   762  1.3e-75   1
TAIR|locus:505006391 - symbol:CEP3 "cysteine endopeptidas...   759  2.7e-75   1
TAIR|locus:2090629 - symbol:AT3G19400 species:3702 "Arabi...   752  1.5e-74   1
TAIR|locus:2128243 - symbol:AT4G11310 species:3702 "Arabi...   742  1.7e-73   1
TAIR|locus:2024362 - symbol:XBCP3 "xylem bark cysteine pe...   740  2.8e-73   1
TAIR|locus:2117979 - symbol:AT4G23520 species:3702 "Arabi...   736  7.5e-73   1
TAIR|locus:2128253 - symbol:AT4G11320 species:3702 "Arabi...   730  3.2e-72   1
FB|FBgn0013770 - symbol:Cp1 "Cysteine proteinase-1" speci...   713  2.1e-70   1
TAIR|locus:2038515 - symbol:AT1G06260 species:3702 "Arabi...   710  4.3e-70   1
TAIR|locus:2030027 - symbol:AT1G29110 species:3702 "Arabi...   694  2.1e-68   1
DICTYBASE|DDB_G0272815 - symbol:cprE "cysteine proteinase...   563  4.2e-67   2
UNIPROTKB|F1S4J6 - symbol:Ssc.54235 "Cathepsin L1" specie...   677  1.3e-66   1
UNIPROTKB|F1NYJ1 - symbol:CTSL2 "Uncharacterized protein"...   674  2.8e-66   1
ZFIN|ZDB-GENE-030131-106 - symbol:ctsl1a "cathepsin L, 1 ...   664  3.2e-65   1
MGI|MGI:107341 - symbol:Ctss "cathepsin S" species:10090 ...   662  5.2e-65   1
RGD|1560071 - symbol:Ctsll3 "cathepsin L-like 3" species:...   659  1.1e-64   1
ZFIN|ZDB-GENE-071004-74 - symbol:zgc:174855 "zgc:174855" ...   657  1.8e-64   1
ZFIN|ZDB-GENE-030131-572 - symbol:wu:fb37b09 "wu:fb37b09"...   656  2.3e-64   1
RGD|1308751 - symbol:RGD1308751 "similar to Cathepsin L p...   655  2.9e-64   1
ZFIN|ZDB-GENE-080215-7 - symbol:zgc:174153 "zgc:174153" s...   652  6.0e-64   1
UNIPROTKB|Q9GL24 - symbol:CTSL1 "Cathepsin L1" species:96...   648  1.6e-63   1
UNIPROTKB|P07711 - symbol:CTSL1 "Cathepsin L1" species:96...   648  1.6e-63   1
ZFIN|ZDB-GENE-980526-285 - symbol:ctsl1b "cathepsin L, 1 ...   648  1.6e-63   1
UNIPROTKB|Q28944 - symbol:CTSL1 "Cathepsin L1" species:98...   647  2.0e-63   1
TAIR|locus:2097104 - symbol:AT3G43960 species:3702 "Arabi...   647  2.0e-63   1
UNIPROTKB|F1SS93 - symbol:CTSS "Uncharacterized protein" ...   646  2.6e-63   1
MGI|MGI:88564 - symbol:Ctsl "cathepsin L" species:10090 "...   646  2.6e-63   1
UNIPROTKB|F1PAK0 - symbol:CTSS "Cathepsin S" species:9615...   645  3.3e-63   1
UNIPROTKB|Q8HY81 - symbol:CTSS "Cathepsin S" species:9615...   645  3.3e-63   1
UNIPROTKB|P25774 - symbol:CTSS "Cathepsin S" species:9606...   644  4.2e-63   1
ZFIN|ZDB-GENE-041010-76 - symbol:ctsll "cathepsin L, like...   644  4.2e-63   1
RGD|2448 - symbol:Ctsl1 "cathepsin L1" species:10116 "Rat...   643  5.4e-63   1
UNIPROTKB|A4IFS7 - symbol:CTSL1 "CTSL1 protein" species:9...   640  1.1e-62   1
UNIPROTKB|P25326 - symbol:CTSS "Cathepsin S" species:9913...   639  1.4e-62   1
DICTYBASE|DDB_G0283867 - symbol:cprC "cysteine proteinase...   631  1.0e-61   1
WB|WBGene00000776 - symbol:cpl-1 species:6239 "Caenorhabd...   629  1.6e-61   1
UNIPROTKB|P25975 - symbol:CTSL1 "Cathepsin L1" species:99...   628  2.1e-61   1
DICTYBASE|DDB_G0279799 - symbol:cprB "cysteine proteinase...   519  2.6e-61   2
UNIPROTKB|F1PMM9 - symbol:CTSL1 "Cathepsin L1" species:96...   625  4.3e-61   1
DICTYBASE|DDB_G0279185 - symbol:cprF "cysteine proteinase...   515  8.8e-61   2
UNIPROTKB|Q5E998 - symbol:CTSL2 "Cathepsin L2" species:99...   619  1.9e-60   1
ZFIN|ZDB-GENE-040718-61 - symbol:ctsl.1 "cathepsin L.1" s...   618  2.4e-60   1
UNIPROTKB|F1NZ37 - symbol:LOC420160 "Uncharacterized prot...   617  3.1e-60   1
UNIPROTKB|O60911 - symbol:CTSL2 "Cathepsin L2" species:96...   615  5.0e-60   1
RGD|61810 - symbol:Ctsk "cathepsin K" species:10116 "Ratt...   611  1.3e-59   1
ZFIN|ZDB-GENE-001205-4 - symbol:ctsk "cathepsin K" specie...   611  1.3e-59   1
ZFIN|ZDB-GENE-050626-55 - symbol:ctssb.2 "cathepsin S, b....   610  1.7e-59   1
UNIPROTKB|P43235 - symbol:CTSK "Cathepsin K" species:9606...   609  2.2e-59   1
RGD|621513 - symbol:Ctss "cathepsin S" species:10116 "Rat...   603  9.3e-59   1
ZFIN|ZDB-GENE-030131-3539 - symbol:ctsh "cathepsin H" spe...   602  1.2e-58   1
UNIPROTKB|Q86GF7 - symbol:Cys "Crustapain" species:6703 "...   601  1.5e-58   1
MGI|MGI:107823 - symbol:Ctsk "cathepsin K" species:10090 ...   597  4.0e-58   1
RGD|708447 - symbol:Testin "testin gene" species:10116 "R...   593  1.1e-57   1
UNIPROTKB|Q4QRC2 - symbol:Ctsql2 "Protein Ctsql2" species...   593  1.1e-57   1
UNIPROTKB|Q9GLE3 - symbol:CTSK "Cathepsin K" species:9823...   591  1.7e-57   1
MGI|MGI:1349426 - symbol:Ctsj "cathepsin J" species:10090...   591  1.7e-57   1
DICTYBASE|DDB_G0278401 - symbol:cprH "cysteine proteinase...   587  4.6e-57   1
MGI|MGI:1922258 - symbol:4930486L24Rik "RIKEN cDNA 493048...   587  4.6e-57   1
ZFIN|ZDB-GENE-050522-559 - symbol:ctssb.1 "cathepsin S, b...   585  7.5e-57   1
UNIPROTKB|G1SQF0 - symbol:CTSH "Uncharacterized protein" ...   584  9.6e-57   1
UNIPROTKB|D3ZZR3 - symbol:D3ZZR3 "Uncharacterized protein...   584  9.6e-57   1
UNIPROTKB|Q5E968 - symbol:CTSK "Cathepsin K" species:9913...   583  1.2e-56   1
UNIPROTKB|F7B939 - symbol:CTSH "Uncharacterized protein" ...   582  1.6e-56   1
UNIPROTKB|F7BRD4 - symbol:CTSH "Uncharacterized protein" ...   582  1.6e-56   1
UNIPROTKB|G1K2A7 - symbol:CTSK "Cathepsin K" species:9615...   581  2.0e-56   1
UNIPROTKB|Q3ZKN1 - symbol:CTSK "Cathepsin K" species:9615...   581  2.0e-56   1
RGD|2447 - symbol:Ctsh "cathepsin H" species:10116 "Rattu...   577  5.3e-56   1
RGD|69241 - symbol:Ctsj "cathepsin J" species:10116 "Ratt...   576  6.8e-56   1
FB|FBgn0260462 - symbol:CG12163 species:7227 "Drosophila ...   575  8.6e-56   1
TAIR|locus:2078312 - symbol:AT3G45310 species:3702 "Arabi...   575  8.6e-56   1
UNIPROTKB|O46427 - symbol:CTSH "Pro-cathepsin H" species:...   571  2.3e-55   1
RGD|1562210 - symbol:MGC114246 "similar to cathepsin R" s...   570  2.9e-55   1
UNIPROTKB|G1RBY1 - symbol:CTSH "Uncharacterized protein" ...   569  3.7e-55   1
UNIPROTKB|G1M0X4 - symbol:CTSH "Uncharacterized protein" ...   568  4.8e-55   1
UNIPROTKB|Q3T0I2 - symbol:CTSH "Pro-cathepsin H" species:...   567  6.1e-55   1
UNIPROTKB|F6R7P5 - symbol:CTSH "Uncharacterized protein" ...   567  6.1e-55   1
UNIPROTKB|E9PSK9 - symbol:Ctsql2 "Protein Ctsql2" species...   567  6.1e-55   1
UNIPROTKB|P09668 - symbol:CTSH "Pro-cathepsin H" species:...   565  9.9e-55   1
MGI|MGI:107285 - symbol:Ctsh "cathepsin H" species:10090 ...   564  1.3e-54   1
UNIPROTKB|H9KYW5 - symbol:CTSS "Uncharacterized protein" ...   563  1.6e-54   1
UNIPROTKB|G3R9A7 - symbol:CTSH "Uncharacterized protein" ...   563  1.6e-54   1
UNIPROTKB|F1NEC8 - symbol:CTSL2 "Uncharacterized protein"...   561  2.6e-54   1
UNIPROTKB|P09648 - symbol:CTSL1 "Cathepsin L1" species:90...   560  3.4e-54   1
UNIPROTKB|G3SSC1 - symbol:CTSH "Uncharacterized protein" ...   559  4.3e-54   1
RGD|631421 - symbol:Ctsq "cathepsin Q" species:10116 "Rat...   558  5.5e-54   1
UNIPROTKB|F7BJD8 - symbol:CTSH "Uncharacterized protein" ...   553  1.9e-53   1
DICTYBASE|DDB_G0281605 - symbol:cfaD "peptidase C1A famil...   552  2.4e-53   1
UNIPROTKB|J9P7C5 - symbol:J9P7C5 "Uncharacterized protein...   551  3.0e-53   1
UNIPROTKB|F6X9C1 - symbol:CTSH "Uncharacterized protein" ...   546  1.0e-52   1

WARNING:  Descriptions of 200 database sequences were not reported due to the
          limiting value of parameter V = 100.


>TAIR|locus:2038588 [details] [associations]
            symbol:AT2G27420 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002685
            GenomeReviews:CT485783_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC006232
            MEROPS:I29.003 OMA:EEFRATH HOGENOM:HOG000230773 HSSP:P53634
            ProtClustDB:CLSN2688476 EMBL:AY064033 EMBL:AY096388 IPI:IPI00539752
            PIR:F84672 RefSeq:NP_565649.1 UniGene:At.27094
            ProteinModelPortal:Q9ZQH7 SMR:Q9ZQH7 PRIDE:Q9ZQH7
            EnsemblPlants:AT2G27420.1 GeneID:817287 KEGG:ath:AT2G27420
            TAIR:At2g27420 InParanoid:Q9ZQH7 PhylomeDB:Q9ZQH7
            ArrayExpress:Q9ZQH7 Genevestigator:Q9ZQH7 Uniprot:Q9ZQH7
        Length = 348

 Score = 920 (328.9 bits), Expect = 2.4e-92, P = 2.4e-92
 Identities = 179/333 (53%), Positives = 226/333 (67%)

Query:    25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTY 84
             S   S  S+ E S +EKHEQWMA+  R Y DE EK  R  IFK+NLE+++  N     TY
Sbjct:    18 SLATSRGSLFEASAIEKHEQWMARFNRVYSDETEKRNRFNIFKKNLEFVQNFNMNNKITY 77

Query:    85 KLGTNEFSDLTNEEFRASYTGY----NXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREK 140
             K+  NEFSDLT+EEFRA++TG                     F+Y NV+D   S+DWR++
Sbjct:    78 KVDINEFSDLTDEEFRATHTGLVVPEAITRISTLSSGKNTVPFRYGNVSDNGESMDWRQE 137

Query:   141 GAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMD 199
             GAVT +K QG CG CWAFSAVAAVEGIT+IT G+L+ LSEQQL+DC  D N GC GG+M 
Sbjct:   138 GAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQLLDCDRDYNQGCRGGIMS 197

Query:   200 KAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAA---AATIGKYEDLPKGDEHALLQAV 256
             KAFEYII+N+G+ TE +YPYQ+ Q TC      ++   AATI  YE +P  +E ALLQAV
Sbjct:   198 KAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATISGYETVPMNNEEALLQAV 257

Query:   257 TKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWG 316
             ++QPVSV +E +G AFR Y  GV N ECG +  H V +VG+G +EE  G KYW++KNSWG
Sbjct:   258 SQQPVSVGIEGTGAAFRHYSGGVFNGECGTDLHHAVTIVGYGMSEE--GTKYWVVKNSWG 315

Query:   317 ETWGESGYIRILRD----EGLCGIATEASYPVA 345
             ETWGE+GY+RI RD    +G+CG+A  A YP+A
Sbjct:   316 ETWGENGYMRIKRDVDAPQGMCGLAILAFYPLA 348


>TAIR|locus:2082881 [details] [associations]
            symbol:AT3G49340 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002686
            GenomeReviews:BA000014_GR EMBL:AC012329 EMBL:AL132956
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 MEROPS:I29.003 HOGENOM:HOG000230773 HSSP:P07711
            KO:K01376 IPI:IPI00520642 PIR:T45839 RefSeq:NP_566920.1
            UniGene:At.53854 ProteinModelPortal:Q9SG15 SMR:Q9SG15
            EnsemblPlants:AT3G49340.1 GeneID:824096 KEGG:ath:AT3G49340
            TAIR:At3g49340 InParanoid:Q9SG15 OMA:PQNDEEA PhylomeDB:Q9SG15
            ProtClustDB:CLSN2688476 Genevestigator:Q9SG15 Uniprot:Q9SG15
        Length = 341

 Score = 918 (328.2 bits), Expect = 3.9e-92, P = 3.9e-92
 Identities = 174/328 (53%), Positives = 224/328 (68%)

Query:    25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTY 84
             S V S   + E S VEKHEQWM++  R Y D+ EK  R  IF  NL+++E  N   N+TY
Sbjct:    18 SGVTSRGGLFEASAVEKHEQWMSRFNRVYSDDSEKTSRFEIFTNNLKFVESINMNTNKTY 77

Query:    85 KLGTNEFSDLTNEEFRASYTGY---NXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKG 141
              L  NEFSDLT+EEF+A YTG                   +F+Y+NV +   S+DW ++G
Sbjct:    78 TLDVNEFSDLTDEEFKARYTGLVVPEGMTRISTTDSHETVSFRYENVGETGESMDWIQEG 137

Query:   142 AVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKA 201
             AVT +K+Q  CG CWAFSAVAAVEG+T+I  G+L+ LSEQQL+DCST+NNGC GG+M KA
Sbjct:   138 AVTSVKHQQQCGCCWAFSAVAAVEGMTKIANGELVSLSEQQLLDCSTENNGCGGGIMWKA 197

Query:   202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPV 261
             F+YI EN+G+ TE +YPYQ  Q TC+      AAATI  YE +P+ DE ALL+AV++QPV
Sbjct:   198 FDYIKENQGITTEDNYPYQGAQQTCESNH--LAAATISGYETVPQNDEEALLKAVSQQPV 255

Query:   262 SVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGE 321
             SV +E SG  F  Y  G+ N ECG    H V +VG+G +EE  G KYWL+KNSWGE+WGE
Sbjct:   256 SVAIEGSGYEFIHYSGGIFNGECGTQLTHAVTIVGYGVSEE--GIKYWLLKNSWGESWGE 313

Query:   322 SGYIRILRD----EGLCGIATEASYPVA 345
             +GY+RI+RD    +G+CG+A+ A YPVA
Sbjct:   314 NGYMRIMRDVDSPQGMCGLASLAYYPVA 341


>TAIR|locus:2152445 [details] [associations]
            symbol:SAG12 "senescence-associated gene 12" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009723 "response to ethylene stimulus" evidence=IEP]
            [GO:0007568 "aging" evidence=IEP;TAS] [GO:0010150 "leaf senescence"
            evidence=IEP;TAS] [GO:0010282 "senescence-associated vacuole"
            evidence=IDA] [GO:0009817 "defense response to fungus, incompatible
            interaction" evidence=IEP] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:CP002688 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0010150 GO:GO:0009817 EMBL:AB016870
            HSSP:O65039 OMA:NDEQALM EMBL:AF370131 EMBL:AY040073 IPI:IPI00544181
            RefSeq:NP_568651.1 UniGene:At.75256 UniGene:At.7710
            ProteinModelPortal:Q9FJ47 SMR:Q9FJ47 IntAct:Q9FJ47 STRING:Q9FJ47
            MEROPS:C01.117 PRIDE:Q9FJ47 ProMEX:Q9FJ47 EnsemblPlants:AT5G45890.1
            GeneID:834629 KEGG:ath:AT5G45890 TAIR:At5g45890 InParanoid:Q9FJ47
            PhylomeDB:Q9FJ47 ProtClustDB:CLSN2917735 ArrayExpress:Q9FJ47
            Genevestigator:Q9FJ47 GO:GO:0010282 Uniprot:Q9FJ47
        Length = 346

 Score = 881 (315.2 bits), Expect = 3.2e-88, P = 3.2e-88
 Identities = 171/329 (51%), Positives = 216/329 (65%)

Query:    23 CASQVVSGRSMHEPSIVEK-HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK-EG 80
             C S  +S R +    I++K H +WM +HGR Y D  E+  R  +FK N+E IE  N    
Sbjct:    19 CFSITLS-RPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPA 77

Query:    81 NRTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTD--VPTSIDWR 138
              RT+KL  N+F+DLTN+EFR+ YTG+                F+YQNV+   +P S+DWR
Sbjct:    78 GRTFKLAVNQFADLTNDEFRSMYTGFKGVSALSSQSQTKMSPFRYQNVSSGALPVSVDWR 137

Query:   139 EKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLM 198
             +KGAVT IKNQG CG CWAFSAVAA+EG TQI  GKLI LSEQQLVDC T++ GC GGLM
Sbjct:   138 KKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTNDFGCEGGLM 197

Query:   199 DKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK 258
             D AFE+I    GL TE++YPY+ E  TC+ +K    A +I  YED+P  DE AL++AV  
Sbjct:   198 DTAFEHIKATGGLTTESNYPYKGEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAH 257

Query:   259 QPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGET 318
             QPVSV +E  G  F+FY  GV   EC    DH V  +G+G  E  +G+KYW+IKNSWG  
Sbjct:   258 QPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYG--ESTNGSKYWIIKNSWGTK 315

Query:   319 WGESGYIRILRD----EGLCGIATEASYP 343
             WGESGY+RI +D    +GLCG+A +ASYP
Sbjct:   316 WGESGYMRIQKDVKDKQGLCGLAMKASYP 344


>TAIR|locus:2029924 [details] [associations]
            symbol:AT1G29090 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002684 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            HOGENOM:HOG000230773 HSSP:P53634 ProtClustDB:CLSN2688064
            EMBL:BT004146 IPI:IPI00545702 RefSeq:NP_564321.2 UniGene:At.40814
            ProteinModelPortal:Q84W75 SMR:Q84W75 MEROPS:C01.A15
            EnsemblPlants:AT1G29090.1 GeneID:839784 KEGG:ath:AT1G29090
            TAIR:At1g29090 InParanoid:Q84W75 OMA:SIRGHED PhylomeDB:Q84W75
            ArrayExpress:Q84W75 Genevestigator:Q84W75 Uniprot:Q84W75
        Length = 355

 Score = 872 (312.0 bits), Expect = 2.9e-87, P = 2.9e-87
 Identities = 172/330 (52%), Positives = 222/330 (67%)

Query:    25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTY 84
             SQ  S  + HEP + E H+QWM +  R Y DELEK MR  +FK+NL++IEK NK+G+RTY
Sbjct:    30 SQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTY 89

Query:    85 KLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQ-NVTDVP--TSIDWREKG 141
             KLG NEF+D T EEF A++TG                   +  NV+DV    + DWR +G
Sbjct:    90 KLGVNEFADWTREEFIATHTGLKGVNGIPSSEFVDEMIPSWNWNVSDVAGRETKDWRYEG 149

Query:   142 AVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDK 200
             AVT +K QG CG CWAFS+VAAVEG+T+I G  L+ LSEQQL+DC  + +NGC+GG+M  
Sbjct:   150 AVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMSD 209

Query:   201 AFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQP 260
             AF YII+N+G+A+EA YPYQ  +GTC +   K +A   G ++ +P  +E ALL+AV+KQP
Sbjct:   210 AFSYIIKNRGIASEASYPYQAAEGTC-RYNGKPSAWIRG-FQTVPSNNERALLEAVSKQP 267

Query:   261 VSVCVEASGQAFRFYKRGVLNAE-CGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETW 319
             VSV ++A G  F  Y  GV +   CG N +H V  VG+GT+ E  G KYWL KNSWGETW
Sbjct:   268 VSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTSPE--GIKYWLAKNSWGETW 325

Query:   320 GESGYIRILRD----EGLCGIATEASYPVA 345
             GE+GYIRI RD    +G+CG+A  A YPVA
Sbjct:   326 GENGYIRIRRDVAWPQGMCGVAQYAFYPVA 355


>TAIR|locus:2055440 [details] [associations]
            symbol:AT2G34080 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002685 GenomeReviews:CT485783_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 MEROPS:I29.003 EMBL:AC002341
            HOGENOM:HOG000230773 HSSP:P53634 IPI:IPI00530325 PIR:B84752
            RefSeq:NP_565780.1 UniGene:At.28613 UniGene:At.37859
            ProteinModelPortal:O22961 SMR:O22961 EnsemblPlants:AT2G34080.1
            GeneID:817969 KEGG:ath:AT2G34080 TAIR:At2g34080 InParanoid:O22961
            OMA:SENDYSY PhylomeDB:O22961 ProtClustDB:CLSN2688064
            ArrayExpress:O22961 Genevestigator:O22961 Uniprot:O22961
        Length = 345

 Score = 861 (308.1 bits), Expect = 4.3e-86, P = 4.3e-86
 Identities = 167/319 (52%), Positives = 216/319 (67%)

Query:    35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
             E S+V+KHEQWMA+  R Y+DELEK MR  +FK+NL++IE  NK+GN++YKLG NEF+D 
Sbjct:    32 EQSMVDKHEQWMARFSREYRDELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADW 91

Query:    95 TNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQ--NVTD-VPTSIDWREKGAVTHIKNQGH 151
             TNEEF A +TG                T   Q  NV+D V  S DWR +GAVT +K QG 
Sbjct:    92 TNEEFLAIHTGLKGLTEVSPSKVVAK-TISSQTWNVSDMVVESKDWRAEGAVTPVKYQGQ 150

Query:   152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKG 210
             CG CWAFSAVAAVEG+ +I GG L+ LSEQQL+DC  + + GC GG+M  AF Y+++N+G
Sbjct:   151 CGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCDREYDRGCDGGIMSDAFNYVVQNRG 210

Query:   211 LATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQ 270
             +A+E DY YQ   G C  +     AA I  ++ +P  +E ALL+AV++QPVSV ++A+G 
Sbjct:   211 IASENDYSYQGSDGGC--RSNARPAARISGFQTVPSNNERALLEAVSRQPVSVSMDATGD 268

Query:   271 AFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
              F  Y  GV +  CG + +H V  VG+GT++  DG KYWL KNSWGETWGE GYIRI RD
Sbjct:   269 GFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQ--DGTKYWLAKNSWGETWGEKGYIRIRRD 326

Query:   331 ----EGLCGIATEASYPVA 345
                 +G+CG+A  A YPVA
Sbjct:   327 VAWPQGMCGVAQYAFYPVA 345


>TAIR|locus:2122113 [details] [associations]
            symbol:XCP1 "xylem cysteine peptidase 1" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0000325 "plant-type vacuole" evidence=IDA] [GO:0005634
            "nucleus" evidence=IDA] [GO:0010623 "developmental programmed cell
            death" evidence=IMP] [GO:0010413 "glucuronoxylan metabolic process"
            evidence=RCA] [GO:0045492 "xylan biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005886
            GO:GO:0005634 EMBL:CP002687 GenomeReviews:CT486007_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0000325
            EMBL:AL022604 EMBL:AL161587 GO:GO:0010623 MEROPS:I29.003
            HOGENOM:HOG000230773 EMBL:AF191027 EMBL:AK117394 EMBL:BT005179
            IPI:IPI00532220 PIR:T06122 RefSeq:NP_567983.1 UniGene:At.2280
            UniGene:At.67622 ProteinModelPortal:O65493 SMR:O65493 STRING:O65493
            PaxDb:O65493 PRIDE:O65493 EnsemblPlants:AT4G35350.1 GeneID:829688
            KEGG:ath:AT4G35350 GeneFarm:5033 TAIR:At4g35350 InParanoid:O65493
            KO:K16290 OMA:FEVFREN PhylomeDB:O65493 ProtClustDB:CLSN2689772
            Genevestigator:O65493 Uniprot:O65493
        Length = 355

 Score = 848 (303.6 bits), Expect = 1.0e-84, P = 1.0e-84
 Identities = 160/311 (51%), Positives = 213/311 (68%)

Query:    38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
             ++E  E WM++H + YK   EK  R  +F++NL +I++ N E N +Y LG NEF+DLT+E
Sbjct:    47 LLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFADLTHE 105

Query:    98 EFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
             EF+  Y G                 F+Y+++TD+P S+DWR+KGAV  +K+QG CGSCWA
Sbjct:   106 EFKGRYLGL--AKPQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWA 163

Query:   158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
             FS VAAVEGI QIT G L  LSEQ+L+DC T  N+GC+GGLMD AF+YII   GL  E D
Sbjct:   164 FSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLHKEDD 223

Query:   217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
             YPY  E+G C +QKE     TI  YED+P+ D+ +L++A+  QPVSV +EASG+ F+FYK
Sbjct:   224 YPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYK 283

Query:   277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
              GV N +CG + DHGVA VG+G+++   G+ Y ++KNSWG  WGE G+IR+ R+    EG
Sbjct:   284 GGVFNGKCGTDLDHGVAAVGYGSSK---GSDYVIVKNSWGPRWGEKGFIRMKRNTGKPEG 340

Query:   333 LCGIATEASYP 343
             LCGI   ASYP
Sbjct:   341 LCGINKMASYP 351


>TAIR|locus:2157712 [details] [associations]
            symbol:CEP1 "cysteine endopeptidase 1" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005783 EMBL:CP002688
            GenomeReviews:BA000015_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 PROSITE:PS00014
            EMBL:AB024031 MEROPS:I29.003 EMBL:HM367092 EMBL:AY091087
            IPI:IPI00516991 RefSeq:NP_568722.1 UniGene:At.7918 HSSP:O65039
            ProteinModelPortal:Q9FGR9 SMR:Q9FGR9 PaxDb:Q9FGR9 PRIDE:Q9FGR9
            EnsemblPlants:AT5G50260.1 GeneID:835091 KEGG:ath:AT5G50260
            TAIR:At5g50260 HOGENOM:HOG000230773 InParanoid:Q9FGR9 KO:K16292
            OMA:WHSKKYH PhylomeDB:Q9FGR9 ProtClustDB:CLSN2689970
            Genevestigator:Q9FGR9 Uniprot:Q9FGR9
        Length = 361

 Score = 837 (299.7 bits), Expect = 1.5e-83, P = 1.5e-83
 Identities = 164/316 (51%), Positives = 206/316 (65%)

Query:    35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
             E S+ E +E+W + H      E EKA R  +FK N+++I + NK+ +++YKL  N+F D+
Sbjct:    31 ENSLWELYERWRSHHTVARSLE-EKAKRFNVFKHNVKHIHETNKK-DKSYKLKLNKFGDM 88

Query:    95 TNEEFRASYTGYNXXXXXXXXXXXXXX-TFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
             T+EEFR +Y G N               +F Y NV  +PTS+DWR+ GAVT +KNQG CG
Sbjct:    89 TSEEFRRTYAGSNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAVTPVKNQGQCG 148

Query:   154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIENKGLA 212
             SCWAFS V AVEGI QI   KL  LSEQ+LVDC T+ N GC+GGLMD AFE+I E  GL 
Sbjct:   149 SCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCDTNQNQGCNGGLMDLAFEFIKEKGGLT 208

Query:   213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
             +E  YPY+    TCD  KE A   +I  +ED+PK  E  L++AV  QPVSV ++A G  F
Sbjct:   209 SELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDF 268

Query:   273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-- 330
             +FY  GV    CG   +HGVAVVG+GT    DG KYW++KNSWGE WGE GYIR+ R   
Sbjct:   269 QFYSEGVFTGRCGTELNHGVAVVGYGTTI--DGTKYWIVKNSWGEEWGEKGYIRMQRGIR 326

Query:   331 --EGLCGIATEASYPV 344
               EGLCGIA EASYP+
Sbjct:   327 HKEGLCGIAMEASYPL 342


>TAIR|locus:2029934 [details] [associations]
            symbol:AT1G29080 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002684 GenomeReviews:CT485782_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AC021043 MEROPS:I29.003 HOGENOM:HOG000230773
            HSSP:P53634 ProtClustDB:CLSN2688064 EMBL:DQ056468 IPI:IPI00521747
            PIR:C86413 RefSeq:NP_564320.1 UniGene:At.51814
            ProteinModelPortal:Q9LP39 SMR:Q9LP39 EnsemblPlants:AT1G29080.1
            GeneID:839783 KEGG:ath:AT1G29080 TAIR:At1g29080 InParanoid:Q9LP39
            OMA:KTWGENG PhylomeDB:Q9LP39 Genevestigator:Q9LP39 Uniprot:Q9LP39
        Length = 346

 Score = 827 (296.2 bits), Expect = 1.7e-82, P = 1.7e-82
 Identities = 163/330 (49%), Positives = 221/330 (66%)

Query:    25 SQVVSGRSMHEPS-IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRT 83
             S+  S  ++++PS IV+ H+QWM Q  R Y DE EK +RL +  +NL++IE  N  GN++
Sbjct:    21 SEATSRVALYKPSSIVDYHQQWMIQFSRVYDDEFEKQLRLQVLTENLKFIESFNNMGNQS 80

Query:    84 YKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQ-NVTDV-PTSIDWREKG 141
             YKLG NEF+D T EEF A+YTG                   +   V+DV  T+ DWR +G
Sbjct:    81 YKLGVNEFTDWTKEEFLATYTGLRGVNVTSPFEVVNETKPAWNWTVSDVLGTNKDWRNEG 140

Query:   142 AVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDK 200
             AVT +K+QG CG CWAFSA+AAVEG+T+I  G LI LSEQQL+DC+ + NNGC GG    
Sbjct:   141 AVTPVKSQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDCTREQNNGCKGGTFVN 200

Query:   201 AFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQP 260
             AF YII+++G+++E +YPYQ ++G C  +     A  I  +E++P  +E ALL+AV++QP
Sbjct:   201 AFNYIIKHRGISSENEYPYQVKEGPC--RSNARPAILIRGFENVPSNNERALLEAVSRQP 258

Query:   261 VSVCVEASGQAFRFYKRGVLNAE-CGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETW 319
             V+V ++AS   F  Y  GV NA  CG + +H V +VG+GT+ E  G KYWL KNSWG+TW
Sbjct:   259 VAVAIDASEAGFVHYSGGVYNARNCGTSVNHAVTLVGYGTSPE--GMKYWLAKNSWGKTW 316

Query:   320 GESGYIRILRD----EGLCGIATEASYPVA 345
             GE+GYIRI RD    +G+CG+A  ASYPVA
Sbjct:   317 GENGYIRIRRDVEWPQGMCGVAQYASYPVA 346


>TAIR|locus:2030427 [details] [associations]
            symbol:XCP2 "xylem cysteine peptidase 2" species:3702
            "Arabidopsis thaliana" [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009507 "chloroplast" evidence=ISM] [GO:0008233 "peptidase
            activity" evidence=ISS] [GO:0005618 "cell wall" evidence=IDA]
            [GO:0010623 "developmental programmed cell death" evidence=IMP]
            [GO:0010075 "regulation of meristem growth" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0005886 GO:GO:0005618 GO:GO:0005773
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AC069251 EMBL:AC007369 GO:GO:0010623
            OMA:YKEIPEG HOGENOM:HOG000230773 KO:K16290 EMBL:AF191028
            EMBL:BT004822 IPI:IPI00526722 PIR:A86341 RefSeq:NP_564126.1
            UniGene:At.21316 ProteinModelPortal:Q9LM66 SMR:Q9LM66 IntAct:Q9LM66
            STRING:Q9LM66 MEROPS:C01.120 PaxDb:Q9LM66 PRIDE:Q9LM66
            ProMEX:Q9LM66 EnsemblPlants:AT1G20850.1 GeneID:838677
            KEGG:ath:AT1G20850 GeneFarm:5034 TAIR:At1g20850 InParanoid:Q9LM66
            PhylomeDB:Q9LM66 ProtClustDB:CLSN2917031 Genevestigator:Q9LM66
            GermOnline:AT1G20850 Uniprot:Q9LM66
        Length = 356

 Score = 827 (296.2 bits), Expect = 1.7e-82, P = 1.7e-82
 Identities = 155/315 (49%), Positives = 214/315 (67%)

Query:    34 HEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
             H+  ++E  E W++   + Y+   EK +R  +FK NL++I++ NK+G ++Y LG NEF+D
Sbjct:    44 HD-KLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFAD 101

Query:    94 LTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
             L++EEF+  Y G                 F Y++V  VP S+DWR+KGAV  +KNQG CG
Sbjct:   102 LSHEEFKKMYLGLKTDIVRRDEERSYAE-FAYRDVEAVPKSVDWRKKGAVAEVKNQGSCG 160

Query:   154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLA 212
             SCWAFS VAAVEGI +I  G L  LSEQ+L+DC T  NNGC+GGLMD AFEYI++N GL 
Sbjct:   161 SCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLR 220

Query:   213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
              E DYPY  E+GTC+ QK+++   TI  ++D+P  DE +LL+A+  QP+SV ++ASG+ F
Sbjct:   221 KEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREF 280

Query:   273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-- 330
             +FY  GV +  CG + DHGVA VG+G+++   G+ Y ++KNSWG  WGE GYIR+ R+  
Sbjct:   281 QFYSGGVFDGRCGVDLDHGVAAVGYGSSK---GSDYIIVKNSWGPKWGEKGYIRLKRNTG 337

Query:   331 --EGLCGIATEASYP 343
               EGLCGI   AS+P
Sbjct:   338 KPEGLCGINKMASFP 352


>TAIR|locus:2825832 [details] [associations]
            symbol:RD21A "responsive to dehydration 21A" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;IMP]
            [GO:0008234 "cysteine-type peptidase activity"
            evidence=ISS;IDA;IMP] [GO:0009507 "chloroplast" evidence=IDA]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0008233 "peptidase
            activity" evidence=IDA] [GO:0048046 "apoplast" evidence=IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005829 "cytosol"
            evidence=RCA] [GO:0009506 "plasmodesma" evidence=IDA] [GO:0050832
            "defense response to fungus" evidence=IMP] [GO:0006096 "glycolysis"
            evidence=RCA] [GO:0006833 "water transport" evidence=RCA]
            [GO:0006972 "hyperosmotic response" evidence=RCA] [GO:0007030
            "Golgi organization" evidence=RCA] [GO:0009266 "response to
            temperature stimulus" evidence=RCA] [GO:0009651 "response to salt
            stress" evidence=RCA] [GO:0015996 "chlorophyll catabolic process"
            evidence=RCA] [GO:0042744 "hydrogen peroxide catabolic process"
            evidence=RCA] [GO:0046686 "response to cadmium ion" evidence=RCA]
            [GO:0009414 "response to water deprivation" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0009506 GO:GO:0009507 GO:GO:0005773
            GO:GO:0050832 GO:GO:0048046 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC083835
            HOGENOM:HOG000230773 KO:K01376 InterPro:IPR000118 Pfam:PF00396
            SMART:SM00277 UniGene:At.43549 EMBL:D13043 EMBL:AY072130
            EMBL:AY133781 IPI:IPI00530094 PIR:JN0719 RefSeq:NP_564497.1
            UniGene:At.47599 UniGene:At.71705 ProteinModelPortal:P43297
            SMR:P43297 IntAct:P43297 STRING:P43297 MEROPS:C01.064 PaxDb:P43297
            PRIDE:P43297 ProMEX:P43297 EnsemblPlants:AT1G47128.1 GeneID:841122
            KEGG:ath:AT1G47128 TAIR:At1g47128 InParanoid:P43297 OMA:EAWLVKH
            PhylomeDB:P43297 ProtClustDB:CLSN2688498 Genevestigator:P43297
            GermOnline:AT1G47128 Uniprot:P43297
        Length = 462

 Score = 801 (287.0 bits), Expect = 9.7e-80, P = 9.7e-80
 Identities = 154/324 (47%), Positives = 210/324 (64%)

Query:    30 GRSMHEPSIVEKHEQWMAQHGRTYKDE--LEKAMRLTIFKQNLEYIEKANKEGNRTYKLG 87
             GRS  E  ++  +E W+ +HG+       +EK  R  IFK NL ++++ N E N +Y+LG
Sbjct:    40 GRS--EAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSYRLG 96

Query:    88 TNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQ-NVTD-VPTSIDWREKGAVTH 145
                F+DLTN+E+R+ Y G                + +Y+  V D +P SIDWR+KGAV  
Sbjct:    97 LTRFADLTNDEYRSKYLG-----AKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAE 151

Query:   146 IKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEY 204
             +K+QG CGSCWAFS + AVEGI QI  G LI LSEQ+LVDC T  N GC+GGLMD AFE+
Sbjct:   152 VKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEF 211

Query:   205 IIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVC 264
             II+N G+ T+ DYPY+   GTCD+ ++ A   TI  YED+P   E +L +AV  QP+S+ 
Sbjct:   212 IIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIA 271

Query:   265 VEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGY 324
             +EA G+AF+ Y  G+ +  CG   DHGV  VG+GT   E+G  YW+++NSWG++WGESGY
Sbjct:   272 IEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYGT---ENGKDYWIVRNSWGKSWGESGY 328

Query:   325 IRILRD----EGLCGIATEASYPV 344
             +R+ R+     G CGIA E SYP+
Sbjct:   329 LRMARNIASSSGKCGIAIEPSYPI 352


>TAIR|locus:2167821 [details] [associations]
            symbol:RD21B "esponsive to dehydration 21B" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS] [GO:0005773
            "vacuole" evidence=IDA] [GO:0009651 "response to salt stress"
            evidence=IEP] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005829 "cytosol" evidence=IDA] [GO:0052541 "plant-type cell
            wall cellulose metabolic process" evidence=RCA] [GO:0052546 "cell
            wall pectin metabolic process" evidence=RCA] [GO:0005783
            "endoplasmic reticulum" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005783 GO:GO:0005829 EMBL:CP002688
            GO:GO:0005773 GO:GO:0009651 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AB008267 HSSP:O65039
            HOGENOM:HOG000230773 KO:K01376 InterPro:IPR000118 Pfam:PF00396
            SMART:SM00277 ProtClustDB:CLSN2688498 EMBL:AY062608 EMBL:AY114661
            IPI:IPI00520971 RefSeq:NP_568620.1 UniGene:At.24130 SMR:Q9FMH8
            IntAct:Q9FMH8 STRING:Q9FMH8 MEROPS:C01.A12
            EnsemblPlants:AT5G43060.1 GeneID:834321 KEGG:ath:AT5G43060
            TAIR:At5g43060 InParanoid:Q9FMH8 OMA:ENSEASL Genevestigator:Q9FMH8
            Uniprot:Q9FMH8
        Length = 463

 Score = 796 (285.3 bits), Expect = 3.3e-79, P = 3.3e-79
 Identities = 157/329 (47%), Positives = 211/329 (64%)

Query:    28 VSGRSMHEPSIVEK-HEQWMAQHGRTYKDE----LEKAMRLTIFKQNLEYIEKANKEGNR 82
             ++  +    S VE+ +E WM +HG+   ++     EK  R  IFK NL +I++ N + N 
Sbjct:    35 ITTETSRSDSEVERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTK-NL 93

Query:    83 TYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQ-NVTD-VPTSIDWREK 140
             +YKLG   F+DLTNEE+R+ Y G                  +YQ  V D +P S+DWR++
Sbjct:    94 SYKLGLTRFADLTNEEYRSMYLGAKPTKRVLKTSD------RYQARVGDALPDSVDWRKE 147

Query:   141 GAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMD 199
             GAV  +K+QG CGSCWAFS + AVEGI +I  G LI LSEQ+LVDC T  N GC+GGLMD
Sbjct:   148 GAVADVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMD 207

Query:   200 KAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ 259
              AFE+II+N G+ TEADYPY+   G CD+ ++ A   TI  YED+P+  E +L +A+  Q
Sbjct:   208 YAFEFIIKNGGIDTEADYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQ 267

Query:   260 PVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETW 319
             P+SV +EA G+AF+ Y  GV +  CG   DHGV  VG+GT   E+G  YW+++NSWG  W
Sbjct:   268 PISVAIEAGGRAFQLYSSGVFDGLCGTELDHGVVAVGYGT---ENGKDYWIVRNSWGNRW 324

Query:   320 GESGYIRILRD----EGLCGIATEASYPV 344
             GESGYI++ R+     G CGIA EASYP+
Sbjct:   325 GESGYIKMARNIEAPTGKCGIAMEASYPI 353


>TAIR|locus:2090614 [details] [associations]
            symbol:AT3G19390 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA;ISS] [GO:0000041 "transition metal ion
            transport" evidence=RCA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:CP002686 GenomeReviews:BA000014_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AB025624
            MEROPS:I29.003 OMA:KAMDQKC HSSP:O65039 HOGENOM:HOG000230773
            InterPro:IPR000118 Pfam:PF00396 SMART:SM00277 EMBL:AY062725
            EMBL:AY093350 IPI:IPI00520189 RefSeq:NP_566633.1 UniGene:At.27473
            ProteinModelPortal:Q9LT78 SMR:Q9LT78 IntAct:Q9LT78 STRING:Q9LT78
            PaxDb:Q9LT78 PRIDE:Q9LT78 EnsemblPlants:AT3G19390.1 GeneID:821473
            KEGG:ath:AT3G19390 TAIR:At3g19390 InParanoid:Q9LT78
            PhylomeDB:Q9LT78 ProtClustDB:CLSN2917188 Genevestigator:Q9LT78
            Uniprot:Q9LT78
        Length = 452

 Score = 762 (273.3 bits), Expect = 1.3e-75, P = 1.3e-75
 Identities = 145/308 (47%), Positives = 195/308 (63%)

Query:    42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
             +E+W+ ++ + Y    EK  R  IFK NL+++E+ +   NRTY++G   F+DLTN+EFRA
Sbjct:    43 YERWLVENRKNYNGLGEKERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRA 102

Query:   102 SYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAV 161
              Y                   + Y+    +P +IDWR KGAV  +K+QG CGSCWAFSA+
Sbjct:   103 IYL---RSKMERTRVPVKGEKYLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAI 159

Query:   162 AAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPY- 219
              AVEGI QI  G+LI LSEQ+LVDC T  N+GC GGLMD AF++IIEN G+ TE DYPY 
Sbjct:   160 GAVEGINQIKTGELISLSEQELVDCDTSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYI 219

Query:   220 QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGV 279
               +   C+  K+     TI  YED+P+ DE +L +A+  QP+SV +EA G+AF+ Y  GV
Sbjct:   220 ATDVNVCNSDKKNTRVVTIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYTSGV 279

Query:   280 LNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCG 335
                 CG + DHGV  VG+G+   E G  YW+++NSWG  WGESGY ++ R+     G CG
Sbjct:   280 FTGTCGTSLDHGVVAVGYGS---EGGQDYWIVRNSWGSNWGESGYFKLERNIKESSGKCG 336

Query:   336 IATEASYP 343
             +A  ASYP
Sbjct:   337 VAMMASYP 344


>TAIR|locus:505006391 [details] [associations]
            symbol:CEP3 "cysteine endopeptidase 3" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005783 EMBL:CP002686
            GenomeReviews:BA000014_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 PROSITE:PS00014
            EMBL:AL049659 HSSP:O65039 HOGENOM:HOG000230773 KO:K16292
            EMBL:AK119026 IPI:IPI00525150 PIR:T06707 RefSeq:NP_566901.1
            UniGene:At.3162 ProteinModelPortal:Q9STL5 SMR:Q9STL5 MEROPS:C01.A02
            PRIDE:Q9STL5 EnsemblPlants:AT3G48350.1 GeneID:823993
            KEGG:ath:AT3G48350 TAIR:At3g48350 InParanoid:Q9STL5 OMA:DITHHEF
            PhylomeDB:Q9STL5 ProtClustDB:CLSN2917387 Genevestigator:Q9STL5
            Uniprot:Q9STL5
        Length = 364

 Score = 759 (272.2 bits), Expect = 2.7e-75, P = 2.7e-75
 Identities = 148/319 (46%), Positives = 196/319 (61%)

Query:    35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
             E ++ + +E+W   H  +     E   R  +F+ N+ ++ + NK+ N+ YKL  N F+D+
Sbjct:    31 EENVWKLYERWRGHHSVSRASH-EAIKRFNVFRHNVLHVHRTNKK-NKPYKLKINRFADI 88

Query:    95 TNEEFRASYTGYNXXXXXXXXX-XXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
             T+ EFR+SY G N                F Y+NVT VP+S+DWREKGAVT +KNQ  CG
Sbjct:    89 THHEFRSSYAGSNVKHHRMLRGPKRGSGGFMYENVTRVPSSVDWREKGAVTEVKNQQDCG 148

Query:   154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIENKGLA 212
             SCWAFS VAAVEGI +I   KL+ LSEQ+LVDC T+ N GC+GGLM+ AFE+I  N G+ 
Sbjct:   149 SCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQGCAGGLMEPAFEFIKNNGGIK 208

Query:   213 TEADYPYQQEQGT-CDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQA 271
             TE  YPY       C          TI  +E +P+ DE  LL+AV  QPVSV ++A    
Sbjct:   209 TEETYPYDSSDVQFCRANSIGGETVTIDGHEHVPENDEEELLKAVAHQPVSVAIDAGSSD 268

Query:   272 FRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR-- 329
             F+ Y  GV   ECG   +HGV +VG+G  E ++G KYW+++NSWG  WGE GY+RI R  
Sbjct:   269 FQLYSEGVFIGECGTQLNHGVVIVGYG--ETKNGTKYWIVRNSWGPEWGEGGYVRIERGI 326

Query:   330 --DEGLCGIATEASYPVAM 346
               +EG CGIA EASYP  +
Sbjct:   327 SENEGRCGIAMEASYPTKL 345


>TAIR|locus:2090629 [details] [associations]
            symbol:AT3G19400 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0019344 "cysteine biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            EMBL:CP002686 GenomeReviews:BA000014_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AB025624
            MEROPS:I29.003 HOGENOM:HOG000230773 EMBL:AK118509 IPI:IPI00543468
            RefSeq:NP_566634.2 UniGene:At.38409 ProteinModelPortal:Q9LT77
            SMR:Q9LT77 PaxDb:Q9LT77 PRIDE:Q9LT77 EnsemblPlants:AT3G19400.1
            GeneID:821474 KEGG:ath:AT3G19400 TAIR:At3g19400 InParanoid:Q9LT77
            OMA:IGEHERR ProtClustDB:CLSN2679975 Genevestigator:Q9LT77
            Uniprot:Q9LT77
        Length = 362

 Score = 752 (269.8 bits), Expect = 1.5e-74, P = 1.5e-74
 Identities = 148/318 (46%), Positives = 200/318 (62%)

Query:    34 HEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
             +E  +   +EQW+ ++ + Y    EK  R  IFK NL+++++ N   +RT+++G   F+D
Sbjct:    36 NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFAD 95

Query:    94 LTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
             LTNEEFRA Y                   + Y+    +P  +DWR  GAV  +K+QG+CG
Sbjct:    96 LTNEEFRAIYL---RKKMERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCG 152

Query:   154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGL 211
             SCWAFSAV AVEGI QIT G+LI LSEQ+LVDC     N GC GG+M+ AFE+I++N G+
Sbjct:   153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212

Query:   212 ATEADYPYQ-QEQGTCDKQKEK-AAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASG 269
              T+ DYPY   + G C+  K       TI  YED+P+ DE +L +AV  QPVSV +EAS 
Sbjct:   213 ETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASS 272

Query:   270 QAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR 329
             QAF+ YK GV+   CG + DHGV VVG+G+   ED   YW+I+NSWG  WG+SGY+++ R
Sbjct:   273 QAFQLYKSGVMTGTCGISLDHGVVVVGYGSTSGED---YWIIRNSWGLNWGDSGYVKLQR 329

Query:   330 --DE--GLCGIATEASYP 343
               D+  G CGIA   SYP
Sbjct:   330 NIDDPFGKCGIAMMPSYP 347


>TAIR|locus:2128243 [details] [associations]
            symbol:AT4G11310 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005618 "cell wall"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 GO:GO:0005618 EMBL:CP002687
            GenomeReviews:CT486007_GR EMBL:AL096882 EMBL:AL161531
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            HOGENOM:HOG000230773 KO:K01376 EMBL:AY093066 EMBL:BT000099
            IPI:IPI00520496 PIR:T13022 RefSeq:NP_567376.1 UniGene:At.43189
            ProteinModelPortal:Q9SUT0 SMR:Q9SUT0 IntAct:Q9SUT0 STRING:Q9SUT0
            MEROPS:C01.A20 PaxDb:Q9SUT0 PRIDE:Q9SUT0 EnsemblPlants:AT4G11310.1
            GeneID:826733 KEGG:ath:AT4G11310 TAIR:At4g11310 InParanoid:Q9SUT0
            OMA:EVCHGAD PhylomeDB:Q9SUT0 ProtClustDB:CLSN2689395
            Genevestigator:Q9SUT0 GermOnline:AT4G11310 Uniprot:Q9SUT0
        Length = 364

 Score = 742 (266.3 bits), Expect = 1.7e-73, P = 1.7e-73
 Identities = 144/316 (45%), Positives = 201/316 (63%)

Query:    35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
             E S++   E WM +HG+ Y    EK  RLTIF+ NL +I   N E N +Y+LG   F+DL
Sbjct:    44 EASLI--FESWMVKHGKVYGSVAEKERRLTIFEDNLRFINNRNAE-NLSYRLGLTGFADL 100

Query:    95 TNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDV-PTSIDWREKGAVTHIKNQGHCG 153
             +  E++    G +               +K  +  DV P S+DWR +GAVT +K+QGHC 
Sbjct:   101 SLHEYKEVCHGADPRPPRNHVFMTSSDRYK-TSADDVLPKSVDWRNEGAVTEVKDQGHCR 159

Query:   154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLAT 213
             SCWAFS V AVEG+ +I  G+L+ LSEQ L++C+ +NNGC GG ++ A+E+I++N GL T
Sbjct:   160 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKLETAYEFIMKNGGLGT 219

Query:   214 EADYPYQQEQGTCD-KQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
             + DYPY+   G CD + KE      I  YE+LP  DE AL++AV  QPV+  +++S + F
Sbjct:   220 DNDYPYKAVNGVCDGRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREF 279

Query:   273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-- 330
             + Y+ GV +  CG N +HGV VVG+GT   E+G  YWL+KNS G TWGE+GY+++ R+  
Sbjct:   280 QLYESGVFDGSCGTNLNHGVVVVGYGT---ENGRDYWLVKNSRGITWGEAGYMKMARNIA 336

Query:   331 --EGLCGIATEASYPV 344
                GLCGIA  ASYP+
Sbjct:   337 NPRGLCGIAMRASYPL 352


>TAIR|locus:2024362 [details] [associations]
            symbol:XBCP3 "xylem bark cysteine peptidase 3"
            species:3702 "Arabidopsis thaliana" [GO:0005576 "extracellular
            region" evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0005783 "endoplasmic
            reticulum" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005783 EMBL:CP002684 GO:GO:0005773 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 MEROPS:I29.003
            HOGENOM:HOG000230773 InterPro:IPR000118 Pfam:PF00396 SMART:SM00277
            UniGene:At.10233 OMA:CEIESAV EMBL:BT026490 EMBL:AK226753
            IPI:IPI00536687 RefSeq:NP_563855.1 ProteinModelPortal:Q0WVJ5
            SMR:Q0WVJ5 PRIDE:Q0WVJ5 EnsemblPlants:AT1G09850.1 GeneID:837517
            KEGG:ath:AT1G09850 TAIR:At1g09850 InParanoid:Q0WVJ5
            PhylomeDB:Q0WVJ5 ProtClustDB:CLSN2687747 Genevestigator:Q0WVJ5
            Uniprot:Q0WVJ5
        Length = 437

 Score = 740 (265.6 bits), Expect = 2.8e-73, P = 2.8e-73
 Identities = 142/323 (43%), Positives = 198/323 (61%)

Query:    27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKL 86
             +VS  S  +  I E  + W  +HG+TY  E E+  R+ IFK N +++ + N   N TY L
Sbjct:    18 LVSSSSSSD-DISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSL 76

Query:    87 GTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHI 146
               N F+DLT+ EF+AS  G +                       VP S+DWR+KGAVT++
Sbjct:    77 SLNAFADLTHHEFKASRLGLSVSAPSVIMASKGQSL---GGSVKVPDSVDWRKKGAVTNV 133

Query:   147 KNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYI 205
             K+QG CG+CW+FSA  A+EGI QI  G LI LSEQ+L+DC    N GC+GGLMD AFE++
Sbjct:   134 KDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFV 193

Query:   206 IENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCV 265
             I+N G+ TE DYPYQ+  GTC K K K    TI  Y  +   DE AL++AV  QPVSV +
Sbjct:   194 IKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGI 253

Query:   266 EASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYI 325
               S +AF+ Y  G+ +  C  + DH V +VG+G+   ++G  YW++KNSWG++WG  G++
Sbjct:   254 CGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGS---QNGVDYWIVKNSWGKSWGMDGFM 310

Query:   326 RILRD----EGLCGIATEASYPV 344
              + R+    +G+CGI   ASYP+
Sbjct:   311 HMQRNTENSDGVCGINMLASYPI 333


>TAIR|locus:2117979 [details] [associations]
            symbol:AT4G23520 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002687 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            KO:K01376 IPI:IPI00527171 RefSeq:NP_567686.2 UniGene:At.32421
            ProteinModelPortal:F4JNL3 SMR:F4JNL3 MEROPS:C01.A22 PRIDE:F4JNL3
            EnsemblPlants:AT4G23520.1 GeneID:828452 KEGG:ath:AT4G23520
            OMA:PANDEIS ArrayExpress:F4JNL3 Uniprot:F4JNL3
        Length = 356

 Score = 736 (264.1 bits), Expect = 7.5e-73, P = 7.5e-73
 Identities = 148/321 (46%), Positives = 201/321 (62%)

Query:    31 RSMHEPSIVEKHEQWMAQHGRTYKDEL-EKAMRLTIFKQNLEYIEKANKEGNRTYKLGTN 89
             RS  E   +   + WM++HG+TY + L EK  R   FK NL +I++ N + N +Y+LG  
Sbjct:    38 RSNEEVEFI--FQMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHNAK-NLSYQLGLT 94

Query:    90 EFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQ 149
              F+DLT +E+R  + G                      +   P S+DWR++GAV+ IK+Q
Sbjct:    95 RFADLTVQEYRDLFPGSPKPKQRNLKTSRRYVPLAGDQL---PESVDWRQEGAVSEIKDQ 151

Query:   150 GHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSG-GLMDKAFEYIIEN 208
             G C SCWAFS VAAVEG+ +I  G+LI LSEQ+LVDC+  NNGC G GLMD AF+++I N
Sbjct:   152 GTCNSCWAFSTVAAVEGLNKIVTGELISLSEQELVDCNLVNNGCYGSGLMDTAFQFLINN 211

Query:   209 KGLATEADYPYQQEQGTCD-KQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEA 267
              GL +E DYPYQ  QG+C+ KQ       TI  YED+P  DE +L +AV  QPVSV V+ 
Sbjct:   212 NGLDSEKDYPYQGTQGSCNRKQSTSNKVITIDSYEDVPANDEISLQKAVAHQPVSVGVDK 271

Query:   268 SGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI 327
               Q F  Y+  + N  CG N DH + +VG+G+   E+G  YW+++NSWG TWG++GYI+I
Sbjct:   272 KSQEFMLYRSCIYNGPCGTNLDHALVIVGYGS---ENGQDYWIVRNSWGTTWGDAGYIKI 328

Query:   328 LRD----EGLCGIATEASYPV 344
              R+    +GLCGIA  ASYP+
Sbjct:   329 ARNFEDPKGLCGIAMLASYPI 349


>TAIR|locus:2128253 [details] [associations]
            symbol:AT4G11320 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002687 GenomeReviews:CT486007_GR EMBL:AL096882 EMBL:AL161531
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 OMA:ICHGADP
            HOGENOM:HOG000230773 KO:K01376 ProtClustDB:CLSN2689395
            EMBL:AY035055 EMBL:AY051062 IPI:IPI00520480 PIR:T13023
            RefSeq:NP_567377.1 UniGene:At.25206 ProteinModelPortal:Q9SUS9
            SMR:Q9SUS9 STRING:Q9SUS9 MEROPS:C01.A21 PaxDb:Q9SUS9 PRIDE:Q9SUS9
            EnsemblPlants:AT4G11320.1 GeneID:826734 KEGG:ath:AT4G11320
            TAIR:At4g11320 InParanoid:Q9SUS9 PhylomeDB:Q9SUS9
            Genevestigator:Q9SUS9 GermOnline:AT4G11320 Uniprot:Q9SUS9
        Length = 371

 Score = 730 (262.0 bits), Expect = 3.2e-72, P = 3.2e-72
 Identities = 139/307 (45%), Positives = 195/307 (63%)

Query:    43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
             E WM +HG+ Y    EK  RLTIF+ NL +I   N E N +Y+LG N F+DL+  E+   
Sbjct:    57 ESWMVKHGKVYDSVAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADLSLHEYGEI 115

Query:   103 YTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVA 162
               G +               +K  +   +P S+DWR +GAVT +K+QG C SCWAFS V 
Sbjct:   116 CHGADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVG 175

Query:   163 AVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQE 222
             AVEG+ +I  G+L+ LSEQ L++C+ +NNGC GG ++ A+E+I+ N GL T+ DYPY+  
Sbjct:   176 AVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKVETAYEFIMNNGGLGTDNDYPYKAL 235

Query:   223 QGTCD-KQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLN 281
              G C+ + KE      I  YE+LP  DE AL++AV  QPV+  V++S + F+ Y+ GV +
Sbjct:   236 NGVCEGRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGVFD 295

Query:   282 AECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIA 337
               CG N +HGV VVG+GT   E+G  YW++KNS G+TWGE+GY+++ R+     GLCGIA
Sbjct:   296 GTCGTNLNHGVVVVGYGT---ENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRGLCGIA 352

Query:   338 TEASYPV 344
               ASYP+
Sbjct:   353 MRASYPL 359


>FB|FBgn0013770 [details] [associations]
            symbol:Cp1 "Cysteine proteinase-1" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS;NAS] [GO:0030163 "protein catabolic process"
            evidence=NAS] [GO:0005764 "lysosome" evidence=NAS] [GO:0048102
            "autophagic cell death" evidence=IEP] [GO:0035071 "salivary gland
            cell autophagic cell death" evidence=IEP] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0045169 "fusome" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:AE013599 GO:GO:0007586 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0035071 GO:GO:0045169 GeneTree:ENSGT00660000095458 KO:K01365
            EMBL:U75652 EMBL:AF012089 EMBL:BT016071 EMBL:D31970
            RefSeq:NP_523735.2 RefSeq:NP_725347.1 RefSeq:NP_725348.1
            UniGene:Dm.7400 ProteinModelPortal:Q95029 SMR:Q95029 IntAct:Q95029
            MINT:MINT-814156 STRING:Q95029 MEROPS:C01.092 PaxDb:Q95029
            EnsemblMetazoa:FBtr0087593 GeneID:36546 KEGG:dme:Dmel_CG6692
            CTD:36546 FlyBase:FBgn0013770 InParanoid:Q95029 OMA:ICHGADP
            OrthoDB:EOG46M91C PhylomeDB:Q95029 GenomeRNAi:36546 NextBio:799136
            Bgee:Q95029 GermOnline:CG6692 Uniprot:Q95029
        Length = 371

 Score = 713 (256.0 bits), Expect = 2.1e-70, P = 2.1e-70
 Identities = 146/319 (45%), Positives = 196/319 (61%)

Query:    38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
             ++E+   +  +H + Y+DE E+  RL IF +N   I K N+   EG  ++KL  N+++DL
Sbjct:    55 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 114

Query:    95 TNEEFRASYTGYNXXXXXXXXXXXXXX---TFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
              + EFR    G+N                 TF       +P S+DWR KGAVT +K+QGH
Sbjct:   115 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 174

Query:   152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENK 209
             CGSCWAFS+  A+EG      G L+ LSEQ LVDCST   NNGC+GGLMD AF YI +N 
Sbjct:   175 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 234

Query:   210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEAS 268
             G+ TE  YPY+    +C   K    A   G + D+P+GDE  + +AV T  PVSV ++AS
Sbjct:   235 GIDTEKSYPYEAIDDSCHFNKGTVGATDRG-FTDIPQGDEKKMAEAVATVGPVSVAIDAS 293

Query:   269 GQAFRFYKRGVLNA-EC-GDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
              ++F+FY  GV N  +C   N DHGV VVGFGT  +E G  YWL+KNSWG TWG+ G+I+
Sbjct:   294 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGT--DESGEDYWLVKNSWGTTWGDKGFIK 351

Query:   327 ILRD-EGLCGIATEASYPV 344
             +LR+ E  CGIA+ +SYP+
Sbjct:   352 MLRNKENQCGIASASSYPL 370


>TAIR|locus:2038515 [details] [associations]
            symbol:AT1G06260 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0048046 "apoplast"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0048046 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC025290
            MEROPS:I29.003 HSSP:O65039 HOGENOM:HOG000230773 OMA:METAFEF
            IPI:IPI00525965 PIR:D86198 RefSeq:NP_563764.1 UniGene:At.24617
            ProteinModelPortal:Q9LNC1 SMR:Q9LNC1 PaxDb:Q9LNC1 PRIDE:Q9LNC1
            EnsemblPlants:AT1G06260.1 GeneID:837137 KEGG:ath:AT1G06260
            TAIR:At1g06260 InParanoid:Q9LNC1 PhylomeDB:Q9LNC1
            ProtClustDB:CLSN2916975 Genevestigator:Q9LNC1 Uniprot:Q9LNC1
        Length = 343

 Score = 710 (255.0 bits), Expect = 4.3e-70, P = 4.3e-70
 Identities = 148/330 (44%), Positives = 201/330 (60%)

Query:    24 ASQVVS-GRSMHEP--SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEG 80
             AS++ S   S+++P  ++ ++ E+W+  H + Y    E  +R  I++ N++ I+  N   
Sbjct:    22 ASKLCSVDSSVYDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSL- 80

Query:    81 NRTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREK 140
             +  +KL  N F+D+TN EF+A + G N                      +VP ++DWR +
Sbjct:    81 HLPFKLTDNRFADMTNSEFKAHFLGLNTSSLRLHKKQRPVC----DPAGNVPDAVDWRTQ 136

Query:   141 GAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS--TDNNGCSGGLM 198
             GAVT I+NQG CG CWAFSAVAA+EGI +I  G L+ LSEQQL+DC   T N GCSGGLM
Sbjct:   137 GAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLM 196

Query:   199 DKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK 258
             + AFE+I  N GLATE DYPY   +GTCD++K K    TI  Y+ + + +E +L  A  +
Sbjct:   197 ETAFEFIKTNGGLATETDYPYTGIEGTCDQEKSKNKVVTIQGYQKVAQ-NEASLQIAAAQ 255

Query:   259 QPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGET 318
             QPVSV ++A G  F+ Y  GV    CG N +HGV VVG+G    E   KYW++KNSWG  
Sbjct:   256 QPVSVGIDAGGFIFQLYSSGVFTNYCGTNLNHGVTVVGYGV---EGDQKYWIVKNSWGTG 312

Query:   319 WGESGYIRILR----DEGLCGIATEASYPV 344
             WGE GYIR+ R    D G CGIA  ASYP+
Sbjct:   313 WGEEGYIRMERGVSEDTGKCGIAMMASYPL 342


>TAIR|locus:2030027 [details] [associations]
            symbol:AT1G29110 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            EMBL:CP002684 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            IPI:IPI00544534 RefSeq:NP_564322.1 UniGene:At.51816
            ProteinModelPortal:F4HZW2 SMR:F4HZW2 EnsemblPlants:AT1G29110.1
            GeneID:839786 KEGG:ath:AT1G29110 OMA:SCRANAR Uniprot:F4HZW2
        Length = 334

 Score = 694 (249.4 bits), Expect = 2.1e-68, P = 2.1e-68
 Identities = 140/322 (43%), Positives = 201/322 (62%)

Query:    32 SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEF 91
             +++E SIV+ H+QWM Q  R YKDE EK MRL +FK+NL++IE  N  GN++Y LG NEF
Sbjct:    28 TLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKVFKKNLKFIENFNNMGNQSYTLGVNEF 87

Query:    92 SDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPT---SIDWREKGAVTHIKN 148
             +D   EEF A++TG                  +  N++D+     S DWR++GAVT +K 
Sbjct:    88 TDWKTEEFLATHTGLRVNVTSLSELFNKTKPSRNWNMSDIDMEDESKDWRDEGAVTPVKY 147

Query:   149 QGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNG-CSGGLMDKAFEYIIE 207
             QG C              +T+I+G  L+ LSEQQL+DC  + NG C+GG  ++AF+YII+
Sbjct:   148 QGACR-------------LTKISGKNLLTLSEQQLIDCDIEKNGGCNGGEFEEAFKYIIK 194

Query:   208 NKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEA 267
             N G++ E +YPYQ ++ +C     +A    I  ++ +P  +E ALL+AV +QPVSV ++A
Sbjct:   195 NGGVSLETEYPYQVKKESCRANARRAPHTQIRGFQMVPSHNERALLEAVRRQPVSVLIDA 254

Query:   268 SGQAFRFYKRGVL-NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
                +F  YK GV    +CG + +H V +VG+GT     G  YW++KNSWGE+WGE+GY+R
Sbjct:   255 RADSFGHYKGGVYAGLDCGTDVNHAVTIVGYGTMS---GLNYWVLKNSWGESWGENGYMR 311

Query:   327 ILRD----EGLCGIATEASYPV 344
             I RD    +G+CGIA  A+YPV
Sbjct:   312 IRRDVEWPQGMCGIAQVAAYPV 333


>DICTYBASE|DDB_G0272815 [details] [associations]
            symbol:cprE "cysteine proteinase 5" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0272815 GO:GO:0005615
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GenomeReviews:CM000151_GR GO:GO:0005764
            EMBL:AAFI02000008 MEROPS:I29.003 KO:K01376 EMBL:L36205
            RefSeq:XP_644977.1 ProteinModelPortal:P54640 SMR:P54640
            PRIDE:P54640 EnsemblProtists:DDB0185092 GeneID:8618654
            KEGG:ddi:DDB_G0272815 OMA:METAFEF ProtClustDB:CLSZ2430780
            Uniprot:P54640
        Length = 344

 Score = 563 (203.2 bits), Expect = 4.2e-67, Sum P(2) = 4.2e-67
 Identities = 112/257 (43%), Positives = 153/257 (59%)

Query:    45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYT 104
             WM  H ++Y  E E   R  IFK N++Y+++ N +G+ T  LG N F+D+TNEE+R +Y 
Sbjct:    33 WMITHQKSYTSE-EFGARYNIFKANMDYVQQWNSKGSETV-LGLNNFADITNEEYRNTYL 90

Query:   105 GYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAV 164
             G                 F     T    S DWR +GAVT +KNQG CG CW+FS   + 
Sbjct:    91 G-TKFDASSLIGTQEEKVF----TTSSAASKDWRSEGAVTPVKNQGQCGGCWSFSTTGST 145

Query:   165 EGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQG 224
             EG    + G+L+ LSEQ L+DCST+N+GC GGLM  AFEYII N G+ TE+ YPY+ E G
Sbjct:   146 EGAHFQSKGELVSLSEQNLIDCSTENSGCDGGLMTYAFEYIINNNGIDTESSYPYKAENG 205

Query:   225 TCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVL-NAE 283
              C+ + E + A T+  Y+ +  G E +L  AV   PVSV ++AS Q+F+ Y  G+    E
Sbjct:   206 KCEYKSENSGA-TLSSYKTVTAGSESSLESAVNVNPVSVAIDASHQSFQLYTSGIYYEPE 264

Query:   284 CG-DNCDHGVAVVGFGT 299
             C  +N DHGV  VG+G+
Sbjct:   265 CSSENLDHGVLAVGYGS 281

 Score = 137 (53.3 bits), Expect = 4.2e-67, Sum P(2) = 4.2e-67
 Identities = 30/76 (39%), Positives = 43/76 (56%)

Query:   286 DNCDHGVAVVGFGTAE-----EEDGA-----------KYWLIKNSWGETWGESGYIRILR 329
             +N DHGV  VG+G+       +  G            +YW++KNSWG +WG  GYI + R
Sbjct:   268 ENLDHGVLAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNSWGTSWGIEGYILMSR 327

Query:   330 D-EGLCGIATEASYPV 344
             + +  CGIA+ AS+PV
Sbjct:   328 NRDNNCGIASSASFPV 343


>UNIPROTKB|F1S4J6 [details] [associations]
            symbol:Ssc.54235 "Cathepsin L1" species:9823 "Sus scrofa"
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197
            GO:GO:0002250 GeneTree:ENSGT00560000076577 GO:GO:0097067
            EMBL:CU571031 RefSeq:XP_003130681.1 Ensembl:ENSSSCT00000011983
            GeneID:100515919 KEGG:ssc:100515919 OMA:IAICATK Uniprot:F1S4J6
        Length = 332

 Score = 677 (243.4 bits), Expect = 1.3e-66, P = 1.3e-66
 Identities = 144/327 (44%), Positives = 195/327 (59%)

Query:    27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRT 83
             + S    H+ S+     +W A H + Y    E+  R  I+++N++ IE+ N   ++G  +
Sbjct:    14 IASAAPRHDHSLDADWYKWKATHRKLYGLN-EEGRRRAIWEKNMKMIERHNWEHRQGKHS 72

Query:    84 YKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAV 143
             + +  N F D+TNEEFR +  G+                F        P S+DWREKG V
Sbjct:    73 FTMAMNAFGDMTNEEFRKTMNGFQNQKHKKGK------VFLDAGSALTPHSVDWREKGYV 126

Query:   144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS--TDNNGCSGGLMDKA 201
             T +KNQGHCGSCWAFSA  A+EG       KLI LSEQ LVDCS    N GC+GGLMD A
Sbjct:   127 TAVKNQGHCGSCWAFSATGALEGQMFRKTSKLISLSEQNLVDCSWPEGNEGCNGGLMDNA 186

Query:   202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQP 260
             F+YI +N GL +E  YPY  + G+C K K +++AA    Y D+PK  E AL++AV T  P
Sbjct:   187 FQYIKDNGGLDSEESYPYFGKDGSC-KYKPQSSAANDTGYVDIPK-QEKALMKAVATVGP 244

Query:   261 VSVCVEASGQAFRFYKRGV-LNAECG-DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGET 318
             +SV ++AS ++F+FY  G+    +C  ++ DHGV VVG+G        KYWL+KNSWG T
Sbjct:   245 ISVGIDASHESFQFYSTGIYFEPQCSSEDLDHGVLVVGYGVEGAHSNNKYWLVKNSWGNT 304

Query:   319 WGESGYIRILRDEGL-CGIATEASYPV 344
             WG  GYI++ +D+   CGIAT ASYPV
Sbjct:   305 WGMDGYIKMTKDQNNHCGIATMASYPV 331


>UNIPROTKB|F1NYJ1 [details] [associations]
            symbol:CTSL2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 IPI:IPI00602255
            OMA:DITHHEF EMBL:AADN02067812 Ensembl:ENSGALT00000020588
            ArrayExpress:F1NYJ1 Uniprot:F1NYJ1
        Length = 339

 Score = 674 (242.3 bits), Expect = 2.8e-66, P = 2.8e-66
 Identities = 139/321 (43%), Positives = 195/321 (60%)

Query:    35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
             +P +    + W + H + Y  E E++ R  ++++NL+ IE  N +   G  +YKLG N+F
Sbjct:    23 DPDLDSHWQLWKSWHSKDYH-EREESWRRVVWEKNLKMIELHNLDHSLGKHSYKLGMNQF 81

Query:    92 SDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
              D+T EEFR    GY                F   +  + P S+DWREKG VT +K+QG 
Sbjct:    82 GDMTAEEFRQLMNGYKHKKSERKYRGSQ---FLEPSFLEAPRSVDWREKGYVTPVKDQGQ 138

Query:   152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENK 209
             CGSCWAFS   A+EG      GKL+ LSEQ LVDCS    N GC+GGLMD+AF+Y+ +N 
Sbjct:   139 CGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNG 198

Query:   210 GLATEADYPYQ-QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVSVCVEA 267
             G+ +E  YPY  ++   C  + E  AA   G + D+P+G E AL++AV    PVSV ++A
Sbjct:   199 GIDSEESYPYTAKDDEDCRYKAEYNAANDTG-FVDIPQGHERALMKAVASVGPVSVAIDA 257

Query:   268 SGQAFRFYKRGVL-NAECG-DNCDHGVAVVGFG-TAEEEDGAKYWLIKNSWGETWGESGY 324
                +F+FY+ G+    +C  ++ DHGV VVG+G   E+ DG KYW++KNSWGE WG+ GY
Sbjct:   258 GHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGY 317

Query:   325 IRILRD-EGLCGIATEASYPV 344
             I + +D +  CGIAT ASYP+
Sbjct:   318 IYMAKDRKNHCGIATAASYPL 338


>ZFIN|ZDB-GENE-030131-106 [details] [associations]
            symbol:ctsl1a "cathepsin L, 1 a" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030131-106 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 HSSP:P43235
            KO:K01365 EMBL:BC066490 IPI:IPI00495935 RefSeq:NP_997749.1
            UniGene:Dr.104499 ProteinModelPortal:Q6NYR5 SMR:Q6NYR5
            MEROPS:C01.074 PRIDE:Q6NYR5 GeneID:321453 KEGG:dre:321453
            CTD:321453 InParanoid:Q6NYR5 NextBio:20807387 ArrayExpress:Q6NYR5
            Bgee:Q6NYR5 Uniprot:Q6NYR5
        Length = 337

 Score = 664 (238.8 bits), Expect = 3.2e-65, P = 3.2e-65
 Identities = 140/333 (42%), Positives = 193/333 (57%)

Query:    23 CASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE--- 79
             C S V +  ++ +  + +  +QW   H + Y    E+  R  I+++NL+ IE  N E   
Sbjct:    11 CLSAVFAAPTLDQ-QLNDHWDQWKKWHSKKYH-ATEEGWRRVIWEKNLKKIEMHNLEHSM 68

Query:    80 GNRTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWRE 139
             G  TY+LG N F D+T+EEFR    G+                F   N  +VP  +DWRE
Sbjct:    69 GIHTYRLGMNHFGDMTHEEFRQVMNGFKHKKDRRFRGSL----FMEPNFIEVPNKLDWRE 124

Query:   140 KGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGL 197
             KG VT +K+QG CGSCWAFS   A+EG      GKL+ LSEQ LVDCS    N GC+GGL
Sbjct:   125 KGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGL 184

Query:   198 MDKAFEYIIENKGLATEADYPYQ-QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV 256
             MD+AF+Y+ +  GL +E  YPY   +   C    + +AA   G + D+P G E AL++A+
Sbjct:   185 MDQAFQYVKDQNGLDSEESYPYLGTDDQPCHFDPKNSAANDTG-FVDIPSGKERALMKAI 243

Query:   257 TKQ-PVSVCVEASGQAFRFYKRGVL-NAECG-DNCDHGVAVVGFG-TAEEEDGAKYWLIK 312
                 PVSV ++A  ++F+FY+ G+    EC  +  DHGV  VG+G   E+ DG KYW++K
Sbjct:   244 AAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVDGKKYWIVK 303

Query:   313 NSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
             NSW E WG+ GYI + +D    CGIAT ASYP+
Sbjct:   304 NSWSENWGDKGYIYMAKDRHNHCGIATAASYPL 336


>MGI|MGI:107341 [details] [associations]
            symbol:Ctss "cathepsin S" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=ISO;IDA] [GO:0006508 "proteolysis"
            evidence=IDA] [GO:0008152 "metabolic process" evidence=ISO]
            [GO:0008233 "peptidase activity" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0009986 "cell
            surface" evidence=ISO] [GO:0016020 "membrane" evidence=IDA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0045453 "bone
            resorption" evidence=ISO] [GO:0051930 "regulation of sensory
            perception of pain" evidence=ISO] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:107341 GO:GO:0016020 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0008233 GO:GO:0031905 Reactome:REACT_102124
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 BRENDA:3.4.22.27
            ChiTaRS:CTSS EMBL:AF051732 EMBL:AF051727 EMBL:AF051728
            EMBL:AF051729 EMBL:AF051726 EMBL:AF051730 EMBL:AF051731
            EMBL:AF038546 EMBL:AJ002386 EMBL:AC092203 EMBL:Y18466 EMBL:AJ223208
            IPI:IPI00309520 UniGene:Mm.3619 PDB:1M0H PDBsum:1M0H
            ProteinModelPortal:O70370 SMR:O70370 STRING:O70370
            PhosphoSite:O70370 PaxDb:O70370 PRIDE:O70370
            Ensembl:ENSMUST00000116304 BindingDB:O70370 ChEMBL:CHEMBL4098
            NextBio:282932 Bgee:O70370 CleanEx:MM_CTSS Genevestigator:O70370
            GermOnline:ENSMUSG00000038642 Uniprot:O70370
        Length = 340

 Score = 662 (238.1 bits), Expect = 5.2e-65, P = 5.2e-65
 Identities = 144/320 (45%), Positives = 192/320 (60%)

Query:    35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
             +P++    + W   H + YKD+ E+ +R  I+++NL++I   N E   G  TY++G N+ 
Sbjct:    29 DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 88

Query:    92 SDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFK-YQNVTDVPTSIDWREKGAVTHIKNQG 150
              D+TNEE                       TF+ Y N T +P ++DWREKG VT +K QG
Sbjct:    89 GDMTNEEILC-----RMGALRIPRQSPKTVTFRSYSNRT-LPDTVDWREKGCVTEVKYQG 142

Query:   151 HCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD----NNGCSGGLMDKAFEYII 206
              CG+CWAFSAV A+EG  ++  GKLI LS Q LVDCS +    N GC GG M +AF+YII
Sbjct:   143 SCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYII 202

Query:   207 ENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCV 265
             +N G+  +A YPY+     C     K  AAT  +Y  LP GDE AL +AV TK PVSV +
Sbjct:   203 DNGGIEADASYPYKATDEKCH-YNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGI 261

Query:   266 EASGQAFRFYKRGVLN-AECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGY 324
             +AS  +F FYK GV +   C  N +HGV VVG+GT    DG  YWL+KNSWG  +G+ GY
Sbjct:   262 DASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTL---DGKDYWLVKNSWGLNFGDQGY 318

Query:   325 IRILRD-EGLCGIATEASYP 343
             IR+ R+ +  CGIA+  SYP
Sbjct:   319 IRMARNNKNHCGIASYCSYP 338


>RGD|1560071 [details] [associations]
            symbol:Ctsll3 "cathepsin L-like 3" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1560071 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:CH474032 IPI:IPI00560469 RefSeq:XP_001065834.2
            RefSeq:XP_573976.3 UniGene:Rn.104851 MEROPS:C01.107
            Ensembl:ENSRNOT00000061398 GeneID:498691 KEGG:rno:498691
            UCSC:RGD:1560071 CTD:70202 OMA:NCGIASD OrthoDB:EOG4HDSTZ
            NextBio:700548 Uniprot:D3ZJV2
        Length = 330

 Score = 659 (237.0 bits), Expect = 1.1e-64, P = 1.1e-64
 Identities = 138/327 (42%), Positives = 192/327 (58%)

Query:    27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRT 83
             ++S    H+PS     E+W  +HG+TY    E+  +  +++ N++ I   N++   G   
Sbjct:    14 MISAAPTHDPSFDTVWEEWKTKHGKTYNTN-EEGQKRAVWENNMKMINLHNEDYLKGKHG 72

Query:    84 YKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAV 143
             + L  N F DLTN EFR   TG+                F    + DVP ++DWR+ G V
Sbjct:    73 FSLEMNAFGDLTNTEFRELMTGFQGQKTKMMK------VFPEPFLGDVPKTVDWRKHGYV 126

Query:   144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKA 201
             T +KNQG CGSCWAFSAV ++EG      GKL+ LSEQ LVDCS    N GC GGL D A
Sbjct:   127 TPVKNQGPCGSCWAFSAVGSLEGQVFRKTGKLVPLSEQNLVDCSWSHGNKGCDGGLPDFA 186

Query:   202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQP 260
             F+Y+ +N GL T   YPY+   GTC    + +AA  +G +  +P   E+AL++AV T  P
Sbjct:   187 FQYVKDNGGLDTSVSYPYEALNGTCRYNPKYSAAKVVG-FMSIPPS-ENALMKAVATVGP 244

Query:   261 VSVCVEASGQAFRFYKRGVL-NAECGD-NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGET 318
             +SV ++   ++F+FYK G+    +C   N +H V VVG+G  EE DG KYWL+KNSWG  
Sbjct:   245 ISVGIDIKHKSFQFYKGGMYYEPDCSSTNLNHAVLVVGYG--EESDGRKYWLVKNSWGRD 302

Query:   319 WGESGYIRILRD-EGLCGIATEASYPV 344
             WG  GYI++ +D    CGIA++ASYP+
Sbjct:   303 WGMDGYIKMAKDWNNNCGIASDASYPI 329


>ZFIN|ZDB-GENE-071004-74 [details] [associations]
            symbol:zgc:174855 "zgc:174855" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-071004-74
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 MEROPS:C01.032 EMBL:BX000534 EMBL:BC152282
            IPI:IPI00773140 RefSeq:NP_001096592.1 UniGene:Dr.104905 SMR:A7MCR6
            STRING:A7MCR6 Ensembl:ENSDART00000109968 GeneID:569326
            KEGG:dre:569326 NextBio:20889622 Uniprot:A7MCR6
        Length = 335

 Score = 657 (236.3 bits), Expect = 1.8e-64, P = 1.8e-64
 Identities = 135/331 (40%), Positives = 196/331 (59%)

Query:    23 CASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE--- 79
             C S V +  S+ +  + +    W +QHG++Y +++E   R+ I+++NL  IE+ N E   
Sbjct:    10 CISAVFTAPSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSL 67

Query:    80 GNRTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWRE 139
             GN T+K+G N+F D+TNEEFR +  GY                F   +    P  +DWR+
Sbjct:    68 GNHTFKMGMNQFGDMTNEEFRQAMNGYKQDPNRTSKGAL----FMEPSFFAAPQQVDWRQ 123

Query:   140 KGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGL 197
             +G VT +K+Q  CGSCW+FS+  A+EG      GKLI +SEQ LVDCS    N GC+GG+
Sbjct:   124 RGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGI 183

Query:   198 MDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVT 257
             MD+AF+Y+ ENKGL +E  YPY        +   +   A I  + D+P+G+E AL+ AV 
Sbjct:   184 MDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGNELALMNAVA 243

Query:   258 KQ-PVSVCVEASGQAFRFYKRGVL-NAECGDNCDHGVAVVGFG-TAEEEDGAKYWLIKNS 314
                PVSV ++AS Q+ +FY+ G+     C    DH V VVG+G    +  G +YW++KNS
Sbjct:   244 AVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAGNRYWIVKNS 303

Query:   315 WGETWGESGYIRILRDEGL-CGIATEASYPV 344
             W + WG+ GYI + +D+   CGIAT ASYP+
Sbjct:   304 WSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334


>ZFIN|ZDB-GENE-030131-572 [details] [associations]
            symbol:wu:fb37b09 "wu:fb37b09" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030131-572 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:BX465840 IPI:IPI00866294 RefSeq:XP_001923796.1
            UniGene:Dr.25683 PRIDE:E9QBE2 Ensembl:ENSDART00000133962
            GeneID:321853 KEGG:dre:321853 NextBio:20807556 Uniprot:E9QBE2
        Length = 335

 Score = 656 (236.0 bits), Expect = 2.3e-64, P = 2.3e-64
 Identities = 137/322 (42%), Positives = 192/322 (59%)

Query:    36 PSI---VEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGT 88
             PSI   ++ H   W +QHG++Y +++E   R+ I+++NL  IE+ N E   GN T+K+G 
Sbjct:    18 PSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSLGNHTFKMGM 76

Query:    89 NEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKN 148
             N+F D+TNEEFR +  GY                F        P  +DWR++G VT +K+
Sbjct:    77 NQFGDMTNEEFRQAMNGYKHDPNRTSQGPL----FMEPKFFAAPQQVDWRQRGYVTPVKD 132

Query:   149 QGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYII 206
             Q  CGSCW+FS+  A+EG      GKLI +SEQ LVDCS    N GC+GGLMD+AF+Y+ 
Sbjct:   133 QKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHGNQGCNGGLMDQAFQYVK 192

Query:   207 ENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVSVCV 265
             ENKGL +E  YPY        +   +   A I  + D+PKG+E AL+ AV    PVSV +
Sbjct:   193 ENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGNELALMNAVAAVGPVSVAI 252

Query:   266 EASGQAFRFYKRGVL-NAECGDNCDHGVAVVGFG-TAEEEDGAKYWLIKNSWGETWGESG 323
             +AS Q+ +FY+ G+     C    DH V VVG+G    +  G +YW++KNSW + WG+ G
Sbjct:   253 DASHQSLQFYQSGIYYERACTSQLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKG 312

Query:   324 YIRILRDEGL-CGIATEASYPV 344
             YI + +D+   CGIAT ASYP+
Sbjct:   313 YIYMAKDKNNHCGIATMASYPL 334


>RGD|1308751 [details] [associations]
            symbol:RGD1308751 "similar to Cathepsin L precursor (Major
            excreted protein) (MEP)" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1308751 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:CH474032 IPI:IPI00365697 RefSeq:XP_001065885.2
            RefSeq:XP_225137.5 MEROPS:C01.069 Ensembl:ENSRNOT00000061391
            GeneID:290981 KEGG:rno:290981 UCSC:RGD:1308751 CTD:290981
            OMA:ESYAYEA OrthoDB:EOG42823G NextBio:631921 Uniprot:D3ZKC3
        Length = 330

 Score = 655 (235.6 bits), Expect = 2.9e-64, P = 2.9e-64
 Identities = 135/325 (41%), Positives = 188/325 (57%)

Query:    27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRT 83
             ++S    H+PS     E+W  +HG+TY    E+  +  +++ N++ I   N++   G   
Sbjct:    14 MISAAPTHDPSFDTVWEEWKTKHGKTYNTN-EEGQKRAVWENNMKMINLHNEDYLKGKHG 72

Query:    84 YKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAV 143
             + L  N F DLTN EFR   TG+                F+   + D+P S+DWRE G V
Sbjct:    73 FSLEMNAFGDLTNTEFRELMTGFQSMGPKETTI------FREPFLGDIPKSLDWREHGYV 126

Query:   144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKA 201
             T +KNQG CGSCWAFSAV ++EG      GKL+ LSEQ LVDCS    N GC+GGLM+ A
Sbjct:   127 TPVKNQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDCSWSYGNLGCNGGLMEFA 186

Query:   202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPV 261
             F+Y+ EN+GL T   Y Y+ + G C +   K +AA +  +  +P  ++  +    +  PV
Sbjct:   187 FQYVKENRGLDTGESYAYEAQDGLC-RYNPKYSAANVTGFVKVPLSEDDLMSAVASVGPV 245

Query:   262 SVCVEASGQAFRFYKRGVL-NAECGDN-CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETW 319
             SV +++  Q+FRFY  G+    +C     DH V VVG+G  EE DG KYWL+KNSWGE W
Sbjct:   246 SVGIDSHHQSFRFYSGGMYYEPDCSSTEMDHAVLVVGYG--EESDGGKYWLVKNSWGEDW 303

Query:   320 GESGYIRILRDEGL-CGIATEASYP 343
             G  GYI++ +D+   CGIAT A YP
Sbjct:   304 GMDGYIKMAKDQNNNCGIATYAIYP 328


>ZFIN|ZDB-GENE-080215-7 [details] [associations]
            symbol:zgc:174153 "zgc:174153" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-080215-7
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 EMBL:BX000534 EMBL:BX322603
            IPI:IPI00483644 Ensembl:ENSDART00000113654 OMA:ITLCISA Bgee:F1R8Y0
            Uniprot:F1R8Y0
        Length = 336

 Score = 652 (234.6 bits), Expect = 6.0e-64, P = 6.0e-64
 Identities = 135/332 (40%), Positives = 197/332 (59%)

Query:    23 CASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE--- 79
             C S V +  S+ +  + +    W +QHG++Y +++E   R+ I+++NL  IE+ N E   
Sbjct:    10 CISAVFTAPSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSY 67

Query:    80 GNRTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWRE 139
             GN T+K+G N+F D+TNEEFR +  GY                F   +    P  +DWR+
Sbjct:    68 GNHTFKMGMNQFGDMTNEEFRQAMNGYKHDPNQTSQGPL----FMEPSFFAAPQQVDWRQ 123

Query:   140 KGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGL 197
             +G VT +K+Q  CGSCW+FS+  A+EG      GKLI +SEQ LVDCS    N GC+GGL
Sbjct:   124 RGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGL 183

Query:   198 MDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVT 257
             MD+AF+Y+ ENKGL +E  YPY        +   +   A I  + D+P G+E AL+ AV 
Sbjct:   184 MDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGNEPALMNAVA 243

Query:   258 KQ-PVSVCVEASGQAFRFYKRGVL-NAECGDN-CDHGVAVVGFG-TAEEEDGAKYWLIKN 313
                PVSV ++AS Q+ +FY+ G+     C  +  DH V VVG+G    +  G +YW++KN
Sbjct:   244 AVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAGNRYWIVKN 303

Query:   314 SWGETWGESGYIRILRDEGL-CGIATEASYPV 344
             SW + WG+ GYI + +D+   CG+AT+ASYP+
Sbjct:   304 SWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335


>UNIPROTKB|Q9GL24 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 CTD:1515 KO:K01365
            OrthoDB:EOG48PMKF EMBL:AJ279008 RefSeq:NP_001239115.1
            UniGene:Cfa.3571 ProteinModelPortal:Q9GL24 SMR:Q9GL24
            MEROPS:C01.032 Ensembl:ENSCAFT00000001770
            Ensembl:ENSCAFT00000023837 GeneID:100684364 KEGG:cfa:100684364
            InParanoid:Q9GL24 OMA:FDQNLDT NextBio:20817211 Uniprot:Q9GL24
        Length = 333

 Score = 648 (233.2 bits), Expect = 1.6e-63, P = 1.6e-63
 Identities = 138/327 (42%), Positives = 194/327 (59%)

Query:    27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRT 83
             + S     + S+  +  QW A H R Y    E+  R  ++++N++ IE  N+E   G   
Sbjct:    14 IASAAPKFDQSLNAQWYQWKATHRRLYGMN-EEGWRRAVWEKNMKMIELHNREYSQGKHG 72

Query:    84 YKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAV 143
             + +  N F D+TNEEFR    G+                F+     ++P S+DWREKG V
Sbjct:    73 FTMAMNAFGDMTNEEFRQVMNGFQNQKHKKGKM------FQEPLFAEIPKSVDWREKGYV 126

Query:   144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKA 201
             T +KNQG CGSCWAFSA  A+EG      GKL+ LSEQ LVDCS    N GC+GGLMD A
Sbjct:   127 TPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNA 186

Query:   202 FEYIIENKGLATEADYPYQ-QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQ 259
             F Y+ +N GL +E  YPY  ++  TC+ + E +AA   G + DLP+  E AL++AV T  
Sbjct:   187 FRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTG-FVDLPQR-EKALMKAVATLG 244

Query:   260 PVSVCVEASGQAFRFYKRGV-LNAECGD-NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGE 317
             P+SV ++A  Q+F+FYK G+  + +C   + DHGV VVG+G    +   K+W++KNSWG 
Sbjct:   245 PISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKFWIVKNSWGP 304

Query:   318 TWGESGYIRILRDEGL-CGIATEASYP 343
              WG +GY+++ +D+   CGIAT ASYP
Sbjct:   305 EWGWNGYVKMAKDQNNHCGIATAASYP 331


>UNIPROTKB|P07711 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9606 "Homo sapiens"
            [GO:0005576 "extracellular region" evidence=NAS] [GO:0005764
            "lysosome" evidence=IDA;NAS] [GO:0006508 "proteolysis"
            evidence=IDA] [GO:0002250 "adaptive immune response" evidence=IEP]
            [GO:0019882 "antigen processing and presentation" evidence=TAS]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0036021 "endolysosome lumen" evidence=TAS] [GO:0043202
            "lysosomal lumen" evidence=TAS] [GO:0045087 "innate immune
            response" evidence=TAS] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042393 "histone binding" evidence=IDA] [GO:0005634 "nucleus"
            evidence=TAS] [GO:0071888 "macrophage apoptotic process"
            evidence=NAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            Reactome:REACT_118779 EMBL:X12451 GO:GO:0005634 Reactome:REACT_6900
            GO:GO:0005576 GO:GO:0019886 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087
            GO:GO:0042393 GO:GO:0004197 GO:GO:0002250 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0036021 KO:K01365 OrthoDB:EOG48PMKF EMBL:M20496
            EMBL:CR457053 EMBL:BX537395 EMBL:AL160279 EMBL:BC012612 EMBL:X05256
            IPI:IPI00012887 PIR:S01002 RefSeq:NP_001244900.1
            RefSeq:NP_001244901.1 RefSeq:NP_001903.1 RefSeq:NP_666023.1
            UniGene:Hs.731507 UniGene:Hs.731952 PDB:1CJL PDB:1CS8 PDB:1ICF
            PDB:1MHW PDB:2NQD PDB:2VHS PDB:2XU1 PDB:2XU3 PDB:2XU4 PDB:2XU5
            PDB:2YJ2 PDB:2YJ8 PDB:2YJ9 PDB:2YJB PDB:2YJC PDB:3BC3 PDB:3H89
            PDB:3H8B PDB:3H8C PDB:3HHA PDB:3HWN PDB:3IV2 PDB:3K24 PDB:3KSE
            PDB:3OF8 PDB:3OF9 PDBsum:1CJL PDBsum:1CS8 PDBsum:1ICF PDBsum:1MHW
            PDBsum:2NQD PDBsum:2VHS PDBsum:2XU1 PDBsum:2XU3 PDBsum:2XU4
            PDBsum:2XU5 PDBsum:2YJ2 PDBsum:2YJ8 PDBsum:2YJ9 PDBsum:2YJB
            PDBsum:2YJC PDBsum:3BC3 PDBsum:3H89 PDBsum:3H8B PDBsum:3H8C
            PDBsum:3HHA PDBsum:3HWN PDBsum:3IV2 PDBsum:3K24 PDBsum:3KSE
            PDBsum:3OF8 PDBsum:3OF9 ProteinModelPortal:P07711 SMR:P07711
            IntAct:P07711 STRING:P07711 MEROPS:I29.001 PhosphoSite:P07711
            DMDM:115741 PaxDb:P07711 PeptideAtlas:P07711 PRIDE:P07711
            DNASU:1514 Ensembl:ENST00000340342 Ensembl:ENST00000343150
            GeneID:1514 KEGG:hsa:1514 UCSC:uc004aph.3 CTD:1514
            GeneCards:GC09P090341 H-InvDB:HIX0058839 H-InvDB:HIX0170314
            HGNC:HGNC:2537 HPA:CAB000459 MIM:116880 neXtProt:NX_P07711
            PharmGKB:PA162382890 InParanoid:P07711 OMA:REPLFAQ PhylomeDB:P07711
            BRENDA:3.4.22.15 BindingDB:P07711 ChEMBL:CHEMBL3837 ChiTaRS:CTSL1
            DrugBank:DB00040 EvolutionaryTrace:P07711 GenomeRNAi:1514
            NextBio:6271 PMAP-CutDB:P07711 ArrayExpress:P07711 Bgee:P07711
            CleanEx:HS_CTSL1 Genevestigator:P07711 GermOnline:ENSG00000135047
            GO:GO:0071888 Uniprot:P07711
        Length = 333

 Score = 648 (233.2 bits), Expect = 1.6e-63, P = 1.6e-63
 Identities = 137/327 (41%), Positives = 192/327 (58%)

Query:    27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRT 83
             + S     + S+  +  +W A H R Y    E+  R  ++++N++ IE  N   +EG  +
Sbjct:    14 IASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKMIELHNQEYREGKHS 72

Query:    84 YKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAV 143
             + +  N F D+T+EEFR    G+                F+     + P S+DWREKG V
Sbjct:    73 FTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGK------VFQEPLFYEAPRSVDWREKGYV 126

Query:   144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS--TDNNGCSGGLMDKA 201
             T +KNQG CGSCWAFSA  A+EG      G+LI LSEQ LVDCS    N GC+GGLMD A
Sbjct:   127 TPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYA 186

Query:   202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQP 260
             F+Y+ +N GL +E  YPY+  + +C K   K + A    + D+PK  E AL++AV T  P
Sbjct:   187 FQYVQDNGGLDSEESYPYEATEESC-KYNPKYSVANDTGFVDIPK-QEKALMKAVATVGP 244

Query:   261 VSVCVEASGQAFRFYKRGV-LNAECG-DNCDHGVAVVGFG-TAEEEDGAKYWLIKNSWGE 317
             +SV ++A  ++F FYK G+    +C  ++ DHGV VVG+G  + E D  KYWL+KNSWGE
Sbjct:   245 ISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGE 304

Query:   318 TWGESGYIRILRDE-GLCGIATEASYP 343
              WG  GY+++ +D    CGIA+ ASYP
Sbjct:   305 EWGMGGYVKMAKDRRNHCGIASAASYP 331


>ZFIN|ZDB-GENE-980526-285 [details] [associations]
            symbol:ctsl1b "cathepsin L, 1 b" species:7955
            "Danio rerio" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005576
            "extracellular region" evidence=IDA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-980526-285 GO:GO:0005576 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:BX465840 IPI:IPI00498443 Ensembl:ENSDART00000145570
            Bgee:F1R7B3 Uniprot:F1R7B3
        Length = 352

 Score = 648 (233.2 bits), Expect = 1.6e-63, P = 1.6e-63
 Identities = 135/323 (41%), Positives = 194/323 (60%)

Query:    36 PSI---VEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGT 88
             PSI   ++ H   W +QHG++Y +++E   R+ I+++NL  IE+ N E   GN T+K+G 
Sbjct:    34 PSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSYGNHTFKMGM 92

Query:    89 NEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKN 148
             N+F D+TNEEFR +  GY                F   +    P  +DWR++G VT +K+
Sbjct:    93 NQFGDMTNEEFRQAMNGYTHDPNQTSQGPL----FMEPSFFAAPQQVDWRQRGYVTPVKD 148

Query:   149 QGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYII 206
             Q  CGSCW+FS+  A+EG      GKLI +SEQ LVDCS    N GC+GGLMD+AF+Y+ 
Sbjct:   149 QKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGLMDQAFQYVK 208

Query:   207 ENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVSVCV 265
             ENKGL +E  YPY        +   +   A I  + D+P G+E AL+ AV    PVSV +
Sbjct:   209 ENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGNELALMNAVAAVGPVSVAI 268

Query:   266 EASGQAFRFYKRGVL-NAECGDN-CDHGVAVVGFG-TAEEEDGAKYWLIKNSWGETWGES 322
             +AS Q+ +FY+ G+     C  +  DH V VVG+G    +  G +YW++KNSW + WG+ 
Sbjct:   269 DASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDK 328

Query:   323 GYIRILRDEGL-CGIATEASYPV 344
             GYI + +D+   CG+AT+ASYP+
Sbjct:   329 GYIYMAKDKNNHCGVATKASYPL 351


>UNIPROTKB|Q28944 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9823 "Sus scrofa"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 KO:K01365 OrthoDB:EOG48PMKF MEROPS:C01.032
            CTD:1514 EMBL:D37917 EMBL:AJ315771 PIR:A58195 RefSeq:NP_999057.1
            UniGene:Ssc.54036 ProteinModelPortal:Q28944 SMR:Q28944
            STRING:Q28944 Ensembl:ENSSSCT00000012233 GeneID:396926
            KEGG:ssc:396926 OMA:DASETGK ArrayExpress:Q28944 Uniprot:Q28944
        Length = 334

 Score = 647 (232.8 bits), Expect = 2.0e-63, P = 2.0e-63
 Identities = 136/311 (43%), Positives = 190/311 (61%)

Query:    44 QWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
             +W A HGR Y    E+  R  ++++N++ IE  N+E   G   + +  N F D+TNEEFR
Sbjct:    31 KWKATHGRLYGMN-EEGWRRAVWEKNMKMIELHNQEYSQGKHGFSMAMNAFGDMTNEEFR 89

Query:   101 ASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSA 160
                 G+                F    V +VP S+DWREKG VT +KNQG CGSCWAFSA
Sbjct:    90 QVMNGFQNQKHKKGK------VFHESLVLEVPKSVDWREKGYVTAVKNQGQCGSCWAFSA 143

Query:   161 VAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYP 218
               A+EG      GKL+ LSEQ LVDCS    N GC+GGLMD AF+Y+ +N GL TE  YP
Sbjct:   144 TGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTEESYP 203

Query:   219 YQ-QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYK 276
             Y  +E  +C  + E +AA   G + D+P+  E AL++AV T  P+SV ++A   +F+FYK
Sbjct:   204 YLGRETNSCTYKPECSAANDTG-FVDIPQR-EKALMKAVATVGPISVAIDAGHSSFQFYK 261

Query:   277 RGVL-NAECGD-NCDHGVAVVGFG-TAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGL 333
              G+  + +C   + DHGV VVG+G    + + +K+W++KNSWG  WG +GY+++ +D+  
Sbjct:   262 SGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWGPEWGWNGYVKMAKDQNN 321

Query:   334 -CGIATEASYP 343
              CGI+T ASYP
Sbjct:   322 HCGISTAASYP 332


>TAIR|locus:2097104 [details] [associations]
            symbol:AT3G43960 species:3702 "Arabidopsis thaliana"
            [GO:0005886 "plasma membrane" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0031225 "anchored to
            membrane" evidence=TAS] [GO:0048767 "root hair elongation"
            evidence=IMP] [GO:0016132 "brassinosteroid biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:CP002686
            GenomeReviews:BA000014_GR GO:GO:0031225 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0048767 MEROPS:I29.003 HOGENOM:HOG000230773
            EMBL:AL163975 EMBL:AK118634 IPI:IPI00526842 PIR:T48950
            RefSeq:NP_566867.1 UniGene:At.43352 ProteinModelPortal:Q9LXW3
            SMR:Q9LXW3 STRING:Q9LXW3 PaxDb:Q9LXW3 PRIDE:Q9LXW3
            EnsemblPlants:AT3G43960.1 GeneID:823513 KEGG:ath:AT3G43960
            TAIR:At3g43960 eggNOG:NOG286334 InParanoid:Q9LXW3 KO:K01376
            OMA:MAISFRT PhylomeDB:Q9LXW3 ProtClustDB:CLSN2917367
            Genevestigator:Q9LXW3 GermOnline:AT3G43960 Uniprot:Q9LXW3
        Length = 376

 Score = 647 (232.8 bits), Expect = 2.0e-63, P = 2.0e-63
 Identities = 135/321 (42%), Positives = 188/321 (58%)

Query:    34 HEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
             +E  ++  +EQW+ ++G+ Y    EK  R  IFK NL+ IE+ N + NR+Y+ G N+FSD
Sbjct:    33 NEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSD 92

Query:    94 LTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVT-HIKNQGHC 152
             LT +EF+ASY G                 ++Y+    +P  +DWRE+GAV   +K QG C
Sbjct:    93 LTADEFQASYLG---GKMEKKSLSDVAERYQYKEGDVLPDEVDWRERGAVVPRVKRQGEC 149

Query:   153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS--TDNNGCSGGLMDKAFEYIIENKG 210
             GSCWAF+A  AVEGI QIT G+L+ LSEQ+L+DC    DN GC+GG    AFE+I EN G
Sbjct:   150 GSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGG 209

Query:   211 LATEADYPYQQEQ-GTCDKQKEKAA-AATIGKYEDLPKGDEHALLQAVTKQPVSVCVEAS 268
             + ++  Y Y  E    C   + K     TI  +E +P  DE +L +AV  QP+SV + A+
Sbjct:   210 IVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPISVMISAA 269

Query:   269 GQAFRFYKRGVLNAECGDNC-DHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI 327
               +   YK GV    C +   DH V +VG+GT+ +E    YWLI+NSWG  WGE GY+R+
Sbjct:   270 NMSD--YKSGVYKGACSNLWGDHNVLIVGYGTSSDE--GDYWLIRNSWGPEWGEGGYLRL 325

Query:   328 LRD----EGLCGIATEASYPV 344
              R+     G C +A    YP+
Sbjct:   326 QRNFHEPTGKCAVAVAPVYPI 346


>UNIPROTKB|F1SS93 [details] [associations]
            symbol:CTSS "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0002250 "adaptive immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0016020 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0002250
            GeneTree:ENSGT00560000076577 GO:GO:0097067 EMBL:CU463875
            Ensembl:ENSSSCT00000007284 OMA:CEIESAV Uniprot:F1SS93
        Length = 342

 Score = 646 (232.5 bits), Expect = 2.6e-63, P = 2.6e-63
 Identities = 139/331 (41%), Positives = 195/331 (58%)

Query:    23 CASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE-- 79
             C+S +     +H    +++H + W   +G+ YK++ E+  R  I+++NL+ +   N E  
Sbjct:    22 CSSAMAQ---LHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHS 78

Query:    80 -GNRTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWR 138
              G  +Y LG N   D+T+EE  +  +                 T+K      +P S+DWR
Sbjct:    79 MGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNV-----TYKSNPNQKLPDSMDWR 133

Query:   139 EKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD---NNGCSG 195
             EKG VT +K QG CGSCWAFSAV A+E   ++  G+L+ LS Q LVDCST+   N GC+G
Sbjct:   134 EKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNG 193

Query:   196 GLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQA 255
             G M +AF+YII+N G+ +EA YPY+   G C K   K  AAT  +Y +LP  DE+AL +A
Sbjct:   194 GFMTEAFQYIIDNNGIDSEASYPYKAVDGKC-KYDSKNRAATCSRYTELPFADEYALKEA 252

Query:   256 VT-KQPVSVCVEASGQAFRFYKRGVL-NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKN 313
             V  K PVSV ++A   +F FY+ GV  +  C  N +HGV VVG+G    +D   YWL+KN
Sbjct:   253 VANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQNVNHGVLVVGYGNLNGKD---YWLVKN 309

Query:   314 SWGETWGESGYIRILRD-EGLCGIATEASYP 343
             SWG  +G+ GYIR+ R+ E  CGIA   SYP
Sbjct:   310 SWGLNFGDGGYIRMARNSENHCGIANYPSYP 340


>MGI|MGI:88564 [details] [associations]
            symbol:Ctsl "cathepsin L" species:10090 "Mus musculus"
            [GO:0004177 "aminopeptidase activity" evidence=ISO] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISO] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005730 "nucleolus"
            evidence=NAS] [GO:0005737 "cytoplasm" evidence=ISO] [GO:0005764
            "lysosome" evidence=ISO] [GO:0005773 "vacuole" evidence=ISO]
            [GO:0005902 "microvillus" evidence=ISO] [GO:0006508 "proteolysis"
            evidence=ISO;IDA] [GO:0007154 "cell communication" evidence=ISO]
            [GO:0008152 "metabolic process" evidence=TAS] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISO;TAS] [GO:0009897 "external side of
            plasma membrane" evidence=ISO] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0030141 "secretory granule" evidence=ISO]
            [GO:0030984 "kininogen binding" evidence=ISO] [GO:0032403 "protein
            complex binding" evidence=ISO] [GO:0042277 "peptide binding"
            evidence=ISO] [GO:0042393 "histone binding" evidence=ISO;NAS]
            [GO:0043005 "neuron projection" evidence=ISO] [GO:0043204
            "perikaryon" evidence=ISO] [GO:0045177 "apical part of cell"
            evidence=ISO] [GO:0048863 "stem cell differentiation" evidence=NAS]
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IMP] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:88564 GO:GO:0005730 GO:GO:0009897 GO:GO:0034698
            GO:GO:0043204 GO:GO:0009749 GO:GO:0030141 GO:GO:0048863
            GO:GO:0051384 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045177 GO:GO:0043005
            GO:GO:0007283 GO:GO:0004177 GO:GO:0005764 GO:GO:0042277
            GO:GO:0009267 GO:GO:0021675 GO:GO:0042393 GO:GO:0005902
            GO:GO:0010259 GO:GO:0004197 GO:GO:0048102 GO:GO:0046697
            HOVERGEN:HBG011513 KO:K01365 OMA:EEFRATH OrthoDB:EOG48PMKF
            MEROPS:C01.032 BRENDA:3.4.22.15 ChiTaRS:CTSL1 EMBL:X06086
            EMBL:J02583 EMBL:M20495 EMBL:AF121837 EMBL:AF121838 EMBL:AF121839
            EMBL:BC068163 EMBL:X04392 IPI:IPI00128154 PIR:S01177
            RefSeq:NP_034114.1 UniGene:Mm.930 PDB:1MVV PDBsum:1MVV
            ProteinModelPortal:P06797 SMR:P06797 STRING:P06797
            PhosphoSite:P06797 PaxDb:P06797 PRIDE:P06797
            Ensembl:ENSMUST00000021933 GeneID:13039 KEGG:mmu:13039 CTD:13039
            InParanoid:P06797 BioCyc:MetaCyc:MONOMER-14812 BindingDB:P06797
            ChEMBL:CHEMBL5291 NextBio:282928 Bgee:P06797 CleanEx:MM_CTSL
            Genevestigator:P06797 GermOnline:ENSMUSG00000021477 GO:GO:0060008
            Uniprot:P06797
        Length = 334

 Score = 646 (232.5 bits), Expect = 2.6e-63, P = 2.6e-63
 Identities = 142/332 (42%), Positives = 189/332 (56%)

Query:    23 CASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE--- 79
             C    ++     +    E H QW + H R Y    E+  R  I+++N+  I+  N E   
Sbjct:    11 CLGTALATPKFDQTFSAEWH-QWKSTHRRLYGTN-EEEWRRAIWEKNMRMIQLHNGEYSN 68

Query:    80 GNRTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWRE 139
             G   + +  N F D+TNEEFR    GY                F+   +  +P S+DWRE
Sbjct:    69 GQHGFSMEMNAFGDMTNEEFRQVVNGYRHQKHKKGRL------FQEPLMLKIPKSVDWRE 122

Query:   140 KGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS--TDNNGCSGGL 197
             KG VT +KNQG CGSCWAFSA   +EG   +  GKLI LSEQ LVDCS    N GC+GGL
Sbjct:   123 KGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGL 182

Query:   198 MDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV- 256
             MD AF+YI EN GL +E  YPY+ + G+C  + E A A   G + D+P+  E AL++AV 
Sbjct:   183 MDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTG-FVDIPQ-QEKALMKAVA 240

Query:   257 TKQPVSVCVEASGQAFRFYKRGVL-NAECGD-NCDHGVAVVGFG-TAEEEDGAKYWLIKN 313
             T  P+SV ++AS  + +FY  G+     C   N DHGV +VG+G    + +  KYWL+KN
Sbjct:   241 TVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKN 300

Query:   314 SWGETWGESGYIRILRD-EGLCGIATEASYPV 344
             SWG  WG  GYI+I +D +  CG+AT ASYPV
Sbjct:   301 SWGSEWGMEGYIKIAKDRDNHCGLATAASYPV 332


>UNIPROTKB|F1PAK0 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9615 "Canis lupus
            familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:AAEX03011051 Ensembl:ENSCAFT00000019176 OMA:YEPACTQ
            Uniprot:F1PAK0
        Length = 339

 Score = 645 (232.1 bits), Expect = 3.3e-63, P = 3.3e-63
 Identities = 138/321 (42%), Positives = 188/321 (58%)

Query:    33 MHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGT 88
             +H+   ++ H   W   + + YK+E E+  R  I+++NL+++   N E   G  +Y LG 
Sbjct:    26 VHKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGM 85

Query:    89 NEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKN 148
             N   D+T EE   S  G                T++  +   +P S+DWREKG VT +K 
Sbjct:    86 NHLGDMTGEEV-ISLMG----SLRVPSQWQRNVTYRSNSNQKLPDSVDWREKGCVTEVKY 140

Query:   149 QGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD---NNGCSGGLMDKAFEYI 205
             QG CG+CWAFSAV A+E   ++  GKL+ LS Q LVDCST+   N GC+GG M  AF+YI
Sbjct:   141 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYI 200

Query:   206 IENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVT-KQPVSVC 264
             I+N G+ +EA YPY+   G C +   K  AAT  KY +LP G E AL +AV  K PVSV 
Sbjct:   201 IDNNGIDSEASYPYKAVNGKC-RYDSKKRAATCSKYTELPFGSEDALKEAVANKGPVSVA 259

Query:   265 VEASGQAFRFYKRGVL-NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESG 323
             ++AS  +F  Y+ GV     C  N +HGV VVG+G    +D   YWL+KNSWG  +G+ G
Sbjct:   260 IDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNLNGKD---YWLVKNSWGLNFGDQG 316

Query:   324 YIRILRDEGL-CGIATEASYP 343
             YIR+ R+ G  CGIA+  SYP
Sbjct:   317 YIRMARNSGNHCGIASYPSYP 337


>UNIPROTKB|Q8HY81 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            CTD:1520 KO:K01368 OrthoDB:EOG4JM7Q2 EMBL:AY156692
            RefSeq:NP_001002938.2 UniGene:Cfa.1661 ProteinModelPortal:Q8HY81
            SMR:Q8HY81 STRING:Q8HY81 MEROPS:C01.034 GeneID:403400
            KEGG:cfa:403400 InParanoid:Q8HY81 NextBio:20816922 Uniprot:Q8HY81
        Length = 331

 Score = 645 (232.1 bits), Expect = 3.3e-63, P = 3.3e-63
 Identities = 138/321 (42%), Positives = 188/321 (58%)

Query:    33 MHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGT 88
             +H+   ++ H   W   + + YK+E E+  R  I+++NL+++   N E   G  +Y LG 
Sbjct:    18 VHKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGM 77

Query:    89 NEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKN 148
             N   D+T EE   S  G                T++  +   +P S+DWREKG VT +K 
Sbjct:    78 NHLGDMTGEEV-ISLMG----SLRVPSQWQRNVTYRSNSNQKLPDSVDWREKGCVTEVKY 132

Query:   149 QGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD---NNGCSGGLMDKAFEYI 205
             QG CG+CWAFSAV A+E   ++  GKL+ LS Q LVDCST+   N GC+GG M  AF+YI
Sbjct:   133 QGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYI 192

Query:   206 IENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVT-KQPVSVC 264
             I+N G+ +EA YPY+   G C +   K  AAT  KY +LP G E AL +AV  K PVSV 
Sbjct:   193 IDNNGIDSEASYPYKAMNGKC-RYDSKKRAATCSKYTELPFGSEDALKEAVANKGPVSVA 251

Query:   265 VEASGQAFRFYKRGVL-NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESG 323
             ++AS  +F  Y+ GV     C  N +HGV VVG+G    +D   YWL+KNSWG  +G+ G
Sbjct:   252 IDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNLNGKD---YWLVKNSWGLNFGDQG 308

Query:   324 YIRILRDEGL-CGIATEASYP 343
             YIR+ R+ G  CGIA+  SYP
Sbjct:   309 YIRMARNSGNHCGIASYPSYP 329


>UNIPROTKB|P25774 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0016020 "membrane"
            evidence=IEA] [GO:0005576 "extracellular region" evidence=NAS]
            [GO:0005764 "lysosome" evidence=IDA;NAS] [GO:0097067 "cellular
            response to thyroid hormone stimulus" evidence=IEP] [GO:0002250
            "adaptive immune response" evidence=IEP] [GO:0019882 "antigen
            processing and presentation" evidence=TAS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=TAS] [GO:0006955
            "immune response" evidence=TAS] [GO:0002474 "antigen processing and
            presentation of peptide antigen via MHC class I" evidence=TAS]
            [GO:0002480 "antigen processing and presentation of exogenous
            peptide antigen via MHC class I, TAP-independent" evidence=TAS]
            [GO:0019886 "antigen processing and presentation of exogenous
            peptide antigen via MHC class II" evidence=TAS] [GO:0036021
            "endolysosome lumen" evidence=TAS] [GO:0042590 "antigen processing
            and presentation of exogenous peptide antigen via MHC class I"
            evidence=TAS] [GO:0043202 "lysosomal lumen" evidence=TAS]
            [GO:0045087 "innate immune response" evidence=TAS] [GO:0043231
            "intracellular membrane-bounded organelle" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 Reactome:REACT_118779
            Reactome:REACT_6900 GO:GO:0005576 GO:GO:0002480 GO:GO:0016020
            GO:GO:0019886 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087 EMBL:CH471121
            GO:GO:0004197 GO:GO:0002250 HOVERGEN:HBG011513 GO:GO:0097067
            GO:GO:0036021 EMBL:AL356292 CTD:1520 KO:K01368 OMA:KAMDQKC
            OrthoDB:EOG4JM7Q2 EMBL:S93414 EMBL:M86553 EMBL:M90696 EMBL:U07374
            EMBL:U07370 EMBL:U07371 EMBL:U07372 EMBL:U07373 EMBL:CR541676
            EMBL:AK301472 EMBL:AK314482 EMBL:BC002642 IPI:IPI00299150
            IPI:IPI00910216 PIR:A42482 RefSeq:NP_001186668.1 RefSeq:NP_004070.3
            UniGene:Hs.181301 PDB:1BXF PDB:1GLO PDB:1MS6 PDB:1NPZ PDB:1NQC
            PDB:2C0Y PDB:2F1G PDB:2FQ9 PDB:2FRA PDB:2FRQ PDB:2FT2 PDB:2FUD
            PDB:2FYE PDB:2G6D PDB:2G7Y PDB:2H7J PDB:2HH5 PDB:2HHN PDB:2HXZ
            PDB:2OP3 PDB:2R9M PDB:2R9N PDB:2R9O PDB:3IEJ PDB:3KWN PDB:3MPE
            PDB:3MPF PDB:3N3G PDB:3N4C PDB:3OVX PDBsum:1BXF PDBsum:1GLO
            PDBsum:1MS6 PDBsum:1NPZ PDBsum:1NQC PDBsum:2C0Y PDBsum:2F1G
            PDBsum:2FQ9 PDBsum:2FRA PDBsum:2FRQ PDBsum:2FT2 PDBsum:2FUD
            PDBsum:2FYE PDBsum:2G6D PDBsum:2G7Y PDBsum:2H7J PDBsum:2HH5
            PDBsum:2HHN PDBsum:2HXZ PDBsum:2OP3 PDBsum:2R9M PDBsum:2R9N
            PDBsum:2R9O PDBsum:3IEJ PDBsum:3KWN PDBsum:3MPE PDBsum:3MPF
            PDBsum:3N3G PDBsum:3N4C PDBsum:3OVX ProteinModelPortal:P25774
            SMR:P25774 IntAct:P25774 STRING:P25774 MEROPS:I29.004
            PhosphoSite:P25774 DMDM:88984046 PaxDb:P25774 PeptideAtlas:P25774
            PRIDE:P25774 DNASU:1520 Ensembl:ENST00000368985
            Ensembl:ENST00000448301 GeneID:1520 KEGG:hsa:1520 UCSC:uc001evn.3
            GeneCards:GC01M150702 HGNC:HGNC:2545 HPA:CAB000460 HPA:HPA002988
            MIM:116845 neXtProt:NX_P25774 PharmGKB:PA27041 InParanoid:P25774
            PhylomeDB:P25774 BRENDA:3.4.22.27 BindingDB:P25774
            ChEMBL:CHEMBL2954 ChiTaRS:CTSS EvolutionaryTrace:P25774
            GenomeRNAi:1520 NextBio:6291 PMAP-CutDB:P25774 ArrayExpress:P25774
            Bgee:P25774 CleanEx:HS_CTSS Genevestigator:P25774
            GermOnline:ENSG00000163131 Uniprot:P25774
        Length = 331

 Score = 644 (231.8 bits), Expect = 4.2e-63, P = 4.2e-63
 Identities = 140/331 (42%), Positives = 196/331 (59%)

Query:    23 CASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE-- 79
             C+S V     +H+   ++ H   W   +G+ YK++ E+A+R  I+++NL+++   N E  
Sbjct:    11 CSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHS 67

Query:    80 -GNRTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWR 138
              G  +Y LG N   D+T+EE  +  +                 T+K      +P S+DWR
Sbjct:    68 MGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNI-----TYKSNPNRILPDSVDWR 122

Query:   139 EKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD---NNGCSG 195
             EKG VT +K QG CG+CWAFSAV A+E   ++  GKL+ LS Q LVDCST+   N GC+G
Sbjct:   123 EKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNG 182

Query:   196 GLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQA 255
             G M  AF+YII+NKG+ ++A YPY+     C +   K  AAT  KY +LP G E  L +A
Sbjct:   183 GFMTTAFQYIIDNKGIDSDASYPYKAMDQKC-QYDSKYRAATCSKYTELPYGREDVLKEA 241

Query:   256 VT-KQPVSVCVEASGQAFRFYKRGVL-NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKN 313
             V  K PVSV V+A   +F  Y+ GV     C  N +HGV VVG+G   + +G +YWL+KN
Sbjct:   242 VANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DLNGKEYWLVKN 298

Query:   314 SWGETWGESGYIRILRDEGL-CGIATEASYP 343
             SWG  +GE GYIR+ R++G  CGIA+  SYP
Sbjct:   299 SWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329


>ZFIN|ZDB-GENE-041010-76 [details] [associations]
            symbol:ctsll "cathepsin L, like" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-041010-76
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            OrthoDB:EOG48PMKF MEROPS:C01.032 EMBL:BX119902 IPI:IPI00616622
            UniGene:Dr.79994 SMR:A2BEM8 Ensembl:ENSDART00000144226
            InParanoid:A2BEM8 OMA:PRYSAAN Uniprot:A2BEM8
        Length = 337

 Score = 644 (231.8 bits), Expect = 4.2e-63, P = 4.2e-63
 Identities = 137/334 (41%), Positives = 197/334 (58%)

Query:    23 CASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE-- 79
             C S V +  ++ +   ++ H   W   H ++Y ++ E+  R  ++++NL+ IE  N E  
Sbjct:    11 CISAVFAAPTLDQK--LDDHWHLWKRWHEKSYHEK-EEGWRRMVWEKNLKKIELHNLEHS 67

Query:    80 -GNRTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWR 138
              G  T++LG N+F D+TNEEFR +  GYN               F   +    P  IDWR
Sbjct:    68 VGKHTFRLGMNQFGDMTNEEFRQAMNGYNRDPNRKSKGSL----FIEPSFFTAPQQIDWR 123

Query:   139 EKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGG 196
             +KG VT IK+Q  CGSCWAFS+  A+EG      GKL+ LSEQ L+DCS    NNGC GG
Sbjct:   124 QKGYVTPIKDQKRCGSCWAFSSTGALEGQVFRKTGKLVSLSEQNLMDCSRPQGNNGCDGG 183

Query:   197 LMDKAFEYIIENKGLATEADYPY-QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQA 255
             LMD+AF+Y+ +N GL +E  YPY   +   C      +AA   G + D+P G EHAL++A
Sbjct:   184 LMDQAFQYVQDNNGLDSEESYPYLATDDQPCHYDPRYSAANVTG-FVDIPSGKEHALMKA 242

Query:   256 VTKQ-PVSVCVEASGQAFRFYKRGVLNAE-CG-DNCDHGVAVVGFG-TAEEEDGAKYWLI 311
             V    PV+V ++A  ++F+FY+ G+   + C  +  DHGV VVG+G    +  G +YW++
Sbjct:   243 VAAVGPVAVAIDAGHESFQFYQSGIYYEKACSTEELDHGVLVVGYGYEGVDVAGRRYWIV 302

Query:   312 KNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
             KNSW + WG+ GYI + +D +  CGIAT ASYP+
Sbjct:   303 KNSWTDRWGDKGYIYMAKDLKNHCGIATSASYPL 336


>RGD|2448 [details] [associations]
            symbol:Ctsl1 "cathepsin L1" species:10116 "Rattus norvegicus"
          [GO:0002250 "adaptive immune response" evidence=ISO] [GO:0004177
          "aminopeptidase activity" evidence=IDA] [GO:0004197 "cysteine-type
          endopeptidase activity" evidence=ISO;IDA] [GO:0005576 "extracellular
          region" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IDA]
          [GO:0005764 "lysosome" evidence=ISO;IDA] [GO:0005773 "vacuole"
          evidence=IDA] [GO:0005902 "microvillus" evidence=IDA] [GO:0006508
          "proteolysis" evidence=IEP;ISO] [GO:0007154 "cell communication"
          evidence=IDA] [GO:0007283 "spermatogenesis" evidence=IEP] [GO:0008234
          "cysteine-type peptidase activity" evidence=ISO] [GO:0008584 "male
          gonad development" evidence=IEP] [GO:0009267 "cellular response to
          starvation" evidence=IEP] [GO:0009749 "response to glucose stimulus"
          evidence=IEP] [GO:0009897 "external side of plasma membrane"
          evidence=IDA] [GO:0010259 "multicellular organismal aging"
          evidence=IEP] [GO:0014070 "response to organic cyclic compound"
          evidence=IEP] [GO:0021675 "nerve development" evidence=IEP]
          [GO:0030984 "kininogen binding" evidence=IPI] [GO:0032403 "protein
          complex binding" evidence=IPI] [GO:0034698 "response to gonadotropin
          stimulus" evidence=IEP] [GO:0042277 "peptide binding" evidence=IDA]
          [GO:0042393 "histone binding" evidence=ISO] [GO:0043005 "neuron
          projection" evidence=IDA] [GO:0043204 "perikaryon" evidence=IDA]
          [GO:0046697 "decidualization" evidence=IEP] [GO:0048102 "autophagic
          cell death" evidence=IEP] [GO:0051384 "response to glucocorticoid
          stimulus" evidence=IEP] [GO:0060008 "Sertoli cell differentiation"
          evidence=IEP] [GO:0097067 "cellular response to thyroid hormone
          stimulus" evidence=ISO] [GO:0030141 "secretory granule" evidence=IDA]
          [GO:0045177 "apical part of cell" evidence=IDA] [GO:0060441
          "epithelial tube branching involved in lung morphogenesis"
          evidence=ISO] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
          PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:Y00697 RGD:2448
          GO:GO:0005576 GO:GO:0009897 GO:GO:0034698 GO:GO:0043204 GO:GO:0009749
          GO:GO:0051384 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
          InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
          PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
          PROSITE:PS00139 PROSITE:PS00639 GO:GO:0043005 GO:GO:0007283
          GO:GO:0004177 GO:GO:0005764 GO:GO:0042277 GO:GO:0009267 GO:GO:0021675
          GO:GO:0005902 GO:GO:0010259 GO:GO:0004197 GO:GO:0048102 GO:GO:0046697
          GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 KO:K01365
          OrthoDB:EOG48PMKF MEROPS:C01.032 OMA:FDQNLDT CTD:1514
          BRENDA:3.4.22.15 GO:GO:0060008 EMBL:AF025476 EMBL:BC063175
          EMBL:S85184 IPI:IPI00326070 PIR:S07098 RefSeq:NP_037288.1
          UniGene:Rn.1294 ProteinModelPortal:P07154 SMR:P07154 IntAct:P07154
          STRING:P07154 PhosphoSite:P07154 PRIDE:P07154
          Ensembl:ENSRNOT00000025462 GeneID:25697 KEGG:rno:25697 UCSC:RGD:2448
          InParanoid:P07154 SABIO-RK:P07154 BindingDB:P07154 ChEMBL:CHEMBL2305
          NextBio:607715 Genevestigator:P07154 GermOnline:ENSRNOG00000018566
          Uniprot:P07154
        Length = 334

 Score = 643 (231.4 bits), Expect = 5.4e-63, P = 5.4e-63
 Identities = 137/311 (44%), Positives = 184/311 (59%)

Query:    44 QWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
             QW + H R Y    E+  R  ++++N+  I+  N E   G   + +  N F D+TNEEFR
Sbjct:    31 QWKSTHRRLYGTN-EEEWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTNEEFR 89

Query:   101 ASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSA 160
                 GY                F+   +  +P ++DWREKG VT +KNQG CGSCWAFSA
Sbjct:    90 QIVNGYRHQKHKKGRL------FQEPLMLQIPKTVDWREKGCVTPVKNQGQCGSCWAFSA 143

Query:   161 VAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYP 218
                +EG   +  GKLI LSEQ LVDCS D  N GC+GGLMD AF+YI EN GL +E  YP
Sbjct:   144 SGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYP 203

Query:   219 YQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKR 277
             Y+ + G+C  + E A A   G + D+P+  E AL++AV T  P+SV ++AS  + +FY  
Sbjct:   204 YEAKDGSCKYRAEYAVANDTG-FVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSS 261

Query:   278 GVL-NAECGD-NCDHGVAVVGFG-TAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGL- 333
             G+     C   + DHGV VVG+G    + +  KYWL+KNSWG+ WG  GYI+I +D    
Sbjct:   262 GIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNH 321

Query:   334 CGIATEASYPV 344
             CG+AT ASYP+
Sbjct:   322 CGLATAASYPI 332


>UNIPROTKB|A4IFS7 [details] [associations]
            symbol:CTSL1 "CTSL1 protein" species:9913 "Bos taurus"
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197 GO:GO:0002250
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 GO:GO:0097067
            OrthoDB:EOG48PMKF MEROPS:C01.032 CTD:1514 EMBL:DAAA02023987
            EMBL:BC134741 IPI:IPI00708619 RefSeq:NP_001077155.1
            UniGene:Bt.23199 SMR:A4IFS7 Ensembl:ENSBTAT00000000962
            GeneID:515200 KEGG:bta:515200 InParanoid:A4IFS7 OMA:NDEQALM
            NextBio:20871707 Uniprot:A4IFS7
        Length = 333

 Score = 640 (230.4 bits), Expect = 1.1e-62, P = 1.1e-62
 Identities = 141/328 (42%), Positives = 190/328 (57%)

Query:    27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRT 83
             + S     + S+  + + W A H + Y D  E+  R  ++K+N++ IE  N+E   G  +
Sbjct:    14 IASAAPKFDHSLDTQWKLWKAAHRKPY-DLNEEGWRKAVWKKNMKMIELHNQEYSQGKHS 72

Query:    84 YKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAV 143
             + +  N F D+TNEEFR +  G+                F       +P S+DWREKG V
Sbjct:    73 FSMAMNAFGDMTNEEFRHTMNGFQRQKNKKGKE------FHETIFASIPPSVDWREKGYV 126

Query:   144 THIKNQGHCGSCWAFSAVAAVEG-ITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDK 200
             T +KNQG CGSCWAFSA  A+EG + Q TG KL+ LSEQ LVDCS    N GC GG +D 
Sbjct:   127 TPVKNQGKCGSCWAFSATGALEGQMFQKTG-KLVSLSEQNLVDCSQPEGNRGCHGGFIDN 185

Query:   201 AFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ- 259
             AF+Y+++  GL +E  YPY    GTC      +AA   G + DLPK  E AL++AV    
Sbjct:   186 AFQYVLDVGGLDSEESYPYTGLVGTCLYNPNNSAANETG-FVDLPK-QEKALMKAVANLG 243

Query:   260 PVSVCVEASGQAFRFYKRGVL-NAECG-DNCDHGVAVVGFG-TAEEEDGAKYWLIKNSWG 316
             P+SV V+A   +F+FYK G+     C  ++ DH V VVG+G    + D  KYWL+KNSWG
Sbjct:   244 PISVAVDAHNPSFQFYKSGIYYEPNCSSESVDHAVLVVGYGFEGADSDDNKYWLVKNSWG 303

Query:   317 ETWGESGYIRILRDEGL-CGIATEASYP 343
             E WG +GYI++ +D    CGIAT ASYP
Sbjct:   304 EHWGMNGYIKMAKDRNNHCGIATMASYP 331


>UNIPROTKB|P25326 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9913 "Bos taurus"
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0016020 "membrane" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0002250 "adaptive
            immune response" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0016020 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0002250 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            GO:GO:0097067 EMBL:BC102245 EMBL:M95211 EMBL:X62001 IPI:IPI00702008
            PIR:S15844 RefSeq:NP_001028787.1 UniGene:Bt.7938
            ProteinModelPortal:P25326 SMR:P25326 STRING:P25326 PRIDE:P25326
            Ensembl:ENSBTAT00000022774 GeneID:327711 KEGG:bta:327711 CTD:1520
            InParanoid:P25326 KO:K01368 OMA:KAMDQKC OrthoDB:EOG4JM7Q2
            NextBio:20810175 Uniprot:P25326
        Length = 331

 Score = 639 (230.0 bits), Expect = 1.4e-62, P = 1.4e-62
 Identities = 139/321 (43%), Positives = 189/321 (58%)

Query:    33 MHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGT 88
             +H    ++ H + W   +G+ YK++ E+  R  I+++NL+ +   N E   G  +Y+LG 
Sbjct:    18 VHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVTLHNLEHSMGMHSYELGM 77

Query:    89 NEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKN 148
             N   D+T+EE  +  +                 T+K      +P S+DWREKG VT +K 
Sbjct:    78 NHLGDMTSEEVISLMSSLRVPSQWPRNV-----TYKSDPNQKLPDSMDWREKGCVTEVKY 132

Query:   149 QGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST---DNNGCSGGLMDKAFEYI 205
             QG CGSCWAFSAV A+E   ++  GKL+ LS Q LVDCST    N GC+GG M +AF+YI
Sbjct:   133 QGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYI 192

Query:   206 IENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVT-KQPVSVC 264
             I+N G+ +EA YPY+   G C +   K  AAT  +Y +LP G E AL +AV  K PVSV 
Sbjct:   193 IDNNGIDSEASYPYKAMDGKC-QYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVG 251

Query:   265 VEASGQAFRFYKRGVL-NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESG 323
             ++AS  +F  YK GV  +  C  N +HGV VVG+G     DG  YWL+KNSWG  +G+ G
Sbjct:   252 IDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNL---DGKDYWLVKNSWGLHFGDQG 308

Query:   324 YIRILRDEGL-CGIATEASYP 343
             YIR+ R+ G  CGIA   SYP
Sbjct:   309 YIRMARNSGNHCGIANYPSYP 329


>DICTYBASE|DDB_G0283867 [details] [associations]
            symbol:cprC "cysteine proteinase 3" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0283867 GenomeReviews:CM000153_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:AAFI02000057
            KO:K01365 EMBL:X03930 RefSeq:XP_638859.1 ProteinModelPortal:Q23894
            SMR:Q23894 MEROPS:C01.114 EnsemblProtists:DDB0220784 GeneID:8624257
            KEGG:ddi:DDB_G0283867 OMA:NNVEHIN Uniprot:Q23894
        Length = 337

 Score = 631 (227.2 bits), Expect = 1.0e-61, P = 1.0e-61
 Identities = 129/306 (42%), Positives = 185/306 (60%)

Query:    45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYT 104
             WM  + + Y  + E   R   FK+N++Y+   N +G++T  LG N+ +DL+NEE+R +Y 
Sbjct:    37 WMRSNNKAYTHK-EFMPRYEEFKKNMDYVHNWNSKGSKTV-LGLNQHADLSNEEYRLNYL 94

Query:   105 GYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAV 164
             G                    +     P ++DWREK AVT +K+QG CGSC++FS   +V
Sbjct:    95 GTRAHIKLNGYHKRNLGLRLNRPQFKQPLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSV 154

Query:   165 EGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQE 222
             EG+T I  GKL+ LSEQ ++DCS+   N GC+GGLM  AFEYII+N GL +E  YPY+ +
Sbjct:   155 EGVTAIKTGKLVSLSEQNILDCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMK 214

Query:   223 QGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVL-N 281
                  K +E + AA I  Y+++  GDE+ L  A+   PVSV ++AS  +F+ Y  GV   
Sbjct:   215 VNDECKFQEGSVAAKITSYKEIEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAGVYYE 274

Query:   282 AECG-DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATE 339
               C  ++ DHGV  VG GT   ED   Y+++KNSWG +WG +GYI + R+ +  CGI+T 
Sbjct:   275 PACSSEDLDHGVLAVGMGTDNGED---YYIVKNSWGPSWGLNGYIHMARNKDNNCGISTM 331

Query:   340 ASYPVA 345
             ASYP+A
Sbjct:   332 ASYPIA 337


>WB|WBGene00000776 [details] [associations]
            symbol:cpl-1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0040010 "positive regulation
            of growth rate" evidence=IMP] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] [GO:0040011
            "locomotion" evidence=IMP] [GO:0070265 "necrotic cell death"
            evidence=IMP] [GO:0031983 "vesicle lumen" evidence=IDA] [GO:0042718
            "yolk granule" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0009792 GO:GO:0040010 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0040011
            GO:GO:0031983 GO:GO:0070265 GeneTree:ENSGT00660000095458 KO:K01365
            GO:GO:0042718 MEROPS:I29.009 EMBL:Z92812 GeneID:180111
            KEGG:cel:CELE_T03E6.7 CTD:180111 PIR:T24387 RefSeq:NP_001256718.1
            HSSP:P80067 ProteinModelPortal:O45734 SMR:O45734 DIP:DIP-26616N
            IntAct:O45734 MINT:MINT-211563 STRING:O45734 PaxDb:O45734
            EnsemblMetazoa:T03E6.7.1 EnsemblMetazoa:T03E6.7.2 UCSC:T03E6.7.1
            WormBase:T03E6.7a InParanoid:O45734 OMA:HIENHNR NextBio:908128
            Uniprot:O45734
        Length = 337

 Score = 629 (226.5 bits), Expect = 1.6e-61, P = 1.6e-61
 Identities = 135/317 (42%), Positives = 187/317 (58%)

Query:    37 SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSD 93
             S +EK + +     + Y  E E+   +  F +N+ +IE  N++   G +T+++G N  +D
Sbjct:    27 SAIEKWDDYKEDFDKEYS-ESEEQTYMEAFVKNMIHIENHNRDHRLGRKTFEMGLNHIAD 85

Query:    94 LTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
             L   ++R    GY                  + NV  VP  +DWR+   VT +KNQG CG
Sbjct:    86 LPFSQYR-KLNGYRRLFGDSRIKNSSSFLAPF-NV-QVPDEVDWRDTHLVTDVKNQGMCG 142

Query:   154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGL 211
             SCWAFSA  A+EG      G+L+ LSEQ LVDCST   N+GC+GGLMD+AFEYI +N G+
Sbjct:   143 SCWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLMDQAFEYIRDNHGV 202

Query:   212 ATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVSVCVEASGQ 270
              TE  YPY+     C   K+   A   G Y D P+GDE  L  AV  Q P+S+ ++A  +
Sbjct:   203 DTEESYPYKGRDMKCHFNKKTVGADDKG-YVDTPEGDEEQLKIAVATQGPISIAIDAGHR 261

Query:   271 AFRFYKRGVL-NAECG-DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRIL 328
             +F+ YK+GV  + EC  +  DHGV +VG+GT + E G  YW++KNSWG  WGE GYIRI 
Sbjct:   262 SFQLYKKGVYYDEECSSEELDHGVLLVGYGT-DPEHG-DYWIVKNSWGAGWGEKGYIRIA 319

Query:   329 RDEGL-CGIATEASYPV 344
             R+    CG+AT+ASYP+
Sbjct:   320 RNRNNHCGVATKASYPL 336


>UNIPROTKB|P25975 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 EMBL:X91755 EMBL:BC102312 EMBL:AB017648
            IPI:IPI00687440 PIR:S15845 RefSeq:NP_776457.1 UniGene:Bt.3987
            ProteinModelPortal:P25975 SMR:P25975 STRING:P25975
            Ensembl:ENSBTAT00000022710 Ensembl:ENSBTAT00000036427 GeneID:281108
            KEGG:bta:281108 CTD:1515 InParanoid:P25975 KO:K01365 OMA:EEFRATH
            OrthoDB:EOG48PMKF BindingDB:P25975 ChEMBL:CHEMBL2113
            NextBio:20805179 ArrayExpress:P25975 Uniprot:P25975
        Length = 334

 Score = 628 (226.1 bits), Expect = 2.1e-61, P = 2.1e-61
 Identities = 136/328 (41%), Positives = 193/328 (58%)

Query:    27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRT 83
             V S     +P++     QW A H R Y    E+  R  ++++N + I+  N+E   G   
Sbjct:    14 VASAAPKLDPNLDAHWHQWKATHRRLYGMN-EEEWRRAVWEKNKKIIDLHNQEYSEGKHG 72

Query:    84 YKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAV 143
             +++  N F D+TNEEFR    G+                F    + DVP S+DW +KG V
Sbjct:    73 FRMAMNAFGDMTNEEFRQVMNGFQNQKHKKGKL------FHEPLLVDVPKSVDWTKKGYV 126

Query:   144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKA 201
             T +KNQG CGSCWAFSA  A+EG      GKL+ LSEQ LVDCS    N GC+GGLMD A
Sbjct:   127 TPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNA 186

Query:   202 FEYIIENKGLATEADYPY-QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQ 259
             F+YI +N GL +E  YPY   +  +C+ + E +AA   G + D+P+  E AL++AV T  
Sbjct:   187 FQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTG-FVDIPQR-EKALMKAVATVG 244

Query:   260 PVSVCVEASGQAFRFYKRGVL-NAECGD-NCDHGVAVVGFG-TAEEEDGAKYWLIKNSWG 316
             P+SV ++A   +F+FYK G+  + +C   + DHGV VVG+G    + +  K+W++KNSWG
Sbjct:   245 PISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWG 304

Query:   317 ETWGESGYIRILRDEGL-CGIATEASYP 343
               WG +GY+++ +D+   CGIAT ASYP
Sbjct:   305 PEWGWNGYVKMAKDQNNHCGIATAASYP 332


>DICTYBASE|DDB_G0279799 [details] [associations]
            symbol:cprB "cysteine proteinase 2" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0279799 GenomeReviews:CM000152_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            MEROPS:I29.003 KO:K01365 EMBL:AAFI02000033 EMBL:M16039 EMBL:X03344
            PIR:A25439 RefSeq:XP_641494.1 ProteinModelPortal:P04989 SMR:P04989
            EnsemblProtists:DDB0214998 GeneID:8622234 KEGG:ddi:DDB_G0279799
            OMA:YVNITAG Uniprot:P04989
        Length = 376

 Score = 519 (187.8 bits), Expect = 2.6e-61, Sum P(2) = 2.6e-61
 Identities = 112/281 (39%), Positives = 153/281 (54%)

Query:    29 SGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGT 88
             +GR   E        +W  +  R Y    E + R +IFK N++Y++  N +G+    LG 
Sbjct:    23 NGRRFSESQYRTAFTEWTLKFNRQYSSS-EFSNRYSIFKSNMDYVDNWNSKGDSQTVLGL 81

Query:    89 NEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKN 148
             N F+D+TNEE+R +Y G                    +++   P SIDWR K AVT IK+
Sbjct:    82 NNFADITNEEYRKTYLG-TRVNAHSYNGYDGREVLNVEDLQTNPKSIDWRTKNAVTPIKD 140

Query:   149 QGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS--TDNNGCSGGLMDKAFEYII 206
             QG CGSCW+FS   + EG   +   KL+ LSEQ LVDCS   +N GC GGLM+ AF+YII
Sbjct:   141 QGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAFDYII 200

Query:   207 ENKGLATEADYPYQQEQG-TCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCV 265
             +NKG+ TE+ YPY  E G TC   K    A TI  Y ++  G E +L       PVSV +
Sbjct:   201 KNKGIDTESSYPYTAETGSTCLFNKSDIGA-TIKGYVNITAGSEISLENGAQHGPVSVAI 259

Query:   266 EASGQAFRFYKRGVL-NAECGDN-CDHGVAVVGFGTAEEED 304
             +AS  +F+ Y  G+    +C     DHGV VVG+G   ++D
Sbjct:   260 DASHNSFQLYTSGIYYEPKCSPTELDHGVLVVGYGVQGKDD 300

 Score = 126 (49.4 bits), Expect = 2.6e-61, Sum P(2) = 2.6e-61
 Identities = 21/39 (53%), Positives = 30/39 (76%)

Query:   308 YWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPVA 345
             YW++KNSWG +WG  GYI + +D +  CGIA+ +SYP+A
Sbjct:   338 YWIVKNSWGTSWGIKGYILMSKDRKNNCGIASVSSYPLA 376


>UNIPROTKB|F1PMM9 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9615 "Canis lupus
            familiaris" [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00639 GO:GO:0004197 GO:GO:0002250
            GeneTree:ENSGT00560000076577 GO:GO:0097067 EMBL:AAEX03000499
            Ensembl:ENSCAFT00000002029 OMA:EFKQVLN Uniprot:F1PMM9
        Length = 341

 Score = 625 (225.1 bits), Expect = 4.3e-61, P = 4.3e-61
 Identities = 134/328 (40%), Positives = 188/328 (57%)

Query:    27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRT 83
             + S     + S+     QW   HG+ Y D+ E+  R T++++N+E IE+ N+E   G  +
Sbjct:    22 IASAAPQQDHSLDAHWSQWKEAHGKLY-DKDEEGWRRTVWERNMEMIEQHNQEYSQGEHS 80

Query:    84 YKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAV 143
             + L  N F D+TNEEF+     +                F      +VP+S+DWRE+G V
Sbjct:    81 FTLAMNAFGDMTNEEFKQVLNDFKIQKHKKGK------VFPAPLFAEVPSSVDWREQGYV 134

Query:   144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKA 201
             T +K+QG C  CWAFSA  A+EG      GKL+ LSEQ LVDCS    N GC+GGLM+ A
Sbjct:   135 TPVKDQGQCLGCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWSQGNRGCNGGLMEYA 194

Query:   202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQP 260
             F+Y+ +N GL +E  YPY      C  + EK+AA     +  L   +E  L+  V T  P
Sbjct:   195 FQYVKDNGGLDSEESYPYLARNEPCKYRPEKSAANVTAFWPIL--NEEDGLMTTVATVGP 252

Query:   261 VSVCVEASGQAFRFYKRGVL-NAECGDNC-DHGVAVVGFG-TAEEEDGAKYWLIKNSWGE 317
             VS  V++S Q+F+FYK+G+  + +C +   +HGV VVG+G    E D  KYW++KNSWG 
Sbjct:   253 VSAAVDSSPQSFQFYKKGIYYDPKCSNKLLNHGVLVVGYGFEGAESDNKKYWIVKNSWGT 312

Query:   318 TWGESGYIRILRD-EGLCGIATEASYPV 344
              WG  GY+ + +D +  CGIAT ASYPV
Sbjct:   313 NWGMQGYMLLAKDRDNHCGIATRASYPV 340


>DICTYBASE|DDB_G0279185 [details] [associations]
            symbol:cprF "cysteine proteinase 6" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0279185 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000030 HSSP:P07711 ProtClustDB:CLSZ2846820 EMBL:U72745
            RefSeq:XP_641725.1 ProteinModelPortal:Q94503 SMR:Q94503
            MEROPS:C01.081 PRIDE:Q94503 EnsemblProtists:DDB0215002
            GeneID:8621921 KEGG:ddi:DDB_G0279185 Uniprot:Q94503
        Length = 434

 Score = 515 (186.3 bits), Expect = 8.8e-61, Sum P(2) = 8.8e-61
 Identities = 118/285 (41%), Positives = 149/285 (52%)

Query:    45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYT 104
             WM  H R Y  E E   R  IFK N++YI + N +G+ T  LG N F+D+TNEE+RA+Y 
Sbjct:    33 WMIAHQRHYSSE-EFNGRFNIFKANMDYINEWNTKGSETV-LGLNVFADITNEEYRATYL 90

Query:   105 GYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAV 164
             G                    Q       S+DWR KGAVT IKNQG CG CW+FSA  A 
Sbjct:    91 GTPFDASSLEMTPSEKVFGGVQ-----ANSVDWRAKGAVTPIKNQGECGGCWSFSATGAT 145

Query:   165 EGITQITGGK--LIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQ 220
             EG   I  G   L  +SEQQL+DCS    NNGC GGLM  AFEYII N G+ TE+ YP+ 
Sbjct:   146 EGAQYIANGDSDLTSVSEQQLIDCSGSYGNNGCEGGLMTLAFEYIINNGGIDTESSYPFT 205

Query:   221 QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVL 280
                  C K       A +  Y ++  G E  L   VT+ P SV ++AS  +F+FY  G+ 
Sbjct:   206 ANTEKC-KYNPSNIGAELSSYVNVTSGSESDLAAKVTQGPTSVAIDASQPSFQFYSSGIY 264

Query:   281 NAE-CGDN-CDHGVAVVGFGTAE-----EEDGAKYWLIKNSWGET 318
             N   C     DHGV  VGFG+       +  G++     N+W E+
Sbjct:   265 NEPACSSTQLDHGVLAVGFGSGSSGSQSQSAGSQSQSSNNNWSES 309

 Score = 125 (49.1 bits), Expect = 8.8e-61, Sum P(2) = 8.8e-61
 Identities = 24/44 (54%), Positives = 31/44 (70%)

Query:   304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPVAM 346
             DG  YW++KNSWG  WG +GYI + +D +  CGIAT AS P A+
Sbjct:   386 DG-NYWIVKNSWGLDWGINGYILMSKDKDNQCGIATMASIPQAI 428


>UNIPROTKB|Q5E998 [details] [associations]
            symbol:CTSL2 "Cathepsin L2" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            HOVERGEN:HBG011513 UniGene:Bt.3987 MEROPS:C01.032 EMBL:BT021022
            IPI:IPI00711962 ProteinModelPortal:Q5E998 SMR:Q5E998 STRING:Q5E998
            InParanoid:Q5E998 Uniprot:Q5E998
        Length = 334

 Score = 619 (223.0 bits), Expect = 1.9e-60, P = 1.9e-60
 Identities = 135/328 (41%), Positives = 192/328 (58%)

Query:    27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRT 83
             V S     +P++     QW A H R Y    E+  R  ++++N + I+  N+E   G   
Sbjct:    14 VASAAPKLDPNLDAHWHQWKATHRRLYGMN-EEEWRRAVWEKNKKIIDLHNQEYSEGKHG 72

Query:    84 YKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAV 143
             +++  N F D+TNEEFR    G+                F    + DVP S+DW +KG V
Sbjct:    73 FRMAMNAFGDMTNEEFRQVMNGFQNQKHKKGKL------FHEPLLVDVPKSVDWTKKGYV 126

Query:   144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKA 201
             T +KNQG CGSCWAFSA  A+EG      GKL+ LSEQ LVDCS    N GC+GGLMD A
Sbjct:   127 TPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNA 186

Query:   202 FEYIIENKGLATEADYPY-QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQ 259
             F+YI +N  L +E  YPY   +  +C+ + E +AA   G + D+P+  E AL++AV T  
Sbjct:   187 FQYIKDNGCLDSEESYPYLATDTNSCNYKPECSAANDTG-FVDIPQR-EKALMKAVATVG 244

Query:   260 PVSVCVEASGQAFRFYKRGVL-NAECGD-NCDHGVAVVGFG-TAEEEDGAKYWLIKNSWG 316
             P+SV ++A   +F+FYK G+  + +C   + DHGV VVG+G    + +  K+W++KNSWG
Sbjct:   245 PISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWG 304

Query:   317 ETWGESGYIRILRDEGL-CGIATEASYP 343
               WG +GY+++ +D+   CGIAT ASYP
Sbjct:   305 PEWGWNGYVKMAKDQNNHCGIATAASYP 332


>ZFIN|ZDB-GENE-040718-61 [details] [associations]
            symbol:ctsl.1 "cathepsin L.1" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-040718-61
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            GeneTree:ENSGT00660000095458 MEROPS:C01.092 EMBL:FP015965
            EMBL:BC075887 IPI:IPI00513499 RefSeq:NP_001002368.1
            UniGene:Dr.85174 SMR:Q6DHT0 Ensembl:ENSDART00000017756
            GeneID:436641 KEGG:dre:436641 CTD:436641 InParanoid:Q6DHT0
            OMA:GGQMENA OrthoDB:EOG41ZFB9 NextBio:20831086 Uniprot:Q6DHT0
        Length = 334

 Score = 618 (222.6 bits), Expect = 2.4e-60, P = 2.4e-60
 Identities = 132/315 (41%), Positives = 183/315 (58%)

Query:    39 VEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK---EGNRTYKLGTNEFSDLT 95
             +E H  W  + G++Y+   E++ R   +  N + +   N    +G ++Y+LG   F+D++
Sbjct:    24 MEFHA-WKLKFGKSYRSAEEESHRQLTWLTNRKLVLVHNMMADQGLKSYRLGMTYFADMS 82

Query:    96 NEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSC 155
             NEE+R                      F+ +    VP ++DWR+KG VT IK+Q  CGSC
Sbjct:    83 NEEYRQLVFRGCLGSMNNTKARGGSTFFRLRKAAVVPDTVDWRDKGYVTDIKDQKQCGSC 142

Query:   156 WAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLAT 213
             WAFSA  ++EG T    GKL+ LSEQQLVDCS    N GC GGLMD+AF+YI  NKGL T
Sbjct:   143 WAFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGSYGNYGCDGGLMDQAFQYIEANKGLDT 202

Query:   214 EADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAF 272
             E  YPY+ + G C        A+  G Y D+  GDE AL +AV T  P+SV ++A   +F
Sbjct:   203 EDSYPYEAQDGECRFNPSTVGASCTG-YVDIASGDESALQEAVATIGPISVAIDAGHSSF 261

Query:   273 RFYKRGVLNA-ECGDN-CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
             + Y  GV N  +C  +  DHGV  VG+G++  +D   YW++KNSWG  WG  GYI + R+
Sbjct:   262 QLYSSGVYNEPDCSSSELDHGVLAVGYGSSNGDD---YWIVKNSWGLDWGVQGYILMSRN 318

Query:   331 EG-LCGIATEASYPV 344
             +   CGIAT ASYP+
Sbjct:   319 KSNQCGIATAASYPL 333


>UNIPROTKB|F1NZ37 [details] [associations]
            symbol:LOC420160 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 EMBL:AADN02062018
            IPI:IPI00587784 Ensembl:ENSGALT00000006765 OMA:CGVANQA
            Uniprot:F1NZ37
        Length = 340

 Score = 617 (222.3 bits), Expect = 3.1e-60, P = 3.1e-60
 Identities = 130/319 (40%), Positives = 179/319 (56%)

Query:    35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
             +P + E  E+W + + + Y  E E  +R  +++ NL  IE+ N E   G  T++LG N +
Sbjct:    27 DPVLEEAWERWKSLYAKEYPGEAE-LIRREVWENNLRRIEQHNWEESQGQHTFRLGMNHY 85

Query:    92 SDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
              DL +EEF     G+               TF+       P  +DWR +G VT +KNQGH
Sbjct:    86 GDLMDEEFNQLLNGF-----APVQHEEPALTFQASAAQKTPAEVDWRMRGYVTPVKNQGH 140

Query:   152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS--TDNNGCSGGLMDKAFEYIIENK 209
             CGSCWAFSA  A+EG+     GKL  LSEQ L+DCS    NNGC GG M +AF+Y+ +N 
Sbjct:   141 CGSCWAFSATGALEGLVFNWTGKLAVLSEQNLIDCSWKLGNNGCQGGYMTRAFQYVHDNG 200

Query:   210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEAS 268
             G+ +E  YPYQ    +  +      AA       + +G E AL QAV T  PVSV V+AS
Sbjct:   201 GMNSEHIYPYQATDTSSCRYNPADRAANCSTVWLVAQGSEAALEQAVATVGPVSVAVDAS 260

Query:   269 GQAFRFYKRGVLNAE-CGDNCDHGVAVVGFGTAEE-EDGAKYWLIKNSWGETWGESGYIR 326
                F FYK G+ N+  C    +HG+  VG+G ++E      YW++KNSW E WGE GYIR
Sbjct:   261 SFFFHFYKSGIFNSMFCSQKVNHGMLAVGYGISQEARKNVSYWILKNSWSEVWGEKGYIR 320

Query:   327 ILRD-EGLCGIATEASYPV 344
             +L+     CG+A +AS+P+
Sbjct:   321 LLKGVNNHCGVANQASFPL 339


>UNIPROTKB|O60911 [details] [associations]
            symbol:CTSL2 "Cathepsin L2" species:9606 "Homo sapiens"
            [GO:0004177 "aminopeptidase activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA] [GO:0005902
            "microvillus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0007283 "spermatogenesis" evidence=IEA] [GO:0009267 "cellular
            response to starvation" evidence=IEA] [GO:0009749 "response to
            glucose stimulus" evidence=IEA] [GO:0009897 "external side of
            plasma membrane" evidence=IEA] [GO:0010259 "multicellular
            organismal aging" evidence=IEA] [GO:0021675 "nerve development"
            evidence=IEA] [GO:0030984 "kininogen binding" evidence=IEA]
            [GO:0032403 "protein complex binding" evidence=IEA] [GO:0034698
            "response to gonadotropin stimulus" evidence=IEA] [GO:0042277
            "peptide binding" evidence=IEA] [GO:0043005 "neuron projection"
            evidence=IEA] [GO:0043204 "perikaryon" evidence=IEA] [GO:0046697
            "decidualization" evidence=IEA] [GO:0048102 "autophagic cell death"
            evidence=IEA] [GO:0051384 "response to glucocorticoid stimulus"
            evidence=IEA] [GO:0060008 "Sertoli cell differentiation"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=TAS] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_118779 Reactome:REACT_6900
            GO:GO:0009897 GO:GO:0019886 GO:GO:0034698 GO:GO:0043204
            GO:GO:0009749 GO:GO:0030141 GO:GO:0051384 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0045177 GO:GO:0043005 GO:GO:0007283
            GO:GO:0004177 GO:GO:0042277 GO:GO:0009267 GO:GO:0021675
            GO:GO:0043202 GO:GO:0005902 GO:GO:0010259 GO:GO:0004197
            GO:GO:0048102 GO:GO:0046697 HOVERGEN:HBG011513 CTD:1515
            OrthoDB:EOG48PMKF OMA:FDQNLDT GO:GO:0060008 EMBL:Y14734
            EMBL:AB001928 EMBL:AF070448 EMBL:AB019534 EMBL:AY358641
            EMBL:AL445670 EMBL:BC023504 EMBL:BC110512 IPI:IPI00000013
            RefSeq:NP_001188504.1 RefSeq:NP_001324.2 UniGene:Hs.610096 PDB:1FH0
            PDB:3H6S PDB:3KFQ PDBsum:1FH0 PDBsum:3H6S PDBsum:3KFQ
            ProteinModelPortal:O60911 SMR:O60911 IntAct:O60911 STRING:O60911
            MEROPS:I29.010 PhosphoSite:O60911 PaxDb:O60911 PeptideAtlas:O60911
            PRIDE:O60911 Ensembl:ENST00000259470 Ensembl:ENST00000538255
            GeneID:1515 KEGG:hsa:1515 UCSC:uc004awt.3 GeneCards:GC09M099794
            HGNC:HGNC:2538 HPA:CAB017112 MIM:603308 neXtProt:NX_O60911
            PharmGKB:PA27036 InParanoid:O60911 KO:K01375 PhylomeDB:O60911
            BRENDA:3.4.22.43 SABIO-RK:O60911 BindingDB:O60911 ChEMBL:CHEMBL3272
            ChiTaRS:CTSL2 EvolutionaryTrace:O60911 GenomeRNAi:1515 NextBio:6277
            Bgee:O60911 CleanEx:HS_CTSL2 Genevestigator:O60911
            GermOnline:ENSG00000136943 Uniprot:O60911
        Length = 334

 Score = 615 (221.5 bits), Expect = 5.0e-60, P = 5.0e-60
 Identities = 134/327 (40%), Positives = 183/327 (55%)

Query:    27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRT 83
             + S     + ++  K  QW A H R Y    E+  R  ++++N++ IE  N E   G   
Sbjct:    14 IASAVPKFDQNLDTKWYQWKATHRRLYGAN-EEGWRRAVWEKNMKMIELHNGEYSQGKHG 72

Query:    84 YKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAV 143
             + +  N F D+TNEEFR     +                F+     D+P S+DWR+KG V
Sbjct:    73 FTMAMNAFGDMTNEEFRQMMGCFRNQKFRKGK------VFREPLFLDLPKSVDWRKKGYV 126

Query:   144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKA 201
             T +KNQ  CGSCWAFSA  A+EG      GKL+ LSEQ LVDCS    N GC+GG M +A
Sbjct:   127 TPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARA 186

Query:   202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQP 260
             F+Y+ EN GL +E  YPY      C  + E + A   G +  +  G E AL++AV T  P
Sbjct:   187 FQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTG-FTVVAPGKEKALMKAVATVGP 245

Query:   261 VSVCVEASGQAFRFYKRGV-LNAECGD-NCDHGVAVVGFG-TAEEEDGAKYWLIKNSWGE 317
             +SV ++A   +F+FYK G+    +C   N DHGV VVG+G      + +KYWL+KNSWG 
Sbjct:   246 ISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGP 305

Query:   318 TWGESGYIRILRDEGL-CGIATEASYP 343
              WG +GY++I +D+   CGIAT ASYP
Sbjct:   306 EWGSNGYVKIAKDKNNHCGIATAASYP 332


>RGD|61810 [details] [associations]
            symbol:Ctsk "cathepsin K" species:10116 "Rattus norvegicus"
           [GO:0001957 "intramembranous ossification" evidence=IEP] [GO:0005615
           "extracellular space" evidence=IDA] [GO:0005737 "cytoplasm"
           evidence=IDA] [GO:0005764 "lysosome" evidence=IDA] [GO:0006508
           "proteolysis" evidence=TAS] [GO:0008234 "cysteine-type peptidase
           activity" evidence=TAS] [GO:0045453 "bone resorption" evidence=IMP]
           InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
           Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
           RGD:61810 GO:GO:0005615 eggNOG:COG4870 HOGENOM:HOG000230774
           GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
           InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
           PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
           GO:GO:0045453 GO:GO:0001957 GeneTree:ENSGT00560000076577
           HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
           OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:AF010306 EMBL:BC078793
           IPI:IPI00206378 RefSeq:NP_113748.1 UniGene:Rn.5598
           ProteinModelPortal:O35186 SMR:O35186 STRING:O35186
           PhosphoSite:O35186 PRIDE:O35186 Ensembl:ENSRNOT00000028730
           GeneID:29175 KEGG:rno:29175 UCSC:RGD:61810 InParanoid:O35186
           OMA:YKEIPEG BindingDB:O35186 ChEMBL:CHEMBL3034 NextBio:608248
           Genevestigator:O35186 GermOnline:ENSRNOG00000021155 Uniprot:O35186
        Length = 329

 Score = 611 (220.1 bits), Expect = 1.3e-59, P = 1.3e-59
 Identities = 135/324 (41%), Positives = 188/324 (58%)

Query:    27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRT 83
             VVS     E ++  + E W   HG+ Y  ++++  R  I+++NL+ I   N E   G  T
Sbjct:    11 VVSFALSPEETLDTQWELWKKTHGKQYNSKVDEISRRLIWEKNLKKISVHNLEASLGAHT 70

Query:    84 YKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAV 143
             Y+L  N   D+T+EE     TG                T +++    VP SID+R+KG V
Sbjct:    71 YELAMNHLGDMTSEEVVQKMTGLRVPPSRSFSNDTLY-TPEWEG--RVPDSIDYRKKGYV 127

Query:   144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFE 203
             T +KNQG CGSCWAFS+  A+EG  +   GKL+ LS Q LVDC ++N GC GG M  AF+
Sbjct:   128 TPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVSENYGCGGGYMTTAFQ 187

Query:   204 YIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVS 262
             Y+ +N G+ +E  YPY  +  +C      A AA    Y ++P G+E AL +AV +  PVS
Sbjct:   188 YVQQNGGIDSEDAYPYVGQDESC-MYNATAKAAKCRGYREIPVGNEKALKRAVARVGPVS 246

Query:   263 VCVEASGQAFRFYKRGVLNAE-CG-DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWG 320
             V ++AS  +F+FY RGV   E C  DN +H V VVG+GT   + G KYW+IKNSWGE+WG
Sbjct:   247 VSIDASLTSFQFYSRGVYYDENCDRDNVNHAVLVVGYGT---QKGNKYWIIKNSWGESWG 303

Query:   321 ESGYIRILRDEG-LCGIATEASYP 343
               GY+ + R++   CGI   AS+P
Sbjct:   304 NKGYVLLARNKNNACGITNLASFP 327


>ZFIN|ZDB-GENE-001205-4 [details] [associations]
            symbol:ctsk "cathepsin K" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-001205-4 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            CTD:1513 KO:K01371 OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55
            EMBL:BC092901 IPI:IPI00512751 RefSeq:NP_001017778.1
            UniGene:Dr.76224 ProteinModelPortal:Q568D6 SMR:Q568D6 GeneID:550475
            KEGG:dre:550475 InParanoid:Q568D6 NextBio:20879718
            ArrayExpress:Q568D6 Uniprot:Q568D6
        Length = 333

 Score = 611 (220.1 bits), Expect = 1.3e-59, P = 1.3e-59
 Identities = 134/321 (41%), Positives = 182/321 (56%)

Query:    32 SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGT 88
             S+   S+ E  E W   H R Y    E+++R TI+++N+ +IE  NKE   G  TY LG 
Sbjct:    20 SLDNLSLDEAWESWKITHKREYNGLNEESIRRTIWEKNMLFIEAHNKEYELGIHTYDLGM 79

Query:    89 NEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQN-VTDVPTSIDWREKGAVTHIK 147
             N F D+T EE      G                TF   + V  +P SID+R+ G VT +K
Sbjct:    80 NHFGDMTLEEVAEKVMGLQMPMYRDPAN-----TFVPDDRVGKLPKSIDYRKLGYVTSVK 134

Query:   148 NQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIE 207
             NQG CGSCWAFS+V A+EG    T G+L++LS Q LVDC T+N+GC GG M  AF Y+  
Sbjct:   135 NQGSCGSCWAFSSVGALEGQLMKTKGQLVDLSPQNLVDCVTENDGCGGGYMTNAFRYVSN 194

Query:   208 NKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVSVCVE 266
             N+G+ +E  YPY      C       AA+  G Y+++P+G+E AL  AV    PVSV ++
Sbjct:   195 NQGIDSEESYPYVGTDQQCAYNTSGVAASCRG-YKEIPQGNERALTAAVANVGPVSVGID 253

Query:   267 ASGQAFRFYKRGVL-NAECG-DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGY 324
             A    F +YK GV  +  C  ++ +H V  VG+G      G KYW++KNSWGE WG+ GY
Sbjct:   254 AMQSTFLYYKSGVYYDPNCNKEDVNHAVLAVGYGATPR--GKKYWIVKNSWGEEWGKKGY 311

Query:   325 IRILRDEG-LCGIATEASYPV 344
             + + R+    CGIA  AS+PV
Sbjct:   312 VLMARNRNNACGIANLASFPV 332


>ZFIN|ZDB-GENE-050626-55 [details] [associations]
            symbol:ctssb.2 "cathepsin S, b.2" species:7955
            "Danio rerio" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-050626-55
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            KO:K01368 EMBL:BC093339 IPI:IPI00507098 RefSeq:NP_001017661.1
            UniGene:Dr.132688 ProteinModelPortal:Q566T8 SMR:Q566T8
            GeneID:337572 KEGG:dre:337572 CTD:337572 InParanoid:Q566T8
            NextBio:20812306 ArrayExpress:Q566T8 Uniprot:Q566T8
        Length = 330

 Score = 610 (219.8 bits), Expect = 1.7e-59, P = 1.7e-59
 Identities = 130/315 (41%), Positives = 186/315 (59%)

Query:    39 VEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDL 94
             +++H E W  +H + Y  E E+  R  ++++NLE I   N E   G  +Y L  N  +D+
Sbjct:    23 LDQHWELWKKKHVKLYSCEDEEVGRRELWERNLELIAIHNLEASMGMHSYDLAINHMADM 82

Query:    95 TNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGS 154
             T EE   +                    +   +   VP ++DWR+KG VT +KNQG CGS
Sbjct:    83 TTEEILQTLA----VTRVPPGFKRPTAEYVSSSFAVVPDTLDWRDKGYVTSVKNQGACGS 138

Query:   155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLA 212
             CWAFS+V A+EG    T GKL++LS Q LVDCS+   N GC+GG M +AF+Y+I+N G+ 
Sbjct:   139 CWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGYMSQAFQYVIDNGGID 198

Query:   213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQA 271
             +E+ YPYQ  QG+C +      AA    Y+ + +GDE AL +A+    PVSV ++A+   
Sbjct:   199 SESSYPYQGTQGSC-RYDPSQRAANCTSYKFVSQGDEQALKEALANIGPVSVAIDATRPQ 257

Query:   272 FRFYKRGVLN-AECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
             F FY+ GV +   C    +HGV  VG+GT   +D   YWL+KNSWG  +G+ GYIRI R+
Sbjct:   258 FIFYRSGVYDDPSCTQKVNHGVLAVGYGTLSGQD---YWLVKNSWGAGFGDGGYIRIARN 314

Query:   331 EG-LCGIATEASYPV 344
             +  +CGIA+EA YP+
Sbjct:   315 KNNMCGIASEACYPI 329


>UNIPROTKB|P43235 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0001957
            "intramembranous ossification" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0045453 "bone resorption"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=TAS] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=TAS] [GO:0036021 "endolysosome lumen" evidence=TAS]
            [GO:0045087 "innate immune response" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            Reactome:REACT_118779 Reactome:REACT_6900 GO:GO:0005615
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087 GO:GO:0045453
            EMBL:CH471121 EMBL:AL355860 GO:GO:0004197 GO:GO:0001957
            HOVERGEN:HBG011513 GO:GO:0036021 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:U13665 EMBL:X82153
            EMBL:U20280 EMBL:S79895 EMBL:CR541675 EMBL:AL356292 EMBL:BC016058
            IPI:IPI00300599 PIR:JC2476 RefSeq:NP_000387.1 UniGene:Hs.632466
            PDB:1ATK PDB:1AU0 PDB:1AU2 PDB:1AU3 PDB:1AU4 PDB:1AYU PDB:1AYV
            PDB:1AYW PDB:1BGO PDB:1BY8 PDB:1MEM PDB:1NL6 PDB:1NLJ PDB:1Q6K
            PDB:1SNK PDB:1TU6 PDB:1U9V PDB:1U9W PDB:1U9X PDB:1VSN PDB:1YK7
            PDB:1YK8 PDB:1YT7 PDB:2ATO PDB:2AUX PDB:2AUZ PDB:2BDL PDB:2R6N
            PDB:3C9E PDB:3H7D PDB:3KW9 PDB:3KWB PDB:3KWZ PDB:3KX1 PDB:3O0U
            PDB:3O1G PDB:3OVZ PDB:4DMX PDB:4DMY PDB:7PCK PDBsum:1ATK
            PDBsum:1AU0 PDBsum:1AU2 PDBsum:1AU3 PDBsum:1AU4 PDBsum:1AYU
            PDBsum:1AYV PDBsum:1AYW PDBsum:1BGO PDBsum:1BY8 PDBsum:1MEM
            PDBsum:1NL6 PDBsum:1NLJ PDBsum:1Q6K PDBsum:1SNK PDBsum:1TU6
            PDBsum:1U9V PDBsum:1U9W PDBsum:1U9X PDBsum:1VSN PDBsum:1YK7
            PDBsum:1YK8 PDBsum:1YT7 PDBsum:2ATO PDBsum:2AUX PDBsum:2AUZ
            PDBsum:2BDL PDBsum:2R6N PDBsum:3C9E PDBsum:3H7D PDBsum:3KW9
            PDBsum:3KWB PDBsum:3KWZ PDBsum:3KX1 PDBsum:3O0U PDBsum:3O1G
            PDBsum:3OVZ PDBsum:4DMX PDBsum:4DMY PDBsum:7PCK
            ProteinModelPortal:P43235 SMR:P43235 DIP:DIP-39993N IntAct:P43235
            STRING:P43235 PhosphoSite:P43235 DMDM:1168793 PaxDb:P43235
            PRIDE:P43235 DNASU:1513 Ensembl:ENST00000271651 GeneID:1513
            KEGG:hsa:1513 UCSC:uc001evp.2 GeneCards:GC01M150768 HGNC:HGNC:2536
            MIM:265800 MIM:601105 neXtProt:NX_P43235 Orphanet:763
            PharmGKB:PA27034 InParanoid:P43235 OMA:LKVPPSH PhylomeDB:P43235
            BindingDB:P43235 ChEMBL:CHEMBL268 EvolutionaryTrace:P43235
            GenomeRNAi:1513 NextBio:6267 ArrayExpress:P43235 Bgee:P43235
            CleanEx:HS_CTSK CleanEx:HS_CTSO Genevestigator:P43235
            GermOnline:ENSG00000143387 Uniprot:P43235
        Length = 329

 Score = 609 (219.4 bits), Expect = 2.2e-59, P = 2.2e-59
 Identities = 129/320 (40%), Positives = 190/320 (59%)

Query:    32 SMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLG 87
             +++   I++ H E W   H + Y +++++  R  I+++NL+YI   N E   G  TY+L 
Sbjct:    15 ALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELA 74

Query:    88 TNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIK 147
              N   D+T+EE     TG                  +++     P S+D+R+KG VT +K
Sbjct:    75 MNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIP-EWEG--RAPDSVDYRKKGYVTPVK 131

Query:   148 NQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIE 207
             NQG CGSCWAFS+V A+EG  +   GKL+ LS Q LVDC ++N+GC GG M  AF+Y+ +
Sbjct:   132 NQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQK 191

Query:   208 NKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVSVCVE 266
             N+G+ +E  YPY  ++ +C       AA   G Y ++P+G+E AL +AV +  PVSV ++
Sbjct:   192 NRGIDSEDAYPYVGQEESCMYNPTGKAAKCRG-YREIPEGNEKALKRAVARVGPVSVAID 250

Query:   267 ASGQAFRFYKRGVLNAE-CG-DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGY 324
             AS  +F+FY +GV   E C  DN +H V  VG+G    + G K+W+IKNSWGE WG  GY
Sbjct:   251 ASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGI---QKGNKHWIIKNSWGENWGNKGY 307

Query:   325 IRILRDEG-LCGIATEASYP 343
             I + R++   CGIA  AS+P
Sbjct:   308 ILMARNKNNACGIANLASFP 327


>RGD|621513 [details] [associations]
            symbol:Ctss "cathepsin S" species:10116 "Rattus norvegicus"
            [GO:0001656 "metanephros development" evidence=IEP] [GO:0002250
            "adaptive immune response" evidence=ISO] [GO:0005764 "lysosome"
            evidence=IEA;ISO] [GO:0006508 "proteolysis" evidence=IEA;ISO]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0009986 "cell
            surface" evidence=IDA] [GO:0016020 "membrane" evidence=ISO]
            [GO:0043231 "intracellular membrane-bounded organelle"
            evidence=ISO] [GO:0045453 "bone resorption" evidence=IMP]
            [GO:0051930 "regulation of sensory perception of pain"
            evidence=IMP] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            RGD:621513 GO:GO:0009986 GO:GO:0051930 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0045453
            GO:GO:0001656 HOVERGEN:HBG011513 CTD:1520 KO:K01368 MEROPS:I29.004
            BRENDA:3.4.22.27 EMBL:L03201 IPI:IPI00210228 PIR:A45087
            RefSeq:NP_059016.1 UniGene:Rn.11347 ProteinModelPortal:Q02765
            PhosphoSite:Q02765 PRIDE:Q02765 GeneID:50654 KEGG:rno:50654
            UCSC:RGD:621513 ChEMBL:CHEMBL1075217 NextBio:610462
            Genevestigator:Q02765 Uniprot:Q02765
        Length = 330

 Score = 603 (217.3 bits), Expect = 9.3e-59, P = 9.3e-59
 Identities = 138/326 (42%), Positives = 188/326 (57%)

Query:    29 SGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYK 85
             +G +   P++    + W     R   D+ E+ +R  I+++NL++I   N E   G  +Y 
Sbjct:    13 NGATAERPTLDHHWDLWKKTRMRRNTDQNEEDVRRLIWEKNLKFIMLHNLEHSMGMHSYS 72

Query:    86 LGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTH 145
             +G N   D+T EE    Y G                T K  +   +P S+DWREKG VT+
Sbjct:    73 VGMNHMGDMTPEEV-IGYMG----SLRIPRPWNRSGTLKSSSNQTLPDSVDWREKGCVTN 127

Query:   146 IKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD----NNGCSGGLMDKA 201
             +K QG CGSCWAFSA  A+EG  ++  GKL+ LS Q LVDCST+    N GC GG M +A
Sbjct:   128 VKYQGSCGSCWAFSAEGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEA 187

Query:   202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQP 260
             F+YII+   + +EA YPY+     C     K  AAT  +Y +LP GDE AL +AV TK P
Sbjct:   188 FQYIIDTS-IDSEASYPYKAMDEKC-LYDPKNRAATCSRYIELPFGDEEALKEAVATKGP 245

Query:   261 VSVCVE-ASGQAFRFYKRGVLN-AECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGET 318
             VSV ++ AS  +F  Y+ GV +   C +N +HGV VVG+GT    DG  YWL+KNSWG  
Sbjct:   246 VSVGIDDASHSSFFLYQSGVYDDPSCTENMNHGVLVVGYGTL---DGKDYWLVKNSWGLH 302

Query:   319 WGESGYIRILRD-EGLCGIATEASYP 343
             +G+ GYIR+ R+ +  CGIA+  SYP
Sbjct:   303 FGDQGYIRMARNNKNHCGIASYCSYP 328


>ZFIN|ZDB-GENE-030131-3539 [details] [associations]
            symbol:ctsh "cathepsin H" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-030131-3539
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 KO:K01366 HOVERGEN:HBG011513
            CTD:1512 OrthoDB:EOG4W9J43 MEROPS:I29.003 HSSP:P43235 EMBL:BC067615
            IPI:IPI00506892 RefSeq:NP_997853.1 UniGene:Dr.14176
            ProteinModelPortal:Q6NWF2 SMR:Q6NWF2 PRIDE:Q6NWF2 GeneID:324818
            KEGG:dre:324818 InParanoid:Q6NWF2 NextBio:20808976 Bgee:Q6NWF2
            Uniprot:Q6NWF2
        Length = 330

 Score = 602 (217.0 bits), Expect = 1.2e-58, P = 1.2e-58
 Identities = 130/316 (41%), Positives = 182/316 (57%)

Query:    40 EKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEE 98
             E H + WM+Q+ + Y+   E   RL IF +N + I++ N EGN  + +G N+FSD+T  E
Sbjct:    27 EYHFKSWMSQYNKKYEIN-EFYQRLQIFLENKKRIDQHN-EGNHKFSMGLNQFSDMTFAE 84

Query:    99 FRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGA-VTHIKNQGHCGSCWA 157
             F+ +Y                     Y      P +IDWR KG  +T +KNQG CGSCW 
Sbjct:    85 FKKTYLLTEPQNCSATRGNHVSSNGLY------PDAIDWRTKGHYITDVKNQGPCGSCWT 138

Query:   158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEA 215
             FS    +E +T I  GKL++L+EQQL+DC+ D  N+GC+GGL   AFEYI+ NKGL TE 
Sbjct:   139 FSTTGCLESVTAIATGKLLQLAEQQLIDCAGDFDNHGCNGGLPSHAFEYIMYNKGLMTED 198

Query:   216 DYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQAFRF 274
             DYPYQ + G C + K + AAA + +  ++ K DE  ++ AV +  PVS   E +   F  
Sbjct:   199 DYPYQAKGGQC-RFKPQLAAAFVKEVVNITKYDEMGMVDAVARLNPVSFAYEVTSD-FMH 256

Query:   275 YKRGVLNA-ECGDNCD---HGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
             YK G+  + EC +  D   H V  VG+    EE+G  YW++KNSWG  WG  GY  I R 
Sbjct:   257 YKDGIYTSTECHNTTDMVNHAVLAVGYA---EENGTPYWIVKNSWGTNWGIKGYFYIERG 313

Query:   331 EGLCGIATEASYPVAM 346
             + +CG+A  +SYP+ +
Sbjct:   314 KNMCGLAACSSYPIPL 329


>UNIPROTKB|Q86GF7 [details] [associations]
            symbol:Cys "Crustapain" species:6703 "Pandalus borealis"
            [GO:0005576 "extracellular region" evidence=IC] [GO:0007586
            "digestion" evidence=NAS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IDA] [GO:0030163 "protein catabolic process"
            evidence=NAS] [GO:0030574 "collagen catabolic process"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005576
            GO:GO:0007586 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0030163 GO:GO:0030574 EMBL:AB091669
            ProteinModelPortal:Q86GF7 SMR:Q86GF7 MEROPS:C01.030 Uniprot:Q86GF7
        Length = 323

 Score = 601 (216.6 bits), Expect = 1.5e-58, P = 1.5e-58
 Identities = 130/317 (41%), Positives = 183/317 (57%)

Query:    37 SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK---EGNRTYKLGTNEFSD 93
             S + + E +  + G+ Y +  E++ R+++F   L++I++ N+   +G  TY L  N FSD
Sbjct:    15 SAIGEWENFKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSD 74

Query:    94 LTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
             LT+EE  A+ TG                T      T +   +DWR KGAVT +K+QG CG
Sbjct:    75 LTHEEVLATKTGMTRRRHPLSVLPKSAPT------TPMAADVDWRNKGAVTPVKDQGQCG 128

Query:   154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGL 211
             SCWAFSAVAA+EG   +  G L+ LSEQ LVDCS+   N GC+GG   +A++YII N+G+
Sbjct:   129 SCWAFSAVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRGI 188

Query:   212 ATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVSVCVEASGQ 270
              TE+ YPY+     C +       AT+  Y +   GDE AL  AV  + PVSVC++A   
Sbjct:   189 DTESSYPYKAIDDNC-RYDAGNIGATVSSYVEPASGDESALQHAVQNEGPVSVCIDAGQS 247

Query:   271 AFRFYKRGVL-NAECGD-NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRIL 328
             +F  Y  GV     C     +H V  VG+GT  + +G  YW++KNSWG  WGESGYI++ 
Sbjct:   248 SFGSYGGGVYYEPNCDSWYANHAVTAVGYGT--DANGGDYWIVKNSWGAWWGESGYIKMA 305

Query:   329 RD-EGLCGIATEASYPV 344
             R+ +  C IAT + YPV
Sbjct:   306 RNRDNNCAIATYSVYPV 322


>MGI|MGI:107823 [details] [associations]
            symbol:Ctsk "cathepsin K" species:10090 "Mus musculus"
            [GO:0005615 "extracellular space" evidence=ISO] [GO:0005737
            "cytoplasm" evidence=ISO] [GO:0005764 "lysosome" evidence=ISO]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0045453 "bone resorption" evidence=ISO]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:107823 GO:GO:0005615 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0045453
            GO:GO:0001957 HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 OMA:LKVPPSH EMBL:X94444
            EMBL:AJ006033 EMBL:BC046320 IPI:IPI00316575 PIR:S74227
            RefSeq:NP_031828.2 UniGene:Mm.272085 ProteinModelPortal:P55097
            SMR:P55097 MINT:MINT-3089515 STRING:P55097 PhosphoSite:P55097
            PRIDE:P55097 Ensembl:ENSMUST00000015664 GeneID:13038 KEGG:mmu:13038
            InParanoid:P55097 BioCyc:MetaCyc:MONOMER-14811 ChEMBL:CHEMBL1075277
            NextBio:282924 Bgee:P55097 CleanEx:MM_CTSK Genevestigator:P55097
            GermOnline:ENSMUSG00000028111 Uniprot:P55097
        Length = 329

 Score = 597 (215.2 bits), Expect = 4.0e-58, P = 4.0e-58
 Identities = 129/308 (41%), Positives = 180/308 (58%)

Query:    43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEF 99
             E W   H + Y  ++++  R  I+++NL+ I   N E   G  TY+L  N   D+T+EE 
Sbjct:    27 ELWKKTHQKQYNSKVDEISRRLIWEKNLKQISAHNLEASLGVHTYELAMNHLGDMTSEEV 86

Query:   100 RASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFS 159
                 TG                T +++    VP SID+R+KG VT +KNQG CGSCWAFS
Sbjct:    87 VQKMTGLRIPPSRSYSNDTLY-TPEWEG--RVPDSIDYRKKGYVTPVKNQGQCGSCWAFS 143

Query:   160 AVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPY 219
             +  A+EG  +   GKL+ LS Q LVDC T+N GC GG M  AF+Y+ +N G+ +E  YPY
Sbjct:   144 SAGALEGQLKKKTGKLLALSPQNLVDCVTENYGCGGGYMTTAFQYVQQNGGIDSEDAYPY 203

Query:   220 QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVSVCVEASGQAFRFYKRG 278
               +  +C      A AA    Y ++P G+E AL +AV +  P+SV ++AS  +F+FY RG
Sbjct:   204 VGQDESC-MYNATAKAAKCRGYREIPVGNEKALKRAVARVGPISVSIDASLASFQFYSRG 262

Query:   279 VLNAE-CG-DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEG-LCG 335
             V   E C  DN +H V VVG+GT   + G+K+W+IKNSWGE+WG  GY  + R++   CG
Sbjct:   263 VYYDENCDRDNVNHAVLVVGYGT---QKGSKHWIIKNSWGESWGNKGYALLARNKNNACG 319

Query:   336 IATEASYP 343
             I   AS+P
Sbjct:   320 ITNMASFP 327


>RGD|708447 [details] [associations]
            symbol:Testin "testin gene" species:10116 "Rattus norvegicus"
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0030054 "cell junction" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 RGD:708447 GO:GO:0005576 GO:GO:0030054 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            MEROPS:C01.972 OMA:RYHAENS OrthoDB:EOG4XWG0N EMBL:U16858
            IPI:IPI00207173 PIR:I52525 PIR:PC1251 RefSeq:NP_775155.1
            UniGene:Rn.10029 ProteinModelPortal:P15242 SMR:P15242
            Ensembl:ENSRNOT00000024467 GeneID:286916 KEGG:rno:286916
            UCSC:RGD:708447 CTD:286916 InParanoid:P15242 NextBio:625036
            Genevestigator:P15242 GermOnline:ENSRNOG00000018028 Uniprot:P15242
        Length = 333

 Score = 593 (213.8 bits), Expect = 1.1e-57, P = 1.1e-57
 Identities = 130/320 (40%), Positives = 182/320 (56%)

Query:    35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRTYKLGTNEF 91
             +PS+  +  +W  +HG+TY    E+  R  ++++N + IE  N    EG   + +  N F
Sbjct:    22 DPSLDVEWNEWRTKHGKTYNMNEERLKR-AVWEKNFKMIELHNWEYLEGRHDFTMAMNAF 80

Query:    92 SDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
              DLTN EF    TG+                F Y     VP  +DWR+ G VT +KNQGH
Sbjct:    81 GDLTNIEFVKMMTGFQRQKIKKTHIFQDHQ-FLY-----VPKRVDWRQLGYVTPVKNQGH 134

Query:   152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDN--NGCSGGLMDKAFEYIIENK 209
             C S WAFSA  ++EG       +LI LSEQ L+DC   N  +GCSGG M  AF+Y+ +N 
Sbjct:   135 CASSWAFSATGSLEGQMFRKTERLIPLSEQNLLDCMGSNVTHGCSGGFMQYAFQYVKDNG 194

Query:   210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVSVCVEAS 268
             GLATE  YPY+ +   C    E +AA  +  +  +P G E AL++AV K  P+SV V+AS
Sbjct:   195 GLATEESYPYRGQGRECRYHAENSAA-NVRDFVQIP-GSEEALMKAVAKVGPISVAVDAS 252

Query:   269 GQAFRFYKRGVL-NAECGD-NCDHGVAVVGFG-TAEEEDGAKYWLIKNSWGETWGESGYI 325
               +F+FY  G+    +C   + +H V VVG+G   EE DG  +WL+KNSWGE WG  GY+
Sbjct:   253 HGSFQFYGSGIYYEPQCKRVHLNHAVLVVGYGFEGEESDGNSFWLVKNSWGEEWGMKGYM 312

Query:   326 RILRD-EGLCGIATEASYPV 344
             ++ +D    CGIAT ++YP+
Sbjct:   313 KLAKDWSNHCGIATYSTYPI 332


>UNIPROTKB|Q4QRC2 [details] [associations]
            symbol:Ctsql2 "Protein Ctsql2" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 HOVERGEN:HBG011513 EMBL:CH474032
            RGD:1303225 EMBL:BC097257 IPI:IPI00421946 RefSeq:NP_001002813.2
            UniGene:Rn.128678 SMR:Q4QRC2 MEROPS:C01.111
            Ensembl:ENSRNOT00000038758 GeneID:408201 KEGG:rno:408201 CTD:408201
            InParanoid:Q4QRC2 OMA:NDEGALM NextBio:696394 Genevestigator:Q4QRC2
            Uniprot:Q4QRC2
        Length = 343

 Score = 593 (213.8 bits), Expect = 1.1e-57, P = 1.1e-57
 Identities = 131/332 (39%), Positives = 187/332 (56%)

Query:    27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRT 83
             VVSG S    S+  + ++W  ++ + Y  E E+ ++  ++++N++ IE  N+E   G  T
Sbjct:    14 VVSGASAFNLSLDVQWQEWKMKYEKLYSPE-EELLKRVVWEENVKKIELHNRENSLGKNT 72

Query:    84 YKLGTNEFSDLTNEEFRASYTGY-----NXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWR 138
             Y +  N F+DLT+EEF+   TG      N                 +     +P SIDWR
Sbjct:    73 YIMEINNFADLTDEEFKDMITGITLPINNTMKSLWKRALGSPFPNSWYWRDALPKSIDWR 132

Query:   139 EKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGG 196
             ++G VT ++ QG C SCWAF    A+EG      GKL  LS Q LVDCS    N GC GG
Sbjct:   133 KEGYVTRVREQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGG 192

Query:   197 LMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV 256
                 AF+Y+++N GL +EA YPY+ ++G C K   K A A I ++  LP+ DE  L+ A+
Sbjct:   193 TTYNAFQYVLQNGGLESEATYPYKGKEGLC-KYNPKNAYAKITRFVALPE-DEDVLMDAL 250

Query:   257 -TKQPVSVCVEASGQAFRFYKRGVLNA-ECGDNCDHGVAVVGFG-TAEEEDGAKYWLIKN 313
              TK PV+  +     + RFYK+G+ +  +C +  +H V VVG+G    E DG  YWLIKN
Sbjct:   251 ATKGPVAAGIHVVYSSLRFYKKGIYHEPKCNNRVNHAVLVVGYGFEGNETDGNNYWLIKN 310

Query:   314 SWGETWGESGYIRILRDEGL-CGIATEASYPV 344
             SWG+ WG  GY++I +D    CGIAT A YP+
Sbjct:   311 SWGKQWGLKGYMKIAKDRNNHCGIATFAQYPI 342


>UNIPROTKB|Q9GLE3 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9823 "Sus scrofa"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0005576 "extracellular
            region" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005576 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 MEROPS:I29.007
            CTD:1513 KO:K01371 OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55
            OMA:LKVPPSH EMBL:AF292030 RefSeq:NP_999467.1 UniGene:Ssc.1020
            ProteinModelPortal:Q9GLE3 SMR:Q9GLE3 STRING:Q9GLE3
            Ensembl:ENSSSCT00000007283 GeneID:397569 KEGG:ssc:397569
            ArrayExpress:Q9GLE3 Uniprot:Q9GLE3
        Length = 330

 Score = 591 (213.1 bits), Expect = 1.7e-57, P = 1.7e-57
 Identities = 129/324 (39%), Positives = 187/324 (57%)

Query:    28 VSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRT 83
             V   +++   I++   E W   + + Y  ++++  R  I+++NL++I   N E   G  T
Sbjct:    12 VMSSALYPEEILDTQWELWKKTYRKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHT 71

Query:    84 YKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAV 143
             Y+L  N   D+T+EE     TG                   ++  T  P SID+R+KG V
Sbjct:    72 YELAMNHLGDMTSEEVVQKMTGLKVPPSHSRSNDTLYIP-DWEGRT--PDSIDYRKKGYV 128

Query:   144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFE 203
             T +KNQG CGSCWAFS+V A+EG  +   GKL+ LS Q LVDC ++N+GC GG M  AF+
Sbjct:   129 TPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQ 188

Query:   204 YIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVS 262
             Y+ +N+G+ +E  YPY  +   C       AA   G Y ++P+G+E AL +AV +  PVS
Sbjct:   189 YVQKNRGIDSEDAYPYVGQDENCMYNPTGKAAKCRG-YREIPEGNEKALKRAVARVGPVS 247

Query:   263 VCVEASGQAFRFYKRGVLNAE-CG-DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWG 320
             V ++AS  +F+FY +GV   E C  DN +H V  VG+G    + G K+W+IKNSWGE WG
Sbjct:   248 VAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGI---QKGKKHWIIKNSWGENWG 304

Query:   321 ESGYIRILRDEG-LCGIATEASYP 343
               GYI + R++   CGIA  AS+P
Sbjct:   305 NKGYILMARNKNNACGIANLASFP 328


>MGI|MGI:1349426 [details] [associations]
            symbol:Ctsj "cathepsin J" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=ISO]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0048471 "perinuclear region
            of cytoplasm" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:1349426 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF136272
            EMBL:AF158182 EMBL:AY034579 EMBL:AK005526 EMBL:AK131661
            EMBL:BC103769 IPI:IPI00126770 RefSeq:NP_036137.1 UniGene:Mm.31948
            ProteinModelPortal:Q9R014 SMR:Q9R014 MEROPS:C01.038 PRIDE:Q9R014
            Ensembl:ENSMUST00000071526 GeneID:26898 KEGG:mmu:26898
            UCSC:uc007qwa.1 CTD:26898 InParanoid:Q9R014 KO:K09599
            NextBio:304745 Bgee:Q9R014 CleanEx:MM_CTSJ Genevestigator:Q9R014
            GermOnline:ENSMUSG00000055298 Uniprot:Q9R014
        Length = 334

 Score = 591 (213.1 bits), Expect = 1.7e-57, P = 1.7e-57
 Identities = 125/326 (38%), Positives = 181/326 (55%)

Query:    27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRT 83
             V SG   H+P +  + + W  ++ ++Y  + E+A+R  ++++N+  I+  NKE   G   
Sbjct:    14 VASGAQAHDPKLDAEWKDWKTKYAKSYSPK-EEALRRAVWEENMRMIKLHNKENSLGKNN 72

Query:    84 YKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAV 143
             + +  N+F D T+EEFR S                   +        +P   DWRE+G V
Sbjct:    73 FTMKMNKFGDQTSEEFRKSIDNIPIPAAMTDPHAQNHVSI------GLPDYKDWREEGYV 126

Query:   144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS--TDNNGCSGGLMDKA 201
             T ++NQG CGSCWAF+A  A+EG      G L  LS Q L+DCS    N GC  G   +A
Sbjct:   127 TPVRNQGKCGSCWAFAAAGAIEGQMFWKTGNLTPLSVQNLLDCSKTVGNKGCQSGTAHQA 186

Query:   202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPV 261
             FEY+++NKGL  EA YPY+ + G C  + E A+A  I  Y +LP  + +  +   +  PV
Sbjct:   187 FEYVLKNKGLEAEATYPYEGKDGPCRYRSENASA-NITDYVNLPPNELYLWVAVASIGPV 245

Query:   262 SVCVEASGQAFRFYKRGVL-NAECGDN-CDHGVAVVGFGT-AEEEDGAKYWLIKNSWGET 318
             S  ++AS  +FRFY  G+     C     +H V VVG+G+  + +DG  YWLIKNSWGE 
Sbjct:   246 SAAIDASHDSFRFYNGGIYYEPNCSSYFVNHAVLVVGYGSEGDVKDGNNYWLIKNSWGEE 305

Query:   319 WGESGYIRILRDEGL-CGIATEASYP 343
             WG +GY++I +D    CGIA+ ASYP
Sbjct:   306 WGMNGYMQIAKDHNNHCGIASLASYP 331


>DICTYBASE|DDB_G0278401 [details] [associations]
            symbol:cprH "cysteine proteinase 8" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0278401 EMBL:AAFI02000023
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 ProtClustDB:CLSZ2430780 RefSeq:XP_642342.1
            ProteinModelPortal:Q54Y60 MEROPS:C01.A62 EnsemblProtists:DDB0205428
            GeneID:8621547 KEGG:ddi:DDB_G0278401 InParanoid:Q54Y60 OMA:FANMENE
            Uniprot:Q54Y60
        Length = 337

 Score = 587 (211.7 bits), Expect = 4.6e-57, P = 4.6e-57
 Identities = 128/330 (38%), Positives = 185/330 (56%)

Query:    31 RSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNE 90
             + + E    +    WM  + ++Y    E   R  IFK N +YIE+ N +G+ T  LG N+
Sbjct:    19 QELSESQYRDAFTDWMISNQKSYSSS-EFITRYNIFKTNFDYIEEWNSKGSETV-LGLNK 76

Query:    91 FSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQG 150
              +D+TNEE+R+ Y G                   + N     +++DWR+KGAVTH+KNQ 
Sbjct:    77 MADITNEEYRSLYLG---KPFDASSLIGTKEEILFSN--KFSSTVDWRKKGAVTHVKNQQ 131

Query:   151 HCGSCWAFSAVAAVEGITQITGG---KLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYI 205
              C  CW+FSA  A EG  ++      +L+ LSEQ L+DCST   N GC+GG++  AFEYI
Sbjct:   132 SCSGCWSFSATGATEGAHKLANNGTNELVSLSEQNLIDCSTPFGNTGCNGGVITYAFEYI 191

Query:   206 IENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCV 265
             I N G+ TE  YP++   GTC + K + + ATI  Y ++  G E +L  AV   PV+  +
Sbjct:   192 ISNGGIDTEKSYPFEGTDGTC-RYKSENSGATISSYVNVTFGSESSLESAVNVNPVACSI 250

Query:   266 EASGQAFRFYKRGV-LNAECG-DNCDHGVAVVGFGT--------AEEEDGAKYWLIKNSW 315
             +AS  +F FYK G+     C   N DHGV VVG+GT        + E + + YW+ KNSW
Sbjct:   251 DASHSSFLFYKSGIYFEPACSRTNLDHGVLVVGYGTENSQSQDSSSEPNHSNYWIAKNSW 310

Query:   316 GETWGESGYIRILRD-EGLCGIATEASYPV 344
             G     +GYI + +D + +CGI+T AS+P+
Sbjct:   311 GI----NGYILMSKDRDNMCGISTLASFPI 336


>MGI|MGI:1922258 [details] [associations]
            symbol:4930486L24Rik "RIKEN cDNA 4930486L24 gene"
            species:10090 "Mus musculus" [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030054 "cell
            junction" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 MGI:MGI:1922258
            GO:GO:0005576 GO:GO:0030054 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 HSSP:P07711
            EMBL:AY146988 EMBL:AK145933 EMBL:BC061218 IPI:IPI00280732
            RefSeq:NP_835199.1 UniGene:Mm.19839 ProteinModelPortal:Q80UB0
            SMR:Q80UB0 MEROPS:C01.972 PRIDE:Q80UB0 Ensembl:ENSMUST00000091569
            GeneID:214639 KEGG:mmu:214639 UCSC:uc007qvs.1 InParanoid:Q80UB0
            OMA:RYHAENS OrthoDB:EOG4XWG0N NextBio:374408 Bgee:Q80UB0
            CleanEx:MM_4930486L24RIK Genevestigator:Q80UB0 Uniprot:Q80UB0
        Length = 333

 Score = 587 (211.7 bits), Expect = 4.6e-57, P = 4.6e-57
 Identities = 130/320 (40%), Positives = 179/320 (55%)

Query:    35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRTYKLGTNEF 91
             +PS+  +  +W  +HG+ Y    E+ +R  ++++N + IE  N    EG   + +  N F
Sbjct:    22 DPSLDVQWNEWRTKHGKAYNVN-EERLRRAVWEKNFKMIELHNWEYLEGKHDFTMTMNAF 80

Query:    92 SDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
              DLTN EF    TG+                F+      VP  +DWR  G VT +KNQG+
Sbjct:    81 GDLTNTEFVKMMTGFRRQKIKRMH------VFQDHQFLYVPKYVDWRMLGYVTPVKNQGY 134

Query:   152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDN--NGCSGGLMDKAFEYIIENK 209
             C S WAFSA  ++EG      G+L+ LSEQ L+DC   N  + CSGG M  AF+Y+ +N 
Sbjct:   135 CASSWAFSATGSLEGQMFKKTGRLVPLSEQNLLDCMGSNVTHDCSGGFMQNAFQYVKDNG 194

Query:   210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVSVCVEAS 268
             GLATE  YPY      C    E +AA  +  +  +P G E AL++AV K  P+SV V+AS
Sbjct:   195 GLATEESYPYIGPGRKCRYHAENSAA-NVRDFVQIP-GREEALMKAVAKVGPISVAVDAS 252

Query:   269 GQAFRFYKRGVL-NAECGD-NCDHGVAVVGFG-TAEEEDGAKYWLIKNSWGETWGESGYI 325
               +F+FY  G+    +C   + +H V VVG+G   EE DG  YWL+KNSWGE WG  GYI
Sbjct:   253 HDSFQFYDSGIYYEPQCKRVHLNHAVLVVGYGFEGEESDGNSYWLVKNSWGEEWGMKGYI 312

Query:   326 RILRD-EGLCGIATEASYPV 344
             +I +D    CGIAT A+YP+
Sbjct:   313 KIAKDWNNHCGIATLATYPI 332


>ZFIN|ZDB-GENE-050522-559 [details] [associations]
            symbol:ctssb.1 "cathepsin S, b.1" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-050522-559 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 MEROPS:C01.034
            EMBL:BC095694 IPI:IPI00607338 UniGene:Dr.75553
            ProteinModelPortal:Q502H6 SMR:Q502H6 InParanoid:Q502H6
            ArrayExpress:Q502H6 Uniprot:Q502H6
        Length = 330

 Score = 585 (211.0 bits), Expect = 7.5e-57, P = 7.5e-57
 Identities = 130/321 (40%), Positives = 185/321 (57%)

Query:    34 HEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTN 89
             H  + +++H E W   +G+ Y  E+E+  R  ++++NL+ I   N E   G  +Y L  N
Sbjct:    18 HFNTNLDQHWELWKKTYGKIYTTEVEEFGRRQLWERNLQLITVHNLEASMGMHSYDLSMN 77

Query:    90 EFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQ 149
                DLT EE   +                        +   VP S+DWREKG V+ +K Q
Sbjct:    78 HMGDLTTEEILQTLA----LTHVPSGFKRQIANIVGSSGDAVPDSLDWREKGYVSSVKMQ 133

Query:   150 GHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIE 207
             G CGSCWAFS+V A+EG  + T GKL++LS Q LVDCS+   N GC+GG M  AF+Y+I+
Sbjct:   134 GACGSCWAFSSVGALEGQLKKTTGKLVDLSPQNLVDCSSKYGNKGCNGGFMSDAFQYVID 193

Query:   208 NKGLATEADYPYQQEQGTCD-KQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVSVCV 265
             N G+A+++ YPY+  Q  C     ++AA  T  KY  + +GDE+AL QAV    P+SV +
Sbjct:   194 NGGIASDSAYPYRGVQQQCSYSSSQRAANCT--KYYFVRQGDENALKQAVASVGPISVAI 251

Query:   266 EASGQAFRFYKRGVLN-AECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGY 324
             +A+   F  Y  GV N   C    +H V VVG+GT   +D   +WL+KNSWG  +G+ GY
Sbjct:   252 DATRPQFVLYHSGVYNDPTCSKRVNHAVLVVGYGTLSGQD---HWLVKNSWGTRFGDGGY 308

Query:   325 IRILRDEG-LCGIATEASYPV 344
             IR+ R++  +CGIA+ A YPV
Sbjct:   309 IRMARNKNNMCGIASYACYPV 329


>UNIPROTKB|G1SQF0 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9986
            "Oryctolagus cuniculus" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 RefSeq:XP_002721635.1 UniGene:Ocu.7137
            Ensembl:ENSOCUT00000006138 GeneID:100101597 Uniprot:G1SQF0
        Length = 333

 Score = 584 (210.6 bits), Expect = 9.6e-57, P = 9.6e-57
 Identities = 127/318 (39%), Positives = 182/318 (57%)

Query:    39 VEK-H-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTN 96
             +EK H + WM+QH + Y  E E   RL  F +N   I  A+  GN T+++G N+FSD++ 
Sbjct:    28 LEKFHFKSWMSQHHKKYSAE-EYPRRLQTFVRNWRKIN-AHNNGNHTFQMGLNQFSDMSF 85

Query:    97 EEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGA-VTHIKNQGHCGSC 155
              E +  Y                  T  Y      P+S+DWR+KG  V+ +KNQG CGSC
Sbjct:    86 AEIKHKYLWTEPQNCSATKSNYLRGTGPY------PSSVDWRKKGNFVSPVKNQGACGSC 139

Query:   156 WAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLAT 213
             W FS   A+E    I GGK++ L+EQQLVDC+ +  N+GC GGL  +AFEYI+ NKG+  
Sbjct:   140 WTFSTTGALESAVAIAGGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQAFEYILYNKGIMG 199

Query:   214 EADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQAF 272
             E  YPY+  +G C  Q +KA A  +    ++   DE A+++AV    PVS   E + + F
Sbjct:   200 EDSYPYRAMEGRCKFQPQKAIAF-VKDVANITLNDEEAMVEAVALYNPVSFAFEVT-EDF 257

Query:   273 RFYKRGVLNA-ECG---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRIL 328
               Y++G+ ++  C    D  +H V  VG+G   EE+G  YW++KNSWG  WG +GY  I 
Sbjct:   258 MQYRKGIYSSTSCHKTPDKVNHAVLAVGYG---EENGVPYWIVKNSWGSHWGMNGYFYIE 314

Query:   329 RDEGLCGIATEASYPVAM 346
             R + +CG+A  ASYP+ +
Sbjct:   315 RGKNMCGLAACASYPIPL 332


>UNIPROTKB|D3ZZR3 [details] [associations]
            symbol:D3ZZR3 "Uncharacterized protein" species:10116
            "Rattus norvegicus" [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0016020 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0002250 GeneTree:ENSGT00560000076577 GO:GO:0097067
            OrthoDB:EOG4JM7Q2 IPI:IPI00210228 PRIDE:D3ZZR3
            Ensembl:ENSRNOT00000028732 Uniprot:D3ZZR3
        Length = 331

 Score = 584 (210.6 bits), Expect = 9.6e-57, P = 9.6e-57
 Identities = 135/328 (41%), Positives = 187/328 (57%)

Query:    29 SGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTY 84
             +G +   P  ++ H + W   H + YKD+ E+ +R  I+++NL++I   N E   G  +Y
Sbjct:    13 NGATAERP--LDHHWDLWKKTHEKEYKDQNEEDVRRLIWEKNLKFIMLHNLEHSMGMHSY 70

Query:    85 KLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWRE--KGA 142
              +G N   D+  E       G                +   QN+   P  + W+E  KG 
Sbjct:    71 SVGMNHMGDMVAETIIGEM-GSERLPRKRKALGLIPSSVN-QNL---PAGVKWKERTKGC 125

Query:   143 VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD----NNGCSGGLM 198
               ++  QG CGSCWAFSAV A+EG  ++  GKL+ LS Q LVDCST+    N GC GG M
Sbjct:   126 WKNLVFQGSCGSCWAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFM 185

Query:   199 DKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-T 257
              +AF+YII+N G+ +EA YPY+     C     K  AAT  +Y +LP GDE AL +AV T
Sbjct:   186 TEAFQYIIDNGGIDSEASYPYKAMDEKCHYDP-KNRAATCSRYIELPFGDEEALKEAVAT 244

Query:   258 KQPVSVCVEASGQAFRFYKRGVLN-AECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWG 316
             K PVSV ++AS  +F  Y+ GV +   C +N +HGV VVG+GT    DG  YWL+KNSWG
Sbjct:   245 KGPVSVGIDASHSSFFLYQSGVYDDPSCTENVNHGVLVVGYGTL---DGKDYWLVKNSWG 301

Query:   317 ETWGESGYIRILRD-EGLCGIATEASYP 343
               +G+ GYIR+ R+ +  CGIA+  SYP
Sbjct:   302 LHFGDQGYIRMARNNKNHCGIASYCSYP 329


>UNIPROTKB|Q5E968 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:BT021052
            EMBL:BC109853 IPI:IPI00709374 RefSeq:NP_001029607.1
            UniGene:Bt.23218 ProteinModelPortal:Q5E968 SMR:Q5E968 STRING:Q5E968
            MEROPS:I29.007 PRIDE:Q5E968 Ensembl:ENSBTAT00000028016
            GeneID:513038 KEGG:bta:513038 CTD:1513 InParanoid:Q5E968 KO:K01371
            OrthoDB:EOG4SJ5FC NextBio:20870669 PANTHER:PTHR12411:SF55
            Uniprot:Q5E968
        Length = 329

 Score = 583 (210.3 bits), Expect = 1.2e-56, P = 1.2e-56
 Identities = 128/324 (39%), Positives = 185/324 (57%)

Query:    27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRT 83
             VVS     E  +  + E W   + + Y  + ++  R  I+++NL++I   N E   G  T
Sbjct:    11 VVSFALYPEEILDTQWELWKKTYRKQYNSKGDEISRRLIWEKNLKHISIHNLEASLGVHT 70

Query:    84 YKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAV 143
             Y+L  N   D+T+EE     TG                   ++     P S+D+R+KG V
Sbjct:    71 YELAMNHLGDMTSEEVVQKMTGLKVPASRSRSNDTLYIP-DWEG--RAPDSVDYRKKGYV 127

Query:   144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFE 203
             T +KNQG CGSCWAFS+V A+EG  +   GKL+ LS Q LVDC ++N+GC GG M  AF+
Sbjct:   128 TPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQ 187

Query:   204 YIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVS 262
             Y+ +N+G+ +E  YPY  +   C       AA   G Y ++P+G+E AL +AV +  P+S
Sbjct:   188 YVQKNRGIDSEDAYPYVGQDENCMYNPTGKAAKCRG-YREIPEGNEKALKRAVARVGPIS 246

Query:   263 VCVEASGQAFRFYKRGVLNAE-CG-DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWG 320
             V ++AS  +F+FY++GV   E C  DN +H V  VG+G    + G K+W+IKNSWGE WG
Sbjct:   247 VAIDASLTSFQFYRKGVYYDENCNSDNLNHAVLAVGYGI---QKGNKHWIIKNSWGENWG 303

Query:   321 ESGYIRILRDEG-LCGIATEASYP 343
               GYI + R++   CGIA  AS+P
Sbjct:   304 NKGYILMARNKNNACGIANLASFP 327


>UNIPROTKB|F7B939 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458 CTD:1512
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 EMBL:ACFV01158341
            EMBL:ACFV01158342 EMBL:ACFV01158343 RefSeq:XP_002753411.1
            Ensembl:ENSCJAT00000004397 GeneID:100413104 Uniprot:F7B939
        Length = 336

 Score = 582 (209.9 bits), Expect = 1.6e-56, P = 1.6e-56
 Identities = 136/339 (40%), Positives = 183/339 (53%)

Query:    23 CASQVVSG---RSMHEPSI--VEK-H-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEK 75
             CA   + G   R   E S+  +EK H + WMA+H +TY  E E   RL  F  N   I  
Sbjct:     9 CAGVCLLGAPARGAAELSVNSLEKFHFKSWMAKHHKTYSREEEYHQRLQTFASNWRKIN- 67

Query:    76 ANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSI 135
             A+  GN T+K+  N+FSD++  E +  Y                  T  Y      P S+
Sbjct:    68 AHNNGNHTFKMAVNQFSDMSFAEIKRKYLWSEPQNCSATKSNYLRGTGPY------PPSV 121

Query:   136 DWREKGA-VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNG 192
             DWR+KG  V+ +KNQG CGSCW FS   A+E    I  GK++ L+EQQLVDC+ D  N+G
Sbjct:   122 DWRKKGHFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHG 181

Query:   193 CSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHAL 252
             C GGL  +AFEYI+ N G+  E  YPYQ +   C  Q  KA    +    ++   DE A+
Sbjct:   182 CQGGLPSQAFEYILYNNGIMGEDTYPYQGKDSDCKFQPGKAIGF-VKDVANITIYDEDAM 240

Query:   253 LQAVTK-QPVSVCVEASGQAFRFYKRGVLNA-ECG---DNCDHGVAVVGFGTAEEEDGAK 307
             ++AV    PVS   E + Q F  YKRG+ ++  C    D  +H V  VG+G   EE+G  
Sbjct:   241 VEAVALYNPVSFAFEVT-QDFMMYKRGIYSSTSCHKTPDKVNHAVLAVGYG---EENGIP 296

Query:   308 YWLIKNSWGETWGESGYIRILRDEGLCGIATEASYPVAM 346
             YW++KNSWG  WG +GY  I R + +CG+A  ASYPV +
Sbjct:   297 YWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASYPVPL 335


>UNIPROTKB|F7BRD4 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001656 "metanephros development"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0001656
            GeneTree:ENSGT00660000095458 EMBL:ACFV01158341 EMBL:ACFV01158342
            EMBL:ACFV01158343 Ensembl:ENSCJAT00000004396 Uniprot:F7BRD4
        Length = 336

 Score = 582 (209.9 bits), Expect = 1.6e-56, P = 1.6e-56
 Identities = 132/329 (40%), Positives = 179/329 (54%)

Query:    28 VSGRSMHEPSIVEK-H-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYK 85
             VS +   +   +EK H + WMA+H +TY  E E   RL  F  N   I  A+  GN T+K
Sbjct:    19 VSKKKKKKMLALEKFHFKSWMAKHHKTYSREEEYHQRLQTFASNWRKIN-AHNNGNHTFK 77

Query:    86 LGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGA-VT 144
             +  N+FSD++  E +  Y                  T  Y      P S+DWR+KG  V+
Sbjct:    78 MAVNQFSDMSFAEIKRKYLWSEPQNCSATKSNYLRGTGPY------PPSVDWRKKGHFVS 131

Query:   145 HIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAF 202
              +KNQG CGSCW FS   A+E    I  GK++ L+EQQLVDC+ D  N+GC GGL  +AF
Sbjct:   132 PVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAF 191

Query:   203 EYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPV 261
             EYI+ N G+  E  YPYQ +   C  Q  KA    +    ++   DE A+++AV    PV
Sbjct:   192 EYILYNNGIMGEDTYPYQGKDSDCKFQPGKAIGF-VKDVANITIYDEDAMVEAVALYNPV 250

Query:   262 SVCVEASGQAFRFYKRGVLNA-ECG---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGE 317
             S   E + Q F  YKRG+ ++  C    D  +H V  VG+G   EE+G  YW++KNSWG 
Sbjct:   251 SFAFEVT-QDFMMYKRGIYSSTSCHKTPDKVNHAVLAVGYG---EENGIPYWIVKNSWGP 306

Query:   318 TWGESGYIRILRDEGLCGIATEASYPVAM 346
              WG +GY  I R + +CG+A  ASYPV +
Sbjct:   307 QWGMNGYFLIERGKNMCGLAACASYPVPL 335


>UNIPROTKB|G1K2A7 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9615 "Canis lupus
            familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 PANTHER:PTHR12411:SF55 OMA:LKVPPSH
            EMBL:AAEX03011051 Ensembl:ENSCAFT00000019202 Uniprot:G1K2A7
        Length = 333

 Score = 581 (209.6 bits), Expect = 2.0e-56, P = 2.0e-56
 Identities = 123/306 (40%), Positives = 180/306 (58%)

Query:    45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRA 101
             W   + + Y  ++++  R  I+++NL++I   N E   G  TY+L  N   D+T+EE   
Sbjct:    33 WKKTYRKQYNSKVDELSRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEVVQ 92

Query:   102 SYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAV 161
               TG                   +++    P S+D+R+KG VT +KNQG CGSCWAFS+V
Sbjct:    93 KMTGLKVPPSHSRSNDTLYIP-DWES--RAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSV 149

Query:   162 AAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQ 221
              A+EG  +   GKL+ LS Q LVDC ++N+GC GG M  AF+Y+ +N+G+ +E  YPY  
Sbjct:   150 GALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVG 209

Query:   222 EQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVSVCVEASGQAFRFYKRGVL 280
             +  +C       AA   G Y ++P+G+E AL +AV +  P+SV ++AS  +F+FY +GV 
Sbjct:   210 QDESCMYNPTGKAAKCRG-YREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKGVY 268

Query:   281 NAE-CG-DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEG-LCGIA 337
               E C  DN +H V  VG+G    + G K+W+IKNSWGE WG  GYI + R++   CGIA
Sbjct:   269 YDENCNSDNLNHAVLAVGYGI---QKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIA 325

Query:   338 TEASYP 343
               AS+P
Sbjct:   326 NLASFP 331


>UNIPROTKB|Q3ZKN1 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:AY738221
            RefSeq:NP_001029168.1 UniGene:Cfa.588 HSSP:P43235
            ProteinModelPortal:Q3ZKN1 SMR:Q3ZKN1 STRING:Q3ZKN1 GeneID:608843
            KEGG:cfa:608843 InParanoid:Q3ZKN1 NextBio:20894470 Uniprot:Q3ZKN1
        Length = 330

 Score = 581 (209.6 bits), Expect = 2.0e-56, P = 2.0e-56
 Identities = 123/306 (40%), Positives = 180/306 (58%)

Query:    45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRA 101
             W   + + Y  ++++  R  I+++NL++I   N E   G  TY+L  N   D+T+EE   
Sbjct:    30 WKKTYRKQYNSKVDELSRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEVVQ 89

Query:   102 SYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAV 161
               TG                   +++    P S+D+R+KG VT +KNQG CGSCWAFS+V
Sbjct:    90 KMTGLKVPPSHSRSNDTLYIP-DWES--RAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSV 146

Query:   162 AAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQ 221
              A+EG  +   GKL+ LS Q LVDC ++N+GC GG M  AF+Y+ +N+G+ +E  YPY  
Sbjct:   147 GALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVG 206

Query:   222 EQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVSVCVEASGQAFRFYKRGVL 280
             +  +C       AA   G Y ++P+G+E AL +AV +  P+SV ++AS  +F+FY +GV 
Sbjct:   207 QDESCMYNPTGKAAKCRG-YREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKGVY 265

Query:   281 NAE-CG-DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEG-LCGIA 337
               E C  DN +H V  VG+G    + G K+W+IKNSWGE WG  GYI + R++   CGIA
Sbjct:   266 YDENCNSDNLNHAVLAVGYGI---QKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIA 322

Query:   338 TEASYP 343
               AS+P
Sbjct:   323 NLASFP 328


>RGD|2447 [details] [associations]
            symbol:Ctsh "cathepsin H" species:10116 "Rattus norvegicus"
          [GO:0001520 "outer dense fiber" evidence=IDA] [GO:0001656
          "metanephros development" evidence=IEP] [GO:0001669 "acrosomal
          vesicle" evidence=IDA] [GO:0001913 "T cell mediated cytotoxicity"
          evidence=ISO;ISS] [GO:0002250 "adaptive immune response"
          evidence=ISO] [GO:0002764 "immune response-regulating signaling
          pathway" evidence=ISO;ISS] [GO:0004175 "endopeptidase activity"
          evidence=ISO] [GO:0004177 "aminopeptidase activity" evidence=ISO;IDA]
          [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISO;ISS]
          [GO:0004252 "serine-type endopeptidase activity" evidence=ISO;ISS]
          [GO:0005615 "extracellular space" evidence=ISO;ISS;IDA] [GO:0005764
          "lysosome" evidence=ISO;ISS;IDA] [GO:0005829 "cytosol"
          evidence=ISO;ISS] [GO:0006508 "proteolysis" evidence=IEP;ISO]
          [GO:0007283 "spermatogenesis" evidence=IEP] [GO:0008233 "peptidase
          activity" evidence=ISO] [GO:0008234 "cysteine-type peptidase
          activity" evidence=ISO] [GO:0008284 "positive regulation of cell
          proliferation" evidence=ISO;ISS] [GO:0010628 "positive regulation of
          gene expression" evidence=ISO;ISS] [GO:0010634 "positive regulation
          of epithelial cell migration" evidence=ISO;ISS] [GO:0010813
          "neuropeptide catabolic process" evidence=ISO;ISS] [GO:0010815
          "bradykinin catabolic process" evidence=ISO;ISS] [GO:0010952
          "positive regulation of peptidase activity" evidence=ISO;ISS]
          [GO:0016505 "apoptotic protease activator activity" evidence=ISO;ISS]
          [GO:0030108 "HLA-A specific activating MHC class I receptor activity"
          evidence=ISO;ISS] [GO:0030335 "positive regulation of cell migration"
          evidence=ISO;ISS] [GO:0030984 "kininogen binding" evidence=IPI]
          [GO:0031638 "zymogen activation" evidence=ISO;ISS] [GO:0031648
          "protein destabilization" evidence=ISO;ISS] [GO:0032403 "protein
          complex binding" evidence=IPI] [GO:0032526 "response to retinoic
          acid" evidence=ISO;ISS] [GO:0033619 "membrane protein proteolysis"
          evidence=ISO;ISS] [GO:0035085 "cilium axoneme" evidence=IDA]
          [GO:0043066 "negative regulation of apoptotic process"
          evidence=ISO;ISS] [GO:0043129 "surfactant homeostasis"
          evidence=ISO;ISS] [GO:0043621 "protein self-association"
          evidence=IDA] [GO:0045766 "positive regulation of angiogenesis"
          evidence=ISO;ISS] [GO:0060448 "dichotomous subdivision of terminal
          units involved in lung branching" evidence=ISO;ISS] [GO:0070324
          "thyroid hormone binding" evidence=ISO;ISS] [GO:0070371 "ERK1 and
          ERK2 cascade" evidence=ISO;ISS] [GO:0097067 "cellular response to
          thyroid hormone stimulus" evidence=ISO;IEP] [GO:0097208 "alveolar
          lamellar body" evidence=ISO;ISS;IDA] InterPro:IPR000668
          InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
          InterPro:IPR000169 RGD:2447 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
          GO:GO:0008284 GO:GO:0070371 GO:GO:0001669 eggNOG:COG4870
          HOGENOM:HOG000230774 InterPro:IPR025661 InterPro:IPR025660
          InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
          PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0007283
          GO:GO:0045766 GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
          GO:GO:0043621 GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 KO:K01366
          GO:GO:0016505 GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
          HOVERGEN:HBG011513 GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
          GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT OrthoDB:EOG4W9J43
          GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
          GO:GO:0010813 GO:GO:0043129 MEROPS:I29.003 EMBL:Y00708 EMBL:BC085352
          EMBL:M38135 IPI:IPI00212809 PIR:S00211 RefSeq:NP_037071.1
          UniGene:Rn.1997 ProteinModelPortal:P00786 SMR:P00786 STRING:P00786
          PRIDE:P00786 Ensembl:ENSRNOT00000019285 GeneID:25425 KEGG:rno:25425
          UCSC:RGD:2447 InParanoid:P00786 BindingDB:P00786 NextBio:606599
          Genevestigator:P00786 GermOnline:ENSRNOG00000014064 GO:GO:0035086
          GO:GO:0001520 Uniprot:P00786
        Length = 333

 Score = 577 (208.2 bits), Expect = 5.3e-56, P = 5.3e-56
 Identities = 127/316 (40%), Positives = 177/316 (56%)

Query:    39 VEK-H-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTN 96
             +EK H   WM QH +TY    E + RL +F  N   I+ A+ + N T+K+G N+FSD++ 
Sbjct:    28 IEKFHFTSWMKQHQKTYSSR-EYSHRLQVFANNWRKIQ-AHNQRNHTFKMGLNQFSDMSF 85

Query:    97 EEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKG-AVTHIKNQGHCGSC 155
              E +  Y                  T  Y      P+S+DWR+KG  V+ +KNQG CGSC
Sbjct:    86 AEIKHKYLWSEPQNCSATKSNYLRGTGPY------PSSMDWRKKGNVVSPVKNQGACGSC 139

Query:   156 WAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLAT 213
             W FS   A+E    I  GK++ L+EQQLVDC+ +  N+GC GGL  +AFEYI+ NKG+  
Sbjct:   140 WTFSTTGALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMG 199

Query:   214 EADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQAF 272
             E  YPY  + G C    EKA A  +    ++   DE A+++AV    PVS   E + + F
Sbjct:   200 EDSYPYIGKNGQCKFNPEKAVAF-VKNVVNITLNDEAAMVEAVALYNPVSFAFEVT-EDF 257

Query:   273 RFYKRGVLNAE-CG---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRIL 328
               YK GV ++  C    D  +H V  VG+G   E++G  YW++KNSWG  WG +GY  I 
Sbjct:   258 MMYKSGVYSSNSCHKTPDKVNHAVLAVGYG---EQNGLLYWIVKNSWGSNWGNNGYFLIE 314

Query:   329 RDEGLCGIATEASYPV 344
             R + +CG+A  ASYP+
Sbjct:   315 RGKNMCGLAACASYPI 330


>RGD|69241 [details] [associations]
            symbol:Ctsj "cathepsin J" species:10116 "Rattus norvegicus"
           [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
           evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
           evidence=IEA] [GO:0048471 "perinuclear region of cytoplasm"
           evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
           PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:L14776
           RGD:69241 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
           InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246
           SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
           GO:GO:0005764 HOVERGEN:HBG011513 MEROPS:C01.038 CTD:26898 KO:K09599
           EMBL:AF310623 EMBL:BC097263 IPI:IPI00205027 PIR:I58002
           RefSeq:NP_058817.1 UniGene:Rn.34875 ProteinModelPortal:Q63088
           SMR:Q63088 PRIDE:Q63088 GeneID:29174 KEGG:rno:29174 NextBio:608244
           Genevestigator:Q63088 Uniprot:Q63088
        Length = 334

 Score = 576 (207.8 bits), Expect = 6.8e-56, P = 6.8e-56
 Identities = 123/326 (37%), Positives = 183/326 (56%)

Query:    27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRT 83
             V SG    +P++  + + W  ++ ++Y   +E+ ++  ++++NL+ I+  NKE   G   
Sbjct:    14 VASGAPARDPNLDAEWQDWKTKYAKSYSP-VEEELKRAVWEENLKMIQLHNKENGLGKNG 72

Query:    84 YKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAV 143
             + +  N F+D T EEFR S +                 + + Q    +P   DWR++G V
Sbjct:    73 FTMEMNAFADTTGEEFRKSLSDI------LIPAAVTNPSAQKQVSIGLPNFKDWRKEGYV 126

Query:   144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKA 201
             T ++NQG CGSCWAF+AV A+EG      G L  LS Q L+DCS    NNGC  G   +A
Sbjct:   127 TPVRNQGKCGSCWAFAAVGAIEGQMFSKTGNLTPLSVQNLLDCSKSEGNNGCRWGTAHQA 186

Query:   202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPV 261
             F Y+++NKGL  EA YPY+ + G C    E A+A   G + +LP  + +  +   +  PV
Sbjct:   187 FNYVLKNKGLEAEATYPYEGKDGPCRYHSENASANITG-FVNLPPNELYLWVAVASIGPV 245

Query:   262 SVCVEASGQAFRFYKRGVLNA-ECGDNC-DHGVAVVGFG-TAEEEDGAKYWLIKNSWGET 318
             S  ++AS  +FRFY  GV +   C     +H V VVG+G    E DG  YWLIKNSWGE 
Sbjct:   246 SAAIDASHDSFRFYSGGVYHEPNCSSYVVNHAVLVVGYGFEGNETDGNNYWLIKNSWGEE 305

Query:   319 WGESGYIRILRDEGL-CGIATEASYP 343
             WG +G+++I +D    CGIA++AS+P
Sbjct:   306 WGINGFMKIAKDRNNHCGIASQASFP 331


>FB|FBgn0260462 [details] [associations]
            symbol:CG12163 species:7227 "Drosophila melanogaster"
            [GO:0035071 "salivary gland cell autophagic cell death"
            evidence=IEP] [GO:0048102 "autophagic cell death" evidence=IEP]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0004869 "cysteine-type
            endopeptidase inhibitor activity" evidence=IEA] [GO:0045169
            "fusome" evidence=IDA] [GO:0035220 "wing disc development"
            evidence=IGI] [GO:0022416 "chaeta development" evidence=IGI]
            InterPro:IPR000010 InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00043 SMART:SM00645
            InterPro:IPR000169 EMBL:AE014297 GO:GO:0004869 eggNOG:COG4870
            KO:K01373 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0022416 GO:GO:0035220 GO:GO:0035071
            GO:GO:0045169 GeneTree:ENSGT00660000095458 EMBL:AY121614
            EMBL:BT003231 RefSeq:NP_649521.1 RefSeq:NP_730901.1
            RefSeq:NP_730902.2 UniGene:Dm.7315 ProteinModelPortal:Q9VN93
            SMR:Q9VN93 DIP:DIP-17491N IntAct:Q9VN93 MINT:MINT-763966
            STRING:Q9VN93 MEROPS:C01.A27 PaxDb:Q9VN93
            EnsemblMetazoa:FBtr0078823 GeneID:40628 KEGG:dme:Dmel_CG12163
            UCSC:CG12163-RA FlyBase:FBgn0260462 InParanoid:Q9VN93 OMA:GPRWGEQ
            OrthoDB:EOG4CC2G9 PhylomeDB:Q9VN93 GenomeRNAi:40628 NextBio:819744
            Bgee:Q9VN93 GermOnline:CG12163 Uniprot:Q9VN93
        Length = 614

 Score = 575 (207.5 bits), Expect = 8.6e-56, P = 8.6e-56
 Identities = 119/309 (38%), Positives = 177/309 (57%)

Query:    44 QWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASY 103
             ++  + GR Y    E+ MRL IF+QNL+ IE+ N     + K G  EF+D+T+ E++   
Sbjct:   310 KFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSEYKER- 368

Query:   104 TGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAA 163
             TG                   Y    ++P   DWR+K AVT +KNQG CGSCWAFS    
Sbjct:   369 TGLWQRDEAKATGGSAAVVPAYHG--ELPKEFDWRQKDAVTQVKNQGSCGSCWAFSVTGN 426

Query:   164 VEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQ 223
             +EG+  +  G+L E SEQ+L+DC T ++ C+GGLMD A++ I +  GL  EA+YPY+ ++
Sbjct:   427 IEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAIKDIGGLEYEAEYPYKAKK 486

Query:   224 GTCDKQKEKAAAATIGKYEDLPKGDEHALLQ-AVTKQPVSVCVEASGQAFRFYKRGVLN- 281
               C   +  +     G + DLPKG+E A+ +  +   P+S+ + A+  A +FY+ GV + 
Sbjct:   487 NQCHFNRTLSHVQVAG-FVDLPKGNETAMQEWLLANGPISIGINAN--AMQFYRGGVSHP 543

Query:   282 --AECGD-NCDHGVAVVGFGTAEEEDGAK---YWLIKNSWGETWGESGYIRILRDEGLCG 335
               A C   N DHGV VVG+G ++  +  K   YW++KNSWG  WGE GY R+ R +  CG
Sbjct:   544 WKALCSKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYRVYRGDNTCG 603

Query:   336 IATEASYPV 344
             ++  A+  V
Sbjct:   604 VSEMATSAV 612


>TAIR|locus:2078312 [details] [associations]
            symbol:AT3G45310 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005773 EMBL:CP002686
            GenomeReviews:BA000014_GR eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AL132953
            EMBL:AY091771 IPI:IPI00540369 PIR:T47471 RefSeq:NP_566880.1
            UniGene:At.25239 ProteinModelPortal:Q8RWQ9 SMR:Q8RWQ9
            MEROPS:C01.162 PaxDb:Q8RWQ9 PRIDE:Q8RWQ9 EnsemblPlants:AT3G45310.1
            GeneID:823669 KEGG:ath:AT3G45310 GeneFarm:5032 TAIR:At3g45310
            InParanoid:Q8RWQ9 KO:K01366 OMA:AFEVVHE PhylomeDB:Q8RWQ9
            ProtClustDB:CLSN2689015 Genevestigator:Q8RWQ9 Uniprot:Q8RWQ9
        Length = 358

 Score = 575 (207.5 bits), Expect = 8.6e-56, P = 8.6e-56
 Identities = 134/324 (41%), Positives = 180/324 (55%)

Query:    28 VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLG 87
             + G+S H    V    ++  ++G+ Y+   E  +R ++FK+NL+ I   NK+G  +YKL 
Sbjct:    49 ILGQSRH----VLSFSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKG-LSYKLS 103

Query:    88 TNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIK 147
              N+F+DLT +EF+    G                + K    T VP + DWRE G V+ +K
Sbjct:   104 LNQFADLTWQEFQRYKLG-----AAQNCSATLKGSHKITEAT-VPDTKDWREDGIVSPVK 157

Query:   148 NQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS-TDNN-GCSGGLMDKAFEYI 205
              QGHCGSCW FS   A+E       GK I LSEQQLVDC+ T NN GC GGL  +AFEYI
Sbjct:   158 EQGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYI 217

Query:   206 IENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVC 264
               N GL TE  YPY  + G C K   K     +    ++  G E  L  AV   +PVSV 
Sbjct:   218 KYNGGLDTEEAYPYTGKDGGC-KFSAKNIGVQVRDSVNITLGAEDELKHAVGLVRPVSVA 276

Query:   265 VEASGQAFRFYKRGVLNAE-CGD---NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWG 320
              E   + FRFYK+GV  +  CG+   + +H V  VG+G    ED   YWLIKNSWG  WG
Sbjct:   277 FEVVHE-FRFYKKGVFTSNTCGNTPMDVNHAVLAVGYGV---EDDVPYWLIKNSWGGEWG 332

Query:   321 ESGYIRILRDEGLCGIATEASYPV 344
             ++GY ++   + +CG+AT +SYPV
Sbjct:   333 DNGYFKMEMGKNMCGVATCSSYPV 356


>UNIPROTKB|O46427 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9823 "Sus scrofa"
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0032526 "response to retinoic acid" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0043129
            "surfactant homeostasis" evidence=ISS] [GO:0010815 "bradykinin
            catabolic process" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0005829 "cytosol"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0030335 "positive regulation of cell
            migration" evidence=ISS] [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0016505 "apoptotic protease activator
            activity" evidence=ISS] [GO:0004252 "serine-type endopeptidase
            activity" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=ISS] [GO:0031638 "zymogen activation"
            evidence=ISS] [GO:0030108 "HLA-A specific activating MHC class I
            receptor activity" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0010628 "positive regulation of gene
            expression" evidence=ISS] [GO:0070324 "thyroid hormone binding"
            evidence=ISS] [GO:0006508 "proteolysis" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0060448
            "dichotomous subdivision of terminal units involved in lung
            branching" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] [GO:0005764 "lysosome" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0004177 "aminopeptidase
            activity" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=ISS] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 MEROPS:C01.040 CTD:1512 OMA:STSCHKT
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 EMBL:AF001169
            RefSeq:NP_999094.1 UniGene:Ssc.3593 PDB:1NB3 PDB:1NB5 PDB:8PCH
            PDBsum:1NB3 PDBsum:1NB5 PDBsum:8PCH ProteinModelPortal:O46427
            SMR:O46427 Ensembl:ENSSSCT00000001983 GeneID:396969 KEGG:ssc:396969
            EvolutionaryTrace:O46427 ArrayExpress:O46427 Uniprot:O46427
        Length = 335

 Score = 571 (206.1 bits), Expect = 2.3e-55, P = 2.3e-55
 Identities = 131/327 (40%), Positives = 179/327 (54%)

Query:    30 GRSMHEPSIVEK-H-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLG 87
             G S    S  EK H + WM QH + Y  E E   RL +F  N   I  A+  GN T+KLG
Sbjct:    21 GASNLAVSSFEKLHFKSWMVQHQKKYSLE-EYHHRLQVFVSNWRKIN-AHNAGNHTFKLG 78

Query:    88 TNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGA-VTHI 146
              N+FSD++ +E R  Y                  T  Y      P S+DWR+KG  V+ +
Sbjct:    79 LNQFSDMSFDEIRHKYLWSEPQNCSATKGNYLRGTGPY------PPSMDWRKKGNFVSPV 132

Query:   147 KNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEY 204
             KNQG CGSCW FS   A+E    I  GK++ L+EQQLVDC+ +  N+GC GGL  +AFEY
Sbjct:   133 KNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEY 192

Query:   205 IIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSV 263
             I  NKG+  E  YPY+ +   C  Q +KA A  +    ++   DE A+++AV    PVS 
Sbjct:   193 IRYNKGIMGEDTYPYKGQDDHCKFQPDKAIAF-VKDVANITMNDEEAMVEAVALYNPVSF 251

Query:   264 CVEASGQAFRFYKRGVLNA-ECG---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETW 319
               E +   F  Y++G+ ++  C    D  +H V  VG+G   EE+G  YW++KNSWG  W
Sbjct:   252 AFEVTND-FLMYRKGIYSSTSCHKTPDKVNHAVLAVGYG---EENGIPYWIVKNSWGPQW 307

Query:   320 GESGYIRILRDEGLCGIATEASYPVAM 346
             G +GY  I R + +CG+A  ASYP+ +
Sbjct:   308 GMNGYFLIERGKNMCGLAACASYPIPL 334


>RGD|1562210 [details] [associations]
            symbol:MGC114246 "similar to cathepsin R" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1562210 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 EMBL:CH474032 MEROPS:C01.042 KO:K09601
            OMA:ASHESFK OrthoDB:EOG4ZCT6D EMBL:BC091563 IPI:IPI00555186
            RefSeq:NP_001017509.1 UniGene:Rn.198321 SMR:Q5BJA0
            Ensembl:ENSRNOT00000061470 GeneID:498688 KEGG:rno:498688
            UCSC:RGD:1562210 InParanoid:Q5BJA0 NextBio:700535
            Genevestigator:Q5BJA0 Uniprot:Q5BJA0
        Length = 334

 Score = 570 (205.7 bits), Expect = 2.9e-55, P = 2.9e-55
 Identities = 122/326 (37%), Positives = 183/326 (56%)

Query:    27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRT 83
             V SG  + +PS+  + ++W  ++ ++Y  E E+ +R  ++++NL+ I+  N E   G   
Sbjct:    14 VASGAPILDPSLDAEWQEWKKKYDKSYSLE-EEELRRAVWEENLKMIKLHNGENGLGKNG 72

Query:    84 YKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAV 143
             + +  NEF D T EEFR     +                 K    +  P  +DWR+KG V
Sbjct:    73 FTMEINEFGDTTGEEFRKMMVEFPVQTHREGKSI-----MKRAAGSIFPKFVDWRKKGYV 127

Query:   144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKA 201
             T ++ QG+C +CWAFS   A+E  T    GKLI LS Q LVDCS    NNGC GG    A
Sbjct:   128 TPVRRQGNCNACWAFSVTGAIEAQTIWQSGKLIPLSVQNLVDCSKPQGNNGCLGGDTYNA 187

Query:   202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPV 261
             F+Y++ N GL +EA YPY+ + G C +   K ++A I  +  LP+ ++  ++   T  P+
Sbjct:   188 FQYVLHNGGLQSEATYPYEGKDGPC-RYNPKNSSAEITGFVSLPESEDILMVAVATIGPI 246

Query:   262 SVCVEASGQAFRFYKRGVLNA-ECGDNC-DHGVAVVGFG-TAEEEDGAKYWLIKNSWGET 318
             S  ++AS ++F+FYK+G+ +   C  N   HGV VVG+G    +  G  YWLIKNSWG+ 
Sbjct:   247 SAGIDASHESFKFYKKGIYHEPNCSSNSVTHGVLVVGYGFKGNDTGGDHYWLIKNSWGKQ 306

Query:   319 WGESGYIRILRDEGL-CGIATEASYP 343
             WG  GY++I +D+   C IA+ A YP
Sbjct:   307 WGIRGYMKITKDKNNHCAIASYAHYP 332


>UNIPROTKB|G1RBY1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:61853
            "Nomascus leucogenys" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 EMBL:ADFV01087552 RefSeq:XP_003275518.1
            Ensembl:ENSNLET00000011249 GeneID:100584322 Uniprot:G1RBY1
        Length = 335

 Score = 569 (205.4 bits), Expect = 3.7e-55, P = 3.7e-55
 Identities = 129/329 (39%), Positives = 182/329 (55%)

Query:    28 VSGRSMHEPSIVEK-H-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYK 85
             V G +    + +EK H + WM++H +TY  E E   RL +F  N   I  A+  GN T+K
Sbjct:    19 VCGAAELSVNSLEKFHFKSWMSKHHKTYSTE-EYHHRLQMFASNWRKIN-AHNNGNHTFK 76

Query:    86 LGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGA-VT 144
             +  N+FSD++  E +  Y                  T  Y      P S+DWR+KG  V+
Sbjct:    77 MALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPY------PPSMDWRKKGNFVS 130

Query:   145 HIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAF 202
              +KNQG CGSCW FS   A+E    I  GK++ L+EQQLVDC+ D  N+GC GGL  +AF
Sbjct:   131 PVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAF 190

Query:   203 EYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPV 261
             EYI+ NKG+  E  YPYQ + G C K +   A   +    ++   DE A+++AV    PV
Sbjct:   191 EYILYNKGIMGEDTYPYQGKDGYC-KFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPV 249

Query:   262 SVCVEASGQAFRFYKRGVLNA-ECG---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGE 317
             S   E + Q F  Y+RG+ ++  C    D  +H V  VG+G   E++G  YW++KNSWG 
Sbjct:   250 SFAFEVT-QDFMMYRRGIYSSTSCHKTPDKVNHAVLAVGYG---EKNGIPYWIVKNSWGP 305

Query:   318 TWGESGYIRILRDEGLCGIATEASYPVAM 346
              WG +GY  I R + +CG+A  ASYP+ +
Sbjct:   306 QWGMNGYFLIERGKNMCGLAACASYPIPL 334


>UNIPROTKB|G1M0X4 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9646
            "Ailuropoda melanoleuca" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 EMBL:ACTA01057330 EMBL:ACTA01065330
            Ensembl:ENSAMET00000013529 Uniprot:G1M0X4
        Length = 337

 Score = 568 (205.0 bits), Expect = 4.8e-55, P = 4.8e-55
 Identities = 125/312 (40%), Positives = 170/312 (54%)

Query:    43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
             + WM QH + Y  E E   RL  F  N   I  A+  GN T+K+G N+FSD++  E +  
Sbjct:    38 KSWMVQHQKKYSSE-EYQHRLRTFVGNWRKIN-AHNAGNHTFKMGLNQFSDMSFAEIKRK 95

Query:   103 YTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGA-VTHIKNQGHCGSCWAFSAV 161
             Y                  T  Y      P  +DWR+KG  V+ +KNQG CGSCW FS  
Sbjct:    96 YLWSEPQNCSATKGNYLRGTGPY------PPFVDWRKKGKFVSPVKNQGGCGSCWTFSTT 149

Query:   162 AAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPY 219
              A+E    I  GKL+ L+EQQLVDC+ D  N+GC GGL  +AFEYI  N+G+  E  YPY
Sbjct:   150 GALESAIAIKTGKLLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYIRYNRGIMGEDSYPY 209

Query:   220 QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQAFRFYKRG 278
             + + G C  Q  KA A  +    ++   DE A+++AV    PVS   E +G  F  Y++G
Sbjct:   210 KGQDGDCKFQPSKAIAF-VKDVANITINDEQAMVEAVALFNPVSFAFEVTGD-FMMYRKG 267

Query:   279 VLNA-ECG---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLC 334
             V ++  C    D  +H V  VG+G   E++G  YW++KNSWG  WG  GY  I R + +C
Sbjct:   268 VYSSTSCHKTPDKVNHAVLAVGYG---EQNGVPYWIVKNSWGPQWGMHGYFLIERGKNMC 324

Query:   335 GIATEASYPVAM 346
             G+A  ASYP+ +
Sbjct:   325 GLAACASYPIPL 336


>UNIPROTKB|Q3T0I2 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9913 "Bos taurus"
            [GO:0031638 "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=ISS] [GO:0010815 "bradykinin
            catabolic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0010813 "neuropeptide catabolic
            process" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0045766 "positive regulation of angiogenesis" evidence=ISS]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=ISS] [GO:0008234 "cysteine-type peptidase activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0033619 "membrane protein proteolysis" evidence=ISS]
            [GO:0043066 "negative regulation of apoptotic process"
            evidence=ISS] [GO:0004252 "serine-type endopeptidase activity"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0016505 "apoptotic protease activator activity"
            evidence=ISS] [GO:0010952 "positive regulation of peptidase
            activity" evidence=ISS] [GO:0030108 "HLA-A specific activating MHC
            class I receptor activity" evidence=ISS] [GO:0002764 "immune
            response-regulating signaling pathway" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0070324 "thyroid
            hormone binding" evidence=ISS] [GO:0006508 "proteolysis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0097208
            "alveolar lamellar body" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004175
            "endopeptidase activity" evidence=ISS] [GO:0032526 "response to
            retinoic acid" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 EMBL:BC102386 IPI:IPI00693034
            RefSeq:NP_001029557.1 UniGene:Bt.52393 ProteinModelPortal:Q3T0I2
            SMR:Q3T0I2 STRING:Q3T0I2 MEROPS:C01.040 PRIDE:Q3T0I2
            Ensembl:ENSBTAT00000014593 GeneID:510524 KEGG:bta:510524 CTD:1512
            InParanoid:Q3T0I2 OMA:STSCHKT OrthoDB:EOG4W9J43 NextBio:20869490
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 Uniprot:Q3T0I2
        Length = 335

 Score = 567 (204.7 bits), Expect = 6.1e-55, P = 6.1e-55
 Identities = 129/318 (40%), Positives = 172/318 (54%)

Query:    39 VEK-HEQ-WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTN 96
             +EK H Q WM QH + Y  E E   RL  F  NL  I   N   N T+K+G N+FSD++ 
Sbjct:    30 LEKFHFQSWMVQHQKKYSSE-EYYHRLQAFASNLREINAHNAR-NHTFKMGLNQFSDMSF 87

Query:    97 EEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGA-VTHIKNQGHCGSC 155
             +E +  Y                  T  Y      P S+DWR+KG  VT +KNQG CGSC
Sbjct:    88 DELKRKYLWSEPQNCSATKSNYLRGTGPY------PPSMDWRKKGNFVTPVKNQGSCGSC 141

Query:   156 WAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLAT 213
             W FS   A+E    I  GKL  L+EQQLVDC+ +  N+GC GGL  +AFEYI  NKG+  
Sbjct:   142 WTFSTTGALESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMG 201

Query:   214 EADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVT-KQPVSVCVEASGQAF 272
             E  YPY+ + G C  Q  KA A  +    ++   DE A+++AV    PVS   E +   F
Sbjct:   202 EDTYPYRGQDGDCKYQPSKAIAF-VKDVANITLNDEEAMVEAVALHNPVSFAFEVTAD-F 259

Query:   273 RFYKRGVLNA-ECG---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRIL 328
               Y++G+ ++  C    D  +H V  VG+G   EE G  YW++KNSWG  WG  GY  I 
Sbjct:   260 MMYRKGIYSSTSCHKTPDKVNHAVLAVGYG---EEKGIPYWIVKNSWGPNWGMKGYFLIE 316

Query:   329 RDEGLCGIATEASYPVAM 346
             R + +CG+A  AS+P+ +
Sbjct:   317 RGKNMCGLAACASFPIPL 334


>UNIPROTKB|F6R7P5 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9544 "Macaca
            mulatta" [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=ISS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0004175 "endopeptidase activity" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0004252
            "serine-type endopeptidase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005829 "cytosol" evidence=ISS] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0010628 "positive regulation of
            gene expression" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0010815 "bradykinin catabolic
            process" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0016505 "apoptotic protease
            activator activity" evidence=ISS] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=ISS] [GO:0030335
            "positive regulation of cell migration" evidence=ISS] [GO:0031638
            "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0032526 "response to retinoic
            acid" evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0043129 "surfactant homeostasis"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0070324
            "thyroid hormone binding" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            GO:GO:0043066 GO:GO:0005615 GO:GO:0008284 GO:GO:0070371
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458 CTD:1512
            OMA:STSCHKT GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 RefSeq:XP_001108862.1
            UniGene:Mmu.3000 Ensembl:ENSMMUT00000014095 GeneID:711437
            KEGG:mcc:711437 NextBio:19969972 Uniprot:F6R7P5
        Length = 335

 Score = 567 (204.7 bits), Expect = 6.1e-55, P = 6.1e-55
 Identities = 129/329 (39%), Positives = 180/329 (54%)

Query:    28 VSGRSMHEPSIVEK-H-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYK 85
             V G +    + +EK H + WM++H +TY  E E   R+  F  N   I  A+  GN T+K
Sbjct:    19 VCGAAELSVNSLEKFHFKSWMSKHHKTYSTE-EYHHRMQTFASNWRKIN-AHNNGNHTFK 76

Query:    86 LGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGA-VT 144
             +  N+FSD++  E +  Y                  T  Y      P S+DWR+KG  V+
Sbjct:    77 MALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPY------PPSMDWRKKGNFVS 130

Query:   145 HIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAF 202
              +KNQG CGSCW FS   A+E    I  GK++ L+EQQLVDC+ D  N+GC GGL  +AF
Sbjct:   131 PVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAF 190

Query:   203 EYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPV 261
             EYI+ NKG+  E  YPYQ + G C K +   A   +    ++   DE A+++AV    PV
Sbjct:   191 EYILYNKGIMGEDTYPYQGKDGDC-KFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPV 249

Query:   262 SVCVEASGQAFRFYKRGVLNA-ECG---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGE 317
             S   E + Q F  YK G+ ++  C    D  +H V  VG+G   EE+G  YW++KNSWG 
Sbjct:   250 SFAFEVT-QDFMIYKTGIYSSTSCHKTPDKVNHAVLAVGYG---EENGIPYWIVKNSWGP 305

Query:   318 TWGESGYIRILRDEGLCGIATEASYPVAM 346
              WG +GY  I R + +CG+A  ASYP+ +
Sbjct:   306 QWGMNGYFLIERGKNMCGLAACASYPIPL 334


>UNIPROTKB|E9PSK9 [details] [associations]
            symbol:Ctsql2 "Protein Ctsql2" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            IPI:IPI00562656 Ensembl:ENSRNOT00000045847 RGD:1303225
            ArrayExpress:E9PSK9 Uniprot:E9PSK9
        Length = 342

 Score = 567 (204.7 bits), Expect = 6.1e-55, P = 6.1e-55
 Identities = 129/332 (38%), Positives = 183/332 (55%)

Query:    27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRT 83
             VVSG S    S+  + ++W  ++ + Y  E E+ ++  ++++N++ IE  N+E   G  T
Sbjct:    14 VVSGASAFNLSLDVQWQEWKMKYEKLYSPE-EELLKRVVWEENVKKIELHNRENSLGKNT 72

Query:    84 YKLGTNEFSDLTNEEFRASYTGY-----NXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWR 138
             Y +  N F+DLT+EEF+   TG      N                 +     +P SIDWR
Sbjct:    73 YIMEINNFADLTDEEFKDMITGITLPINNTMKSLWKRALGSPFPNSWYWRDALPKSIDWR 132

Query:   139 EKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGG 196
             ++G VT ++ QG C SCWAF    A+EG      GKL  LS Q LVDCS    N GC GG
Sbjct:   133 KEGYVTRVREQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGG 192

Query:   197 LMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV 256
                 AF+Y+++N GL +EA YPY+ ++G C K   K A A I ++  LP+ DE  L+ A+
Sbjct:   193 TTYNAFQYVLQNGGLESEATYPYKGKEGLC-KYNPKNAYAKITRFVALPE-DEDVLMDAL 250

Query:   257 -TKQPVSVCVEASGQAFRFYKRGVLNA-ECGDNCDHGVAVVGFG-TAEEEDGAKYWLIKN 313
              TK PV+  +      F F   G+ +  +C +  +H V VVG+G    E DG  YWLIKN
Sbjct:   251 ATKGPVAAGIHVVYSYFHFVS-GIYHEPKCNNRVNHAVLVVGYGFEGNETDGNNYWLIKN 309

Query:   314 SWGETWGESGYIRILRDEGL-CGIATEASYPV 344
             SWG+ WG  GY++I +D    CGIAT A YP+
Sbjct:   310 SWGKQWGLKGYMKIAKDRNNHCGIATFAQYPI 341


>UNIPROTKB|P09668 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9606 "Homo sapiens"
            [GO:0001520 "outer dense fiber" evidence=IEA] [GO:0001669
            "acrosomal vesicle" evidence=IEA] [GO:0007283 "spermatogenesis"
            evidence=IEA] [GO:0030984 "kininogen binding" evidence=IEA]
            [GO:0032403 "protein complex binding" evidence=IEA] [GO:0043621
            "protein self-association" evidence=IEA] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0031648 "protein destabilization"
            evidence=IMP] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=ISS] [GO:0032526 "response to retinoic acid"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0030108 "HLA-A
            specific activating MHC class I receptor activity" evidence=IDA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEP] [GO:0010813 "neuropeptide catabolic process"
            evidence=IDA] [GO:0010815 "bradykinin catabolic process"
            evidence=IDA] [GO:0030335 "positive regulation of cell migration"
            evidence=IDA] [GO:0070371 "ERK1 and ERK2 cascade" evidence=IDA]
            [GO:0010628 "positive regulation of gene expression" evidence=IDA]
            [GO:0006508 "proteolysis" evidence=IDA;TAS] [GO:0031638 "zymogen
            activation" evidence=IDA] [GO:0016505 "apoptotic protease activator
            activity" evidence=IDA] [GO:0010952 "positive regulation of
            peptidase activity" evidence=IDA] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=ISS] [GO:0004252 "serine-type endopeptidase
            activity" evidence=ISS] [GO:0043066 "negative regulation of
            apoptotic process" evidence=ISS] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0033619 "membrane protein proteolysis"
            evidence=IDA] [GO:0004175 "endopeptidase activity" evidence=IDA]
            [GO:0004177 "aminopeptidase activity" evidence=IDA] [GO:0005764
            "lysosome" evidence=IDA] [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0002250 "adaptive immune response" evidence=IEP]
            [GO:0019882 "antigen processing and presentation" evidence=TAS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0070324 "thyroid hormone binding" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0008284
            "positive regulation of cell proliferation" evidence=ISS]
            [GO:0045766 "positive regulation of angiogenesis" evidence=ISS]
            [GO:0008233 "peptidase activity" evidence=IDA] [GO:0097208
            "alveolar lamellar body" evidence=IDA] [GO:0043129 "surfactant
            homeostasis" evidence=IDA] [GO:0004197 "cysteine-type endopeptidase
            activity" evidence=IDA;TAS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 Reactome:REACT_6900 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913 MEROPS:C01.040 CTD:1512
            OMA:STSCHKT OrthoDB:EOG4W9J43 GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 EMBL:X16832 EMBL:AF426247 EMBL:AK314698 EMBL:AC011944
            EMBL:BC002479 EMBL:X07549 IPI:IPI00297487 PIR:S12486
            RefSeq:NP_004381.2 UniGene:Hs.148641 PDB:1BZN PDBsum:1BZN
            ProteinModelPortal:P09668 SMR:P09668 IntAct:P09668 STRING:P09668
            PhosphoSite:P09668 DMDM:288558851 PaxDb:P09668 PRIDE:P09668
            DNASU:1512 Ensembl:ENST00000220166 GeneID:1512 KEGG:hsa:1512
            UCSC:uc021srk.1 GeneCards:GC15M079213 H-InvDB:HIX0012481
            HGNC:HGNC:2535 HPA:CAB000458 HPA:HPA003524 MIM:116820
            neXtProt:NX_P09668 PharmGKB:PA27033 InParanoid:P09668
            PhylomeDB:P09668 BRENDA:3.4.22.16 ChEMBL:CHEMBL2225 GenomeRNAi:1512
            NextBio:6261 ArrayExpress:P09668 Bgee:P09668 CleanEx:HS_CTSH
            Genevestigator:P09668 GermOnline:ENSG00000103811 GO:GO:0019882
            Uniprot:P09668
        Length = 335

 Score = 565 (203.9 bits), Expect = 9.9e-55, P = 9.9e-55
 Identities = 127/318 (39%), Positives = 176/318 (55%)

Query:    39 VEK-H-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTN 96
             +EK H + WM++H +TY  E E   RL  F  N   I  A+  GN T+K+  N+FSD++ 
Sbjct:    30 LEKFHFKSWMSKHRKTYSTE-EYHHRLQTFASNWRKIN-AHNNGNHTFKMALNQFSDMSF 87

Query:    97 EEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGA-VTHIKNQGHCGSC 155
              E +  Y                  T  Y      P S+DWR+KG  V+ +KNQG CGSC
Sbjct:    88 AEIKHKYLWSEPQNCSATKSNYLRGTGPY------PPSVDWRKKGNFVSPVKNQGACGSC 141

Query:   156 WAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLAT 213
             W FS   A+E    I  GK++ L+EQQLVDC+ D  N+GC GGL  +AFEYI+ NKG+  
Sbjct:   142 WTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMG 201

Query:   214 EADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQAF 272
             E  YPYQ + G C  Q  KA    +    ++   DE A+++AV    PVS   E + Q F
Sbjct:   202 EDTYPYQGKDGYCKFQPGKAIGF-VKDVANITIYDEEAMVEAVALYNPVSFAFEVT-QDF 259

Query:   273 RFYKRGVLNA-ECG---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRIL 328
               Y+ G+ ++  C    D  +H V  VG+G   E++G  YW++KNSWG  WG +GY  I 
Sbjct:   260 MMYRTGIYSSTSCHKTPDKVNHAVLAVGYG---EKNGIPYWIVKNSWGPQWGMNGYFLIE 316

Query:   329 RDEGLCGIATEASYPVAM 346
             R + +CG+A  ASYP+ +
Sbjct:   317 RGKNMCGLAACASYPIPL 334


>MGI|MGI:107285 [details] [associations]
            symbol:Ctsh "cathepsin H" species:10090 "Mus musculus"
            [GO:0001520 "outer dense fiber" evidence=ISO] [GO:0001669
            "acrosomal vesicle" evidence=ISO] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=IGI] [GO:0002764 "immune response-regulating
            signaling pathway" evidence=ISO] [GO:0004175 "endopeptidase
            activity" evidence=ISO;IMP] [GO:0004177 "aminopeptidase activity"
            evidence=ISO] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISO;IDA] [GO:0004252 "serine-type endopeptidase activity"
            evidence=IMP] [GO:0005615 "extracellular space" evidence=ISO]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0005829 "cytosol"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=ISO;IMP]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0008284
            "positive regulation of cell proliferation" evidence=IMP]
            [GO:0010628 "positive regulation of gene expression" evidence=ISO]
            [GO:0010634 "positive regulation of epithelial cell migration"
            evidence=IMP] [GO:0010813 "neuropeptide catabolic process"
            evidence=ISO] [GO:0010815 "bradykinin catabolic process"
            evidence=ISO] [GO:0010952 "positive regulation of peptidase
            activity" evidence=IGI;ISO] [GO:0016505 "apoptotic protease
            activator activity" evidence=IGI;ISO] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0030108 "HLA-A specific activating MHC
            class I receptor activity" evidence=ISO] [GO:0030335 "positive
            regulation of cell migration" evidence=ISO] [GO:0030984 "kininogen
            binding" evidence=ISO] [GO:0031638 "zymogen activation"
            evidence=ISO;IMP] [GO:0031648 "protein destabilization"
            evidence=ISO;IMP] [GO:0032403 "protein complex binding"
            evidence=ISO] [GO:0032526 "response to retinoic acid" evidence=IDA]
            [GO:0033619 "membrane protein proteolysis" evidence=ISO;IMP]
            [GO:0035085 "cilium axoneme" evidence=ISO] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IMP] [GO:0043129
            "surfactant homeostasis" evidence=ISO] [GO:0043621 "protein
            self-association" evidence=ISO] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IMP] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=IMP]
            [GO:0070324 "thyroid hormone binding" evidence=ISO] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISO] [GO:0097208 "alveolar
            lamellar body" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:107285 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 EMBL:CH466560 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            HOVERGEN:HBG011513 GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT OrthoDB:EOG4W9J43
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 BRENDA:3.4.22.16
            EMBL:U06119 EMBL:AK149949 EMBL:AK150583 EMBL:AK157376 EMBL:AK160026
            EMBL:Y18464 IPI:IPI00118987 RefSeq:NP_031827.2 UniGene:Mm.2277
            ProteinModelPortal:P49935 SMR:P49935 STRING:P49935 MEROPS:I29.003
            PhosphoSite:P49935 PaxDb:P49935 PRIDE:P49935
            Ensembl:ENSMUST00000034915 GeneID:13036 KEGG:mmu:13036
            InParanoid:Q3UCD6 ChEMBL:CHEMBL1949491 NextBio:282920 Bgee:P49935
            CleanEx:MM_CTSH Genevestigator:P49935 GermOnline:ENSMUSG00000032359
            Uniprot:P49935
        Length = 333

 Score = 564 (203.6 bits), Expect = 1.3e-54, P = 1.3e-54
 Identities = 124/316 (39%), Positives = 179/316 (56%)

Query:    39 VEK-H-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTN 96
             +EK H + WM QH +TY   +E   RL +F  N   I+ A+ + N T+K+  N+FSD++ 
Sbjct:    28 IEKFHFKSWMKQHQKTYSS-VEYNHRLQMFANNWRKIQ-AHNQRNHTFKMALNQFSDMSF 85

Query:    97 EEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKG-AVTHIKNQGHCGSC 155
              E +  +                  T  Y      P+S+DWR+KG  V+ +KNQG CGSC
Sbjct:    86 AEIKHKFLWSEPQNCSATKSNYLRGTGPY------PSSMDWRKKGNVVSPVKNQGACGSC 139

Query:   156 WAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKGLAT 213
             W FS   A+E    I  GK++ L+EQQLVDC+   +N+GC GGL  +AFEYI+ NKG+  
Sbjct:   140 WTFSTTGALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFEYILYNKGIME 199

Query:   214 EADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQAF 272
             E  YPY  +  +C    +KA A  +    ++   DE A+++AV    PVS   E + + F
Sbjct:   200 EDSYPYIGKDSSCRFNPQKAVAF-VKNVVNITLNDEAAMVEAVALYNPVSFAFEVT-EDF 257

Query:   273 RFYKRGVLNAE-CG---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRIL 328
               YK GV +++ C    D  +H V  VG+G   E++G  YW++KNSWG  WGE+GY  I 
Sbjct:   258 LMYKSGVYSSKSCHKTPDKVNHAVLAVGYG---EQNGLLYWIVKNSWGSQWGENGYFLIE 314

Query:   329 RDEGLCGIATEASYPV 344
             R + +CG+A  ASYP+
Sbjct:   315 RGKNMCGLAACASYPI 330


>UNIPROTKB|H9KYW5 [details] [associations]
            symbol:CTSS "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0002250 "adaptive immune response" evidence=IEA]
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 OMA:YEPACTQ EMBL:AADN02010496
            Ensembl:ENSGALT00000001122 Uniprot:H9KYW5
        Length = 245

 Score = 563 (203.2 bits), Expect = 1.6e-54, P = 1.6e-54
 Identities = 112/227 (49%), Positives = 144/227 (63%)

Query:   122 TFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQ 181
             T  Y+     P ++DWREKG VT +KNQG CG+CWAFSAV A+E   ++  GKL+ LS Q
Sbjct:    21 TSTYRRRGGAPDAMDWREKGCVTEVKNQGACGACWAFSAVGALEAQVKLKTGKLVSLSAQ 80

Query:   182 QLVDCST--DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIG 239
              LVDCS    N GC GG M +AF+YII+N G+ +E  YPY  + GTC +      AAT  
Sbjct:    81 NLVDCSMMYGNKGCGGGFMTRAFQYIIDNNGIDSEESYPYMAQNGTC-QYNVSTRAATCS 139

Query:   240 KYEDLPKGDEHALLQAVTKQ-PVSVCVEASGQAFRFYKRGVLN-AECGDNCDHGVAVVGF 297
             KY +LP  DE AL  AV    PVSV ++A+   F  Y+ GV +   C    +HGV VVG+
Sbjct:   140 KYVELPYADEAALKDAVANVGPVSVAIDATQPTFFLYRSGVYDDPRCTQEVNHGVLVVGY 199

Query:   298 GTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGL-CGIATEASYP 343
             GT  E+D   +WL+KNSWGE +G+ GYIR+ R+    CGIA+ ASYP
Sbjct:   200 GTLNEKD---FWLVKNSWGERFGDGGYIRMSRNHANHCGIASYASYP 243


>UNIPROTKB|G3R9A7 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9595 "Gorilla
            gorilla gorilla" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 OMA:STSCHKT GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 RefSeq:XP_004056662.1 Ensembl:ENSGGOT00000012331
            GeneID:101144312 Uniprot:G3R9A7
        Length = 335

 Score = 563 (203.2 bits), Expect = 1.6e-54, P = 1.6e-54
 Identities = 124/310 (40%), Positives = 171/310 (55%)

Query:    45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYT 104
             WM++H +TY  E E   RL  F  N   I  A+  GN T+K+  N+FSD++  E +  Y 
Sbjct:    38 WMSKHRKTYSTE-EYHHRLQTFASNWRKIN-AHNNGNHTFKMALNQFSDMSFAEIKHKYL 95

Query:   105 GYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGA-VTHIKNQGHCGSCWAFSAVAA 163
                              T  Y      P S+DWR+KG  V+ +KNQG CGSCW FS   A
Sbjct:    96 WSEPQNCSATKSNYLRGTGPY------PPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGA 149

Query:   164 VEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQ 221
             +E    I  GK++ L+EQQLVDC+ D  N+GC GGL  +AFEYI+ NKG+  E  YPYQ 
Sbjct:   150 LESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQG 209

Query:   222 EQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGVL 280
             + G C  Q  KA    +    ++   DE A+++AV    PVS   E + Q F  Y+ G+ 
Sbjct:   210 KDGYCKFQPGKAIGF-VKDVANITIYDEEAMVEAVALYNPVSFAFEVT-QDFMMYRTGIY 267

Query:   281 NA-ECG---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGI 336
             ++  C    D  +H V  VG+G   E++G  YW++KNSWG  WG +GY  I R + +CG+
Sbjct:   268 SSTSCHKTPDKVNHAVLAVGYG---EKNGIPYWIVKNSWGPKWGMNGYFLIERGKNMCGL 324

Query:   337 ATEASYPVAM 346
             A  ASYP+ +
Sbjct:   325 AACASYPIPL 334


>UNIPROTKB|F1NEC8 [details] [associations]
            symbol:CTSL2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:AADN02067812 IPI:IPI00820956 Ensembl:ENSGALT00000037988
            ArrayExpress:F1NEC8 Uniprot:F1NEC8
        Length = 218

 Score = 561 (202.5 bits), Expect = 2.6e-54, P = 2.6e-54
 Identities = 111/220 (50%), Positives = 148/220 (67%)

Query:   132 PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-- 189
             P S+DWREKG VT +K+QG CGSCWAFS   A+EG      GKL+ LSEQ LVDCS    
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEG 61

Query:   190 NNGCSGGLMDKAFEYIIENKGLATEADYPYQ-QEQGTCDKQKEKAAAATIGKYEDLPKGD 248
             N GC+GGLMD+AF+Y+ +N G+ +E  YPY  ++   C  + E  AA   G + D+P+G 
Sbjct:    62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTG-FVDIPQGH 120

Query:   249 EHALLQAVTKQ-PVSVCVEASGQAFRFYKRGVL-NAECG-DNCDHGVAVVGFGTAEEEDG 305
             E AL++AV    PVSV ++A   +F+FY+ G+    +C  ++ DHGV VVG+G    EDG
Sbjct:   121 ERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGF---EDG 177

Query:   306 AKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
              KYW++KNSWGE WG+ GYI + +D +  CGIAT ASYP+
Sbjct:   178 KKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPL 217


>UNIPROTKB|P09648 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9031 "Gallus gallus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            OrthoDB:EOG48PMKF MEROPS:C01.032 IPI:IPI00602255 PIR:S00081
            UniGene:Gga.523 ProteinModelPortal:P09648 SMR:P09648 Uniprot:P09648
        Length = 218

 Score = 560 (202.2 bits), Expect = 3.4e-54, P = 3.4e-54
 Identities = 111/220 (50%), Positives = 148/220 (67%)

Query:   132 PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-- 189
             P S+DWREKG VT +K+QG CGSCWAFS   A+EG    T GKL+ LSEQ LVDCS    
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRPEG 61

Query:   190 NNGCSGGLMDKAFEYIIENKGLATEADYPYQ-QEQGTCDKQKEKAAAATIGKYEDLPKGD 248
             N GC+GGLMD+AF+Y+ +N G+ +E  YPY  ++   C  + E  AA   G + D+P+G 
Sbjct:    62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTG-FVDIPQGH 120

Query:   249 EHALLQAVTKQ-PVSVCVEASGQAFRFYKRGVL-NAECG-DNCDHGVAVVGFGTAEEEDG 305
             E AL++AV    PVSV ++A   +F+FY+ G+    +C  ++ DHGV VVG+G    E G
Sbjct:   121 ERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGF---EGG 177

Query:   306 AKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
              KYW++KNSWGE WG+ GYI + +D +  CGIAT ASYP+
Sbjct:   178 KKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPL 217


>UNIPROTKB|G3SSC1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9785
            "Loxodonta africana" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 RefSeq:XP_003413898.1
            Ensembl:ENSLAFT00000003415 GeneID:100662496 Uniprot:G3SSC1
        Length = 335

 Score = 559 (201.8 bits), Expect = 4.3e-54, P = 4.3e-54
 Identities = 129/320 (40%), Positives = 170/320 (53%)

Query:    37 SIVEK-HEQ-WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
             S  EK H Q WMAQH + Y  E E   R   F  N   I   N   N T+K+  N+FSD+
Sbjct:    28 SSYEKFHFQSWMAQHQKKYSSE-EYHQRQQTFVSNWRKINAHNAR-NHTFKMALNQFSDM 85

Query:    95 TNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGA-VTHIKNQGHCG 153
             T  E +  Y                  T  Y      P  +DWR+KG  V+ +KNQG CG
Sbjct:    86 TFAEIKQKYLWSEPQNCSATKGNYLRGTGPY------PPFVDWRKKGHFVSPVKNQGACG 139

Query:   154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGL 211
             SCW FS   A+E    I GGKL+ L+EQQLVDC+ D  N+GC GGL  +AFEYI+ NKG+
Sbjct:   140 SCWTFSTTGALESAIAIAGGKLLSLAEQQLVDCAKDFNNHGCQGGLPSQAFEYILYNKGI 199

Query:   212 ATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQ 270
               E  YPY+ +   C  Q +KA A  +    ++   DE A+++AV    PVS   E +  
Sbjct:   200 MGEDTYPYKGQDDVCKFQPKKAIAF-VKDVANITLNDEEAMVEAVALYNPVSFAFEVTDD 258

Query:   271 AFRFYKRGVLNA-ECG---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
              F  Y +G+ ++  C    D  +H V  VG+G   EE G  YW++KNSWG  WG  GY  
Sbjct:   259 -FMKYSKGIYSSTSCHKTPDKVNHAVLAVGYG---EEKGIPYWIVKNSWGPYWGMDGYFL 314

Query:   327 ILRDEGLCGIATEASYPVAM 346
             I R + +CG+A  ASYP+ +
Sbjct:   315 IERGKNMCGLAACASYPIPL 334


>RGD|631421 [details] [associations]
            symbol:Ctsq "cathepsin Q" species:10116 "Rattus norvegicus"
            [GO:0005764 "lysosome" evidence=NAS] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 RGD:631421 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 UniGene:Rn.34875 EMBL:AF187323 IPI:IPI00214897
            PIR:JC7183 RefSeq:NP_640355.1 UniGene:Rn.35820
            ProteinModelPortal:Q9QZE3 SMR:Q9QZE3 STRING:Q9QZE3 MEROPS:C01.039
            PRIDE:Q9QZE3 Ensembl:ENSRNOT00000024208 GeneID:246147
            KEGG:rno:246147 UCSC:RGD:631421 CTD:104002 InParanoid:Q9QZE3
            OMA:ESEDVLM OrthoDB:EOG4HHP48 NextBio:623425 Genevestigator:Q9QZE3
            GermOnline:ENSRNOG00000017946 Uniprot:Q9QZE3
        Length = 343

 Score = 558 (201.5 bits), Expect = 5.5e-54, P = 5.5e-54
 Identities = 122/331 (36%), Positives = 184/331 (55%)

Query:    27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRT 83
             VV G S  + S+  + ++W  ++ + Y  E E+ ++  ++++N++ IE  N+E   G  T
Sbjct:    14 VVPGASALDLSLDVQWQEWKIKYEKLYSPE-EEVLKRVVWEENVKKIELHNRENSLGKNT 72

Query:    84 YKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKY----QNVTD-VPTSIDWR 138
             Y +  N+F+D+T+EEF+    G+                  +     N  D +P  +DWR
Sbjct:    73 YTMEINDFADMTDEEFKDMIIGFQLPVHNTEKRLWKRALGSFFPNSWNWRDALPKFVDWR 132

Query:   139 EKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGG 196
              +G VT ++ QG C SCWAF    A+EG      GKLI LS Q L+DCS    N GC  G
Sbjct:   133 NEGYVTRVRKQGGCSSCWAFPVTGAIEGQMFKKTGKLIPLSVQNLIDCSKPQGNRGCLWG 192

Query:   197 LMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV 256
                 AF+Y++ N GL  EA YPY++++G C +   K ++A I  +  LP+ ++  L+ AV
Sbjct:   193 NTYNAFQYVLHNGGLEAEATYPYERKEGVC-RYNPKNSSAKITGFVVLPESED-VLMDAV 250

Query:   257 -TKQPVSVCVEASGQAFRFYKRGVLNA-ECGDNCDHGVAVVGFG-TAEEEDGAKYWLIKN 313
              TK P++  V     +FRFY++GV +  +C    +H V VVG+G    E DG  YWLIKN
Sbjct:   251 ATKGPIATGVHVISSSFRFYQKGVYHEPKCSSYVNHAVLVVGYGFEGNETDGNNYWLIKN 310

Query:   314 SWGETWGESGYIRILRDEGL-CGIATEASYP 343
             SWG+ WG  GY++I +D    C IA+ A YP
Sbjct:   311 SWGKRWGLRGYMKIAKDRNNHCAIASLAQYP 341


>UNIPROTKB|F7BJD8 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9796 "Equus
            caballus" [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=ISS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0004175 "endopeptidase activity" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0004252
            "serine-type endopeptidase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005829 "cytosol" evidence=ISS] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0010628 "positive regulation of
            gene expression" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0010815 "bradykinin catabolic
            process" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0016505 "apoptotic protease
            activator activity" evidence=ISS] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=ISS] [GO:0030335
            "positive regulation of cell migration" evidence=ISS] [GO:0031638
            "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0032526 "response to retinoic
            acid" evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0043129 "surfactant homeostasis"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0070324
            "thyroid hormone binding" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            GO:GO:0043066 GO:GO:0005615 GO:GO:0008284 GO:GO:0070371
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458
            OMA:STSCHKT GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129
            Ensembl:ENSECAT00000013967 Uniprot:F7BJD8
        Length = 305

 Score = 553 (199.7 bits), Expect = 1.9e-53, P = 1.9e-53
 Identities = 122/312 (39%), Positives = 169/312 (54%)

Query:    43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
             + WM QH + Y  E E   RL  F  N   I  A+  GN T+++G N+FS +   E +  
Sbjct:     6 KSWMVQHQKKYSSE-EYHHRLQTFVSNWRKIN-AHNTGNHTFRMGLNQFSAMNFAELKHK 63

Query:   103 YTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGA-VTHIKNQGHCGSCWAFSAV 161
             Y                     Y      P S+DWR+KG  V+ +KNQG CGSCW FS  
Sbjct:    64 YLWSEPQNCSATKGNYLRGAGPY------PPSVDWRKKGNFVSPVKNQGGCGSCWTFSTT 117

Query:   162 AAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPY 219
              A+E    I  GKL+ L+EQQLVDC+ +  N+GC GGL  +AFEYI  NKG+  E  YPY
Sbjct:   118 GALESAVAIASGKLLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPY 177

Query:   220 QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQAFRFYKRG 278
             + + G C  Q  KA A  +    ++   DE A+++AV    PVS   E + + F  Y++G
Sbjct:   178 KGQDGDCKFQPNKAIAF-VKDVANITLNDEKAMVEAVALYNPVSFAFEVT-EDFMMYRKG 235

Query:   279 VLNA-ECG---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLC 334
             + ++  C    D  +H V  VG+G   EE+G  YW++KNSWG  WG +GY  I R + +C
Sbjct:   236 IYSSTSCHKTPDKVNHAVLAVGYG---EENGIPYWIVKNSWGPHWGMNGYFLIERGKNMC 292

Query:   335 GIATEASYPVAM 346
             G+A  ASYP+ +
Sbjct:   293 GLAACASYPIPL 304


>DICTYBASE|DDB_G0281605 [details] [associations]
            symbol:cfaD "peptidase C1A family protein"
            species:44689 "Dictyostelium discoideum" [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IDA] [GO:0006508 "proteolysis" evidence=IDA] [GO:0031410
            "cytoplasmic vesicle" evidence=IDA] [GO:0031288 "sorocarp
            morphogenesis" evidence=IMP] [GO:0008285 "negative regulation of
            cell proliferation" evidence=IGI;IDA] [GO:0005576 "extracellular
            region" evidence=IEA;IDA] [GO:0005515 "protein binding"
            evidence=IPI] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0281605
            GO:GO:0008285 GO:GO:0005615 GenomeReviews:CM000152_GR
            eggNOG:COG4870 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0031410 EMBL:AAFI02000042
            GO:GO:0031288 RefSeq:XP_640530.1 HSSP:P07711
            ProteinModelPortal:Q54TR1 STRING:Q54TR1 PRIDE:Q54TR1
            EnsemblProtists:DDB0229857 GeneID:8623140 KEGG:ddi:DDB_G0281605
            InParanoid:Q54TR1 OMA:PSAHEHE ProtClustDB:CLSZ2430523
            Uniprot:Q54TR1
        Length = 531

 Score = 552 (199.4 bits), Expect = 2.4e-53, P = 2.4e-53
 Identities = 112/310 (36%), Positives = 176/310 (56%)

Query:    43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
             +++ AQ+ + Y  + E   R   FK   + I   N + + +YKLG N ++DL+N+EF   
Sbjct:   226 KEYKAQYNKEYSSQDEHDERFINFKAARKIIATHNAKES-SYKLGMNHYADLSNKEFNTL 284

Query:   103 YTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVA 162
                                    +++  +P+++DWR +  VT +K+QG CGSCW F +  
Sbjct:   285 VKP----KVARPSVTGADSVHDDESLRSIPSTVDWRNQNCVTPVKDQGICGSCWTFGSTG 340

Query:   163 AVEGITQITGGKLIELSEQQLVDCS--TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQ 220
             ++EG   +T G+L+ LSEQQLVDC+  T + GC GG    AF+Y++E   LATE++YPY 
Sbjct:   341 SLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGSLATESNYPYL 400

Query:   221 QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV 279
              + G C  +    +  +I  Y ++  G E AL  A+ T  PV++ ++AS   FR+Y  GV
Sbjct:   401 MQNGLCRDRTVTPSGVSITGYVNVTSGSESALQNAIATTGPVAIAIDASVDDFRYYMSGV 460

Query:   280 LN-AEC--G-DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR-DEGLC 334
              N   C  G D+ DH V  +G+GT + +D   Y+L+KNSW   WG  GY+ + R D  LC
Sbjct:   461 YNNPACKNGLDDLDHEVLAIGYGTYQGQD---YFLVKNSWSTNWGMDGYVYMARNDNNLC 517

Query:   335 GIATEASYPV 344
             G++++A+YP+
Sbjct:   518 GVSSQATYPI 527


>UNIPROTKB|J9P7C5 [details] [associations]
            symbol:J9P7C5 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 EMBL:AAEX03010953
            Ensembl:ENSCAFT00000012925 Uniprot:J9P7C5
        Length = 321

 Score = 551 (199.0 bits), Expect = 3.0e-53, P = 3.0e-53
 Identities = 124/316 (39%), Positives = 180/316 (56%)

Query:    36 PSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFS 92
             P + +++ QW A H R Y    E+  R  ++++N++ IE  N+E   G   + +  N F 
Sbjct:    19 PKLDQRY-QWKAMHRRLYGMN-EEGWRRAVWEKNMKMIELHNREYSQGKHGFTMAMNAFG 76

Query:    93 DLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHC 152
             D+TNEEFR    G+                F+     ++P S+DWREKG VT +KNQG C
Sbjct:    77 DMTNEEFRQVINGFQNQKHKKGK------VFQEPLFAEIPKSVDWREKGYVTPVKNQGQC 130

Query:   153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLA 212
             GSCWAFSA  A EG      G L+ LSEQ L      N GC+GGLMD AF+Y+ +N+ L 
Sbjct:   131 GSCWAFSATGAFEGQMFWKTGNLVPLSEQNLAQ---GNEGCNGGLMDNAFQYVKDNRCLD 187

Query:   213 TEADYPYQ-QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQ 270
             +E  YPY  ++  TC+ + E +AA   G + DLP+  E AL++A+ T   ++V ++A  Q
Sbjct:   188 SEESYPYLGRDTDTCNYKPECSAAHDSG-FVDLPQR-EKALMKAMATLGSITVAIDAGHQ 245

Query:   271 AFRFYKRGV-LNAECGD-NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRIL 328
              F+FYK  +  + +C   + DHGV VVG+G  E  D    W++KNSW   WG + Y+++ 
Sbjct:   246 YFQFYKSSIYFDPDCSSKDLDHGVLVVGYGF-EGTDSNNKWIVKNSWSPEWGWNSYVKMA 304

Query:   329 RDEGL-CGIATEASYP 343
             + +   CGI T ASYP
Sbjct:   305 KGQNNHCGI-TAASYP 319


>UNIPROTKB|F6X9C1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00660000095458
            OMA:STSCHKT Ensembl:ENSCAFT00000036196 EMBL:AAEX03002388
            Uniprot:F6X9C1
        Length = 305

 Score = 546 (197.3 bits), Expect = 1.0e-52, P = 1.0e-52
 Identities = 120/312 (38%), Positives = 167/312 (53%)

Query:    43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
             + W  QH + Y  E E   RL  F  N   I  A+  GN T+K+G N+FSD+   E +  
Sbjct:     6 KSWAVQHQKKYSSE-EYLQRLQTFVGNWRKIN-AHNAGNHTFKMGLNQFSDMNFAEIKHK 63

Query:   103 YTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGA-VTHIKNQGHCGSCWAFSAV 161
             Y                  T  Y      P  +DWR+KG  V+ +KNQG CGSCW FS  
Sbjct:    64 YLWSEPQNCSATKGNYLRGTGPY------PPFVDWRKKGKFVSPVKNQGSCGSCWTFSTT 117

Query:   162 AAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPY 219
              A+E    I  GKL+ L+EQQLVDC+ +  N+GC GG   +AFEYI  NKG+  E  YPY
Sbjct:   118 GALESAIAIKSGKLLSLAEQQLVDCAQNFNNHGCQGGAPLQAFEYIRYNKGIMGEDSYPY 177

Query:   220 QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQAFRFYKRG 278
             + + G C  Q  KA A  +    ++   DE A+++AV    PVS   E +   F  Y++G
Sbjct:   178 KGQDGDCKYQPSKAIAF-VKDVANITINDEQAMVEAVALYNPVSFAFEVTSD-FMMYRKG 235

Query:   279 VLNA-ECG---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLC 334
             + ++  C    D  +H V  VG+G   E++G  YW++KNSWG  WG +GY  + R + +C
Sbjct:   236 IYSSTSCHKTPDKVNHAVLAVGYG---EQNGIPYWIVKNSWGPQWGMNGYFLMERGKNMC 292

Query:   335 GIATEASYPVAM 346
             G+A  ASYP+ +
Sbjct:   293 GLAACASYPIPL 304


>ZFIN|ZDB-GENE-040426-1583 [details] [associations]
            symbol:ctssa "cathepsin S, a" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-040426-1583
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            GeneTree:ENSGT00660000095458 EMBL:CR548627 IPI:IPI00491948
            UniGene:Dr.81560 SMR:Q1L8W8 Ensembl:ENSDART00000053638 OMA:RNTREER
            OrthoDB:EOG480HX9 Uniprot:Q1L8W8
        Length = 328

 Score = 546 (197.3 bits), Expect = 1.0e-52, P = 1.0e-52
 Identities = 114/307 (37%), Positives = 174/307 (56%)

Query:    45 WMAQHGRTYKDELEKAMRLTIFKQNLEYI---EKANKEGNRTYKLGTNEFSDLTNEEFRA 101
             W +QH +TY++  E+ +R +++KQNL+ I    +A   G  +Y LG N+ SD+T +E   
Sbjct:    30 WKSQHNKTYRNTREERLRRSVWKQNLQDILLHNEAAAVGLHSYTLGLNQLSDMTADEVN- 88

Query:   102 SYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAV 161
                  +              TF   ++  +P  ++W E G V+ ++NQG CGSCWAFSAV
Sbjct:    89 -----DMNGLLEEDFPDVNATFSPPSLQTLPQRVNWTEHGMVSPVQNQGPCGSCWAFSAV 143

Query:   162 AAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPY 219
              ++E   +     L+ LS Q L+DCS    N GC GG + +AF Y+I+N+G+ +   YPY
Sbjct:   144 GSLEAQMKRRTAALVPLSAQNLLDCSVSLGNRGCKGGFLSRAFLYVIQNRGIDSSTFYPY 203

Query:   220 QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQAFRFYKRG 278
             + ++G C       A    G +  +P+ +E AL  AV    PVSV + A   +F  Y+ G
Sbjct:   204 EHKEGVCRYSVSGRAGYCTG-FRIVPRHNEAALQSAVANIGPVSVGINAKLLSFHRYRSG 262

Query:   279 VLN-AECGDNC-DHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGI 336
             + N  +C     +H V VVG+G+   E+G  YWL+KNSWG  WGE+GYIR+ R++ +CGI
Sbjct:   263 IYNDPKCSSALINHAVLVVGYGS---ENGQDYWLVKNSWGTAWGENGYIRMARNKNMCGI 319

Query:   337 ATEASYP 343
             ++   YP
Sbjct:   320 SSFGIYP 326


>UNIPROTKB|Q24940 [details] [associations]
            symbol:Cat-1 "Cathepsin L-like proteinase" species:6192
            "Fasciola hepatica" [GO:0004175 "endopeptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005576 "extracellular region" evidence=IDA]
            [GO:0006508 "proteolysis" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005576 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0004197 EMBL:L33771 PIR:S43991 PDB:2O6X
            PDBsum:2O6X ProteinModelPortal:Q24940 SMR:Q24940 MEROPS:C01.033
            EvolutionaryTrace:Q24940 Uniprot:Q24940
        Length = 326

 Score = 544 (196.6 bits), Expect = 1.7e-52, P = 1.7e-52
 Identities = 121/310 (39%), Positives = 170/310 (54%)

Query:    44 QWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
             QW   + + Y    ++  R  I+++N+++I++ N     G  TY LG N+F+D+T EEF+
Sbjct:    23 QWKRMYNKEYNGADDQHRR-NIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFK 81

Query:   101 ASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSA 160
             A Y                   ++  N   VP  IDWRE G VT +K+QG+CGSCWAFS 
Sbjct:    82 AKYL---TEMSRASDILSHGVPYEANNRA-VPDKIDWRESGYVTEVKDQGNCGSCWAFST 137

Query:   161 VAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYP 218
                +EG         I  SEQQLVDCS    NNGCSGGLM+ A++Y+ +  GL TE+ YP
Sbjct:   138 TGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYL-KQFGLETESSYP 196

Query:   219 YQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKR 277
             Y   +G C   K+   A   G Y  +  G E  L   V  ++P +V V+     F  Y+ 
Sbjct:   197 YTAVEGQCRYNKQLGVAKVTGYYT-VHSGSEVELKNLVGARRPAAVAVDVESD-FMMYRS 254

Query:   278 GVLNAE-CGD-NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEG-LC 334
             G+  ++ C     +H V  VG+GT   + G  YW++KNSWG  WGE GYIR+ R+ G +C
Sbjct:   255 GIYQSQTCSPLRVNHAVLAVGYGT---QGGTDYWIVKNSWGTYWGERGYIRMARNRGNMC 311

Query:   335 GIATEASYPV 344
             GIA+ AS P+
Sbjct:   312 GIASLASLPM 321


>ZFIN|ZDB-GENE-050208-336 [details] [associations]
            symbol:ctskl "cathepsin K, like" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-050208-336 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:BX465190
            GeneTree:ENSGT00660000095458 IPI:IPI00491185 RefSeq:XP_695425.1
            UniGene:Dr.110795 Ensembl:ENSDART00000062749 GeneID:567046
            KEGG:dre:567046 CTD:567046 NextBio:20888499 Bgee:F1QCP8
            Uniprot:F1QCP8
        Length = 349

 Score = 544 (196.6 bits), Expect = 1.7e-52, P = 1.7e-52
 Identities = 122/311 (39%), Positives = 173/311 (55%)

Query:    45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRA 101
             W  +H  +Y +E E   R TI++ N++ I K N +   G   +K+  N++ DLT+ E++ 
Sbjct:    44 WKKKHEISYDEESEDVHRKTIWETNMQKIWKNNNDFSFGLSMFKMAMNKYGDLTSVEYKR 103

Query:   102 SYTGYNXXXXXXXXXXXXXXTFKYQNVTDVP-TSIDWREKGAVTHIKNQGHCGSCWAFSA 160
                G                     N   +  T+ID+R KG VT +K+QG+CGSCW+FS 
Sbjct:   104 -LLGSKIKGTGNRKGKITSAQMLRLNAKRLGVTNIDYRAKGYVTEVKDQGYCGSCWSFST 162

Query:   161 VAAVEGITQITGGKLIELSEQQLVDCSTDNN--GCSGGLMDKAFEYIIENKGLATEADYP 218
               A+EG      G+L+ LSEQQLVDCS      GCSG  M  A++Y+I N  L +   YP
Sbjct:   163 TGAIEGQMYKHTGRLVSLSEQQLVDCSRSYGTYGCSGAWMANAYDYVINN-ALESSDTYP 221

Query:   219 YQQ-EQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYK 276
             Y   +   C  +K  A A  I  Y  +P G+E AL  AV T  PVSV ++A   +F FY 
Sbjct:   222 YTSVDTQPCFYEKNLAMAG-ISDYRFVPAGNEQALADAVATVGPVSVAIDADNPSFLFYS 280

Query:   277 RGVLN-AECG-DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGL 333
              G+   + C  +N +H V VVG+G+   E+G  YW+IKNSWG  WGE GY+R++R+ +  
Sbjct:   281 SGIYKESNCNPNNLNHAVLVVGYGS---EEGTDYWIIKNSWGTGWGEGGYMRMIRNGKNT 337

Query:   334 CGIATEASYPV 344
             CGIA+ A YP+
Sbjct:   338 CGIASYALYPI 348


>DICTYBASE|DDB_G0279187 [details] [associations]
            symbol:cprG "cysteine proteinase 7" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0279187 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000030 ProtClustDB:CLSZ2846820 MEROPS:C01.081
            EMBL:U72746 RefSeq:XP_641720.2 ProteinModelPortal:Q94504 SMR:Q94504
            PRIDE:Q94504 EnsemblProtists:DDB0215005 GeneID:8621915
            KEGG:ddi:DDB_G0279187 OMA:INTETEK Uniprot:Q94504
        Length = 460

 Score = 542 (195.9 bits), Expect = 2.7e-52, P = 2.7e-52
 Identities = 116/268 (43%), Positives = 146/268 (54%)

Query:    45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYT 104
             WM  H R Y  E E   R  IFK N++Y+ + N +G+ T  LG N F+D++NEE+RA+Y 
Sbjct:    33 WMIAHQRHYSSE-EFNGRYNIFKANMDYVNEWNTKGSETV-LGLNVFADISNEEYRATYL 90

Query:   105 GYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAV 164
             G                      + D    +DWR +GAVT IKNQG CG CW+FS   A 
Sbjct:    91 GTPFDASSLEMTES-------DKIFDASAQVDWRTQGAVTPIKNQGQCGGCWSFSTTGAT 143

Query:   165 EGITQITGGK--LIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQ 220
             EG   +  GK  L+ LSEQ L+DCS    NNGC GGLM  AFEYII NKG+ TE+ YPY 
Sbjct:   144 EGAQYLANGKKNLVSLSEQNLIDCSGSYGNNGCEGGLMTLAFEYIINNKGIDTESSYPYT 203

Query:   221 QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVL 280
              E G   K   K  AA +  Y ++  G E  L   VT+ P SV ++AS Q+F+ Y  G+ 
Sbjct:   204 AEDGKKCKFNPKNVAAQLSSYVNVTSGSESDLAAKVTQGPTSVAIDASNQSFQLYVSGIY 263

Query:   281 NAE-CGDN-CDHGVAVVGFGTAEEEDGA 306
             N   C     DHGV  VGFGT     G+
Sbjct:   264 NEPACSSTQLDHGVLAVGFGTGSGSSGS 291

 Score = 122 (48.0 bits), Expect = 0.00012, P = 0.00012
 Identities = 33/86 (38%), Positives = 44/86 (51%)

Query:   262 SVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGE 321
             SV   ASG A          +  G N + GV    + TA +     YW++KNSWG +WG 
Sbjct:   385 SVSGSASGSA----SGSASGSSSGSNSNGGV----YPTAGD-----YWIVKNSWGTSWGM 431

Query:   322 SGYIRILR-DEGLCGIATEASYPVAM 346
              GYI + + +   CGIAT AS P A+
Sbjct:   432 DGYILMTKGNNNQCGIATMASRPTAV 457


>UNIPROTKB|P83654 [details] [associations]
            symbol:P83654 "Ervatamin-C" species:52861 "Tabernaemontana
            divaricata" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0006508 "proteolysis" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005576 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197 PDB:1O0E PDB:2PNS
            PDBsum:1O0E PDBsum:2PNS MEROPS:C01.116 EvolutionaryTrace:P83654
            Uniprot:P83654
        Length = 208

 Score = 541 (195.5 bits), Expect = 3.5e-52, P = 3.5e-52
 Identities = 106/215 (49%), Positives = 135/215 (62%)

Query:   131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDN 190
             +P  IDWR+KGAVT +KNQG CGSCWAFS V+ VE I QI  G LI LSEQ+LVDC   N
Sbjct:     1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKKN 60

Query:   191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
             +GC GG    A++YII N G+ T+A+YPY+  QG C    +     +I  Y  +P  +E 
Sbjct:    61 HGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPCQAASK---VVSIDGYNGVPFCNEX 117

Query:   251 ALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWL 310
             AL QAV  QP +V ++AS   F+ Y  G+ +  CG   +HGV +VG+        A YW+
Sbjct:   118 ALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQ-------ANYWI 170

Query:   311 IKNSWGETWGESGYIRILRDEG--LCGIATEASYP 343
             ++NSWG  WGE GYIR+LR  G  LCGIA    YP
Sbjct:   171 VRNSWGRYWGEKGYIRMLRVGGCGLCGIARLPYYP 205


>UNIPROTKB|Q10991 [details] [associations]
            symbol:CTSL "Cathepsin L1" species:9940 "Ovis aries"
            [GO:0005515 "protein binding" evidence=IPI] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            MEROPS:C01.032 ProteinModelPortal:Q10991 SMR:Q10991 Uniprot:Q10991
        Length = 217

 Score = 541 (195.5 bits), Expect = 3.5e-52, P = 3.5e-52
 Identities = 109/219 (49%), Positives = 145/219 (66%)

Query:   131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
             VP S+DW +KG VT +KNQG CGSCWAFSA  A+EG      GKL+ LSEQ LVD S   
Sbjct:     1 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQ 60

Query:   190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
              N GC+GGLMD AF+YI EN GL +E  YPY+    +C+ + E +AA   G + D+P+  
Sbjct:    61 GNQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTSCNYKPEYSAAKDTG-FVDIPQR- 118

Query:   249 EHALLQAV-TKQPVSVCVEASGQAFRFYKRGVL-NAECGD-NCDHGVAVVGFGTAEEEDG 305
             E AL++AV T  P+SV ++A   +F+FYK G+  + +C   + DHGV VVG+G   E   
Sbjct:   119 EKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGF--EGTN 176

Query:   306 AKYWLIKNSWGETWGESGYIRILRDEGL-CGIATEASYP 343
              K+W++KNSWG  WG  GY+++ +D+   CGIAT ASYP
Sbjct:   177 NKFWIVKNSWGPEWGNKGYVKMAKDQNNHCGIATAASYP 215


>TAIR|locus:2175088 [details] [associations]
            symbol:ALP "aleurain-like protease" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009723 "response to ethylene stimulus" evidence=IEP]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0005829 "cytosol"
            evidence=RCA] [GO:0006096 "glycolysis" evidence=RCA] [GO:0006816
            "calcium ion transport" evidence=RCA] [GO:0006833 "water transport"
            evidence=RCA] [GO:0006972 "hyperosmotic response" evidence=RCA]
            [GO:0007030 "Golgi organization" evidence=RCA] [GO:0009266
            "response to temperature stimulus" evidence=RCA] [GO:0009651
            "response to salt stress" evidence=RCA] [GO:0009750 "response to
            fructose stimulus" evidence=RCA] [GO:0042744 "hydrogen peroxide
            catabolic process" evidence=RCA] [GO:0046686 "response to cadmium
            ion" evidence=RCA] [GO:0007568 "aging" evidence=IEP]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002688 GO:GO:0005773
            GO:GO:0007568 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AB011483 KO:K01366
            ProtClustDB:CLSN2689015 UniGene:At.25414 IPI:IPI00846287
            RefSeq:NP_001078774.1 ProteinModelPortal:A8MQZ1 SMR:A8MQZ1
            STRING:A8MQZ1 PRIDE:A8MQZ1 EnsemblPlants:AT5G60360.3 GeneID:836158
            KEGG:ath:AT5G60360 OMA:CGSTPMD Genevestigator:A8MQZ1 Uniprot:A8MQZ1
        Length = 361

 Score = 539 (194.8 bits), Expect = 5.6e-52, P = 5.6e-52
 Identities = 128/321 (39%), Positives = 178/321 (55%)

Query:    25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTY 84
             SQ++ G+S H  S      ++  ++G+ Y++  E  +R +IFK+NL+ I   NK+G  +Y
Sbjct:    47 SQIL-GQSRHVLSFA----RFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKG-LSY 100

Query:    85 KLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQN-VTD--VPTSIDWREKG 141
             KLG N+F+DLT +EF+ +  G                T K  + VT+  +P + DWRE G
Sbjct:   101 KLGVNQFADLTWQEFQRTKLG---------AAQNCSATLKGSHKVTEAALPETKDWREDG 151

Query:   142 AVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMD 199
              V+ +K+QG CGSCW FS   A+E       GK I LSEQQLVDC+   +N GC+GGL  
Sbjct:   152 IVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPS 211

Query:   200 KAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TK 258
             +AFEYI  N GL TE  YPY  +  TC    E      +    ++  G E  L  AV   
Sbjct:   212 QAFEYIKSNGGLDTEKAYPYTGKDETCKFSAENVGVQVLNSV-NITLGAEDELKHAVGLV 270

Query:   259 QPVSVCVEASGQAFRFYKRGVL-NAECGD---NCDHGVAVVGFGTAEEEDGAKYWLIKNS 314
             +PVS+  E    +FR YK GV  ++ CG    + +H V  VG+G    EDG  YWLIKNS
Sbjct:   271 RPVSIAFEVI-HSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGV---EDGVPYWLIKNS 326

Query:   315 WGETWGESGYIRILRDEGLCG 335
             WG  WG+ GY ++   + +CG
Sbjct:   327 WGADWGDKGYFKMEMGKNMCG 347


>DICTYBASE|DDB_G0272298 [details] [associations]
            symbol:DDB_G0272298 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0272298 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246
            SMART:SM00848 EMBL:AAFI02000008 KO:K01365 RefSeq:XP_645281.1
            ProteinModelPortal:Q559Q3 MEROPS:C01.A53 EnsemblProtists:DDB0203746
            GeneID:8618447 KEGG:ddi:DDB_G0272298 InParanoid:Q559Q3 OMA:PANINWR
            Uniprot:Q559Q3
        Length = 305

 Score = 534 (193.0 bits), Expect = 1.9e-51, P = 1.9e-51
 Identities = 118/308 (38%), Positives = 173/308 (56%)

Query:    46 MAQHGRTYKDELEKAMRLTIFKQNLEYI-EKANKEGNRTYKLGTNEFSDLTNEEFRASYT 104
             M ++ + YK+  E   R  IF+ N  +I    NK G    ++  NE+SDLT +EF   + 
Sbjct:     1 MVKYNKHYKNNKEYLKRFDIFQDNYNFILNHRNKNGENI-EMDLNEYSDLTQKEFADKFF 59

Query:   105 GYNXXXXXXXXXXXXXXT-FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAA 163
                              T FK+     +P S DWR+ GAV  +KNQG C SCW+FSA+ A
Sbjct:    60 EKLVPEPRSGPINDIKATPFKHNVNATIPKSFDWRDHGAVGKVKNQGSCASCWSFSALGA 119

Query:   164 VEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQ 221
             +EG   I  G+L++LSEQ LVDC+T     GC  G M  AF+YII + G+  E+ YPY  
Sbjct:   120 LEGHYYIKYGELLDLSEQNLVDCATPFGPKGCKTGWMHDAFKYIISSGGVNLESQYPYTG 179

Query:   222 EQGTCD-KQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVSVCVEASGQAFRFYKRGV 279
             +   C   Q EK A   +  +  +PK DE AL++A+    PV+V ++ S + F+    G+
Sbjct:   180 KDEVCKFNQSEKEAK--VSGFVMIPKFDESALMEAIALYGPVAVPIDTSTKEFQHLSGGI 237

Query:   280 LNAECGD--NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGI 336
               ++  D  N  H V  +G+GT  +E+G  Y+L+KNSWG++WG +G+ ++ R  +G CGI
Sbjct:   238 YYSDSCDPWNTIHAVLAIGYGT--DENGVDYFLMKNSWGKSWGTNGFFKVKRGVKGKCGI 295

Query:   337 ATEASYPV 344
              T ASYP+
Sbjct:   296 VTAASYPI 303


>UNIPROTKB|P83443 [details] [associations]
            symbol:P83443 "Macrodontain-1" species:203992 "Pseudananas
            sagenarius" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0006508 "proteolysis" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197
            ProteinModelPortal:P83443 SMR:P83443 MEROPS:C01.028 Uniprot:P83443
        Length = 213

 Score = 531 (192.0 bits), Expect = 4.0e-51, P = 4.0e-51
 Identities = 95/217 (43%), Positives = 144/217 (66%)

Query:   131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDN 190
             VP SIDWR+ GAV  +KNQG CG CWAF+A+A VEGI +I  G L+ LSEQ+++DC+  +
Sbjct:     2 VPQSIDWRDYGAVNEVKNQGPCGGCWAFAAIATVEGIYKIRKGNLVYLSEQEVLDCAV-S 60

Query:   191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
              GC GG +++A+++II N G+ T+ +YPY+  QGTC+      +A   G Y  + + DE 
Sbjct:    61 YGCKGGWVNRAYDFIISNNGVTTDENYPYRAYQGTCNANYFPNSAYITG-YSYVRRNDES 119

Query:   251 ALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWL 310
              ++ AV+ QP++  ++ASG  F++YK GV +  CG + +H + ++G+G     D   YW+
Sbjct:   120 HMMYAVSNQPIAALIDASGDNFQYYKGGVYSGPCGFSLNHAITIIGYG----RDS--YWI 173

Query:   311 IKNSWGETWGESGYIRILRDE----GLCGIATEASYP 343
             ++NSWG +WG+ GY+RI RD     G+CGIA    +P
Sbjct:   174 VRNSWGSSWGQGGYVRIRRDVSHSGGVCGIAMSPLFP 210


>MGI|MGI:1861723 [details] [associations]
            symbol:Ctsr "cathepsin R" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=ISA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=ISA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0030163 "protein
            catabolic process" evidence=ISA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1861723 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0030163
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF245399
            EMBL:AY014778 EMBL:AK014432 EMBL:AK005429 IPI:IPI00120321
            RefSeq:NP_064680.1 UniGene:Mm.315715 ProteinModelPortal:Q9JIA9
            SMR:Q9JIA9 MEROPS:C01.042 PRIDE:Q9JIA9 Ensembl:ENSMUST00000021889
            GeneID:56835 KEGG:mmu:56835 CTD:56835 InParanoid:Q9JIA9 KO:K09601
            OMA:ASHESFK OrthoDB:EOG4ZCT6D NextBio:313379 Bgee:Q9JIA9
            CleanEx:MM_CTSR Genevestigator:Q9JIA9 GermOnline:ENSMUSG00000055679
            Uniprot:Q9JIA9
        Length = 334

 Score = 529 (191.3 bits), Expect = 6.5e-51, P = 6.5e-51
 Identities = 117/326 (35%), Positives = 179/326 (54%)

Query:    27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRT 83
             V SG  + + S+  + + W  ++ ++Y  + EK  R+ ++++ L+ I+  N+E   G   
Sbjct:    14 VASGVPVLDSSLDAEWQDWKIKYNKSYSLKEEKLKRV-VWEEKLKMIKLHNRENSLGKNG 72

Query:    84 YKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAV 143
             + +  NEF D T+EEFR      +                K +  + +P  +DWR+KG V
Sbjct:    73 FTMKMNEFGDQTDEEFRKMMIEISVWTHREGKSI-----MKREAGSILPKFVDWRKKGYV 127

Query:   144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKA 201
             T ++ QG C +CWAF+   A+E       GKL  LS Q LVDCS    NNGC GG    A
Sbjct:   128 TPVRRQGDCDACWAFAVTGAIEAQAIWQTGKLTPLSVQNLVDCSKPQGNNGCLGGDTYNA 187

Query:   202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPV 261
             F+Y++ N GL +EA YPY+ + G C +   K + A I  +  LP+ ++  +    T  P+
Sbjct:   188 FQYVLHNGGLESEATYPYEGKDGPC-RYNPKNSKAEITGFVSLPQSEDILMAAVATIGPI 246

Query:   262 SVCVEASGQAFRFYKRGVLNA-ECG-DNCDHGVAVVGFG-TAEEEDGAKYWLIKNSWGET 318
             +  ++AS ++F+ YK G+ +   C  D   HGV VVG+G    E DG  YWLIKNSWG+ 
Sbjct:   247 TAGIDASHESFKNYKGGIYHEPNCSSDTVTHGVLVVGYGFKGIETDGNHYWLIKNSWGKR 306

Query:   319 WGESGYIRILRDEGL-CGIATEASYP 343
             WG  GY+++ +D+   CGIA+ A YP
Sbjct:   307 WGIRGYMKLAKDKNNHCGIASYAHYP 332


>RGD|1588248 [details] [associations]
            symbol:Cts8 "cathepsin 8" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1588248 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 IPI:IPI00765053
            RefSeq:NP_001121688.1 UniGene:Rn.220599 Ensembl:ENSRNOT00000061486
            GeneID:680718 KEGG:rno:680718 UCSC:RGD:1588248 CTD:56094
            OMA:DSEWQEW OrthoDB:EOG4JT07C NextBio:719350 Uniprot:D3ZP54
        Length = 333

 Score = 529 (191.3 bits), Expect = 6.5e-51, P = 6.5e-51
 Identities = 118/319 (36%), Positives = 177/319 (55%)

Query:    35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGN---RTYKLGTNEF 91
             +PS+  + ++W  ++ + Y  E E+  +  ++++N++ +++ N E +   + + +  N F
Sbjct:    22 DPSLDSEWQEWKTKYEKNYSLE-EEGQKRAVWEENMKVVKQHNIEYDQEKKNFTMELNAF 80

Query:    92 SDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
             +D+T EEFR   T                  F+Y     +P  +DWR +G VT +KNQG 
Sbjct:    81 ADMTGEEFRKMMTNIPVQNLRKKKSIHQPI-FRY-----LPKFVDWRRRGYVTSVKNQGT 134

Query:   152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENK 209
             C SCWAFS   A+EG      G+L+ LS Q LVDCS    N+GC  G    A +Y+  N 
Sbjct:   135 CNSCWAFSVAGAIEGQMFRKTGRLVSLSPQNLVDCSRPEGNHGCHMGSTLYALKYVWSNG 194

Query:   210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEAS 268
             GL  E+ YPY+ ++G C     ++AA   G +  + + +E AL+ AV T  P+SV ++AS
Sbjct:   195 GLEAESTYPYEGKEGPCRYLPRRSAARVTG-FSTVARSEE-ALMHAVATIGPISVGIDAS 252

Query:   269 GQAFRFYKRGVL-NAECGDN-CDHGVAVVGFG-TAEEEDGAKYWLIKNSWGETWGESGYI 325
               +FRFY+RG+     C  N  +H V VVG+G    E DG KYWLIKNS G  WG +GY+
Sbjct:   253 HVSFRFYRRGIYYEPRCSSNRINHSVLVVGYGYEGRESDGRKYWLIKNSHGVGWGMNGYM 312

Query:   326 RILRD-EGLCGIATEASYP 343
             ++ R     CGIAT   YP
Sbjct:   313 KLARGWNNHCGIATYGFYP 331


>UNIPROTKB|G3V9F8 [details] [associations]
            symbol:Ctsm "RCG24133" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015645 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 EMBL:CH474032
            PANTHER:PTHR12411:SF58 Ensembl:ENSRNOT00000045830 RGD:631420
            Uniprot:G3V9F8
        Length = 333

 Score = 527 (190.6 bits), Expect = 1.1e-50, P = 1.1e-50
 Identities = 119/329 (36%), Positives = 187/329 (56%)

Query:    26 QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNR 82
             +++S     +P +  + ++W  ++ +TY  E E+  +  ++++N++ I+  N E   G  
Sbjct:    13 RLISSSPAPDPVLDAEWQKWKIKYEKTYSLE-EEGQKRAVWEENMKKIKLHNGENGLGKH 71

Query:    83 TYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGA 142
              + +  N F D+T EEFR                     + + +   +VP  I+WR++G 
Sbjct:    72 GFTMEMNAFGDMTIEEFR------KLMIEIPIPTVKKENSVQKRQAVNVPNFINWRKRGY 125

Query:   143 VTHIKNQGHCGSCWAFSAVAAVEG-ITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMD 199
             VT ++ QG C  CWAFS   A+EG + Q TG +LI LS Q LVDCS    N GC  G   
Sbjct:   126 VTPVRRQGRCNVCWAFSVAGAIEGQMFQKTG-QLIPLSVQNLVDCSRPQGNLGCYLGNTY 184

Query:   200 KAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TK 258
              A +Y+ EN GL +EA YPY++++G+C    + + A+ I  +E +PK +E AL+ AV T 
Sbjct:   185 LALQYVKENGGLESEATYPYEEKEGSCRYHPDNSTAS-ITDFEFVPK-NEDALMNAVATL 242

Query:   259 QPVSVCVEASGQAFRFYKRGVLNA-ECGDNC-DHGVAVVGFG-TAEEEDGAKYWLIKNSW 315
              P+SV ++A  ++F FY+ G+ +   C  +   H + +VG+G   EE DG KYW++KNS 
Sbjct:   243 GPISVAIDARHESFLFYRNGIYHEPNCSSSVVTHAMLLVGYGFVGEESDGRKYWILKNSM 302

Query:   316 GETWGESGYIRILRDEGL-CGIATEASYP 343
             G  WG  GY++I +D+G  CGIAT A YP
Sbjct:   303 GNKWGNRGYMKIAKDQGNHCGIATYALYP 331


>UNIPROTKB|E9PTT3 [details] [associations]
            symbol:Ctsr "Protein Ctsr" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            IPI:IPI00627092 Ensembl:ENSRNOT00000024115 RGD:631422
            Uniprot:E9PTT3
        Length = 334

 Score = 526 (190.2 bits), Expect = 1.3e-50, P = 1.3e-50
 Identities = 123/328 (37%), Positives = 180/328 (54%)

Query:    27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRT 83
             V  G    +PS+  +      ++ ++Y  E E+  R  ++++N++ I+  N+E   G   
Sbjct:    14 VGXGALAFDPSLDAEWHDXKTEYEKSYTME-EEGHRRAVWEENMKMIKLHNRENSLGKNG 72

Query:    84 YKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDV-PTSIDWREKGA 142
             + +  NEF DLT EEFR                       + ++V +V P  +DWR+KG 
Sbjct:    73 FIMEMNEFGDLTAEEFRKMMVNIPIRSHRKGKI------IRKRDVGNVLPKFVDWRKKGY 126

Query:   143 VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC--STDNNGCSGGLMDK 200
             VT ++NQ  C SCWAF+   A+EG      G+L  LS Q LVDC  S  N GC  G    
Sbjct:   127 VTRVQNQKFCNSCWAFAVTGAIEGQMFNKTGQLTPLSVQNLVDCTKSQGNEGCQWGDPHI 186

Query:   201 AFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQ 259
             A+EY++ N GL  EA YPY+ ++G C +   K + A I  +  LP+ ++  L++AV T  
Sbjct:   187 AYEYVLNNGGLEAEATYPYKGKEGVC-RYNPKHSKAEITGFVSLPESED-ILMEAVATIG 244

Query:   260 PVSVCVEASGQAFRFYKRGVLNA-ECGDNC-DHGVAVVGFG-TAEEEDGAKYWLIKNSWG 316
             P+SV V+AS  +F FYK+G+ +   C +N  +H V VVG+G    E DG  YWLIKNSWG
Sbjct:   245 PISVAVDASFNSFGFYKKGLYDEPNCSNNTVNHSVLVVGYGFEGNETDGNSYWLIKNSWG 304

Query:   317 ETWGESGYIRILRDEG-LCGIATEASYP 343
               WG  GY++I +D+   C IA+ A YP
Sbjct:   305 RKWGLRGYMKIPKDQNNFCAIASYAHYP 332


>DICTYBASE|DDB_G0278721 [details] [associations]
            symbol:cprD "cysteine proteinase 4" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0278721 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000024 EMBL:L36204 RefSeq:XP_641963.1
            ProteinModelPortal:P54639 SMR:P54639 MEROPS:C01.A57 PRIDE:P54639
            EnsemblProtists:DDB0214999 GeneID:8621695 KEGG:ddi:DDB_G0278721
            OMA:NAFADIT ProtClustDB:CLSZ2846820 Uniprot:P54639
        Length = 442

 Score = 521 (188.5 bits), Expect = 4.6e-50, P = 4.6e-50
 Identities = 117/285 (41%), Positives = 151/285 (52%)

Query:    45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYT 104
             WM  H RTY  E E   R  IFK N++Y+ + N +G  T  LG N F+D+TN+E+R +Y 
Sbjct:    33 WMQAHQRTYSSE-EFNARYQIFKSNMDYVHQWNSKGGETV-LGLNVFADITNQEYRTTYL 90

Query:   105 GYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAV 164
             G                 F     T  PT +DWR +GAVT IKNQG CG CW+FS   + 
Sbjct:    91 G-TPFDGSALIGTEEEKIFS----TPAPT-VDWRAQGAVTPIKNQGQCGGCWSFSTTGST 144

Query:   165 EGITQITGGK---LIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPY 219
             EG   I  G    L+ LSEQ L+DCS    NNGC GGLM  AFEYII NKG+ TE+ YPY
Sbjct:   145 EGAHFIASGTKKDLVSLSEQNLIDCSKSYGNNGCEGGLMTLAFEYIINNKGIDTESSYPY 204

Query:   220 QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGV 279
               E G   K K     A I  Y+++  G E +L  A    PVSV ++AS ++F+ Y+ G+
Sbjct:   205 TAEDGKECKFKTSNIGAQIVSYQNVTSGSEASLQSASNNAPVSVAIDASNESFQLYESGI 264

Query:   280 L-NAECGDN-CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGES 322
                  C     DHGV VVG+G+             +S   T G++
Sbjct:   265 YYEPACSPTQLDHGVLVVGYGSGSSSSSGSSSGKSSSSSSTGGKT 309

 Score = 140 (54.3 bits), Expect = 1.1e-06, P = 1.1e-06
 Identities = 41/114 (35%), Positives = 52/114 (45%)

Query:   233 AAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGV 292
             AA++T G       G +    Q+   Q  S    ASGQA          +  G     G 
Sbjct:   338 AASSTSGSQSGSQSGSQSG--QSTGSQ--SGQTSASGQA------SASGSGSGSGSGSGS 387

Query:   293 AVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGL-CGIATEASYPVA 345
                G G  E   G  YW++KNSWG +WG  GYI + +D    CGIAT AS+P A
Sbjct:   388 GS-GSGAVEASSG-NYWIVKNSWGTSWGMDGYIFMSKDRNNNCGIATMASFPTA 439


>MGI|MGI:1927229 [details] [associations]
            symbol:Ctsm "cathepsin M" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008152 "metabolic process" evidence=ISS] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015645 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1927229 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF202528
            EMBL:AY014777 EMBL:AY057446 EMBL:AK005550 EMBL:AK005428
            IPI:IPI00131133 RefSeq:NP_071721.2 UniGene:Mm.279933
            ProteinModelPortal:Q9JL96 SMR:Q9JL96 STRING:Q9JL96 MEROPS:C01.023
            PRIDE:Q9JL96 DNASU:64139 Ensembl:ENSMUST00000099451 GeneID:64139
            KEGG:mmu:64139 UCSC:uc007qwj.1 CTD:64139 InParanoid:Q9JL96
            KO:K09600 OrthoDB:EOG4TTGKR NextBio:319931 Bgee:Q9JL96
            CleanEx:MM_CTSM Genevestigator:Q9JL96 GermOnline:ENSMUSG00000074484
            GermOnline:ENSMUSG00000074871 PANTHER:PTHR12411:SF58 Uniprot:Q9JL96
        Length = 333

 Score = 521 (188.5 bits), Expect = 4.6e-50, P = 4.6e-50
 Identities = 119/319 (37%), Positives = 176/319 (55%)

Query:    35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
             +P +  + ++W  ++G+ Y  E E+  +  +++ N++ I+  N E   G   + +  N F
Sbjct:    22 DPILDVEWQKWKIKYGKAYSLE-EEGQKRAVWEDNMKKIKLHNGENGLGKHGFTMEMNAF 80

Query:    92 SDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
              D+T EEFR                     + + +   ++P  I+W+++G VT ++ QG 
Sbjct:    81 GDMTLEEFR------KVMIEIPVPTVKKGKSVQKRLSVNLPKFINWKKRGYVTPVQTQGR 134

Query:   152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENK 209
             C SCWAFS   A+EG      G+LI LS Q LVDCS    N GC  G    A  Y++EN 
Sbjct:   135 CNSCWAFSVTGAIEGQMFRKTGQLIPLSVQNLVDCSRPQGNWGCYLGNTYLALHYVMENG 194

Query:   210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEAS 268
             GL +EA YPY+++ G+C    E + A   G +E +PK +E AL+ AV    P+SV ++A 
Sbjct:   195 GLESEATYPYEEKDGSCRYSPENSTANITG-FEFVPK-NEDALMNAVASIGPISVAIDAR 252

Query:   269 GQAFRFYKRGVLNAECGDNC--DHGVAVVGFG-TAEEEDGAKYWLIKNSWGETWGESGYI 325
               +F FYKRG+       +C   H + +VG+G T  E DG KYWL+KNS G  WG  GY+
Sbjct:   253 HASFLFYKRGIYYEPNCSSCVVTHSMLLVGYGFTGRESDGRKYWLVKNSMGTQWGNKGYM 312

Query:   326 RILRDEGL-CGIATEASYP 343
             +I RD+G  CGIAT A YP
Sbjct:   313 KISRDKGNHCGIATYALYP 331


>RGD|1309226 [details] [associations]
            symbol:Cts7 "cathepsin 7" species:10116 "Rattus norvegicus"
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005768 "endosome" evidence=IEA] [GO:0005794 "Golgi apparatus"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0007067
            "mitosis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IEA] [GO:0051301 "cell division" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 RGD:1309226 GO:GO:0005634
            GO:GO:0005794 GO:GO:0048471 GO:GO:0005615 GO:GO:0051301
            GO:GO:0007067 GO:GO:0005768 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 MEROPS:C01.016 CTD:56092
            GeneTree:ENSGT00560000076577 OrthoDB:EOG44QT2S EMBL:CH474032
            IPI:IPI00870531 RefSeq:NP_001099569.1 UniGene:Rn.218615
            Ensembl:ENSRNOT00000043686 GeneID:290970 KEGG:rno:290970
            UCSC:RGD:1309226 OMA:VESFNAN Uniprot:D3ZZ07
        Length = 331

 Score = 519 (187.8 bits), Expect = 7.4e-50, P = 7.4e-50
 Identities = 114/318 (35%), Positives = 178/318 (55%)

Query:    37 SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGN---RTYKLGTNEFSD 93
             S+  + E+W   + +TY  E EK  R  ++++N++ I+    +       + +  NEF D
Sbjct:    24 SLDAEWEEWKRNNAKTYSPEEEKQRR-AVWEENVKMIKWHTMQNGLWMNNFTIEMNEFGD 82

Query:    94 LTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
             +T EE R                       + +NV  +P ++DWR+ G V  +++QG CG
Sbjct:    83 MTGEEMRMM-------TDSSALTLRNGKHIQKRNVK-IPKTLDWRDTGCVAPVRSQGGCG 134

Query:   154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGL 211
             +CWAFS  A++E       GKLI LS Q L+DC+    NN CSGG    AF+Y+  N GL
Sbjct:   135 ACWAFSVAASIESQLFKKTGKLIPLSVQNLIDCTVTYGNNDCSGGKPYTAFQYVKNNGGL 194

Query:   212 ATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQA-VTKQPVSVCVEASGQ 270
               EA YPY+ +   C  + E++    I ++  +P+ +E AL+QA VT  P++V ++ S  
Sbjct:   195 EAEATYPYEAKLRHCRYRPERSVVK-IARFFVVPRNEE-ALMQALVTYGPIAVAIDGSHA 252

Query:   271 AFRFYKRGVLNA-ECG-DNCDHGVAVVGFG-TAEEEDGAKYWLIKNSWGETWGESGYIRI 327
             +F+ Y+ G+ +  +C  D  DHG+ +VG+G    E +  KYWL+KNS GE WGE GY+++
Sbjct:   253 SFKRYRGGIYHEPKCRRDTLDHGLLLVGYGYEGHESENRKYWLLKNSHGEQWGERGYMKL 312

Query:   328 LRDEG-LCGIATEASYPV 344
              RD+   CGIA+ A YP+
Sbjct:   313 PRDQNNYCGIASYAMYPL 330


>WB|WBGene00007055 [details] [associations]
            symbol:tag-196 species:6239 "Caenorhabditis elegans"
            [GO:0004869 "cysteine-type endopeptidase inhibitor activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000010
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00031 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00043 SMART:SM00645 InterPro:IPR000169
            GO:GO:0004869 eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 EMBL:FO080488 PIR:T31871
            RefSeq:NP_505215.2 HSSP:Q9UBX1 ProteinModelPortal:O16454 SMR:O16454
            DIP:DIP-27400N IntAct:O16454 MINT:MINT-1044990 MEROPS:C01.A50
            PaxDb:O16454 EnsemblMetazoa:F41E6.6.1 EnsemblMetazoa:F41E6.6.2
            EnsemblMetazoa:F41E6.6.3 GeneID:179240 KEGG:cel:CELE_F41E6.6
            UCSC:F41E6.6.1 CTD:179240 WormBase:F41E6.6 InParanoid:O16454
            OMA:GGGLMTN NextBio:904514 Uniprot:O16454
        Length = 477

 Score = 518 (187.4 bits), Expect = 9.5e-50, P = 9.5e-50
 Identities = 116/303 (38%), Positives = 163/303 (53%)

Query:    48 QHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYN 107
             +H + Y ++ E   R  +FK+N + I +  K    T   G  +FSD+T  EF+     Y 
Sbjct:   180 RHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFKKIMLPYQ 239

Query:   108 XXXXXXXXXXXXXXTFKYQNVT----DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAA 163
                            F+  +VT    D+P S DWREKGAVT +KNQG+CGSCWAFS    
Sbjct:   240 WEQPVYPMEQA---NFEKHDVTINEEDLPESFDWREKGAVTQVKNQGNCGSCWAFSTTGN 296

Query:   164 VEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQ 223
             VEG   I   KL+ LSEQ+LVDC + + GC+GGL   A++ II   GL  E  YPY    
Sbjct:   297 VEGAWFIAKNKLVSLSEQELVDCDSMDQGCNGGLPSNAYKEIIRMGGLEPEDAYPYDGRG 356

Query:   224 GTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAE 283
              TC   ++  A    G  E LP  +       VTK P+S+ + A+    +FY+ GV++  
Sbjct:   357 ETCHLVRKDIAVYINGSVE-LPHDEVEMQKWLVTKGPISIGLNAN--TLQFYRHGVVHPF 413

Query:   284 ---CGD-NCDHGVAVVGFGTAEEEDGAK-YWLIKNSWGETWGESGYIRILRDEGLCGIAT 338
                C     +HGV +VG+G    +DG K YW++KNSWG  WGE+GY ++ R + +CG+  
Sbjct:   414 KIFCEPFMLNHGVLIVGYG----KDGRKPYWIVKNSWGPNWGEAGYFKLYRGKNVCGVQE 469

Query:   339 EAS 341
              A+
Sbjct:   470 MAT 472


>FB|FBgn0034229 [details] [associations]
            symbol:CG4847 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0032504
            "multicellular organism reproduction" evidence=IEP] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0005615 "extracellular space"
            evidence=ISM;IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:AE013599 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 GO:GO:0032504 GeneTree:ENSGT00560000076599
            KO:K01371 EMBL:BT099507 RefSeq:NP_725686.1 UniGene:Dm.4677
            SMR:A1ZAU4 IntAct:A1ZAU4 MEROPS:C01.A28 EnsemblMetazoa:FBtr0086935
            GeneID:36973 KEGG:dme:Dmel_CG4847 UCSC:CG4847-RB
            FlyBase:FBgn0034229 InParanoid:A1ZAU4 OMA:GGFQEYA OrthoDB:EOG4J9KFC
            ChiTaRS:CG4847 GenomeRNAi:36973 NextBio:801302 Uniprot:A1ZAU4
        Length = 420

 Score = 514 (186.0 bits), Expect = 2.5e-49, P = 2.5e-49
 Identities = 120/321 (37%), Positives = 164/321 (51%)

Query:    37 SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK---EGNRTYKLGTNEFSD 93
             S V+    +++Q G+TY    ++A+    F      +E  N    +G  T+K   N F+D
Sbjct:   107 SNVQDFGDFLSQSGKTYLSAADRALHEGAFASTKNLVEAGNAAFAQGVHTFKQAVNAFAD 166

Query:    94 LTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVT--DVPTSIDWREKGAVTHIKNQGH 151
             LT+ EF +  TG                + K  N+    +P + DWRE G VT +K QG 
Sbjct:   167 LTHSEFLSQLTGLKRSPEAKARAAA---SLKLVNLPAKPIPDAFDWREHGGVTPVKFQGT 223

Query:   152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDN----NGCSGGLMDKAFEYIIE 207
             CGSCWAF+   A+EG T    G L  LSEQ LVDC        NGC GG  + AF +I E
Sbjct:   224 CGSCWAFATTGAIEGHTFRKTGSLPNLSEQNLVDCGPVEDFGLNGCDGGFQEAAFCFIDE 283

Query:   208 -NKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCV 265
               KG++ E  YPY   +GTC     K+ A T+  +  +P  DE  L + V T  PV+  V
Sbjct:   284 VQKGVSQEGAYPYIDNKGTCKYDGSKSGA-TLQGFAAIPPKDEEQLKKVVATLGPVACSV 342

Query:   266 EASGQAFRFYKRGVLNA-ECGDN-CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESG 323
                 +  + Y  G+ N  EC     +H + VVG+G+   E G  YW++KNSW +TWGE G
Sbjct:   343 NGL-ETLKNYAGGIYNDDECNKGEPNHSILVVGYGS---EKGQDYWIVKNSWDDTWGEKG 398

Query:   324 YIRILRDEGLCGIATEASYPV 344
             Y R+ R +  C IA E SYPV
Sbjct:   399 YFRLPRGKNYCFIAEECSYPV 419


>UNIPROTKB|Q90686 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9031 "Gallus gallus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            CTD:1513 KO:K01371 PANTHER:PTHR12411:SF55 EMBL:U37691
            IPI:IPI00575213 RefSeq:NP_990302.1 UniGene:Gga.51509
            ProteinModelPortal:Q90686 SMR:Q90686 MEROPS:C01.036 GeneID:395818
            KEGG:gga:395818 NextBio:20815886 Uniprot:Q90686
        Length = 334

 Score = 512 (185.3 bits), Expect = 4.1e-49, P = 4.1e-49
 Identities = 109/274 (39%), Positives = 158/274 (57%)

Query:    74 EKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPT 133
             ++  + G  +++L  N   D+T+EE   + TG                 +     +  P 
Sbjct:    66 QRGARLGKHSFQLAMNYLGDMTSEEVVRTMTGLRVPRSRPRPNGTL---YVPDWSSRAPA 122

Query:   134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGC 193
             ++DWR KG VT +K+QG CGSCWAFS+V A+EG  +   GKL+ LS Q LV C ++NNGC
Sbjct:   123 AVDWRRKGYVTPVKDQGQCGSCWAFSSVGALEGQLKRRTGKLLSLSPQNLVYCVSNNNGC 182

Query:   194 SGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALL 253
              GG M  AFEY+  N+G+ +E  YPY  +  +C       AA   G Y ++P+ +E AL 
Sbjct:   183 GGGYMTNAFEYVRLNRGIDSEDAYPYIGQDESCMYSPTGKAAKCRG-YREIPEDNEKALK 241

Query:   254 QAVTK-QPVSVCVEASGQAFRFYKRGVL-NAECG-DNCDHGVAVVGFGTAEEEDGAKYWL 310
             +AV +  PVSV ++AS  +F+FY RGV  +  C  +N +H V  VG+G    + G K+W+
Sbjct:   242 RAVARIGPVSVGIDASLPSFQFYSRGVYYDTGCNPENINHAVLAVGYGA---QKGTKHWI 298

Query:   311 IKNSWGETWGESGYIRILRD-EGLCGIATEASYP 343
             IKNSWG  WG  GY+ + R+ +  CGIA  AS+P
Sbjct:   299 IKNSWGTEWGNKGYVLLARNMKQTCGIANLASFP 332


>MGI|MGI:1860262 [details] [associations]
            symbol:Cts7 "cathepsin 7" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005576
            "extracellular region" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005768 "endosome" evidence=IEA]
            [GO:0005794 "Golgi apparatus" evidence=IEA] [GO:0006508
            "proteolysis" evidence=ISA] [GO:0007049 "cell cycle" evidence=IEA]
            [GO:0007067 "mitosis" evidence=IEA] [GO:0008152 "metabolic process"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=ISA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0051301 "cell
            division" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:1860262 GO:GO:0005634 GO:GO:0005794 GO:GO:0048471
            GO:GO:0005615 GO:GO:0051301 GO:GO:0007067 GO:GO:0005768
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GO:GO:0008233 EMBL:CH466546
            EMBL:AY014779 EMBL:CT030645 EMBL:BC064740 EMBL:AF250837
            IPI:IPI00131132 RefSeq:NP_062412.1 UniGene:Mm.3692 HSSP:O60911
            ProteinModelPortal:Q91ZF2 SMR:Q91ZF2 STRING:Q91ZF2 MEROPS:C01.016
            PRIDE:Q91ZF2 Ensembl:ENSMUST00000021892 GeneID:56092 KEGG:mmu:56092
            UCSC:uc007qwi.1 CTD:56092 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 InParanoid:Q91ZF2 OMA:ERRVIWE OrthoDB:EOG44QT2S
            NextBio:311908 Bgee:Q91ZF2 Genevestigator:Q91ZF2 Uniprot:Q91ZF2
        Length = 331

 Score = 511 (184.9 bits), Expect = 5.2e-49, P = 5.2e-49
 Identities = 113/311 (36%), Positives = 172/311 (55%)

Query:    43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGN---RTYKLGTNEFSDLTNEEF 99
             E+W   + RTY  E EK  R  +++ N+++I++   E       + +  NEF D+T EE 
Sbjct:    30 EEWKRSNDRTYSPEEEKQRR-AVWEGNVKWIKQHIMENGLWMNNFTIEMNEFGDMTGEEM 88

Query:   100 RASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFS 159
             +                       + +N   +P ++DWR++G VT ++ QG CG+CWAFS
Sbjct:    89 KM-------LTESSSYPLRNGKHIQKRN-PKIPPTLDWRKEGYVTPVRRQGSCGACWAFS 140

Query:   160 AVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADY 217
               A +EG      GKLI LS Q L+DCS      GC GG    AF+Y+  N GL  EA Y
Sbjct:   141 VTACIEGQLFKKTGKLIPLSVQNLMDCSVSYGTKGCDGGRPYDAFQYVKNNGGLEAEATY 200

Query:   218 PYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQA-VTKQPVSVCVEASGQAFRFYK 276
             PY+ +   C  + E++    + ++  +P+ +E ALLQA VT  P++V ++ S  +F  Y+
Sbjct:   201 PYEAKAKHCRYRPERSVVK-VNRFFVVPRNEE-ALLQALVTHGPIAVAIDGSHASFHSYR 258

Query:   277 RGVLNA-ECG-DNCDHGVAVVGFG-TAEEEDGAKYWLIKNSWGETWGESGYIRILRDEG- 332
              G+ +  +C  D  DHG+ +VG+G    E +  KYWL+KNS GE WGE+GY+++ R +  
Sbjct:   259 GGIYHEPKCRKDTLDHGLLLVGYGYEGHESENRKYWLLKNSHGERWGENGYMKLPRGQNN 318

Query:   333 LCGIATEASYP 343
              CGIA+ A YP
Sbjct:   319 YCGIASYAMYP 329


>UNIPROTKB|F1P3U9 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005829 "cytosol" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=IEA] [GO:0010628 "positive regulation of gene expression"
            evidence=IEA] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=IEA] [GO:0010813 "neuropeptide catabolic
            process" evidence=IEA] [GO:0010815 "bradykinin catabolic process"
            evidence=IEA] [GO:0016505 "apoptotic protease activator activity"
            evidence=IEA] [GO:0030108 "HLA-A specific activating MHC class I
            receptor activity" evidence=IEA] [GO:0031638 "zymogen activation"
            evidence=IEA] [GO:0031648 "protein destabilization" evidence=IEA]
            [GO:0032526 "response to retinoic acid" evidence=IEA] [GO:0033619
            "membrane protein proteolysis" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0043129
            "surfactant homeostasis" evidence=IEA] [GO:0045766 "positive
            regulation of angiogenesis" evidence=IEA] [GO:0060448 "dichotomous
            subdivision of terminal units involved in lung branching"
            evidence=IEA] [GO:0070324 "thyroid hormone binding" evidence=IEA]
            [GO:0070371 "ERK1 and ERK2 cascade" evidence=IEA] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA]
            [GO:0097208 "alveolar lamellar body" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066
            GO:GO:0005615 GO:GO:0008284 GO:GO:0070371 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0032526 GO:GO:0010628
            GO:GO:0070324 GO:GO:0016505 GO:GO:0010634 GO:GO:0004197
            GO:GO:0042599 GO:GO:0031648 GO:GO:0097067 GO:GO:0031638
            GO:GO:0001913 GeneTree:ENSGT00660000095458 OMA:STSCHKT
            GO:GO:0030108 GO:GO:0010815 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 EMBL:AADN02038832 EMBL:AADN02038831 IPI:IPI00594147
            Ensembl:ENSGALT00000013440 Uniprot:F1P3U9
        Length = 261

 Score = 509 (184.2 bits), Expect = 8.5e-49, P = 8.5e-49
 Identities = 113/271 (41%), Positives = 153/271 (56%)

Query:    84 YKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGA- 142
             + +  N+FSD+T  EF+  Y                   F  ++    P ++DWR+KG  
Sbjct:     1 FLVALNQFSDMTFAEFKKLYLW-----SEPQNCSATRGNF-LRSDGPCPEAVDWRKKGNF 54

Query:   143 VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDK 200
             VT +KNQG CGSCW FS    +E    I  GKL+ L+EQ LVDC+   +N+GCSGGL  +
Sbjct:    55 VTPVKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQLLVDCAQAFNNHGCSGGLPSQ 114

Query:   201 AFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ- 259
             AFEYI+ NKGL  E  YPY+ + GTC  Q +KA A  +    ++ + DE  +++AV K  
Sbjct:   115 AFEYILYNKGLMGEDAYPYRAQNGTCKFQPDKAIAF-VKDVINITQYDEAGMVEAVGKHN 173

Query:   260 PVSVCVEASGQAFRFYKRGVL-NAECG---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSW 315
             PVS   E +   F  Y++GV  N  C    D  +H V  VG+G   EEDG  YW++KNSW
Sbjct:   174 PVSFAFEVTSD-FMHYRKGVYSNPRCEHTPDKVNHAVLAVGYG---EEDGRPYWIVKNSW 229

Query:   316 GETWGESGYIRILRDEGLCGIATEASYPVAM 346
             G  WG  GY  I R + +CG+A  ASYPV +
Sbjct:   230 GPLWGMDGYFLIERGKNMCGLAACASYPVPL 260


>TAIR|locus:2120222 [details] [associations]
            symbol:RD19 "RESPONSIVE TO DEHYDRATION 19" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009269 "response to desiccation" evidence=IEP] [GO:0006970
            "response to osmotic stress" evidence=IGI] [GO:0005634 "nucleus"
            evidence=IDA] [GO:0005773 "vacuole" evidence=IDA] [GO:0042742
            "defense response to bacterium" evidence=IMP] [GO:0006096
            "glycolysis" evidence=RCA] [GO:0006833 "water transport"
            evidence=RCA] [GO:0006972 "hyperosmotic response" evidence=RCA]
            [GO:0007030 "Golgi organization" evidence=RCA] [GO:0009266
            "response to temperature stimulus" evidence=RCA] [GO:0009651
            "response to salt stress" evidence=IEP;RCA] [GO:0046686 "response
            to cadmium ion" evidence=RCA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=ISS] [GO:0009414 "response to
            water deprivation" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005634 GO:GO:0005773 EMBL:CP002687
            GenomeReviews:CT486007_GR GO:GO:0009651 GO:GO:0042742
            eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            ProtClustDB:CLSN2688311 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AL035679 EMBL:AL161594 GO:GO:0004197
            MEROPS:C01.022 EMBL:D13042 EMBL:AY080598 EMBL:AY133844
            IPI:IPI00544363 PIR:JN0718 RefSeq:NP_568052.1 UniGene:At.2850
            UniGene:At.74924 ProteinModelPortal:P43296 SMR:P43296 STRING:P43296
            PaxDb:P43296 PRIDE:P43296 EnsemblPlants:AT4G39090.1 GeneID:830064
            KEGG:ath:AT4G39090 TAIR:At4g39090 InParanoid:P43296 OMA:EDFDWRD
            PhylomeDB:P43296 Genevestigator:P43296 GermOnline:AT4G39090
            Uniprot:P43296
        Length = 368

 Score = 504 (182.5 bits), Expect = 2.9e-48, P = 2.9e-48
 Identities = 121/335 (36%), Positives = 170/335 (50%)

Query:    26 QVVSGRSMHEPSIV--EKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK-EGN 81
             QVV G    EP ++  E H   +  + G+ Y    E   R ++FK NL    +  K + +
Sbjct:    35 QVVGGA---EPQVLTSEDHFSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPS 91

Query:    82 RTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKG 141
              T+  G  +FSDLT  EFR  + G                    +N+   P   DWR+ G
Sbjct:    92 ATH--GVTQFSDLTRSEFRKKHLGVRSGFKLPKDANKAPI-LPTENL---PEDFDWRDHG 145

Query:   142 AVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD---------NNG 192
             AVT +KNQG CGSCW+FSA  A+EG   +  GKL+ LSEQQLVDC  +         ++G
Sbjct:   146 AVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSG 205

Query:   193 CSGGLMDKAFEYIIENKGLATEADYPYQQEQG-TCDKQKEKAAAATIGKYEDLPKGDEHA 251
             C+GGLM+ AFEY ++  GL  E DYPY  + G TC   K K  A+ +  +  +   +E  
Sbjct:   206 CNGGLMNSAFEYTLKTGGLMKEEDYPYTGKDGKTCKLDKSKIVAS-VSNFSVISIDEEQI 264

Query:   252 LLQAVTKQPVSVCVEASGQAFRFYKRGVLNAE-CGDNCDHGVAVVGFGTAEEEDGA---- 306
                 V   P++V + A     + Y  GV     C    +HGV +VG+G A          
Sbjct:   265 AANLVKNGPLAVAINAG--YMQTYIGGVSCPYICTRRLNHGVLLVGYGAAGYAPARFKEK 322

Query:   307 KYWLIKNSWGETWGESGYIRILRDEGLCGIATEAS 341
              YW+IKNSWGETWGE+G+ +I +   +CG+ +  S
Sbjct:   323 PYWIIKNSWGETWGENGFYKICKGRNICGVDSMVS 357


>TAIR|locus:2082687 [details] [associations]
            symbol:AT3G54940 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002686 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HSSP:P53634
            OMA:GGGLMTN EMBL:AY070063 IPI:IPI00528988 RefSeq:NP_567010.5
            UniGene:At.28412 ProteinModelPortal:Q8VYS0 SMR:Q8VYS0 PRIDE:Q8VYS0
            EnsemblPlants:AT3G54940.2 GeneID:824659 KEGG:ath:AT3G54940
            TAIR:At3g54940 PhylomeDB:Q8VYS0 ProtClustDB:CLSN2718801
            ArrayExpress:Q8VYS0 Genevestigator:Q8VYS0 Uniprot:Q8VYS0
        Length = 367

 Score = 501 (181.4 bits), Expect = 6.0e-48, P = 6.0e-48
 Identities = 121/337 (35%), Positives = 168/337 (49%)

Query:    26 QVVSGRSMHEPSIVEKHEQ-----WMAQHGRTYKDELEKAMRLTIFKQN-LEYIEKANKE 79
             QV +      P+++  H +     +M+ +G+ Y    E   RL IF +N L+  E    +
Sbjct:    30 QVTADNRRIRPNLLGTHTESKFRLFMSDYGKNYSTREEYIHRLGIFAKNVLKAAEHQMMD 89

Query:    80 GNRTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWRE 139
              +  +  G  +FSDLT EEF+  YTG                      V  +P   DWRE
Sbjct:    90 PSAVH--GVTQFSDLTEEEFKRMYTGVADVGGSRGGTVGAEAPMV--EVDGLPEDFDWRE 145

Query:   140 KGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC--STD-------N 190
             KG VT +KNQG CGSCWAFS   A EG   ++ GKL+ LSEQQLVDC  + D       +
Sbjct:   146 KGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCDQACDPKDKKACD 205

Query:   191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
             NGC GGLM  A+EY++E  GL  E  YPY  ++G C    EK A   +  +  +P  +  
Sbjct:   206 NGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKRGHCKFDPEKVAVRVLN-FTTIPLDENQ 264

Query:   251 ALLQAVTKQPVSVCVEASGQAFRFYKRGV-LNAECGD-NCDHGVAVVGFGTAEEE----D 304
                  V   P++V + A     + Y  GV     C   N +HGV +VG+G+         
Sbjct:   265 IAANLVRHGPLAVGLNAV--FMQTYIGGVSCPLICSKRNVNHGVLLVGYGSKGFSILRLS 322

Query:   305 GAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEAS 341
                YW+IKNSWG+ WGE+GY ++ R   +CGI +  S
Sbjct:   323 NKPYWIIKNSWGKKWGENGYYKLCRGHDICGINSMVS 359


>FB|FBgn0250848 [details] [associations]
            symbol:26-29-p "26-29kD-proteinase" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005811
            "lipid particle" evidence=IDA] [GO:0005875 "microtubule associated
            complex" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005875 EMBL:AE014296 GO:GO:0005811 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 MEROPS:I29.003 HSSP:O65039
            EMBL:AY122222 EMBL:AB011376 RefSeq:NP_620470.1 UniGene:Dm.3049
            SMR:Q9V3U6 MINT:MINT-890485 STRING:Q9V3U6
            EnsemblMetazoa:FBtr0075766 GeneID:39547 KEGG:dme:Dmel_CG8947
            UCSC:CG8947-RA CTD:39547 FlyBase:FBgn0250848 InParanoid:Q9V3U6
            OMA:IHSKNRA OrthoDB:EOG4BVQ8T GenomeRNAi:39547 NextBio:814210
            Uniprot:Q9V3U6
        Length = 549

 Score = 499 (180.7 bits), Expect = 9.8e-48, P = 9.8e-48
 Identities = 115/327 (35%), Positives = 165/327 (50%)

Query:    26 QVVSGRSMHEPSIVEK-HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTY 84
             + +SG   H    V+K    +  +HG  Y  + E   R  IF+QNL YI   N+    TY
Sbjct:   232 EFISGTDEH----VDKAFHHFKRKHGVAYHSDTEHEHRKNIFRQNLRYIHSKNR-AKLTY 286

Query:    85 KLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVT 144
              L  N  +D T EE +A   GY                 KY++  ++P   DWR  GAVT
Sbjct:   287 TLAVNHLADKTEEELKAR-RGYKSSGIYNTGKPFPYDVPKYKD--EIPDQYDWRLYGAVT 343

Query:   145 HIKNQGHCGSCWAFSAVAAVEGITQI-TGGKLIELSEQQLVDCST--DNNGCSGGLMDKA 201
              +K+Q  CGSCW+F  +  +EG   +  GG L+ LS+Q L+DCS    NNGC GG   + 
Sbjct:   344 PVKDQSVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDCSWAYGNNGCDGGEDFRV 403

Query:   202 FEYIIENKGLATEADY-PYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ- 259
             +++++++ G+ TE +Y PY  + G C        A   G + ++   D +A   A+ K  
Sbjct:   404 YQWMLQSGGVPTEEEYGPYLGQDGYCHVNNVTLVAPIKG-FVNVTSNDPNAFKLALLKHG 462

Query:   260 PVSVCVEASGQAFRFYKRGVL-NAECG---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSW 315
             P+SV ++AS + F FY  GV     C    D  DH V  VG+G+   ED   YWL+KNSW
Sbjct:   463 PLSVAIDASPKTFSFYSHGVYYEPTCKNDVDGLDHAVLAVGYGSINGED---YWLVKNSW 519

Query:   316 GETWGESGYIRILRDEGLCGIATEASY 342
                WG  GYI +   +  CG+ T  +Y
Sbjct:   520 STYWGNDGYILMSAKKNNCGVMTMPTY 546


>DICTYBASE|DDB_G0290957 [details] [associations]
            symbol:cprA "cysteine proteinase 1" species:44689
            "Dictyostelium discoideum" [GO:0006972 "hyperosmotic response"
            evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0290957
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GenomeReviews:CM000154_GR GO:GO:0005764
            GO:GO:0006972 EMBL:AAFI02000174 KO:K01376 EMBL:X02407 PIR:A22827
            RefSeq:XP_635417.1 ProteinModelPortal:P04988 MEROPS:C01.022
            GlycoSuiteDB:P04988 SWISS-2DPAGE:P04988 EnsemblProtists:DDB0201647
            GeneID:8627918 KEGG:ddi:DDB_G0290957 OMA:KISNFTM
            ProtClustDB:CLSZ2429603 Uniprot:P04988
        Length = 343

 Score = 494 (179.0 bits), Expect = 3.3e-47, P = 3.3e-47
 Identities = 110/314 (35%), Positives = 157/314 (50%)

Query:    44 QWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK---EGNRTYKLGTNEFSDLTNEEFR 100
             ++  +  + Y  E E   R  IFK NL  IE+ N          K G N+F+DL+++EF+
Sbjct:    31 EFQDKFNKKYSHE-EYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFK 89

Query:   101 ASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSA 160
               Y   N                  + +  +PT+ DWR +GAVT +KNQG CGSCW+FS 
Sbjct:    90 NYYL--NNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFST 147

Query:   161 VAAVEGITQITGGKLIELSEQQLVDCSTD----------NNGCSGGLMDKAFEYIIENKG 210
                VEG   I+  KL+ LSEQ LVDC  +          + GC+GGL   A+ YII+N G
Sbjct:   148 TGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEQACDEGCNGGLQPNAYNYIIKNGG 207

Query:   211 LATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQ 270
             + TE+ YPY  E GT          A I  +  +PK +       V+  P+++  +A   
Sbjct:   208 IQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAV-- 265

Query:   271 AFRFYKRGVLNAECGDNC-DHGVAVVGFGTAEE--EDGAKYWLIKNSWGETWGESGYIRI 327
              ++FY  GV +  C  N  DHG+ +VG+            YW++KNSWG  WGE GYI +
Sbjct:   266 EWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYL 325

Query:   328 LRDEGLCGIATEAS 341
              R +  CG++   S
Sbjct:   326 RRGKNTCGVSNFVS 339


>GENEDB_PFALCIPARUM|PF11_0165 [details] [associations]
            symbol:PF11_0165 "falcipain 2 precursor"
            species:5833 "Plasmodium falciparum" [GO:0020020 "food vacuole"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014186 HSSP:O65039 GO:GO:0020020
            RefSeq:XP_001347836.1 ProteinModelPortal:Q8I6U4 SMR:Q8I6U4
            IntAct:Q8I6U4 MINT:MINT-1559493 MEROPS:C01.046
            EnsemblProtists:PF11_0165:mRNA GeneID:810712 KEGG:pfa:PF11_0165
            EuPathDB:PlasmoDB:PF3D7_1115700 HOGENOM:HOG000065857 OMA:NESLHAN
            ProtClustDB:PTZ00021 BindingDB:Q8I6U4 ChEMBL:CHEMBL3470
            Uniprot:Q8I6U4
        Length = 484

 Score = 490 (177.5 bits), Expect = 8.8e-47, P = 8.8e-47
 Identities = 122/334 (36%), Positives = 170/334 (50%)

Query:    33 MHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
             M+    + +   ++  + + Y    E   R  +F QN   +   N   N  YK   N F+
Sbjct:   156 MNNAEHINQFYMFIKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKELNRFA 215

Query:    93 DLTNEEFRASYTGYNXXX-XXXXXXXXXXXTF-----KYQ-NVTDVPTSIDWREKGAVTH 145
             DLT  EF+  Y                    +     KY+ N      + DWR    VT 
Sbjct:   216 DLTYHEFKNKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYKGNENFDHAAYDWRLHSGVTP 275

Query:   146 IKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYI 205
             +K+Q +CGSCWAFS++ +VE    I   KLI LSEQ+LVDCS  N GC+GGL++ AFE +
Sbjct:   276 VKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFKNYGCNGGLINNAFEDM 335

Query:   206 IENKGLATEADYPYQQEQ-GTC--DKQKEKAAAATIGKYEDLPKGDEHALLQAVT-KQPV 261
             IE  G+ T+ DYPY  +    C  D+  EK     I  Y  +P   ++ L +A+    P+
Sbjct:   336 IELGGICTDDDYPYVSDAPNLCNIDRCTEKYG---IKNYLSVP---DNKLKEALRFLGPI 389

Query:   262 SVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAE-----EEDGAK--YWLIKNS 314
             S+ V  S   F FYK G+ + ECGD  +H V +VGFG  E      + G K  Y++IKNS
Sbjct:   390 SISVAVSDD-FAFYKEGIFDGECGDQLNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNS 448

Query:   315 WGETWGESGYIRILRDE-GL---CGIATEASYPV 344
             WG+ WGE G+I I  DE GL   CG+ T+A  P+
Sbjct:   449 WGQQWGERGFINIETDESGLMRKCGLGTDAFIPL 482


>UNIPROTKB|Q8I6U4 [details] [associations]
            symbol:PF11_0165 "Falcipain-2A" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            EMBL:AE014186 HSSP:O65039 GO:GO:0020020 RefSeq:XP_001347836.1
            ProteinModelPortal:Q8I6U4 SMR:Q8I6U4 IntAct:Q8I6U4
            MINT:MINT-1559493 MEROPS:C01.046 EnsemblProtists:PF11_0165:mRNA
            GeneID:810712 KEGG:pfa:PF11_0165 EuPathDB:PlasmoDB:PF3D7_1115700
            HOGENOM:HOG000065857 OMA:NESLHAN ProtClustDB:PTZ00021
            BindingDB:Q8I6U4 ChEMBL:CHEMBL3470 Uniprot:Q8I6U4
        Length = 484

 Score = 490 (177.5 bits), Expect = 8.8e-47, P = 8.8e-47
 Identities = 122/334 (36%), Positives = 170/334 (50%)

Query:    33 MHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
             M+    + +   ++  + + Y    E   R  +F QN   +   N   N  YK   N F+
Sbjct:   156 MNNAEHINQFYMFIKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKELNRFA 215

Query:    93 DLTNEEFRASYTGYNXXX-XXXXXXXXXXXTF-----KYQ-NVTDVPTSIDWREKGAVTH 145
             DLT  EF+  Y                    +     KY+ N      + DWR    VT 
Sbjct:   216 DLTYHEFKNKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYKGNENFDHAAYDWRLHSGVTP 275

Query:   146 IKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYI 205
             +K+Q +CGSCWAFS++ +VE    I   KLI LSEQ+LVDCS  N GC+GGL++ AFE +
Sbjct:   276 VKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFKNYGCNGGLINNAFEDM 335

Query:   206 IENKGLATEADYPYQQEQ-GTC--DKQKEKAAAATIGKYEDLPKGDEHALLQAVT-KQPV 261
             IE  G+ T+ DYPY  +    C  D+  EK     I  Y  +P   ++ L +A+    P+
Sbjct:   336 IELGGICTDDDYPYVSDAPNLCNIDRCTEKYG---IKNYLSVP---DNKLKEALRFLGPI 389

Query:   262 SVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAE-----EEDGAK--YWLIKNS 314
             S+ V  S   F FYK G+ + ECGD  +H V +VGFG  E      + G K  Y++IKNS
Sbjct:   390 SISVAVSDD-FAFYKEGIFDGECGDQLNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNS 448

Query:   315 WGETWGESGYIRILRDE-GL---CGIATEASYPV 344
             WG+ WGE G+I I  DE GL   CG+ T+A  P+
Sbjct:   449 WGQQWGERGFINIETDESGLMRKCGLGTDAFIPL 482


>ZFIN|ZDB-GENE-030131-9831 [details] [associations]
            symbol:ctsf "cathepsin F" species:7955 "Danio
            rerio" [GO:0004869 "cysteine-type endopeptidase inhibitor activity"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000010 InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00031 Pfam:PF00112 PRINTS:PR00705 SMART:SM00043
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-030131-9831
            GO:GO:0004869 eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 HOVERGEN:HBG011513 CTD:8722 OrthoDB:EOG4CC41T
            MEROPS:I25.006 EMBL:BC124243 IPI:IPI00503226 RefSeq:NP_001071036.1
            UniGene:Dr.81265 ProteinModelPortal:Q08CH0 SMR:Q08CH0 GeneID:565588
            KEGG:dre:565588 InParanoid:Q08CH0 NextBio:20885952
            ArrayExpress:Q08CH0 Uniprot:Q08CH0
        Length = 473

 Score = 486 (176.1 bits), Expect = 2.3e-46, P = 2.3e-46
 Identities = 118/320 (36%), Positives = 162/320 (50%)

Query:    24 ASQVVSGRSMHEP-SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE--KANKEG 80
             A  +   + M E   ++   + +M  + RTY  + E   RL IF+QN++  +  ++ ++G
Sbjct:   156 AVPLTHSKPMKESVELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQG 215

Query:    81 NRTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREK 140
             +  Y  G  +FSDLT +EFR  Y   N                        P + DWR+ 
Sbjct:   216 SAEY--GITKFSDLTEDEFRMMYL--NPMLSQWSLKKEMKPAIPAS--APAPDTWDWRDH 269

Query:   141 GAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDK 200
             GAV+ +KNQG CGSCWAFS    +EG      G+L+ LSEQ+LVDC   +  C GGL   
Sbjct:   270 GAVSPVKNQGMCGSCWAFSVTGNIEGQWFKKTGQLLSLSEQELVDCDKLDQACGGGLPSN 329

Query:   201 AFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQP 260
             A+E I    GL TE DY Y   + +CD    K AA  I    +LPK ++          P
Sbjct:   330 AYEAIENLGGLETETDYSYTGHKQSCDFSTGKVAAY-INSSVELPKDEKEIAAFLAENGP 388

Query:   261 VSVCVEASGQAFRFYKRGV---LNAECGD-NCDHGVAVVGFGTAEEEDGAKYWLIKNSWG 316
             VS  + A   A +FY++GV   L   C     DH V +VGFG   + +G  +W IKNSWG
Sbjct:   389 VSAALNAF--AMQFYRKGVSHPLKIFCNPWMIDHAVLLVGFG---QRNGVPFWAIKNSWG 443

Query:   317 ETWGESGYIRILRDEGLCGI 336
             E +GE GY  + R  GLCGI
Sbjct:   444 EDYGEQGYYYLYRGSGLCGI 463


>TAIR|locus:2130180 [details] [associations]
            symbol:AT4G16190 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0042744 "hydrogen peroxide catabolic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005773
            EMBL:CP002687 HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 EMBL:Z97340 EMBL:AL161543 UniGene:At.25555
            EMBL:AY039556 EMBL:AY129473 EMBL:AY136316 EMBL:BT000733
            EMBL:AK226366 IPI:IPI00543588 PIR:D71428 RefSeq:NP_567489.1
            HSSP:P25779 ProteinModelPortal:Q9SUL1 SMR:Q9SUL1 STRING:Q9SUL1
            MEROPS:C01.A06 PRIDE:Q9SUL1 EnsemblPlants:AT4G16190.1 GeneID:827311
            KEGG:ath:AT4G16190 TAIR:At4g16190 InParanoid:Q9SUL1 OMA:NACGINK
            PhylomeDB:Q9SUL1 ProtClustDB:CLSN2917559 Genevestigator:Q9SUL1
            Uniprot:Q9SUL1
        Length = 373

 Score = 485 (175.8 bits), Expect = 3.0e-46, P = 3.0e-46
 Identities = 117/337 (34%), Positives = 171/337 (50%)

Query:    26 QVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTY 84
             QVV   +  +    E H   + +++ +TY  ++E   R  +FK NL    + N+  + + 
Sbjct:    38 QVVPEENDEQLLNAEHHFTLFKSKYEKTYATQVEHDHRFRVFKANLRRARR-NQLLDPSA 96

Query:    85 KLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVT 144
               G  +FSDLT +EFR  + G                T      +D+PT  DWRE+GAVT
Sbjct:    97 VHGVTQFSDLTPKEFRRKFLGLKRRGFRLPTDTQ---TAPILPTSDLPTEFDWREQGAVT 153

Query:   145 HIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD---------NNGCSG 195
              +KNQG CGSCW+FSA+ A+EG   +   +L+ LSEQQLVDC  +         ++GCSG
Sbjct:   154 PVKNQGMCGSCWSFSAIGALEGAHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSG 213

Query:   196 GLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQA 255
             GLM+ AFEY ++  GL  E DYPY     T  K  +    A++  +  +   ++      
Sbjct:   214 GLMNNAFEYALKAGGLMKEEDYPYTGRDHTACKFDKSKIVASVSNFSVVSSDEDQIAANL 273

Query:   256 VTKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEED----GAKYWL 310
             V   P+++ + A     + Y  GV     C  + DHGV +VGFG++           YW+
Sbjct:   274 VQHGPLAIAINAMWM--QTYIGGVSCPYVCSKSQDHGVLLVGFGSSGYAPIRLKEKPYWI 331

Query:   311 IKNSWGETWGESGYIRILRD-EGLCGIATEASYPVAM 346
             IKNSWG  WGE GY +I R    +CG+ T  S   A+
Sbjct:   332 IKNSWGAMWGEHGYYKICRGPHNMCGMDTMVSTVAAV 368


>GENEDB_PFALCIPARUM|PF11_0161 [details] [associations]
            symbol:PF11_0161 "falcipain-2 precursor,
            putative" species:5833 "Plasmodium falciparum" [GO:0020020 "food
            vacuole" evidence=TAS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014186 HSSP:O65039 OMA:NNVEHIN GO:GO:0020020
            MEROPS:C01.046 HOGENOM:HOG000065857 ProtClustDB:PTZ00021
            RefSeq:XP_001347832.1 ProteinModelPortal:Q8I6U5 SMR:Q8I6U5
            IntAct:Q8I6U5 MINT:MINT-1546851 EnsemblProtists:PF11_0161:mRNA
            GeneID:810708 KEGG:pfa:PF11_0161 EuPathDB:PlasmoDB:PF3D7_1115300
            Uniprot:Q8I6U5
        Length = 482

 Score = 484 (175.4 bits), Expect = 3.8e-46, P = 3.8e-46
 Identities = 121/334 (36%), Positives = 173/334 (51%)

Query:    33 MHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
             M+    + +   ++  + + Y    E   R  +F QN   ++  N      YK   N F+
Sbjct:   154 MNNVEHINQFYTFIKTNNKQYNSPNEMKERFQVFLQNAHKVKMHNNNKKSLYKKELNRFA 213

Query:    93 DLTNEEFRASY-TGYNXXXXXXXXXXXXXXTF-----KYQ-NVTDVPTSIDWREKGAVTH 145
             DLT  EF++ Y T  +               +     KY+ N      + DWR    VT 
Sbjct:   214 DLTYHEFKSKYLTLRSSKPLKNSKYLLDQINYDAVIKKYKGNENFDHAAYDWRLHSGVTP 273

Query:   146 IKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYI 205
             +K+Q +CGSCWAFS++ +VE    I   KLI LSEQ+LVDCS  N GC+GGL++ AFE +
Sbjct:   274 VKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFKNYGCNGGLINNAFEDM 333

Query:   206 IENKGLATEADYPYQQEQ-GTC--DKQKEKAAAATIGKYEDLPKGDEHALLQAVT-KQPV 261
             IE  G+ T+ DYPY  +    C  D+  EK     I  Y  +P   ++ L +A+    P+
Sbjct:   334 IELGGICTDDDYPYVSDAPNLCNIDRCTEKYG---IKNYLSVP---DNKLKEALRFLGPI 387

Query:   262 SVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAE-----EEDGAK--YWLIKNS 314
             S+ +  S   F FYK G+ + ECGD  +H V +VGFG  E      + G K  Y++IKNS
Sbjct:   388 SISIAVSDD-FPFYKEGIFDGECGDELNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNS 446

Query:   315 WGETWGESGYIRILRDE-GL---CGIATEASYPV 344
             WG+ WGE G+I I  DE GL   CG+ T+A  P+
Sbjct:   447 WGQQWGERGFINIETDESGLMRKCGLGTDAFIPL 480


>UNIPROTKB|Q8I6U5 [details] [associations]
            symbol:PF11_0161 "Falcipain-2B" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            EMBL:AE014186 HSSP:O65039 OMA:NNVEHIN GO:GO:0020020 MEROPS:C01.046
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347832.1
            ProteinModelPortal:Q8I6U5 SMR:Q8I6U5 IntAct:Q8I6U5
            MINT:MINT-1546851 EnsemblProtists:PF11_0161:mRNA GeneID:810708
            KEGG:pfa:PF11_0161 EuPathDB:PlasmoDB:PF3D7_1115300 Uniprot:Q8I6U5
        Length = 482

 Score = 484 (175.4 bits), Expect = 3.8e-46, P = 3.8e-46
 Identities = 121/334 (36%), Positives = 173/334 (51%)

Query:    33 MHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
             M+    + +   ++  + + Y    E   R  +F QN   ++  N      YK   N F+
Sbjct:   154 MNNVEHINQFYTFIKTNNKQYNSPNEMKERFQVFLQNAHKVKMHNNNKKSLYKKELNRFA 213

Query:    93 DLTNEEFRASY-TGYNXXXXXXXXXXXXXXTF-----KYQ-NVTDVPTSIDWREKGAVTH 145
             DLT  EF++ Y T  +               +     KY+ N      + DWR    VT 
Sbjct:   214 DLTYHEFKSKYLTLRSSKPLKNSKYLLDQINYDAVIKKYKGNENFDHAAYDWRLHSGVTP 273

Query:   146 IKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYI 205
             +K+Q +CGSCWAFS++ +VE    I   KLI LSEQ+LVDCS  N GC+GGL++ AFE +
Sbjct:   274 VKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFKNYGCNGGLINNAFEDM 333

Query:   206 IENKGLATEADYPYQQEQ-GTC--DKQKEKAAAATIGKYEDLPKGDEHALLQAVT-KQPV 261
             IE  G+ T+ DYPY  +    C  D+  EK     I  Y  +P   ++ L +A+    P+
Sbjct:   334 IELGGICTDDDYPYVSDAPNLCNIDRCTEKYG---IKNYLSVP---DNKLKEALRFLGPI 387

Query:   262 SVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAE-----EEDGAK--YWLIKNS 314
             S+ +  S   F FYK G+ + ECGD  +H V +VGFG  E      + G K  Y++IKNS
Sbjct:   388 SISIAVSDD-FPFYKEGIFDGECGDELNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNS 446

Query:   315 WGETWGESGYIRILRDE-GL---CGIATEASYPV 344
             WG+ WGE G+I I  DE GL   CG+ T+A  P+
Sbjct:   447 WGQQWGERGFINIETDESGLMRKCGLGTDAFIPL 480


>DICTYBASE|DDB_G0291191 [details] [associations]
            symbol:DDB_G0291191 "cysteine protease" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0291191
            GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AAFI02000175 MEROPS:C01.022
            ProtClustDB:CLSZ2429603 RefSeq:XP_635374.1
            ProteinModelPortal:Q54F16 PRIDE:Q54F16 EnsemblProtists:DDB0252831
            GeneID:8628022 KEGG:ddi:DDB_G0291191 OMA:NETQIAS Uniprot:Q54F16
        Length = 352

 Score = 482 (174.7 bits), Expect = 6.2e-46, P = 6.2e-46
 Identities = 110/319 (34%), Positives = 158/319 (49%)

Query:    48 QHGRTYKDELEKAMRLTIFKQNLEYIEKANKE----GNRTYKLGTNEFSDLTNEEFRASY 103
             ++ + Y  E E  ++   FK NL  I+  NK+    G+ T K G N+F+DL+ EEF+  Y
Sbjct:    33 KYNKIYSAE-EYLVKFETFKSNLLNIDALNKQATTIGSDT-KFGVNKFADLSKEEFKKYY 90

Query:   104 TGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGA---------VTHIKNQGHCGS 154
                +                    ++  P + DWR  G          VT +KNQG CGS
Sbjct:    91 L--SSKEARLTDDLPMLPNLSDDIISATPAAFDWRNTGGSTKFPQGTPVTAVKNQGQCGS 148

Query:   155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDC----------STDNNGCSGGLMDKAFEY 204
             CW+FS    VEG   ++ G L+ LSEQ LVDC          +  N GC GGL   A+ Y
Sbjct:   149 CWSFSTTGNVEGQHYLSTGTLVGLSEQNLVDCDHTCMTYENENVCNAGCDGGLQPNAYNY 208

Query:   205 IIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVC 264
             II+N G+ TEA YPY    G C K       A I  +  +P+ +           P+++ 
Sbjct:   209 IIKNGGIQTEATYPYTAVDGEC-KFNSAQVGAKISSFTMVPQNETQIASYLFNNGPLAIA 267

Query:   265 VEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK--YWLIKNSWGETWGES 322
              +A  + ++FY  GV +  CG   DHG+ +VG+G  +   G    YW+IKNSWG  WGE+
Sbjct:   268 ADA--EEWQFYMGGVFDFPCGQTLDHGILIVGYGAQDTIVGKNTPYWIIKNSWGADWGEA 325

Query:   323 GYIRILRDEGLCGIATEAS 341
             GY+++ R+   CG+A   S
Sbjct:   326 GYLKVERNTDKCGVANFVS 344


>GENEDB_PFALCIPARUM|PF11_0162 [details] [associations]
            symbol:PF11_0162 "falcipain-3" species:5833
            "Plasmodium falciparum" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 HSSP:P43235 EMBL:AE014186 GO:GO:0020020
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347833.1
            ProteinModelPortal:Q8IIL0 SMR:Q8IIL0 MEROPS:C01.063
            EnsemblProtists:PF11_0162:mRNA GeneID:810709 KEGG:pfa:PF11_0162
            EuPathDB:PlasmoDB:PF3D7_1115400 OMA:ENDEDYW ChEMBL:CHEMBL1250373
            Uniprot:Q8IIL0
        Length = 492

 Score = 480 (174.0 bits), Expect = 1.0e-45, P = 1.0e-45
 Identities = 117/321 (36%), Positives = 160/321 (49%)

Query:    45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYT 104
             ++ ++ + Y+   E   R  IF +N   IE  NK+ N  YK G N+F DL+ EEFR+ Y 
Sbjct:   174 FLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKFGDLSPEEFRSKYL 233

Query:   105 GYNXXXXXXXXXXXXXXTFKYQNV------TDVPT---SIDWREKGAVTHIKNQGHCGSC 155
                                 Y++V       D      + DWR  G VT +K+Q  CGSC
Sbjct:   234 NLKTHGPFKTLSPPVSYEANYEDVIKKYKPADAKLDRIAYDWRLHGGVTPVKDQALCGSC 293

Query:   156 WAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEA 215
             WAFS+V +VE    I    L   SEQ+LVDCS  NNGC GG +  AF+ +I+  GL ++ 
Sbjct:   294 WAFSSVGSVESQYAIRKKALFLFSEQELVDCSVKNNGCYGGYITNAFDDMIDLGGLCSQD 353

Query:   216 DYPYQQE-QGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRF 274
             DYPY      TC+  K      TI  Y  +P       L+ +   P+S+ + AS   F F
Sbjct:   354 DYPYVSNLPETCNL-KRCNERYTIKSYVSIPDDKFKEALRYLG--PISISIAASDD-FAF 409

Query:   275 YKRGVLNAECGDNCDHGVAVVGFGTAE--EEDGAK-----YWLIKNSWGETWGESGYIRI 327
             Y+ G  + ECG   +H V +VG+G  +   ED  +     Y++IKNSWG  WGE GYI +
Sbjct:   410 YRGGFYDGECGAAPNHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWGSDWGEGGYINL 469

Query:   328 LRDEG----LCGIATEASYPV 344
               DE      C I TEA  P+
Sbjct:   470 ETDENGYKKTCSIGTEAYVPL 490


>UNIPROTKB|Q8IIL0 [details] [associations]
            symbol:PF11_0162 "Falcipain-3" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 HSSP:P43235 EMBL:AE014186 GO:GO:0020020
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347833.1
            ProteinModelPortal:Q8IIL0 SMR:Q8IIL0 MEROPS:C01.063
            EnsemblProtists:PF11_0162:mRNA GeneID:810709 KEGG:pfa:PF11_0162
            EuPathDB:PlasmoDB:PF3D7_1115400 OMA:ENDEDYW ChEMBL:CHEMBL1250373
            Uniprot:Q8IIL0
        Length = 492

 Score = 480 (174.0 bits), Expect = 1.0e-45, P = 1.0e-45
 Identities = 117/321 (36%), Positives = 160/321 (49%)

Query:    45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYT 104
             ++ ++ + Y+   E   R  IF +N   IE  NK+ N  YK G N+F DL+ EEFR+ Y 
Sbjct:   174 FLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKFGDLSPEEFRSKYL 233

Query:   105 GYNXXXXXXXXXXXXXXTFKYQNV------TDVPT---SIDWREKGAVTHIKNQGHCGSC 155
                                 Y++V       D      + DWR  G VT +K+Q  CGSC
Sbjct:   234 NLKTHGPFKTLSPPVSYEANYEDVIKKYKPADAKLDRIAYDWRLHGGVTPVKDQALCGSC 293

Query:   156 WAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEA 215
             WAFS+V +VE    I    L   SEQ+LVDCS  NNGC GG +  AF+ +I+  GL ++ 
Sbjct:   294 WAFSSVGSVESQYAIRKKALFLFSEQELVDCSVKNNGCYGGYITNAFDDMIDLGGLCSQD 353

Query:   216 DYPYQQE-QGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRF 274
             DYPY      TC+  K      TI  Y  +P       L+ +   P+S+ + AS   F F
Sbjct:   354 DYPYVSNLPETCNL-KRCNERYTIKSYVSIPDDKFKEALRYLG--PISISIAASDD-FAF 409

Query:   275 YKRGVLNAECGDNCDHGVAVVGFGTAE--EEDGAK-----YWLIKNSWGETWGESGYIRI 327
             Y+ G  + ECG   +H V +VG+G  +   ED  +     Y++IKNSWG  WGE GYI +
Sbjct:   410 YRGGFYDGECGAAPNHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWGSDWGEGGYINL 469

Query:   328 LRDEG----LCGIATEASYPV 344
               DE      C I TEA  P+
Sbjct:   470 ETDENGYKKTCSIGTEAYVPL 490


>MGI|MGI:1861434 [details] [associations]
            symbol:Ctsf "cathepsin F" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=ISS]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISS] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:1861434 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513 MEROPS:C01.018
            CTD:8722 OMA:LAPPEWD OrthoDB:EOG4CC41T EMBL:AF136280 EMBL:AF217224
            EMBL:AJ131851 EMBL:AK075862 EMBL:BC058758 IPI:IPI00126769
            RefSeq:NP_063914.1 UniGene:Mm.29561 ProteinModelPortal:Q9R013
            SMR:Q9R013 STRING:Q9R013 PhosphoSite:Q9R013 PaxDb:Q9R013
            PRIDE:Q9R013 Ensembl:ENSMUST00000119694 GeneID:56464 KEGG:mmu:56464
            UCSC:uc008gbc.1 GeneTree:ENSGT00660000095458 InParanoid:Q9R013
            NextBio:312722 Bgee:Q9R013 CleanEx:MM_CTSF Genevestigator:Q9R013
            GermOnline:ENSMUSG00000006458 Uniprot:Q9R013
        Length = 462

 Score = 479 (173.7 bits), Expect = 1.3e-45, P = 1.3e-45
 Identities = 113/307 (36%), Positives = 150/307 (48%)

Query:    43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
             + +M  + RTY+   E   RLT+F +N+   +K       T + G  +FSDLT EEF   
Sbjct:   166 KDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHTI 225

Query:   103 YTGYNXXXXXXXXXXXXXXTFKYQNVTDV-PTSIDWREKGAVTHIKNQGHCGSCWAFSAV 161
             Y   N                  +++ D+ P   DWR+KGAVT +KNQG CGSCWAFS  
Sbjct:   226 YL--NPLLQKESGRKMSPA----KSINDLAPPEWDWRKKGAVTEVKNQGMCGSCWAFSVT 279

Query:   162 AAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQ 221
               VEG   +  G L+ LSEQ+L+DC   +  C GGL   A+  I    GL TE DY YQ 
Sbjct:   280 GNVEGQWFLNRGTLLSLSEQELLDCDKVDKACLGGLPSNAYAAIKNLGGLETEDDYGYQG 339

Query:   222 EQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLN 281
                TC+   + A    I    +L + +         K P+SV + A G   +FY+ G+ +
Sbjct:   340 HVQTCNFSAQMAKVY-INDSVELSRNENKIAAWLAQKGPISVAINAFGM--QFYRHGIAH 396

Query:   282 AE---CGD-NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIA 337
                  C     DH V +VG+G         YW IKNSWG  WGE GY  + R  G CG+ 
Sbjct:   397 PFRPLCSPWFIDHAVLLVGYGN---RSNIPYWAIKNSWGSDWGEEGYYYLYRGSGACGVN 453

Query:   338 TEASYPV 344
             T AS  V
Sbjct:   454 TMASSAV 460


>UNIPROTKB|Q0VCU3 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            HOVERGEN:HBG011513 MEROPS:C01.018 CTD:8722 OMA:LAPPEWD
            OrthoDB:EOG4CC41T GeneTree:ENSGT00660000095458 EMBL:DAAA02063594
            EMBL:BC120003 IPI:IPI00717812 RefSeq:NP_001068884.1 UniGene:Bt.7264
            SMR:Q0VCU3 Ensembl:ENSBTAT00000014587 GeneID:509715 KEGG:bta:509715
            InParanoid:Q0VCU3 NextBio:20869091 Uniprot:Q0VCU3
        Length = 460

 Score = 478 (173.3 bits), Expect = 1.6e-45, P = 1.6e-45
 Identities = 114/307 (37%), Positives = 150/307 (48%)

Query:    43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
             + ++  + RTY  + E + R+++F  N+   +K       T + G  +FSDLT EEFR  
Sbjct:   164 KDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTARYGVTKFSDLTEEEFRTI 223

Query:   103 YTGYNXXXXXXXXXXXXXXTFKYQNVTDVPT-SIDWREKGAVTHIKNQGHCGSCWAFSAV 161
             Y   N                  Q VTDVP    DWR KGAVT++K+QG CGSCWAFS  
Sbjct:   224 YL--NPLLKDAPGRNMRPA----QPVTDVPPPQWDWRNKGAVTNVKDQGMCGSCWAFSVT 277

Query:   162 AAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQ 221
               VEG   +  G L+ LSEQ+L+DC   +  C GGL   A+  I    GL TE DY Y+ 
Sbjct:   278 GNVEGQWFLKRGTLLSLSEQELLDCDKTDKACLGGLPSNAYSAIRTLGGLETEDDYSYRG 337

Query:   222 EQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGV-- 279
                TC    EKA    I    +L K ++          PVS+ + A G   +FY+ G+  
Sbjct:   338 RLQTCSFSAEKAKVY-INDSVELSKNEQKLAAWLAKNGPVSIAINAFGM--QFYRHGISH 394

Query:   280 -LNAECGD-NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIA 337
              L   C     DH V +VG+G         +W IKNSWG  WGE GY  + R  G CG+ 
Sbjct:   395 PLRPLCSPWLIDHAVLLVGYGN---RSAIPFWAIKNSWGTDWGEEGYYYLHRGSGACGVN 451

Query:   338 TEASYPV 344
               AS  V
Sbjct:   452 IMASSAV 458


>TAIR|locus:2050145 [details] [associations]
            symbol:AT2G21430 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002685 GenomeReviews:CT485783_GR
            EMBL:AC006841 EMBL:X74359 IPI:IPI00519637 PIR:B84601
            RefSeq:NP_565512.1 UniGene:At.14069 ProteinModelPortal:P43295
            SMR:P43295 MEROPS:C01.A04 PRIDE:P43295 EnsemblPlants:AT2G21430.1
            GeneID:816682 KEGG:ath:AT2G21430 TAIR:At2g21430 eggNOG:COG4870
            HOGENOM:HOG000230774 InParanoid:P43295 KO:K01373 OMA:GSIEEHY
            PhylomeDB:P43295 ProtClustDB:CLSN2688311 Genevestigator:P43295
            GermOnline:AT2G21430 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 Uniprot:P43295
        Length = 361

 Score = 476 (172.6 bits), Expect = 2.7e-45, P = 2.7e-45
 Identities = 108/320 (33%), Positives = 164/320 (51%)

Query:    35 EPSIVEKHEQWMA---QHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEF 91
             EP ++   + +     + G+ Y    E   R ++FK NL    +  K  + + + G  +F
Sbjct:    38 EPKVLSSEDHFTLFKKKFGKVYGSIEEHYYRFSVFKANLLRAMRHQKM-DPSARHGVTQF 96

Query:    92 SDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
             SDLT  EFR  + G                    QN+   P   DWR++GAVT +KNQG 
Sbjct:    97 SDLTRSEFRRKHLGVKGGFKLPKDANQAPI-LPTQNL---PEEFDWRDRGAVTPVKNQGS 152

Query:   152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD---------NNGCSGGLMDKAF 202
             CGSCW+FS   A+EG   +  GKL+ LSEQQLVDC  +         ++GC+GGLM+ AF
Sbjct:   153 CGSCWSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAF 212

Query:   203 EYIIENKGLATEADYPYQ-QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPV 261
             EY ++  GL  E DYPY   + G+C   + K  A+ +  +  +   ++      +   P+
Sbjct:   213 EYTLKTGGLMREKDYPYTGTDGGSCKLDRSKIVAS-VSNFSVVSINEDQIAANLIKNGPL 271

Query:   262 SVCVEASGQAFRFYKRGVLNAE-CGDNCDHGVAVVGFGTAEEEDGA----KYWLIKNSWG 316
             +V + A+    + Y  GV     C    +HGV +VG+G+A           YW+IKNSWG
Sbjct:   272 AVAINAA--YMQTYIGGVSCPYICSRRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWG 329

Query:   317 ETWGESGYIRILRDEGLCGI 336
             E+WGE+G+ +I +   +CG+
Sbjct:   330 ESWGENGFYKICKGRNICGV 349


>UNIPROTKB|Q9UBX1 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=TAS] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_6900 GO:GO:0019886 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0043202
            GO:GO:0004197 HOVERGEN:HBG011513 EMBL:AJ007331 EMBL:AF088886
            EMBL:AF132894 EMBL:AF136279 EMBL:AF071748 EMBL:AF071749
            EMBL:AK313657 EMBL:BC011682 EMBL:BC036451 EMBL:AL137742
            IPI:IPI00002816 RefSeq:NP_003784.2 UniGene:Hs.11590 PDB:1D5U
            PDB:1M6D PDBsum:1D5U PDBsum:1M6D ProteinModelPortal:Q9UBX1
            SMR:Q9UBX1 STRING:Q9UBX1 MEROPS:C01.018 PhosphoSite:Q9UBX1
            DMDM:12643325 PaxDb:Q9UBX1 PeptideAtlas:Q9UBX1 PRIDE:Q9UBX1
            DNASU:8722 Ensembl:ENST00000310325 GeneID:8722 KEGG:hsa:8722
            UCSC:uc001oip.3 CTD:8722 GeneCards:GC11M066332 HGNC:HGNC:2531
            HPA:CAB002141 MIM:603539 neXtProt:NX_Q9UBX1 PharmGKB:PA27031
            InParanoid:Q9UBX1 OMA:LAPPEWD OrthoDB:EOG4CC41T PhylomeDB:Q9UBX1
            BindingDB:Q9UBX1 ChEMBL:CHEMBL2517 ChiTaRS:CTSF
            EvolutionaryTrace:Q9UBX1 GenomeRNAi:8722 NextBio:32715
            ArrayExpress:Q9UBX1 Bgee:Q9UBX1 CleanEx:HS_CTSF
            Genevestigator:Q9UBX1 GermOnline:ENSG00000174080 Uniprot:Q9UBX1
        Length = 484

 Score = 475 (172.3 bits), Expect = 3.4e-45, P = 3.4e-45
 Identities = 112/307 (36%), Positives = 153/307 (49%)

Query:    43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
             + ++  + RTY+ + E   RL++F  N+   +K       T + G  +FSDLT EEFR  
Sbjct:   188 KNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTI 247

Query:   103 YTGYNXXXXXXXXXXXXXXTFKYQNVTDV-PTSIDWREKGAVTHIKNQGHCGSCWAFSAV 161
             Y   N                  ++V D+ P   DWR KGAVT +K+QG CGSCWAFS  
Sbjct:   248 YL--NTLLRKEPGNKMKQA----KSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVT 301

Query:   162 AAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQ 221
               VEG   +  G L+ LSEQ+L+DC   +  C GGL   A+  I    GL TE DY YQ 
Sbjct:   302 GNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQG 361

Query:   222 EQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGV-- 279
                +C+   EKA    I    +L + ++        + P+SV + A G   +FY+ G+  
Sbjct:   362 HMQSCNFSAEKAKVY-INDSVELSQNEQKLAAWLAKRGPISVAINAFGM--QFYRHGISR 418

Query:   280 -LNAECGD-NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIA 337
              L   C     DH V +VG+G   +     +W IKNSWG  WGE GY  + R  G CG+ 
Sbjct:   419 PLRPLCSPWLIDHAVLLVGYGNRSD---VPFWAIKNSWGTDWGEKGYYYLHRGSGACGVN 475

Query:   338 TEASYPV 344
             T AS  V
Sbjct:   476 TMASSAV 482


>DICTYBASE|DDB_G0272742 [details] [associations]
            symbol:DDB_G0272742 species:44689 "Dictyostelium
            discoideum" [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0272742 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639 EMBL:AAFI02000008
            eggNOG:NOG331187 RefSeq:XP_644986.1 ProteinModelPortal:Q7KWP5
            PRIDE:Q7KWP5 EnsemblProtists:DDB0168242 GeneID:8618663
            KEGG:ddi:DDB_G0272742 InParanoid:Q7KWP5 OMA:ATESAHF Uniprot:Q7KWP5
        Length = 345

 Score = 474 (171.9 bits), Expect = 4.4e-45, P = 4.4e-45
 Identities = 113/317 (35%), Positives = 167/317 (52%)

Query:    45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASY- 103
             WM  + RTY    E   R   FK NL++I + N +G++T  L  NEF+D++NEE+R +Y 
Sbjct:    32 WMTSNQRTYASS-EFTNRYNTFKSNLDFINQWNSKGSKTV-LALNEFADISNEEYRKNYL 89

Query:   104 ---TGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQ-GHCGSCWAFS 159
                   N                   +     + IDWR+KGAV  +K+Q G CGS W  +
Sbjct:    90 RNDNNINKLSSLLINDKEDKEIKSSSSSGSGSSGIDWRKKGAVPSVKSQIGGCGS-WPIT 148

Query:   160 AVAAVEGITQITGGK--LIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADY 217
             AV A E    +   K   I LS Q L+DCS  N  C  G +++AF+YIIEN G+ +E  Y
Sbjct:   149 AVGATESAHFLANPKDPFISLSMQNLIDCSNLNKQCYQGTVNEAFQYIIENGGIDSEESY 208

Query:   218 PYQQ-EQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
              +   E G C K     + A I  YE +  G E +L  AV+ +PV+  ++AS  +F+FY 
Sbjct:   209 KFSGGEPGKC-KYNSSNSVAKITSYEKVKSGSESSLESAVSLKPVAAYIDASLSSFQFYS 267

Query:   277 RGVL-NAECGD-NCDHGVAVVGFG--TAEEEDGAK----YWLIKNSWGETWGESGYIRIL 328
              G+     C   + +H + +VGF   +    D  K    YW+++NS+G+ WGE+GYI + 
Sbjct:   268 SGIYYEPSCNSTDLNHSILIVGFSDFSTTPTDSLKHSSNYWIVQNSFGKNWGENGYIFMS 327

Query:   329 RD-EGLCGIATEASYPV 344
             +D +  CGI+  ASY +
Sbjct:   328 KDRDDNCGISKMASYVI 344


>UNIPROTKB|E2RR02 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            OMA:LAPPEWD GeneTree:ENSGT00660000095458 EMBL:AAEX03011628
            Ensembl:ENSCAFT00000019742 Uniprot:E2RR02
        Length = 460

 Score = 472 (171.2 bits), Expect = 7.1e-45, P = 7.1e-45
 Identities = 111/308 (36%), Positives = 153/308 (49%)

Query:    43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
             ++++  + RTY+ + E   R+++F  N+   +K       T + G  +FSDLT EEFR  
Sbjct:   163 KEFVTTYNRTYETKEEAEWRMSVFSNNMVRAQKIQALDRGTAQYGITKFSDLTEEEFRTI 222

Query:   103 YTGYNXXXXXXXXXXXXXXTFKYQNVTD--VPTSIDWREKGAVTHIKNQGHCGSCWAFSA 160
             Y   N                  ++++D   P   DWR KGAVT +K+QG CGSCWAFS 
Sbjct:   223 YL--NPLLRENRGKKMRLA----KSISDHAPPPEWDWRSKGAVTKVKDQGMCGSCWAFSV 276

Query:   161 VAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQ 220
                VEG   +  G L+ LSEQ+L+DC   +  C GGL   A+  I+   GL TE DY YQ
Sbjct:   277 TGNVEGQWFLKEGTLLSLSEQELLDCDKVDKACLGGLPSNAYSAIMTLGGLETEDDYSYQ 336

Query:   221 QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGV- 279
                  C    +KA    I    +L + ++        K P+SV + A G   +FY+ G+ 
Sbjct:   337 GHLQACSFSAKKARVY-INDSMELSQNEQKLAAWLAKKGPISVAINAFGM--QFYRHGIS 393

Query:   280 --LNAECGD-NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGI 336
               L   C     DH V +VG+G      G  +W IKNSWG  WGE GY  + R  G CG+
Sbjct:   394 HPLRPLCSPWLIDHAVLLVGYGN---RSGIPFWAIKNSWGTDWGEEGYYYLHRGSGACGV 450

Query:   337 ATEASYPV 344
              T AS  V
Sbjct:   451 NTMASSAV 458


>FB|FBgn0033874 [details] [associations]
            symbol:CG6347 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE013599 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 HSSP:P53634 EMBL:AY069609
            RefSeq:NP_610906.1 UniGene:Dm.608 SMR:Q7K0S6 MEROPS:C01.A29
            EnsemblMetazoa:FBtr0087637 GeneID:36531 KEGG:dme:Dmel_CG6347
            UCSC:CG6347-RA FlyBase:FBgn0033874 InParanoid:Q7K0S6 OMA:FEYIRDH
            OrthoDB:EOG4FQZ74 GenomeRNAi:36531 NextBio:799046 Uniprot:Q7K0S6
        Length = 352

 Score = 471 (170.9 bits), Expect = 9.1e-45, P = 9.1e-45
 Identities = 115/324 (35%), Positives = 167/324 (51%)

Query:    39 VEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLT 95
             V+  + ++ Q G+ Y DE E+  R +IF   +  I  +NK    G   ++LG N  +D+T
Sbjct:    35 VQNFDDFLRQTGKVYSDE-ERVYRESIFAAKMSLITLSNKNADNGVSGFRLGVNTLADMT 93

Query:    96 NEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVT--DVPTSIDWREKGAVTHIKNQG-HC 152
              +E  A+  G                    +N    ++P   DWREKG VT    QG  C
Sbjct:    94 RKEI-ATLLGSKISEFGERYTNGHINFVTARNPASANLPEMFDWREKGGVTPPGFQGVGC 152

Query:   153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKG 210
             G+CW+F+   A+EG      G L  LS+Q LVDC+ D  N GC GG  +  FEYI ++ G
Sbjct:   153 GACWSFATTGALEGHLFRRTGVLASLSQQNLVDCADDYGNMGCDGGFQEYGFEYIRDH-G 211

Query:   211 LATEADYPYQQEQGTCDKQKEKA------AAATIGKYEDLPKGDEHALLQAV-TKQPVSV 263
             +     YPY Q +  C +Q E A      +   I  Y  +  GDE  + + + T  P++ 
Sbjct:   212 VTLANKYPYTQTEMQC-RQNETAGRPPRESLVKIRDYATITPGDEEKMKEVIATLGPLAC 270

Query:   264 CVEASGQAFRFYKRGVL-NAECGDN-CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGE 321
              + A   +F  Y  G+  + EC     +H V VVG+GT   E+G  YW+IKNS+ + WGE
Sbjct:   271 SMNADTISFEQYSGGIYEDEECNQGELNHSVTVVGYGT---ENGRDYWIIKNSYSQNWGE 327

Query:   322 SGYIRILRDEG-LCGIATEASYPV 344
              G++RILR+ G  CGIA+E SYP+
Sbjct:   328 GGFMRILRNAGGFCGIASECSYPI 351


>UNIPROTKB|F1RU48 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            OMA:LAPPEWD GeneTree:ENSGT00660000095458 EMBL:CU928034
            EMBL:FP565364 Ensembl:ENSSSCT00000014140 Ensembl:ENSSSCT00000014154
            Uniprot:F1RU48
        Length = 460

 Score = 471 (170.9 bits), Expect = 9.1e-45, P = 9.1e-45
 Identities = 111/307 (36%), Positives = 152/307 (49%)

Query:    43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
             ++++  + RTY  + E   R+++F  N+   +K       T + G  +FSDLT EEFR  
Sbjct:   164 KEFVTTYNRTYDTKEEARWRMSVFANNMVRAQKIQALDTGTARYGVTKFSDLTEEEFRTI 223

Query:   103 YTGYNXXXXXXXXXXXXXXTFKYQNVTDVPT-SIDWREKGAVTHIKNQGHCGSCWAFSAV 161
             Y   N                  ++V+ +P    DWR+KGAVT +K+QG CGSCWAFS  
Sbjct:   224 YL--NPLLQEEPGRKMRLA----KSVSSLPPPEWDWRKKGAVTKVKDQGMCGSCWAFSVT 277

Query:   162 AAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQ 221
               VEG   +  G L+ LSEQ+L+DC   + GC GGL   A+  I    GL TE DY Y+ 
Sbjct:   278 GNVEGQWFLKQGTLLSLSEQELLDCDKVDKGCMGGLPSNAYSAIKTLGGLETEEDYSYRG 337

Query:   222 EQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGV-- 279
                TC    EKA    I    +L + ++        K P+SV + A G   +FY+ G+  
Sbjct:   338 HLQTCSFNAEKAKVY-INDSVELSQNEQKLAAWLAEKGPISVAINAFGM--QFYRHGISH 394

Query:   280 -LNAECGD-NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIA 337
              L   C     DH V +VG+G         +W IKNSWG  WGE GY  + R  G CG+ 
Sbjct:   395 PLRPLCSPWLIDHAVLLVGYGN---RSATPFWAIKNSWGTDWGEEGYYYLYRGSGACGVN 451

Query:   338 TEASYPV 344
               AS  V
Sbjct:   452 IMASSAV 458


>RGD|1308181 [details] [associations]
            symbol:Ctsf "cathepsin F" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1308181 eggNOG:COG4870 HOGENOM:HOG000230774
            KO:K01373 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 MEROPS:C01.018
            CTD:8722 OMA:LAPPEWD OrthoDB:EOG4CC41T GeneTree:ENSGT00660000095458
            EMBL:CH473953 EMBL:BC099780 EMBL:EU253481 IPI:IPI00201100
            RefSeq:NP_001029282.1 UniGene:Rn.25087 SMR:Q499S6
            Ensembl:ENSRNOT00000026718 GeneID:361704 KEGG:rno:361704
            UCSC:RGD:1308181 InParanoid:Q499S6 NextBio:677325
            Genevestigator:Q499S6 Uniprot:Q499S6
        Length = 462

 Score = 470 (170.5 bits), Expect = 1.2e-44, P = 1.2e-44
 Identities = 111/307 (36%), Positives = 149/307 (48%)

Query:    43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
             + +M  + RTY+   E   RLT+F +N+   +K       T + G  +FSDLT EEF   
Sbjct:   166 KDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHTI 225

Query:   103 YTGYNXXXXXXXXXXXXXXTFKYQNVTDV-PTSIDWREKGAVTHIKNQGHCGSCWAFSAV 161
             Y   N                  +++ D+ P   DWR+KGAVT +K+QG CGSCWAFS  
Sbjct:   226 YL--NPLLQKESGGKMSLA----KSINDLAPPEWDWRKKGAVTEVKDQGMCGSCWAFSVT 279

Query:   162 AAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQ 221
               VEG   +  G L+ LSEQ+L+DC   +  C GGL   A+  I    GL TE DY YQ 
Sbjct:   280 GNVEGQWFLNRGTLLSLSEQELLDCDKMDKACMGGLPSNAYTAIKNLGGLETEDDYGYQG 339

Query:   222 EQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLN 281
                 C+   + A    I    +L + +         K P+SV + A G   +FY+ G+ +
Sbjct:   340 HVQACNFSTQMAKVY-INDSVELSRDENKIAAWLAQKGPISVAINAFGM--QFYRHGIAH 396

Query:   282 AE---CGD-NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIA 337
                  C     DH V +VG+G         YW IKNSWG  WGE GY  + R  G CG+ 
Sbjct:   397 PFRPLCSPWFIDHAVLLVGYGN---RSNIPYWAIKNSWGRDWGEEGYYYLYRGSGACGVN 453

Query:   338 TEASYPV 344
             T AS  V
Sbjct:   454 TMASSAV 460


>WB|WBGene00019986 [details] [associations]
            symbol:R09F10.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            EMBL:FO081137 HSSP:P53634 PIR:D89588 RefSeq:NP_509408.1
            ProteinModelPortal:Q23030 SMR:Q23030 STRING:Q23030 MEROPS:C01.A44
            PaxDb:Q23030 EnsemblMetazoa:R09F10.1 GeneID:181087
            KEGG:cel:CELE_R09F10.1 UCSC:R09F10.1 CTD:181087 WormBase:R09F10.1
            InParanoid:Q23030 OMA:EYPYSAL NextBio:912346 Uniprot:Q23030
        Length = 383

 Score = 467 (169.5 bits), Expect = 2.4e-44, P = 2.4e-44
 Identities = 115/314 (36%), Positives = 157/314 (50%)

Query:    41 KHEQ----WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTN 96
             KHEQ    ++ +  R Y    E   R  IF +N+   E A +E N    L  NEF+D T+
Sbjct:    77 KHEQMFNDFILKFDRKYTSVEEFEYRYQIFLRNVIEFE-AEEERNLGLDLDVNEFTDWTD 135

Query:    97 EEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCW 156
             EE +      N              ++    V   P SIDWRE+G +T IKNQG CGSCW
Sbjct:   136 EELQ-KMVQENKYTKYDFDTPKFEGSYLETGVIR-PASIDWREQGKLTPIKNQGQCGSCW 193

Query:   157 AFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEAD 216
             AF+ VA+VE    I  GKL+ LSEQ++VDC   NNGCSGG    A +++ EN GL +E +
Sbjct:   194 AFATVASVEAQNAIKKGKLVSLSEQEMVDCDGRNNGCSGGYRPYAMKFVKEN-GLESEKE 252

Query:   217 YPYQQ-EQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFY 275
             YPY   +   C   KE      I  +  L   +E       TK PV+  +    +A   Y
Sbjct:   253 YPYSALKHDQCFL-KENDTRVFIDDFRMLSNNEEDIANWVGTKGPVTFGMNVV-KAMYSY 310

Query:   276 KRGVLNAECGDNCD-----HGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
             + G+ N    D  +     H + ++G+G  E E    YW++KNSWG +WG SGY R+ R 
Sbjct:   311 RSGIFNPSVEDCTEKSMGAHALTIIGYG-GEGESA--YWIVKNSWGTSWGASGYFRLARG 367

Query:   331 EGLCGIATEASYPV 344
                CG+A     P+
Sbjct:   368 VNSCGLANTVVAPI 381


>DICTYBASE|DDB_G0274385 [details] [associations]
            symbol:DDB_G0274385 "Cysteine proteinase 1,
            mitochondrial" species:44689 "Dictyostelium discoideum" [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0274385 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 EMBL:AAFI02000012 RefSeq:XP_644301.1
            ProteinModelPortal:Q86KD4 EnsemblProtists:DDB0167535 GeneID:8619729
            KEGG:ddi:DDB_G0274385 InParanoid:Q86KD4 OMA:SICVDAS Uniprot:Q86KD4
        Length = 358

 Score = 466 (169.1 bits), Expect = 3.1e-44, P = 3.1e-44
 Identities = 110/322 (34%), Positives = 158/322 (49%)

Query:    35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
             + S+ +    W  +H + YKD +E   R + FK+N++   + N       K  +N FSDL
Sbjct:    37 DSSMRDTFNHWAKKHSKIYKDSIEMENRFSNFKENMKKNIELNSMHAGKAKFESNGFSDL 96

Query:    95 TNEEF-----RASYTGY-----NXXXXXXXXXXXXXXTFKYQNVTDVPT--SIDWREKGA 142
             + EEF       ++ G      N               +K     D+    SIDWR+KG 
Sbjct:    97 SEEEFSNFHLNKAFKGKPSHLRNSIKPQPTPHHSLINGYKEMENGDLNELYSIDWRKKGL 156

Query:   143 VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAF 202
             VT +K+QG CGSC+ FSAV  +E      G K I LSEQQ VDC   +  C GG     +
Sbjct:   157 VTPVKDQGQCGSCYIFSAVEQIETAWIKAGNKPILLSEQQAVDCDPYDGQCGGGDPYTVY 216

Query:   203 EYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PV 261
             EY  +  G++T A YPY    GTC       A   +  +     GDE+ L++ +    PV
Sbjct:   217 EYFSQVGGVSTNAQYPYTATDGTCVNMSR--AVPVVSYHYVTQGGDENTLIKTIVNDGPV 274

Query:   262 SVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGT--AEEEDGAKYWLIKNSWGETW 319
             S+CV+AS   ++ Y  G++   CG N DH V VVG      +  +  +Y++I+NSWG  W
Sbjct:   275 SICVDAS--TWQSYSGGIITTGCGKNIDHCVQVVGLEVDKTDPSNPVQYYIIRNSWGTDW 332

Query:   320 GESGYIRILRDEGLCGIATEAS 341
             G  GYI +     LCGI  E++
Sbjct:   333 GIDGYIYVATGSDLCGITYEST 354


>DICTYBASE|DDB_G0282991 [details] [associations]
            symbol:DDB_G0282991 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0282991 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            EMBL:AAFI02000049 eggNOG:NOG331187 RefSeq:XP_639299.1
            ProteinModelPortal:Q54RQ2 EnsemblProtists:DDB0185304 GeneID:8623870
            KEGG:ddi:DDB_G0282991 InParanoid:Q54RQ2 OMA:PENGNEY Uniprot:Q54RQ2
        Length = 339

 Score = 465 (168.7 bits), Expect = 3.9e-44, P = 3.9e-44
 Identities = 110/309 (35%), Positives = 173/309 (55%)

Query:    44 QWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASY 103
             +W  ++ + Y ++ E  MR   FK+N EY+++ N++   T  L  N F+DL+  E+  +Y
Sbjct:    29 EWTNKYNKIYSNK-EFYMRFNNFKKNKEYVDQWNEKQLETI-LELNFFADLSRNEYINNY 86

Query:   104 -TGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHC-GSCWAFSAV 161
                +                 K  N  +   SIDWR   AVT +KNQG C G+ ++FSA+
Sbjct:    87 LASFIDISNIEQKNTKYEGNLK-NNFNNSIKSIDWRNFDAVTPVKNQGLCSGAGYSFSAI 145

Query:   162 AAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPY 219
               +E    I   +LI LSEQ ++DC+TD  NNGC GGL   AF+YII+ KG+ +E +YPY
Sbjct:   146 GVIESSHFIKNKELITLSEQNIIDCTTDMGNNGCMGGLALIAFDYIIKQKGIDSEFNYPY 205

Query:   220 Q-------QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
             +       + +G C +     + A+I  Y ++ + +E+ L Q++ K PVSV ++AS  +F
Sbjct:   206 EGYLIEPYEGRGRC-RYNSFYSKASISSYIEIERFNENELTQSLIKSPVSVMIDASQLSF 264

Query:   273 RFYKRGVL-NAECGDNC-DHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
               YK GV  +  C     +HG+  +GFG   E +G +Y+++KNS+G  WG  GYI + R+
Sbjct:   265 MLYKSGVYKDPSCSSTILNHGILNIGFGVTPE-NGNEYYILKNSFGSKWGMKGYIYLSRN 323

Query:   331 -EGLCGIAT 338
                 CGI++
Sbjct:   324 FNNHCGISS 332


>RGD|1564827 [details] [associations]
            symbol:RGD1564827 "similar to cathepsin M" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 IPI:IPI00192321
            Ensembl:ENSRNOT00000023990 ArrayExpress:D3ZY04 Uniprot:D3ZY04
        Length = 338

 Score = 433 (157.5 bits), Expect = 1.6e-43, Sum P(2) = 1.6e-43
 Identities = 88/201 (43%), Positives = 118/201 (58%)

Query:   149 QGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYII 206
             QG C SCWAF  V A+EG      GKL  LS Q LVDCS    N GC GG    AF+Y++
Sbjct:   139 QGRCNSCWAFPVVGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVL 198

Query:   207 ENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVE 266
             +N GL +EA YPY+ ++G C      +A  T       P+ +E  L+ AV  +PV+  + 
Sbjct:   199 QNGGLESEATYPYEGKEGLCRYNPNSSAKITXICAP--PQKNEDVLMDAVATKPVAAGIH 256

Query:   267 ASGQAFRFYKRGVLNA-ECGDNCDHGVAVVGFG-TAEEEDGAKYWLIKNSWGETWGESGY 324
                 + RFYK+G+ +  +C +  +H V VVG+G    E DG  YWLI+NSWGE WG +GY
Sbjct:   257 VVHSSLRFYKKGIYHEPKCNNYVNHAVLVVGYGFEGNETDGNNYWLIQNSWGERWGLNGY 316

Query:   325 IRILRDEGL-CGIATEASYPV 344
             ++I +D    CGIAT A YP+
Sbjct:   317 MKIAKDRNNHCGIATFAQYPI 337

 Score = 43 (20.2 bits), Expect = 1.6e-43, Sum P(2) = 1.6e-43
 Identities = 9/38 (23%), Positives = 22/38 (57%)

Query:    27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKD-ELEKAMRL 63
             VVSG S  + S+  + ++W  ++ + Y    ++K +++
Sbjct:    14 VVSGASAFDLSLDVQWQEWKMKYEKLYSPVRIQKTVQM 51


>ZFIN|ZDB-GENE-050417-107 [details] [associations]
            symbol:zgc:110239 "zgc:110239" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-050417-107
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 MEROPS:I29.003 OrthoDB:EOG412M56 EMBL:BC092817
            IPI:IPI00503987 RefSeq:NP_001017633.1 UniGene:Dr.39081
            ProteinModelPortal:Q568K7 GeneID:550326 KEGG:dre:550326
            HOGENOM:HOG000007373 HOVERGEN:HBG105018 InParanoid:Q568K7
            NextBio:20879584 ArrayExpress:Q568K7 Uniprot:Q568K7
        Length = 546

 Score = 458 (166.3 bits), Expect = 2.2e-43, P = 2.2e-43
 Identities = 107/300 (35%), Positives = 153/300 (51%)

Query:    51 RTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNXXX 110
             R Y +E+E   R   F  N+ Y+   N+ G  ++ L  N  +D + +E  +   G     
Sbjct:   252 RQYDNEMEHEEREHNFVHNIRYVHSMNRAG-LSFSLSVNHLADRSQKEL-SMMRGCQRTH 309

Query:   111 XXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQI 170
                          +  ++   P S+DWR  GAVT +K+Q  CGSCW+F+    +EG   +
Sbjct:   310 KVHRKAQPFPSEIR--SIA-TPNSVDWRLYGAVTPVKDQAVCGSCWSFATTGTLEGALFL 366

Query:   171 TGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADY-PYQQEQGTCD 227
               G+L  LS+Q LVDC+    NNGC GG   +AFE+I+++ G++T   Y  Y    G C 
Sbjct:   367 KTGQLTSLSQQMLVDCTWGFGNNGCDGGEEWRAFEWIMKHGGISTAESYGAYMGMNGLCH 426

Query:   228 KQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGVL-NAEC- 284
               K    A   G Y ++  GD  AL  A+ K  PV+V ++A+ ++F FY  GV    EC 
Sbjct:   427 YDKSSMVAQLTG-YTNVTSGDILALKAAIFKFGPVAVSIDAAHRSFAFYSNGVYYEPECK 485

Query:   285 -GDN-CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEASY 342
              G N  DH V  VG+G    E    YWL+KNSW   WG  GYI +   +  CG+AT+A Y
Sbjct:   486 NGINDLDHAVLAVGYGIMNNES---YWLVKNSWSSYWGNDGYILMSMKDNNCGVATDAIY 542


>UNIPROTKB|F1NHB8 [details] [associations]
            symbol:F1NHB8 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 EMBL:AADN02044011
            IPI:IPI00586027 Ensembl:ENSGALT00000021873 OMA:SELDHAV
            Uniprot:F1NHB8
        Length = 329

 Score = 453 (164.5 bits), Expect = 7.3e-43, P = 7.3e-43
 Identities = 107/303 (35%), Positives = 153/303 (50%)

Query:    50 GRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNXX 109
             G+ Y  E E   R   F  N+ ++   N+    +Y L  N  +D T +E  A+  G    
Sbjct:    34 GKRYSSEEEHEHRKRTFIHNMRFVHSKNRAA-LSYSLALNHLADRTPQEM-AALRGRRRS 91

Query:   110 XXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQ 169
                            Y ++  +P S+DWR  GAVT +K+Q  CGSCW+F+   A+EG   
Sbjct:    92 GDPKSGQPFSMQL--YASLV-LPESLDWRLYGAVTPVKDQAVCGSCWSFATTGAMEGALF 148

Query:   170 ITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADY-PYQQEQGTC 226
             +  G L  LS+Q L+DCS    N  C GG   +A+E+I ++ G+A+   Y PY  + G C
Sbjct:   149 LKTGVLTPLSQQVLIDCSWGFGNYACDGGEEWRAYEWIKKHGGIASTESYGPYLGQNGYC 208

Query:   227 DKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVSVCVEASGQAFRFYKRGVLNA-EC 284
                + +  A   G Y  +  G+  AL  A+ K  PV+V ++AS ++F FY  GV     C
Sbjct:   209 HYNQSELVAPLAG-YVTVESGNAEALKAALFKHGPVAVNIDASHKSFTFYANGVYEEPHC 267

Query:   285 GDNC---DHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEAS 341
             G+     DH V  VG+G      G  YWLIKNSW   WG  GYI +   +  CG+AT AS
Sbjct:   268 GNETSELDHAVLAVGYGVLH---GKSYWLIKNSWSTYWGNDGYILMAMKDNNCGVATAAS 324

Query:   342 YPV 344
             +P+
Sbjct:   325 FPI 327


>WB|WBGene00012747 [details] [associations]
            symbol:Y40H7A.10 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000230773 EMBL:AL033510
            HSSP:P80067 MEROPS:C01.A48 PIR:T26792 RefSeq:NP_502836.1
            ProteinModelPortal:Q9XWA4 SMR:Q9XWA4 STRING:Q9XWA4
            EnsemblMetazoa:Y40H7A.10 GeneID:189809 KEGG:cel:CELE_Y40H7A.10
            UCSC:Y40H7A.10 CTD:189809 WormBase:Y40H7A.10 eggNOG:NOG286423
            InParanoid:Q9XWA4 OMA:NGPMIVC NextBio:943702 Uniprot:Q9XWA4
        Length = 343

 Score = 448 (162.8 bits), Expect = 2.5e-42, P = 2.5e-42
 Identities = 108/322 (33%), Positives = 162/322 (50%)

Query:    26 QVVSGRSMHEPSI--VEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE--GN 81
             Q++    +  P +      + ++ ++ R Y +E E   R TIF +NL+ +E+ NKE  G 
Sbjct:    33 QILQRHHIPTPDVKYTNAFQNFLVKYLREYPNEYEIVKRFTIFSRNLDLVERYNKEDAGK 92

Query:    82 RTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKG 141
              TY+L  N+FSDLT EE++  Y                    K     ++P S+DWR   
Sbjct:    93 VTYEL--NDFSDLTEEEWK-KYLMTPKPDHSEKSLKPKTLIDK----KNLPNSVDWRNVN 145

Query:   142 AVTH---IKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLM 198
                H   IK QG CGSCWAF+  AA+E    I+GG L  LS QQL+DC+  ++ C GG  
Sbjct:   146 GTNHVTGIKYQGPCGSCWAFATAAAIESAVSISGGGLQSLSSQQLLDCTVVSDKCGGGEP 205

Query:   199 DKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK 258
              +A +Y  ++ G+ T  +YPY      C  ++     A I  +      DE A + A+  
Sbjct:   206 VEALKYA-QSHGITTAHNYPYYFWTTKC--RETVPTVARISSWMKAESEDEMAQIVALNG 262

Query:   259 QPVSVCVEASGQAFRFYKRGVL-NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGE 317
              P+ VC   +    RFY  G+  + +CG    H + V+G+G         YW++KN++ +
Sbjct:   263 -PMIVCANFATNKNRFYHSGIAEDPDCGTEPTHALIVIGYGP-------DYWILKNTYSK 314

Query:   318 TWGESGYIRILRDEGLCGIATE 339
              WGE GY+R+ RD   CGI TE
Sbjct:   315 VWGEKGYMRVKRDVNWCGINTE 336


>FB|FBgn0032228 [details] [associations]
            symbol:CG5367 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE014134 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 HSSP:P80067
            RefSeq:NP_609387.1 UniGene:Dm.26782 ProteinModelPortal:Q9VKY4
            SMR:Q9VKY4 MEROPS:C01.A30 EnsemblMetazoa:FBtr0080055 GeneID:34401
            KEGG:dme:Dmel_CG5367 UCSC:CG5367-RA FlyBase:FBgn0032228
            InParanoid:Q9VKY4 OMA:QIVDCSV OrthoDB:EOG4THT8X PhylomeDB:Q9VKY4
            GenomeRNAi:34401 NextBio:788324 ArrayExpress:Q9VKY4 Bgee:Q9VKY4
            Uniprot:Q9VKY4
        Length = 338

 Score = 442 (160.7 bits), Expect = 1.1e-41, P = 1.1e-41
 Identities = 99/310 (31%), Positives = 158/310 (50%)

Query:    43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEF 99
             E++   + R Y    ++      F++N + IE+ N   KEG  +++L  N F+D++ + +
Sbjct:    37 EKFKNNNNRKYLRTYDEMRSYKAFEENFKVIEEHNQNYKEGQTSFRLKPNIFADMSTDGY 96

Query:   100 RASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFS 159
                +                        + +VP S+DWR KG +T   NQ  CGSC+AFS
Sbjct:    97 LKGFLRLLKSNIEDSADNMAEIVGS-PLMANVPESLDWRSKGFITPPYNQLSCGSCYAFS 155

Query:   160 AVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADY 217
                ++ G      GK++ LS+QQ+VDCS    N GC GG +     Y+    G+  + DY
Sbjct:   156 IAESIMGQVFKRTGKILSLSKQQIVDCSVSHGNQGCVGGSLRNTLSYLQSTGGIMRDQDY 215

Query:   218 PYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQAFRFYK 276
             PY   +G C    +  +   +  +  LP  DE A+  AVT   PV++ + AS + F+ Y 
Sbjct:   216 PYVARKGKCQFVPD-LSVVNVTSWAILPVRDEQAIQAAVTHIGPVAISINASPKTFQLYS 274

Query:   277 RGVLNAE-CGD-NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLC 334
              G+ +   C   + +H + V+GFG    +D   YW++KN WG+ WGE+GYIRI +   +C
Sbjct:   275 DGIYDDPLCSSASVNHAMVVIGFG----KD---YWILKNWWGQNWGENGYIRIRKGVNMC 327

Query:   335 GIATEASYPV 344
             GIA  A+Y +
Sbjct:   328 GIANYAAYAI 337


>UNIPROTKB|P56202 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 GO:GO:0006955 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AF013611
            EMBL:AF015954 EMBL:AF055903 EMBL:AP001201 EMBL:BC048255
            IPI:IPI00328978 RefSeq:NP_001326.2 UniGene:Hs.416848
            ProteinModelPortal:P56202 SMR:P56202 STRING:P56202 MEROPS:C01.037
            PhosphoSite:P56202 DMDM:259016196 PaxDb:P56202 PRIDE:P56202
            Ensembl:ENST00000307886 GeneID:1521 KEGG:hsa:1521 UCSC:uc001ogc.1
            CTD:1521 GeneCards:GC11P065647 HGNC:HGNC:2546 HPA:CAB016345
            MIM:602364 neXtProt:NX_P56202 PharmGKB:PA27042 eggNOG:NOG288820
            HOVERGEN:HBG100117 InParanoid:P56202 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 PhylomeDB:P56202 GenomeRNAi:1521 NextBio:6295
            ArrayExpress:P56202 Bgee:P56202 CleanEx:HS_CTSW
            Genevestigator:P56202 GermOnline:ENSG00000172543 Uniprot:P56202
        Length = 376

 Score = 343 (125.8 bits), Expect = 2.0e-41, Sum P(2) = 2.0e-41
 Identities = 88/274 (32%), Positives = 139/274 (50%)

Query:    40 EKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEF 99
             E  + +  Q  R+Y    E A RL IF  NL   ++  +E   T + G   FSDLT EEF
Sbjct:    40 EAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEF 99

Query:   100 RASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWRE-KGAVTHIKNQGHCGSCWAF 158
                Y GY               + + +    VP S DWR+   A++ IK+Q +C  CWA 
Sbjct:   100 GQLY-GYRRAAGGVPSMGREIRSEEPEE--SVPFSCDWRKVASAISPIKDQKNCNCCWAM 156

Query:   159 SAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYP 218
             +A   +E + +I+    +++S Q+L+DC    +GC GG +  AF  ++ N GLA+E DYP
Sbjct:   157 AAAGNIETLWRISFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNNSGLASEKDYP 216

Query:   219 YQQEQGT--CDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFY 275
             +Q +     C  +K +  A  I  +  L + +EH + Q + T  P++V +    +  + Y
Sbjct:   217 FQGKVRAHRCHPKKYQKVA-WIQDFIML-QNNEHRIAQYLATYGPITVTINM--KPLQLY 272

Query:   276 KRGVLNAE---CGDNC-DHGVAVVGFGTAEEEDG 305
             ++GV+ A    C     DH V +VGFG+ + E+G
Sbjct:   273 RKGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEG 306

 Score = 113 (44.8 bits), Expect = 2.0e-41, Sum P(2) = 2.0e-41
 Identities = 17/29 (58%), Positives = 20/29 (68%)

Query:   308 YWLIKNSWGETWGESGYIRILRDEGLCGI 336
             YW++KNSWG  WGE GY R+ R    CGI
Sbjct:   326 YWILKNSWGAQWGEKGYFRLHRGSNTCGI 354


>UNIPROTKB|F1RU23 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 KO:K08569 EMBL:CU928325
            RefSeq:XP_003122571.1 UniGene:Ssc.28940 Ensembl:ENSSSCT00000014177
            GeneID:100525853 KEGG:ssc:100525853 OMA:CWAMAAV Uniprot:F1RU23
        Length = 367

 Score = 436 (158.5 bits), Expect = 4.6e-41, P = 4.6e-41
 Identities = 110/313 (35%), Positives = 157/313 (50%)

Query:    48 QHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYN 107
             Q+ R+Y +  E A RL IF QNL   ++  +E   T + G   FSDLT EEF   + G++
Sbjct:    48 QYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDLTEEEFGQLH-GHH 106

Query:   108 XXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREK-GAVTHIKNQGHCGSCWAFSAVAAVEG 166
                           +   ++   VP S DWR+K G ++ IK+Q  C  CWA +AV  VE 
Sbjct:   107 WGAGKAPSMGIKVGS--EESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDNVEA 164

Query:   167 ITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGT- 225
                I   + ++LS QQ++DC    NGC+GG +  AF  ++   GLA+E DYPY+    T 
Sbjct:   165 QWAIKYHQAVQLSVQQVLDCDRCGNGCNGGFVWDAFLTVLNTSGLASEQDYPYKGTVKTH 224

Query:   226 -CDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAEC 284
              C   K+    A I  +  L   ++       T+ P++V + A     + YKRGV+ A  
Sbjct:   225 RC-LAKQHRKVAWIQDFLMLQFCEQSIARYLATEGPITVTINAG--LLQQYKRGVIRATP 281

Query:   285 GDNCD-----HGVAVVGFGTAEEEDGAK--------YWLIKNSWGETWGESGYIRILRDE 331
                CD     H V +VGFG ++  +G +        YW++KNSWG  WGE GY R+ R  
Sbjct:   282 A-TCDPHLVNHSVLLVGFGKSKSVEGRRPRPGHSIPYWILKNSWGPDWGEEGYFRLHRGS 340

Query:   332 GLCGIATEASYPV 344
               CGI     YPV
Sbjct:   341 NTCGIT---KYPV 350


>DICTYBASE|DDB_G0281079 [details] [associations]
            symbol:DDB_G0281079 species:44689 "Dictyostelium
            discoideum" [GO:0030246 "carbohydrate binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR008965 InterPro:IPR013128 InterPro:IPR019028
            Pfam:PF00112 Pfam:PF09478 PRINTS:PR00705 SMART:SM00645
            SMART:SM01063 InterPro:IPR000169 dictyBase:DDB_G0281079
            GO:GO:0030246 EMBL:AAFI02000040 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 SUPFAM:SSF49384 RefSeq:XP_640804.1
            ProteinModelPortal:Q54UH2 EnsemblProtists:DDB0204000 GeneID:8622858
            KEGG:ddi:DDB_G0281079 InParanoid:Q54UH2 OMA:ALESHYY
            ProtClustDB:CLSZ2430562 Uniprot:Q54UH2
        Length = 664

 Score = 356 (130.4 bits), Expect = 6.4e-41, Sum P(2) = 6.4e-41
 Identities = 79/191 (41%), Positives = 108/191 (56%)

Query:   132 PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN 191
             P SIDWR  G V+ +KNQG CGSC+AFS V A+E        ++++LSEQ LVDC+  N 
Sbjct:   471 PISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALESHYYRKNNRMLDLSEQNLVDCTASNK 530

Query:   192 ----GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
                 GCSGG M   + YI EN G+  E+ YPY+ + G C +     A + I K+  + + 
Sbjct:   531 YRNGGCSGGWMHNCYSYIQENGGINQESTYPYEGKFGQC-RYNSGDAQSRISKFVMIKQH 589

Query:   248 DEHALLQAVTKQ-PVSVCVEASGQAFRFYKRGVLNAECGDNCD-----HGVAVVGFGTAE 301
             DE  L   V    PVSV  +AS + F +Y RG+  +   DNC+     H V VVG+   +
Sbjct:   590 DEEDLADTVASVGPVSVAYDASTREFMYYSRGIYYS---DNCNKYRTTHAVVVVGY---D 643

Query:   302 EEDGAKYWLIK 312
              E+G  YW+IK
Sbjct:   644 NENGVDYWIIK 654

 Score = 109 (43.4 bits), Expect = 6.4e-41, Sum P(2) = 6.4e-41
 Identities = 23/62 (37%), Positives = 37/62 (59%)

Query:    44 QWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLTNEEFRAS 102
             QW  Q  RTY+ + +  ++   FK +  +IE+  +E  N T +LG  +FSD+T++EF   
Sbjct:   163 QWSNQFNRTYRAD-QFLLKYEAFKDSSRFIEQYKRENQNSTMELGLTQFSDMTHDEFLNV 221

Query:   103 YT 104
             YT
Sbjct:   222 YT 223


>GENEDB_PFALCIPARUM|PF14_0553 [details] [associations]
            symbol:PF14_0553 "cysteine proteinase
            falcipain-1" species:5833 "Plasmodium falciparum" [GO:0042540
            "hemoglobin catabolic process" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014187 KO:K01376 HSSP:P07688 GO:GO:0042540
            RefSeq:XP_001348727.1 ProteinModelPortal:Q8I6V0 PRIDE:Q8I6V0
            EnsemblProtists:PF14_0553:mRNA GeneID:812135 KEGG:pfa:PF14_0553
            EuPathDB:PlasmoDB:PF3D7_1458000 HOGENOM:HOG000065906
            ProtClustDB:CLSZ2457715 ChEMBL:CHEMBL1250371 Uniprot:Q8I6V0
        Length = 569

 Score = 271 (100.5 bits), Expect = 7.7e-41, Sum P(3) = 7.7e-41
 Identities = 63/176 (35%), Positives = 98/176 (55%)

Query:   131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDN 190
             VP  +D+REKG V   K+QG CGSCWAF++V  +E +       ++  SEQ++VDCS DN
Sbjct:   333 VPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDN 392

Query:   191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGT-C--DKQKEKAAAATIGKYEDLPKG 247
              GC GG    +F Y+++N+ L    +Y Y+ +    C   + K K + ++IG  +     
Sbjct:   393 FGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVK----- 446

Query:   248 DEHALLQAVTKQ-PVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEE 302
              E+ L+ A+ +  P+SV V  +   F  Y  GV N  C +  +H V +VG+G  E+
Sbjct:   447 -ENQLILALNEVGPLSVNVGVNND-FVAYSEGVYNGTCSEELNHSVLLVGYGQVEK 500

 Score = 119 (46.9 bits), Expect = 7.7e-41, Sum P(3) = 7.7e-41
 Identities = 20/41 (48%), Positives = 27/41 (65%)

Query:   308 YWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPV 344
             YW+IKNSW + WGE+G++R+ R    D   CGI  E  YP+
Sbjct:   528 YWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPI 568

 Score = 93 (37.8 bits), Expect = 7.7e-41, Sum P(3) = 7.7e-41
 Identities = 22/61 (36%), Positives = 33/61 (54%)

Query:    41 KHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK-EGNRTYKLGTNEFSDLTNEEF 99
             K  ++M +H + YK+  E+  +  IFK N   I+  NK   N  YK   N+FSD + EE 
Sbjct:   224 KFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEEL 283

Query:   100 R 100
             +
Sbjct:   284 K 284

 Score = 41 (19.5 bits), Expect = 2.0e-35, Sum P(3) = 2.0e-35
 Identities = 12/47 (25%), Positives = 26/47 (55%)

Query:    54 KDELEKAMRLTI--FKQNLEYI--EKANKEGNRTYKLGTNEFSDLTN 96
             K+E+E  +R+ +  +K+  + I  E +N+E    Y L +  +++  N
Sbjct:    76 KEEIE-LLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNN 121


>UNIPROTKB|Q8I6V0 [details] [associations]
            symbol:PF14_0553 "Cysteine proteinase falcipain-1"
            species:36329 "Plasmodium falciparum 3D7" [GO:0042540 "hemoglobin
            catabolic process" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014187 KO:K01376 HSSP:P07688 GO:GO:0042540
            RefSeq:XP_001348727.1 ProteinModelPortal:Q8I6V0 PRIDE:Q8I6V0
            EnsemblProtists:PF14_0553:mRNA GeneID:812135 KEGG:pfa:PF14_0553
            EuPathDB:PlasmoDB:PF3D7_1458000 HOGENOM:HOG000065906
            ProtClustDB:CLSZ2457715 ChEMBL:CHEMBL1250371 Uniprot:Q8I6V0
        Length = 569

 Score = 271 (100.5 bits), Expect = 7.7e-41, Sum P(3) = 7.7e-41
 Identities = 63/176 (35%), Positives = 98/176 (55%)

Query:   131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDN 190
             VP  +D+REKG V   K+QG CGSCWAF++V  +E +       ++  SEQ++VDCS DN
Sbjct:   333 VPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDN 392

Query:   191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGT-C--DKQKEKAAAATIGKYEDLPKG 247
              GC GG    +F Y+++N+ L    +Y Y+ +    C   + K K + ++IG  +     
Sbjct:   393 FGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVK----- 446

Query:   248 DEHALLQAVTKQ-PVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEE 302
              E+ L+ A+ +  P+SV V  +   F  Y  GV N  C +  +H V +VG+G  E+
Sbjct:   447 -ENQLILALNEVGPLSVNVGVNND-FVAYSEGVYNGTCSEELNHSVLLVGYGQVEK 500

 Score = 119 (46.9 bits), Expect = 7.7e-41, Sum P(3) = 7.7e-41
 Identities = 20/41 (48%), Positives = 27/41 (65%)

Query:   308 YWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPV 344
             YW+IKNSW + WGE+G++R+ R    D   CGI  E  YP+
Sbjct:   528 YWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPI 568

 Score = 93 (37.8 bits), Expect = 7.7e-41, Sum P(3) = 7.7e-41
 Identities = 22/61 (36%), Positives = 33/61 (54%)

Query:    41 KHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK-EGNRTYKLGTNEFSDLTNEEF 99
             K  ++M +H + YK+  E+  +  IFK N   I+  NK   N  YK   N+FSD + EE 
Sbjct:   224 KFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEEL 283

Query:   100 R 100
             +
Sbjct:   284 K 284

 Score = 41 (19.5 bits), Expect = 2.0e-35, Sum P(3) = 2.0e-35
 Identities = 12/47 (25%), Positives = 26/47 (55%)

Query:    54 KDELEKAMRLTI--FKQNLEYI--EKANKEGNRTYKLGTNEFSDLTN 96
             K+E+E  +R+ +  +K+  + I  E +N+E    Y L +  +++  N
Sbjct:    76 KEEIE-LLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNN 121


>DICTYBASE|DDB_G0281077 [details] [associations]
            symbol:DDB_G0281077 species:44689 "Dictyostelium
            discoideum" [GO:0030246 "carbohydrate binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR008965 InterPro:IPR013128 InterPro:IPR019028
            Pfam:PF00112 Pfam:PF09478 PRINTS:PR00705 SMART:SM00645
            SMART:SM01063 InterPro:IPR000169 dictyBase:DDB_G0281077
            GO:GO:0030246 EMBL:AAFI02000040 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 SUPFAM:SSF49384 ProtClustDB:CLSZ2430562
            RefSeq:XP_640803.1 ProteinModelPortal:Q54UH3
            EnsemblProtists:DDB0203998 GeneID:8622857 KEGG:ddi:DDB_G0281077
            InParanoid:Q54UH3 OMA:LINDFNF Uniprot:Q54UH3
        Length = 662

 Score = 355 (130.0 bits), Expect = 8.1e-41, Sum P(2) = 8.1e-41
 Identities = 80/189 (42%), Positives = 107/189 (56%)

Query:   132 PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-N 190
             P SIDWR  G V+ +KNQG CGSC+AFS V A+E        +++ LSEQ LVDC+ +  
Sbjct:   472 PISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALEAHYYRKNNRMLNLSEQNLVDCTRNYG 531

Query:   191 NG-CSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDE 249
             NG CSGG M   F YI EN G+  ++ YPY+   G C +     A + I  Y  + + DE
Sbjct:   532 NGECSGGWMHNCFRYIKENGGINLQSTYPYEGRVGLC-RYNSGDAQSRISNYVMIKQHDE 590

Query:   250 HALLQAVTKQ-PVSVCVEASGQAFRFYKRGVLNAECGDNCD-----HGVAVVGFGTAEEE 303
               L  AV    PVSV  +AS + F +Y  G+ N+   D+CD     H V VVG+G    E
Sbjct:   591 EDLANAVASVGPVSVAYDASTREFMYYSSGIYNS---DSCDKYRTTHAVVVVGYGI---E 644

Query:   304 DGAKYWLIK 312
             +G  +W+IK
Sbjct:   645 NGVDFWIIK 653

 Score = 109 (43.4 bits), Expect = 8.1e-41, Sum P(2) = 8.1e-41
 Identities = 23/62 (37%), Positives = 37/62 (59%)

Query:    44 QWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLTNEEFRAS 102
             QW  Q  RTY+ + +  ++   FK +  +IE+  +E  N T +LG  +FSD+T++EF   
Sbjct:   164 QWSNQFNRTYRAD-QFLLKYEAFKDSSRFIEQYKRENQNSTMELGLTQFSDMTHDEFLNI 222

Query:   103 YT 104
             YT
Sbjct:   223 YT 224


>UNIPROTKB|E2RPX3 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 CTD:1521 KO:K08569 OMA:GRCGDGC
            EMBL:AAEX03011632 RefSeq:XP_540846.2 Ensembl:ENSCAFT00000020910
            GeneID:483725 KEGG:cfa:483725 Uniprot:E2RPX3
        Length = 374

 Score = 323 (118.8 bits), Expect = 1.1e-40, Sum P(2) = 1.1e-40
 Identities = 86/266 (32%), Positives = 131/266 (49%)

Query:    48 QHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYN 107
             Q+ R+Y +  E A RL IF  NL   ++   E   T + G   FSDLT EEF   Y G+ 
Sbjct:    48 QYNRSYSNPEEYARRLDIFAHNLAQAQQLEDEDLGTAEFGVTPFSDLTEEEFGQFY-GHQ 106

Query:   108 XXXXXXXXXXXXXXTFKYQNVTDVPTSIDWRE-KGAVTHIKNQGHCGSCWAFSAVAAVEG 166
                           + ++     VP + DWR+  G ++ IK QG+C  CWA +A   +E 
Sbjct:   107 RMAGEAPSVGRKVESEEWGE--PVPPTCDWRKLPGIISPIKQQGNCRCCWAMAAAGNIEA 164

Query:   167 ITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPY--QQEQG 224
             +  I   + +E+S Q+L+DC    +GC GG    AF  ++ N GLA+  DYP+    +  
Sbjct:   165 LWGIRYHQPVEVSVQELLDCGRCGDGCKGGFTWDAFITVLNNSGLASAKDYPFLGNTKPH 224

Query:   225 TCDKQKEKAAAATIGKYEDLPKGDEHALL-QAVTKQPVSVCVEASGQAFRFYKRGVLNAE 283
              C  +K K  A  I  +  L +G+E A+     TK P++V +    +  + Y++GV+ A 
Sbjct:   225 RCLAKKYKKVA-WIQDFIML-QGNEQAIAWYLATKGPITVTINM--KLLQHYQKGVIQAT 280

Query:   284 ---CG-DNCDHGVAVVGFGTAEEEDG 305
                C     DH V +VGFG ++   G
Sbjct:   281 HTTCDPQRVDHSVLLVGFGKSKSVAG 306

 Score = 126 (49.4 bits), Expect = 1.1e-40, Sum P(2) = 1.1e-40
 Identities = 29/71 (40%), Positives = 36/71 (50%)

Query:   289 DHGVAVVGFGTAEE------EDGAK---------YWLIKNSWGETWGESGYIRILRDEGL 333
             DH V +VGFG ++       E G+          YW++KNSWG  WGE GY R+ R    
Sbjct:   290 DHSVLLVGFGKSKSVAGKQAEGGSSRPRPHHPIPYWILKNSWGAEWGEEGYFRLHRGNNT 349

Query:   334 CGIATEASYPV 344
             CGI     YPV
Sbjct:   350 CGIT---KYPV 357


>FB|FBgn0037396 [details] [associations]
            symbol:CG11459 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE014297 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 KO:K01365 HSSP:P07711 EMBL:AY060710
            RefSeq:NP_649608.1 UniGene:Dm.3894 SMR:Q9VNK6 MEROPS:C01.A31
            EnsemblMetazoa:FBtr0078623 GeneID:40741 KEGG:dme:Dmel_CG11459
            UCSC:CG11459-RA FlyBase:FBgn0037396 InParanoid:Q9VNK6 OMA:NYDEREL
            OrthoDB:EOG4MGQPX ChiTaRS:CG11459 GenomeRNAi:40741 NextBio:820359
            Uniprot:Q9VNK6
        Length = 336

 Score = 432 (157.1 bits), Expect = 1.2e-40, P = 1.2e-40
 Identities = 104/312 (33%), Positives = 162/312 (51%)

Query:    43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK---EGNRTYKLGTNEFSDLTNEEF 99
             +Q+ A++ + Y++  +K  R  +++Q +  +E  N+   +G   +K+G N+FSD T++  
Sbjct:    31 DQYKAKYNKQYRNR-DKYHR-ALYEQRVLAVESHNQLYLQGKVAFKMGLNKFSD-TDQRI 87

Query:   100 RASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQG-HCGSCWAF 158
               +Y   +              T  Y+    +   IDWR+ G ++ + +QG  C SCWAF
Sbjct:    88 LFNYRS-SIPAPLETSTNALTETVNYKRYDQITEGIDWRQYGYISPVGDQGTECLSCWAF 146

Query:   159 SAVAAVEGITQITGGKLIELSEQQLVDCST-DNNGCSGGLMDKAFEYIIENKGLATEADY 217
             S    +E       G L+ LS + LVDC    NNGCSGG +  AF Y  ++ G+AT+  Y
Sbjct:   147 STSGVLEAHMAKKYGNLVPLSPKHLVDCVPYPNNGCSGGWVSVAFNYTRDH-GIATKESY 205

Query:   218 PYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQAFRFYK 276
             PY+   G C   K   +A T+  Y  L   DE  L + V    PV+V ++   + F  Y 
Sbjct:   206 PYEPVSGEC-LWKSDRSAGTLSGYVTLGNYDERELAEVVYNIGPVAVSIDHLHEEFDQYS 264

Query:   277 RGVLNAE-CGD---NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-E 331
              GVL+   C     +  H V +VGFGT  +     YW+IKNS+G  WGESGY+++ R+  
Sbjct:   265 GGVLSIPACRSKRQDLTHSVLLVGFGTHRK--WGDYWIIKNSYGTDWGESGYLKLARNAN 322

Query:   332 GLCGIATEASYP 343
              +CG+A+   YP
Sbjct:   323 NMCGVASLPQYP 334


>UNIPROTKB|F1NT07 [details] [associations]
            symbol:LOC100857883 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 EMBL:AADN02044012
            EMBL:AADN02044013 EMBL:AADN02044014 IPI:IPI00577314
            Ensembl:ENSGALT00000000192 OMA:IYKHGPV Uniprot:F1NT07
        Length = 317

 Score = 432 (157.1 bits), Expect = 1.2e-40, P = 1.2e-40
 Identities = 107/323 (33%), Positives = 159/323 (49%)

Query:    34 HEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
             H P        +  + GR Y    E   R  IF  ++ ++   N+    +Y L  N  +D
Sbjct:     4 HRPWAHAAFHHYRRRLGRPYGSAREMEHRQRIFAHHMRFVHSKNRAA-LSYSLALNHLAD 62

Query:    94 LTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDV--PTSIDWREKGAVTHIKNQGH 151
              T +E  A+  G                 F  ++ T +  P S+DWR  GAVT +K+Q  
Sbjct:    63 RTPQEM-AALRGRRRSGDPNHGL-----PFPAEHYTGIILPESLDWRMYGAVTPVKDQAV 116

Query:   152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS--TDNNGCSGGLMDKAFEYIIENK 209
             CGSCW+F+   A+EG   +  G L  LS+Q L+DCS    N  C GG   +A  +I ++ 
Sbjct:   117 CGSCWSFATTGAMEGALFLKTGVLTPLSQQVLIDCSWGKGNYACDGGEEWRAKGWIKKHG 176

Query:   210 GLA-TEA--DYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVSVCV 265
             G+A TE+   +P   + G C   + +  A   G Y ++  G+  A+  A+ K  PV+V +
Sbjct:   177 GIASTESPPSFPLVLQNGLCHYNQSEMLAKITG-YVNVTSGNITAVKTAIYKHGPVAVSI 235

Query:   266 EASGQAFRFYKRGVL-NAECGDN---CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGE 321
             +AS + F FY  G+    +C +     DH V  VG+G  +   G  YWLIKNSW   WG 
Sbjct:   236 DASHKTFSFYSNGIYYEPKCANKPGQLDHAVLAVGYGVLQ---GETYWLIKNSWSTYWGN 292

Query:   322 SGYIRILRDEGLCGIATEASYPV 344
              GYI +   +  CG+ATEA+YP+
Sbjct:   293 DGYILMAMKDNNCGVATEATYPI 315


>RGD|1309354 [details] [associations]
            symbol:Ctsw "cathepsin W" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1309354 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 MEROPS:C01.037
            CTD:1521 eggNOG:NOG288820 HOVERGEN:HBG100117 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 EMBL:CH473953 EMBL:BC093401 IPI:IPI00371471
            RefSeq:NP_001019413.1 UniGene:Rn.34406 Ensembl:ENSRNOT00000037404
            GeneID:293676 KEGG:rno:293676 UCSC:RGD:1309354 InParanoid:Q561Q9
            NextBio:636716 Genevestigator:Q561Q9 Uniprot:Q561Q9
        Length = 371

 Score = 325 (119.5 bits), Expect = 1.4e-40, Sum P(2) = 1.4e-40
 Identities = 83/274 (30%), Positives = 135/274 (49%)

Query:    40 EKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEF 99
             E  + +  Q  R+Y +  E   RL IF  NL   ++  +E   T + G   FSDLT EEF
Sbjct:    38 EVFKLFQIQFNRSYSNPAEYTRRLGIFAHNLAQAQRLQEEDLGTAEFGQTPFSDLTEEEF 97

Query:   100 RASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWRE-KGAVTHIKNQGHCGSCWAF 158
                Y G+               + ++     VP + DWR+ K  ++ IKNQG+C  CWA 
Sbjct:    98 GQLY-GHQRAPERILNMAKKVKSERWGE--SVPPTCDWRKVKNIISSIKNQGNCRCCWAI 154

Query:   159 SAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYP 218
             +A   ++ + +I   + +++S Q+L+DC    NGC+GG +  A+  ++ N GLA+E DYP
Sbjct:   155 AAADNIQTLWRIKTQQFVDVSVQELLDCDRCGNGCNGGFVWDAYITVLNNSGLASEEDYP 214

Query:   219 YQ--QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
             +Q  Q+   C   K +  A  I  +  L   ++          P++V +    +  ++Y+
Sbjct:   215 FQGHQKPHRCLADKYRKVA-WIQDFTMLSSNEQVIAGYLAIHGPITVTINM--KLLQYYQ 271

Query:   277 RGVLNAECGDNCD-----HGVAVVGFGTAEEEDG 305
             +GV+ A     CD     H V +VGFG  +E+ G
Sbjct:   272 KGVIKAT-PSTCDPHLVNHSVLLVGFG--KEKGG 302

 Score = 123 (48.4 bits), Expect = 1.4e-40, Sum P(2) = 1.4e-40
 Identities = 20/37 (54%), Positives = 24/37 (64%)

Query:   308 YWLIKNSWGETWGESGYIRILRDEGLCGIATEASYPV 344
             YW++KNSWG  WGE GY R+ R    CGIA    YP+
Sbjct:   321 YWILKNSWGAEWGEKGYFRLYRGNNTCGIA---KYPI 354


>MGI|MGI:1338045 [details] [associations]
            symbol:Ctsw "cathepsin W" species:10090 "Mus musculus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 MGI:MGI:1338045 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 MEROPS:C01.037
            CTD:1521 eggNOG:NOG288820 HOVERGEN:HBG100117 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 EMBL:AF014941 EMBL:AC122861 IPI:IPI00111727
            RefSeq:NP_034115.2 UniGene:Mm.113590 ProteinModelPortal:P56203
            SMR:P56203 PhosphoSite:P56203 PRIDE:P56203 DNASU:13041
            Ensembl:ENSMUST00000025844 GeneID:13041 KEGG:mmu:13041
            InParanoid:P56203 NextBio:282936 Bgee:P56203 CleanEx:MM_CTSW
            Genevestigator:P56203 GermOnline:ENSMUSG00000024910 Uniprot:P56203
        Length = 371

 Score = 320 (117.7 bits), Expect = 4.6e-40, Sum P(2) = 4.6e-40
 Identities = 80/270 (29%), Positives = 131/270 (48%)

Query:    40 EKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEF 99
             E  + +  +  R+Y +  E   RL+IF  NL   ++  +E   T + G   FSDLT EEF
Sbjct:    38 EVFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDLGTAEFGETPFSDLTEEEF 97

Query:   100 RASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWRE-KGAVTHIKNQGHCGSCWAF 158
                Y G                +  +     VP + DWR+ K  ++ +KNQG C  CWA 
Sbjct:    98 GQLY-GQERSPERTPNMTKKVESNTWGE--SVPRTCDWRKAKNIISSVKNQGSCKCCWAM 154

Query:   159 SAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYP 218
             +A   ++ + +I   + +++S Q+L+DC    NGC+GG +  A+  ++ N GLA+E DYP
Sbjct:   155 AAADNIQALWRIKHQQFVDVSVQELLDCERCGNGCNGGFVWDAYLTVLNNSGLASEKDYP 214

Query:   219 YQQEQGT--CDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
             +Q ++    C  +K K  A  I  +  L   ++          P++V +    +  + Y+
Sbjct:   215 FQGDRKPHRCLAKKYKKVA-WIQDFTMLSNNEQAIAHYLAVHGPITVTINM--KLLQHYQ 271

Query:   277 RGVLNA---ECGDN-CDHGVAVVGFGTAEE 302
             +GV+ A    C     DH V +VGFG  +E
Sbjct:   272 KGVIKATPSSCDPRQVDHSVLLVGFGKEKE 301

 Score = 123 (48.4 bits), Expect = 4.6e-40, Sum P(2) = 4.6e-40
 Identities = 27/69 (39%), Positives = 35/69 (50%)

Query:   289 DHGVAVVGFGTAEE--EDG------------AKYWLIKNSWGETWGESGYIRILRDEGLC 334
             DH V +VGFG  +E  + G            + YW++KNSWG  WGE GY R+ R    C
Sbjct:   288 DHSVLLVGFGKEKEGMQTGTVLSHSRKRRHSSPYWILKNSWGAHWGEKGYFRLYRGNNTC 347

Query:   335 GIATEASYP 343
             G+     YP
Sbjct:   348 GVT---KYP 353


>UNIPROTKB|F1MHV4 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 OMA:GRCGDGC EMBL:DAAA02063574
            IPI:IPI00716321 Ensembl:ENSBTAT00000027681 Uniprot:F1MHV4
        Length = 375

 Score = 316 (116.3 bits), Expect = 5.1e-39, Sum P(2) = 5.1e-39
 Identities = 81/265 (30%), Positives = 126/265 (47%)

Query:    48 QHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYN 107
             Q+ R+Y +  E A RL IF QNL   ++  +E   T + G  +FSDLT EEF   Y    
Sbjct:    48 QYNRSYPNPAEYARRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSDLTEEEFVQLYGSQV 107

Query:   108 XXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGI 167
                            +        P + DWR+ G ++ +++Q +C  CWA +A   +E +
Sbjct:   108 AGEALGVSRKVGSEEWGESE----PQTCDWRKVGTISPVRDQRNCNCCWAMAAAGNIEAL 163

Query:   168 TQITGGKLIELSEQ-QLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGT- 225
               I     +E+S Q +L+DC    NGC GG +  AF  ++ N GLA+E DYP+     T 
Sbjct:   164 WAIKFRHFVEVSVQPELLDCDRCGNGCRGGFVWDAFLTVLNNSGLASEKDYPFNGSGKTH 223

Query:   226 -CDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAE- 283
              C  +K K  A  I  +  L   ++       T+ P++V +  +    + Y++GV+ A  
Sbjct:   224 RCLAKKYKKVA-WIQDFIILQACEQSMARHLATEGPITVTINMT--LLQQYQKGVIKATP 280

Query:   284 --CGDN-CDHGVAVVGFGTAEEEDG 305
               C     DH V +VGFG  +  +G
Sbjct:   281 TTCDPTQVDHSVLLVGFGKTKLVEG 305

 Score = 117 (46.2 bits), Expect = 5.1e-39, Sum P(2) = 5.1e-39
 Identities = 22/49 (44%), Positives = 27/49 (55%)

Query:   297 FGT-AEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEASYPV 344
             FG+ A       YW++KNSWG  WGE GY R+ R    CGI     +PV
Sbjct:   313 FGSHARPRRSMAYWILKNSWGPQWGEEGYFRLHRGSNTCGIT---KFPV 358


>ZFIN|ZDB-GENE-080724-8 [details] [associations]
            symbol:ctso "cathepsin O" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            ZFIN:ZDB-GENE-080724-8 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 CTD:1519 KO:K01374 EMBL:CR931784
            IPI:IPI00513613 RefSeq:XP_695717.3 UniGene:Dr.88386
            Ensembl:ENSDART00000074786 GeneID:567333 KEGG:dre:567333
            NextBio:20888622 Uniprot:E7FA09
        Length = 334

 Score = 415 (151.1 bits), Expect = 7.8e-39, P = 7.8e-39
 Identities = 90/299 (30%), Positives = 151/299 (50%)

Query:    48 QHGRTYKDELEKAM--RLTIFKQNLE---YIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
             QH  T++ ++   +  R   ++ +L+   ++  A  + N++ + G N+FS L+ ++F+  
Sbjct:    37 QHSDTFQQDVNNELYQRWINYQSSLQRQAFLNSALGKSNQSAQYGVNQFSYLSQKQFKEQ 96

Query:   103 YTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVA 162
             Y                    K  N    P   DWR+ G V  + NQG CG CWAFS V 
Sbjct:    97 YLTARAEAAPKFDQSKSEIKVKANN----PPRFDWRDHGVVGPVHNQGSCGGCWAFSIVE 152

Query:   163 AVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENK-GLATEADYPYQQ 221
             A+E ++   G KL +LS QQ++DCS  N GC+GG   +A  ++ ++K  L +EA+YP++ 
Sbjct:   153 AIESVSAKGGEKLQQLSVQQVIDCSYQNQGCNGGSPVEALYWLTQSKLKLVSEAEYPFKG 212

Query:   222 EQGTCDKQKEKAAAATIGKYEDLP-KGDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV 279
               G C    +  A   +  Y      G E  ++ A+    P+ V V+A   +++ Y  G+
Sbjct:   213 ADGVCQFFPQAHAGVAVRNYSAYDFSGQEEVMMSALVDFGPLVVIVDAI--SWQDYLGGI 270

Query:   280 LNAECGDN-CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIA 337
             +   C  +  +H V + G+ T  E     YW+++NSWG +WG+ GY  I     +CG+A
Sbjct:   271 IQHHCSSHKANHAVLITGYDTTGE---VPYWIVRNSWGTSWGDDGYAYIKIGNDVCGVA 326


>WB|WBGene00013076 [details] [associations]
            symbol:Y51A2D.8 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            EMBL:AL021497 HSSP:P53634 HOGENOM:HOG000019851 PIR:T27079
            RefSeq:NP_507627.1 ProteinModelPortal:Q9XXQ7 SMR:Q9XXQ7
            MEROPS:C01.A49 EnsemblMetazoa:Y51A2D.8 GeneID:180208
            KEGG:cel:CELE_Y51A2D.8 UCSC:Y51A2D.8 CTD:180208 WormBase:Y51A2D.8
            eggNOG:NOG307864 InParanoid:Q9XXQ7 OMA:VAVYFKV NextBio:908434
            Uniprot:Q9XXQ7
        Length = 386

 Score = 329 (120.9 bits), Expect = 9.3e-38, Sum P(2) = 9.3e-38
 Identities = 74/204 (36%), Positives = 109/204 (53%)

Query:   146 IKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEY 204
             IK+QG C  CW F+  A VE +     GK   LS+Q++ DC T+   GC GG +    +Y
Sbjct:   167 IKDQGQCACCWGFAVTALVETVYAAHSGKFKSLSDQEVCDCGTEGTPGCKGGSLTLGVQY 226

Query:   205 IIENKGLATEADYPYQQE---QGT-CD-KQKEKAAAATIGKYEDL-PKGDEHALLQAVT- 257
             + +  GL+ + DYPY Q    QG  C  ++ ++   A    +  + P+  E  ++Q +T 
Sbjct:   227 V-KKYGLSGDEDYPYDQNRANQGRRCRLRETDRIVPARAFNFAVINPRRAEEQIIQVLTE 285

Query:   258 -KQPVSVCVEASGQAFRFYKRGVL-NAECGDNCD-HGVAVVGFGTAEEEDGAK--YWLIK 312
              K PV+V  +  G  F+ YK GV+   +C      H  A+VG+ T E+  G    YW+IK
Sbjct:   286 WKVPVAVYFKV-GDQFKEYKEGVIIEDDCRRATQWHAGAIVGYDTVEDSRGRSHDYWIIK 344

Query:   313 NSWGETWGESGYIRILRDEGLCGI 336
             NSWG  W ESGY+R++R    C I
Sbjct:   345 NSWGGDWAESGYVRVVRGRDWCSI 368

 Score = 92 (37.4 bits), Expect = 9.3e-38, Sum P(2) = 9.3e-38
 Identities = 21/69 (30%), Positives = 34/69 (49%)

Query:    34 HEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRT-Y--KLGTNE 90
             H   + +  E +  ++ R YKDE E   R   F ++   ++K N +     Y  + G N+
Sbjct:    35 HPEKLYKAFEDFKKKYNRKYKDESENQQRFNNFVKSYNNVDKLNAKSKAAGYDTQFGINK 94

Query:    91 FSDLTNEEF 99
             FSDL+  EF
Sbjct:    95 FSDLSTAEF 103


>UNIPROTKB|E9PI30 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            EMBL:AP001201 HGNC:HGNC:2546 IPI:IPI00984532
            ProteinModelPortal:E9PI30 SMR:E9PI30 Ensembl:ENST00000528419
            ArrayExpress:E9PI30 Bgee:E9PI30 Uniprot:E9PI30
        Length = 364

 Score = 343 (125.8 bits), Expect = 1.5e-37, Sum P(2) = 1.5e-37
 Identities = 88/274 (32%), Positives = 139/274 (50%)

Query:    40 EKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEF 99
             E  + +  Q  R+Y    E A RL IF  NL   ++  +E   T + G   FSDLT EEF
Sbjct:    40 EAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEF 99

Query:   100 RASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWRE-KGAVTHIKNQGHCGSCWAF 158
                Y GY               + + +    VP S DWR+   A++ IK+Q +C  CWA 
Sbjct:   100 GQLY-GYRRAAGGVPSMGREIRSEEPEE--SVPFSCDWRKVASAISPIKDQKNCNCCWAM 156

Query:   159 SAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYP 218
             +A   +E + +I+    +++S Q+L+DC    +GC GG +  AF  ++ N GLA+E DYP
Sbjct:   157 AAAGNIETLWRISFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNNSGLASEKDYP 216

Query:   219 YQQEQGT--CDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFY 275
             +Q +     C  +K +  A  I  +  L + +EH + Q + T  P++V +    +  + Y
Sbjct:   217 FQGKVRAHRCHPKKYQKVA-WIQDFIML-QNNEHRIAQYLATYGPITVTINM--KPLQLY 272

Query:   276 KRGVLNAE---CGDNC-DHGVAVVGFGTAEEEDG 305
             ++GV+ A    C     DH V +VGFG+ + E+G
Sbjct:   273 RKGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEG 306

 Score = 76 (31.8 bits), Expect = 1.5e-37, Sum P(2) = 1.5e-37
 Identities = 14/30 (46%), Positives = 18/30 (60%)

Query:   308 YWLIKNSWGETWGES-GYIRILRDEGLCGI 336
             YW++KNSWG  WGE    I   R +G  G+
Sbjct:   326 YWILKNSWGAQWGEKVSVIYWGRGQGRTGL 355


>WB|WBGene00011102 [details] [associations]
            symbol:R07E3.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            EMBL:Z49207 HSSP:P53634 PIR:T24030 RefSeq:NP_001041280.1
            ProteinModelPortal:Q21810 SMR:Q21810 STRING:Q21810 MEROPS:C01.A43
            PaxDb:Q21810 EnsemblMetazoa:R07E3.1a GeneID:181242
            KEGG:cel:CELE_R07E3.1 UCSC:R07E3.1a CTD:181242 WormBase:R07E3.1a
            HOGENOM:HOG000021028 InParanoid:Q21810 OMA:ACKNEVI NextBio:913066
            ArrayExpress:Q21810 Uniprot:Q21810
        Length = 402

 Score = 392 (143.0 bits), Expect = 2.1e-36, P = 2.1e-36
 Identities = 106/320 (33%), Positives = 147/320 (45%)

Query:    37 SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEK---ANKEGNRTYKLGTNEFSD 93
             +I +++  +  +  ++Y    E   RL  +    E I      N+ G+  Y  G N+ SD
Sbjct:    85 NIAKEYIAYTEKFDKSYATSQESLKRLNAYYNTDENIANWNIQNEHGSAEY--GHNDMSD 142

Query:    94 LTNEEF------RASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIK 147
              T+EEF      ++ Y   +                K ++ +  P   DWR+K  +T +K
Sbjct:   143 WTDEEFEKTLLPKSFYKRLHKEAEFIEPIPESLTAKKGESSSPFPDFFDWRDKNVITPVK 202

Query:   148 NQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIE 207
              QG CGSCWAF++ A VE    I  G+   LSEQ L+DC   +N C GG  DKAF YI  
Sbjct:   203 AQGQCGSCWAFASTATVEAAWAIAHGEKRNLSEQTLLDCDLVDNACDGGDEDKAFRYIHR 262

Query:   208 NKGLATEADYPY-QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQ-AVTKQPVSVCV 265
             N GLA   D PY    Q  C              Y      DE +++   V   PV++ +
Sbjct:   263 N-GLANAVDLPYVAHRQNGCAVNDHWNTTRIKAAY--FLHHDEDSIINWLVNFGPVNIGM 319

Query:   266 EASGQAFRFYKRGVLNAE---CGDNCD--HGVAVVGFGTAEEEDGAKYWLIKNSWGETWG 320
              A  Q  R YK GV       C +     H + + G+GT++   G KYW++KNSWG TWG
Sbjct:   320 -AVIQPMRAYKGGVFTPSEYACKNEVIGLHALLITGYGTSKT--GEKYWIVKNSWGNTWG 376

Query:   321 -ESGYIRILRDEGLCGIATE 339
              E GYI   R    CGI  E
Sbjct:   377 VEHGYIYFARGINACGIEDE 396


>UNIPROTKB|E1BPI9 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 CTD:1519 KO:K01374 OMA:SNVCGIA
            EMBL:DAAA02044933 IPI:IPI01004081 RefSeq:XP_002694471.2
            RefSeq:XP_874012.4 Ensembl:ENSBTAT00000014691 GeneID:616804
            KEGG:bta:616804 Uniprot:E1BPI9
        Length = 313

 Score = 386 (140.9 bits), Expect = 9.2e-36, P = 9.2e-36
 Identities = 93/284 (32%), Positives = 142/284 (50%)

Query:    66 FKQNLE---YIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXT 122
             F+++L    Y+       N T   G N+FS L  EEF+A Y                  T
Sbjct:    36 FRESLNRQRYLNSLFPYENSTAVYGINQFSYLFPEEFKAIYL--RSSPSRFPRFPAEEYT 93

Query:   123 FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
                 N++ +P   DWR+K  VT ++NQ  CG CWAFS V AVE +  I G  L  LS QQ
Sbjct:    94 -SISNLS-LPLRFDWRDKHVVTQVRNQKTCGGCWAFSVVGAVESVCAIKGQPLEVLSVQQ 151

Query:   183 LVDCSTDNNGCSGGLMDKAFEYI--IENKGLATEADYPYQQEQGTCDKQKEKAAAATIGK 240
             ++DCS  N GC+GG    A  ++  ++ K L  +++YP+Q + G C    +  + ++I  
Sbjct:   152 VIDCSYSNYGCNGGSPLSALYWLNKLQVK-LVRDSEYPFQAQNGLCRYFSDSHSGSSIKG 210

Query:   241 YEDLP-KGDEHALLQAVTKQ-PVSVCVEASGQAFRFYKRGVLNAECGDN-CDHGVAVVGF 297
             Y      G E  + +A+    P+ V V+A   +++ Y  G++   C     +H V V GF
Sbjct:   211 YSAYDFSGQEDKMAEALLALGPLIVVVDA--MSWQDYLGGIIQHHCSSGEANHAVLVTGF 268

Query:   298 GTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEAS 341
                ++     YW+++NSWG +WG  GY+R+     +CGIA   S
Sbjct:   269 ---DKTGSIPYWIVRNSWGTSWGIDGYVRVKMGGNVCGIADSVS 309


>UNIPROTKB|F1PGK4 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 OMA:SNVCGIA
            EMBL:AAEX03010073 Ensembl:ENSCAFT00000013638 Uniprot:F1PGK4
        Length = 316

 Score = 382 (139.5 bits), Expect = 2.4e-35, P = 2.4e-35
 Identities = 89/283 (31%), Positives = 140/283 (49%)

Query:    66 FKQNLE---YIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXT 122
             F+++L    Y+       N +   G N+FS L+ EEF+A Y                  T
Sbjct:    39 FRESLNRHRYLNSVFPRENSSAVYGINQFSYLSPEEFKAIYL--RSKPSRSPRYPAEVRT 96

Query:   123 FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
                +NV+ +P   DWR+K  VT ++NQ  CG CWAFS V AVE    I G  L ++S QQ
Sbjct:    97 -SIRNVS-LPLRFDWRDKRVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGKPLADISVQQ 154

Query:   183 LVDCSTDNNGCSGGLMDKAFEYIIENK-GLATEADYPYQQEQGTCDKQKEKAAAATIGKY 241
             ++DCS +N GCSGG    A  ++ + +  L  +++YP++ + G C    +  +  +I  Y
Sbjct:   155 VIDCSYNNYGCSGGSTLNALNWLNKTQVKLVRDSEYPFKAQNGLCHYFSDSYSGFSIRGY 214

Query:   242 EDLPKGDEHALLQAV--TKQPVSVCVEASGQAFRFYKRGVLNAECGDN-CDHGVAVVGFG 298
                   D+   +  V  T  P+ V V+A   +++ Y  G++   C     +H V + GF 
Sbjct:   215 SAYDFSDQEDEMAKVLLTFGPLVVVVDAV--SWQDYLGGIIQHHCSSGEANHAVLITGF- 271

Query:   299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEAS 341
               ++     YW+++NSWG +WG  GY  +     +CGIA   S
Sbjct:   272 --DKIGSTPYWIVRNSWGSSWGVDGYAHVKMGGNICGIADSVS 312


>UNIPROTKB|Q5T8F0 [details] [associations]
            symbol:CTSL1 "Cathepsin L1 light chain" species:9606 "Homo
            sapiens" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            EMBL:AL160279 UniGene:Hs.731507 UniGene:Hs.731952 HGNC:HGNC:2537
            ChiTaRS:CTSL1 IPI:IPI00640540 SMR:Q5T8F0 Ensembl:ENST00000342020
            ChEMBL:CHEMBL1293261 Uniprot:Q5T8F0
        Length = 225

 Score = 380 (138.8 bits), Expect = 4.0e-35, P = 4.0e-35
 Identities = 81/199 (40%), Positives = 112/199 (56%)

Query:    27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRT 83
             + S     + S+  +  +W A H R Y    E+  R  ++++N++ IE  N   +EG  +
Sbjct:    14 IASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKMIELHNQEYREGKHS 72

Query:    84 YKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAV 143
             + +  N F D+T+EEFR    G+                F+     + P S+DWREKG V
Sbjct:    73 FTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGK------VFQEPLFYEAPRSVDWREKGYV 126

Query:   144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS--TDNNGCSGGLMDKA 201
             T +KNQG CGSCWAFSA  A+EG      G+LI LSEQ LVDCS    N GC+GGLMD A
Sbjct:   127 TPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYA 186

Query:   202 FEYIIENKGLATEADYPYQ 220
             F+Y+ +N GL +E  YPY+
Sbjct:   187 FQYVQDNGGLDSEESYPYE 205


>UNIPROTKB|P43234 [details] [associations]
            symbol:CTSO "Cathepsin O" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 Reactome:REACT_6900
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0004197
            CleanEx:HS_CTSO EMBL:X77383 EMBL:BC049206 IPI:IPI00017257
            PIR:A55090 RefSeq:NP_001325.1 UniGene:Hs.75262
            ProteinModelPortal:P43234 SMR:P43234 IntAct:P43234 STRING:P43234
            MEROPS:C01.035 PhosphoSite:P43234 DMDM:1168795 PRIDE:P43234
            DNASU:1519 Ensembl:ENST00000433477 GeneID:1519 KEGG:hsa:1519
            UCSC:uc003ipg.3 CTD:1519 GeneCards:GC04M156845 HGNC:HGNC:2542
            HPA:HPA002041 MIM:600550 neXtProt:NX_P43234 PharmGKB:PA27040
            HOVERGEN:HBG105050 InParanoid:P43234 KO:K01374 OMA:SNVCGIA
            OrthoDB:EOG4V6ZH1 PhylomeDB:P43234 BindingDB:P43234
            ChEMBL:CHEMBL3035 GenomeRNAi:1519 NextBio:6287 Bgee:P43234
            Genevestigator:P43234 GermOnline:ENSG00000151792 Uniprot:P43234
        Length = 321

 Score = 376 (137.4 bits), Expect = 1.1e-34, P = 1.1e-34
 Identities = 90/283 (31%), Positives = 138/283 (48%)

Query:    66 FKQNLE---YIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXT 122
             F+++L    Y+       N T   G N+FS L  EEF+A Y                   
Sbjct:    44 FRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYL---RSKPSKFPRYSAEVH 100

Query:   123 FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
                 NV+ +P   DWR+K  VT ++NQ  CG CWAFS V AVE    I G  L +LS QQ
Sbjct:   101 MSIPNVS-LPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQ 159

Query:   183 LVDCSTDNNGCSGGLMDKAFEYIIENK-GLATEADYPYQQEQGTCDKQKEKAAAATIGKY 241
             ++DCS +N GC+GG    A  ++ + +  L  +++YP++ + G C       +  +I  Y
Sbjct:   160 VIDCSYNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGY 219

Query:   242 EDLPKGD-EHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLNAECGDN-CDHGVAVVGFG 298
                   D E  + +A+ T  P+ V V+A   +++ Y  G++   C     +H V + GF 
Sbjct:   220 SAYDFSDQEDEMAKALLTFGPLVVIVDAV--SWQDYLGGIIQHHCSSGEANHAVLITGF- 276

Query:   299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEAS 341
               ++     YW+++NSWG +WG  GY  +     +CGIA   S
Sbjct:   277 --DKTGSTPYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVS 317


>MGI|MGI:2139628 [details] [associations]
            symbol:Ctso "cathepsin O" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:2139628 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0005764 GeneTree:ENSGT00560000076599 MEROPS:C01.035 CTD:1519
            HOVERGEN:HBG105050 KO:K01374 OMA:SNVCGIA OrthoDB:EOG4V6ZH1
            EMBL:AK034490 EMBL:AK049470 EMBL:AK165930 EMBL:AK166103
            EMBL:BC044664 IPI:IPI00453524 RefSeq:NP_808330.1 UniGene:Mm.254642
            ProteinModelPortal:Q8BM88 SMR:Q8BM88 STRING:Q8BM88
            PhosphoSite:Q8BM88 PRIDE:Q8BM88 Ensembl:ENSMUST00000029649
            GeneID:229445 KEGG:mmu:229445 UCSC:uc008pon.1 InParanoid:Q8BM88
            NextBio:379433 Bgee:Q8BM88 CleanEx:MM_CTSO Genevestigator:Q8BM88
            GermOnline:ENSMUSG00000028015 Uniprot:Q8BM88
        Length = 312

 Score = 368 (134.6 bits), Expect = 7.4e-34, P = 7.4e-34
 Identities = 88/282 (31%), Positives = 137/282 (48%)

Query:    60 AMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXX 119
             A+R ++ +    Y+     E N T   G N+FS L  EEF+A Y G              
Sbjct:    35 ALRESLHRHR--YLNSFPHE-NSTAFYGVNQFSYLFPEEFKALYLGSKYAWAPRYPAEGQ 91

Query:   120 XXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELS 179
                    NV+ +P   DWR+K  V  ++NQ  CG CWAFS V+A+E    I G  L  LS
Sbjct:    92 R---PIPNVS-LPLRFDWRDKHVVNPVRNQEMCGGCWAFSVVSAIESARAIQGKSLDYLS 147

Query:   180 EQQLVDCSTDNNGCSGGLMDKAFEYIIENK-GLATEADYPYQQEQGTCDKQKEKAAAATI 238
              QQ++DCS +N+GC GG    A  ++ E +  L  ++ YP++   G C    +  A  ++
Sbjct:   148 VQQVIDCSFNNSGCLGGSPLCALRWLNETQLKLVADSQYPFKAVNGQCRHFPQSQAGVSV 207

Query:   239 GKYEDLP-KGDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGVLNAECGDN-CDHGVAVV 295
               +     +G E  + +A+    P+ V V+A   +++ Y  G++   C     +H V + 
Sbjct:   208 KDFSAYNFRGQEDEMARALLSFGPLVVIVDA--MSWQDYLGGIIQHHCSSGEANHAVLIT 265

Query:   296 GFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIA 337
             GF   +      YW+++NSWG +WG  GY  +     +CGIA
Sbjct:   266 GF---DRTGNTPYWMVRNSWGSSWGVEGYAHVKMGGNVCGIA 304


>UNIPROTKB|F1P0K2 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            OMA:SNVCGIA EMBL:AADN02016534 IPI:IPI00651180
            Ensembl:ENSGALT00000015270 Uniprot:F1P0K2
        Length = 320

 Score = 359 (131.4 bits), Expect = 6.7e-33, P = 6.7e-33
 Identities = 85/270 (31%), Positives = 129/270 (47%)

Query:    76 ANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSI 135
             +N  G+  Y  G N+FS L  EEF+A Y                    K +    +P   
Sbjct:    60 SNDNGSAFY--GKNQFSHLFPEEFKAIYL-----RSIPYKLPRYIKVPKGEE-KPLPKKF 111

Query:   136 DWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSG 195
             DWR+K  +  ++NQ  CG CWAFS V  +E    I G  L ELS QQ++DCS  N GCSG
Sbjct:   112 DWRDKKVIAEVRNQQTCGGCWAFSVVGGIESAYAIKGHNLEELSVQQVIDCSYSNYGCSG 171

Query:   196 GLMDKAFEYIIENK-GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP-KGDEHALL 253
             G    A  ++ + K  L  +++Y ++ + G C          +I  +      G E  ++
Sbjct:   172 GSTITALSWLNQTKVKLVRDSEYTFKAQTGLCHYFPHSDFGVSITGFAAYDFSGQEEEMM 231

Query:   254 QAVTKQ-PVSVCVEASGQAFRFYKRGVLNAECGDN-CDHGVAVVGFGTAEEEDGAKYWLI 311
             + +    P++V V+A   +++ Y  G++   C     +H V + GF T        YW++
Sbjct:   232 RVLVDWGPLAVTVDAV--SWQDYLGGIIQYHCSSGKANHAVLITGFDTTGI---IPYWIV 286

Query:   312 KNSWGETWGESGYIRILRDEGLCGIATEAS 341
             +NSWG TWG  GY+R+     +CGIA   S
Sbjct:   287 QNSWGRTWGIDGYVRVKIGSNVCGIADTVS 316


>WB|WBGene00019314 [details] [associations]
            symbol:K02E7.10 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 SMART:SM00645 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 HSSP:P07711
            EMBL:FO080411 PIR:T32392 RefSeq:NP_493904.1 UniGene:Cel.14828
            ProteinModelPortal:O17255 SMR:O17255 EnsemblMetazoa:K02E7.10
            GeneID:186889 KEGG:cel:CELE_K02E7.10 UCSC:K02E7.10 CTD:186889
            WormBase:K02E7.10 eggNOG:NOG331187 HOGENOM:HOG000114005
            InParanoid:O17255 OMA:GNANEAR NextBio:933344 Uniprot:O17255
        Length = 299

 Score = 359 (131.4 bits), Expect = 6.7e-33, P = 6.7e-33
 Identities = 79/218 (36%), Positives = 120/218 (55%)

Query:   135 IDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGI-TQITGGKLIELSEQQLVDCSTDNNGC 193
             +DWREKG V  +K+QG C + +AF+A+AA+E +  +   GKL+  SEQQ++DC+   N C
Sbjct:    84 LDWREKGIVGPVKDQGKCNASYAFAAIAAIESMYAKANNGKLLSFSEQQIIDCANFTNPC 143

Query:   194 SGGLMDKAFEYIIENKGLATEADYPY--QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
                L +      ++  G+ TEADYPY  ++  G C+    K        Y D+   +E A
Sbjct:   144 QENLENVLSNRFLKENGVGTEADYPYVGKENVGKCEYDSSKMKLRPT--YIDVYPNEEWA 201

Query:   252 LLQAVTKQPVSVCVEASGQAFRFYKRGVLNA---ECGD-NCDHGVAVVGFGTAEEEDGA- 306
                 +T          S  +F  YK G+ N    ECG+ N    +A+VG+G    +DGA 
Sbjct:   202 RAH-ITTFGTGYFRMRSPPSFFHYKTGIYNPTKEECGNANEARSLAIVGYG----KDGAE 256

Query:   307 KYWLIKNSWGETWGESGYIRILRDEGLCGIATEASYPV 344
             KYW++K S+G +WGE GY+++ R+   CG+A   S P+
Sbjct:   257 KYWIVKGSFGTSWGEHGYMKLARNVNACGMAESISIPI 294


>WB|WBGene00008231 [details] [associations]
            symbol:tag-329 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            eggNOG:NOG288820 EMBL:Z70750 HSSP:P53634 HOGENOM:HOG000019851
            PIR:T20110 RefSeq:NP_505458.1 ProteinModelPortal:Q18740 SMR:Q18740
            MEROPS:C01.A36 EnsemblMetazoa:C50F4.3 GeneID:183677
            KEGG:cel:CELE_C50F4.3 UCSC:C50F4.3 CTD:183677 WormBase:C50F4.3
            InParanoid:Q18740 OMA:WIFRNSW NextBio:921986 Uniprot:Q18740
        Length = 374

 Score = 350 (128.3 bits), Expect = 6.0e-32, P = 6.0e-32
 Identities = 94/323 (29%), Positives = 144/323 (44%)

Query:    38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRT---YKLGTNEFSDL 94
             + ++ E ++ ++ R YKDE+EK  R   F      + K NK   +     K G N+FSDL
Sbjct:    43 LYKEFEDFIVKYKRNYKDEIEKKFRFQQFVATHNRVGKMNKAAKKAGHDTKYGINKFSDL 102

Query:    95 TNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTD-VPTSIDWREKGAVTH-----IKN 148
             + +E    Y+ +                 + +   + +P + D R K    H     IK 
Sbjct:   103 SKKEIHGMYSKFGPPKNNTNVPKFNLKNLRVKRQMEGLPKTFDLRNKKVGGHYIIGPIKT 162

Query:   149 QGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIE 207
             Q  C  CW F+A A  E    +   K + LSEQ++ DC+  +  GC+GG      EYI E
Sbjct:   163 QDSCACCWGFAATAVAEAALTVHLKKAMNLSEQEVCDCAPKHGPGCNGGDPVDGLEYIKE 222

Query:   208 NKGLATEADYPYQQEQGT----CDKQK--EKAAAATIGKYEDLPKGDEHALLQAV--TKQ 259
               GL    +YP+   + T    C+ +K   +     +  Y   P   E+ +   +     
Sbjct:   223 -MGLTGGKEYPFNVNRSTQLGRCESEKYDRELNPLELDYYAIDPFNAEYQMTHHLYLLNL 281

Query:   260 PVSVCVEASGQAFRFYKRGVLN-AECGDNCD---HGVAVVGFGTAEEEDG--AKYWLIKN 313
             P+SV    +G +   Y  G+L  A+C D      H  A+VG+GT +   G    YW+ +N
Sbjct:   282 PISVAFR-TGASLSSYLSGILELADCDDEKGGHWHSGAIVGYGTTKNSAGRTVDYWIFRN 340

Query:   314 SWGETWGESGYIRILRDEGLCGI 336
             SW   WG+ GY RI+R E  C I
Sbjct:   341 SWWTDWGDDGYARIVRGEDWCSI 363


>UNIPROTKB|H0YD65 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 EMBL:AP002748
            HGNC:HGNC:2531 ChiTaRS:CTSF Ensembl:ENST00000524994 Uniprot:H0YD65
        Length = 283

 Score = 348 (127.6 bits), Expect = 9.8e-32, P = 9.8e-32
 Identities = 84/238 (35%), Positives = 119/238 (50%)

Query:    43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
             + ++  + RTY+ + E   RL++F  N+   +K       T + G  +FSDLT EEFR  
Sbjct:    37 KNFVITYNRTYESK-EARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTI 95

Query:   103 YTGYNXXXXXXXXXXXXXXTFKYQNVTDV-PTSIDWREKGAVTHIKNQGHCGSCWAFSAV 161
             Y   N                  ++V D+ P   DWR KGAVT +K+QG CGSCWAFS  
Sbjct:    96 YL--NTLLRKEPGNKMKQA----KSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVT 149

Query:   162 AAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQ 221
               VEG   +  G L+ LSEQ+L+DC   +  C GGL   A+  I    GL TE DY YQ 
Sbjct:   150 GNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQG 209

Query:   222 EQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGV 279
                +C+   EKA    I    +L + ++        + P+SV + A G   +FY+ G+
Sbjct:   210 HMQSCNFSAEKAKVY-INDSVELSQNEQKLAAWLAKRGPISVAINAFGM--QFYRHGI 264


>UNIPROTKB|Q5QP40 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015644 Pfam:PF00112
            InterPro:IPR000169 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 EMBL:AL355860 HOVERGEN:HBG011513
            PANTHER:PTHR12411:SF55 EMBL:AL356292 UniGene:Hs.632466
            HGNC:HGNC:2536 IPI:IPI00514633 SMR:Q5QP40 STRING:Q5QP40
            Ensembl:ENST00000443913 Uniprot:Q5QP40
        Length = 258

 Score = 346 (126.9 bits), Expect = 1.6e-31, P = 1.6e-31
 Identities = 71/187 (37%), Positives = 109/187 (58%)

Query:    32 SMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLG 87
             +++   I++ H E W   H + Y +++++  R  I+++NL+YI   N E   G  TY+L 
Sbjct:    74 ALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELA 133

Query:    88 TNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIK 147
              N   D+T+EE     TG                  +++     P S+D+R+KG VT +K
Sbjct:   134 MNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIP-EWEG--RAPDSVDYRKKGYVTPVK 190

Query:   148 NQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIE 207
             NQG CGSCWAFS+V A+EG  +   GKL+ LS Q LVDC ++N+GC GG M  AF+Y+ +
Sbjct:   191 NQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQK 250

Query:   208 NKGLATE 214
             N+G+ +E
Sbjct:   251 NRGIDSE 257


>DICTYBASE|DDB_G0276111 [details] [associations]
            symbol:DDB_G0276111 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0276111 Pfam:PF00188
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            PROSITE:PS00139 EMBL:AAFI02000014 Gene3D:3.40.33.10
            InterPro:IPR001283 InterPro:IPR014044 PRINTS:PR00837 SMART:SM00198
            SUPFAM:SSF55797 ProtClustDB:CLSZ2429919 RefSeq:XP_643261.1
            ProteinModelPortal:Q75JH0 EnsemblProtists:DDB0169514 GeneID:8620304
            KEGG:ddi:DDB_G0276111 InParanoid:Q75JH0 OMA:GFVTSIK Uniprot:Q75JH0
        Length = 415

 Score = 303 (111.7 bits), Expect = 5.7e-27, P = 5.7e-27
 Identities = 75/215 (34%), Positives = 113/215 (52%)

Query:   131 VPTS----IDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKL---IELSEQQL 183
             +PTS    +DW+  G VT IKNQG CG C++F+  AA+E    I        I+LSEQ  
Sbjct:   205 LPTSSTGDVDWKSLGFVTSIKNQGQCGGCYSFATCAALESAYLIKNNLPNTDIDLSEQNF 264

Query:   184 VDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYED 243
             V C   N GC GG      + + ++ G+  E  YPY+   G+C    +         Y +
Sbjct:   265 VSCV--NYGCGGGNGQSCLDKL-KSTGIMYETSYPYKAVTGSCPNVIQSPQPFKWTGYSN 321

Query:   244 LPKGDEHALLQAVTKQPV--SVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAE 301
             + +G++ A L A+   P+  S+ V+ SG  F+ YK G+ +       +H + +VG+ +A+
Sbjct:   322 I-QGNKEAFLNALKSGPIYASLYVD-SG--FQLYKSGIYSCSQSSTPNHAITIVGYSSAD 377

Query:   302 EEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGI 336
                    +LIKNSWG  +GESGYIR+   EG C +
Sbjct:   378 NS-----YLIKNSWGTIYGESGYIRL--KEGSCNL 405


>WB|WBGene00022189 [details] [associations]
            symbol:Y71H2AR.2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0004181
            "metallocarboxypeptidase activity" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] [GO:0008340 "determination of adult lifespan"
            evidence=IMP] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0008340 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 HSSP:P07711
            eggNOG:NOG331187 HOGENOM:HOG000114005 EMBL:FO081570
            RefSeq:NP_497627.1 UniGene:Cel.28419 ProteinModelPortal:Q9BL26
            SMR:Q9BL26 EnsemblMetazoa:Y71H2AR.2 GeneID:190615
            KEGG:cel:CELE_Y71H2AR.2 UCSC:Y71H2AR.2 CTD:190615
            WormBase:Y71H2AR.2 InParanoid:Q9BL26 OMA:CAMATTI NextBio:946382
            Uniprot:Q9BL26
        Length = 345

 Score = 295 (108.9 bits), Expect = 4.0e-26, P = 4.0e-26
 Identities = 74/214 (34%), Positives = 110/214 (51%)

Query:   135 IDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGI-TQITGGKLIELSEQQLVDCSTDN-NG 192
             +DWREKG V  +K+QG C +  AF+  +++E +  + T G L+  SEQQL+DC+     G
Sbjct:    86 LDWREKGIVGPVKDQGKCNASHAFAITSSIESMYAKATNGTLLSFSEQQLIDCNDQGYKG 145

Query:   193 CSGGLMDKAFEYIIENKGLATEADYPY---QQEQGTCDKQKEKAAAATIGKYEDLPKGDE 249
             C       A  Y+  + G+ TEADYPY     E+ T D  K K       K   + +G+E
Sbjct:   146 CEEQFAMNAIGYLATH-GIETEADYPYVDKTNEKCTFDSTKSKIHL----KKGVVAEGNE 200

Query:   250 HALLQAVTKQ-PVSVCVEASGQAFRFYKRGVLNA---ECGDNCD-HGVAVVGFGTAEEED 304
                   VT   P    + A    +  YK G+ N    EC    +   + +VG+G   E+ 
Sbjct:   201 VLGKVYVTNYGPAFFTMRAPPSLYD-YKIGIYNPSIEECTSTHEIRSMVIVGYGIEGEQ- 258

Query:   305 GAKYWLIKNSWGETWGESGYIRILRDEGLCGIAT 338
               KYW++K S+G +WGE GY+++ RD   C +AT
Sbjct:   259 --KYWIVKGSFGTSWGEQGYMKLARDVNACAMAT 290


>WB|WBGene00044760 [details] [associations]
            symbol:Y71H2AM.25 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0004181
            "metallocarboxypeptidase activity" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            GeneTree:ENSGT00560000076599 EMBL:FO081822 eggNOG:NOG331187
            HOGENOM:HOG000114005 RefSeq:NP_001040887.1
            ProteinModelPortal:Q2AAB9 SMR:Q2AAB9 EnsemblMetazoa:Y71H2AM.25
            GeneID:4363054 KEGG:cel:CELE_Y71H2AM.25 UCSC:Y71H2AM.25 CTD:4363054
            WormBase:Y71H2AM.25 InParanoid:Q2AAB9 NextBio:959635 Uniprot:Q2AAB9
        Length = 299

 Score = 290 (107.1 bits), Expect = 1.4e-25, P = 1.4e-25
 Identities = 75/230 (32%), Positives = 112/230 (48%)

Query:   122 TFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGI-TQITGGKLIELSE 180
             T KY  +      +DWR+KG V  +K+QG C +  AF+  +++E +  + T G L+  SE
Sbjct:    74 TPKY-TIQTTEEFLDWRDKGIVGPVKDQGKCNASHAFAISSSIESMYAKATNGSLLSFSE 132

Query:   181 QQLVDCSTDN-NGCSGGLMDKAFEYIIENKGLATEADYPYQ-QEQGTCDKQKEKAAAATI 238
             QQL+DC      GC       A  Y I + G+ TEADYPY  +E G C     K+     
Sbjct:   133 QQLIDCDDHGFKGCEEQPAINAVSYFIFH-GIETEADYPYAGKENGKCTFDSTKSKIQL- 190

Query:   239 GKYEDLPKGDEHALLQAVTKQ-PVSVCVEASGQAFRFYKRGVLNA---ECGDNCD-HGVA 293
              K  +    +E    + VT   P    + A    +  YK G+ N    EC    +   + 
Sbjct:   191 -KDAEFVVSNETQGKELVTNYGPAFFTMRAPPSLYD-YKIGIYNPSIEECTSTHEIRSMV 248

Query:   294 VVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEASYP 343
             +VG+G    E   KYW++K S+G +WGE GY+++ RD   C +A   + P
Sbjct:   249 IVGYGI---EGVQKYWIVKGSFGTSWGEQGYMKLARDVNACAMADFITVP 295


>WB|WBGene00013764 [details] [associations]
            symbol:Y113G7B.15 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 GeneTree:ENSGT00560000076599
            EMBL:AL110477 HOGENOM:HOG000019851 RefSeq:NP_507904.2
            ProteinModelPortal:Q9U2X1 SMR:Q9U2X1 DIP:DIP-25339N IntAct:Q9U2X1
            MINT:MINT-1058673 STRING:Q9U2X1 MEROPS:C01.A47
            EnsemblMetazoa:Y113G7B.15 GeneID:190976 KEGG:cel:CELE_Y113G7B.15
            UCSC:Y113G7B.15 CTD:190976 WormBase:Y113G7B.15 eggNOG:NOG302449
            OMA:AEEDIME Uniprot:Q9U2X1
        Length = 362

 Score = 288 (106.4 bits), Expect = 2.2e-25, P = 2.2e-25
 Identities = 73/231 (31%), Positives = 112/231 (48%)

Query:   130 DVPTSIDWRE---KGA--VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLV 184
             D+P   D R+    G+  V  +K+Q  CG CWAF+  A  E    +       LS+Q++ 
Sbjct:   130 DIPDYFDLRDIYVDGSPVVGPVKDQEQCGCCWAFATTAITEAANTLYSKSFTSLSDQEIC 189

Query:   185 DC--STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQ----GTC--DKQKEKAAAA 236
             DC  S D  GC GG      + ++  +G +++ DYPY++ +    G C  D++       
Sbjct:   190 DCADSGDTPGCVGGDPRNGLK-MVHLRGQSSDGDYPYEEYRANTTGNCVGDEKSTVIQPE 248

Query:   237 TIGKYE-DLPKGDEHALLQAVTKQ-PVSVCVEASGQAFRFYKRGVLNAE-CGDNCD---H 290
             T+  Y  D    +E  +        P +V     G+ F +Y  GVL +E C        H
Sbjct:   249 TLNVYRFDQDYAEEDIMENLYLNHIPTAVYFRV-GENFEWYTSGVLQSEDCYQMTPAEWH 307

Query:   291 GVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEAS 341
              VA+VG+GT++  DG  YWL++NSW   WG  GY++I R    C I + A+
Sbjct:   308 SVAIVGYGTSD--DGVPYWLVRNSWNSDWGLHGYVKIRRGVNWCLIESHAA 356

 Score = 183 (69.5 bits), Expect = 7.1e-12, P = 7.1e-12
 Identities = 55/210 (26%), Positives = 90/210 (42%)

Query:    32 SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE----KANKEGNRTYKLG 87
             + H   ++     +   H + Y+   EK  RL  F +N + I+    KA +EG R    G
Sbjct:    20 TQHSQEVLSHFNNFTMHHKKHYRTPAEKDRRLAHFAKNHQKIQELNAKARREG-RNVTFG 78

Query:    88 TNEFSDLTNEEFRASYTGY---NXXXXXXXXXXXXXXTFKYQNVT------DVPTSIDWR 138
              N+F+D   +E  A  +     N              +  + N        D+P   D R
Sbjct:    79 WNKFADKNRQELSARNSKIHPKNHTDLPIYKPRHPRGSRNHHNKRSKRQSGDIPDYFDLR 138

Query:   139 E---KGA--VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC--STDNN 191
             +    G+  V  +K+Q  CG CWAF+  A  E    +       LS+Q++ DC  S D  
Sbjct:   139 DIYVDGSPVVGPVKDQEQCGCCWAFATTAITEAANTLYSKSFTSLSDQEICDCADSGDTP 198

Query:   192 GCSGGLMDKAFEYIIENKGLATEADYPYQQ 221
             GC GG      + ++  +G +++ DYPY++
Sbjct:   199 GCVGGDPRNGLK-MVHLRGQSSDGDYPYEE 227


>UNIPROTKB|E9PKT6 [details] [associations]
            symbol:CTSH "Cathepsin H" species:9606 "Homo sapiens"
            [GO:0001520 "outer dense fiber" evidence=IEA] [GO:0001656
            "metanephros development" evidence=IEA] [GO:0001669 "acrosomal
            vesicle" evidence=IEA] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0007283 "spermatogenesis" evidence=IEA] [GO:0008284 "positive
            regulation of cell proliferation" evidence=IEA] [GO:0010634
            "positive regulation of epithelial cell migration" evidence=IEA]
            [GO:0016505 "apoptotic protease activator activity" evidence=IEA]
            [GO:0030984 "kininogen binding" evidence=IEA] [GO:0031638 "zymogen
            activation" evidence=IEA] [GO:0031648 "protein destabilization"
            evidence=IEA] [GO:0032403 "protein complex binding" evidence=IEA]
            [GO:0032526 "response to retinoic acid" evidence=IEA] [GO:0033619
            "membrane protein proteolysis" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0043621 "protein
            self-association" evidence=IEA] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IEA] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            InterPro:IPR000169 GO:GO:0043066 GO:GO:0008284 PANTHER:PTHR12411
            PROSITE:PS00139 GO:GO:0045766 GO:GO:0004252 GO:GO:0032526
            GO:GO:0016505 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 GO:GO:0060448 GO:GO:0033619
            EMBL:AC011944 HGNC:HGNC:2535 IPI:IPI00375426
            ProteinModelPortal:E9PKT6 SMR:E9PKT6 PRIDE:E9PKT6
            Ensembl:ENST00000528741 ArrayExpress:E9PKT6 Bgee:E9PKT6
            Uniprot:E9PKT6
        Length = 134

 Score = 288 (106.4 bits), Expect = 2.2e-25, P = 2.2e-25
 Identities = 60/138 (43%), Positives = 78/138 (56%)

Query:    86 LGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGA-VT 144
             +  N+FSD++  E +  Y                  T  Y      P S+DWR+KG  V+
Sbjct:     1 MALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPY------PPSVDWRKKGNFVS 54

Query:   145 HIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAF 202
              +KNQG CGSCW FS   A+E    I  GK++ L+EQQLVDC+ D  N+GC GGL  +AF
Sbjct:    55 PVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAF 114

Query:   203 EYIIENKGLATEADYPYQ 220
             EYI+ NKG+  E  YPYQ
Sbjct:   115 EYILYNKGIMGEDTYPYQ 132


>UNIPROTKB|F1RWA9 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 EMBL:CU855637
            Ensembl:ENSSSCT00000009707 OMA:WAFSIVG Uniprot:F1RWA9
        Length = 194

 Score = 288 (106.4 bits), Expect = 2.2e-25, P = 2.2e-25
 Identities = 63/194 (32%), Positives = 100/194 (51%)

Query:   152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENK-G 210
             CG CWAFS V+AVE    I G  L  LS QQ++DCS +N GC+GG    A  ++ + +  
Sbjct:     2 CGGCWAFSVVSAVESAYAIKGQPLEVLSVQQVIDCSYNNYGCNGGSTLNALYWLNKTQVK 61

Query:   211 LATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP-KGDEHALLQAV-TKQPVSVCVEAS 268
             + ++++YP++ + G C       +  +I  Y      G E  + + + T  P+ V V+A 
Sbjct:    62 VVSDSEYPFKAQNGLCHYFSCSHSGVSIKDYSAYDFSGQEDEMAKTLLTLGPLIVIVDAV 121

Query:   269 GQAFRFYKRGVLNAECGDN-CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI 327
               +++ Y  G++   C     +H V V GF   ++     YW+++NSWG  WG  GY  +
Sbjct:   122 --SWQDYLGGIIQHHCSSGEANHAVLVTGF---DKTGSTPYWIVRNSWGSAWGIDGYALV 176

Query:   328 LRDEGLCGIATEAS 341
                  +CGIA   S
Sbjct:   177 KMGGNICGIADSVS 190


>WB|WBGene00008861 [details] [associations]
            symbol:F15D4.4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 SMART:SM00848 PROSITE:PS00640
            GeneTree:ENSGT00560000076599 EMBL:Z80344 HSSP:P53634
            eggNOG:NOG310593 PIR:T20981 ProteinModelPortal:Q93512 SMR:Q93512
            MEROPS:C01.A45 EnsemblMetazoa:F15D4.4 KEGG:cel:CELE_F15D4.4
            UCSC:F15D4.4 CTD:184530 WormBase:F15D4.4 InParanoid:Q93512
            OMA:ITMEQNI NextBio:925068 Uniprot:Q93512
        Length = 608

 Score = 289 (106.8 bits), Expect = 1.1e-24, P = 1.1e-24
 Identities = 86/296 (29%), Positives = 130/296 (43%)

Query:    58 EKAMRLTIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXX 114
             E   R  ++ +  + +++ N   + G  +YK+ TN+FS   + E  A  T  N       
Sbjct:   150 EGLKRFNVYSKVKKEVDEHNIMYELGMSSYKMSTNQFSVALDGEV-APLT-LNLDALTPT 207

Query:   115 XXXXXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGK 174
                    T   +   D   ++DWR    +  I +Q  CG CWAFS ++ +E    I G  
Sbjct:   208 ATVIPA-TISSRKKRDTEPTVDWRP--FLKPILDQSTCGGCWAFSMISMIESFFAIQGYN 264

Query:   175 LIELSEQQLVDCSTD--------NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTC 226
                LS QQL+ C T         N GC GG    A  Y+ E       +  P+  E  +C
Sbjct:   265 TSSLSVQQLLTCDTKVDSTYGLANVGCKGGYFQIAGSYL-EVSAARDASLIPFDLEDTSC 323

Query:   227 DKQKEKAAAATIGKYED-LPKGD---------EHALLQAVTKQPVSVCVEASGQAFRFYK 276
             D         TI  ++D    G+         E  +   V K P++V + A    ++ Y 
Sbjct:   324 DSSFFPPVVPTILLFDDGYISGNFTAAQLITMEQNIEDKVRKGPIAVGMAAGPDIYK-YS 382

Query:   277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEG 332
              GV + +CG   +H V +VGF     +D   YW+I+NSWG +WGE+GY R+ R  G
Sbjct:   383 EGVYDGDCGTIINHAVVIVGF----TDD---YWIIRNSWGASWGEAGYFRVKRTPG 431


>UNIPROTKB|P53634 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9606 "Homo
            sapiens" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0005783 "endoplasmic reticulum" evidence=IEA] [GO:0005794
            "Golgi apparatus" evidence=IEA] [GO:0007568 "aging" evidence=IEA]
            [GO:0010033 "response to organic substance" evidence=IEA]
            [GO:0031404 "chloride ion binding" evidence=IEA] [GO:0042802
            "identical protein binding" evidence=IEA] [GO:0043621 "protein
            self-association" evidence=IEA] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0006508 "proteolysis" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0006955
            "immune response" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005783 GO:GO:0005794 Reactome:REACT_6900
            GO:GO:0006955 GO:GO:0007568 GO:GO:0010033 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004252 GO:GO:0005764 GO:GO:0031404 CTD:1075
            HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ InterPro:IPR014882
            Pfam:PF08773 MEROPS:C01.070 EMBL:X87212 EMBL:U79415 EMBL:AF234263
            EMBL:AF234264 EMBL:AF254757 EMBL:AF525032 EMBL:AF525033
            EMBL:AK292117 EMBL:AK311923 EMBL:AK223038 EMBL:BX537913
            EMBL:AC011088 EMBL:CH471185 EMBL:BC054028 EMBL:BC100891
            EMBL:BC100892 EMBL:BC100893 EMBL:BC100894 EMBL:BC109386
            EMBL:BC110071 EMBL:BC113850 EMBL:BC113897 IPI:IPI00022810
            IPI:IPI00171323 IPI:IPI00872258 PIR:S23941 PIR:S66504
            RefSeq:NP_001107645.1 RefSeq:NP_001805.3 RefSeq:NP_680475.1
            UniGene:Hs.128065 PDB:1K3B PDB:2DJF PDB:2DJG PDB:3PDF PDBsum:1K3B
            PDBsum:2DJF PDBsum:2DJG PDBsum:3PDF ProteinModelPortal:P53634
            SMR:P53634 IntAct:P53634 MINT:MINT-4655964 STRING:P53634
            PhosphoSite:P53634 DMDM:1705632 PaxDb:P53634 PRIDE:P53634
            DNASU:1075 Ensembl:ENST00000227266 Ensembl:ENST00000524463
            Ensembl:ENST00000529974 GeneID:1075 KEGG:hsa:1075 UCSC:uc001pck.4
            UCSC:uc001pcm.4 GeneCards:GC11M088026 HGNC:HGNC:2528 HPA:CAB025364
            MIM:170650 MIM:245000 MIM:245010 MIM:602365 neXtProt:NX_P53634
            Orphanet:2342 Orphanet:678 PharmGKB:PA27028 HOGENOM:HOG000127503
            InParanoid:P53634 OMA:YDDFLHY PhylomeDB:P53634
            BioCyc:MetaCyc:HS03265-MONOMER SABIO-RK:P53634 BindingDB:P53634
            ChEMBL:CHEMBL2252 EvolutionaryTrace:P53634 GenomeRNAi:1075
            NextBio:4488 PMAP-CutDB:P53634 ArrayExpress:P53634 Bgee:P53634
            Genevestigator:P53634 GermOnline:ENSG00000109861 GO:GO:0001913
            Uniprot:P53634
        Length = 463

 Score = 283 (104.7 bits), Expect = 1.8e-24, P = 1.8e-24
 Identities = 82/238 (34%), Positives = 115/238 (48%)

Query:   126 QNVTDVPTSIDWRE-KGA--VTHIKNQGHCGSCWAFSAVAAVEG-ITQITGGKLIE-LSE 180
             Q +  +PTS DWR   G   V+ ++NQ  CGSC++F+++  +E  I  +T       LS 
Sbjct:   226 QKILHLPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSP 285

Query:   181 QQLVDCSTDNNGCSGGLMDKAFEYIIENK-----GLATEADYPYQQEQGTCDKQKEKAAA 235
             Q++V CS    GC GG     F Y+I  K     GL  EA +PY      C K KE    
Sbjct:   286 QEVVSCSQYAQGCEGG-----FPYLIAGKYAQDFGLVEEACFPYTGTDSPC-KMKEDCFR 339

Query:   236 ATIGKYEDLPK---GDEHAL--LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECG----- 285
                 +Y  +     G   AL  L+ V   P++V  E     F  YK+G+ +   G     
Sbjct:   340 YYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD-FLHYKKGIYH-HTGLRDPF 397

Query:   286 ---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEA 340
                +  +H V +VG+GT +   G  YW++KNSWG  WGE+GY RI R    C I + A
Sbjct:   398 NPFELTNHAVLLVGYGT-DSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESIA 454


>UNIPROTKB|F1N455 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1 exclusion domain chain"
            species:9913 "Bos taurus" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 IPI:IPI00697314 UniGene:Bt.49573
            InterPro:IPR014882 Pfam:PF08773 OMA:YDDFLHY GO:GO:0001913
            EMBL:DAAA02062487 EMBL:DAAA02062488 Ensembl:ENSBTAT00000014735
            Uniprot:F1N455
        Length = 463

 Score = 278 (102.9 bits), Expect = 6.6e-24, P = 6.6e-24
 Identities = 81/238 (34%), Positives = 117/238 (49%)

Query:   126 QNVTDVPTSIDWRE-KGA--VTHIKNQGHCGSCWAFSAVAAVEG-ITQITGGKLIE-LSE 180
             + +  +PTS DWR   G   VT ++NQG CGSC++F+++  +E  I  +T       LS 
Sbjct:   226 KKILHLPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSP 285

Query:   181 QQLVDCSTDNNGCSGGLMDKAFEYIIENK-----GLATEADYPYQQEQGTCDKQKEKAAA 235
             Q++V CS    GC GG     F Y+I  K     GL  E  +PY      C + KE    
Sbjct:   286 QEVVSCSQYAQGCEGG-----FPYLIAGKYAQDFGLVEEDCFPYTGTDSPC-RLKEGCFR 339

Query:   236 ATIGKYEDLPK---GDEHAL--LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECG----- 285
                 +Y  +     G   AL  L+ V + P++V  E     F  Y++GV +   G     
Sbjct:   340 YYSSEYHYVGGFYGGCNEALMKLELVHQGPMAVAFEVYDD-FLHYRKGVYH-HTGLRDPF 397

Query:   286 ---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEA 340
                +  +H V +VG+GT +   G  YW++KNSWG +WGE+GY RI R    C I + A
Sbjct:   398 NPFELTNHAVLLVGYGT-DAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIA 454


>UNIPROTKB|Q3ZCJ8 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9913 "Bos
            taurus" [GO:0031638 "zymogen activation" evidence=IDA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0005764 EMBL:BC102115 IPI:IPI00697314 RefSeq:NP_001028789.1
            UniGene:Bt.49573 ProteinModelPortal:Q3ZCJ8 SMR:Q3ZCJ8 STRING:Q3ZCJ8
            PRIDE:Q3ZCJ8 GeneID:352958 KEGG:bta:352958 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 InParanoid:Q3ZCJ8 KO:K01275
            OrthoDB:EOG4H19VZ BindingDB:Q3ZCJ8 ChEMBL:CHEMBL1075050
            NextBio:20812686 GO:GO:0031638 InterPro:IPR014882 Pfam:PF08773
            Uniprot:Q3ZCJ8
        Length = 463

 Score = 278 (102.9 bits), Expect = 6.6e-24, P = 6.6e-24
 Identities = 81/238 (34%), Positives = 117/238 (49%)

Query:   126 QNVTDVPTSIDWRE-KGA--VTHIKNQGHCGSCWAFSAVAAVEG-ITQITGGKLIE-LSE 180
             + +  +PTS DWR   G   VT ++NQG CGSC++F+++  +E  I  +T       LS 
Sbjct:   226 KKILHLPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSP 285

Query:   181 QQLVDCSTDNNGCSGGLMDKAFEYIIENK-----GLATEADYPYQQEQGTCDKQKEKAAA 235
             Q++V CS    GC GG     F Y+I  K     GL  E  +PY      C + KE    
Sbjct:   286 QEVVSCSQYAQGCEGG-----FPYLIAGKYAQDFGLVEEDCFPYTGTDSPC-RLKEGCFR 339

Query:   236 ATIGKYEDLPK---GDEHAL--LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECG----- 285
                 +Y  +     G   AL  L+ V + P++V  E     F  Y++GV +   G     
Sbjct:   340 YYSSEYHYVGGFYGGCNEALMKLELVHQGPMAVAFEVYDD-FLHYRKGVYH-HTGLRDPF 397

Query:   286 ---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEA 340
                +  +H V +VG+GT +   G  YW++KNSWG +WGE+GY RI R    C I + A
Sbjct:   398 NPFELTNHAVLLVGYGT-DAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIESIA 454


>DICTYBASE|DDB_G0286015 [details] [associations]
            symbol:gmsA species:44689 "Dictyostelium discoideum"
            [GO:0005615 "extracellular space" evidence=IDA] [GO:0019953 "sexual
            reproduction" evidence=IEP] [GO:0009897 "external side of plasma
            membrane" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA;ISS] [GO:0000747 "conjugation with cellular
            fusion" evidence=IMP] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005576 "extracellular
            region" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0286015 Pfam:PF00188 GO:GO:0005615
            GenomeReviews:CM000153_GR GO:GO:0009897 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AAFI02000085 GO:GO:0000747
            Gene3D:3.40.33.10 InterPro:IPR001283 InterPro:IPR014044
            SMART:SM00198 SUPFAM:SSF55797 HSSP:P07688 RefSeq:XP_637893.1
            ProteinModelPortal:Q54ME1 MEROPS:C01.A52 EnsemblProtists:DDB0191145
            GeneID:8625403 KEGG:ddi:DDB_G0286015 InParanoid:Q54ME1 OMA:PGIAYEK
            ProtClustDB:CLSZ2429919 Uniprot:Q54ME1
        Length = 448

 Score = 277 (102.6 bits), Expect = 7.1e-24, P = 7.1e-24
 Identities = 81/224 (36%), Positives = 110/224 (49%)

Query:   129 TDVPTS---IDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGG----KLIELSEQ 181
             T  PTS   +DW      T I++QG CGSCWAF++ AA+E    I  G      ++LS Q
Sbjct:   235 TPAPTSTLTVDWTSYQ--TPIRDQGQCGSCWAFASSAALESRYLIKYGTAQKSTLQLSNQ 292

Query:   182 QLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKY 241
               V+C    +GC+GG     F +  +  G+A E D PY+   GT        A      Y
Sbjct:   293 NAVNCIA--SGCNGGWSGNYFNFF-KTPGIAYEKDDPYKAVTGTSCITTSSVARFKYTNY 349

Query:   242 EDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECG-DNCDHGVAVVGFGTA 300
                 K  + ALL  + K PV++ V     AF+ YK G+ N+       +H V +VG+  A
Sbjct:   350 GYTEK-TKAALLAELKKGPVTIAVYVDS-AFQNYKSGIYNSATKYTGINHLVLLVGYDQA 407

Query:   301 EEEDGAKYWLIKNSWGETWGESGYIRILR-DEGLCGIATEASYP 343
                D  K   IKNSWG  WGESGY+RI   ++ L   A  + YP
Sbjct:   408 T--DAYK---IKNSWGSWWGESGYMRITASNDNLAIFAYNSYYP 446


>UNIPROTKB|O97578 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9615 "Canis
            lupus familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 EMBL:AF060171 RefSeq:NP_001182763.1
            UniGene:Cfa.28653 ProteinModelPortal:O97578 SMR:O97578
            MEROPS:C01.070 PRIDE:O97578 GeneID:403458 KEGG:cfa:403458
            InParanoid:O97578 NextBio:20816976 Uniprot:O97578
        Length = 435

 Score = 273 (101.2 bits), Expect = 1.7e-23, P = 1.7e-23
 Identities = 80/236 (33%), Positives = 114/236 (48%)

Query:   125 YQNVTDVPTSIDWRE-KGA--VTHIKNQGHCGSCWAFSAVAAVEG-ITQITGGKLIE-LS 179
             ++ ++ +PTS DWR  +G   V+ ++NQ  CGSC+AF++ A +E  I  +T       LS
Sbjct:   198 HEEISRLPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILS 257

Query:   180 EQQLVDCSTDNNGCSGGLMDKAFEYIIENK-----GLATEADYPYQQEQGTCDKQKEKAA 234
              Q++V CS    GC GG     F Y+I  K     GL  EA +PY      C        
Sbjct:   258 PQEIVSCSQYAQGCEGG-----FPYLIAGKYAQDFGLVEEACFPYAGSDSPCKPNDCFRY 312

Query:   235 AATIGKYEDLPKGD-EHAL--LQAVTKQPVSVCVEASGQAFRFYKRGVL-NAECGD--N- 287
              ++   Y     G    AL  L+ V   P++V  E     F  Y++G+  +    D  N 
Sbjct:   313 YSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFH-YQKGIYYHTGLRDPFNP 371

Query:   288 ---CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEA 340
                 +H V +VG+GT +   G  YW++KNSWG  WGE GY RI R    C I + A
Sbjct:   372 FELTNHAVLLVGYGT-DSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIA 426


>UNIPROTKB|F1STR1 [details] [associations]
            symbol:CTSC "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0004252
            "serine-type endopeptidase activity" evidence=IEA] [GO:0001913 "T
            cell mediated cytotoxicity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 KO:K01275 InterPro:IPR014882
            Pfam:PF08773 OMA:YDDFLHY GO:GO:0001913 EMBL:CU855751
            RefSeq:XP_003129789.1 UniGene:Ssc.6155 Ensembl:ENSSSCT00000016280
            GeneID:100522387 KEGG:ssc:100522387 Uniprot:F1STR1
        Length = 463

 Score = 274 (101.5 bits), Expect = 1.9e-23, P = 1.9e-23
 Identities = 89/299 (29%), Positives = 132/299 (44%)

Query:    65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFK 124
             ++K N ++++  N            E+  LT +E      GYN                 
Sbjct:   167 LYKYNHDFVKAINGIQKSWTATAYMEYETLTLKEMTQRGGGYNQRLPRPKPAPITAEI-- 224

Query:   125 YQNVTDVPTSIDWRE-KGA--VTHIKNQGHCGSCWAFSAVAAVEG-ITQITGGKLIE-LS 179
              +    +P S DWR  +G   VT ++NQ  CGSC++F+++  +E  I  +T       LS
Sbjct:   225 QEKSLHLPASWDWRNVRGTNFVTPVRNQASCGSCYSFASMGMMEARIRILTNNTQTPILS 284

Query:   180 EQQLVDCSTDNNGCSGGLMDKAFEYIIENK-----GLATEADYPYQQEQGTCDKQKEKAA 234
              Q++V CS    GC+GG     F Y+I  K     GL  EA +PY      C   KE   
Sbjct:   285 PQEVVSCSQYAQGCAGG-----FPYLIAGKYAQDFGLVEEACFPYTGTDSPCTV-KEGCF 338

Query:   235 AATIGKYEDLPK---GDEHAL--LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECG---- 285
                  +Y  +     G   AL  L+ V   P++V  E     F  Y++G+ +   G    
Sbjct:   339 RYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDD-FLHYRKGIYH-HTGLRDP 396

Query:   286 ----DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEA 340
                 +  +H V +VG+GT +   G  YW++KNSWG +WGE GY RI R    C I + A
Sbjct:   397 FNPFELTNHAVLLVGYGT-DLASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESIA 454


>UNIPROTKB|F1NWG2 [details] [associations]
            symbol:CTSC "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 InterPro:IPR014882 Pfam:PF08773
            OMA:YDDFLHY GO:GO:0001913 EMBL:AADN02004805 IPI:IPI00577371
            Ensembl:ENSGALT00000027869 Uniprot:F1NWG2
        Length = 463

 Score = 272 (100.8 bits), Expect = 3.1e-23, P = 3.1e-23
 Identities = 74/232 (31%), Positives = 111/232 (47%)

Query:   126 QNVTDVPTSIDWREKGAVTHI---KNQGHCGSCWAFSAVAAVEGITQITGGKLIE--LSE 180
             + V+ +P S DWR    V ++   +NQ  CGSC+AF+++  +E   +I      +   S 
Sbjct:   226 KKVSGLPESWDWRNVNGVNYVSPVRNQASCGSCYAFASMGMLEARIRILTNNTQKPVFSP 285

Query:   181 QQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGK 240
             QQ+V CS  + GC GG         +++ G+  E  +PY  +   C   K         +
Sbjct:   286 QQVVSCSQYSQGCDGGFPYLIAGKYVQDFGVVEEDCFPYTAKDTPC-LFKRSCYHYYTSE 344

Query:   241 YEDLPK--GD-EHAL--LQAVTKQPVSVCVEASGQAFRFYKRGV-----LNAECG--DNC 288
             Y  +    G    AL  L+ V   P++V  E     F FYK G+     L  E    +  
Sbjct:   345 YHYVGGFYGACNEALMKLELVLSGPMAVAFEVYND-FMFYKEGIYHHTGLKDEFNPFELT 403

Query:   289 DHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEA 340
             +H V +VG+G  + E G K+W++KNSWG +WGE GY RI R    C I + A
Sbjct:   404 NHAVLLVGYGK-DPESGEKFWIVKNSWGTSWGEDGYFRIRRGTDECAIESIA 454


>UNIPROTKB|F1PSK8 [details] [associations]
            symbol:F1PSK8 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 InterPro:IPR014882
            Pfam:PF08773 EMBL:AAEX03012741 Ensembl:ENSCAFT00000007054
            Uniprot:F1PSK8
        Length = 405

 Score = 264 (98.0 bits), Expect = 1.1e-22, P = 1.1e-22
 Identities = 80/237 (33%), Positives = 114/237 (48%)

Query:   125 YQNVTDVPTSIDWRE-KGA--VTHIKNQG-HCGSCWAFSAVAAVEG-ITQITGGKLIE-L 178
             ++ ++ +PTS DWR  +G   V+ ++NQ   CGSC+AF++ A +E  I  +T       L
Sbjct:   167 HEEISRLPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRILTNNTQTPIL 226

Query:   179 SEQQLVDCSTDNNGCSGGLMDKAFEYIIENK-----GLATEADYPYQQEQGTCDKQKEKA 233
             S Q++V CS    GC GG     F Y+I  K     GL  EA +PY      C       
Sbjct:   227 SPQEIVSCSQYAQGCEGG-----FPYLIAGKYAQDFGLVEEACFPYAGSDSPCKPNDCFR 281

Query:   234 AAATIGKYEDLPKGD-EHAL--LQAVTKQPVSVCVEASGQAFRFYKRGVL-NAECGD--N 287
               ++   Y     G    AL  L+ V   P++V  E     F  Y++G+  +    D  N
Sbjct:   282 YYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFH-YQKGIYYHTGLRDPFN 340

Query:   288 ----CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEA 340
                  +H V +VG+GT +   G  YW++KNSWG  WGE GY RI R    C I + A
Sbjct:   341 PFELTNHAVLLVGYGT-DSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIA 396


>UNIPROTKB|J9P219 [details] [associations]
            symbol:J9P219 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 InterPro:IPR014882
            Pfam:PF08773 OMA:YDDFLHY EMBL:AAEX03012741
            Ensembl:ENSCAFT00000050015 Uniprot:J9P219
        Length = 406

 Score = 264 (98.0 bits), Expect = 1.1e-22, P = 1.1e-22
 Identities = 80/237 (33%), Positives = 114/237 (48%)

Query:   125 YQNVTDVPTSIDWRE-KGA--VTHIKNQG-HCGSCWAFSAVAAVEG-ITQITGGKLIE-L 178
             ++ ++ +PTS DWR  +G   V+ ++NQ   CGSC+AF++ A +E  I  +T       L
Sbjct:   168 HEEISRLPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRILTNNTQTPIL 227

Query:   179 SEQQLVDCSTDNNGCSGGLMDKAFEYIIENK-----GLATEADYPYQQEQGTCDKQKEKA 233
             S Q++V CS    GC GG     F Y+I  K     GL  EA +PY      C       
Sbjct:   228 SPQEIVSCSQYAQGCEGG-----FPYLIAGKYAQDFGLVEEACFPYAGSDSPCKPNDCFR 282

Query:   234 AAATIGKYEDLPKGD-EHAL--LQAVTKQPVSVCVEASGQAFRFYKRGVL-NAECGD--N 287
               ++   Y     G    AL  L+ V   P++V  E     F  Y++G+  +    D  N
Sbjct:   283 YYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFH-YQKGIYYHTGLRDPFN 341

Query:   288 ----CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEA 340
                  +H V +VG+GT +   G  YW++KNSWG  WGE GY RI R    C I + A
Sbjct:   342 PFELTNHAVLLVGYGT-DSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIA 397


>MGI|MGI:109553 [details] [associations]
            symbol:Ctsc "cathepsin C" species:10090 "Mus musculus"
            [GO:0001913 "T cell mediated cytotoxicity" evidence=IGI]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IMP]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0005783 "endoplasmic
            reticulum" evidence=ISO] [GO:0005794 "Golgi apparatus"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=ISO;IMP]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0010033
            "response to organic substance" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0031404 "chloride ion
            binding" evidence=ISO] [GO:0042802 "identical protein binding"
            evidence=ISO] [GO:0043621 "protein self-association" evidence=ISO]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 MGI:MGI:109553 GO:GO:0005783
            GO:GO:0005794 GO:GO:0007568 GO:GO:0010033 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004252 GO:GO:0005764 GO:GO:0031404 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 MEROPS:C01.070 OMA:YDDFLHY
            GO:GO:0001913 EMBL:U89269 EMBL:U74683 EMBL:BC067063 IPI:IPI00130015
            RefSeq:NP_034112.3 UniGene:Mm.322945 ProteinModelPortal:P97821
            SMR:P97821 STRING:P97821 PhosphoSite:P97821 PaxDb:P97821
            PRIDE:P97821 Ensembl:ENSMUST00000032779 GeneID:13032 KEGG:mmu:13032
            InParanoid:P97821 BindingDB:P97821 ChEMBL:CHEMBL3454 ChiTaRS:CTSC
            NextBio:282904 Bgee:P97821 CleanEx:MM_CTSC Genevestigator:P97821
            Uniprot:P97821
        Length = 462

 Score = 266 (98.7 bits), Expect = 1.5e-22, P = 1.5e-22
 Identities = 79/237 (33%), Positives = 114/237 (48%)

Query:   126 QNVTDVPTSIDWRE-KGA--VTHIKNQGHCGSCWAFSAVAAVEG-ITQITGGKLIE-LSE 180
             Q + ++P S DWR  +G   V+ ++NQ  CGSC++F+++  +E  I  +T       LS 
Sbjct:   225 QQILNLPESWDWRNVQGVNYVSPVRNQESCGSCYSFASMGMLEARIRILTNNSQTPILSP 284

Query:   181 QQLVDCSTDNNGCSGGLMDKAFEYIIENK-----GLATEADYPYQQEQGTCDKQKEKAAA 235
             Q++V CS    GC GG     F Y+I  K     G+  E+ +PY  +   C K +E    
Sbjct:   285 QEVVSCSPYAQGCDGG-----FPYLIAGKYAQDFGVVEESCFPYTAKDSPC-KPRENCLR 338

Query:   236 ATIGKYEDLPK---GDEHAL--LQAVTKQPVSVCVEASGQAFRFYKRGVLN-AECGD--N 287
                  Y  +     G   AL  L+ V   P++V  E     F  Y  G+ +     D  N
Sbjct:   339 YYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDD-FLHYHSGIYHHTGLSDPFN 397

Query:   288 ----CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEA 340
                  +H V +VG+G  +   G +YW+IKNSWG  WGESGY RI R    C I + A
Sbjct:   398 PFELTNHAVLLVGYGR-DPVTGIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESIA 453


>RGD|2445 [details] [associations]
            symbol:Ctsc "cathepsin C" species:10116 "Rattus norvegicus"
          [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA;ISO]
          [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
          [GO:0004252 "serine-type endopeptidase activity" evidence=IEA;ISO]
          [GO:0005764 "lysosome" evidence=IDA;TAS] [GO:0005783 "endoplasmic
          reticulum" evidence=IDA] [GO:0005794 "Golgi apparatus" evidence=IDA]
          [GO:0006508 "proteolysis" evidence=IEP;ISO;TAS] [GO:0007568 "aging"
          evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
          evidence=ISO] [GO:0010033 "response to organic substance"
          evidence=IDA] [GO:0031404 "chloride ion binding" evidence=IDA]
          [GO:0042802 "identical protein binding" evidence=IDA] [GO:0043621
          "protein self-association" evidence=IDA] InterPro:IPR000668
          InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
          InterPro:IPR000169 RGD:2445 GO:GO:0005783 GO:GO:0005794 GO:GO:0007568
          GO:GO:0010033 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
          InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139
          PROSITE:PS00639 GO:GO:0004252 GO:GO:0005764 GO:GO:0043621
          GO:GO:0042802 GO:GO:0031404 GO:GO:0004197
          GeneTree:ENSGT00560000076599 CTD:1075 HOGENOM:HOG000068022
          HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ InterPro:IPR014882
          Pfam:PF08773 MEROPS:C01.070 OMA:YDDFLHY GO:GO:0001913 EMBL:D90404
          IPI:IPI00193765 PIR:A41158 RefSeq:NP_058793.1 UniGene:Rn.203177
          PDB:1JQP PDBsum:1JQP ProteinModelPortal:P80067 SMR:P80067
          STRING:P80067 PhosphoSite:P80067 PRIDE:P80067
          Ensembl:ENSRNOT00000022342 GeneID:25423 KEGG:rno:25423
          InParanoid:P80067 SABIO-RK:P80067 EvolutionaryTrace:P80067
          NextBio:606591 ArrayExpress:P80067 Genevestigator:P80067
          GermOnline:ENSRNOG00000016496 Uniprot:P80067
        Length = 462

 Score = 261 (96.9 bits), Expect = 5.3e-22, P = 5.3e-22
 Identities = 79/237 (33%), Positives = 111/237 (46%)

Query:   126 QNVTDVPTSIDWRE-KGA--VTHIKNQGHCGSCWAFSAVAAVEG-ITQITGGKLIE-LSE 180
             Q +  +P S DWR  +G   V+ ++NQ  CGSC++F+++  +E  I  +T       LS 
Sbjct:   225 QQILSLPESWDWRNVRGINFVSPVRNQESCGSCYSFASLGMLEARIRILTNNSQTPILSP 284

Query:   181 QQLVDCSTDNNGCSGGLMDKAFEYIIENK-----GLATEADYPYQQEQGTCDKQKEKAAA 235
             Q++V CS    GC GG     F Y+I  K     G+  E  +PY      C K KE    
Sbjct:   285 QEVVSCSPYAQGCDGG-----FPYLIAGKYAQDFGVVEENCFPYTATDAPC-KPKENCLR 338

Query:   236 ATIGKYEDLPK---GDEHAL--LQAVTKQPVSVCVEASGQAFRFYKRGVLN-AECGD--N 287
                 +Y  +     G   AL  L+ V   P++V  E     F  Y  G+ +     D  N
Sbjct:   339 YYSSEYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDD-FLHYHSGIYHHTGLSDPFN 397

Query:   288 ----CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEA 340
                  +H V +VG+G  +   G  YW++KNSWG  WGESGY RI R    C I + A
Sbjct:   398 PFELTNHAVLLVGYGK-DPVTGLDYWIVKNSWGSQWGESGYFRIRRGTDECAIESIA 453


>UNIPROTKB|J9NSE7 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 InterPro:IPR014882 Pfam:PF08773
            EMBL:AAEX03017125 Ensembl:ENSCAFT00000014269 OMA:INGQICH
            Uniprot:J9NSE7
        Length = 458

 Score = 258 (95.9 bits), Expect = 1.1e-21, P = 1.1e-21
 Identities = 77/236 (32%), Positives = 112/236 (47%)

Query:   125 YQNVTDVPTSIDWRE-KGA--VTHIKNQGHCGSCWAFSAVAAVEG-ITQITGGKLIE-LS 179
             ++ ++ +PTS DWR  +G   V+ ++NQ  CGSC+AF++   +E  I  +T       LS
Sbjct:   221 HEEISRLPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTVMLEARIRILTNNTQTPILS 280

Query:   180 EQQLVDCSTDNNGCSGGLMDKAFEYIIENK-----GLATEADYPYQQEQGTCDKQKEKAA 234
              Q++V CS    GC GG     F Y+I  K     GL  EA + Y      C        
Sbjct:   281 PQEIVSCSQYAQGCEGG-----FPYLIAGKYAQDFGLVDEACFSYAGSDSPCKPNDCFHY 335

Query:   235 AATIGKYEDLPKGD-EHAL--LQAVTKQPVSVCVEASGQAFRFYKRGVL-NAECGD--N- 287
              ++   Y     G    AL  L+ V   P++V  E     F  Y++G+  +    D  N 
Sbjct:   336 YSSEYHYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFH-YQKGIYYHTGLRDPINP 394

Query:   288 ---CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEA 340
                 +H V +VG+GT +   G  YW++KNSWG  WGE GY +I R    C I + A
Sbjct:   395 FELTNHAVLLVGYGT-DSASGMDYWIVKNSWGSRWGEDGYFQICRGTDECAIESIA 449


>ZFIN|ZDB-GENE-030619-9 [details] [associations]
            symbol:ctsc "cathepsin C" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030619-9 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 MEROPS:C01.070 HSSP:P43235
            EMBL:BC064286 IPI:IPI00486570 RefSeq:NP_999887.1 UniGene:Dr.32463
            ProteinModelPortal:Q6P2V1 SMR:Q6P2V1 PRIDE:Q6P2V1 GeneID:368704
            KEGG:dre:368704 InParanoid:Q6P2V1 NextBio:20813127
            ArrayExpress:Q6P2V1 Bgee:Q6P2V1 Uniprot:Q6P2V1
        Length = 455

 Score = 257 (95.5 bits), Expect = 1.4e-21, P = 1.4e-21
 Identities = 71/226 (31%), Positives = 108/226 (47%)

Query:   131 VPTSIDWRE-KGA--VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIE--LSEQQLVD 185
             +P   DWR   G   V+ ++NQ  CGSC++F+ +  +E   +I      +   S QQ+V 
Sbjct:   224 LPQHWDWRNVNGVNFVSPVRNQAQCGSCYSFATMGMLEARVRIQTNNTQQPVFSPQQVVS 283

Query:   186 CSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKE--KAAAATIGKYED 243
             CS  + GC GG      +YI ++ G+  E  +PY      C+   +  K  A+       
Sbjct:   284 CSQYSQGCDGGFPYLIGKYI-QDFGIVEEDCFPYTGSDSPCNLPAKCTKYYASDYHYVGG 342

Query:   244 LPKG-DEHAL-LQAVTKQPVSVCVEASGQAFRFYKRGVLN---AECGDN----CDHGVAV 294
                G  E A+ L+ V   P+ V +E     F  YK G+ +       +N     +H V +
Sbjct:   343 FYGGCSESAMMLELVKNGPMGVALEVYPD-FMNYKEGIYHHTGLRDANNPFELTNHAVLL 401

Query:   295 VGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEA 340
             VG+G   +  G KYW++KNSWG  WGE+G+ RI R    C I + A
Sbjct:   402 VGYGQCHKT-GEKYWIVKNSWGSGWGENGFFRIRRGTDECAIESIA 446


>WB|WBGene00000788 [details] [associations]
            symbol:cpz-1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] [GO:0009792 "embryo development ending in
            birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0010171 "body morphogenesis" evidence=IMP]
            [GO:0018996 "molting cycle, collagen and cuticulin-based cuticle"
            evidence=IMP] [GO:0031012 "extracellular matrix" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0009792 GO:GO:0040007 GO:GO:0002119
            GO:GO:0018996 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0010171 GO:GO:0031012
            GeneTree:ENSGT00560000076599 KO:K08568 OMA:QCGTCTE EMBL:FO081275
            EMBL:BK001409 PIR:T29872 RefSeq:NP_491023.2 HSSP:Q9UBR2
            ProteinModelPortal:G5EGP8 SMR:G5EGP8 IntAct:G5EGP8 MEROPS:C01.A38
            EnsemblMetazoa:F32B5.8 GeneID:171829 KEGG:cel:CELE_F32B5.8
            CTD:171829 WormBase:F32B5.8 NextBio:872879 Uniprot:G5EGP8
        Length = 306

 Score = 246 (91.7 bits), Expect = 6.3e-21, P = 6.3e-21
 Identities = 65/223 (29%), Positives = 103/223 (46%)

Query:   130 DVPTSIDWREKGAVTHI---KNQG---HCGSCWAFSAVAAVE---GITQITGGKLIELSE 180
             D+P + DWR+   + +    +NQ    +CGSCWAF A +A+     I +        LS 
Sbjct:    64 DLPKTWDWRDANGINYASADRNQHIPQYCGSCWAFGATSALADRINIKRKNAWPQAYLSV 123

Query:   181 QQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAA----- 235
             Q+++DCS       GG     ++Y  E+ G+  E    YQ   G CD      +      
Sbjct:   124 QEVIDCSGAGTCVMGGEPGGVYKYAHEH-GIPHETCNNYQARDGKCDPYNRCGSCWPGEC 182

Query:   236 ATIGKYEDLPKGDEHALLQAVTKQPVSV-------CVEASGQAFRFYKRGVLNAECGDNC 288
              +I  Y  L K  E+  +    K    +       C  A+ +AF  Y  G+      ++ 
Sbjct:   183 FSIKNYT-LYKVSEYGTVHGYEKMKAEIYHKGPIACGIAATKAFETYAGGIYKEVTDEDI 241

Query:   289 DHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE 331
             DH ++V G+G  + E G +YW+ +NSWGE WGE G+ +I+  +
Sbjct:   242 DHIISVHGWGV-DHESGVEYWIGRNSWGEPWGEHGWFKIVTSQ 283


>DICTYBASE|DDB_G0288221 [details] [associations]
            symbol:DDB_G0288221 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0288221 Pfam:PF00188 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 PANTHER:PTHR12411 EMBL:AAFI02000109 Gene3D:3.40.33.10
            InterPro:IPR001283 InterPro:IPR014044 SMART:SM00198 SUPFAM:SSF55797
            MEROPS:C01.A52 ProtClustDB:CLSZ2429919 RefSeq:XP_636852.1
            ProteinModelPortal:Q54J84 EnsemblProtists:DDB0187839 GeneID:8626520
            KEGG:ddi:DDB_G0288221 InParanoid:Q54J84 Uniprot:Q54J84
        Length = 395

 Score = 242 (90.2 bits), Expect = 3.3e-20, P = 3.3e-20
 Identities = 67/215 (31%), Positives = 104/215 (48%)

Query:   134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGG----KLIELSEQQLVDCSTD 189
             S+DW +    T +++QG C SCW F ++AA+E    I  G      + LS Q  ++C T 
Sbjct:   191 SVDWSDYQ--TPVRDQGECKSCWVFGSLAALESRYLIKNGVSEKSTLHLSAQNAMNCIT- 247

Query:   190 NNGCSGGLMDKAFEYIIENKGLATEADYPYQQ-EQGTCDKQKEKAAAATIGKYEDLPKGD 248
              +GC  G     F+Y  E+ G+A E DYPY       C     K   +  G Y+ + +  
Sbjct:   248 -SGCESGWPANVFDYF-ESSGIAFEKDYPYDAIGSDNCTSSSNKFEYS--G-YDSV-ENT 301

Query:   249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNA-ECGDNCDHGVAVVGFGTAEEEDGAK 307
             + +L+Q +   P+++ +  S  AF+ Y  G+ ++ E   + +H V +VG+    +     
Sbjct:   302 KDSLIQELKNGPITIALY-SDTAFQSYAGGIYDSVEEYKDVNHIVLLVGYDKPTDS---- 356

Query:   308 YWLIKNSWGETWGESGYIRILRDEGLCGIATEASY 342
              W IKNS G  WGE GY RI       GI    S+
Sbjct:   357 -WKIKNSLGTKWGELGYARITASNDKLGILLYNSF 390


>UNIPROTKB|E2QV47 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0097208 "alveolar lamellar body"
            evidence=IEA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0070371 "ERK1 and ERK2 cascade"
            evidence=IEA] [GO:0070324 "thyroid hormone binding" evidence=IEA]
            [GO:0060448 "dichotomous subdivision of terminal units involved in
            lung branching" evidence=IEA] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IEA] [GO:0043129 "surfactant homeostasis"
            evidence=IEA] [GO:0043066 "negative regulation of apoptotic
            process" evidence=IEA] [GO:0033619 "membrane protein proteolysis"
            evidence=IEA] [GO:0032526 "response to retinoic acid" evidence=IEA]
            [GO:0031648 "protein destabilization" evidence=IEA] [GO:0031638
            "zymogen activation" evidence=IEA] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=IEA] [GO:0016505
            "apoptotic protease activator activity" evidence=IEA] [GO:0010815
            "bradykinin catabolic process" evidence=IEA] [GO:0010813
            "neuropeptide catabolic process" evidence=IEA] [GO:0010634
            "positive regulation of epithelial cell migration" evidence=IEA]
            [GO:0010628 "positive regulation of gene expression" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=IEA] [GO:0005829 "cytosol" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0004252 "serine-type endopeptidase activity"
            evidence=IEA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IEA] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 GO:GO:0016505 GO:GO:0010634
            GO:GO:0004197 GO:GO:0042599 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 Ensembl:ENSCAFT00000036196 Uniprot:E2QV47
        Length = 136

 Score = 239 (89.2 bits), Expect = 3.5e-20, P = 3.5e-20
 Identities = 50/138 (36%), Positives = 77/138 (55%)

Query:   214 EADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQAF 272
             E  YPY+ + G C  Q  KA A  +    ++   DE A+++AV    PVS   E +   F
Sbjct:     3 EDSYPYKGQDGDCKYQPSKAIAF-VKDVANITINDEQAMVEAVALYNPVSFAFEVTSD-F 60

Query:   273 RFYKRGVLNA-ECG---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRIL 328
               Y++G+ ++  C    D  +H V  VG+G   E++G  YW++KNSWG  WG +GY  + 
Sbjct:    61 MMYRKGIYSSTSCHKTPDKVNHAVLAVGYG---EQNGIPYWIVKNSWGPQWGMNGYFLME 117

Query:   329 RDEGLCGIATEASYPVAM 346
             R + +CG+A  ASYP+ +
Sbjct:   118 RGKNMCGLAACASYPIPL 135


>TAIR|locus:2133402 [details] [associations]
            symbol:AT4G01610 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0005829 "cytosol" evidence=IDA] [GO:0000902 "cell
            morphogenesis" evidence=RCA] [GO:0006635 "fatty acid
            beta-oxidation" evidence=RCA] [GO:0010162 "seed dormancy process"
            evidence=RCA] [GO:0016049 "cell growth" evidence=RCA] [GO:0048193
            "Golgi vesicle transport" evidence=RCA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0005773 EMBL:CP002687
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 eggNOG:NOG315657
            HOGENOM:HOG000241341 KO:K01363 PANTHER:PTHR12411:SF16 OMA:DAIPDHF
            HSSP:P07858 ProtClustDB:CLSN2687619 EMBL:AF370193 EMBL:AY065167
            EMBL:AY114015 EMBL:AY086034 EMBL:AF083797 EMBL:BT001190
            EMBL:AK175280 EMBL:AK175481 EMBL:AK175539 EMBL:AK176165
            EMBL:AK176244 EMBL:AK176281 EMBL:AK176330 EMBL:AK176416
            EMBL:AK176433 EMBL:AK176487 EMBL:AK221398 EMBL:AK230235
            IPI:IPI00530811 RefSeq:NP_567215.1 UniGene:At.24471
            ProteinModelPortal:Q94K85 SMR:Q94K85 STRING:Q94K85 MEROPS:C01.144
            PaxDb:Q94K85 PRIDE:Q94K85 EnsemblPlants:AT4G01610.1 GeneID:826792
            KEGG:ath:AT4G01610 TAIR:At4g01610 InParanoid:Q94K85
            PhylomeDB:Q94K85 Genevestigator:Q94K85 Uniprot:Q94K85
        Length = 359

 Score = 153 (58.9 bits), Expect = 5.7e-20, Sum P(2) = 5.7e-20
 Identities = 48/160 (30%), Positives = 69/160 (43%)

Query:    71 EYIEKANKEGNRTYKLGTNE-FSDLTNEEFRASYTGYNXXXXXXXXXXXXXX---TFKYQ 126
             E ++K N+  N  +K   N+ FS+ T  EF+    G                   + K  
Sbjct:    46 EIVKKVNENPNAGWKAAINDRFSNATVAEFKR-LLGVKPTPKKHFLGVPIVSHDPSLKLP 104

Query:   127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
                D  T+  W +  ++ +I +QGHCGSCWAF AV ++     I  G  I LS   L+ C
Sbjct:   105 KAFDARTA--WPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLAC 162

Query:   187 S--TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQG 224
                   +GC GG    A++Y     G+ TE   PY    G
Sbjct:   163 CGFRCGDGCDGGYPIAAWQYF-SYSGVVTEECDPYFDNTG 201

 Score = 149 (57.5 bits), Expect = 5.7e-20, Sum P(2) = 5.7e-20
 Identities = 38/102 (37%), Positives = 48/102 (47%)

Query:   246 KGDEHALLQAVTKQ-PVSVCVEASGQAFRFYKRGVLNAECGDNCD-HGVAVVGFGTAEEE 303
             K +   ++  V K  PV V      + F  YK GV     G N   H V ++G+GT+ E 
Sbjct:   241 KSNPQDIMAEVYKNGPVEVSFTVY-EDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSE- 298

Query:   304 DGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEASYPVA 345
              G  YWL+ N W   WG+ GY  I R    CGI  E   PVA
Sbjct:   299 -GEDYWLMANQWNRGWGDDGYFMIRRGTNECGIEDE---PVA 336


>DICTYBASE|DDB_G0288563 [details] [associations]
            symbol:DDB_G0288563 species:44689 "Dictyostelium
            discoideum" [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0288563
            GO:GO:0005615 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197
            EMBL:AAFI02000117 PANTHER:PTHR12411:SF16 RefSeq:XP_636643.1
            MEROPS:C01.A58 PRIDE:Q54IS1 EnsemblProtists:DDB0187993
            GeneID:8626689 KEGG:ddi:DDB_G0288563 InParanoid:Q54IS1 OMA:AWEYMEL
            Uniprot:Q54IS1
        Length = 314

 Score = 236 (88.1 bits), Expect = 7.2e-20, P = 7.2e-20
 Identities = 70/226 (30%), Positives = 106/226 (46%)

Query:   131 VPTSIDWREK--GAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIE---LSEQQLVD 185
             +PTS D R +    +  I NQ  CGSCWAFS+   +     I          LS Q LV 
Sbjct:    88 IPTSFDSRVQWPDCIHPILNQEQCGSCWAFSSSEVLSDRLCIASNNKTNPGALSPQTLVA 147

Query:   186 CST-DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDK-QKEKAAAATIGKYED 243
             C    N+GCSGG+   A+EY+ E KGL T++  PY    GT    Q+  + +     Y  
Sbjct:   148 CDVYGNDGCSGGIPQLAWEYM-ELKGLPTDSCVPYTAGNGTVYSCQRSCSDSEDYSLYRA 206

Query:   244 LPKGDEH-ALLQAVTKQ-----PVSVCVEASGQAFRFYKRGVLNAECGDNC--DHGVAVV 295
              P   +  + +Q + +      P+   +E   + F  Y  GV     G +    H + +V
Sbjct:   207 KPFTLKTCSSVQCIQENILAYGPIVGTMEVY-EDFMSYSSGVYVMTPGSSLLGGHAIKIV 265

Query:   296 GFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEAS 341
             G+G  ++     YW++ NSWG  WG+ G+  I  +   C I+++AS
Sbjct:   266 GWGF-DQTSQLNYWIVANSWGADWGQQGFFFISMET--CSISSDAS 308


>ZFIN|ZDB-GENE-070323-1 [details] [associations]
            symbol:ctsbb "capthepsin B, b" species:7955 "Danio
            rerio" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-070323-1 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197
            GeneTree:ENSGT00560000076599 PANTHER:PTHR12411:SF16 OMA:CCGFLCG
            EMBL:CU207296 EMBL:CABZ01037785 IPI:IPI00877452
            Ensembl:ENSDART00000097263 Bgee:F1QZT5 Uniprot:F1QZT5
        Length = 326

 Score = 162 (62.1 bits), Expect = 9.7e-20, Sum P(2) = 9.7e-20
 Identities = 34/101 (33%), Positives = 50/101 (49%)

Query:   240 KYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCD-HGVAVVGFG 298
             K  ++P   +  + +  T  PV        + F  YK GV     G     H V ++G+G
Sbjct:   222 KVYNVPSDQQQIMTELYTNGPVEAAFTVY-EDFPLYKSGVYQHLTGSALGGHAVKILGWG 280

Query:   299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATE 339
                EE+G  +WL+ NSW   WG++GY +ILR    CGI +E
Sbjct:   281 ---EENGTPFWLVANSWNSDWGDNGYFKILRGHDECGIESE 318

 Score = 134 (52.2 bits), Expect = 9.7e-20, Sum P(2) = 9.7e-20
 Identities = 36/103 (34%), Positives = 53/103 (51%)

Query:   122 TFKYQNVTDVPTSIDWREKG----AVTHIKNQGHCGSCWAFSAVAAV-EGITQITGGKLI 176
             T K+     +P S D R++      +  I++QG CGSCWAF AV ++ + I   + GK  
Sbjct:    66 TVKHSTNVKLPDSFDLRDQWPNCKTLNQIRDQGSCGSCWAFGAVESISDRICIHSKGKQS 125

Query:   177 -ELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIENKGLATEADY 217
              E+S + L+ C      GCSGG   +A++Y     GL T   Y
Sbjct:   126 PEISAEDLLSCCDQCGFGCSGGFPAEAWDYW-RRSGLVTGGLY 167


>WB|WBGene00013072 [details] [associations]
            symbol:Y51A2D.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            EMBL:AL021497 RefSeq:NP_001256811.1 ProteinModelPortal:O62484
            SMR:O62484 MEROPS:C01.A37 EnsemblMetazoa:Y51A2D.1 GeneID:180204
            KEGG:cel:CELE_Y51A2D.1 UCSC:Y51A2D.1 CTD:180204 WormBase:Y51A2D.1a
            HOGENOM:HOG000019851 NextBio:908416 Uniprot:O62484
        Length = 314

 Score = 159 (61.0 bits), Expect = 1.4e-19, Sum P(2) = 1.4e-19
 Identities = 38/103 (36%), Positives = 59/103 (57%)

Query:   245 PKGDEHALLQAVT--KQPVSVCVEASGQAFRFYKRGVLNAECGDNCD------HGVAVVG 296
             P+  E  +++ +   K PV+V   A+G AF  YK GVL  E   +CD      H  A+VG
Sbjct:   201 PENAESEIIEILNTWKTPVAVYF-AAGTAFLQYKSGVLVTE---DCDLAGTVWHAGAIVG 256

Query:   297 FGTAEEEDGA--KYWLIKNSWGET-WGESGYIRILRDEGLCGI 336
             +G   +  G   ++W++KNSWG + WG  GY++++R +  CGI
Sbjct:   257 YGEENDLRGRSQRFWIMKNSWGVSGWGTGGYVKLIRGKNWCGI 299

 Score = 135 (52.6 bits), Expect = 1.4e-19, Sum P(2) = 1.4e-19
 Identities = 40/155 (25%), Positives = 63/155 (40%)

Query:    34 HEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK---EGNRTYKLGTNE 90
             H   + ++  ++  +  RTYK E E  +RL  F ++   + + NK   +  R      N+
Sbjct:    36 HPEKVYQEFVEFKKKFSRTYKSEAENQLRLQNFVKSRNNVVRLNKNAQKAGRNSNFAVNQ 95

Query:    91 FSDLTNEEFRASYTGY------NXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWREKGA-- 142
             FSDLT  E     + +      N                K QN ++   + D R +    
Sbjct:    96 FSDLTTSELHQRLSRFPPNLTENSVFHKNFKKLLGKTRTKRQN-SEFARNFDLRSQKVNG 154

Query:   143 ---VTHIKNQGHCGSCWAFSAVAAVEGITQITGGK 174
                V  IKNQG C  CW F+  A +E I  +  G+
Sbjct:   155 RYIVGPIKNQGQCACCWGFAVTAMLETIYAVNVGR 189


>MGI|MGI:88561 [details] [associations]
            symbol:Ctsb "cathepsin B" species:10090 "Mus musculus"
            [GO:0004175 "endopeptidase activity" evidence=ISO] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISO] [GO:0005576
            "extracellular region" evidence=ISO] [GO:0005615 "extracellular
            space" evidence=ISO] [GO:0005737 "cytoplasm" evidence=ISO]
            [GO:0005739 "mitochondrion" evidence=ISO;IDA] [GO:0005764
            "lysosome" evidence=ISO;IDA] [GO:0005901 "caveola" evidence=ISO]
            [GO:0006508 "proteolysis" evidence=ISO] [GO:0008233 "peptidase
            activity" evidence=ISO] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISO] [GO:0009897 "external side of plasma
            membrane" evidence=ISO] [GO:0009986 "cell surface" evidence=ISO]
            [GO:0016324 "apical plasma membrane" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0030984 "kininogen binding"
            evidence=ISO] [GO:0032403 "protein complex binding" evidence=ISO]
            [GO:0042277 "peptide binding" evidence=ISO] [GO:0042383
            "sarcolemma" evidence=ISO] [GO:0043621 "protein self-association"
            evidence=ISO] [GO:0048471 "perinuclear region of cytoplasm"
            evidence=ISO] [GO:0050790 "regulation of catalytic activity"
            evidence=IEA] [GO:0060548 "negative regulation of cell death"
            evidence=ISO] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 MGI:MGI:88561
            GO:GO:0005739 GO:GO:0042470 GO:GO:0048471 GO:GO:0005615
            GO:GO:0009612 GO:GO:0009611 GO:GO:0009897 GO:GO:0045471
            GO:GO:0016324 GO:GO:0009749 GO:GO:0006914 GO:GO:0043434
            eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0042383 GO:GO:0014070
            GO:GO:0042277 GO:GO:0060548 GO:GO:0005901 GO:GO:0014075
            GO:GO:0004197 GO:GO:0070670 GO:GO:0007519 MEROPS:C01.060 CTD:1508
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 KO:K01363 OMA:GYPSGAW OrthoDB:EOG4K6G4C
            BRENDA:3.4.22.1 GO:GO:0097067 PANTHER:PTHR12411:SF16 ChiTaRS:CTSB
            EMBL:M65270 EMBL:M65263 EMBL:M65264 EMBL:M65265 EMBL:M65266
            EMBL:M65267 EMBL:M65268 EMBL:M65269 EMBL:M14222 EMBL:X54966
            EMBL:S69034 EMBL:AK083393 EMBL:AK147192 EMBL:AK149884 EMBL:AK151790
            EMBL:AK167361 EMBL:BC006656 IPI:IPI00113517 PIR:A38458
            RefSeq:NP_031824.1 UniGene:Mm.236553 UniGene:Mm.489070
            ProteinModelPortal:P10605 SMR:P10605 IntAct:P10605 STRING:P10605
            PhosphoSite:P10605 SWISS-2DPAGE:P10605 PaxDb:P10605 PRIDE:P10605
            Ensembl:ENSMUST00000006235 GeneID:13030 KEGG:mmu:13030
            UCSC:uc007uhh.1 InParanoid:P10605 BioCyc:MetaCyc:MONOMER-14810
            BindingDB:P10605 ChEMBL:CHEMBL5187 NextBio:282900 Bgee:P10605
            CleanEx:MM_CTSB Genevestigator:P10605 GermOnline:ENSMUSG00000021939
            Uniprot:P10605
        Length = 339

 Score = 155 (59.6 bits), Expect = 2.1e-19, Sum P(2) = 2.1e-19
 Identities = 30/69 (43%), Positives = 40/69 (57%)

Query:   272 FRFYKRGVLNAECGDNCD-HGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
             F  YK GV   E GD    H + ++G+G    E+G  YWL  NSW   WG++G+ +ILR 
Sbjct:   259 FLTYKSGVYKHEAGDMMGGHAIRILGWGV---ENGVPYWLAANSWNLDWGDNGFFKILRG 315

Query:   331 EGLCGIATE 339
             E  CGI +E
Sbjct:   316 ENHCGIESE 324

 Score = 140 (54.3 bits), Expect = 2.1e-19, Sum P(2) = 2.1e-19
 Identities = 35/96 (36%), Positives = 55/96 (57%)

Query:   130 DVPTSIDWREKGA----VTHIKNQGHCGSCWAFSAVAAVEGITQI-TGGKL-IELSEQQL 183
             D+P + D RE+ +    +  I++QG CGSCWAF AV A+   T I T G++ +E+S + L
Sbjct:    79 DLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDL 138

Query:   184 VDCS--TDNNGCSGGLMDKAFEYIIENKGLATEADY 217
             + C      +GC+GG    A+ +  + KGL +   Y
Sbjct:   139 LTCCGIQCGDGCNGGYPSGAWSFWTK-KGLVSGGVY 173


>FB|FBgn0033873 [details] [associations]
            symbol:CG6337 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 EMBL:AE013599
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 HSSP:P80067 EMBL:AY084123
            RefSeq:NP_610905.1 UniGene:Dm.5230 SMR:Q7JYA0 IntAct:Q7JYA0
            EnsemblMetazoa:FBtr0087646 GeneID:36530 KEGG:dme:Dmel_CG6337
            UCSC:CG6337-RA FlyBase:FBgn0033873 eggNOG:NOG310593
            InParanoid:Q7JYA0 OMA:NRTTYRE OrthoDB:EOG4MCVFZ GenomeRNAi:36530
            NextBio:799041 Uniprot:Q7JYA0
        Length = 340

 Score = 233 (87.1 bits), Expect = 3.2e-19, P = 3.2e-19
 Identities = 83/299 (27%), Positives = 124/299 (41%)

Query:    65 IFKQNLEYIEKANKEGNRT-YKLGTNEFSDLTNEEFRASYT-GYNXXXXXXXXX-XXXXX 121
             I+ +N      A  + NRT Y+   N+FSD+   +F A      N               
Sbjct:    53 IYNRNQVAQHNAQADRNRTTYREAVNQFSDIRLIQFAALLPKAVNTVTSAASDPPASQAA 112

Query:   122 TFKYQNVTDVPTSIDWREKGAVTHIKNQG-HCGSCWAFSAVAAVEGITQI-TGGKL-IEL 178
             +  +  +TD          G    +++QG +C S WA++   AVE +  + T   L   L
Sbjct:   113 SASFDIITDF---------GLTVAVEDQGVNCSSSWAYATAKAVEIMNAVQTANPLPSSL 163

Query:   179 SEQQLVDCSTDNNGCSGGLMDKAFEYIIE--NKGLATEADYPYQQE---QGTCDKQKEKA 233
             S QQL+DC+    GCS      A  Y+ +  +  L  E DYP        G C      +
Sbjct:   164 SAQQLLDCAGMGTGCSTQTPLAALNYLTQLTDAYLYPEVDYPNNNSLKTPGMCQPPSSVS 223

Query:   234 AAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRF--YKRGVLNAEC----GDN 287
                 +  Y  +   D+ A+++ V+     V VE +   F F  Y  GV   E        
Sbjct:   224 VGVKLAGYSTVADNDDAAVMRYVSNG-FPVIVEYNPATFGFMQYSSGVYVQETRALTNPK 282

Query:   288 CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEASYPVAM 346
                 + VVG+   + +    YW   NS+G+TWGE GYIRI+R      IA  A +P A+
Sbjct:   283 SSQFLVVVGYDH-DVDSNLDYWRCLNSFGDTWGEEGYIRIVRRSNQ-PIAKNAVFPSAL 339


>TAIR|locus:505006093 [details] [associations]
            symbol:AT1G02305 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0005829 "cytosol" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684 GO:GO:0005773
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 HOGENOM:HOG000241341
            KO:K01363 PANTHER:PTHR12411:SF16 OMA:CCGFLCG UniGene:At.23486
            UniGene:At.42610 UniGene:At.43952 EMBL:AY039887 EMBL:AF428337
            EMBL:BT002227 IPI:IPI00524601 RefSeq:NP_563648.1 HSSP:P07858
            ProteinModelPortal:Q93VC9 SMR:Q93VC9 IntAct:Q93VC9 STRING:Q93VC9
            MEROPS:C01.049 PRIDE:Q93VC9 ProMEX:Q93VC9 EnsemblPlants:AT1G02305.1
            GeneID:839538 KEGG:ath:AT1G02305 TAIR:At1g02305 InParanoid:Q93VC9
            PhylomeDB:Q93VC9 ProtClustDB:CLSN2687619 Genevestigator:Q93VC9
            Uniprot:Q93VC9
        Length = 362

 Score = 151 (58.2 bits), Expect = 4.5e-19, Sum P(2) = 4.5e-19
 Identities = 31/78 (39%), Positives = 41/78 (52%)

Query:   260 PVSVCVEASGQAFRFYKRGVLNAECGDNCD-HGVAVVGFGTAEEEDGAKYWLIKNSWGET 318
             PV V      + F  YK GV     G N   H V ++G+GT++  DG  YWL+ N W  +
Sbjct:   259 PVEVAFTVY-EDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSD--DGEDYWLLANQWNRS 315

Query:   319 WGESGYIRILRDEGLCGI 336
             WG+ GY +I R    CGI
Sbjct:   316 WGDDGYFKIRRGTNECGI 333

 Score = 143 (55.4 bits), Expect = 4.5e-19, Sum P(2) = 4.5e-19
 Identities = 48/176 (27%), Positives = 74/176 (42%)

Query:    55 DELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNE-FSDLTNEEFRASYTGYNXXXXXX 113
             + L K    +   QN E +++ N+  N  +K   N+ F++ T  EF+    G        
Sbjct:    34 ENLSKQKLTSWILQN-EIVKEVNENPNAGWKASFNDRFANATVAEFKR-LLGVKPTPKTE 91

Query:   114 XXXX---XXXXTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQI 170
                        + K     D  T+  W +  ++  I +QGHCGSCWAF AV ++     I
Sbjct:    92 FLGVPIVSHDISLKLPKEFDARTA--WSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCI 149

Query:   171 TGGKLIELSEQQLVDCS--TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQG 224
                  + LS   L+ C       GC+GG    A+ Y  ++ G+ TE   PY    G
Sbjct:   150 KYNMNVSLSVNDLLACCGFLCGQGCNGGYPIAAWRYF-KHHGVVTEECDPYFDNTG 204


>RGD|621509 [details] [associations]
            symbol:Ctsb "cathepsin B" species:10116 "Rattus norvegicus"
            [GO:0004175 "endopeptidase activity" evidence=IMP;IDA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA;ISO;IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005576
            "extracellular region" evidence=IDA] [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0005730 "nucleolus" evidence=IEA;ISO]
            [GO:0005737 "cytoplasm" evidence=IDA] [GO:0005739 "mitochondrion"
            evidence=IEA;ISO;IDA] [GO:0005764 "lysosome" evidence=IEA;ISO;IDA]
            [GO:0006508 "proteolysis" evidence=IEA;IEP;ISO;IMP;IDA;TAS]
            [GO:0006914 "autophagy" evidence=IEP] [GO:0006950 "response to
            stress" evidence=IEP] [GO:0007283 "spermatogenesis" evidence=IEP]
            [GO:0007519 "skeletal muscle tissue development" evidence=IEP]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0009611
            "response to wounding" evidence=IEP] [GO:0009612 "response to
            mechanical stimulus" evidence=IEP] [GO:0009749 "response to glucose
            stimulus" evidence=IEP] [GO:0009897 "external side of plasma
            membrane" evidence=IDA] [GO:0009986 "cell surface" evidence=IDA]
            [GO:0014070 "response to organic cyclic compound" evidence=IEP]
            [GO:0014075 "response to amine stimulus" evidence=IEP] [GO:0016324
            "apical plasma membrane" evidence=IDA] [GO:0030984 "kininogen
            binding" evidence=IPI] [GO:0032403 "protein complex binding"
            evidence=IPI] [GO:0034097 "response to cytokine stimulus"
            evidence=IEP] [GO:0042277 "peptide binding" evidence=IDA]
            [GO:0042383 "sarcolemma" evidence=IDA] [GO:0042470 "melanosome"
            evidence=IEA] [GO:0043231 "intracellular membrane-bounded
            organelle" evidence=ISO] [GO:0043434 "response to peptide hormone
            stimulus" evidence=IEP] [GO:0043621 "protein self-association"
            evidence=IDA] [GO:0045471 "response to ethanol" evidence=IEP]
            [GO:0048471 "perinuclear region of cytoplasm" evidence=ISO;IDA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0060548 "negative regulation of cell death" evidence=IMP]
            [GO:0070670 "response to interleukin-4" evidence=IEP] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA;ISO]
            [GO:0005901 "caveola" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:621509 GO:GO:0005739 GO:GO:0042470
            GO:GO:0005615 GO:GO:0009612 GO:GO:0009611 GO:GO:0009897
            GO:GO:0045471 GO:GO:0016324 GO:GO:0009749 GO:GO:0006914
            GO:GO:0043434 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0007283
            GO:GO:0005764 GO:GO:0042383 GO:GO:0043621 GO:GO:0014070
            GO:GO:0042277 GO:GO:0060548 GO:GO:0014075 GO:GO:0004197
            GO:GO:0070670 GO:GO:0007519 MEROPS:C01.060 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 GO:GO:0097067
            PANTHER:PTHR12411:SF16 EMBL:X82396 EMBL:M11305 IPI:IPI00212811
            PIR:S51041 UniGene:Rn.100909 PDB:1CPJ PDB:1CTE PDB:1MIR PDB:1THE
            PDBsum:1CPJ PDBsum:1CTE PDBsum:1MIR PDBsum:1THE
            ProteinModelPortal:P00787 SMR:P00787 STRING:P00787 PRIDE:P00787
            UCSC:RGD:621509 InParanoid:P00787 SABIO-RK:P00787 BindingDB:P00787
            ChEMBL:CHEMBL2602 EvolutionaryTrace:P00787 ArrayExpress:P00787
            Genevestigator:P00787 GermOnline:ENSRNOG00000010331 Uniprot:P00787
        Length = 339

 Score = 158 (60.7 bits), Expect = 6.1e-19, Sum P(2) = 6.1e-19
 Identities = 30/69 (43%), Positives = 41/69 (59%)

Query:   272 FRFYKRGVLNAECGDNCD-HGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
             F  YK GV   E GD    H + ++G+G    E+G  YWL+ NSW   WG++G+ +ILR 
Sbjct:   259 FLTYKSGVYKHEAGDVMGGHAIRILGWGI---ENGVPYWLVANSWNVDWGDNGFFKILRG 315

Query:   331 EGLCGIATE 339
             E  CGI +E
Sbjct:   316 ENHCGIESE 324

 Score = 132 (51.5 bits), Expect = 6.1e-19, Sum P(2) = 6.1e-19
 Identities = 34/96 (35%), Positives = 54/96 (56%)

Query:   130 DVPTSIDWREKGA----VTHIKNQGHCGSCWAFSAVAAV-EGITQITGGKL-IELSEQQL 183
             ++P S D RE+ +    +  I++QG CGSCWAF AV A+ + I   T G++ +E+S + L
Sbjct:    79 NLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDL 138

Query:   184 VDCS--TDNNGCSGGLMDKAFEYIIENKGLATEADY 217
             + C      +GC+GG    A+ +    KGL +   Y
Sbjct:   139 LTCCGIQCGDGCNGGYPSGAWNFWTR-KGLVSGGVY 173


>UNIPROTKB|Q6IN22 [details] [associations]
            symbol:Ctsb "Cathepsin B" species:10116 "Rattus norvegicus"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 RGD:621509 GO:GO:0005739
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 CTD:1508
            GeneTree:ENSGT00560000076599 HOVERGEN:HBG003480 KO:K01363
            OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16 UniGene:Rn.100909
            EMBL:CH474023 HSSP:P00785 EMBL:BC072490 IPI:IPI00562653
            RefSeq:NP_072119.2 SMR:Q6IN22 IntAct:Q6IN22 STRING:Q6IN22
            Ensembl:ENSRNOT00000014177 GeneID:64529 KEGG:rno:64529
            InParanoid:Q6IN22 NextBio:613362 Genevestigator:Q6IN22
            Uniprot:Q6IN22
        Length = 339

 Score = 158 (60.7 bits), Expect = 6.1e-19, Sum P(2) = 6.1e-19
 Identities = 30/69 (43%), Positives = 41/69 (59%)

Query:   272 FRFYKRGVLNAECGDNCD-HGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
             F  YK GV   E GD    H + ++G+G    E+G  YWL+ NSW   WG++G+ +ILR 
Sbjct:   259 FLTYKSGVYKHEAGDVMGGHAIRILGWGI---ENGVPYWLVANSWNVDWGDNGFFKILRG 315

Query:   331 EGLCGIATE 339
             E  CGI +E
Sbjct:   316 ENHCGIESE 324

 Score = 132 (51.5 bits), Expect = 6.1e-19, Sum P(2) = 6.1e-19
 Identities = 34/96 (35%), Positives = 54/96 (56%)

Query:   130 DVPTSIDWREKGA----VTHIKNQGHCGSCWAFSAVAAV-EGITQITGGKL-IELSEQQL 183
             ++P S D RE+ +    +  I++QG CGSCWAF AV A+ + I   T G++ +E+S + L
Sbjct:    79 NLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDL 138

Query:   184 VDCS--TDNNGCSGGLMDKAFEYIIENKGLATEADY 217
             + C      +GC+GG    A+ +    KGL +   Y
Sbjct:   139 LTCCGIQCGDGCNGGYPSGAWNFWTR-KGLVSGGVY 173


>UNIPROTKB|F1N9D7 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9031 "Gallus gallus"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005730 "nucleolus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005739 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0005764
            GO:GO:0004197 GeneTree:ENSGT00560000076599 OMA:GYPSGAW
            GO:GO:0097067 PANTHER:PTHR12411:SF16 IPI:IPI00573387
            EMBL:AADN02018292 Ensembl:ENSGALT00000026896
            Ensembl:ENSGALT00000036723 Uniprot:F1N9D7
        Length = 340

 Score = 151 (58.2 bits), Expect = 2.3e-18, Sum P(2) = 2.3e-18
 Identities = 31/97 (31%), Positives = 49/97 (50%)

Query:   244 LPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCD-HGVAVVGFGTAEE 302
             +P+ ++  + +     PV        + F  YK GV     G+    H + ++G+G    
Sbjct:   233 VPRSEKEIMAEIYKNGPVEGAFIVY-EDFLMYKSGVYQHVSGEQVGGHAIRILGWGV--- 288

Query:   303 EDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATE 339
             E+G  YWL  NSW   WG++G+ +ILR E  CGI +E
Sbjct:   289 ENGTPYWLAANSWNTDWGDNGFFKILRGEDHCGIESE 325

 Score = 135 (52.6 bits), Expect = 2.3e-18, Sum P(2) = 2.3e-18
 Identities = 34/96 (35%), Positives = 51/96 (53%)

Query:   130 DVPTSID----WREKGAVTHIKNQGHCGSCWAFSAVAAV-EGITQITGGKL-IELSEQQL 183
             D+P + D    W     ++ I++QG CGSCWAF AV A+ + I   T  K+ +E+S + L
Sbjct:    79 DLPDTFDSRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDL 138

Query:   184 VDCS--TDNNGCSGGLMDKAFEYIIENKGLATEADY 217
             + C       GC+GG    A+ Y  E +GL +   Y
Sbjct:   139 LSCCGFECGMGCNGGYPSGAWRYWTE-RGLVSGGLY 173


>UNIPROTKB|E2R6Q7 [details] [associations]
            symbol:CTSB "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730 "nucleolus"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790
            GO:GO:0005764 GO:GO:0004197 CTD:1508 GeneTree:ENSGT00560000076599
            KO:K01363 OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16
            EMBL:AAEX03014318 RefSeq:XP_543203.3 Ensembl:ENSCAFT00000012692
            GeneID:486077 KEGG:cfa:486077 NextBio:20859923 Uniprot:E2R6Q7
        Length = 339

 Score = 151 (58.2 bits), Expect = 4.6e-18, Sum P(2) = 4.6e-18
 Identities = 33/94 (35%), Positives = 47/94 (50%)

Query:   248 DEHALLQAVTKQ-PVSVCVEASGQAFRFYKRGVLNAECGDNCD-HGVAVVGFGTAEEEDG 305
             +E  ++  + K  PV          F  YK GV     G+    H V ++G+G    EDG
Sbjct:   235 NEKEIMAEIYKNGPVEAAFTVYSD-FLLYKSGVYQHVTGEMMGGHAVRILGWGV---EDG 290

Query:   306 AKYWLIKNSWGETWGESGYIRILRDEGLCGIATE 339
               YWL+ NSW   WG++G+ +ILR    CGI +E
Sbjct:   291 TPYWLVGNSWNTDWGDNGFFKILRGRDHCGIESE 324

 Score = 132 (51.5 bits), Expect = 4.6e-18, Sum P(2) = 4.6e-18
 Identities = 33/95 (34%), Positives = 54/95 (56%)

Query:   131 VPTSIDWREKG----AVTHIKNQGHCGSCWAFSAVAAV-EGITQITGGKL-IELSEQQLV 184
             +P S D RE+      +  I++QG CGSCWAF AV A+ + I   T G + +E+S + ++
Sbjct:    80 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNVEVSAEDML 139

Query:   185 DCSTDN--NGCSGGLMDKAFEYIIENKGLATEADY 217
              C  D   +GC+GG   +A+ +  + +GL +   Y
Sbjct:   140 TCCGDQCGDGCNGGFPAEAWNFWTK-QGLVSGGLY 173


>FB|FBgn0030521 [details] [associations]
            symbol:CtsB1 "Cathepsin B1" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS] [GO:0035071 "salivary gland cell autophagic cell
            death" evidence=IEP] [GO:0048102 "autophagic cell death"
            evidence=IEP] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:AE014298 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0035071
            GO:GO:0004197 MEROPS:C01.060 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 KO:K01363 PANTHER:PTHR12411:SF16
            HSSP:P07688 EMBL:AY060640 RefSeq:NP_572920.1 UniGene:Dm.3926
            SMR:Q9VY87 IntAct:Q9VY87 MINT:MINT-932864 STRING:Q9VY87
            EnsemblMetazoa:FBtr0073838 GeneID:32341 KEGG:dme:Dmel_CG10992
            UCSC:CG10992-RA FlyBase:FBgn0030521 InParanoid:Q9VY87 OMA:TEGHIRR
            OrthoDB:EOG48W9HM ChiTaRS:CG10992 GenomeRNAi:32341 NextBio:778020
            Uniprot:Q9VY87
        Length = 340

 Score = 144 (55.7 bits), Expect = 4.8e-18, Sum P(2) = 4.8e-18
 Identities = 48/162 (29%), Positives = 68/162 (41%)

Query:    71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFKYQN-VT 129
             E+IE    +  +T+ +G N  + +T    R     +                  Y N V 
Sbjct:    27 EFIEVVRSKA-KTWTVGRNFDASVTEGHIRRLMGVHPDAHKFALPDKREVLGDLYVNSVD 85

Query:   130 DVPTSID----WREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQI-TGGKL-IELSEQQL 183
             ++P   D    W     +  I++QG CGSCWAF AV A+     I +GGK+    S   L
Sbjct:    86 ELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDL 145

Query:   184 VDCS-TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQG 224
             V C  T   GC+GG    A+ Y    KG+ +    PY   QG
Sbjct:   146 VSCCHTCGFGCNGGFPGAAWSYWTR-KGIVSGG--PYGSNQG 184

 Score = 140 (54.3 bits), Expect = 4.8e-18, Sum P(2) = 4.8e-18
 Identities = 29/68 (42%), Positives = 37/68 (54%)

Query:   275 YKRGVLNAECGDNCD-HGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGL 333
             YK GV   E G     H + ++G+G   EE    YWLI NSW   WG+ G+ RILR +  
Sbjct:   268 YKDGVYQHEHGKELGGHAIRILGWGVWGEEK-IPYWLIGNSWNTDWGDHGFFRILRGQDH 326

Query:   334 CGIATEAS 341
             CGI +  S
Sbjct:   327 CGIESSIS 334


>ZFIN|ZDB-GENE-040426-2650 [details] [associations]
            symbol:ctsba "cathepsin B, a" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0031101 "fin regeneration"
            evidence=IEP] [GO:0008233 "peptidase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 ZFIN:ZDB-GENE-040426-2650 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790
            GO:GO:0004197 GO:GO:0031101 MEROPS:C01.060 HOVERGEN:HBG003480
            PANTHER:PTHR12411:SF16 HSSP:P07688 EMBL:BC044517 IPI:IPI00485996
            UniGene:Dr.3374 ProteinModelPortal:Q803E4 SMR:Q803E4 STRING:Q803E4
            PRIDE:Q803E4 InParanoid:Q803E4 ArrayExpress:Q803E4 Bgee:Q803E4
            Uniprot:Q803E4
        Length = 330

 Score = 154 (59.3 bits), Expect = 7.2e-18, Sum P(2) = 7.2e-18
 Identities = 33/97 (34%), Positives = 46/97 (47%)

Query:   244 LPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCD-HGVAVVGFGTAEE 302
             +P      + +     PV        + F  YK GV     G     H + ++G+G   E
Sbjct:   231 VPSNQNGIMAELFKNGPVEAAFTVY-EDFLLYKSGVYQHMSGSALGGHAIKILGWG---E 286

Query:   303 EDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATE 339
             E+G  YWL  NSW   WG++GY +ILR E  CGI +E
Sbjct:   287 ENGVPYWLAANSWNTDWGDNGYFKILRGEDHCGIESE 323

 Score = 126 (49.4 bits), Expect = 7.2e-18, Sum P(2) = 7.2e-18
 Identities = 41/139 (29%), Positives = 62/139 (44%)

Query:   124 KYQNVTDVPTSIDWREKG----AVTHIKNQGHCGSCWAFSAVAAVEGITQI-TGGKL-IE 177
             +Y     +P + D RE+      +  I++QG CGSCWAF A  A+     I +  K+ +E
Sbjct:    72 QYTEGLKLPKNFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIQSNAKVSVE 131

Query:   178 LSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADY-------PYQQEQGTCDKQ 229
             +S Q L+ C      GC+GG    A+++   + GL T   Y       PY  E   C+  
Sbjct:   132 ISSQDLLTCCDSCGMGCNGGYPSAAWDFWTTD-GLVTGGLYNSHIGCRPYTIEP--CEHH 188

Query:   230 KEKAAAATIGKYEDLPKGD 248
                +     G+  D P  D
Sbjct:   189 VNGSRPPCTGEGGDTPNCD 207


>FB|FBgn0034709 [details] [associations]
            symbol:Swim "Secreted Wg-interacting molecule" species:7227
            "Drosophila melanogaster" [GO:0004197 "cysteine-type endopeptidase
            activity" evidence=ISS] [GO:0006955 "immune response" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044 "scavenger
            receptor activity" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0042600 "chorion" evidence=IDA]
            [GO:0035593 "positive regulation of Wnt receptor signaling pathway
            by establishment of Wnt protein localization to extracellular
            region" evidence=IMP] [GO:0030177 "positive regulation of Wnt
            receptor signaling pathway" evidence=IDA] [GO:0005615
            "extracellular space" evidence=IDA] [GO:0017147 "Wnt-protein
            binding" evidence=IDA] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR013128 Pfam:PF00112 PROSITE:PS50958 SMART:SM00201
            SMART:SM00645 EMBL:AE013599 GO:GO:0005615 GO:GO:0006955
            GO:GO:0030247 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GO:GO:0017147 GO:GO:0005044
            GeneTree:ENSGT00560000076599 GO:GO:0042600 eggNOG:NOG310046
            OMA:DNCNRCT HSSP:P80067 EMBL:AY113377 RefSeq:NP_611652.2
            RefSeq:NP_726176.1 UniGene:Dm.732 SMR:Q7JWQ7 IntAct:Q7JWQ7
            EnsemblMetazoa:FBtr0071784 EnsemblMetazoa:FBtr0071785 GeneID:37537
            KEGG:dme:Dmel_CG3074 UCSC:CG3074-RA FlyBase:FBgn0034709
            HOGENOM:HOG000264150 InParanoid:Q7JWQ7 OrthoDB:EOG48CZ9P
            GenomeRNAi:37537 NextBio:804155 GO:GO:0035593 Uniprot:Q7JWQ7
        Length = 431

 Score = 230 (86.0 bits), Expect = 7.8e-18, P = 7.8e-18
 Identities = 64/207 (30%), Positives = 96/207 (46%)

Query:   159 SAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYP 218
             ++VA+     Q  G + ++LS Q ++ C+    GC GG +D A+ Y+   KG+  E  YP
Sbjct:   219 TSVASDRFAIQSKGKENVQLSAQNILSCTRRQQGCEGGHLDAAWRYL-HKKGVVDENCYP 277

Query:   219 YQQEQGTCDK------------QK----EKAAAATIGKYEDLPKGDEHALLQAVTKQPVS 262
             Y Q + TC              QK    ++ +  T+G    L + +   + +     PV 
Sbjct:   278 YTQHRDTCKIRHNSRSLRANGCQKPVNVDRDSLYTVGPAYSLNR-EADIMAEIFHSGPVQ 336

Query:   263 VCVEASGQAFRF----YKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGET 318
               +  +   F +    Y+    N +      H V +VG+G  EE +G KYW+  NSWG  
Sbjct:   337 ATMRVNRDFFAYSGGVYRETAANRKAPTGF-HSVKLVGWG--EEHNGEKYWIAANSWGSW 393

Query:   319 WGESGYIRILRDEGLCGIATE--ASYP 343
             WGE GY RILR    CGI     AS+P
Sbjct:   394 WGEHGYFRILRGSNECGIEEYVLASWP 420

 Score = 166 (63.5 bits), Expect = 1.1e-09, P = 1.1e-09
 Identities = 36/108 (33%), Positives = 61/108 (56%)

Query:   124 KYQNVTD-VPTSIDWREKGA--VTHIKNQGHCGSCWAFS--AVAAVEGITQITGGKLIEL 178
             + +N TD +P+S +  +K +  ++ + +QG CG+ W  S  +VA+     Q  G + ++L
Sbjct:   179 RLKNPTDGLPSSFNALDKWSSYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQL 238

Query:   179 SEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTC 226
             S Q ++ C+    GC GG +D A+ Y+   KG+  E  YPY Q + TC
Sbjct:   239 SAQNILSCTRRQQGCEGGHLDAAWRYL-HKKGVVDENCYPYTQHRDTC 285


>UNIPROTKB|P43233 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9031 "Gallus gallus"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 MEROPS:C01.060 CTD:1508
            HOGENOM:HOG000241341 HOVERGEN:HBG003480 KO:K01363 OrthoDB:EOG4K6G4C
            PANTHER:PTHR12411:SF16 EMBL:U18083 IPI:IPI00573387 PIR:S58770
            RefSeq:NP_990702.1 UniGene:Gga.3854 ProteinModelPortal:P43233
            SMR:P43233 STRING:P43233 PRIDE:P43233 GeneID:396329 KEGG:gga:396329
            InParanoid:P43233 NextBio:20816377 Uniprot:P43233
        Length = 340

 Score = 146 (56.5 bits), Expect = 9.3e-18, Sum P(2) = 9.3e-18
 Identities = 31/97 (31%), Positives = 48/97 (49%)

Query:   244 LPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCD-HGVAVVGFGTAEE 302
             +P+ ++  + +     PV        + F  YK GV     G+    H + ++G+G    
Sbjct:   233 VPRSEKEIMAEIYKNGPVEGAFIVY-EDFLMYKSGVYQHVSGEQVGGHAIRILGWGV--- 288

Query:   303 EDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATE 339
             E+G  YWL  NSW   WG +G+ +ILR E  CGI +E
Sbjct:   289 ENGTPYWLAANSWNTDWGITGFFKILRGEDHCGIESE 325

 Score = 135 (52.6 bits), Expect = 9.3e-18, Sum P(2) = 9.3e-18
 Identities = 34/96 (35%), Positives = 51/96 (53%)

Query:   130 DVPTSID----WREKGAVTHIKNQGHCGSCWAFSAVAAV-EGITQITGGKL-IELSEQQL 183
             D+P + D    W     ++ I++QG CGSCWAF AV A+ + I   T  K+ +E+S + L
Sbjct:    79 DLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDL 138

Query:   184 VDCS--TDNNGCSGGLMDKAFEYIIENKGLATEADY 217
             + C       GC+GG    A+ Y  E +GL +   Y
Sbjct:   139 LSCCGFECGMGCNGGYPSGAWRYWTE-RGLVSGGLY 173


>UNIPROTKB|A1E295 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9823 "Sus scrofa"
            [GO:0005615 "extracellular space" evidence=IDA] [GO:0042470
            "melanosome" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730
            "nucleolus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0042470
            GO:GO:0005615 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 MEROPS:C01.060 CTD:1508
            GeneTree:ENSGT00560000076599 HOVERGEN:HBG003480 KO:K01363
            OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16 EMBL:EF095956
            RefSeq:NP_001090927.1 UniGene:Ssc.53773 ProteinModelPortal:A1E295
            SMR:A1E295 PRIDE:A1E295 Ensembl:ENSSSCT00000026923 GeneID:100037961
            KEGG:ssc:100037961 Uniprot:A1E295
        Length = 335

 Score = 146 (56.5 bits), Expect = 1.1e-17, Sum P(2) = 1.1e-17
 Identities = 28/69 (40%), Positives = 40/69 (57%)

Query:   272 FRFYKRGVLNAECGDNCD-HGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
             F  YK GV     GD    H + ++G+G    E+G  YWL+ NSW   WG++G+ +ILR 
Sbjct:   259 FLQYKSGVYQHVTGDLMGGHAIRILGWGV---ENGTPYWLVGNSWNTDWGDNGFFKILRG 315

Query:   331 EGLCGIATE 339
             +  CGI +E
Sbjct:   316 QDHCGIESE 324

 Score = 134 (52.2 bits), Expect = 1.1e-17, Sum P(2) = 1.1e-17
 Identities = 33/95 (34%), Positives = 54/95 (56%)

Query:   131 VPTSIDWREKG----AVTHIKNQGHCGSCWAFSAVAAV-EGITQITGGKL-IELSEQQLV 184
             +P S D RE+      +  I++QG CGSCWAF AV A+ + I   + G++ +E+S + ++
Sbjct:    80 LPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDML 139

Query:   185 DCSTDN--NGCSGGLMDKAFEYIIENKGLATEADY 217
              C  D   +GC+GG    A+ +  + KGL +   Y
Sbjct:   140 TCCGDECGDGCNGGFPSGAWNFWTK-KGLVSGGLY 173


>DICTYBASE|DDB_G0286055 [details] [associations]
            symbol:DDB_G0286055 "peptidase C1A family protein"
            species:44689 "Dictyostelium discoideum" [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 dictyBase:DDB_G0286055 Pfam:PF00188 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411 EMBL:AAFI02000085
            Gene3D:3.40.33.10 InterPro:IPR001283 InterPro:IPR014044
            PRINTS:PR00837 SMART:SM00198 SUPFAM:SSF55797
            ProtClustDB:CLSZ2429919 RefSeq:XP_637918.1
            ProteinModelPortal:Q54MB6 EnsemblProtists:DDB0186794 GeneID:8625429
            KEGG:ddi:DDB_G0286055 InParanoid:Q54MB6 OMA:GENGFAR Uniprot:Q54MB6
        Length = 435

 Score = 227 (85.0 bits), Expect = 2.7e-17, P = 2.7e-17
 Identities = 74/238 (31%), Positives = 111/238 (46%)

Query:   131 VPT--SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC-- 186
             VPT  S DWR+ G V   K+  +C S WAF+A    E  + +      + S QQL+DC  
Sbjct:   206 VPTDGSFDWRDNGVVGFPKDSSNCASGWAFTAAGIFESRSAMRTRHRYDYSAQQLIDCIN 265

Query:   187 ---------STDN-NGCS--GGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAA 234
                      S  N   CS   G ++KA  Y  +  GL   + YPY           + + 
Sbjct:   266 VCIIIFSNFSIGNYTKCSRFSGELNKALMYA-QAYGLQATSTYPYVGASSIGCSYNQSSI 324

Query:   235 AATIGKYEDLPKGDEHALLQAVTKQ-PVSVCVEASGQAFRFYKRGVLNAECGD------N 287
             A   G  E    G + ++++   KQ PV V +  + + F +Y  G+   EC +      N
Sbjct:   325 AVEGGDVEYSQVGRD-SIVEKCRKQGPVGVGIYVTNE-FLYYAGGIF--ECNNTLIDNAN 380

Query:   288 CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGL-CGIATEASYPV 344
              +H V +VG+    E+D   Y++IKN++G TWGE+G+ RI  D    C IA   +Y +
Sbjct:   381 INHNVLLVGYN---EKDN--YYIIKNNFGRTWGENGFARITADVNKDCLIAKNPAYSI 433


>WB|WBGene00021072 [details] [associations]
            symbol:W07B8.4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:FO081739 PIR:T31728 RefSeq:NP_503382.1
            HSSP:P53634 ProteinModelPortal:O16288 SMR:O16288 STRING:O16288
            MEROPS:C01.A39 PaxDb:O16288 EnsemblMetazoa:W07B8.4 GeneID:178611
            KEGG:cel:CELE_W07B8.4 UCSC:W07B8.4 CTD:178611 WormBase:W07B8.4
            InParanoid:O16288 OMA:ESQYGCK NextBio:901836 Uniprot:O16288
        Length = 335

 Score = 142 (55.0 bits), Expect = 3.3e-17, Sum P(2) = 3.3e-17
 Identities = 29/70 (41%), Positives = 36/70 (51%)

Query:   272 FRFYKRGVLNAECGDNCD-HGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
             F  YK G+     G     H V ++G+G    ++G  YWL  NSW   WGE GY RILR 
Sbjct:   258 FYLYKTGIYTHVAGGELGGHAVKMLGWGV---DNGTPYWLAANSWNTVWGEKGYFRILRG 314

Query:   331 EGLCGIATEA 340
                CGI + A
Sbjct:   315 VDECGIESAA 324

 Score = 134 (52.2 bits), Expect = 3.3e-17, Sum P(2) = 3.3e-17
 Identities = 33/105 (31%), Positives = 52/105 (49%)

Query:   126 QNVTDVPTSID----WREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIE--LS 179
             +    +P S D    W +  +V +I++Q HCGSCWA +A  A+   T I     +   LS
Sbjct:    68 ETADSIPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGDVNTLLS 127

Query:   180 EQQLVDCSTDN----NGCSGGLMDKAFEYIIENKGLATEADYPYQ 220
              + ++ C T      +GC GG   +A+ Y ++N GL T   +  Q
Sbjct:   128 AEDILTCCTGKFNCGDGCEGGYPIQAWRYWVKN-GLVTGGSFESQ 171


>UNIPROTKB|P07858 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9606 "Homo sapiens"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0005739 "mitochondrion" evidence=IEA] [GO:0042470 "melanosome"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042981 "regulation of apoptotic process" evidence=TAS]
            [GO:0006508 "proteolysis" evidence=IDA] [GO:0005764 "lysosome"
            evidence=IDA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEP] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IDA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IDA] [GO:0005622 "intracellular" evidence=TAS]
            [GO:0036021 "endolysosome lumen" evidence=TAS] [GO:0045087 "innate
            immune response" evidence=TAS] [GO:0008233 "peptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0005730 "nucleolus" evidence=IDA] [GO:0043231 "intracellular
            membrane-bounded organelle" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_118779 GO:GO:0005739
            GO:GO:0042470 GO:GO:0048471 Reactome:REACT_6900 GO:GO:0005615
            GO:GO:0009612 GO:GO:0009611 GO:GO:0005730 GO:GO:0042981
            GO:GO:0009897 GO:GO:0045471 GO:GO:0016324 GO:GO:0009749
            GO:GO:0006914 GO:GO:0043434 eggNOG:COG4870 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087
            GO:GO:0050790 GO:GO:0042383 GO:GO:0014070 GO:GO:0042277
            GO:GO:0060548 GO:GO:0005901 GO:GO:0014075 GO:GO:0004197
            GO:GO:0070670 EMBL:CH471157 GO:GO:0007519 MEROPS:C01.060 CTD:1508
            HOGENOM:HOG000241341 HOVERGEN:HBG003480 KO:K01363 OMA:GYPSGAW
            OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 GO:GO:0097067
            PANTHER:PTHR12411:SF16 EMBL:M14221 EMBL:L16510 EMBL:AK092070
            EMBL:AK075393 EMBL:BC010240 EMBL:BC095408 EMBL:M13230
            IPI:IPI00295741 PIR:A26498 RefSeq:NP_001899.1 RefSeq:NP_680090.1
            RefSeq:NP_680091.1 RefSeq:NP_680092.1 RefSeq:NP_680093.1
            UniGene:Hs.520898 PDB:1CSB PDB:1GMY PDB:1HUC PDB:1PBH PDB:2IPP
            PDB:2PBH PDB:3AI8 PDB:3CBJ PDB:3CBK PDB:3K9M PDB:3PBH PDBsum:1CSB
            PDBsum:1GMY PDBsum:1HUC PDBsum:1PBH PDBsum:2IPP PDBsum:2PBH
            PDBsum:3AI8 PDBsum:3CBJ PDBsum:3CBK PDBsum:3K9M PDBsum:3PBH
            ProteinModelPortal:P07858 SMR:P07858 DIP:DIP-42785N IntAct:P07858
            MINT:MINT-1397666 STRING:P07858 PhosphoSite:P07858 DMDM:68067549
            SWISS-2DPAGE:P07858 UCD-2DPAGE:P07858 PaxDb:P07858
            PeptideAtlas:P07858 PRIDE:P07858 DNASU:1508 Ensembl:ENST00000345125
            Ensembl:ENST00000353047 Ensembl:ENST00000434271
            Ensembl:ENST00000453527 Ensembl:ENST00000530640
            Ensembl:ENST00000531089 Ensembl:ENST00000533455
            Ensembl:ENST00000534510 GeneID:1508 KEGG:hsa:1508 UCSC:uc003wum.3
            GeneCards:GC08M011700 H-InvDB:HIX0007320 HGNC:HGNC:2527
            HPA:CAB000457 HPA:HPA018156 MIM:116810 neXtProt:NX_P07858
            PharmGKB:PA27027 InParanoid:P07858 PhylomeDB:P07858
            BindingDB:P07858 ChEMBL:CHEMBL4072 ChiTaRS:CTSB
            EvolutionaryTrace:P07858 GenomeRNAi:1508 NextBio:6235
            PMAP-CutDB:P07858 ArrayExpress:P07858 Bgee:P07858 CleanEx:HS_CTSB
            Genevestigator:P07858 GermOnline:ENSG00000164733 GO:GO:0036021
            Uniprot:P07858
        Length = 339

 Score = 147 (56.8 bits), Expect = 3.6e-17, Sum P(2) = 3.6e-17
 Identities = 27/69 (39%), Positives = 40/69 (57%)

Query:   272 FRFYKRGVLNAECGDNCD-HGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
             F  YK GV     G+    H + ++G+G    E+G  YWL+ NSW   WG++G+ +ILR 
Sbjct:   259 FLLYKSGVYQHVTGEMMGGHAIRILGWGV---ENGTPYWLVANSWNTDWGDNGFFKILRG 315

Query:   331 EGLCGIATE 339
             +  CGI +E
Sbjct:   316 QDHCGIESE 324

 Score = 128 (50.1 bits), Expect = 3.6e-17, Sum P(2) = 3.6e-17
 Identities = 34/95 (35%), Positives = 52/95 (54%)

Query:   131 VPTSIDWREKG----AVTHIKNQGHCGSCWAFSAVAAV-EGITQITGGKL-IELSEQQLV 184
             +P S D RE+      +  I++QG CGSCWAF AV A+ + I   T   + +E+S + L+
Sbjct:    80 LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139

Query:   185 DC--STDNNGCSGGLMDKAFEYIIENKGLATEADY 217
              C  S   +GC+GG   +A+ +    KGL +   Y
Sbjct:   140 TCCGSMCGDGCNGGYPAEAWNFWTR-KGLVSGGLY 173


>WB|WBGene00000785 [details] [associations]
            symbol:cpr-5 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39896 EMBL:L39927 EMBL:FO081739
            PIR:T37277 RefSeq:NP_503383.1 UniGene:Cel.19730
            ProteinModelPortal:P43509 SMR:P43509 DIP:DIP-25329N IntAct:P43509
            MINT:MINT-1051285 STRING:P43509 MEROPS:C01.A35 PaxDb:P43509
            EnsemblMetazoa:W07B8.5 GeneID:178612 KEGG:cel:CELE_W07B8.5
            UCSC:W07B8.5.1 CTD:178612 WormBase:W07B8.5 InParanoid:P43509
            OMA:DAIPDHF NextBio:901840 Uniprot:P43509
        Length = 344

 Score = 140 (54.3 bits), Expect = 6.6e-17, Sum P(2) = 6.6e-17
 Identities = 32/93 (34%), Positives = 44/93 (47%)

Query:   249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCD-HGVAVVGFGTAEEEDGAK 307
             E    + +T  P+ V      + F  Y  GV     G +   H V ++G+G    ++G  
Sbjct:   245 EQIQTEILTNGPIEVAFTVY-EDFYQYTTGVYVHTAGASLGGHAVKILGWGV---DNGTP 300

Query:   308 YWLIKNSWGETWGESGYIRILRDEGLCGIATEA 340
             YWL+ NSW   WGE GY RI+R    CGI   A
Sbjct:   301 YWLVANSWNVAWGEKGYFRIIRGLNECGIEHSA 333

 Score = 134 (52.2 bits), Expect = 6.6e-17, Sum P(2) = 6.6e-17
 Identities = 34/104 (32%), Positives = 55/104 (52%)

Query:   128 VTD-VPTSIDWREKG----AVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIE--LSE 180
             V+D +P   D R++     ++ +I++Q  CGSCWAF+A  A+   T I     +   LS 
Sbjct:    78 VSDAIPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSS 137

Query:   181 QQLVDCSTD----NNGCSGGLMDKAFEYIIENKGLATEADYPYQ 220
             + L+ C T      NGC GG   +A+++ +++ GL T   Y  Q
Sbjct:   138 EDLLSCCTGMFSCGNGCEGGYPIQAWKWWVKH-GLVTGGSYETQ 180

 Score = 43 (20.2 bits), Expect = 1.7e-07, Sum P(2) = 1.7e-07
 Identities = 19/69 (27%), Positives = 29/69 (42%)

Query:   137 WREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQ--QLVD-CSTDNNGC 193
             W + G VT    +   G C  +S     E +  +      E +E   + VD C++ NN  
Sbjct:   166 WVKHGLVTGGSYETQFG-CKPYSIAPCGETVNGVKWPACPEDTEPTPKCVDSCTSKNNYA 224

Query:   194 SGGLMDKAF 202
             +  L DK F
Sbjct:   225 TPYLQDKHF 233


>UNIPROTKB|P07688 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9913 "Bos taurus"
            [GO:0042470 "melanosome" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730
            "nucleolus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0042470
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 EMBL:L06075 EMBL:M64620
            EMBL:U16336 EMBL:U16337 EMBL:U16338 EMBL:U16339 EMBL:U16341
            EMBL:U16342 EMBL:U16343 EMBL:BC102997 IPI:IPI00692061 PIR:S38328
            RefSeq:NP_776456.1 UniGene:Bt.393 PDB:1ITO PDB:1QDQ PDB:1SP4
            PDB:2DC6 PDB:2DC7 PDB:2DC8 PDB:2DC9 PDB:2DCA PDB:2DCB PDB:2DCC
            PDB:2DCD PDBsum:1ITO PDBsum:1QDQ PDBsum:1SP4 PDBsum:2DC6
            PDBsum:2DC7 PDBsum:2DC8 PDBsum:2DC9 PDBsum:2DCA PDBsum:2DCB
            PDBsum:2DCC PDBsum:2DCD ProteinModelPortal:P07688 SMR:P07688
            STRING:P07688 MEROPS:C01.060 PRIDE:P07688
            Ensembl:ENSBTAT00000036795 GeneID:281105 KEGG:bta:281105 CTD:1508
            eggNOG:NOG315657 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 InParanoid:P07688 KO:K01363 OMA:GYPSGAW
            OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 BindingDB:P07688
            ChEMBL:CHEMBL2323 EvolutionaryTrace:P07688 NextBio:20805177
            ArrayExpress:P07688 GO:GO:0097067 PANTHER:PTHR12411:SF16
            Uniprot:P07688
        Length = 335

 Score = 145 (56.1 bits), Expect = 7.7e-17, Sum P(2) = 7.7e-17
 Identities = 27/69 (39%), Positives = 40/69 (57%)

Query:   272 FRFYKRGVLNAECGDNCD-HGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
             F  YK GV     G+    H + ++G+G    E+G  YWL+ NSW   WG++G+ +ILR 
Sbjct:   259 FLLYKSGVYQHVSGEIMGGHAIRILGWGV---ENGTPYWLVGNSWNTDWGDNGFFKILRG 315

Query:   331 EGLCGIATE 339
             +  CGI +E
Sbjct:   316 QDHCGIESE 324

 Score = 127 (49.8 bits), Expect = 7.7e-17, Sum P(2) = 7.7e-17
 Identities = 32/95 (33%), Positives = 53/95 (55%)

Query:   131 VPTSIDWREKG----AVTHIKNQGHCGSCWAFSAVAAV-EGITQITGGKL-IELSEQQLV 184
             +P S D RE+      +  I++QG CGSCWAF AV A+ + I   + G++ +E+S + ++
Sbjct:    80 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 139

Query:   185 DCSTDN--NGCSGGLMDKAFEYIIENKGLATEADY 217
              C      +GC+GG    A+ +  + KGL +   Y
Sbjct:   140 TCCGGECGDGCNGGFPSGAWNFWTK-KGLVSGGLY 173


>WB|WBGene00000784 [details] [associations]
            symbol:cpr-4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39895 EMBL:L39926 EMBL:FO081381
            PIR:T37280 RefSeq:NP_504682.1 UniGene:Cel.5404
            ProteinModelPortal:P43508 SMR:P43508 DIP:DIP-25376N
            MINT:MINT-1069892 STRING:P43508 MEROPS:C01.A34 PaxDb:P43508
            EnsemblMetazoa:F44C4.3 GeneID:179053 KEGG:cel:CELE_F44C4.3
            UCSC:F44C4.3 CTD:179053 WormBase:F44C4.3 InParanoid:P43508
            OMA:CCGFLCG NextBio:903704 Uniprot:P43508
        Length = 335

 Score = 145 (56.1 bits), Expect = 9.8e-17, Sum P(2) = 9.8e-17
 Identities = 28/66 (42%), Positives = 37/66 (56%)

Query:   272 FRFYKRGVLNAECGDNCD-HGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
             F  YK GV     G     H + ++G+GT   ++G  YWL+ NSW   WGE+GY RI+R 
Sbjct:   262 FYQYKTGVYVHTTGQELGGHAIRILGWGT---DNGTPYWLVANSWNVNWGENGYFRIIRG 318

Query:   331 EGLCGI 336
                CGI
Sbjct:   319 TNECGI 324

 Score = 126 (49.4 bits), Expect = 9.8e-17, Sum P(2) = 9.8e-17
 Identities = 32/101 (31%), Positives = 50/101 (49%)

Query:   127 NVTDVPTSID----WREKGAVTHIKNQGHCGSCWAFSAV-AAVEGITQITGGKLIEL--S 179
             N   +P + D    W    ++ +I++Q  CGSCWAF+A  AA +     + G +  L  +
Sbjct:    77 NEDTIPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSA 136

Query:   180 EQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQ 220
             E  L  CS    GC GG    A++Y++++ G  T   Y  Q
Sbjct:   137 EDVLSCCSNCGYGCEGGYPINAWKYLVKS-GFCTGGSYEAQ 176


>UNIPROTKB|E1BTI7 [details] [associations]
            symbol:TINAG "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0005044 "scavenger receptor activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0006955 "immune
            response" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0030247 "polysaccharide binding"
            evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
            [GO:0007155 "cell adhesion" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GO:GO:0007155 GO:GO:0005604 GO:GO:0005044
            GeneTree:ENSGT00560000076599 CTD:27283 OMA:WGQLTSS
            EMBL:AADN02002720 EMBL:AADN02002721 IPI:IPI00581566
            RefSeq:XP_419905.3 UniGene:Gga.11215 Ensembl:ENSGALT00000026295
            GeneID:421888 KEGG:gga:421888 Uniprot:E1BTI7
        Length = 467

 Score = 145 (56.1 bits), Expect = 1.4e-16, Sum P(2) = 1.4e-16
 Identities = 35/128 (27%), Positives = 61/128 (47%)

Query:   215 ADYPYQQEQGTCDKQKEKAAAA-TIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFR 273
             ++Y      G C    E +      G +  +   +   + + + K PV   ++   + F 
Sbjct:   332 SEYGKNHTNGPCPNALEDSNRLYRCGSHYRVSSKETDIMEEIMAKGPVQAIMKVY-EDFF 390

Query:   274 FYKRGVL--NAECGDNCD-HGVAVVGFGTAEEEDGAK--YWLIKNSWGETWGESGYIRIL 328
              YK G+   + + G     H V ++G+G+   ++G K  +W+  NSWG+ WGE+GY RIL
Sbjct:   391 LYKEGIYRHSYKAGSKWKTHSVKLLGWGSLPGKNGQKQKFWIAANSWGKYWGENGYFRIL 450

Query:   329 RDEGLCGI 336
             R +  C I
Sbjct:   451 RGQNECDI 458

 Score = 130 (50.8 bits), Expect = 1.4e-16, Sum P(2) = 1.4e-16
 Identities = 30/74 (40%), Positives = 46/74 (62%)

Query:   148 NQGHCGSCWAFS-AVAAVEGITQITGGKLIE-LSEQQLVDCSTDNN-GCSGGLMDKAFEY 204
             +Q +CG+ WAFS A  A + IT  + G++ + LS Q L+ C T N  GC+GG +D A+ Y
Sbjct:   241 DQRNCGASWAFSTASVAADRITIHSDGQITDNLSVQNLISCDTGNQRGCNGGSIDGAWRY 300

Query:   205 IIENKGLATEADYP 218
             +  + G+ + A YP
Sbjct:   301 LTTH-GVVSYACYP 313


>WB|WBGene00009158 [details] [associations]
            symbol:F26E4.3 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005576
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005044
            GeneTree:ENSGT00560000076599 HSSP:P07711 EMBL:Z81070
            eggNOG:NOG310046 HOGENOM:HOG000241342 OMA:DNCNRCT PIR:T21421
            RefSeq:NP_492593.2 ProteinModelPortal:P90850 SMR:P90850
            PaxDb:P90850 EnsemblMetazoa:F26E4.3.1 EnsemblMetazoa:F26E4.3.2
            GeneID:172827 KEGG:cel:CELE_F26E4.3 UCSC:F26E4.3.1 CTD:172827
            WormBase:F26E4.3 InParanoid:P90850 NextBio:877161 Uniprot:P90850
        Length = 452

 Score = 144 (55.7 bits), Expect = 1.6e-16, Sum P(2) = 1.6e-16
 Identities = 37/106 (34%), Positives = 58/106 (54%)

Query:   130 DVPTSIDWREK-GAVTH-IKNQGHCGSCWAFSAVA-AVEGITQITGGKLIE-LSEQQLVD 185
             ++P   D R+K G + H + +QG CGS W+ S  A + + +  I+ G++   LS QQL+ 
Sbjct:   183 ELPEHFDARDKWGPLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQLLS 242

Query:   186 CSTDNN-GCSGGLMDKAFEYIIENKGLATEADYPY----QQEQGTC 226
             C+     GC GG +D+A+ YI    G+  +  YPY     +E G C
Sbjct:   243 CNQHRQKGCEGGYLDRAWWYI-RKLGVVGDHCYPYVSGQSREPGHC 287

 Score = 130 (50.8 bits), Expect = 1.6e-16, Sum P(2) = 1.6e-16
 Identities = 38/129 (29%), Positives = 54/129 (41%)

Query:   219 YQQEQGT-CDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRF--- 274
             Y   QG  C    + + A  +     +   +E    + +T  PV          F +   
Sbjct:   294 YTNRQGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGG 353

Query:   275 -YKRGVLNAECGDNCD----HGVAVVGFGTAEEEDGA--KYWLIKNSWGETWGESGYIRI 327
              Y+   L A+ G +      H V V+G+G  +   G   KYWL  NSWG  WGE GY ++
Sbjct:   354 VYQHSDLAAQKGASSVAEGYHSVRVLGWGV-DHSTGKPIKYWLCANSWGTQWGEDGYFKV 412

Query:   328 LRDEGLCGI 336
             LR E  C I
Sbjct:   413 LRGENHCEI 421


>UNIPROTKB|F1PIF2 [details] [associations]
            symbol:CTSZ "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0060441 "epithelial tube branching involved
            in lung morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0005783 GO:GO:0005615 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GO:GO:0060441 GeneTree:ENSGT00560000076599 OMA:QCGTCTE
            EMBL:AAEX03014054 Ensembl:ENSCAFT00000019357 Uniprot:F1PIF2
        Length = 261

 Score = 205 (77.2 bits), Expect = 1.7e-16, P = 1.7e-16
 Identities = 68/215 (31%), Positives = 105/215 (48%)

Query:   151 HCGSCWAFSAVAAVEGITQIT--GG-KLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIE 207
             +CGSCWA  + +A+     I   G      LS Q ++DC+   + C GG     + Y  E
Sbjct:    46 YCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVLDCANAGS-CEGGNDLPVWSYAHE 104

Query:   208 NKGLATEADYPYQ---QE-----Q-GTCDKQKEKAAAAT-----IGKYEDLPKGDEHALL 253
             + G+  E    YQ   QE     Q GTC + KE  A        +G Y  L  G E  + 
Sbjct:   105 H-GIPDETCNNYQAKDQECNKFNQCGTCTEFKECHAIQNYTLWRVGDYGSL-SGREKMMA 162

Query:   254 QAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNC--DHGVAVVGFGTAEEEDGAKYWLI 311
             +     P+S  + A+ +   +   G ++AE  +    +H ++VVG+G +   DG +YW++
Sbjct:   163 EIYANGPISCGIMATEKMVNY--TGGIHAEYQEQAYINHVISVVGWGVS---DGTEYWIV 217

Query:   312 KNSWGETWGESGYIRILRDEGLCGIATEASYPVAM 346
             +NSWGE WGE G++RI+      G    ASY +A+
Sbjct:   218 RNSWGEPWGERGWMRIVTSTYKDGKG--ASYNLAV 250

 Score = 115 (45.5 bits), Expect = 0.00025, P = 0.00025
 Identities = 35/114 (30%), Positives = 54/114 (47%)

Query:   124 KYQNVTDVPTSIDWREKGAVTHI---KNQG---HCGSCWAFSAVAAVEGITQIT--GG-K 174
             +Y + +D+P S DWR    V +    +NQ    +CGSCWA  + +A+     I   G   
Sbjct:    13 EYLSPSDLPKSWDWRNVNGVNYASATRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWP 72

Query:   175 LIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDK 228
                LS Q ++DC+   + C GG     + Y  E+ G+  E    YQ +   C+K
Sbjct:    73 STLLSVQHVLDCANAGS-CEGGNDLPVWSYAHEH-GIPDETCNNYQAKDQECNK 124


>DICTYBASE|DDB_G0283921 [details] [associations]
            symbol:ctsB "cathepsin B precursor" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0283921 GO:GO:0005615
            GenomeReviews:CM000153_GR GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:AAFI02000058
            eggNOG:NOG315657 PANTHER:PTHR12411:SF16 OMA:CSLSCQS
            RefSeq:XP_638805.1 HSSP:P07688 MEROPS:C01.A59
            EnsemblProtists:DDB0233997 GeneID:8624329 KEGG:ddi:DDB_G0283921
            Uniprot:Q54QD9
        Length = 311

 Score = 215 (80.7 bits), Expect = 1.9e-16, P = 1.9e-16
 Identities = 66/231 (28%), Positives = 101/231 (43%)

Query:   129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
             T      +W     ++ I+NQ  CGSCWAF A  +      I   + ++LS   +V C  
Sbjct:    81 TSFNAQTNWPNCTTISQIQNQARCGSCWAFGATESATDRLCIHNNENVQLSFMDMVTCDE 140

Query:   189 DNNGCSGGLMDKAFEYIIENKGLATEA-DY------PYQQ------EQGTCDKQKEKAAA 235
              +NGC GG    A+ ++ +   ++ E   Y      P QQ         +C K+ +  ++
Sbjct:   141 TDNGCEGGDAFSAWNWLRKQGAVSEECLPYTIPTCPPAQQPCLNFVNTPSCTKECQSNSS 200

Query:   236 ATIGKYED-LPK-----GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECG-DNC 288
                 + +  + K      DE  + + VT  PV  C     + F  YK GV     G D  
Sbjct:   201 LIYSQDKHKMAKIYSFDSDEAIMQEIVTNGPVEACFTVF-EDFLAYKSGVYVHTTGKDLG 259

Query:   289 DHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATE 339
              H V +VGFGT    +G  Y+   N W  +WG++G   I R  G CGI+ +
Sbjct:   260 GHCVKLVGFGTL---NGVDYYAANNQWTTSWGDNGTFLIKR--GDCGISDD 305

 Score = 142 (55.0 bits), Expect = 2.9e-07, P = 2.9e-07
 Identities = 30/93 (32%), Positives = 46/93 (49%)

Query:   131 VPTSID----WREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
             +PTS +    W     ++ I+NQ  CGSCWAF A  +      I   + ++LS   +V C
Sbjct:    79 IPTSFNAQTNWPNCTTISQIQNQARCGSCWAFGATESATDRLCIHNNENVQLSFMDMVTC 138

Query:   187 STDNNGCSGGLMDKAFEYIIENKGLATEADYPY 219
                +NGC GG    A+ ++   +G  +E   PY
Sbjct:   139 DETDNGCEGGDAFSAWNWL-RKQGAVSEECLPY 170


>UNIPROTKB|H0YDT2 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            EMBL:AP001201 HGNC:HGNC:2546 Ensembl:ENST00000526034 Bgee:H0YDT2
            Uniprot:H0YDT2
        Length = 211

 Score = 176 (67.0 bits), Expect = 2.7e-16, Sum P(2) = 2.7e-16
 Identities = 45/143 (31%), Positives = 68/143 (47%)

Query:    40 EKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEF 99
             E  + +  Q  R+Y    E A RL IF  NL   ++  +E   T + G   FSDLT EEF
Sbjct:    39 EAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEF 98

Query:   100 RASYTGYNXXXXXXXXXXXXXXTFKYQNVTDVPTSIDWRE-KGAVTHIKNQGHCGSCWAF 158
                Y GY               + + +    VP S DWR+   A++ IK+Q +C  CWA 
Sbjct:    99 GQLY-GYRRAAGGVPSMGREIRSEEPEE--SVPFSCDWRKVASAISPIKDQKNCNCCWAM 155

Query:   159 SAVAAVEGITQITGGKLIELSEQ 181
             +A   +E + +I+    +++S Q
Sbjct:   156 AAAGNIETLWRISFWDFVDVSVQ 178

 Score = 47 (21.6 bits), Expect = 2.7e-16, Sum P(2) = 2.7e-16
 Identities = 8/11 (72%), Positives = 10/11 (90%)

Query:   210 GLATEADYPYQ 220
             GLA+E DYP+Q
Sbjct:   180 GLASEKDYPFQ 190


>WB|WBGene00000786 [details] [associations]
            symbol:cpr-6 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            PANTHER:PTHR12411:SF16 EMBL:L39894 EMBL:L39939 EMBL:FO080666
            PIR:T37274 RefSeq:NP_741818.1 UniGene:Cel.18138
            ProteinModelPortal:P43510 SMR:P43510 DIP:DIP-25139N
            MINT:MINT-1074025 STRING:P43510 MEROPS:C01.A51 PaxDb:P43510
            PRIDE:P43510 EnsemblMetazoa:C25B8.3a GeneID:180931
            KEGG:cel:CELE_C25B8.3 UCSC:C25B8.3a CTD:180931 WormBase:C25B8.3a
            InParanoid:P43510 OMA:KAKWGLM NextBio:911608 ArrayExpress:P43510
            Uniprot:P43510
        Length = 379

 Score = 133 (51.9 bits), Expect = 1.7e-15, Sum P(2) = 1.7e-15
 Identities = 33/93 (35%), Positives = 45/93 (48%)

Query:   246 KGDEHALL-QAVTKQPVSVCVEASGQAFRFYKRGVLNAECGD-NCDHGVAVVGFGTAEEE 303
             K D  A+  + +T  P+ +  E   + F  Y  GV     G     H V ++G+G    +
Sbjct:   260 KDDVEAIQKELMTHGPLEIAFEVY-EDFLNYDGGVYVHTGGKLGGGHAVKLIGWGI---D 315

Query:   304 DGAKYWLIKNSWGETWGESGYIRILRDEGLCGI 336
             DG  YW + NSW   WGE G+ RILR    CGI
Sbjct:   316 DGIPYWTVANSWNTDWGEDGFFRILRGVDECGI 348

 Score = 130 (50.8 bits), Expect = 1.7e-15, Sum P(2) = 1.7e-15
 Identities = 33/95 (34%), Positives = 53/95 (55%)

Query:   130 DVPTSID----WREKGAVTHIKNQGHCGSCWAFSAVAAV-EGITQITGGKL-IELSEQQL 183
             D+P S D    W +  ++  I++Q  CGSCWAF AV A+ + I   + G+L + LS   L
Sbjct:   104 DIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDL 163

Query:   184 VDCSTDNN-GCSGGLMDKAFEYIIENKGLATEADY 217
             + C      GC+GG    A+ Y +++ G+ T ++Y
Sbjct:   164 LSCCKSCGFGCNGGDPLAAWRYWVKD-GIVTGSNY 197


>UNIPROTKB|P05689 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:BC122603
            EMBL:X01809 IPI:IPI00708474 PIR:A29172 RefSeq:NP_001071303.1
            UniGene:Bt.4902 ProteinModelPortal:P05689 SMR:P05689 MEROPS:C01.013
            PRIDE:P05689 GeneID:404187 KEGG:bta:404187 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 InParanoid:P05689 KO:K08568
            OrthoDB:EOG42Z4QN BRENDA:3.4.18.1 NextBio:20817615 Uniprot:P05689
        Length = 304

 Score = 207 (77.9 bits), Expect = 2.2e-15, P = 2.2e-15
 Identities = 65/197 (32%), Positives = 94/197 (47%)

Query:   151 HCGSCWAFSAVAAVEGITQIT--GG-KLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIE 207
             +CGSCWA  + +A+     I   G      LS Q ++DC  D   C GG     +EY   
Sbjct:    89 YCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCG-DAGSCEGGNDLPVWEYA-H 146

Query:   208 NKGLATEADYPYQ---QE-----Q-GTCDKQKEKAAAAT-----IGKYEDLPKGDEHALL 253
               G+  E    YQ   QE     Q GTC + KE           +G Y  L  G E  + 
Sbjct:   147 RHGIPDETCNNYQAKDQECDKFNQCGTCTEFKECHVIKNYTLWKVGDYGSL-SGREKMMA 205

Query:   254 QAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNC--DHGVAVVGFGTAEEEDGAKYWLI 311
             +  T  P+S  + A+ +    Y  G+ + E  D    +H V+V G+G +   DG +YW++
Sbjct:   206 EIYTNGPISCGIMAT-EKMSNYTGGIYS-EYNDQAFINHIVSVAGWGVS---DGMEYWIV 260

Query:   312 KNSWGETWGESGYIRIL 328
             +NSWGE WGE G++RI+
Sbjct:   261 RNSWGEPWGEHGWMRIV 277

 Score = 127 (49.8 bits), Expect = 1.5e-05, P = 1.5e-05
 Identities = 37/114 (32%), Positives = 52/114 (45%)

Query:   124 KYQNVTDVPTSIDWREKGAVTHI---KNQG---HCGSCWAFSAVAAVEGITQIT--GG-K 174
             +Y + +D+P S DWR    V +    +NQ    +CGSCWA  + +A+     I   G   
Sbjct:    56 EYLSPSDLPKSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWP 115

Query:   175 LIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDK 228
                LS Q ++DC  D   C GG     +EY     G+  E    YQ +   CDK
Sbjct:   116 STLLSVQHVIDCG-DAGSCEGGNDLPVWEYA-HRHGIPDETCNNYQAKDQECDK 167


>WB|WBGene00010204 [details] [associations]
            symbol:F57F5.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0006898
            "receptor-mediated endocytosis" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0009792 GO:GO:0006898
            GO:GO:0040007 GO:GO:0002119 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0040011
            EMBL:Z75953 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            PANTHER:PTHR12411:SF16 RefSeq:NP_506011.2 ProteinModelPortal:Q20950
            SMR:Q20950 DIP:DIP-24447N IntAct:Q20950 MINT:MINT-211137
            STRING:Q20950 MEROPS:C01.A42 EnsemblMetazoa:F57F5.1 GeneID:179645
            KEGG:cel:CELE_F57F5.1 UCSC:F57F5.1 CTD:179645 WormBase:F57F5.1
            OMA:ADDINAC Uniprot:Q20950
        Length = 351

 Score = 144 (55.7 bits), Expect = 3.6e-15, Sum P(2) = 3.6e-15
 Identities = 32/84 (38%), Positives = 43/84 (51%)

Query:   254 QAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCD-HGVAVVGFGTAEEEDGAKYWLIK 312
             + +T  PV V      + F  Y  GV     G +   H V ++G+G    ++G  YWL  
Sbjct:   261 EIMTHGPVEVAFTVY-EDFEHYSGGVYVHTAGASLGGHAVKMLGWGV---DNGTPYWLCA 316

Query:   313 NSWGETWGESGYIRILRDEGLCGI 336
             NSW E WGE+GY RI+R    CGI
Sbjct:   317 NSWNEDWGENGYFRIIRGVNECGI 340

 Score = 113 (44.8 bits), Expect = 3.6e-15, Sum P(2) = 3.6e-15
 Identities = 29/95 (30%), Positives = 47/95 (49%)

Query:   131 VPTSID----WREKGAVTHIKNQGHCGSCWAFSAVAAV-EGITQITGGK-LIELSEQQLV 184
             VP S D    W    +++ I++Q  CGSCWA SA   + + I   +  K ++ +S   + 
Sbjct:    97 VPDSFDSRTAWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNAKTILSISADDIN 156

Query:   185 DCS--TDNNGCSGGLMDKAFEYIIENKGLATEADY 217
              C      NGC+GG   +A+ + ++ KG  T   Y
Sbjct:   157 ACCGMVCGNGCNGGYPIEAWRHYVK-KGYVTGGSY 190


>UNIPROTKB|Q9UBR2 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0060441 "epithelial tube
            branching involved in lung morphogenesis" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            Reactome:REACT_11123 Reactome:REACT_17015 InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 EMBL:CH471077 GO:GO:0005615 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AL109840 GO:GO:0060441 MEROPS:C01.013 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN
            BRENDA:3.4.18.1 EMBL:AF073890 EMBL:AF032906 EMBL:AF136273
            EMBL:AF136276 EMBL:AF136274 EMBL:AF136275 EMBL:AK314931
            EMBL:BC042168 EMBL:AF009923 IPI:IPI00002745 RefSeq:NP_001327.2
            UniGene:Hs.252549 PDB:1DEU PDB:1EF7 PDBsum:1DEU PDBsum:1EF7
            ProteinModelPortal:Q9UBR2 SMR:Q9UBR2 STRING:Q9UBR2 DMDM:12643324
            PaxDb:Q9UBR2 PeptideAtlas:Q9UBR2 PRIDE:Q9UBR2 DNASU:1522
            Ensembl:ENST00000217131 GeneID:1522 KEGG:hsa:1522 UCSC:uc002yai.2
            GeneCards:GC20M057570 HGNC:HGNC:2547 HPA:CAB025114 MIM:603169
            neXtProt:NX_Q9UBR2 PharmGKB:PA27043 InParanoid:Q9UBR2 OMA:QCGTCTE
            PhylomeDB:Q9UBR2 BindingDB:Q9UBR2 ChEMBL:CHEMBL4160 ChiTaRS:CTSZ
            EvolutionaryTrace:Q9UBR2 GenomeRNAi:1522 NextBio:6299 Bgee:Q9UBR2
            CleanEx:HS_CTSZ Genevestigator:Q9UBR2 GermOnline:ENSG00000101160
            Uniprot:Q9UBR2
        Length = 303

 Score = 205 (77.2 bits), Expect = 4.3e-15, P = 4.3e-15
 Identities = 64/197 (32%), Positives = 98/197 (49%)

Query:   151 HCGSCWAFSAVAAVEGITQIT--GG-KLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIE 207
             +CGSCWA ++ +A+     I   G      LS Q ++DC    + C GG     ++Y  +
Sbjct:    88 YCGSCWAHASTSAMADRINIKRKGAWPSTLLSVQNVIDCGNAGS-CEGGNDLSVWDYAHQ 146

Query:   208 NKGLATEADYPYQ---QE-----Q-GTCDKQKEKAAAAT-----IGKYEDLPKGDEHALL 253
             + G+  E    YQ   QE     Q GTC++ KE  A        +G Y  L  G E  + 
Sbjct:   147 H-GIPDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSL-SGREKMMA 204

Query:   254 QAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNC--DHGVAVVGFGTAEEEDGAKYWLI 311
             +     P+S  + A+ +    Y  G+  AE  D    +H V+V G+G +   DG +YW++
Sbjct:   205 EIYANGPISCGIMAT-ERLANYTGGIY-AEYQDTTYINHVVSVAGWGIS---DGTEYWIV 259

Query:   312 KNSWGETWGESGYIRIL 328
             +NSWGE WGE G++RI+
Sbjct:   260 RNSWGEPWGERGWLRIV 276

 Score = 120 (47.3 bits), Expect = 9.3e-05, P = 9.3e-05
 Identities = 35/114 (30%), Positives = 54/114 (47%)

Query:   124 KYQNVTDVPTSIDWREKGAVTHI---KNQG---HCGSCWAFSAVAAVEGITQIT--GG-K 174
             +Y +  D+P S DWR    V +    +NQ    +CGSCWA ++ +A+     I   G   
Sbjct:    55 EYLSPADLPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWP 114

Query:   175 LIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDK 228
                LS Q ++DC    + C GG     ++Y  ++ G+  E    YQ +   CDK
Sbjct:   115 STLLSVQNVIDCGNAGS-CEGGNDLSVWDYAHQH-GIPDETCNNYQAKDQECDK 166


>WB|WBGene00000782 [details] [associations]
            symbol:cpr-2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 eggNOG:NOG315657 GeneTree:ENSGT00560000076599
            HOGENOM:HOG000241341 PANTHER:PTHR12411:SF16 EMBL:Z81531
            RefSeq:NP_507186.3 ProteinModelPortal:O45466 SMR:O45466
            MEROPS:C01.A40 PaxDb:O45466 EnsemblMetazoa:F36D3.9 GeneID:185355
            KEGG:cel:CELE_F36D3.9 CTD:185355 WormBase:F36D3.9 OMA:FDARLRW
            Uniprot:O45466
        Length = 326

 Score = 143 (55.4 bits), Expect = 4.3e-15, Sum P(2) = 4.3e-15
 Identities = 31/66 (46%), Positives = 35/66 (53%)

Query:   272 FRFYKRGVLNAECG-DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
             F  YK G+     G     H V ++G+GT   E G  YWL  NSWG  WGESG  RILR 
Sbjct:   254 FEKYKSGIYRHIAGRSKGGHAVKLIGWGT---ERGTPYWLAVNSWGSQWGESGTFRILRG 310

Query:   331 EGLCGI 336
                CGI
Sbjct:   311 VDECGI 316

 Score = 112 (44.5 bits), Expect = 4.3e-15, Sum P(2) = 4.3e-15
 Identities = 27/85 (31%), Positives = 42/85 (49%)

Query:   137 WREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQIT--GGKLIELSEQQLVDCS--TDNNG 192
             W +  ++  I+ Q +CGSCWAFS    +   T I   G +   +S   L+ C   +   G
Sbjct:    93 WPQCKSMKLIREQSNCGSCWAFSTAEVISDRTCIASNGTQQPIISPTDLLTCCGMSCGEG 152

Query:   193 CSGGLMDKAFEYIIENKGLATEADY 217
             C GG   +AF++    +G+ T  DY
Sbjct:   153 CDGGFPYRAFQWWAR-RGVVTGGDY 176


>UNIPROTKB|F1MW68 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9913 "Bos taurus"
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0060441
            GeneTree:ENSGT00560000076599 IPI:IPI00708474 UniGene:Bt.4902
            OMA:QCGTCTE EMBL:DAAA02036315 PRIDE:F1MW68
            Ensembl:ENSBTAT00000025007 Uniprot:F1MW68
        Length = 304

 Score = 205 (77.2 bits), Expect = 4.5e-15, P = 4.5e-15
 Identities = 65/197 (32%), Positives = 94/197 (47%)

Query:   151 HCGSCWAFSAVAAVEGITQIT--GG-KLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIE 207
             +CGSCWA  + +A+     I   G      LS Q ++DC  D   C GG     +EY   
Sbjct:    89 YCGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVLDCG-DAGSCEGGNDLPVWEYA-H 146

Query:   208 NKGLATEADYPYQ---QE-----Q-GTCDKQKEKAAAAT-----IGKYEDLPKGDEHALL 253
               G+  E    YQ   QE     Q GTC + KE           +G Y  L  G E  + 
Sbjct:   147 RHGIPDETCNNYQAKDQECDKFNQCGTCTEFKECHVIKNYTLWKVGDYGSL-SGREKMMA 205

Query:   254 QAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNC--DHGVAVVGFGTAEEEDGAKYWLI 311
             +  T  P+S  + A+ +    Y  G+ + E  D    +H V+V G+G +   DG +YW++
Sbjct:   206 EIYTNGPISCGIMAT-EKMSNYTGGIYS-EYNDQAFINHIVSVAGWGVS---DGMEYWIV 260

Query:   312 KNSWGETWGESGYIRIL 328
             +NSWGE WGE G++RI+
Sbjct:   261 RNSWGEPWGEHGWMRIV 277

 Score = 125 (49.1 bits), Expect = 2.5e-05, P = 2.5e-05
 Identities = 37/114 (32%), Positives = 52/114 (45%)

Query:   124 KYQNVTDVPTSIDWREKGAVTHI---KNQG---HCGSCWAFSAVAAVEGITQIT--GG-K 174
             +Y + +D+P S DWR    V +    +NQ    +CGSCWA  + +A+     I   G   
Sbjct:    56 EYLSPSDLPKSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWP 115

Query:   175 LIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDK 228
                LS Q ++DC  D   C GG     +EY     G+  E    YQ +   CDK
Sbjct:   116 STLLSVQHVLDCG-DAGSCEGGNDLPVWEYA-HRHGIPDETCNNYQAKDQECDK 167


>MGI|MGI:1891190 [details] [associations]
            symbol:Ctsz "cathepsin Z" species:10090 "Mus musculus"
            [GO:0005615 "extracellular space" evidence=ISO] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1891190 GO:GO:0005783 GO:GO:0005615 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0060441 GeneTree:ENSGT00560000076599 MEROPS:C01.013 CTD:1522
            HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN OMA:QCGTCTE
            ChiTaRS:CTSZ EMBL:AJ242663 EMBL:AF136277 EMBL:AF136278
            EMBL:BC008619 IPI:IPI00986833 RefSeq:NP_071720.1 UniGene:Mm.156919
            ProteinModelPortal:Q9WUU7 SMR:Q9WUU7 IntAct:Q9WUU7 STRING:Q9WUU7
            PaxDb:Q9WUU7 PRIDE:Q9WUU7 Ensembl:ENSMUST00000016400 GeneID:64138
            KEGG:mmu:64138 InParanoid:Q9WUU7 NextBio:319927 Bgee:Q9WUU7
            CleanEx:MM_CTSZ Genevestigator:Q9WUU7 GermOnline:ENSMUSG00000016256
            Uniprot:Q9WUU7
        Length = 306

 Score = 205 (77.2 bits), Expect = 4.8e-15, P = 4.8e-15
 Identities = 67/215 (31%), Positives = 103/215 (47%)

Query:   151 HCGSCWAFSAVAAVEGITQIT--GG-KLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIE 207
             +CGSCWA  + +A+     I   G    I LS Q ++DC    + C GG     +EY  +
Sbjct:    90 YCGSCWAHGSTSAMADRINIKRKGAWPSILLSVQNVIDCGNAGS-CEGGNDLPVWEYAHK 148

Query:   208 NKGLATEADYPYQ-QEQ--------GTCDKQKEKAAAAT-----IGKYEDLPKGDEHALL 253
             + G+  E    YQ ++Q        GTC + KE           +G Y  L  G E  + 
Sbjct:   149 H-GIPDETCNNYQAKDQDCDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSL-SGREKMMA 206

Query:   254 QAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNC--DHGVAVVGFGTAEEEDGAKYWLI 311
             +     P+S  + A+ +    Y  G+  AE  D    +H ++V G+G +   DG +YW++
Sbjct:   207 EIYANGPISCGIMAT-EMMSNYTGGIY-AEHQDQAVINHIISVAGWGVSN--DGIEYWIV 262

Query:   312 KNSWGETWGESGYIRILRDEGLCGIATEASYPVAM 346
             +NSWGE WGE G++RI+      G  T  SY +A+
Sbjct:   263 RNSWGEPWGEKGWMRIVTSTYKGG--TGDSYNLAI 295

 Score = 123 (48.4 bits), Expect = 4.3e-05, P = 4.3e-05
 Identities = 36/114 (31%), Positives = 54/114 (47%)

Query:   124 KYQNVTDVPTSIDWREKGAVTHI---KNQG---HCGSCWAFSAVAAVEGITQIT--GG-K 174
             +Y +  D+P + DWR    V +    +NQ    +CGSCWA  + +A+     I   G   
Sbjct:    57 EYLSPADLPKNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWP 116

Query:   175 LIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDK 228
              I LS Q ++DC    + C GG     +EY  ++ G+  E    YQ +   CDK
Sbjct:   117 SILLSVQNVIDCGNAGS-CEGGNDLPVWEYAHKH-GIPDETCNNYQAKDQDCDK 168


>ZFIN|ZDB-GENE-060503-240 [details] [associations]
            symbol:tinagl1 "tubulointerstitial nephritis
            antigen-like 1" species:7955 "Danio rerio" [GO:0005044 "scavenger
            receptor activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0030414 "peptidase inhibitor activity" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0002040 "sprouting
            angiogenesis" evidence=IMP] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR008037 InterPro:IPR013128 Pfam:PF00112 Pfam:PF05375
            PROSITE:PS00524 PROSITE:PS50958 SMART:SM00201 SMART:SM00645
            ZFIN:ZDB-GENE-060503-240 GO:GO:0006955 GO:GO:0030247 GO:GO:0030414
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 GO:GO:0002040
            GO:GO:0005044 GeneTree:ENSGT00560000076599 GO:GO:0010466
            SUPFAM:SSF57283 HOVERGEN:HBG053961 MEROPS:C01.975 OMA:DNCNRCT
            EMBL:BX950864 IPI:IPI00609339 UniGene:Dr.103937
            Ensembl:ENSDART00000087096 Ensembl:ENSDART00000126228
            InParanoid:Q1LUC6 Uniprot:Q1LUC6
        Length = 471

 Score = 130 (50.8 bits), Expect = 1.3e-14, Sum P(2) = 1.3e-14
 Identities = 32/104 (30%), Positives = 56/104 (53%)

Query:   141 GAVTHIKNQGHCGSCWAFSAVA-AVEGIT-QITGGKLIELSEQQLVDCSTDN-NGCSGGL 197
             G +    +QG+C + WAFS  A A + I+ Q  G    +LS Q L+ C T + +GC+GG 
Sbjct:   212 GKIHEPLDQGNCNASWAFSTAAVASDRISIQSMGHMTPQLSPQNLISCDTRHQDGCAGGR 271

Query:   198 MDKAFEYIIENKGLATEADYPYQQ-EQGTCDKQKEKAAAATIGK 240
             +D A+ + +  +G+ T+  YP+   EQ   +  +    +  +G+
Sbjct:   272 IDGAW-WFMRRRGVVTQDCYPFSPPEQSAVEVARCMMQSRAVGR 314

 Score = 128 (50.1 bits), Expect = 1.3e-14, Sum P(2) = 1.3e-14
 Identities = 31/103 (30%), Positives = 48/103 (46%)

Query:   248 DEHALLQAVTKQ-PVSVCVEASGQAFRFYKRGVLN---------AECGDNCDHGVAVVGF 297
             +E+ +++ +    PV   +E   + F  YK G+           ++   +  H V + G+
Sbjct:   345 NENEIMKEIMDNGPVQAIMEVH-EDFFVYKSGIFRHTDVNYHKPSQYRKHATHSVRITGW 403

Query:   298 GTAEEEDGA--KYWLIKNSWGETWGESGYIRILRDEGLCGIAT 338
             G   +  G   KYW+  NSWG+ WGE GY RI R    C I T
Sbjct:   404 GEERDYSGRTRKYWIGANSWGKNWGEDGYFRIARGVNECDIET 446


>DICTYBASE|DDB_G0292462 [details] [associations]
            symbol:DDB_G0292462 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0292462 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            EMBL:AAFI02000190 RefSeq:XP_629634.1 MEROPS:C01.A56
            EnsemblProtists:DDB0184413 GeneID:8628698 KEGG:ddi:DDB_G0292462
            InParanoid:Q54D62 OMA:NTQVESH Uniprot:Q54D62
        Length = 323

 Score = 203 (76.5 bits), Expect = 1.4e-14, P = 1.4e-14
 Identities = 68/230 (29%), Positives = 99/230 (43%)

Query:   122 TFKYQNVTDVPTSIDWREK--GAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIE-- 177
             ++    +  +P S D R      ++ ++ Q  CGSCWA      +     I   K I+  
Sbjct:    37 SYSQNELDTIPASFDVRTNWGDCMSPVREQQSCGSCWAQVTSGILADRMCIESDKNIKML 96

Query:   178 LSEQQLVDCS----TD-----NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQG---- 224
             LS Q L+DC     +D     NNGC GG +  A   +I N+G+ ++    YQ  +     
Sbjct:    97 LSPQYLMDCDGSCVSDGVSGCNNGCKGGFVGLALTRLI-NEGIVSDECLSYQASKDSSCP 155

Query:   225 -TCDKQKEKAAAATIGKYED---LPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVL 280
              TCD      +  TI K       P   + A  + +T  PV +        F+ +K  V 
Sbjct:   156 TTCD-DGSPISNTTIYKATSCRAFPTVQD-AQYEIMTNGPV-IATFMLYSDFKPHKWDVY 212

Query:   281 NAECGDNCD-HGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR 329
                     + H V VVG+GT    DG  YW+  NSWG  WG+ GY +I R
Sbjct:   213 IKSSNTQVESHAVRVVGWGTTS--DGVDYWIAANSWGTGWGDKGYFKIRR 260


>ZFIN|ZDB-GENE-041010-139 [details] [associations]
            symbol:ctsz "cathepsin Z" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0001525 "angiogenesis"
            evidence=IMP] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 ZFIN:ZDB-GENE-041010-139 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0001525
            CTD:1522 HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568
            OrthoDB:EOG42Z4QN UniGene:Dr.935 eggNOG:NOG275763 EMBL:BC083369
            IPI:IPI00483065 RefSeq:NP_001006043.1 ProteinModelPortal:Q5XJD4
            SMR:Q5XJD4 STRING:Q5XJD4 GeneID:450022 KEGG:dre:450022
            InParanoid:Q5XJD4 NextBio:20833005 ArrayExpress:Q5XJD4
            Uniprot:Q5XJD4
        Length = 301

 Score = 201 (75.8 bits), Expect = 1.4e-14, P = 1.4e-14
 Identities = 57/195 (29%), Positives = 91/195 (46%)

Query:   151 HCGSCWAFSAVAAVE---GITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIE 207
             +CGSCWA  + +A+     I +        LS Q ++DC  D   CSGG     +EY   
Sbjct:    80 YCGSCWAHGSTSALADRINIKRKAAWPSAYLSVQNVIDCG-DAGSCSGGDHSGVWEYA-H 137

Query:   208 NKGLATEADYPYQQEQGTCDKQKEKAAAATIG-----KYEDLPKGDEHALLQAVTKQ--- 259
             NKG+  E    YQ +   C    +     T G     K   L K  ++     + K    
Sbjct:   138 NKGIPDETCNNYQAKDQDCKPFNQCGTCTTFGVCNIVKNFTLWKVGDYGSASGLDKMKAE 197

Query:   260 -----PVSVCVEASGQAFRFYKRGVLNAECGDN-CDHGVAVVGFGTAEEEDGAKYWLIKN 313
                  P+S  + A+ +    Y  G+ +    +   +H V+V G+G   +E+G ++W+++N
Sbjct:   198 IYSGGPISCGIMATDK-LDAYTGGLYSEYVQEPYINHIVSVAGWGV--DENGVEFWVVRN 254

Query:   314 SWGETWGESGYIRIL 328
             SWGE WGE G++RI+
Sbjct:   255 SWGEPWGEKGWLRIV 269

 Score = 140 (54.3 bits), Expect = 4.6e-07, P = 4.6e-07
 Identities = 39/126 (30%), Positives = 57/126 (45%)

Query:   123 FKYQNVTDVPTSIDWRE-KGA--VTHIKNQG---HCGSCWAFSAVAAVE---GITQITGG 173
             ++  N+ ++P   DWR  KG   V+  +NQ    +CGSCWA  + +A+     I +    
Sbjct:    46 YESMNLKELPKEWDWRNIKGVNYVSTTRNQHIPQYCGSCWAHGSTSALADRINIKRKAAW 105

Query:   174 KLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKA 233
                 LS Q ++DC  D   CSGG     +EY   NKG+  E    YQ +   C    +  
Sbjct:   106 PSAYLSVQNVIDCG-DAGSCSGGDHSGVWEYA-HNKGIPDETCNNYQAKDQDCKPFNQCG 163

Query:   234 AAATIG 239
                T G
Sbjct:   164 TCTTFG 169


>WB|WBGene00000789 [details] [associations]
            symbol:cpz-2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GeneTree:ENSGT00560000076599 KO:K08568 EMBL:Z81103
            HSSP:P80067 PIR:T23720 RefSeq:NP_506318.1 ProteinModelPortal:P92005
            SMR:P92005 STRING:P92005 MEROPS:C01.A41 PaxDb:P92005
            EnsemblMetazoa:M04G12.2 GeneID:179818 KEGG:cel:CELE_M04G12.2
            UCSC:M04G12.2 CTD:179818 WormBase:M04G12.2 eggNOG:NOG275763
            InParanoid:P92005 OMA:VEYWIAR NextBio:906990 Uniprot:P92005
        Length = 467

 Score = 205 (77.2 bits), Expect = 3.2e-14, P = 3.2e-14
 Identities = 50/193 (25%), Positives = 90/193 (46%)

Query:   151 HCGSCWAFSAVAAVEGITQITG-GK--LIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIE 207
             +CGSCW F    A+     +   G+  + +LS Q+++DC+   N C GG +    E+  +
Sbjct:   247 YCGSCWVFGTTGALNDRFNVARKGRWPMTQLSPQEIIDCNGKGN-CQGGEIGNVLEHA-K 304

Query:   208 NKGLATEADYPYQQEQGTCDKQ--------KEKAAAATIGKYEDLPKGDEHA---LLQAV 256
              +GL  E    Y+   G C+           E  +     +Y     G       ++  +
Sbjct:   305 IQGLVEEGCNVYRATNGECNPYHRCGSCWPNECFSLTNYTRYYVKDYGQVQGRDKIMSEI 364

Query:   257 TKQPVSVCVEASGQAFRF-YKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSW 315
              K     C   + + F + Y +GV + +     +H +++ G+G   +E+G +YW+ +NSW
Sbjct:   365 KKGGPIACAIGATKKFEYEYVKGVYSEKSDLESNHIISLTGWGV--DENGVEYWIARNSW 422

Query:   316 GETWGESGYIRIL 328
             GE WGE G+ R++
Sbjct:   423 GEAWGELGWFRVV 435

 Score = 135 (52.6 bits), Expect = 4.2e-06, P = 4.2e-06
 Identities = 33/107 (30%), Positives = 52/107 (48%)

Query:   130 DVPTSIDWREKGAVTHI---KNQG---HCGSCWAFSAVAAVEGITQITG-GK--LIELSE 180
             D+PT  DWR    V +    +NQ    +CGSCW F    A+     +   G+  + +LS 
Sbjct:   220 DLPTGWDWRNVSGVNYCSPTRNQHIPVYCGSCWVFGTTGALNDRFNVARKGRWPMTQLSP 279

Query:   181 QQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCD 227
             Q+++DC+   N C GG +    E+  + +GL  E    Y+   G C+
Sbjct:   280 QEIIDCNGKGN-CQGGEIGNVLEHA-KIQGLVEEGCNVYRATNGECN 324


>RGD|1359482 [details] [associations]
            symbol:Tinag "tubulointerstitial nephritis antigen"
            species:10116 "Rattus norvegicus" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0005604 "basement membrane"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=IEA] [GO:0006955
            "immune response" evidence=IEA] [GO:0007155 "cell adhesion"
            evidence=ISO] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR001212 InterPro:IPR013128
            Pfam:PF00112 Pfam:PF01033 PROSITE:PS00524 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 RGD:1359482 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GO:GO:0007155 EMBL:CH473954 GO:GO:0005604
            GO:GO:0005044 MEROPS:C01.973 CTD:27283 eggNOG:NOG310046
            HOGENOM:HOG000241342 HOVERGEN:HBG053961 OMA:WGQLTSS
            OrthoDB:EOG47PX5P EMBL:BC081887 IPI:IPI00370427
            RefSeq:NP_001005549.1 UniGene:Rn.43851 STRING:Q66HF6
            Ensembl:ENSRNOT00000041567 GeneID:300846 KEGG:rno:300846
            UCSC:RGD:1359482 InParanoid:Q66HF6 NextBio:647630
            Genevestigator:Q66HF6 Uniprot:Q66HF6
        Length = 475

 Score = 127 (49.8 bits), Expect = 6.1e-14, Sum P(2) = 6.1e-14
 Identities = 30/81 (37%), Positives = 45/81 (55%)

Query:   148 NQGHCGSCWAFS--AVAAVEGITQITGGKLIELSEQQLVDCSTDN-NGCSGGLMDKAFEY 204
             +Q +C + WAFS  +VAA     Q  G     LS Q L+ C   N +GC+ G +D+A+ +
Sbjct:   235 DQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRAW-W 293

Query:   205 IIENKGLATEADYPYQQEQGT 225
              +  +GL + A YP  +EQ T
Sbjct:   294 FLRKRGLVSHACYPLFKEQST 314

 Score = 125 (49.1 bits), Expect = 6.1e-14, Sum P(2) = 6.1e-14
 Identities = 28/97 (28%), Positives = 46/97 (47%)

Query:   244 LPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCD---------HGVAV 294
             +   +   + + +   PV   ++   + F +YK G+       N +         H V +
Sbjct:   356 ISSNETEIMREIIQNGPVQAIMQVH-EDFFYYKTGIYRHVVSTNEEPEKYRKLRTHAVKL 414

Query:   295 VGFGTAEEEDGAK--YWLIKNSWGETWGESGYIRILR 329
              G+GT     G K  +W+  NSWG++WGE+GY RILR
Sbjct:   415 TGWGTLRGAQGKKEKFWIAANSWGKSWGENGYFRILR 451


>TAIR|locus:2060420 [details] [associations]
            symbol:AT2G22160 "AT2G22160" species:3702 "Arabidopsis
            thaliana" [GO:0005575 "cellular_component" evidence=ND] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] EMBL:CP002685
            GenomeReviews:CT485783_GR InterPro:IPR013201 Pfam:PF08246
            SMART:SM00848 EMBL:AC007168 IPI:IPI00544896 PIR:F84609
            RefSeq:NP_179806.1 UniGene:At.66231 HSSP:P25774
            ProteinModelPortal:Q9SIE8 SMR:Q9SIE8 EnsemblPlants:AT2G22160.1
            GeneID:816750 KEGG:ath:AT2G22160 TAIR:At2g22160 eggNOG:NOG297278
            InParanoid:Q9SIE8 OMA:HRCITLA PhylomeDB:Q9SIE8 ArrayExpress:Q9SIE8
            Genevestigator:Q9SIE8 Uniprot:Q9SIE8
        Length = 105

 Score = 183 (69.5 bits), Expect = 6.3e-14, P = 6.3e-14
 Identities = 38/86 (44%), Positives = 55/86 (63%)

Query:    65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNXXXXXXXXXXXXXXTFK 124
             +FK+N EYI K NKE  + YKL  N+F++LT+ EF  ++T ++               F 
Sbjct:    17 VFKKNAEYIVKTNKE-RKPYKLKLNKFANLTDVEFVNAHTCFDMSDHKKILDSK---PFF 72

Query:   125 YQNVTDVPTSIDWREKGAVTHIKNQG 150
             Y+N+T  P S+DWREKGAVT++K+QG
Sbjct:    73 YENMTQAPDSLDWREKGAVTNVKDQG 98


>RGD|708479 [details] [associations]
            symbol:Ctsz "cathepsin Z" species:10116 "Rattus norvegicus"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=TAS]
            [GO:0005615 "extracellular space" evidence=IEA;ISO] [GO:0005783
            "endoplasmic reticulum" evidence=IEA;ISO] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0060441 "epithelial tube branching involved in
            lung morphogenesis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:708479 GO:GO:0005576 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004197 MEROPS:C01.013 CTD:1522 HOVERGEN:HBG004456 KO:K08568
            EMBL:AB023781 EMBL:BC091110 IPI:IPI00207663 RefSeq:NP_899159.1
            UniGene:Rn.1475 ProteinModelPortal:Q9R1T3 SMR:Q9R1T3 PRIDE:Q9R1T3
            GeneID:252929 KEGG:rno:252929 BindingDB:Q9R1T3 NextBio:624097
            Genevestigator:Q9R1T3 Uniprot:Q9R1T3
        Length = 306

 Score = 194 (73.4 bits), Expect = 1.5e-13, P = 1.5e-13
 Identities = 63/214 (29%), Positives = 100/214 (46%)

Query:   151 HCGSCWAFSAVAAVEGITQIT--GG-KLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIE 207
             +CGSCWA  + +A+     I   G      LS Q ++DC    + C GG     +EY  +
Sbjct:    90 YCGSCWAHGSTSALADRINIKRKGAWPSTLLSVQNVIDCGNAGS-CEGGNDLPVWEYAHK 148

Query:   208 NKGLATEADYPYQ-QEQ--------GTCDKQKEKAAAAT-----IGKYEDLPKGDEHALL 253
             + G+  E    YQ ++Q        GTC + KE           +G Y  L  G E  + 
Sbjct:   149 H-GIPDETCNNYQAKDQECDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSL-SGREKMMA 206

Query:   254 QAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNC-DHGVAVVGFGTAEEEDGAKYWLIK 312
             +     P+S  + A+ +    Y  G+          +H ++V G+G +   DG +YW+++
Sbjct:   207 EIYANGPISCGIMAT-ERMSNYTGGIYTEYQNQAIINHIISVAGWGVSN--DGIEYWIVR 263

Query:   313 NSWGETWGESGYIRILRDEGLCGIATEASYPVAM 346
             NSWGE WGE G++RI+      G  T +SY +A+
Sbjct:   264 NSWGEPWGERGWMRIVTSTYKGG--TGSSYNLAI 295

 Score = 118 (46.6 bits), Expect = 0.00016, P = 0.00016
 Identities = 35/114 (30%), Positives = 53/114 (46%)

Query:   124 KYQNVTDVPTSIDWREKGAVTHI---KNQG---HCGSCWAFSAVAAVEGITQIT--GG-K 174
             +Y +  D+P + DWR    V +    +NQ    +CGSCWA  + +A+     I   G   
Sbjct:    57 EYLSPADLPKNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSALADRINIKRKGAWP 116

Query:   175 LIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDK 228
                LS Q ++DC    + C GG     +EY  ++ G+  E    YQ +   CDK
Sbjct:   117 STLLSVQNVIDCGNAGS-CEGGNDLPVWEYAHKH-GIPDETCNNYQAKDQECDK 168


>UNIPROTKB|I3L9E7 [details] [associations]
            symbol:LOC100153159 "Uncharacterized protein" species:9823
            "Sus scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GeneTree:ENSGT00560000076599 OMA:WGQLTSS
            Ensembl:ENSSSCT00000031207 Uniprot:I3L9E7
        Length = 358

 Score = 123 (48.4 bits), Expect = 1.8e-13, Sum P(2) = 1.8e-13
 Identities = 21/42 (50%), Positives = 29/42 (69%)

Query:   290 HGVAVVGFGTAEEEDGAK--YWLIKNSWGETWGESGYIRILR 329
             H V + G+GT +   G K  +W+  NSWG++WGE+GY RILR
Sbjct:   293 HAVKLTGWGTLKGAQGRKEKFWIAANSWGKSWGENGYFRILR 334

 Score = 121 (47.7 bits), Expect = 1.8e-13, Sum P(2) = 1.8e-13
 Identities = 29/79 (36%), Positives = 44/79 (55%)

Query:   148 NQGHCGSCWAFS--AVAAVEGITQITGGKLIELSEQQLVDCSTDN-NGCSGGLMDKAFEY 204
             +Q +C + WAFS  +VAA     Q  G     LS Q L+ C   N +GC+ G +D+A+ Y
Sbjct:   118 DQKNCAASWAFSTASVAADRIAIQSEGRYTANLSPQNLISCCAKNRHGCNSGSIDRAWWY 177

Query:   205 IIENKGLATEADYPYQQEQ 223
             +   +GL + A YP  ++Q
Sbjct:   178 L-RKRGLVSHACYPLFKDQ 195


>DICTYBASE|DDB_G0283401 [details] [associations]
            symbol:ctsZ "cathepsin Z precursor" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            dictyBase:DDB_G0283401 GO:GO:0005615 GenomeReviews:CM000153_GR
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 EMBL:AAFI02000055 KO:K08568 OMA:QCGTCTE
            eggNOG:NOG275763 RefSeq:XP_639036.1 ProteinModelPortal:Q54R55
            IntAct:Q54R55 MEROPS:C01.A60 PRIDE:Q54R55
            EnsemblProtists:DDB0233836 GeneID:8624061 KEGG:ddi:DDB_G0283401
            InParanoid:Q54R55 Uniprot:Q54R55
        Length = 296

 Score = 192 (72.6 bits), Expect = 2.0e-13, P = 2.0e-13
 Identities = 55/215 (25%), Positives = 102/215 (47%)

Query:   151 HCGSCWAFSAVAAVEGITQITGGKL---IELSEQQLVDCSTDNNGCSGGLMDKAFEYIIE 207
             +CG CWAF++ +++    +I        + ++ Q L+DC+     C GG    AF +I E
Sbjct:    84 YCGGCWAFASTSSISDRIKIQRKAAFPDVNVAPQHLIDCNGGGT-CDGGDPGDAFAFINE 142

Query:   208 NKGLATEADYPYQQEQ---------GTCDKQKEKAAAA-----TIGKYEDLPKGDEHALL 253
             N G+  E   PYQ +           TC+      A       T+ +Y  + +G +  + 
Sbjct:   143 N-GIVDETCKPYQAKNLPDECSPACKTCNPDGTCQAIPVHTNITVTEYGSV-RGAKDMMA 200

Query:   254 QAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNC-DHGVAVVGFGTAEEEDGAKYWLIK 312
             +   + P++  ++A+ +    Y  G+      D   +H ++V+G+G    +D   YW+++
Sbjct:   201 EIYARGPIACSIDATSK-LEAYTSGIFKEFKLDPLPNHIISVIGWGV---QDSTPYWIVR 256

Query:   313 NSWGETWGESGYIRILRD---EGLCGIATEASYPV 344
             NSWG  +GE G+  I++    E L GI  + ++ V
Sbjct:   257 NSWGSYYGEGGFFNIVQGSLFENL-GIELDCNWAV 290

 Score = 134 (52.2 bits), Expect = 2.2e-06, P = 2.2e-06
 Identities = 33/100 (33%), Positives = 51/100 (51%)

Query:   130 DVPTSIDWREKGAVTHI---KNQG---HCGSCWAFSAVAAVEGITQITGGKL---IELSE 180
             +VP S DWR    V ++   +NQ    +CG CWAF++ +++    +I        + ++ 
Sbjct:    57 EVPQSWDWRNVSGVNYLTMNRNQHIPQYCGGCWAFASTSSISDRIKIQRKAAFPDVNVAP 116

Query:   181 QQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQ 220
             Q L+DC+     C GG    AF +I EN G+  E   PYQ
Sbjct:   117 QHLIDCNGGGT-CDGGDPGDAFAFINEN-GIVDETCKPYQ 154


>WB|WBGene00000783 [details] [associations]
            symbol:cpr-3 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0009792 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39890 EMBL:L39925 EMBL:Z81119
            EMBL:Z82057 PIR:T37282 RefSeq:NP_506790.1 UniGene:Cel.23503
            ProteinModelPortal:P43507 SMR:P43507 MEROPS:C01.A33
            EnsemblMetazoa:T10H4.12 GeneID:180033 KEGG:cel:CELE_T10H4.12
            UCSC:T10H4.12 CTD:180033 WormBase:T10H4.12 eggNOG:NOG240190
            InParanoid:P43507 OMA:PVEASYK NextBio:907824 Uniprot:P43507
        Length = 370

 Score = 129 (50.5 bits), Expect = 2.2e-13, Sum P(2) = 2.2e-13
 Identities = 31/76 (40%), Positives = 41/76 (53%)

Query:   265 VEASGQA---FRFYKRGVLNAECGDNCD-HGVAVVGFGTAEEEDGAKYWLIKNSWGETWG 320
             VEAS +    F  YK GV +   G     H V ++G+G    E+G  YWLI NSWG ++G
Sbjct:   255 VEASYKVYEDFYHYKSGVYHYTSGKLVGGHAVKIIGWGV---ENGVDYWLIANSWGTSFG 311

Query:   321 ESGYIRILRDEGLCGI 336
             E G+ +I R    C I
Sbjct:   312 EKGFFKIRRGTNECQI 327

 Score = 114 (45.2 bits), Expect = 2.2e-13, Sum P(2) = 2.2e-13
 Identities = 30/95 (31%), Positives = 44/95 (46%)

Query:   131 VPTSIDWREK----GAVTHIKNQGHCGSCWAFSAVAAVEG--ITQITGGKLIELSEQQLV 184
             +P + D REK      +  I+NQ  CGSCWAF A   +      Q  G +   +S + ++
Sbjct:    92 LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 151

Query:   185 DC--STDNNGCSGGLMDKAFEYIIENKGLATEADY 217
              C  +T   GC GG   +A  +   + G  T  DY
Sbjct:   152 SCCGTTCGYGCKGGYSIEALRFWASS-GAVTGGDY 185


>UNIPROTKB|Q9UJW2 [details] [associations]
            symbol:TINAG "Tubulointerstitial nephritis antigen"
            species:9606 "Homo sapiens" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0007155 "cell adhesion"
            evidence=IDA] [GO:0005604 "basement membrane" evidence=IDA]
            [GO:0000166 "nucleotide binding" evidence=TAS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR001212 InterPro:IPR013128
            Pfam:PF00112 Pfam:PF01033 PROSITE:PS00524 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0006955 EMBL:CH471081
            GO:GO:0000166 GO:GO:0030247 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0007155 GO:GO:0005604
            GO:GO:0004197 GO:GO:0005044 EMBL:AL359380 MEROPS:C01.973 CTD:27283
            eggNOG:NOG310046 HOGENOM:HOG000241342 HOVERGEN:HBG053961
            OMA:WGQLTSS EMBL:AB022277 EMBL:AF195116 EMBL:AF195117 EMBL:AK312918
            EMBL:AL589946 IPI:IPI00099386 IPI:IPI00478705 PIR:JC7189
            RefSeq:NP_055279.3 UniGene:Hs.127011 ProteinModelPortal:Q9UJW2
            SMR:Q9UJW2 IntAct:Q9UJW2 STRING:Q9UJW2 PhosphoSite:Q9UJW2
            DMDM:212276468 PRIDE:Q9UJW2 DNASU:27283 Ensembl:ENST00000259782
            GeneID:27283 KEGG:hsa:27283 UCSC:uc003pcj.2 GeneCards:GC06P054220
            H-InvDB:HIX0025004 HGNC:HGNC:14599 HPA:HPA035427 MIM:606749
            neXtProt:NX_Q9UJW2 PharmGKB:PA37905 InParanoid:Q9UJW2
            PhylomeDB:Q9UJW2 GenomeRNAi:27283 NextBio:50212 ArrayExpress:Q9UJW2
            Bgee:Q9UJW2 CleanEx:HS_TINAG Genevestigator:Q9UJW2
            GermOnline:ENSG00000137251 Uniprot:Q9UJW2
        Length = 476

 Score = 122 (48.0 bits), Expect = 6.0e-13, Sum P(2) = 6.0e-13
 Identities = 21/42 (50%), Positives = 28/42 (66%)

Query:   290 HGVAVVGFGTAEEEDGAK--YWLIKNSWGETWGESGYIRILR 329
             H V + G+GT     G K  +W+  NSWG++WGE+GY RILR
Sbjct:   411 HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILR 452

 Score = 121 (47.7 bits), Expect = 6.0e-13, Sum P(2) = 6.0e-13
 Identities = 29/79 (36%), Positives = 44/79 (55%)

Query:   148 NQGHCGSCWAFS--AVAAVEGITQITGGKLIELSEQQLVDCSTDN-NGCSGGLMDKAFEY 204
             +Q +C + WAFS  +VAA     Q  G     LS Q L+ C   N +GC+ G +D+A+ Y
Sbjct:   236 DQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRAWWY 295

Query:   205 IIENKGLATEADYPYQQEQ 223
             +   +GL + A YP  ++Q
Sbjct:   296 L-RKRGLVSHACYPLFKDQ 313


>UNIPROTKB|E2QXH3 [details] [associations]
            symbol:TINAGL1 "Uncharacterized protein" species:9615
            "Canis lupus familiaris" [GO:0043236 "laminin binding"
            evidence=IEA] [GO:0031012 "extracellular matrix" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006955 "immune response" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044 "scavenger
            receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005737
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639 GO:GO:0031012
            GO:GO:0005044 GeneTree:ENSGT00560000076599 CTD:64129 OMA:DNCNRCT
            EMBL:AAEX03001668 RefSeq:XP_535330.3 Ensembl:ENSCAFT00000035659
            GeneID:478155 KEGG:cfa:478155 NextBio:20853523 Uniprot:E2QXH3
        Length = 467

 Score = 124 (48.7 bits), Expect = 8.7e-13, Sum P(2) = 8.7e-13
 Identities = 36/119 (30%), Positives = 55/119 (46%)

Query:   131 VPTSIDWREK--GAVTHIKNQGHCGSCWAFSAVAAVEGITQITG-GKLIE-LSEQQLVDC 186
             +PT+ +  EK    +    +QG+C   WAFS  A       I   G +   LS Q L+ C
Sbjct:   203 LPTAFEAAEKWPNLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSC 262

Query:   187 STDNN-GCSGGLMDKAFEYIIENKGLATEADYPY----QQEQGTCDKQKEKAAAATIGK 240
              T N  GC GG +D A+ + +  +G+ ++  YP+    Q E G   +    + A   GK
Sbjct:   263 DTHNQQGCRGGRLDGAW-WFLRRRGVVSDHCYPFVGREQDEAGPAPRCMMHSRAMGRGK 320

 Score = 117 (46.2 bits), Expect = 8.7e-13, Sum P(2) = 8.7e-13
 Identities = 22/49 (44%), Positives = 26/49 (53%)

Query:   290 HGVAVVGFGTAEEEDGA--KYWLIKNSWGETWGESGYIRILRDEGLCGI 336
             H V + G+G     DG   KYW   NSWG  WGE G+ RI+R    C I
Sbjct:   400 HSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDI 448


>MGI|MGI:2137617 [details] [associations]
            symbol:Tinagl1 "tubulointerstitial nephritis antigen-like 1"
            species:10090 "Mus musculus" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
            [GO:0043236 "laminin binding" evidence=IPI] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 MGI:MGI:2137617
            GO:GO:0005737 GO:GO:0005576 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GO:GO:0031012 CleanEx:MM_ARG1 GO:GO:0005044
            GeneTree:ENSGT00560000076599 eggNOG:NOG310046 HOGENOM:HOG000241342
            HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OMA:DNCNRCT
            OrthoDB:EOG4BG8W0 EMBL:AB047402 EMBL:AB050626 EMBL:BC005738
            EMBL:BC018539 IPI:IPI00115458 RefSeq:NP_001161805.1
            RefSeq:NP_075965.2 UniGene:Mm.15801 ProteinModelPortal:Q99JR5
            SMR:Q99JR5 STRING:Q99JR5 PhosphoSite:Q99JR5 PaxDb:Q99JR5
            PRIDE:Q99JR5 Ensembl:ENSMUST00000030560 Ensembl:ENSMUST00000105998
            Ensembl:ENSMUST00000105999 GeneID:94242 KEGG:mmu:94242
            InParanoid:Q99JR5 NextBio:352247 Bgee:Q99JR5 Genevestigator:Q99JR5
            GermOnline:ENSMUSG00000028776 Uniprot:Q99JR5
        Length = 466

 Score = 122 (48.0 bits), Expect = 9.1e-13, Sum P(2) = 9.1e-13
 Identities = 32/103 (31%), Positives = 48/103 (46%)

Query:   248 DEHALLQAVTKQ-PVSVCVEASGQAFRFYKRGVLN---------AECGDNCDHGVAVVGF 297
             DE  +++ + +  PV   +E   + F  Y+RG+ +          +   +  H V + G+
Sbjct:   348 DEKEIMKELMENGPVQALMEVH-EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGW 406

Query:   298 GTAEEEDGA--KYWLIKNSWGETWGESGYIRILRDEGLCGIAT 338
             G     DG   KYW   NSWG  WGE G+ RI+R    C I T
Sbjct:   407 GEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIET 449

 Score = 119 (46.9 bits), Expect = 9.1e-13, Sum P(2) = 9.1e-13
 Identities = 31/99 (31%), Positives = 50/99 (50%)

Query:   131 VPTSIDWREK--GAVTHIKNQGHCGSCWAFSAVAAVEGITQITG-GKLIE-LSEQQLVDC 186
             +PT+ +  EK    +    +QG+C   WAFS  A       I   G +   LS Q L+ C
Sbjct:   202 LPTAFEASEKWPNLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSC 261

Query:   187 STDNN-GCSGGLMDKAFEYIIENKGLATEADYPYQ-QEQ 223
              T +  GC GG +D A+ + +  +G+ ++  YP+  +EQ
Sbjct:   262 DTHHQQGCRGGRLDGAW-WFLRRRGVVSDNCYPFSGREQ 299


>UNIPROTKB|Q3SZI1 [details] [associations]
            symbol:TINAG "Tubulointerstitial nephritis antigen"
            species:9913 "Bos taurus" [GO:0005604 "basement membrane"
            evidence=IEA] [GO:0007155 "cell adhesion" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044
            "scavenger receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 Pfam:PF01033
            PROSITE:PS00524 PROSITE:PS50958 SMART:SM00201 SMART:SM00645
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0007155
            GO:GO:0005604 GO:GO:0005044 GeneTree:ENSGT00560000076599
            EMBL:BC102843 IPI:IPI00689615 RefSeq:NP_001030279.1
            UniGene:Bt.29080 ProteinModelPortal:Q3SZI1 MEROPS:C01.973
            PRIDE:Q3SZI1 Ensembl:ENSBTAT00000016790 GeneID:512517
            KEGG:bta:512517 CTD:27283 eggNOG:NOG310046 HOGENOM:HOG000241342
            HOVERGEN:HBG053961 InParanoid:Q3SZI1 OMA:WGQLTSS OrthoDB:EOG47PX5P
            NextBio:20870427 Uniprot:Q3SZI1
        Length = 476

 Score = 123 (48.4 bits), Expect = 1.5e-12, Sum P(2) = 1.5e-12
 Identities = 30/94 (31%), Positives = 46/94 (48%)

Query:   248 DEHALLQAVTKQ-PVSVCVEASGQAFRFYKRGVLNAECGDNCD---------HGVAVVGF 297
             +E  +++ + +  PV   ++     F  YK G+       N D         H V + G+
Sbjct:   360 NETEIMREIMQNGPVQAIMQVHEDFFN-YKTGIYRHITSTNEDSEKYRKFRTHAVKLTGW 418

Query:   298 GTAEEEDGAK--YWLIKNSWGETWGESGYIRILR 329
             GT     G K  +W+  NSWG++WGE+GY RILR
Sbjct:   419 GTLRGAQGQKEKFWIAANSWGKSWGENGYFRILR 452

 Score = 116 (45.9 bits), Expect = 1.5e-12, Sum P(2) = 1.5e-12
 Identities = 28/79 (35%), Positives = 44/79 (55%)

Query:   148 NQGHCGSCWAFS--AVAAVEGITQITGGKLIELSEQQLVDC-STDNNGCSGGLMDKAFEY 204
             +Q +C + WAFS  +VAA     Q  G     LS Q L+ C +   +GC+ G +D+A+ Y
Sbjct:   236 DQKNCAASWAFSTASVAADRIAIQSQGRYTANLSPQNLISCCAKKRHGCNSGSVDRAWWY 295

Query:   205 IIENKGLATEADYPYQQEQ 223
             +   +GL + A YP  ++Q
Sbjct:   296 L-RKRGLVSHACYPLFKDQ 313


>RGD|70956 [details] [associations]
            symbol:Tinagl1 "tubulointerstitial nephritis antigen-like 1"
           species:10116 "Rattus norvegicus" [GO:0005044 "scavenger receptor
           activity" evidence=IEA] [GO:0005576 "extracellular region"
           evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA;ISO] [GO:0006508
           "proteolysis" evidence=IEA] [GO:0006955 "immune response"
           evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
           evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
           [GO:0031012 "extracellular matrix" evidence=IEA;ISO] [GO:0043236
           "laminin binding" evidence=IEA;ISO] InterPro:IPR000668
           InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
           PROSITE:PS50958 SMART:SM00201 SMART:SM00645 RGD:70956 GO:GO:0005737
           GO:GO:0005576 GO:GO:0006955 GO:GO:0030247 GO:GO:0008234
           GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
           GO:GO:0031012 GO:GO:0005044 eggNOG:NOG310046 HOGENOM:HOG000241342
           HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OrthoDB:EOG4BG8W0
           EMBL:AB050717 IPI:IPI00190428 RefSeq:NP_446034.1 UniGene:Rn.1256
           ProteinModelPortal:Q9EQT5 PRIDE:Q9EQT5 GeneID:94174 KEGG:rno:94174
           UCSC:RGD:70956 InParanoid:Q9EQT5 NextBio:617830 ArrayExpress:Q9EQT5
           Genevestigator:Q9EQT5 GermOnline:ENSRNOG00000013179 Uniprot:Q9EQT5
        Length = 467

 Score = 120 (47.3 bits), Expect = 1.6e-12, Sum P(2) = 1.6e-12
 Identities = 32/103 (31%), Positives = 48/103 (46%)

Query:   248 DEHALLQAVTKQ-PVSVCVEASGQAFRFYKRGVLN---------AECGDNCDHGVAVVGF 297
             DE  +++ + +  PV   +E   + F  Y+RG+ +          +   +  H V + G+
Sbjct:   349 DEKEIMKELMENGPVQALMEVH-EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGW 407

Query:   298 GTAEEEDGA--KYWLIKNSWGETWGESGYIRILRDEGLCGIAT 338
             G     DG   KYW   NSWG  WGE G+ RI+R    C I T
Sbjct:   408 GEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGINECDIET 450

 Score = 119 (46.9 bits), Expect = 1.6e-12, Sum P(2) = 1.6e-12
 Identities = 31/99 (31%), Positives = 50/99 (50%)

Query:   131 VPTSIDWREK--GAVTHIKNQGHCGSCWAFSAVAAVEGITQITG-GKLIE-LSEQQLVDC 186
             +PT+ +  EK    +    +QG+C   WAFS  A       I   G +   LS Q L+ C
Sbjct:   202 LPTAFEASEKWPNLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSC 261

Query:   187 STDNN-GCSGGLMDKAFEYIIENKGLATEADYPYQ-QEQ 223
              T +  GC GG +D A+ + +  +G+ ++  YP+  +EQ
Sbjct:   262 DTHHQKGCRGGRLDGAW-WFLRRRGVVSDNCYPFSGREQ 299


>UNIPROTKB|Q9EQT5 [details] [associations]
            symbol:Tinagl1 "Tubulointerstitial nephritis antigen-like"
            species:10116 "Rattus norvegicus" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 RGD:70956 GO:GO:0005737
            GO:GO:0005576 GO:GO:0006955 GO:GO:0030247 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GO:GO:0031012 GO:GO:0005044 eggNOG:NOG310046 HOGENOM:HOG000241342
            HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OrthoDB:EOG4BG8W0
            EMBL:AB050717 IPI:IPI00190428 RefSeq:NP_446034.1 UniGene:Rn.1256
            ProteinModelPortal:Q9EQT5 PRIDE:Q9EQT5 GeneID:94174 KEGG:rno:94174
            UCSC:RGD:70956 InParanoid:Q9EQT5 NextBio:617830 ArrayExpress:Q9EQT5
            Genevestigator:Q9EQT5 GermOnline:ENSRNOG00000013179 Uniprot:Q9EQT5
        Length = 467

 Score = 120 (47.3 bits), Expect = 1.6e-12, Sum P(2) = 1.6e-12
 Identities = 32/103 (31%), Positives = 48/103 (46%)

Query:   248 DEHALLQAVTKQ-PVSVCVEASGQAFRFYKRGVLN---------AECGDNCDHGVAVVGF 297
             DE  +++ + +  PV   +E   + F  Y+RG+ +          +   +  H V + G+
Sbjct:   349 DEKEIMKELMENGPVQALMEVH-EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGW 407

Query:   298 GTAEEEDGA--KYWLIKNSWGETWGESGYIRILRDEGLCGIAT 338
             G     DG   KYW   NSWG  WGE G+ RI+R    C I T
Sbjct:   408 GEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGINECDIET 450

 Score = 119 (46.9 bits), Expect = 1.6e-12, Sum P(2) = 1.6e-12
 Identities = 31/99 (31%), Positives = 50/99 (50%)

Query:   131 VPTSIDWREK--GAVTHIKNQGHCGSCWAFSAVAAVEGITQITG-GKLIE-LSEQQLVDC 186
             +PT+ +  EK    +    +QG+C   WAFS  A       I   G +   LS Q L+ C
Sbjct:   202 LPTAFEASEKWPNLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSC 261

Query:   187 STDNN-GCSGGLMDKAFEYIIENKGLATEADYPYQ-QEQ 223
              T +  GC GG +D A+ + +  +G+ ++  YP+  +EQ
Sbjct:   262 DTHHQKGCRGGRLDGAW-WFLRRRGVVSDNCYPFSGREQ 299

WARNING:  HSPs involving 50 database sequences were not reported due to the
          limiting value of parameter B = 250.


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.314   0.131   0.394    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      346       319   0.00084  116 3  11 23  0.45    34
                                                     33  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  300
  No. of states in DFA:  621 (66 KB)
  Total size of DFA:  250 KB (2133 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  25.89u 0.12s 26.01t   Elapsed:  00:00:01
  Total cpu time:  25.94u 0.12s 26.06t   Elapsed:  00:00:01
  Start:  Tue May 21 04:34:57 2013   End:  Tue May 21 04:34:58 2013
WARNINGS ISSUED:  2

Back to top