BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>041120
MQHRLFIAIYTNLHLKIAIDMRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEE
RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIS
TYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINK
LKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQ
TDKTKHHAVTITGYEAIPARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVK
NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKRC

High Scoring Gene Products

Symbol, full name Information P value
CEP1
cysteine endopeptidase 1
protein from Arabidopsis thaliana 1.1e-74
XCP2
AT1G20850
protein from Arabidopsis thaliana 2.3e-70
XBCP3
xylem bark cysteine peptidase 3
protein from Arabidopsis thaliana 9.8e-70
AT2G27420 protein from Arabidopsis thaliana 2.0e-67
AT3G49340 protein from Arabidopsis thaliana 6.8e-65
AT1G06260 protein from Arabidopsis thaliana 1.1e-64
AT1G29080 protein from Arabidopsis thaliana 3.3e-63
AT1G29090 protein from Arabidopsis thaliana 1.4e-60
CTSL2
Uncharacterized protein
protein from Gallus gallus 7.8e-60
CTSS
Cathepsin S
protein from Canis lupus familiaris 5.4e-59
cprC
cysteine proteinase 3
gene from Dictyostelium discoideum 6.9e-59
cprB
cysteine proteinase 2
gene from Dictyostelium discoideum 1.4e-58
CTSS
Cathepsin S
protein from Canis lupus familiaris 1.8e-58
CTSK
Cathepsin K
protein from Sus scrofa 2.3e-58
CTSL1
Cathepsin L1
protein from Canis lupus familiaris 2.9e-58
CTSK
Cathepsin K
protein from Bos taurus 6.1e-58
CTSK
Cathepsin K
protein from Canis lupus familiaris 6.1e-58
CTSK
Cathepsin K
protein from Canis lupus familiaris 6.1e-58
CTSK
Cathepsin K
protein from Homo sapiens 6.1e-58
CTSL1
Cathepsin L1
protein from Canis lupus familiaris 1.3e-57
CTSL1
Cathepsin L1
protein from Sus scrofa 1.6e-57
Cp1
Cysteine proteinase-1
protein from Drosophila melanogaster 1.6e-57
CTSL1
Cathepsin L1
protein from Bos taurus 2.0e-57
Ctsk
cathepsin K
gene from Rattus norvegicus 2.6e-57
CTSS
Uncharacterized protein
protein from Sus scrofa 2.6e-57
CTSS
Cathepsin S
protein from Bos taurus 6.9e-57
AT3G43960 protein from Arabidopsis thaliana 1.1e-56
Ctsk
cathepsin K
protein from Mus musculus 1.4e-56
CTSL2
Cathepsin L2
protein from Bos taurus 1.8e-56
Ctsll3
cathepsin L-like 3
gene from Rattus norvegicus 2.9e-56
Cys
Crustapain
protein from Pandalus borealis 4.8e-56
ctsl1a
cathepsin L, 1 a
gene_product from Danio rerio 6.1e-56
CTSL1
Cathepsin L1
protein from Homo sapiens 7.8e-56
CTSL2
Cathepsin L2
protein from Homo sapiens 9.9e-56
RGD1308751
similar to Cathepsin L precursor (Major excreted protein) (MEP)
gene from Rattus norvegicus 1.6e-55
Ctss
cathepsin S
protein from Mus musculus 1.6e-55
wu:fb37b09 gene_product from Danio rerio 2.6e-55
ctssb.2
cathepsin S, b.2
gene_product from Danio rerio 4.2e-55
CTSL1
CTSL1 protein
protein from Bos taurus 1.1e-54
ctsll
cathepsin L, like
gene_product from Danio rerio 2.3e-54
cprD
cysteine proteinase 4
gene from Dictyostelium discoideum 2.6e-54
zgc:174153 gene_product from Danio rerio 2.9e-54
ctsl1b
cathepsin L, 1 b
gene_product from Danio rerio 2.9e-54
zgc:174855 gene_product from Danio rerio 3.7e-54
cprG
cysteine proteinase 7
gene from Dictyostelium discoideum 5.4e-54
cfaD
peptidase C1A family protein
gene from Dictyostelium discoideum 1.3e-53
cpl-1 gene from Caenorhabditis elegans 1.3e-53
CTSH
Pro-cathepsin H
protein from Bos taurus 2.6e-53
Ctsh
cathepsin H
gene from Rattus norvegicus 2.6e-53
CTSK
Cathepsin K
protein from Gallus gallus 2.6e-53
cprF
cysteine proteinase 6
gene from Dictyostelium discoideum 4.7e-53
CTSH
Pro-cathepsin H
protein from Sus scrofa 5.4e-53
ctssb.1
cathepsin S, b.1
gene_product from Danio rerio 5.4e-53
ctsk
cathepsin K
gene_product from Danio rerio 6.9e-53
CTSL2
Uncharacterized protein
protein from Gallus gallus 1.8e-52
Ctss
cathepsin S
gene from Rattus norvegicus 2.3e-52
ctsl.1
cathepsin L.1
gene_product from Danio rerio 2.3e-52
CTSH
Uncharacterized protein
protein from Callithrix jacchus 2.9e-52
XCP1
xylem cysteine peptidase 1
protein from Arabidopsis thaliana 4.4e-52
Ctsh
cathepsin H
protein from Mus musculus 4.8e-52
cprE
cysteine proteinase 5
gene from Dictyostelium discoideum 6.1e-52
ALP
aleurain-like protease
protein from Arabidopsis thaliana 6.1e-52
CTSH
Uncharacterized protein
protein from Macaca mulatta 9.9e-52
CTSH
Uncharacterized protein
protein from Callithrix jacchus 9.9e-52
CTSH
Uncharacterized protein
protein from Oryctolagus cuniculus 1.6e-51
CTSH
Uncharacterized protein
protein from Nomascus leucogenys 2.0e-51
RD21B
esponsive to dehydration 21B
protein from Arabidopsis thaliana 3.1e-51
RD19
RESPONSIVE TO DEHYDRATION 19
protein from Arabidopsis thaliana 4.2e-51
CTSH
Pro-cathepsin H
protein from Homo sapiens 4.2e-51
CTSH
Uncharacterized protein
protein from Gorilla gorilla gorilla 4.2e-51
CTSL1
Cathepsin L1
protein from Gallus gallus 4.2e-51
R09F10.1 gene from Caenorhabditis elegans 8.7e-51
Ctsj
cathepsin J
protein from Mus musculus 8.7e-51
D3ZZR3
Uncharacterized protein
protein from Rattus norvegicus 1.1e-50
ctsh
cathepsin H
gene_product from Danio rerio 1.8e-50
LOC100662496
Uncharacterized protein
protein from Loxodonta africana 1.8e-50
CTSH
Uncharacterized protein
protein from Ailuropoda melanoleuca 1.8e-50
4930486L24Rik
RIKEN cDNA 4930486L24 gene
protein from Mus musculus 2.3e-50
Ctsj
cathepsin J
gene from Rattus norvegicus 6.0e-50
AT3G45310 protein from Arabidopsis thaliana 1.3e-49
zgc:110239 gene_product from Danio rerio 2.0e-49
LOC420160
Uncharacterized protein
protein from Gallus gallus 2.0e-49
CTSH
Uncharacterized protein
protein from Canis lupus familiaris 2.6e-49
Cat-1
Cathepsin L-like proteinase
protein from Fasciola hepatica 3.3e-49
SAG12
senescence-associated gene 12
protein from Arabidopsis thaliana 4.1e-49
AT2G21430 protein from Arabidopsis thaliana 4.2e-49
Testin
testin gene
gene from Rattus norvegicus 4.2e-49
tag-196 gene from Caenorhabditis elegans 1.1e-48
CTSH
Uncharacterized protein
protein from Equus caballus 1.4e-48
RD21A
responsive to dehydration 21A
protein from Arabidopsis thaliana 1.8e-48
CTSS
Uncharacterized protein
protein from Gallus gallus 2.9e-48
cprA
cysteine proteinase 1
gene from Dictyostelium discoideum 6.0e-48
AT4G16190 protein from Arabidopsis thaliana 2.0e-47
AT3G19390 protein from Arabidopsis thaliana 6.9e-47
MGC114246
similar to cathepsin R
gene from Rattus norvegicus 1.1e-46
CTSF
Uncharacterized protein
protein from Sus scrofa 1.4e-46
AT3G19400 protein from Arabidopsis thaliana 1.8e-46
cprH
cysteine proteinase 8
gene from Dictyostelium discoideum 2.3e-46

The BLAST search returned 2 gene products which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  041120
        (340 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2157712 - symbol:CEP1 "cysteine endopeptidase ...   517  1.1e-74   2
TAIR|locus:2030427 - symbol:XCP2 "xylem cysteine peptidas...   500  2.3e-70   2
TAIR|locus:2024362 - symbol:XBCP3 "xylem bark cysteine pe...   481  9.8e-70   2
TAIR|locus:2038588 - symbol:AT2G27420 species:3702 "Arabi...   480  2.0e-67   2
TAIR|locus:2082881 - symbol:AT3G49340 species:3702 "Arabi...   452  6.8e-65   2
TAIR|locus:2038515 - symbol:AT1G06260 species:3702 "Arabi...   659  1.1e-64   1
TAIR|locus:2029934 - symbol:AT1G29080 species:3702 "Arabi...   450  3.3e-63   2
TAIR|locus:2029924 - symbol:AT1G29090 species:3702 "Arabi...   418  1.4e-60   2
UNIPROTKB|F1NYJ1 - symbol:CTSL2 "Uncharacterized protein"...   437  7.8e-60   2
UNIPROTKB|F1PAK0 - symbol:CTSS "Cathepsin S" species:9615...   408  5.4e-59   2
DICTYBASE|DDB_G0283867 - symbol:cprC "cysteine proteinase...   413  6.9e-59   2
DICTYBASE|DDB_G0279799 - symbol:cprB "cysteine proteinase...   431  1.4e-58   3
UNIPROTKB|Q8HY81 - symbol:CTSS "Cathepsin S" species:9615...   403  1.8e-58   2
UNIPROTKB|Q9GLE3 - symbol:CTSK "Cathepsin K" species:9823...   413  2.3e-58   2
UNIPROTKB|Q9GL24 - symbol:CTSL1 "Cathepsin L1" species:96...   425  2.9e-58   2
UNIPROTKB|Q5E968 - symbol:CTSK "Cathepsin K" species:9913...   415  6.1e-58   2
UNIPROTKB|G1K2A7 - symbol:CTSK "Cathepsin K" species:9615...   410  6.1e-58   2
UNIPROTKB|Q3ZKN1 - symbol:CTSK "Cathepsin K" species:9615...   410  6.1e-58   2
UNIPROTKB|P43235 - symbol:CTSK "Cathepsin K" species:9606...   410  6.1e-58   2
UNIPROTKB|F1PMM9 - symbol:CTSL1 "Cathepsin L1" species:96...   411  1.3e-57   2
UNIPROTKB|Q28944 - symbol:CTSL1 "Cathepsin L1" species:98...   425  1.6e-57   2
FB|FBgn0013770 - symbol:Cp1 "Cysteine proteinase-1" speci...   379  1.6e-57   2
UNIPROTKB|P25975 - symbol:CTSL1 "Cathepsin L1" species:99...   423  2.0e-57   2
RGD|61810 - symbol:Ctsk "cathepsin K" species:10116 "Ratt...   396  2.6e-57   2
UNIPROTKB|F1SS93 - symbol:CTSS "Uncharacterized protein" ...   395  2.6e-57   2
UNIPROTKB|P25326 - symbol:CTSS "Cathepsin S" species:9913...   399  6.9e-57   2
TAIR|locus:2097104 - symbol:AT3G43960 species:3702 "Arabi...   432  1.1e-56   2
MGI|MGI:107823 - symbol:Ctsk "cathepsin K" species:10090 ...   396  1.4e-56   2
UNIPROTKB|Q5E998 - symbol:CTSL2 "Cathepsin L2" species:99...   414  1.8e-56   2
RGD|1560071 - symbol:Ctsll3 "cathepsin L-like 3" species:...   399  2.9e-56   2
UNIPROTKB|Q86GF7 - symbol:Cys "Crustapain" species:6703 "...   413  4.8e-56   2
ZFIN|ZDB-GENE-030131-106 - symbol:ctsl1a "cathepsin L, 1 ...   407  6.1e-56   2
UNIPROTKB|P07711 - symbol:CTSL1 "Cathepsin L1" species:96...   411  7.8e-56   2
UNIPROTKB|O60911 - symbol:CTSL2 "Cathepsin L2" species:96...   411  9.9e-56   2
RGD|1308751 - symbol:RGD1308751 "similar to Cathepsin L p...   406  1.6e-55   2
MGI|MGI:107341 - symbol:Ctss "cathepsin S" species:10090 ...   387  1.6e-55   2
ZFIN|ZDB-GENE-030131-572 - symbol:wu:fb37b09 "wu:fb37b09"...   409  2.6e-55   2
ZFIN|ZDB-GENE-050626-55 - symbol:ctssb.2 "cathepsin S, b....   379  4.2e-55   2
UNIPROTKB|A4IFS7 - symbol:CTSL1 "CTSL1 protein" species:9...   409  1.1e-54   2
ZFIN|ZDB-GENE-041010-76 - symbol:ctsll "cathepsin L, like...   400  2.3e-54   2
DICTYBASE|DDB_G0278721 - symbol:cprD "cysteine proteinase...   391  2.6e-54   3
ZFIN|ZDB-GENE-080215-7 - symbol:zgc:174153 "zgc:174153" s...   410  2.9e-54   2
ZFIN|ZDB-GENE-980526-285 - symbol:ctsl1b "cathepsin L, 1 ...   410  2.9e-54   2
ZFIN|ZDB-GENE-071004-74 - symbol:zgc:174855 "zgc:174855" ...   402  3.7e-54   2
DICTYBASE|DDB_G0279187 - symbol:cprG "cysteine proteinase...   396  5.4e-54   3
DICTYBASE|DDB_G0281605 - symbol:cfaD "peptidase C1A famil...   396  1.3e-53   2
WB|WBGene00000776 - symbol:cpl-1 species:6239 "Caenorhabd...   349  1.3e-53   2
UNIPROTKB|Q3T0I2 - symbol:CTSH "Pro-cathepsin H" species:...   404  2.6e-53   2
RGD|2447 - symbol:Ctsh "cathepsin H" species:10116 "Rattu...   396  2.6e-53   2
UNIPROTKB|Q90686 - symbol:CTSK "Cathepsin K" species:9031...   372  2.6e-53   2
DICTYBASE|DDB_G0279185 - symbol:cprF "cysteine proteinase...   397  4.7e-53   3
UNIPROTKB|O46427 - symbol:CTSH "Pro-cathepsin H" species:...   399  5.4e-53   2
ZFIN|ZDB-GENE-050522-559 - symbol:ctssb.1 "cathepsin S, b...   357  5.4e-53   2
ZFIN|ZDB-GENE-001205-4 - symbol:ctsk "cathepsin K" specie...   372  6.9e-53   2
UNIPROTKB|F1NEC8 - symbol:CTSL2 "Uncharacterized protein"...   357  1.8e-52   2
RGD|621513 - symbol:Ctss "cathepsin S" species:10116 "Rat...   352  2.3e-52   2
ZFIN|ZDB-GENE-040718-61 - symbol:ctsl.1 "cathepsin L.1" s...   346  2.3e-52   2
UNIPROTKB|F7B939 - symbol:CTSH "Uncharacterized protein" ...   387  2.9e-52   2
TAIR|locus:2122113 - symbol:XCP1 "xylem cysteine peptidas...   540  4.4e-52   1
MGI|MGI:107285 - symbol:Ctsh "cathepsin H" species:10090 ...   382  4.8e-52   2
DICTYBASE|DDB_G0272815 - symbol:cprE "cysteine proteinase...   421  6.1e-52   2
TAIR|locus:2175088 - symbol:ALP "aleurain-like protease" ...   382  6.1e-52   2
UNIPROTKB|F6R7P5 - symbol:CTSH "Uncharacterized protein" ...   385  9.9e-52   2
UNIPROTKB|F7BRD4 - symbol:CTSH "Uncharacterized protein" ...   382  9.9e-52   2
UNIPROTKB|G1SQF0 - symbol:CTSH "Uncharacterized protein" ...   385  1.6e-51   2
UNIPROTKB|G1RBY1 - symbol:CTSH "Uncharacterized protein" ...   384  2.0e-51   2
TAIR|locus:2167821 - symbol:RD21B "esponsive to dehydrati...   532  3.1e-51   1
TAIR|locus:2120222 - symbol:RD19 "RESPONSIVE TO DEHYDRATI...   406  4.2e-51   2
UNIPROTKB|P09668 - symbol:CTSH "Pro-cathepsin H" species:...   383  4.2e-51   2
UNIPROTKB|G3R9A7 - symbol:CTSH "Uncharacterized protein" ...   383  4.2e-51   2
UNIPROTKB|P09648 - symbol:CTSL1 "Cathepsin L1" species:90...   345  4.2e-51   2
WB|WBGene00019986 - symbol:R09F10.1 species:6239 "Caenorh...   389  8.7e-51   2
MGI|MGI:1349426 - symbol:Ctsj "cathepsin J" species:10090...   378  8.7e-51   2
UNIPROTKB|D3ZZR3 - symbol:D3ZZR3 "Uncharacterized protein...   337  1.1e-50   2
ZFIN|ZDB-GENE-030131-3539 - symbol:ctsh "cathepsin H" spe...   382  1.8e-50   2
UNIPROTKB|G3SSC1 - symbol:CTSH "Uncharacterized protein" ...   376  1.8e-50   2
UNIPROTKB|G1M0X4 - symbol:CTSH "Uncharacterized protein" ...   373  1.8e-50   2
MGI|MGI:1922258 - symbol:4930486L24Rik "RIKEN cDNA 493048...   361  2.3e-50   2
RGD|69241 - symbol:Ctsj "cathepsin J" species:10116 "Ratt...   362  6.0e-50   2
TAIR|locus:2078312 - symbol:AT3G45310 species:3702 "Arabi...   363  1.3e-49   2
ZFIN|ZDB-GENE-050417-107 - symbol:zgc:110239 "zgc:110239"...   379  2.0e-49   2
UNIPROTKB|F1NZ37 - symbol:LOC420160 "Uncharacterized prot...   362  2.0e-49   2
UNIPROTKB|F6X9C1 - symbol:CTSH "Uncharacterized protein" ...   360  2.6e-49   2
UNIPROTKB|Q24940 - symbol:Cat-1 "Cathepsin L-like protein...   335  3.3e-49   2
TAIR|locus:2152445 - symbol:SAG12 "senescence-associated ...   512  4.1e-49   1
TAIR|locus:2050145 - symbol:AT2G21430 species:3702 "Arabi...   381  4.2e-49   2
RGD|708447 - symbol:Testin "testin gene" species:10116 "R...   355  4.2e-49   2
WB|WBGene00007055 - symbol:tag-196 species:6239 "Caenorha...   344  1.1e-48   2
UNIPROTKB|F7BJD8 - symbol:CTSH "Uncharacterized protein" ...   356  1.4e-48   2
TAIR|locus:2825832 - symbol:RD21A "responsive to dehydrat...   506  1.8e-48   1
UNIPROTKB|H9KYW5 - symbol:CTSS "Uncharacterized protein" ...   324  2.9e-48   2
DICTYBASE|DDB_G0290957 - symbol:cprA "cysteine proteinase...   368  6.0e-48   2
TAIR|locus:2130180 - symbol:AT4G16190 species:3702 "Arabi...   384  2.0e-47   2
UNIPROTKB|Q4QRC2 - symbol:Ctsql2 "Protein Ctsql2" species...   338  3.3e-47   2
UNIPROTKB|E9PSK9 - symbol:Ctsql2 "Protein Ctsql2" species...   338  6.8e-47   2
TAIR|locus:2090614 - symbol:AT3G19390 species:3702 "Arabi...   491  6.9e-47   1
RGD|1562210 - symbol:MGC114246 "similar to cathepsin R" s...   351  1.1e-46   2
UNIPROTKB|F1RU48 - symbol:CTSF "Uncharacterized protein" ...   353  1.4e-46   2
TAIR|locus:2090629 - symbol:AT3G19400 species:3702 "Arabi...   487  1.8e-46   1
DICTYBASE|DDB_G0278401 - symbol:cprH "cysteine proteinase...   382  2.3e-46   2

WARNING:  Descriptions of 200 database sequences were not reported due to the
          limiting value of parameter V = 100.


>TAIR|locus:2157712 [details] [associations]
            symbol:CEP1 "cysteine endopeptidase 1" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005783 EMBL:CP002688
            GenomeReviews:BA000015_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 PROSITE:PS00014
            EMBL:AB024031 MEROPS:I29.003 EMBL:HM367092 EMBL:AY091087
            IPI:IPI00516991 RefSeq:NP_568722.1 UniGene:At.7918 HSSP:O65039
            ProteinModelPortal:Q9FGR9 SMR:Q9FGR9 PaxDb:Q9FGR9 PRIDE:Q9FGR9
            EnsemblPlants:AT5G50260.1 GeneID:835091 KEGG:ath:AT5G50260
            TAIR:At5g50260 HOGENOM:HOG000230773 InParanoid:Q9FGR9 KO:K16292
            OMA:WHSKKYH PhylomeDB:Q9FGR9 ProtClustDB:CLSN2689970
            Genevestigator:Q9FGR9 Uniprot:Q9FGR9
        Length = 361

 Score = 517 (187.1 bits), Expect = 1.1e-74, Sum P(2) = 1.1e-74
 Identities = 113/244 (46%), Positives = 153/244 (62%)

Query:    24 MLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRF 83
             M R  VL+L +L VL    G        + +  S+ E +E W   ++     E++  +RF
Sbjct:     1 MKRFIVLALCMLMVLETTKGLDFHNKDVESE-NSLWELYERWRSHHTVARSLEEK-AKRF 58

Query:    84 GIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYN-----EPR-WPSV 137
              ++  NV++I   N ++ S+KL  NKF D+++EEF  TY G N  ++     E +   S 
Sbjct:    59 NVFKHNVKHIHETNKKDKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKSF 118

Query:   138 QYLG---LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQEL 194
              Y     LP SVDWRK GAVTPVK+QGQCGSCWAFS V AVEGIN+++T KL SLSEQEL
Sbjct:   119 MYANVNTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQEL 178

Query:   195 VDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
             VDCD N +NQGCNGG M+ AFEFI + GG+T+E  YPY+  ++ C T+K     V+I G+
Sbjct:   179 VDCDTN-QNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGH 237

Query:   255 EAIP 258
             E +P
Sbjct:   238 EDVP 241

 Score = 255 (94.8 bits), Expect = 1.1e-74, Sum P(2) = 1.1e-74
 Identities = 48/77 (62%), Positives = 53/77 (68%)

Query:   263 FQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
             FQ YS GVF   CG +LNHGV VVGYG    G KYW+VKNSWG  WGE GYIRM R    
Sbjct:   268 FQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRH 327

Query:   322 SNIGICGILMQASYPVK 338
                G+CGI M+ASYP+K
Sbjct:   328 KE-GLCGIAMEASYPLK 343


>TAIR|locus:2030427 [details] [associations]
            symbol:XCP2 "xylem cysteine peptidase 2" species:3702
            "Arabidopsis thaliana" [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009507 "chloroplast" evidence=ISM] [GO:0008233 "peptidase
            activity" evidence=ISS] [GO:0005618 "cell wall" evidence=IDA]
            [GO:0010623 "developmental programmed cell death" evidence=IMP]
            [GO:0010075 "regulation of meristem growth" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0005886 GO:GO:0005618 GO:GO:0005773
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AC069251 EMBL:AC007369 GO:GO:0010623
            OMA:YKEIPEG HOGENOM:HOG000230773 KO:K16290 EMBL:AF191028
            EMBL:BT004822 IPI:IPI00526722 PIR:A86341 RefSeq:NP_564126.1
            UniGene:At.21316 ProteinModelPortal:Q9LM66 SMR:Q9LM66 IntAct:Q9LM66
            STRING:Q9LM66 MEROPS:C01.120 PaxDb:Q9LM66 PRIDE:Q9LM66
            ProMEX:Q9LM66 EnsemblPlants:AT1G20850.1 GeneID:838677
            KEGG:ath:AT1G20850 GeneFarm:5034 TAIR:At1g20850 InParanoid:Q9LM66
            PhylomeDB:Q9LM66 ProtClustDB:CLSN2917031 Genevestigator:Q9LM66
            GermOnline:AT1G20850 Uniprot:Q9LM66
        Length = 356

 Score = 500 (181.1 bits), Expect = 2.3e-70, Sum P(2) = 2.3e-70
 Identities = 102/219 (46%), Positives = 137/219 (62%)

Query:    53 YDPQSME------ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLT 106
             Y P+ +E      E FENW+  + + Y + +E   RF ++  N+++ID  N +  S+ L 
Sbjct:    36 YSPEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKSYWLG 95

Query:   107 DNKFADLSNEEFISTYLGYNKPY---NEPR-WPSVQYL---GLPASVDWRKEGAVTPVKD 159
              N+FADLS+EEF   YLG        +E R +    Y     +P SVDWRK+GAV  VK+
Sbjct:    96 LNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKN 155

Query:   160 QGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFIT 219
             QG CGSCWAFS VAAVEGINK+ TG L +LSEQEL+DCD  + N GCNGG M+ AFE+I 
Sbjct:   156 QGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDT-TYNNGCNGGLMDYAFEYIV 214

Query:   220 KIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             K GG+  E+DYPY  +   C+  K +   VTI G++ +P
Sbjct:   215 KNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVP 253

 Score = 231 (86.4 bits), Expect = 2.3e-70, Sum P(2) = 2.3e-70
 Identities = 43/76 (56%), Positives = 49/76 (64%)

Query:   263 FQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSS 322
             FQ YS GVFD  CG  L+HGV  VGYG   G  Y +VKNSWG  WGE GYIR+ RN+   
Sbjct:   280 FQFYSGGVFDGRCGVDLDHGVAAVGYGSSKGSDYIIVKNSWGPKWGEKGYIRLKRNTGKP 339

Query:   323 NIGICGILMQASYPVK 338
               G+CGI   AS+P K
Sbjct:   340 E-GLCGINKMASFPTK 354


>TAIR|locus:2024362 [details] [associations]
            symbol:XBCP3 "xylem bark cysteine peptidase 3"
            species:3702 "Arabidopsis thaliana" [GO:0005576 "extracellular
            region" evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0005783 "endoplasmic
            reticulum" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005783 EMBL:CP002684 GO:GO:0005773 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 MEROPS:I29.003
            HOGENOM:HOG000230773 InterPro:IPR000118 Pfam:PF00396 SMART:SM00277
            UniGene:At.10233 OMA:CEIESAV EMBL:BT026490 EMBL:AK226753
            IPI:IPI00536687 RefSeq:NP_563855.1 ProteinModelPortal:Q0WVJ5
            SMR:Q0WVJ5 PRIDE:Q0WVJ5 EnsemblPlants:AT1G09850.1 GeneID:837517
            KEGG:ath:AT1G09850 TAIR:At1g09850 InParanoid:Q0WVJ5
            PhylomeDB:Q0WVJ5 ProtClustDB:CLSN2687747 Genevestigator:Q0WVJ5
            Uniprot:Q0WVJ5
        Length = 437

 Score = 481 (174.4 bits), Expect = 9.8e-70, Sum P(2) = 9.8e-70
 Identities = 97/206 (47%), Positives = 134/206 (65%)

Query:    58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLSNE 116
             + E F++W +++ + YGSE+E Q+R  I+  N  ++   N   N ++ L+ N FADL++ 
Sbjct:    28 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 87

Query:   117 EFISTYLGYNKPYNEPRWPSV-QYLG----LPASVDWRKEGAVTPVKDQGQCGSCWAFSA 171
             EF ++ LG +         S  Q LG    +P SVDWRK+GAVT VKDQG CG+CW+FSA
Sbjct:    88 EFKASRLGLSVSAPSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWSFSA 147

Query:   172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
               A+EGIN++ TG L+SLSEQEL+DCD  S N GCNGG M+ AFEF+ K  G+ TE DYP
Sbjct:   148 TGAMEGINQIVTGDLISLSEQELIDCD-KSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206

Query:   232 YRGKNDRCQTDKTKHHAVTITGYEAI 257
             Y+ ++  C+ DK K   VTI  Y  +
Sbjct:   207 YQERDGTCKKDKLKQKVVTIDSYAGV 232

 Score = 244 (91.0 bits), Expect = 9.8e-70, Sum P(2) = 9.8e-70
 Identities = 43/77 (55%), Positives = 55/77 (71%)

Query:   262 AFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
             AFQLYS G+F   C   L+H V +VGYG  +G  YW+VKNSWG SWG  G++ M RN+ +
Sbjct:   259 AFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTEN 318

Query:   322 SNIGICGILMQASYPVK 338
             S+ G+CGI M ASYP+K
Sbjct:   319 SD-GVCGINMLASYPIK 334


>TAIR|locus:2038588 [details] [associations]
            symbol:AT2G27420 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002685
            GenomeReviews:CT485783_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC006232
            MEROPS:I29.003 OMA:EEFRATH HOGENOM:HOG000230773 HSSP:P53634
            ProtClustDB:CLSN2688476 EMBL:AY064033 EMBL:AY096388 IPI:IPI00539752
            PIR:F84672 RefSeq:NP_565649.1 UniGene:At.27094
            ProteinModelPortal:Q9ZQH7 SMR:Q9ZQH7 PRIDE:Q9ZQH7
            EnsemblPlants:AT2G27420.1 GeneID:817287 KEGG:ath:AT2G27420
            TAIR:At2g27420 InParanoid:Q9ZQH7 PhylomeDB:Q9ZQH7
            ArrayExpress:Q9ZQH7 Genevestigator:Q9ZQH7 Uniprot:Q9ZQH7
        Length = 348

 Score = 480 (174.0 bits), Expect = 2.0e-67, Sum P(2) = 2.0e-67
 Identities = 104/246 (42%), Positives = 152/246 (61%)

Query:    29 VLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
             +L++FL +   +   A S G    ++  ++E+  E W+ +++R Y  E E + RF I+  
Sbjct:     8 ILTIFLSYRTSL---ATSRG--SLFEASAIEKH-EQWMARFNRVYSDETEKRNRFNIFKK 61

Query:    89 NVQYIDYINSQN-LSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSV---------Q 138
             N++++   N  N +++K+  N+F+DL++EEF +T+ G   P    R  ++         +
Sbjct:    62 NLEFVQNFNMNNKITYKVDINEFSDLTDEEFRATHTGLVVPEAITRISTLSSGKNTVPFR 121

Query:   139 YLGLP---ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
             Y  +     S+DWR+EGAVTPVK QG+CG CWAFSAVAAVEGI K+  G+LVSLSEQ+L+
Sbjct:   122 YGNVSDNGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQLL 181

Query:   196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKT---KHHAVTIT 252
             DCD    NQGC GG M KAFE+I K  G+TTED+YPY+     C +  T      A TI+
Sbjct:   182 DCD-RDYNQGCRGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATIS 240

Query:   253 GYEAIP 258
             GYE +P
Sbjct:   241 GYETVP 246

 Score = 223 (83.6 bits), Expect = 2.0e-67, Sum P(2) = 2.0e-67
 Identities = 45/103 (43%), Positives = 63/103 (61%)

Query:   236 NDRCQTDKTKHHAVTITGYEAIPARYAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGE 294
             N+           V++ G E   A  AF+ YS GVF+  CG  L+H VT+VGYG  + G 
Sbjct:   249 NEEALLQAVSQQPVSV-GIEGTGA--AFRHYSGGVFNGECGTDLHHAVTIVGYGMSEEGT 305

Query:   295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
             KYW+VKNSWG +WGE GY+R+ R+  +   G+CG+ + A YP+
Sbjct:   306 KYWVVKNSWGETWGENGYMRIKRDVDAPQ-GMCGLAILAFYPL 347


>TAIR|locus:2082881 [details] [associations]
            symbol:AT3G49340 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002686
            GenomeReviews:BA000014_GR EMBL:AC012329 EMBL:AL132956
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 MEROPS:I29.003 HOGENOM:HOG000230773 HSSP:P07711
            KO:K01376 IPI:IPI00520642 PIR:T45839 RefSeq:NP_566920.1
            UniGene:At.53854 ProteinModelPortal:Q9SG15 SMR:Q9SG15
            EnsemblPlants:AT3G49340.1 GeneID:824096 KEGG:ath:AT3G49340
            TAIR:At3g49340 InParanoid:Q9SG15 OMA:PQNDEEA PhylomeDB:Q9SG15
            ProtClustDB:CLSN2688476 Genevestigator:Q9SG15 Uniprot:Q9SG15
        Length = 341

 Score = 452 (164.2 bits), Expect = 6.8e-65, Sum P(2) = 6.8e-65
 Identities = 95/214 (44%), Positives = 139/214 (64%)

Query:    57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSN 115
             S  E+ E W+ +++R Y  + E   RF I+++N+++++ IN + N ++ L  N+F+DL++
Sbjct:    30 SAVEKHEQWMSRFNRVYSDDSEKTSRFEIFTNNLKFVESINMNTNKTYTLDVNEFSDLTD 89

Query:   116 EEFISTYLGYNKPYNEPRWP--------SVQY--LGLPA-SVDWRKEGAVTPVKDQGQCG 164
             EEF + Y G   P    R          S +Y  +G    S+DW +EGAVT VK Q QCG
Sbjct:    90 EEFKARYTGLVVPEGMTRISTTDSHETVSFRYENVGETGESMDWIQEGAVTSVKHQQQCG 149

Query:   165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
              CWAFSAVAAVEG+ K+  G+LVSLSEQ+L+DC  ++EN GC GG M KAF++I +  G+
Sbjct:   150 CCWAFSAVAAVEGMTKIANGELVSLSEQQLLDC--STENNGCGGGIMWKAFDYIKENQGI 207

Query:   225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             TTED+YPY+G    C+++     A TI+GYE +P
Sbjct:   208 TTEDNYPYQGAQQTCESNHLA--AATISGYETVP 239

 Score = 227 (85.0 bits), Expect = 6.8e-65, Sum P(2) = 6.8e-65
 Identities = 43/78 (55%), Positives = 53/78 (67%)

Query:   261 YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
             Y F  YS G+F+  CG QL H VT+VGYG  + G KYWL+KNSWG SWGE GY+R+ R+ 
Sbjct:   264 YEFIHYSGGIFNGECGTQLTHAVTIVGYGVSEEGIKYWLLKNSWGESWGENGYMRIMRDV 323

Query:   320 PSSNIGICGILMQASYPV 337
              S   G+CG+   A YPV
Sbjct:   324 DSPQ-GMCGLASLAYYPV 340


>TAIR|locus:2038515 [details] [associations]
            symbol:AT1G06260 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0048046 "apoplast"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0048046 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC025290
            MEROPS:I29.003 HSSP:O65039 HOGENOM:HOG000230773 OMA:METAFEF
            IPI:IPI00525965 PIR:D86198 RefSeq:NP_563764.1 UniGene:At.24617
            ProteinModelPortal:Q9LNC1 SMR:Q9LNC1 PaxDb:Q9LNC1 PRIDE:Q9LNC1
            EnsemblPlants:AT1G06260.1 GeneID:837137 KEGG:ath:AT1G06260
            TAIR:At1g06260 InParanoid:Q9LNC1 PhylomeDB:Q9LNC1
            ProtClustDB:CLSN2916975 Genevestigator:Q9LNC1 Uniprot:Q9LNC1
        Length = 343

 Score = 659 (237.0 bits), Expect = 1.1e-64, P = 1.1e-64
 Identities = 128/242 (52%), Positives = 169/242 (69%)

Query:    21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDP-QSMEERFENWLKQYSREYGSEDEW 79
             M  +LRN+ L+L +L    + A          YDP +++++RFE WLK +S+ YG  DEW
Sbjct:     1 MLNVLRNSNLTLAVLICFVLIASKLCSVDSSVYDPHKTLKQRFEKWLKTHSKLYGGRDEW 60

Query:    80 QRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKP---YNEPRWPS 136
               RFGIY SNVQ IDYINS +L FKLTDN+FAD++N EF + +LG N      ++ + P 
Sbjct:    61 MLRFGIYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEFKAHFLGLNTSSLRLHKKQRPV 120

Query:   137 VQYLG-LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
                 G +P +VDWR +GAVTP+++QG+CG CWAFSAVAA+EGINK+KTG LVSLSEQ+L+
Sbjct:   121 CDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLI 180

Query:   196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE 255
             DCDV + N+GC+GG ME AFEFI   GG+ TE DYPY G    C  +K+K+  VTI GY+
Sbjct:   181 DCDVGTYNKGCSGGLMETAFEFIKTNGGLATETDYPYTGIEGTCDQEKSKNKVVTIQGYQ 240

Query:   256 AI 257
              +
Sbjct:   241 KV 242

 Score = 283 (104.7 bits), Expect = 7.6e-25, P = 7.6e-25
 Identities = 58/115 (50%), Positives = 70/115 (60%)

Query:   224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQLYSHGVFDEYCGHQLNHGV 283
             V T   Y    +N+           V++ G +A    + FQLYS GVF  YCG  LNHGV
Sbjct:   233 VVTIQGYQKVAQNEASLQIAAAQQPVSV-GIDA--GGFIFQLYSSGVFTNYCGTNLNHGV 289

Query:   284 TVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
             TVVGYG +  +KYW+VKNSWGT WGE GYIRM R   S + G CGI M ASYP++
Sbjct:   290 TVVGYGVEGDQKYWIVKNSWGTGWGEEGYIRMERGV-SEDTGKCGIAMMASYPLQ 343


>TAIR|locus:2029934 [details] [associations]
            symbol:AT1G29080 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002684 GenomeReviews:CT485782_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AC021043 MEROPS:I29.003 HOGENOM:HOG000230773
            HSSP:P53634 ProtClustDB:CLSN2688064 EMBL:DQ056468 IPI:IPI00521747
            PIR:C86413 RefSeq:NP_564320.1 UniGene:At.51814
            ProteinModelPortal:Q9LP39 SMR:Q9LP39 EnsemblPlants:AT1G29080.1
            GeneID:839783 KEGG:ath:AT1G29080 TAIR:At1g29080 InParanoid:Q9LP39
            OMA:KTWGENG PhylomeDB:Q9LP39 Genevestigator:Q9LP39 Uniprot:Q9LP39
        Length = 346

 Score = 450 (163.5 bits), Expect = 3.3e-63, Sum P(2) = 3.3e-63
 Identities = 92/218 (42%), Positives = 133/218 (61%)

Query:    53 YDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFA 111
             Y P S+ +  + W+ Q+SR Y  E E Q R  + + N+++I+  N+  N S+KL  N+F 
Sbjct:    30 YKPSSIVDYHQQWMIQFSRVYDDEFEKQLRLQVLTENLKFIESFNNMGNQSYKLGVNEFT 89

Query:   112 DLSNEEFISTYLGYN-----KPY---NE--PRWPSVQYLGLPASVDWRKEGAVTPVKDQG 161
             D + EEF++TY G        P+   NE  P W       L  + DWR EGAVTPVK QG
Sbjct:    90 DWTKEEFLATYTGLRGVNVTSPFEVVNETKPAWNWTVSDVLGTNKDWRNEGAVTPVKSQG 149

Query:   162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKI 221
             +CG CWAFSA+AAVEG+ K+  G L+SLSEQ+L+DC    +N GC GG    AF +I K 
Sbjct:   150 ECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDC-TREQNNGCKGGTFVNAFNYIIKH 208

Query:   222 GGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA 259
              G+++E++YPY+ K   C+++     A+ I G+E +P+
Sbjct:   209 RGISSENEYPYQVKEGPCRSNARP--AILIRGFENVPS 244

 Score = 213 (80.0 bits), Expect = 3.3e-63, Sum P(2) = 3.3e-63
 Identities = 48/105 (45%), Positives = 60/105 (57%)

Query:   236 NDRCQTDKTKHHAVTITGYEAIPARYA-FQLYSHGVFD-EYCGHQLNHGVTVVGYGED-H 292
             N+R   +      V +    AI A  A F  YS GV++   CG  +NH VT+VGYG    
Sbjct:   246 NERALLEAVSRQPVAV----AIDASEAGFVHYSGGVYNARNCGTSVNHAVTLVGYGTSPE 301

Query:   293 GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
             G KYWL KNSWG +WGE GYIR+ R+      G+CG+   ASYPV
Sbjct:   302 GMKYWLAKNSWGKTWGENGYIRIRRDVEWPQ-GMCGVAQYASYPV 345


>TAIR|locus:2029924 [details] [associations]
            symbol:AT1G29090 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002684 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            HOGENOM:HOG000230773 HSSP:P53634 ProtClustDB:CLSN2688064
            EMBL:BT004146 IPI:IPI00545702 RefSeq:NP_564321.2 UniGene:At.40814
            ProteinModelPortal:Q84W75 SMR:Q84W75 MEROPS:C01.A15
            EnsemblPlants:AT1G29090.1 GeneID:839784 KEGG:ath:AT1G29090
            TAIR:At1g29090 InParanoid:Q84W75 OMA:SIRGHED PhylomeDB:Q84W75
            ArrayExpress:Q84W75 Genevestigator:Q84W75 Uniprot:Q84W75
        Length = 355

 Score = 418 (152.2 bits), Expect = 1.4e-60, Sum P(2) = 1.4e-60
 Identities = 89/212 (41%), Positives = 128/212 (60%)

Query:    60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEF 118
             E  + W+ ++SR Y  E E Q RF ++  N+++I+  N + + ++KL  N+FAD + EEF
Sbjct:    45 EHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTREEF 104

Query:   119 ISTYLGYNK----PYNE------PRWP-SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
             I+T+ G       P +E      P W  +V  +    + DWR EGAVTPVK QGQCG CW
Sbjct:   105 IATHTGLKGVNGIPSSEFVDEMIPSWNWNVSDVAGRETKDWRYEGAVTPVKYQGQCGCCW 164

Query:   168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
             AFS+VAAVEG+ K+    LVSLSEQ+L+DCD   +N GCNGG M  AF +I K  G+ +E
Sbjct:   165 AFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDN-GCNGGIMSDAFSYIIKNRGIASE 223

Query:   228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA 259
               YPY+     C+ +     +  I G++ +P+
Sbjct:   224 ASYPYQAAEGTCRYNGKP--SAWIRGFQTVPS 253

 Score = 220 (82.5 bits), Expect = 1.4e-60, Sum P(2) = 1.4e-60
 Identities = 43/77 (55%), Positives = 50/77 (64%)

Query:   263 FQLYSHGVFDE-YCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
             F  YS GV+DE YCG  +NH VT VGYG    G KYWL KNSWG +WGE GYIR+ R+  
Sbjct:   279 FMHYSGGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVA 338

Query:   321 SSNIGICGILMQASYPV 337
                 G+CG+   A YPV
Sbjct:   339 WPQ-GMCGVAQYAFYPV 354


>UNIPROTKB|F1NYJ1 [details] [associations]
            symbol:CTSL2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 IPI:IPI00602255
            OMA:DITHHEF EMBL:AADN02067812 Ensembl:ENSGALT00000020588
            ArrayExpress:F1NYJ1 Uniprot:F1NYJ1
        Length = 339

 Score = 437 (158.9 bits), Expect = 7.8e-60, Sum P(2) = 7.8e-60
 Identities = 92/216 (42%), Positives = 132/216 (61%)

Query:    52 KYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN-SQNL---SFKLTD 107
             + DP  ++  ++ W   +S++Y   +E  RR  ++  N++ I+  N   +L   S+KL  
Sbjct:    21 RVDPD-LDSHWQLWKSWHSKDYHEREESWRRV-VWEKNLKMIELHNLDHSLGKHSYKLGM 78

Query:   108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
             N+F D++ EEF     GY    +E ++   Q+L       P SVDWR++G VTPVKDQGQ
Sbjct:    79 NQFGDMTAEEFRQLMNGYKHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQ 138

Query:   163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
             CGSCWAFS   A+EG +  KTGKLVSLSEQ LVDC     NQGCNGG M++AF+++   G
Sbjct:   139 CGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNG 198

Query:   223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             G+ +E+ YPY  K+D     K +++A   TG+  IP
Sbjct:   199 GIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIP 234

 Score = 194 (73.4 bits), Expect = 7.8e-60, Sum P(2) = 7.8e-60
 Identities = 44/89 (49%), Positives = 57/89 (64%)

Query:   256 AIPARYA-FQLYSHGVFDEY-CGHQ-LNHGVTVVGYG---ED-HGEKYWLVKNSWGTSWG 308
             AI A ++ FQ Y  G++ E  C  + L+HGV VVGYG   ED  G+KYW+VKNSWG  WG
Sbjct:   254 AIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWG 313

Query:   309 EAGYIRMARNSPSSNIGICGILMQASYPV 337
             + GYI MA++  +     CGI   ASYP+
Sbjct:   314 DKGYIYMAKDRKNH----CGIATAASYPL 338


>UNIPROTKB|F1PAK0 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9615 "Canis lupus
            familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:AAEX03011051 Ensembl:ENSCAFT00000019176 OMA:YEPACTQ
            Uniprot:F1PAK0
        Length = 339

 Score = 408 (148.7 bits), Expect = 5.4e-59, Sum P(2) = 5.4e-59
 Identities = 93/235 (39%), Positives = 134/235 (57%)

Query:    33 FLLWVLGI-PAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQ 91
             F+ W++G+ P  +++     K DP +++  +  W K YS++Y  E+E   R  I+  N++
Sbjct:     8 FMKWLVGLLPLCSYAVAQVHK-DP-TLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLK 65

Query:    92 YIDYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPR---WPSVQYLGLPA 144
             ++   N ++     S+ L  N   D++ EE IS       P    R   + S     LP 
Sbjct:    66 FVMLHNLEHSMGMHSYDLGMNHLGDMTGEEVISLMGSLRVPSQWQRNVTYRSNSNQKLPD 125

Query:   145 SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE-N 203
             SVDWR++G VT VK QG CG+CWAFSAV A+E   KLKTGKLVSLS Q LVDC      N
Sbjct:   126 SVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGN 185

Query:   204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             +GCNGG+M  AF++I    G+ +E  YPY+  N +C+ D +K  A T + Y  +P
Sbjct:   186 KGCNGGFMTTAFQYIIDNNGIDSEASYPYKAVNGKCRYD-SKKRAATCSKYTELP 239

 Score = 215 (80.7 bits), Expect = 5.4e-59, Sum P(2) = 5.4e-59
 Identities = 46/83 (55%), Positives = 55/83 (66%)

Query:   256 AIPA-RYAFQLYSHGVFDE-YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYI 313
             AI A  Y+F LY  GV+ E  C   +NHGV VVGYG  +G+ YWLVKNSWG ++G+ GYI
Sbjct:   259 AIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYI 318

Query:   314 RMARNSPSSNIGICGILMQASYP 336
             RMARNS +     CGI    SYP
Sbjct:   319 RMARNSGNH----CGIASYPSYP 337


>DICTYBASE|DDB_G0283867 [details] [associations]
            symbol:cprC "cysteine proteinase 3" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0283867 GenomeReviews:CM000153_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:AAFI02000057
            KO:K01365 EMBL:X03930 RefSeq:XP_638859.1 ProteinModelPortal:Q23894
            SMR:Q23894 MEROPS:C01.114 EnsemblProtists:DDB0220784 GeneID:8624257
            KEGG:ddi:DDB_G0283867 OMA:NNVEHIN Uniprot:Q23894
        Length = 337

 Score = 413 (150.4 bits), Expect = 6.9e-59, Sum P(2) = 6.9e-59
 Identities = 96/245 (39%), Positives = 141/245 (57%)

Query:    25 LRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFG 84
             +R ++  +F L VL I     S G    +  +  ++ F +W++  ++ Y +  E+  R+ 
Sbjct:     1 MRLSITLIFTLIVLSI--SFISAG--NVFSHKQYQDSFIDWMRSNNKAY-THKEFMPRYE 55

Query:    85 IYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGL-- 142
              +  N+ Y+   NS+     L  N+ ADLSNEE+   YLG  + + +      + LGL  
Sbjct:    56 EFKKNMDYVHNWNSKGSKTVLGLNQHADLSNEEYRLNYLG-TRAHIKLNGYHKRNLGLRL 114

Query:   143 -------PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
                    P +VDWR++ AVTPVKDQGQCGSC++FS   +VEG+  +KTGKLVSLSEQ ++
Sbjct:   115 NRPQFKQPLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNIL 174

Query:   196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK-NDRCQTDKTKHHAVTITGY 254
             DC  +  N+GCNGG M  AFE+I K  G+ +E+ YPY  K ND C+  +    A  IT Y
Sbjct:   175 DCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDECKFQEGSV-AAKITSY 233

Query:   255 EAIPA 259
             + I A
Sbjct:   234 KEIEA 238

 Score = 209 (78.6 bits), Expect = 6.9e-59, Sum P(2) = 6.9e-59
 Identities = 45/85 (52%), Positives = 57/85 (67%)

Query:   256 AIPARY-AFQLYSHGVFDE-YCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGY 312
             AI A + +FQLY+ GV+ E  C  + L+HGV  VG G D+GE Y++VKNSWG SWG  GY
Sbjct:   256 AIDASHNSFQLYTAGVYYEPACSSEDLDHGVLAVGMGTDNGEDYYIVKNSWGPSWGLNGY 315

Query:   313 IRMARNSPSSNIGICGILMQASYPV 337
             I MARN  ++    CGI   ASYP+
Sbjct:   316 IHMARNKDNN----CGISTMASYPI 336


>DICTYBASE|DDB_G0279799 [details] [associations]
            symbol:cprB "cysteine proteinase 2" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0279799 GenomeReviews:CM000152_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            MEROPS:I29.003 KO:K01365 EMBL:AAFI02000033 EMBL:M16039 EMBL:X03344
            PIR:A25439 RefSeq:XP_641494.1 ProteinModelPortal:P04989 SMR:P04989
            EnsemblProtists:DDB0214998 GeneID:8622234 KEGG:ddi:DDB_G0279799
            OMA:YVNITAG Uniprot:P04989
        Length = 376

 Score = 431 (156.8 bits), Expect = 1.4e-58, Sum P(3) = 1.4e-58
 Identities = 101/245 (41%), Positives = 138/245 (56%)

Query:    25 LRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFG 84
             +R  V  + L++V    A     G  +++        F  W  +++R+Y S  E+  R+ 
Sbjct:     1 MRLLVFLILLIFVNFSFANVRPNG--RRFSESQYRTAFTEWTLKFNRQYSSS-EFSNRYS 57

Query:    85 IYSSNVQYIDYINSQNLSFKLTD-NKFADLSNEEFISTYLG-------YNKPYNEPRWPS 136
             I+ SN+ Y+D  NS+  S  +   N FAD++NEE+  TYLG       YN  Y+     +
Sbjct:    58 IFKSNMDYVDNWNSKGDSQTVLGLNNFADITNEEYRKTYLGTRVNAHSYNG-YDGREVLN 116

Query:   137 VQYLGL-PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
             V+ L   P S+DWR + AVTP+KDQGQCGSCW+FS   + EG + LKT KLVSLSEQ LV
Sbjct:   117 VEDLQTNPKSIDWRTKNAVTPIKDQGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQNLV 176

Query:   196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN-DRCQTDKTKHHAVTITGY 254
             DC    EN GC+GG M  AF++I K  G+ TE  YPY  +    C  +K+   A TI GY
Sbjct:   177 DCSGPEENFGCDGGLMNNAFDYIIKNKGIDTESSYPYTAETGSTCLFNKSDIGA-TIKGY 235

Query:   255 EAIPA 259
               I A
Sbjct:   236 VNITA 240

 Score = 122 (48.0 bits), Expect = 1.4e-58, Sum P(3) = 1.4e-58
 Identities = 22/42 (52%), Positives = 30/42 (71%)

Query:   296 YWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
             YW+VKNSWGTSWG  GYI M+++  ++    CGI   +SYP+
Sbjct:   338 YWIVKNSWGTSWGIKGYILMSKDRKNN----CGIASVSSYPL 375

 Score = 78 (32.5 bits), Expect = 1.4e-58, Sum P(3) = 1.4e-58
 Identities = 19/37 (51%), Positives = 26/37 (70%)

Query:   256 AIPARY-AFQLYSHGVFDE-YCGH-QLNHGVTVVGYG 289
             AI A + +FQLY+ G++ E  C   +L+HGV VVGYG
Sbjct:   258 AIDASHNSFQLYTSGIYYEPKCSPTELDHGVLVVGYG 294


>UNIPROTKB|Q8HY81 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            CTD:1520 KO:K01368 OrthoDB:EOG4JM7Q2 EMBL:AY156692
            RefSeq:NP_001002938.2 UniGene:Cfa.1661 ProteinModelPortal:Q8HY81
            SMR:Q8HY81 STRING:Q8HY81 MEROPS:C01.034 GeneID:403400
            KEGG:cfa:403400 InParanoid:Q8HY81 NextBio:20816922 Uniprot:Q8HY81
        Length = 331

 Score = 403 (146.9 bits), Expect = 1.8e-58, Sum P(2) = 1.8e-58
 Identities = 92/232 (39%), Positives = 132/232 (56%)

Query:    36 WVLGI-PAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYID 94
             W++G+ P  +++     K DP +++  +  W K YS++Y  E+E   R  I+  N++++ 
Sbjct:     3 WLVGLLPLCSYAVAQVHK-DP-TLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVM 60

Query:    95 YINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPR---WPSVQYLGLPASVD 147
               N ++     S+ L  N   D++ EE IS       P    R   + S     LP SVD
Sbjct:    61 LHNLEHSMGMHSYDLGMNHLGDMTGEEVISLMGSLRVPSQWQRNVTYRSNSNQKLPDSVD 120

Query:   148 WRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE-NQGC 206
             WR++G VT VK QG CG+CWAFSAV A+E   KLKTGKLVSLS Q LVDC      N+GC
Sbjct:   121 WREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGC 180

Query:   207 NGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             NGG+M  AF++I    G+ +E  YPY+  N +C+ D +K  A T + Y  +P
Sbjct:   181 NGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKCRYD-SKKRAATCSKYTELP 231

 Score = 215 (80.7 bits), Expect = 1.8e-58, Sum P(2) = 1.8e-58
 Identities = 46/83 (55%), Positives = 55/83 (66%)

Query:   256 AIPA-RYAFQLYSHGVFDE-YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYI 313
             AI A  Y+F LY  GV+ E  C   +NHGV VVGYG  +G+ YWLVKNSWG ++G+ GYI
Sbjct:   251 AIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYI 310

Query:   314 RMARNSPSSNIGICGILMQASYP 336
             RMARNS +     CGI    SYP
Sbjct:   311 RMARNSGNH----CGIASYPSYP 329


>UNIPROTKB|Q9GLE3 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9823 "Sus scrofa"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0005576 "extracellular
            region" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005576 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 MEROPS:I29.007
            CTD:1513 KO:K01371 OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55
            OMA:LKVPPSH EMBL:AF292030 RefSeq:NP_999467.1 UniGene:Ssc.1020
            ProteinModelPortal:Q9GLE3 SMR:Q9GLE3 STRING:Q9GLE3
            Ensembl:ENSSSCT00000007283 GeneID:397569 KEGG:ssc:397569
            ArrayExpress:Q9GLE3 Uniprot:Q9GLE3
        Length = 330

 Score = 413 (150.4 bits), Expect = 2.3e-58, Sum P(2) = 2.3e-58
 Identities = 91/216 (42%), Positives = 131/216 (60%)

Query:    53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQ-NL---SFKLTD 107
             Y  + ++ ++E W K Y ++Y S+ DE  RR  I+  N+++I   N + +L   +++L  
Sbjct:    18 YPEEILDTQWELWKKTYRKQYNSKVDEISRRL-IWEKNLKHISIHNLEASLGVHTYELAM 76

Query:   108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYL----G-LPASVDWRKEGAVTPVKDQGQ 162
             N   D+++EE +    G   P +  R     Y+    G  P S+D+RK+G VTPVK+QGQ
Sbjct:    77 NHLGDMTSEEVVQKMTGLKVPPSHSRSNDTLYIPDWEGRTPDSIDYRKKGYVTPVKNQGQ 136

Query:   163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
             CGSCWAFS+V A+EG  K KTGKL++LS Q LVDC   SEN GC GGYM  AF+++ K  
Sbjct:   137 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 194

Query:   223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             G+ +ED YPY G+++ C  + T   A    GY  IP
Sbjct:   195 GIDSEDAYPYVGQDENCMYNPTGK-AAKCRGYREIP 229

 Score = 204 (76.9 bits), Expect = 2.3e-58, Sum P(2) = 2.3e-58
 Identities = 40/77 (51%), Positives = 50/77 (64%)

Query:   262 AFQLYSHGVF-DEYCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
             +FQ YS GV+ DE C    LNH V  VGYG   G+K+W++KNSWG +WG  GYI MARN 
Sbjct:   256 SFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGKKHWIIKNSWGENWGNKGYILMARNK 315

Query:   320 PSSNIGICGILMQASYP 336
              ++    CGI   AS+P
Sbjct:   316 NNA----CGIANLASFP 328


>UNIPROTKB|Q9GL24 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 CTD:1515 KO:K01365
            OrthoDB:EOG48PMKF EMBL:AJ279008 RefSeq:NP_001239115.1
            UniGene:Cfa.3571 ProteinModelPortal:Q9GL24 SMR:Q9GL24
            MEROPS:C01.032 Ensembl:ENSCAFT00000001770
            Ensembl:ENSCAFT00000023837 GeneID:100684364 KEGG:cfa:100684364
            InParanoid:Q9GL24 OMA:FDQNLDT NextBio:20817211 Uniprot:Q9GL24
        Length = 333

 Score = 425 (154.7 bits), Expect = 2.9e-58, Sum P(2) = 2.9e-58
 Identities = 99/237 (41%), Positives = 139/237 (58%)

Query:    31 SLFLLWV-LGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
             SLFL  + LGI + A     P K+D QS+  ++  W   + R YG  +E  RR  ++  N
Sbjct:     4 SLFLTALCLGIASAA-----P-KFD-QSLNAQWYQWKATHRRLYGMNEEGWRR-AVWEKN 55

Query:    90 VQYIDYIN---SQNL-SFKLTDNKFADLSNEEFISTYLGY-NKPYNEPR-WPSVQYLGLP 143
             ++ I+  N   SQ    F +  N F D++NEEF     G+ N+ + + + +    +  +P
Sbjct:    56 MKMIELHNREYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFQNQKHKKGKMFQEPLFAEIP 115

Query:   144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSEN 203
              SVDWR++G VTPVK+QGQCGSCWAFSA  A+EG    KTGKLVSLSEQ LVDC     N
Sbjct:   116 KSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGN 175

Query:   204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR 260
             +GCNGG M+ AF ++   GG+ +E+ YPY G++      K +  A   TG+  +P R
Sbjct:   176 EGCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDLPQR 232

 Score = 191 (72.3 bits), Expect = 2.9e-58, Sum P(2) = 2.9e-58
 Identities = 47/108 (43%), Positives = 60/108 (55%)

Query:   237 DRCQTDKTKHHAVTITG--YEAIPARY-AFQLYSHGV-FDEYCGHQ-LNHGVTVVGYG-- 289
             D  Q +K    AV   G    AI A + +FQ Y  G+ FD  C  + L+HGV VVGYG  
Sbjct:   228 DLPQREKALMKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFE 287

Query:   290 -EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
               D   K+W+VKNSWG  WG  GY++MA++  +     CGI   ASYP
Sbjct:   288 GTDSNNKFWIVKNSWGPEWGWNGYVKMAKDQNNH----CGIATAASYP 331


>UNIPROTKB|Q5E968 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:BT021052
            EMBL:BC109853 IPI:IPI00709374 RefSeq:NP_001029607.1
            UniGene:Bt.23218 ProteinModelPortal:Q5E968 SMR:Q5E968 STRING:Q5E968
            MEROPS:I29.007 PRIDE:Q5E968 Ensembl:ENSBTAT00000028016
            GeneID:513038 KEGG:bta:513038 CTD:1513 InParanoid:Q5E968 KO:K01371
            OrthoDB:EOG4SJ5FC NextBio:20870669 PANTHER:PTHR12411:SF55
            Uniprot:Q5E968
        Length = 329

 Score = 415 (151.1 bits), Expect = 6.1e-58, Sum P(2) = 6.1e-58
 Identities = 92/216 (42%), Positives = 131/216 (60%)

Query:    53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQ-NL---SFKLTD 107
             Y  + ++ ++E W K Y ++Y S+ DE  RR  I+  N+++I   N + +L   +++L  
Sbjct:    17 YPEEILDTQWELWKKTYRKQYNSKGDEISRRL-IWEKNLKHISIHNLEASLGVHTYELAM 75

Query:   108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYL----G-LPASVDWRKEGAVTPVKDQGQ 162
             N   D+++EE +    G   P +  R     Y+    G  P SVD+RK+G VTPVK+QGQ
Sbjct:    76 NHLGDMTSEEVVQKMTGLKVPASRSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQ 135

Query:   163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
             CGSCWAFS+V A+EG  K KTGKL++LS Q LVDC   SEN GC GGYM  AF+++ K  
Sbjct:   136 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 193

Query:   223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             G+ +ED YPY G+++ C  + T   A    GY  IP
Sbjct:   194 GIDSEDAYPYVGQDENCMYNPTGK-AAKCRGYREIP 228

 Score = 198 (74.8 bits), Expect = 6.1e-58, Sum P(2) = 6.1e-58
 Identities = 39/77 (50%), Positives = 48/77 (62%)

Query:   262 AFQLYSHGVF-DEYCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
             +FQ Y  GV+ DE C    LNH V  VGYG   G K+W++KNSWG +WG  GYI MARN 
Sbjct:   255 SFQFYRKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNK 314

Query:   320 PSSNIGICGILMQASYP 336
              ++    CGI   AS+P
Sbjct:   315 NNA----CGIANLASFP 327


>UNIPROTKB|G1K2A7 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9615 "Canis lupus
            familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 PANTHER:PTHR12411:SF55 OMA:LKVPPSH
            EMBL:AAEX03011051 Ensembl:ENSCAFT00000019202 Uniprot:G1K2A7
        Length = 333

 Score = 410 (149.4 bits), Expect = 6.1e-58, Sum P(2) = 6.1e-58
 Identities = 90/216 (41%), Positives = 130/216 (60%)

Query:    53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQ-NL---SFKLTD 107
             Y  + ++ +++ W K Y ++Y S+ DE  RR  I+  N+++I   N + +L   +++L  
Sbjct:    21 YPEEILDTQWDLWKKTYRKQYNSKVDELSRRL-IWEKNLKHISIHNLEASLGVHTYELAM 79

Query:   108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
             N   D+++EE +    G   P +  R     Y+       P SVD+RK+G VTPVK+QGQ
Sbjct:    80 NHLGDMTSEEVVQKMTGLKVPPSHSRSNDTLYIPDWESRAPDSVDYRKKGYVTPVKNQGQ 139

Query:   163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
             CGSCWAFS+V A+EG  K KTGKL++LS Q LVDC   SEN GC GGYM  AF+++ K  
Sbjct:   140 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 197

Query:   223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             G+ +ED YPY G+++ C  + T   A    GY  IP
Sbjct:   198 GIDSEDAYPYVGQDESCMYNPTGK-AAKCRGYREIP 232

 Score = 203 (76.5 bits), Expect = 6.1e-58, Sum P(2) = 6.1e-58
 Identities = 40/77 (51%), Positives = 49/77 (63%)

Query:   262 AFQLYSHGVF-DEYCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
             +FQ YS GV+ DE C    LNH V  VGYG   G K+W++KNSWG +WG  GYI MARN 
Sbjct:   259 SFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNK 318

Query:   320 PSSNIGICGILMQASYP 336
              ++    CGI   AS+P
Sbjct:   319 NNA----CGIANLASFP 331


>UNIPROTKB|Q3ZKN1 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:AY738221
            RefSeq:NP_001029168.1 UniGene:Cfa.588 HSSP:P43235
            ProteinModelPortal:Q3ZKN1 SMR:Q3ZKN1 STRING:Q3ZKN1 GeneID:608843
            KEGG:cfa:608843 InParanoid:Q3ZKN1 NextBio:20894470 Uniprot:Q3ZKN1
        Length = 330

 Score = 410 (149.4 bits), Expect = 6.1e-58, Sum P(2) = 6.1e-58
 Identities = 90/216 (41%), Positives = 130/216 (60%)

Query:    53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQ-NL---SFKLTD 107
             Y  + ++ +++ W K Y ++Y S+ DE  RR  I+  N+++I   N + +L   +++L  
Sbjct:    18 YPEEILDTQWDLWKKTYRKQYNSKVDELSRRL-IWEKNLKHISIHNLEASLGVHTYELAM 76

Query:   108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
             N   D+++EE +    G   P +  R     Y+       P SVD+RK+G VTPVK+QGQ
Sbjct:    77 NHLGDMTSEEVVQKMTGLKVPPSHSRSNDTLYIPDWESRAPDSVDYRKKGYVTPVKNQGQ 136

Query:   163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
             CGSCWAFS+V A+EG  K KTGKL++LS Q LVDC   SEN GC GGYM  AF+++ K  
Sbjct:   137 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 194

Query:   223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             G+ +ED YPY G+++ C  + T   A    GY  IP
Sbjct:   195 GIDSEDAYPYVGQDESCMYNPTGK-AAKCRGYREIP 229

 Score = 203 (76.5 bits), Expect = 6.1e-58, Sum P(2) = 6.1e-58
 Identities = 40/77 (51%), Positives = 49/77 (63%)

Query:   262 AFQLYSHGVF-DEYCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
             +FQ YS GV+ DE C    LNH V  VGYG   G K+W++KNSWG +WG  GYI MARN 
Sbjct:   256 SFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNK 315

Query:   320 PSSNIGICGILMQASYP 336
              ++    CGI   AS+P
Sbjct:   316 NNA----CGIANLASFP 328


>UNIPROTKB|P43235 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0001957
            "intramembranous ossification" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0045453 "bone resorption"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=TAS] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=TAS] [GO:0036021 "endolysosome lumen" evidence=TAS]
            [GO:0045087 "innate immune response" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            Reactome:REACT_118779 Reactome:REACT_6900 GO:GO:0005615
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087 GO:GO:0045453
            EMBL:CH471121 EMBL:AL355860 GO:GO:0004197 GO:GO:0001957
            HOVERGEN:HBG011513 GO:GO:0036021 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:U13665 EMBL:X82153
            EMBL:U20280 EMBL:S79895 EMBL:CR541675 EMBL:AL356292 EMBL:BC016058
            IPI:IPI00300599 PIR:JC2476 RefSeq:NP_000387.1 UniGene:Hs.632466
            PDB:1ATK PDB:1AU0 PDB:1AU2 PDB:1AU3 PDB:1AU4 PDB:1AYU PDB:1AYV
            PDB:1AYW PDB:1BGO PDB:1BY8 PDB:1MEM PDB:1NL6 PDB:1NLJ PDB:1Q6K
            PDB:1SNK PDB:1TU6 PDB:1U9V PDB:1U9W PDB:1U9X PDB:1VSN PDB:1YK7
            PDB:1YK8 PDB:1YT7 PDB:2ATO PDB:2AUX PDB:2AUZ PDB:2BDL PDB:2R6N
            PDB:3C9E PDB:3H7D PDB:3KW9 PDB:3KWB PDB:3KWZ PDB:3KX1 PDB:3O0U
            PDB:3O1G PDB:3OVZ PDB:4DMX PDB:4DMY PDB:7PCK PDBsum:1ATK
            PDBsum:1AU0 PDBsum:1AU2 PDBsum:1AU3 PDBsum:1AU4 PDBsum:1AYU
            PDBsum:1AYV PDBsum:1AYW PDBsum:1BGO PDBsum:1BY8 PDBsum:1MEM
            PDBsum:1NL6 PDBsum:1NLJ PDBsum:1Q6K PDBsum:1SNK PDBsum:1TU6
            PDBsum:1U9V PDBsum:1U9W PDBsum:1U9X PDBsum:1VSN PDBsum:1YK7
            PDBsum:1YK8 PDBsum:1YT7 PDBsum:2ATO PDBsum:2AUX PDBsum:2AUZ
            PDBsum:2BDL PDBsum:2R6N PDBsum:3C9E PDBsum:3H7D PDBsum:3KW9
            PDBsum:3KWB PDBsum:3KWZ PDBsum:3KX1 PDBsum:3O0U PDBsum:3O1G
            PDBsum:3OVZ PDBsum:4DMX PDBsum:4DMY PDBsum:7PCK
            ProteinModelPortal:P43235 SMR:P43235 DIP:DIP-39993N IntAct:P43235
            STRING:P43235 PhosphoSite:P43235 DMDM:1168793 PaxDb:P43235
            PRIDE:P43235 DNASU:1513 Ensembl:ENST00000271651 GeneID:1513
            KEGG:hsa:1513 UCSC:uc001evp.2 GeneCards:GC01M150768 HGNC:HGNC:2536
            MIM:265800 MIM:601105 neXtProt:NX_P43235 Orphanet:763
            PharmGKB:PA27034 InParanoid:P43235 OMA:LKVPPSH PhylomeDB:P43235
            BindingDB:P43235 ChEMBL:CHEMBL268 EvolutionaryTrace:P43235
            GenomeRNAi:1513 NextBio:6267 ArrayExpress:P43235 Bgee:P43235
            CleanEx:HS_CTSK CleanEx:HS_CTSO Genevestigator:P43235
            GermOnline:ENSG00000143387 Uniprot:P43235
        Length = 329

 Score = 410 (149.4 bits), Expect = 6.1e-58, Sum P(2) = 6.1e-58
 Identities = 91/216 (42%), Positives = 129/216 (59%)

Query:    53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQ-NL---SFKLTD 107
             Y  + ++  +E W K + ++Y ++ DE  RR  I+  N++YI   N + +L   +++L  
Sbjct:    17 YPEEILDTHWELWKKTHRKQYNNKVDEISRRL-IWEKNLKYISIHNLEASLGVHTYELAM 75

Query:   108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYL----G-LPASVDWRKEGAVTPVKDQGQ 162
             N   D+++EE +    G   P +  R     Y+    G  P SVD+RK+G VTPVK+QGQ
Sbjct:    76 NHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPDSVDYRKKGYVTPVKNQGQ 135

Query:   163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
             CGSCWAFS+V A+EG  K KTGKL++LS Q LVDC   SEN GC GGYM  AF+++ K  
Sbjct:   136 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 193

Query:   223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             G+ +ED YPY G+ + C  + T   A    GY  IP
Sbjct:   194 GIDSEDAYPYVGQEESCMYNPTGK-AAKCRGYREIP 228

 Score = 203 (76.5 bits), Expect = 6.1e-58, Sum P(2) = 6.1e-58
 Identities = 40/77 (51%), Positives = 49/77 (63%)

Query:   262 AFQLYSHGVF-DEYCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
             +FQ YS GV+ DE C    LNH V  VGYG   G K+W++KNSWG +WG  GYI MARN 
Sbjct:   255 SFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNK 314

Query:   320 PSSNIGICGILMQASYP 336
              ++    CGI   AS+P
Sbjct:   315 NNA----CGIANLASFP 327


>UNIPROTKB|F1PMM9 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9615 "Canis lupus
            familiaris" [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00639 GO:GO:0004197 GO:GO:0002250
            GeneTree:ENSGT00560000076577 GO:GO:0097067 EMBL:AAEX03000499
            Ensembl:ENSCAFT00000002029 OMA:EFKQVLN Uniprot:F1PMM9
        Length = 341

 Score = 411 (149.7 bits), Expect = 1.3e-57, Sum P(2) = 1.3e-57
 Identities = 92/232 (39%), Positives = 138/232 (59%)

Query:    31 SLFLLWV-LGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDE-WQRRFGIYSS 88
             SLFL  + LGI + A     PQ+    S++  +  W + + + Y  ++E W+R   ++  
Sbjct:    12 SLFLAALCLGIASAA-----PQQ--DHSLDAHWSQWKEAHGKLYDKDEEGWRRT--VWER 62

Query:    89 NVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLGYN-KPYNEPR-WPSVQYLGL 142
             N++ I+  N +      SF L  N F D++NEEF      +  + + + + +P+  +  +
Sbjct:    63 NMEMIEQHNQEYSQGEHSFTLAMNAFGDMTNEEFKQVLNDFKIQKHKKGKVFPAPLFAEV 122

Query:   143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
             P+SVDWR++G VTPVKDQGQC  CWAFSA  A+EG    KTGKLVSLSEQ LVDC  +  
Sbjct:   123 PSSVDWREQGYVTPVKDQGQCLGCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWSQG 182

Query:   203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
             N+GCNGG ME AF+++   GG+ +E+ YPY  +N+ C+    K  A  +T +
Sbjct:   183 NRGCNGGLMEYAFQYVKDNGGLDSEESYPYLARNEPCKYRPEKS-AANVTAF 233

 Score = 199 (75.1 bits), Expect = 1.3e-57, Sum P(2) = 1.3e-57
 Identities = 40/82 (48%), Positives = 54/82 (65%)

Query:   262 AFQLYSHGVF-DEYCGHQL-NHGVTVVGYG----EDHGEKYWLVKNSWGTSWGEAGYIRM 315
             +FQ Y  G++ D  C ++L NHGV VVGYG    E   +KYW+VKNSWGT+WG  GY+ +
Sbjct:   263 SFQFYKKGIYYDPKCSNKLLNHGVLVVGYGFEGAESDNKKYWIVKNSWGTNWGMQGYMLL 322

Query:   316 ARNSPSSNIGICGILMQASYPV 337
             A++  +     CGI  +ASYPV
Sbjct:   323 AKDRDNH----CGIATRASYPV 340


>UNIPROTKB|Q28944 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9823 "Sus scrofa"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 KO:K01365 OrthoDB:EOG48PMKF MEROPS:C01.032
            CTD:1514 EMBL:D37917 EMBL:AJ315771 PIR:A58195 RefSeq:NP_999057.1
            UniGene:Ssc.54036 ProteinModelPortal:Q28944 SMR:Q28944
            STRING:Q28944 Ensembl:ENSSSCT00000012233 GeneID:396926
            KEGG:ssc:396926 OMA:DASETGK ArrayExpress:Q28944 Uniprot:Q28944
        Length = 334

 Score = 425 (154.7 bits), Expect = 1.6e-57, Sum P(2) = 1.6e-57
 Identities = 102/237 (43%), Positives = 138/237 (58%)

Query:    31 SLFLLWV-LGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
             SLFL  + LGI + A     P K D Q+++  +  W   + R YG  +E  RR  ++  N
Sbjct:     4 SLFLTALCLGIASAA-----P-KLD-QNLDADWYKWKATHGRLYGMNEEGWRR-AVWEKN 55

Query:    90 VQYIDYIN---SQNL-SFKLTDNKFADLSNEEFISTYLGY-NKPYNEPR-WPSVQYLGLP 143
             ++ I+  N   SQ    F +  N F D++NEEF     G+ N+ + + + +     L +P
Sbjct:    56 MKMIELHNQEYSQGKHGFSMAMNAFGDMTNEEFRQVMNGFQNQKHKKGKVFHESLVLEVP 115

Query:   144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSEN 203
              SVDWR++G VT VK+QGQCGSCWAFSA  A+EG    KTGKLVSLSEQ LVDC     N
Sbjct:   116 KSVDWREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGN 175

Query:   204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR 260
             QGCNGG M+ AF+++   GG+ TE+ YPY G+     T K +  A   TG+  IP R
Sbjct:   176 QGCNGGLMDNAFQYVKDNGGLDTEESYPYLGRETNSCTYKPECSAANDTGFVDIPQR 232

 Score = 184 (69.8 bits), Expect = 1.6e-57, Sum P(2) = 1.6e-57
 Identities = 44/106 (41%), Positives = 60/106 (56%)

Query:   240 QTDKTKHHAVTITG--YEAIPARYA-FQLYSHGVF-DEYCGHQ-LNHGVTVVGYG----E 290
             Q +K    AV   G    AI A ++ FQ Y  G++ D  C  + L+HGV VVGYG    +
Sbjct:   231 QREKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTD 290

Query:   291 DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
              +  K+W+VKNSWG  WG  GY++MA++  +     CGI   ASYP
Sbjct:   291 SNSSKFWIVKNSWGPEWGWNGYVKMAKDQNNH----CGISTAASYP 332


>FB|FBgn0013770 [details] [associations]
            symbol:Cp1 "Cysteine proteinase-1" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS;NAS] [GO:0030163 "protein catabolic process"
            evidence=NAS] [GO:0005764 "lysosome" evidence=NAS] [GO:0048102
            "autophagic cell death" evidence=IEP] [GO:0035071 "salivary gland
            cell autophagic cell death" evidence=IEP] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0045169 "fusome" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:AE013599 GO:GO:0007586 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0035071 GO:GO:0045169 GeneTree:ENSGT00660000095458 KO:K01365
            EMBL:U75652 EMBL:AF012089 EMBL:BT016071 EMBL:D31970
            RefSeq:NP_523735.2 RefSeq:NP_725347.1 RefSeq:NP_725348.1
            UniGene:Dm.7400 ProteinModelPortal:Q95029 SMR:Q95029 IntAct:Q95029
            MINT:MINT-814156 STRING:Q95029 MEROPS:C01.092 PaxDb:Q95029
            EnsemblMetazoa:FBtr0087593 GeneID:36546 KEGG:dme:Dmel_CG6692
            CTD:36546 FlyBase:FBgn0013770 InParanoid:Q95029 OMA:ICHGADP
            OrthoDB:EOG46M91C PhylomeDB:Q95029 GenomeRNAi:36546 NextBio:799136
            Bgee:Q95029 GermOnline:CG6692 Uniprot:Q95029
        Length = 371

 Score = 379 (138.5 bits), Expect = 1.6e-57, Sum P(2) = 1.6e-57
 Identities = 90/216 (41%), Positives = 119/216 (55%)

Query:    58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADL 113
             MEE +  +  ++ + Y  E E + R  I++ N   I   N +     +SFKL  NK+ADL
Sbjct:    56 MEE-WHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 114

Query:   114 SNEEFISTYLGYNKPYN------EPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
              + EF     G+N   +      +  +  V ++      LP SVDWR +GAVT VKDQG 
Sbjct:   115 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 174

Query:   163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
             CGSCWAFS+  A+EG +  K+G LVSLSEQ LVDC     N GCNGG M+ AF +I   G
Sbjct:   175 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 234

Query:   223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             G+ TE  YPY   +D C  +K    A T  G+  IP
Sbjct:   235 GIDTEKSYPYEAIDDSCHFNKGTVGA-TDRGFTDIP 269

 Score = 230 (86.0 bits), Expect = 1.6e-57, Sum P(2) = 1.6e-57
 Identities = 55/112 (49%), Positives = 69/112 (61%)

Query:   233 RGKNDRCQTDKTKH-HAVTITG--YEAIPARY-AFQLYSHGVFDE-YCGHQ-LNHGVTVV 286
             RG  D  Q D+ K   AV   G    AI A + +FQ YS GV++E  C  Q L+HGV VV
Sbjct:   263 RGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVV 322

Query:   287 GYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
             G+G D  GE YWLVKNSWGT+WG+ G+I+M RN  +     CGI   +SYP+
Sbjct:   323 GFGTDESGEDYWLVKNSWGTTWGDKGFIKMLRNKENQ----CGIASASSYPL 370


>UNIPROTKB|P25975 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 EMBL:X91755 EMBL:BC102312 EMBL:AB017648
            IPI:IPI00687440 PIR:S15845 RefSeq:NP_776457.1 UniGene:Bt.3987
            ProteinModelPortal:P25975 SMR:P25975 STRING:P25975
            Ensembl:ENSBTAT00000022710 Ensembl:ENSBTAT00000036427 GeneID:281108
            KEGG:bta:281108 CTD:1515 InParanoid:P25975 KO:K01365 OMA:EEFRATH
            OrthoDB:EOG48PMKF BindingDB:P25975 ChEMBL:CHEMBL2113
            NextBio:20805179 ArrayExpress:P25975 Uniprot:P25975
        Length = 334

 Score = 423 (154.0 bits), Expect = 2.0e-57, Sum P(2) = 2.0e-57
 Identities = 98/238 (41%), Positives = 137/238 (57%)

Query:    31 SLFL-LWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYG-SEDEWQRRFGIYSS 88
             S FL +  LG+ + A     P K DP +++  +  W   + R YG +E+EW+R   ++  
Sbjct:     4 SFFLTVLCLGVASAA-----P-KLDP-NLDAHWHQWKATHRRLYGMNEEEWRR--AVWEK 54

Query:    89 NVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLGY-NKPYNEPR-WPSVQYLGL 142
             N + ID  N +       F++  N F D++NEEF     G+ N+ + + + +     + +
Sbjct:    55 NKKIIDLHNQEYSEGKHGFRMAMNAFGDMTNEEFRQVMNGFQNQKHKKGKLFHEPLLVDV 114

Query:   143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
             P SVDW K+G VTPVK+QGQCGSCWAFSA  A+EG    KTGKLVSLSEQ LVDC     
Sbjct:   115 PKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQG 174

Query:   203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR 260
             NQGCNGG M+ AF++I   GG+ +E+ YPY   +      K +  A   TG+  IP R
Sbjct:   175 NQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQR 232

 Score = 185 (70.2 bits), Expect = 2.0e-57, Sum P(2) = 2.0e-57
 Identities = 44/106 (41%), Positives = 60/106 (56%)

Query:   240 QTDKTKHHAVTITG--YEAIPARY-AFQLYSHGVF-DEYCGHQ-LNHGVTVVGYG----E 290
             Q +K    AV   G    AI A + +FQ Y  G++ D  C  + L+HGV VVGYG    +
Sbjct:   231 QREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTD 290

Query:   291 DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
              +  K+W+VKNSWG  WG  GY++MA++  +     CGI   ASYP
Sbjct:   291 SNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNH----CGIATAASYP 332


>RGD|61810 [details] [associations]
            symbol:Ctsk "cathepsin K" species:10116 "Rattus norvegicus"
           [GO:0001957 "intramembranous ossification" evidence=IEP] [GO:0005615
           "extracellular space" evidence=IDA] [GO:0005737 "cytoplasm"
           evidence=IDA] [GO:0005764 "lysosome" evidence=IDA] [GO:0006508
           "proteolysis" evidence=TAS] [GO:0008234 "cysteine-type peptidase
           activity" evidence=TAS] [GO:0045453 "bone resorption" evidence=IMP]
           InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
           Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
           RGD:61810 GO:GO:0005615 eggNOG:COG4870 HOGENOM:HOG000230774
           GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
           InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
           PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
           GO:GO:0045453 GO:GO:0001957 GeneTree:ENSGT00560000076577
           HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
           OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:AF010306 EMBL:BC078793
           IPI:IPI00206378 RefSeq:NP_113748.1 UniGene:Rn.5598
           ProteinModelPortal:O35186 SMR:O35186 STRING:O35186
           PhosphoSite:O35186 PRIDE:O35186 Ensembl:ENSRNOT00000028730
           GeneID:29175 KEGG:rno:29175 UCSC:RGD:61810 InParanoid:O35186
           OMA:YKEIPEG BindingDB:O35186 ChEMBL:CHEMBL3034 NextBio:608248
           Genevestigator:O35186 GermOnline:ENSRNOG00000021155 Uniprot:O35186
        Length = 329

 Score = 396 (144.5 bits), Expect = 2.6e-57, Sum P(2) = 2.6e-57
 Identities = 86/213 (40%), Positives = 132/213 (61%)

Query:    56 QSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQ-NL---SFKLTDNKF 110
             ++++ ++E W K + ++Y S+ DE  RR  I+  N++ I   N + +L   +++L  N  
Sbjct:    20 ETLDTQWELWKKTHGKQYNSKVDEISRRL-IWEKNLKKISVHNLEASLGAHTYELAMNHL 78

Query:   111 ADLSNEEFISTYLGYNKP----YNEPRWPSVQYLG-LPASVDWRKEGAVTPVKDQGQCGS 165
              D+++EE +    G   P    ++     + ++ G +P S+D+RK+G VTPVK+QGQCGS
Sbjct:    79 GDMTSEEVVQKMTGLRVPPSRSFSNDTLYTPEWEGRVPDSIDYRKKGYVTPVKNQGQCGS 138

Query:   166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
             CWAFS+  A+EG  K KTGKL++LS Q LVDC   SEN GC GGYM  AF+++ + GG+ 
Sbjct:   139 CWAFSSAGALEGQLKKKTGKLLALSPQNLVDCV--SENYGCGGGYMTTAFQYVQQNGGID 196

Query:   226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             +ED YPY G+++ C  + T   A    GY  IP
Sbjct:   197 SEDAYPYVGQDESCMYNATAK-AAKCRGYREIP 228

 Score = 211 (79.3 bits), Expect = 2.6e-57, Sum P(2) = 2.6e-57
 Identities = 40/77 (51%), Positives = 50/77 (64%)

Query:   262 AFQLYSHGVF-DEYCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
             +FQ YS GV+ DE C    +NH V VVGYG   G KYW++KNSWG SWG  GY+ +ARN 
Sbjct:   255 SFQFYSRGVYYDENCDRDNVNHAVLVVGYGTQKGNKYWIIKNSWGESWGNKGYVLLARNK 314

Query:   320 PSSNIGICGILMQASYP 336
              ++    CGI   AS+P
Sbjct:   315 NNA----CGITNLASFP 327


>UNIPROTKB|F1SS93 [details] [associations]
            symbol:CTSS "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0002250 "adaptive immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0016020 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0002250
            GeneTree:ENSGT00560000076577 GO:GO:0097067 EMBL:CU463875
            Ensembl:ENSSSCT00000007284 OMA:CEIESAV Uniprot:F1SS93
        Length = 342

 Score = 395 (144.1 bits), Expect = 2.6e-57, Sum P(2) = 2.6e-57
 Identities = 89/239 (37%), Positives = 139/239 (58%)

Query:    28 AVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYS 87
             +++   L+WVL + + A ++ +    DP +++  ++ W K Y ++Y  ++E   R  I+ 
Sbjct:     9 SIIMKCLVWVLLLCSSAMAQLHR---DP-TLDRHWDLWKKTYGKQYKEKNEEVARRLIWE 64

Query:    88 SNVQYIDYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPR---WPSVQYL 140
              N++ +   N ++     S+ L  N   D+++EE IS       P   PR   + S    
Sbjct:    65 KNLKTVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQ 124

Query:   141 GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVN 200
              LP S+DWR++G VT VK QG CGSCWAFSAV A+E   K+KTG+LVSLS Q LVDC   
Sbjct:   125 KLPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTE 184

Query:   201 S-ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
                N+GCNGG+M +AF++I    G+ +E  YPY+  + +C+ D +K+ A T + Y  +P
Sbjct:   185 KYRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYD-SKNRAATCSRYTELP 242

 Score = 212 (79.7 bits), Expect = 2.6e-57, Sum P(2) = 2.6e-57
 Identities = 44/83 (53%), Positives = 55/83 (66%)

Query:   256 AIPARYA-FQLYSHGVF-DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYI 313
             AI A+++ F  Y  GV+ D  C   +NHGV VVGYG  +G+ YWLVKNSWG ++G+ GYI
Sbjct:   262 AIDAKHSSFFFYRSGVYYDPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDGGYI 321

Query:   314 RMARNSPSSNIGICGILMQASYP 336
             RMARNS +     CGI    SYP
Sbjct:   322 RMARNSENH----CGIANYPSYP 340


>UNIPROTKB|P25326 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9913 "Bos taurus"
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0016020 "membrane" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0002250 "adaptive
            immune response" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0016020 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0002250 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            GO:GO:0097067 EMBL:BC102245 EMBL:M95211 EMBL:X62001 IPI:IPI00702008
            PIR:S15844 RefSeq:NP_001028787.1 UniGene:Bt.7938
            ProteinModelPortal:P25326 SMR:P25326 STRING:P25326 PRIDE:P25326
            Ensembl:ENSBTAT00000022774 GeneID:327711 KEGG:bta:327711 CTD:1520
            InParanoid:P25326 KO:K01368 OMA:KAMDQKC OrthoDB:EOG4JM7Q2
            NextBio:20810175 Uniprot:P25326
        Length = 331

 Score = 399 (145.5 bits), Expect = 6.9e-57, Sum P(2) = 6.9e-57
 Identities = 91/234 (38%), Positives = 135/234 (57%)

Query:    33 FLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQY 92
             +L+W L + + A +  +    DP +++  ++ W K Y ++Y  ++E   R  I+  N++ 
Sbjct:     3 WLVWALLLCSSAMAHVHR---DP-TLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKT 58

Query:    93 IDYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPR---WPSVQYLGLPAS 145
             +   N ++     S++L  N   D+++EE IS       P   PR   + S     LP S
Sbjct:    59 VTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVPSQWPRNVTYKSDPNQKLPDS 118

Query:   146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE-NQ 204
             +DWR++G VT VK QG CGSCWAFSAV A+E   KLKTGKLVSLS Q LVDC      N+
Sbjct:   119 MDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNK 178

Query:   205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             GCNGG+M +AF++I    G+ +E  YPY+  + +CQ D  K+ A T + Y  +P
Sbjct:   179 GCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYD-VKNRAATCSRYIELP 231

 Score = 204 (76.9 bits), Expect = 6.9e-57, Sum P(2) = 6.9e-57
 Identities = 42/76 (55%), Positives = 49/76 (64%)

Query:   262 AFQLYSHGVF-DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
             +F LY  GV+ D  C   +NHGV VVGYG   G+ YWLVKNSWG  +G+ GYIRMARNS 
Sbjct:   258 SFFLYKTGVYYDPSCTQNVNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQGYIRMARNSG 317

Query:   321 SSNIGICGILMQASYP 336
             +     CGI    SYP
Sbjct:   318 NH----CGIANYPSYP 329


>TAIR|locus:2097104 [details] [associations]
            symbol:AT3G43960 species:3702 "Arabidopsis thaliana"
            [GO:0005886 "plasma membrane" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0031225 "anchored to
            membrane" evidence=TAS] [GO:0048767 "root hair elongation"
            evidence=IMP] [GO:0016132 "brassinosteroid biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:CP002686
            GenomeReviews:BA000014_GR GO:GO:0031225 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0048767 MEROPS:I29.003 HOGENOM:HOG000230773
            EMBL:AL163975 EMBL:AK118634 IPI:IPI00526842 PIR:T48950
            RefSeq:NP_566867.1 UniGene:At.43352 ProteinModelPortal:Q9LXW3
            SMR:Q9LXW3 STRING:Q9LXW3 PaxDb:Q9LXW3 PRIDE:Q9LXW3
            EnsemblPlants:AT3G43960.1 GeneID:823513 KEGG:ath:AT3G43960
            TAIR:At3g43960 eggNOG:NOG286334 InParanoid:Q9LXW3 KO:K01376
            OMA:MAISFRT PhylomeDB:Q9LXW3 ProtClustDB:CLSN2917367
            Genevestigator:Q9LXW3 GermOnline:AT3G43960 Uniprot:Q9LXW3
        Length = 376

 Score = 432 (157.1 bits), Expect = 1.1e-56, Sum P(2) = 1.1e-56
 Identities = 104/251 (41%), Positives = 151/251 (60%)

Query:    17 IAIDMRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSE 76
             +AI  R +   A+L+L +L ++ I  G  +    Q+ + + +   +E WL +  + Y   
Sbjct:     1 MAISFRTL---ALLTLSVL-LISISLGVVTATESQRNEGEVLT-MYEQWLVENGKNYNGL 55

Query:    77 DEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLG--YNKPYNEPR 133
              E +RRF I+  N++ I+  NS  N S++   NKF+DL+ +EF ++YLG    K      
Sbjct:    56 GEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQASYLGGKMEKKSLSDV 115

Query:   134 WPSVQYL-G--LPASVDWRKEGAVTP-VKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL 189
                 QY  G  LP  VDWR+ GAV P VK QG+CGSCWAF+A  AVEGIN++ TG+LVSL
Sbjct:   116 AERYQYKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSL 175

Query:   190 SEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDR-CQTDKTKH-H 247
             SEQEL+DCD  ++N GC GG    AFEFI + GG+ +++ Y Y G++   C+  + K   
Sbjct:   176 SEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEVYGYTGEDTAACKAIEMKTTR 235

Query:   248 AVTITGYEAIP 258
              VTI G+E +P
Sbjct:   236 VVTINGHEVVP 246

 Score = 169 (64.5 bits), Expect = 1.1e-56, Sum P(2) = 1.1e-56
 Identities = 32/76 (42%), Positives = 44/76 (57%)

Query:   266 YSHGVFDEYCGHQL-NHGVTVVGYG--EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSS 322
             Y  GV+   C +   +H V +VGYG   D G+ YWL++NSWG  WGE GY+R+ RN    
Sbjct:   274 YKSGVYKGACSNLWGDHNVLIVGYGTSSDEGD-YWLIRNSWGPEWGEGGYLRLQRNFHEP 332

Query:   323 NIGICGILMQASYPVK 338
               G C + +   YP+K
Sbjct:   333 T-GKCAVAVAPVYPIK 347


>MGI|MGI:107823 [details] [associations]
            symbol:Ctsk "cathepsin K" species:10090 "Mus musculus"
            [GO:0005615 "extracellular space" evidence=ISO] [GO:0005737
            "cytoplasm" evidence=ISO] [GO:0005764 "lysosome" evidence=ISO]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0045453 "bone resorption" evidence=ISO]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:107823 GO:GO:0005615 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0045453
            GO:GO:0001957 HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 OMA:LKVPPSH EMBL:X94444
            EMBL:AJ006033 EMBL:BC046320 IPI:IPI00316575 PIR:S74227
            RefSeq:NP_031828.2 UniGene:Mm.272085 ProteinModelPortal:P55097
            SMR:P55097 MINT:MINT-3089515 STRING:P55097 PhosphoSite:P55097
            PRIDE:P55097 Ensembl:ENSMUST00000015664 GeneID:13038 KEGG:mmu:13038
            InParanoid:P55097 BioCyc:MetaCyc:MONOMER-14811 ChEMBL:CHEMBL1075277
            NextBio:282924 Bgee:P55097 CleanEx:MM_CTSK Genevestigator:P55097
            GermOnline:ENSMUSG00000028111 Uniprot:P55097
        Length = 329

 Score = 396 (144.5 bits), Expect = 1.4e-56, Sum P(2) = 1.4e-56
 Identities = 88/215 (40%), Positives = 132/215 (61%)

Query:    55 PQSM-EERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQ-NL---SFKLTDN 108
             P+ M + ++E W K + ++Y S+ DE  RR  I+  N++ I   N + +L   +++L  N
Sbjct:    18 PEEMLDTQWELWKKTHQKQYNSKVDEISRRL-IWEKNLKQISAHNLEASLGVHTYELAMN 76

Query:   109 KFADLSNEEFISTYLGYNKP----YNEPRWPSVQYLG-LPASVDWRKEGAVTPVKDQGQC 163
                D+++EE +    G   P    Y+     + ++ G +P S+D+RK+G VTPVK+QGQC
Sbjct:    77 HLGDMTSEEVVQKMTGLRIPPSRSYSNDTLYTPEWEGRVPDSIDYRKKGYVTPVKNQGQC 136

Query:   164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
             GSCWAFS+  A+EG  K KTGKL++LS Q LVDC   +EN GC GGYM  AF+++ + GG
Sbjct:   137 GSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCV--TENYGCGGGYMTTAFQYVQQNGG 194

Query:   224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             + +ED YPY G+++ C  + T   A    GY  IP
Sbjct:   195 IDSEDAYPYVGQDESCMYNATAK-AAKCRGYREIP 228

 Score = 204 (76.9 bits), Expect = 1.4e-56, Sum P(2) = 1.4e-56
 Identities = 39/77 (50%), Positives = 49/77 (63%)

Query:   262 AFQLYSHGVF-DEYCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
             +FQ YS GV+ DE C    +NH V VVGYG   G K+W++KNSWG SWG  GY  +ARN 
Sbjct:   255 SFQFYSRGVYYDENCDRDNVNHAVLVVGYGTQKGSKHWIIKNSWGESWGNKGYALLARNK 314

Query:   320 PSSNIGICGILMQASYP 336
              ++    CGI   AS+P
Sbjct:   315 NNA----CGITNMASFP 327


>UNIPROTKB|Q5E998 [details] [associations]
            symbol:CTSL2 "Cathepsin L2" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            HOVERGEN:HBG011513 UniGene:Bt.3987 MEROPS:C01.032 EMBL:BT021022
            IPI:IPI00711962 ProteinModelPortal:Q5E998 SMR:Q5E998 STRING:Q5E998
            InParanoid:Q5E998 Uniprot:Q5E998
        Length = 334

 Score = 414 (150.8 bits), Expect = 1.8e-56, Sum P(2) = 1.8e-56
 Identities = 97/238 (40%), Positives = 136/238 (57%)

Query:    31 SLFL-LWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYG-SEDEWQRRFGIYSS 88
             S FL +  LG+ + A     P K DP +++  +  W   + R YG +E+EW+R   ++  
Sbjct:     4 SFFLTVLCLGVASAA-----P-KLDP-NLDAHWHQWKATHRRLYGMNEEEWRR--AVWEK 54

Query:    89 NVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLGY-NKPYNEPR-WPSVQYLGL 142
             N + ID  N +       F++  N F D++NEEF     G+ N+ + + + +     + +
Sbjct:    55 NKKIIDLHNQEYSEGKHGFRMAMNAFGDMTNEEFRQVMNGFQNQKHKKGKLFHEPLLVDV 114

Query:   143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
             P SVDW K+G VTPVK+QGQCGSCWAFSA  A+EG    KTGKLVSLSEQ LVDC     
Sbjct:   115 PKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQG 174

Query:   203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR 260
             NQGCNGG M+ AF++I   G + +E+ YPY   +      K +  A   TG+  IP R
Sbjct:   175 NQGCNGGLMDNAFQYIKDNGCLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQR 232

 Score = 185 (70.2 bits), Expect = 1.8e-56, Sum P(2) = 1.8e-56
 Identities = 44/106 (41%), Positives = 60/106 (56%)

Query:   240 QTDKTKHHAVTITG--YEAIPARY-AFQLYSHGVF-DEYCGHQ-LNHGVTVVGYG----E 290
             Q +K    AV   G    AI A + +FQ Y  G++ D  C  + L+HGV VVGYG    +
Sbjct:   231 QREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTD 290

Query:   291 DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
              +  K+W+VKNSWG  WG  GY++MA++  +     CGI   ASYP
Sbjct:   291 SNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNH----CGIATAASYP 332


>RGD|1560071 [details] [associations]
            symbol:Ctsll3 "cathepsin L-like 3" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1560071 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:CH474032 IPI:IPI00560469 RefSeq:XP_001065834.2
            RefSeq:XP_573976.3 UniGene:Rn.104851 MEROPS:C01.107
            Ensembl:ENSRNOT00000061398 GeneID:498691 KEGG:rno:498691
            UCSC:RGD:1560071 CTD:70202 OMA:NCGIASD OrthoDB:EOG4HDSTZ
            NextBio:700548 Uniprot:D3ZJV2
        Length = 330

 Score = 399 (145.5 bits), Expect = 2.9e-56, Sum P(2) = 2.9e-56
 Identities = 87/214 (40%), Positives = 127/214 (59%)

Query:    53 YDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI-----DYINSQNLSFKLTD 107
             +DP S +  +E W  ++ + Y + +E Q+R  ++ +N++ I     DY+  ++  F L  
Sbjct:    21 HDP-SFDTVWEEWKTKHGKTYNTNEEGQKR-AVWENNMKMINLHNEDYLKGKH-GFSLEM 77

Query:   108 NKFADLSNEEFISTYLGYN--KPYNEPRWPSVQYLG-LPASVDWRKEGAVTPVKDQGQCG 164
             N F DL+N EF     G+   K      +P   +LG +P +VDWRK G VTPVK+QG CG
Sbjct:    78 NAFGDLTNTEFRELMTGFQGQKTKMMKVFPE-PFLGDVPKTVDWRKHGYVTPVKNQGPCG 136

Query:   165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
             SCWAFSAV ++EG    KTGKLV LSEQ LVDC  +  N+GC+GG  + AF+++   GG+
Sbjct:   137 SCWAFSAVGSLEGQVFRKTGKLVPLSEQNLVDCSWSHGNKGCDGGLPDFAFQYVKDNGGL 196

Query:   225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
              T   YPY   N  C+ +  K+ A  + G+ +IP
Sbjct:   197 DTSVSYPYEALNGTCRYNP-KYSAAKVVGFMSIP 229

 Score = 198 (74.8 bits), Expect = 2.9e-56, Sum P(2) = 2.9e-56
 Identities = 42/79 (53%), Positives = 52/79 (65%)

Query:   262 AFQLYSHGVFDEY-CGH-QLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARN 318
             +FQ Y  G++ E  C    LNH V VVGYGE+  G KYWLVKNSWG  WG  GYI+MA++
Sbjct:   255 SFQFYKGGMYYEPDCSSTNLNHAVLVVGYGEESDGRKYWLVKNSWGRDWGMDGYIKMAKD 314

Query:   319 SPSSNIGICGILMQASYPV 337
               ++N   CGI   ASYP+
Sbjct:   315 W-NNN---CGIASDASYPI 329


>UNIPROTKB|Q86GF7 [details] [associations]
            symbol:Cys "Crustapain" species:6703 "Pandalus borealis"
            [GO:0005576 "extracellular region" evidence=IC] [GO:0007586
            "digestion" evidence=NAS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IDA] [GO:0030163 "protein catabolic process"
            evidence=NAS] [GO:0030574 "collagen catabolic process"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005576
            GO:GO:0007586 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0030163 GO:GO:0030574 EMBL:AB091669
            ProteinModelPortal:Q86GF7 SMR:Q86GF7 MEROPS:C01.030 Uniprot:Q86GF7
        Length = 323

 Score = 413 (150.4 bits), Expect = 4.8e-56, Sum P(2) = 4.8e-56
 Identities = 90/220 (40%), Positives = 132/220 (60%)

Query:    62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEE 117
             +EN+  ++ ++Y + +E   R  ++   +++I   N +     +++ L  N F+DL++EE
Sbjct:    20 WENFKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHEE 79

Query:   118 FISTYLGYNK---PYNE-PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
              ++T  G  +   P +  P+  S     + A VDWR +GAVTPVKDQGQCGSCWAFSAVA
Sbjct:    80 VLATKTGMTRRRHPLSVLPK--SAPTTPMAADVDWRNKGAVTPVKDQGQCGSCWAFSAVA 137

Query:   174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
             A+EG + LKTG LVSLSEQ LVDC  +  NQGCNGG+  +A+++I    G+ TE  YPY+
Sbjct:   138 ALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRGIDTESSYPYK 197

Query:   234 GKNDRCQTDKTKHHAVTITGYEAIPARYAFQLYSHGVFDE 273
               +D C+ D     A T++ Y   PA        H V +E
Sbjct:   198 AIDDNCRYDAGNIGA-TVSSYVE-PASGDESALQHAVQNE 235

 Score = 182 (69.1 bits), Expect = 4.8e-56, Sum P(2) = 4.8e-56
 Identities = 43/97 (44%), Positives = 55/97 (56%)

Query:   247 HAVTITGYEAI---PARYAFQLYSHGVFDE-YCGH-QLNHGVTVVGYGED-HGEKYWLVK 300
             HAV   G  ++     + +F  Y  GV+ E  C     NH VT VGYG D +G  YW+VK
Sbjct:   230 HAVQNEGPVSVCIDAGQSSFGSYGGGVYYEPNCDSWYANHAVTAVGYGTDANGGDYWIVK 289

Query:   301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
             NSWG  WGE+GYI+MARN  ++    C I   + YPV
Sbjct:   290 NSWGAWWGESGYIKMARNRDNN----CAIATYSVYPV 322


>ZFIN|ZDB-GENE-030131-106 [details] [associations]
            symbol:ctsl1a "cathepsin L, 1 a" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030131-106 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 HSSP:P43235
            KO:K01365 EMBL:BC066490 IPI:IPI00495935 RefSeq:NP_997749.1
            UniGene:Dr.104499 ProteinModelPortal:Q6NYR5 SMR:Q6NYR5
            MEROPS:C01.074 PRIDE:Q6NYR5 GeneID:321453 KEGG:dre:321453
            CTD:321453 InParanoid:Q6NYR5 NextBio:20807387 ArrayExpress:Q6NYR5
            Bgee:Q6NYR5 Uniprot:Q6NYR5
        Length = 337

 Score = 407 (148.3 bits), Expect = 6.1e-56, Sum P(2) = 6.1e-56
 Identities = 83/213 (38%), Positives = 131/213 (61%)

Query:    56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNKFA 111
             Q + + ++ W K +S++Y + +E  RR  I+  N++ I+  N ++     +++L  N F 
Sbjct:    23 QQLNDHWDQWKKWHSKKYHATEEGWRRV-IWEKNLKKIEMHNLEHSMGIHTYRLGMNHFG 81

Query:   112 DLSNEEFISTYLGY----NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
             D+++EEF     G+    ++ +    +    ++ +P  +DWR++G VTPVKDQG+CGSCW
Sbjct:    82 DMTHEEFRQVMNGFKHKKDRRFRGSLFMEPNFIEVPNKLDWREKGYVTPVKDQGECGSCW 141

Query:   168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
             AFS   A+EG    KTGKLVSLSEQ LVDC     N+GCNGG M++AF+++    G+ +E
Sbjct:   142 AFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDQNGLDSE 201

Query:   228 DDYPYRGKNDR-CQTDKTKHHAVTITGYEAIPA 259
             + YPY G +D+ C  D  K+ A   TG+  IP+
Sbjct:   202 ESYPYLGTDDQPCHFDP-KNSAANDTGFVDIPS 233

 Score = 187 (70.9 bits), Expect = 6.1e-56, Sum P(2) = 6.1e-56
 Identities = 41/89 (46%), Positives = 57/89 (64%)

Query:   256 AIPARY-AFQLYSHGVF-DEYCG-HQLNHGVTVVGYG---ED-HGEKYWLVKNSWGTSWG 308
             AI A + +FQ Y  G++ ++ C   +L+HGV  VGYG   ED  G+KYW+VKNSW  +WG
Sbjct:   252 AIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVDGKKYWIVKNSWSENWG 311

Query:   309 EAGYIRMARNSPSSNIGICGILMQASYPV 337
             + GYI MA++  +     CGI   ASYP+
Sbjct:   312 DKGYIYMAKDRHNH----CGIATAASYPL 336


>UNIPROTKB|P07711 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9606 "Homo sapiens"
            [GO:0005576 "extracellular region" evidence=NAS] [GO:0005764
            "lysosome" evidence=IDA;NAS] [GO:0006508 "proteolysis"
            evidence=IDA] [GO:0002250 "adaptive immune response" evidence=IEP]
            [GO:0019882 "antigen processing and presentation" evidence=TAS]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0036021 "endolysosome lumen" evidence=TAS] [GO:0043202
            "lysosomal lumen" evidence=TAS] [GO:0045087 "innate immune
            response" evidence=TAS] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042393 "histone binding" evidence=IDA] [GO:0005634 "nucleus"
            evidence=TAS] [GO:0071888 "macrophage apoptotic process"
            evidence=NAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            Reactome:REACT_118779 EMBL:X12451 GO:GO:0005634 Reactome:REACT_6900
            GO:GO:0005576 GO:GO:0019886 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087
            GO:GO:0042393 GO:GO:0004197 GO:GO:0002250 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0036021 KO:K01365 OrthoDB:EOG48PMKF EMBL:M20496
            EMBL:CR457053 EMBL:BX537395 EMBL:AL160279 EMBL:BC012612 EMBL:X05256
            IPI:IPI00012887 PIR:S01002 RefSeq:NP_001244900.1
            RefSeq:NP_001244901.1 RefSeq:NP_001903.1 RefSeq:NP_666023.1
            UniGene:Hs.731507 UniGene:Hs.731952 PDB:1CJL PDB:1CS8 PDB:1ICF
            PDB:1MHW PDB:2NQD PDB:2VHS PDB:2XU1 PDB:2XU3 PDB:2XU4 PDB:2XU5
            PDB:2YJ2 PDB:2YJ8 PDB:2YJ9 PDB:2YJB PDB:2YJC PDB:3BC3 PDB:3H89
            PDB:3H8B PDB:3H8C PDB:3HHA PDB:3HWN PDB:3IV2 PDB:3K24 PDB:3KSE
            PDB:3OF8 PDB:3OF9 PDBsum:1CJL PDBsum:1CS8 PDBsum:1ICF PDBsum:1MHW
            PDBsum:2NQD PDBsum:2VHS PDBsum:2XU1 PDBsum:2XU3 PDBsum:2XU4
            PDBsum:2XU5 PDBsum:2YJ2 PDBsum:2YJ8 PDBsum:2YJ9 PDBsum:2YJB
            PDBsum:2YJC PDBsum:3BC3 PDBsum:3H89 PDBsum:3H8B PDBsum:3H8C
            PDBsum:3HHA PDBsum:3HWN PDBsum:3IV2 PDBsum:3K24 PDBsum:3KSE
            PDBsum:3OF8 PDBsum:3OF9 ProteinModelPortal:P07711 SMR:P07711
            IntAct:P07711 STRING:P07711 MEROPS:I29.001 PhosphoSite:P07711
            DMDM:115741 PaxDb:P07711 PeptideAtlas:P07711 PRIDE:P07711
            DNASU:1514 Ensembl:ENST00000340342 Ensembl:ENST00000343150
            GeneID:1514 KEGG:hsa:1514 UCSC:uc004aph.3 CTD:1514
            GeneCards:GC09P090341 H-InvDB:HIX0058839 H-InvDB:HIX0170314
            HGNC:HGNC:2537 HPA:CAB000459 MIM:116880 neXtProt:NX_P07711
            PharmGKB:PA162382890 InParanoid:P07711 OMA:REPLFAQ PhylomeDB:P07711
            BRENDA:3.4.22.15 BindingDB:P07711 ChEMBL:CHEMBL3837 ChiTaRS:CTSL1
            DrugBank:DB00040 EvolutionaryTrace:P07711 GenomeRNAi:1514
            NextBio:6271 PMAP-CutDB:P07711 ArrayExpress:P07711 Bgee:P07711
            CleanEx:HS_CTSL1 Genevestigator:P07711 GermOnline:ENSG00000135047
            GO:GO:0071888 Uniprot:P07711
        Length = 333

 Score = 411 (149.7 bits), Expect = 7.8e-56, Sum P(2) = 7.8e-56
 Identities = 84/208 (40%), Positives = 122/208 (58%)

Query:    57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFAD 112
             S+E ++  W   ++R YG  +E  RR  ++  N++ I+  N +      SF +  N F D
Sbjct:    24 SLEAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEYREGKHSFTMAMNAFGD 82

Query:   113 LSNEEFISTYLGYN--KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
             +++EEF     G+   KP     +    +   P SVDWR++G VTPVK+QGQCGSCWAFS
Sbjct:    83 MTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFS 142

Query:   171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
             A  A+EG    KTG+L+SLSEQ LVDC     N+GCNGG M+ AF+++   GG+ +E+ Y
Sbjct:   143 ATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESY 202

Query:   231 PYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             PY    + C+ +  K+     TG+  IP
Sbjct:   203 PYEATEESCKYNP-KYSVANDTGFVDIP 229

 Score = 182 (69.1 bits), Expect = 7.8e-56, Sum P(2) = 7.8e-56
 Identities = 40/88 (45%), Positives = 52/88 (59%)

Query:   256 AIPARY-AFQLYSHGV-FDEYCGHQ-LNHGVTVVGYG----EDHGEKYWLVKNSWGTSWG 308
             AI A + +F  Y  G+ F+  C  + ++HGV VVGYG    E    KYWLVKNSWG  WG
Sbjct:   248 AIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWG 307

Query:   309 EAGYIRMARNSPSSNIGICGILMQASYP 336
               GY++MA++  +     CGI   ASYP
Sbjct:   308 MGGYVKMAKDRRNH----CGIASAASYP 331


>UNIPROTKB|O60911 [details] [associations]
            symbol:CTSL2 "Cathepsin L2" species:9606 "Homo sapiens"
            [GO:0004177 "aminopeptidase activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA] [GO:0005902
            "microvillus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0007283 "spermatogenesis" evidence=IEA] [GO:0009267 "cellular
            response to starvation" evidence=IEA] [GO:0009749 "response to
            glucose stimulus" evidence=IEA] [GO:0009897 "external side of
            plasma membrane" evidence=IEA] [GO:0010259 "multicellular
            organismal aging" evidence=IEA] [GO:0021675 "nerve development"
            evidence=IEA] [GO:0030984 "kininogen binding" evidence=IEA]
            [GO:0032403 "protein complex binding" evidence=IEA] [GO:0034698
            "response to gonadotropin stimulus" evidence=IEA] [GO:0042277
            "peptide binding" evidence=IEA] [GO:0043005 "neuron projection"
            evidence=IEA] [GO:0043204 "perikaryon" evidence=IEA] [GO:0046697
            "decidualization" evidence=IEA] [GO:0048102 "autophagic cell death"
            evidence=IEA] [GO:0051384 "response to glucocorticoid stimulus"
            evidence=IEA] [GO:0060008 "Sertoli cell differentiation"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=TAS] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_118779 Reactome:REACT_6900
            GO:GO:0009897 GO:GO:0019886 GO:GO:0034698 GO:GO:0043204
            GO:GO:0009749 GO:GO:0030141 GO:GO:0051384 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0045177 GO:GO:0043005 GO:GO:0007283
            GO:GO:0004177 GO:GO:0042277 GO:GO:0009267 GO:GO:0021675
            GO:GO:0043202 GO:GO:0005902 GO:GO:0010259 GO:GO:0004197
            GO:GO:0048102 GO:GO:0046697 HOVERGEN:HBG011513 CTD:1515
            OrthoDB:EOG48PMKF OMA:FDQNLDT GO:GO:0060008 EMBL:Y14734
            EMBL:AB001928 EMBL:AF070448 EMBL:AB019534 EMBL:AY358641
            EMBL:AL445670 EMBL:BC023504 EMBL:BC110512 IPI:IPI00000013
            RefSeq:NP_001188504.1 RefSeq:NP_001324.2 UniGene:Hs.610096 PDB:1FH0
            PDB:3H6S PDB:3KFQ PDBsum:1FH0 PDBsum:3H6S PDBsum:3KFQ
            ProteinModelPortal:O60911 SMR:O60911 IntAct:O60911 STRING:O60911
            MEROPS:I29.010 PhosphoSite:O60911 PaxDb:O60911 PeptideAtlas:O60911
            PRIDE:O60911 Ensembl:ENST00000259470 Ensembl:ENST00000538255
            GeneID:1515 KEGG:hsa:1515 UCSC:uc004awt.3 GeneCards:GC09M099794
            HGNC:HGNC:2538 HPA:CAB017112 MIM:603308 neXtProt:NX_O60911
            PharmGKB:PA27036 InParanoid:O60911 KO:K01375 PhylomeDB:O60911
            BRENDA:3.4.22.43 SABIO-RK:O60911 BindingDB:O60911 ChEMBL:CHEMBL3272
            ChiTaRS:CTSL2 EvolutionaryTrace:O60911 GenomeRNAi:1515 NextBio:6277
            Bgee:O60911 CleanEx:HS_CTSL2 Genevestigator:O60911
            GermOnline:ENSG00000136943 Uniprot:O60911
        Length = 334

 Score = 411 (149.7 bits), Expect = 9.9e-56, Sum P(2) = 9.9e-56
 Identities = 88/213 (41%), Positives = 132/213 (61%)

Query:    52 KYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN---SQNL-SFKLTD 107
             K+D Q+++ ++  W   + R YG+ +E  RR  ++  N++ I+  N   SQ    F +  
Sbjct:    20 KFD-QNLDTKWYQWKATHRRLYGANEEGWRR-AVWEKNMKMIELHNGEYSQGKHGFTMAM 77

Query:   108 NKFADLSNEEFISTYLGY--NKPYNEPR-WPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
             N F D++NEEF    +G   N+ + + + +    +L LP SVDWRK+G VTPVK+Q QCG
Sbjct:    78 NAFGDMTNEEF-RQMMGCFRNQKFRKGKVFREPLFLDLPKSVDWRKKGYVTPVKNQKQCG 136

Query:   165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
             SCWAFSA  A+EG    KTGKLVSLSEQ LVDC     NQGCNGG+M +AF+++ + GG+
Sbjct:   137 SCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGL 196

Query:   225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI 257
              +E+ YPY   ++ C+  + ++     TG+  +
Sbjct:   197 DSEESYPYVAVDEICKY-RPENSVANDTGFTVV 228

 Score = 181 (68.8 bits), Expect = 9.9e-56, Sum P(2) = 9.9e-56
 Identities = 37/81 (45%), Positives = 49/81 (60%)

Query:   262 AFQLYSHGV-FDEYCGHQ-LNHGVTVVGYG----EDHGEKYWLVKNSWGTSWGEAGYIRM 315
             +FQ Y  G+ F+  C  + L+HGV VVGYG      +  KYWLVKNSWG  WG  GY+++
Sbjct:   256 SFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKI 315

Query:   316 ARNSPSSNIGICGILMQASYP 336
             A++  +     CGI   ASYP
Sbjct:   316 AKDKNNH----CGIATAASYP 332


>RGD|1308751 [details] [associations]
            symbol:RGD1308751 "similar to Cathepsin L precursor (Major
            excreted protein) (MEP)" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1308751 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:CH474032 IPI:IPI00365697 RefSeq:XP_001065885.2
            RefSeq:XP_225137.5 MEROPS:C01.069 Ensembl:ENSRNOT00000061391
            GeneID:290981 KEGG:rno:290981 UCSC:RGD:1308751 CTD:290981
            OMA:ESYAYEA OrthoDB:EOG42823G NextBio:631921 Uniprot:D3ZKC3
        Length = 330

 Score = 406 (148.0 bits), Expect = 1.6e-55, Sum P(2) = 1.6e-55
 Identities = 87/215 (40%), Positives = 131/215 (60%)

Query:    53 YDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI-----DYINSQNLSFKLTD 107
             +DP S +  +E W  ++ + Y + +E Q+R  ++ +N++ I     DY+  ++  F L  
Sbjct:    21 HDP-SFDTVWEEWKTKHGKTYNTNEEGQKR-AVWENNMKMINLHNEDYLKGKH-GFSLEM 77

Query:   108 NKFADLSNEEFISTYLGYNKPYNEPRWPSV---QYLG-LPASVDWRKEGAVTPVKDQGQC 163
             N F DL+N EF     G+      P+  ++    +LG +P S+DWR+ G VTPVK+QGQC
Sbjct:    78 NAFGDLTNTEFRELMTGFQSM--GPKETTIFREPFLGDIPKSLDWREHGYVTPVKNQGQC 135

Query:   164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
             GSCWAFSAV ++EG    KTGKLVSLSEQ LVDC  +  N GCNGG ME AF+++ +  G
Sbjct:   136 GSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDCSWSYGNLGCNGGLMEFAFQYVKENRG 195

Query:   224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             + T + Y Y  ++  C+ +  K+ A  +TG+  +P
Sbjct:   196 LDTGESYAYEAQDGLCRYNP-KYSAANVTGFVKVP 229

 Score = 184 (69.8 bits), Expect = 1.6e-55, Sum P(2) = 1.6e-55
 Identities = 39/78 (50%), Positives = 52/78 (66%)

Query:   262 AFQLYSHGVFDEY-CGH-QLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARN 318
             +F+ YS G++ E  C   +++H V VVGYGE+  G KYWLVKNSWG  WG  GYI+MA++
Sbjct:   255 SFRFYSGGMYYEPDCSSTEMDHAVLVVGYGEESDGGKYWLVKNSWGEDWGMDGYIKMAKD 314

Query:   319 SPSSNIGICGILMQASYP 336
               ++N   CGI   A YP
Sbjct:   315 Q-NNN---CGIATYAIYP 328


>MGI|MGI:107341 [details] [associations]
            symbol:Ctss "cathepsin S" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=ISO;IDA] [GO:0006508 "proteolysis"
            evidence=IDA] [GO:0008152 "metabolic process" evidence=ISO]
            [GO:0008233 "peptidase activity" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0009986 "cell
            surface" evidence=ISO] [GO:0016020 "membrane" evidence=IDA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0045453 "bone
            resorption" evidence=ISO] [GO:0051930 "regulation of sensory
            perception of pain" evidence=ISO] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:107341 GO:GO:0016020 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0008233 GO:GO:0031905 Reactome:REACT_102124
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 BRENDA:3.4.22.27
            ChiTaRS:CTSS EMBL:AF051732 EMBL:AF051727 EMBL:AF051728
            EMBL:AF051729 EMBL:AF051726 EMBL:AF051730 EMBL:AF051731
            EMBL:AF038546 EMBL:AJ002386 EMBL:AC092203 EMBL:Y18466 EMBL:AJ223208
            IPI:IPI00309520 UniGene:Mm.3619 PDB:1M0H PDBsum:1M0H
            ProteinModelPortal:O70370 SMR:O70370 STRING:O70370
            PhosphoSite:O70370 PaxDb:O70370 PRIDE:O70370
            Ensembl:ENSMUST00000116304 BindingDB:O70370 ChEMBL:CHEMBL4098
            NextBio:282932 Bgee:O70370 CleanEx:MM_CTSS Genevestigator:O70370
            GermOnline:ENSMUSG00000038642 Uniprot:O70370
        Length = 340

 Score = 387 (141.3 bits), Expect = 1.6e-55, Sum P(2) = 1.6e-55
 Identities = 88/234 (37%), Positives = 136/234 (58%)

Query:    36 WVLGIPAGAWSEGYPQ-KYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYID 94
             W+  +P    S    Q + DP +++  ++ W K + +EY  ++E + R  I+  N+++I 
Sbjct:    11 WLFWMPL-VCSVAMEQLQRDP-TLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIM 68

Query:    95 YIN---SQNL-SFKLTDNKFADLSNEEFISTYLGYNKPYNEPR---WPSVQYLGLPASVD 147
               N   S  + ++++  N   D++NEE +        P   P+   + S     LP +VD
Sbjct:    69 IHNLEYSMGMHTYQVGMNDMGDMTNEEILCRMGALRIPRQSPKTVTFRSYSNRTLPDTVD 128

Query:   148 WRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE---NQ 204
             WR++G VT VK QG CG+CWAFSAV A+EG  KLKTGKL+SLS Q LVDC  N E   N+
Sbjct:   129 WREKGCVTEVKYQGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCS-NEEKYGNK 187

Query:   205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             GC GGYM +AF++I   GG+  +  YPY+  +++C  + +K+ A T + Y  +P
Sbjct:   188 GCGGGYMTEAFQYIIDNGGIEADASYPYKATDEKCHYN-SKNRAATCSRYIQLP 240

 Score = 203 (76.5 bits), Expect = 1.6e-55, Sum P(2) = 1.6e-55
 Identities = 40/76 (52%), Positives = 50/76 (65%)

Query:   262 AFQLYSHGVFDE-YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
             +F  Y  GV+D+  C   +NHGV VVGYG   G+ YWLVKNSWG ++G+ GYIRMARN+ 
Sbjct:   267 SFFFYKSGVYDDPSCTGNVNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNNK 326

Query:   321 SSNIGICGILMQASYP 336
             +     CGI    SYP
Sbjct:   327 NH----CGIASYCSYP 338


>ZFIN|ZDB-GENE-030131-572 [details] [associations]
            symbol:wu:fb37b09 "wu:fb37b09" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030131-572 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:BX465840 IPI:IPI00866294 RefSeq:XP_001923796.1
            UniGene:Dr.25683 PRIDE:E9QBE2 Ensembl:ENSDART00000133962
            GeneID:321853 KEGG:dre:321853 NextBio:20807556 Uniprot:E9QBE2
        Length = 335

 Score = 409 (149.0 bits), Expect = 2.6e-55, Sum P(2) = 2.6e-55
 Identities = 84/210 (40%), Positives = 126/210 (60%)

Query:    58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADL 113
             +++ + +W  Q+ + Y  + E  RR  I+  N++ I+  N +    N +FK+  N+F D+
Sbjct:    24 LDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSLGNHTFKMGMNQFGDM 82

Query:   114 SNEEFISTYLGY----NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAF 169
             +NEEF     GY    N+    P +   ++   P  VDWR+ G VTPVKDQ QCGSCW+F
Sbjct:    83 TNEEFRQAMNGYKHDPNRTSQGPLFMEPKFFAAPQQVDWRQRGYVTPVKDQKQCGSCWSF 142

Query:   170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
             S+  A+EG    KTGKL+S+SEQ LVDC     NQGCNGG M++AF+++ +  G+ +E  
Sbjct:   143 SSTGALEGQLFRKTGKLISMSEQNLVDCSRPHGNQGCNGGLMDQAFQYVKENKGLDSEQS 202

Query:   230 YPYRGKNDR-CQTDKTKHHAVTITGYEAIP 258
             YPY  ++D  C+ D  + +   ITG+  IP
Sbjct:   203 YPYLARDDLPCRYDP-RFNVAKITGFVDIP 231

 Score = 179 (68.1 bits), Expect = 2.6e-55, Sum P(2) = 2.6e-55
 Identities = 39/88 (44%), Positives = 52/88 (59%)

Query:   256 AIPARY-AFQLYSHGVFDEY-CGHQLNHGVTVVGYG----EDHGEKYWLVKNSWGTSWGE 309
             AI A + + Q Y  G++ E  C  QL+H V VVGYG    +  G +YW+VKNSW   WG+
Sbjct:   251 AIDASHQSLQFYQSGIYYERACTSQLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGD 310

Query:   310 AGYIRMARNSPSSNIGICGILMQASYPV 337
              GYI MA++  +     CGI   ASYP+
Sbjct:   311 KGYIYMAKDKNNH----CGIATMASYPL 334


>ZFIN|ZDB-GENE-050626-55 [details] [associations]
            symbol:ctssb.2 "cathepsin S, b.2" species:7955
            "Danio rerio" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-050626-55
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            KO:K01368 EMBL:BC093339 IPI:IPI00507098 RefSeq:NP_001017661.1
            UniGene:Dr.132688 ProteinModelPortal:Q566T8 SMR:Q566T8
            GeneID:337572 KEGG:dre:337572 CTD:337572 InParanoid:Q566T8
            NextBio:20812306 ArrayExpress:Q566T8 Uniprot:Q566T8
        Length = 330

 Score = 379 (138.5 bits), Expect = 4.2e-55, Sum P(2) = 4.2e-55
 Identities = 81/211 (38%), Positives = 121/211 (57%)

Query:    56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN---SQNL-SFKLTDNKFA 111
             +++++ +E W K++ + Y  EDE   R  ++  N++ I   N   S  + S+ L  N  A
Sbjct:    21 KNLDQHWELWKKKHVKLYSCEDEEVGRRELWERNLELIAIHNLEASMGMHSYDLAINHMA 80

Query:   112 DLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSC 166
             D++ EE + T      P    R P+ +Y+      +P ++DWR +G VT VK+QG CGSC
Sbjct:    81 DMTTEEILQTLAVTRVPPGFKR-PTAEYVSSSFAVVPDTLDWRDKGYVTSVKNQGACGSC 139

Query:   167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
             WAFS+V A+EG     TGKLV LS Q LVDC     N GCNGGYM +AF+++   GG+ +
Sbjct:   140 WAFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGYMSQAFQYVIDNGGIDS 199

Query:   227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAI 257
             E  YPY+G    C+ D ++  A   T Y+ +
Sbjct:   200 ESSYPYQGTQGSCRYDPSQR-AANCTSYKFV 229

 Score = 207 (77.9 bits), Expect = 4.2e-55, Sum P(2) = 4.2e-55
 Identities = 43/84 (51%), Positives = 55/84 (65%)

Query:   256 AIPA-RYAFQLYSHGVFDE-YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYI 313
             AI A R  F  Y  GV+D+  C  ++NHGV  VGYG   G+ YWLVKNSWG  +G+ GYI
Sbjct:   250 AIDATRPQFIFYRSGVYDDPSCTQKVNHGVLAVGYGTLSGQDYWLVKNSWGAGFGDGGYI 309

Query:   314 RMARNSPSSNIGICGILMQASYPV 337
             R+ARN   +N+  CGI  +A YP+
Sbjct:   310 RIARNK--NNM--CGIASEACYPI 329


>UNIPROTKB|A4IFS7 [details] [associations]
            symbol:CTSL1 "CTSL1 protein" species:9913 "Bos taurus"
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197 GO:GO:0002250
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 GO:GO:0097067
            OrthoDB:EOG48PMKF MEROPS:C01.032 CTD:1514 EMBL:DAAA02023987
            EMBL:BC134741 IPI:IPI00708619 RefSeq:NP_001077155.1
            UniGene:Bt.23199 SMR:A4IFS7 Ensembl:ENSBTAT00000000962
            GeneID:515200 KEGG:bta:515200 InParanoid:A4IFS7 OMA:NDEQALM
            NextBio:20871707 Uniprot:A4IFS7
        Length = 333

 Score = 409 (149.0 bits), Expect = 1.1e-54, Sum P(2) = 1.1e-54
 Identities = 84/214 (39%), Positives = 131/214 (61%)

Query:    52 KYDPQSMEERFENWLKQYSREYG-SEDEWQRRFGIYSSNVQYIDYIN---SQNL-SFKLT 106
             K+D  S++ +++ W   + + Y  +E+ W++   ++  N++ I+  N   SQ   SF + 
Sbjct:    20 KFD-HSLDTQWKLWKAAHRKPYDLNEEGWRK--AVWKKNMKMIELHNQEYSQGKHSFSMA 76

Query:   107 DNKFADLSNEEFISTYLGYNKPYNEP--RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
              N F D++NEEF  T  G+ +  N+    +    +  +P SVDWR++G VTPVK+QG+CG
Sbjct:    77 MNAFGDMTNEEFRHTMNGFQRQKNKKGKEFHETIFASIPPSVDWREKGYVTPVKNQGKCG 136

Query:   165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
             SCWAFSA  A+EG    KTGKLVSLSEQ LVDC     N+GC+GG+++ AF+++  +GG+
Sbjct:   137 SCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCSQPEGNRGCHGGFIDNAFQYVLDVGGL 196

Query:   225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
              +E+ YPY G    C  +   + A   TG+  +P
Sbjct:   197 DSEESYPYTGLVGTCLYNPN-NSAANETGFVDLP 229

 Score = 173 (66.0 bits), Expect = 1.1e-54, Sum P(2) = 1.1e-54
 Identities = 37/81 (45%), Positives = 48/81 (59%)

Query:   262 AFQLYSHGVFDE-YCGHQ-LNHGVTVVGYG----EDHGEKYWLVKNSWGTSWGEAGYIRM 315
             +FQ Y  G++ E  C  + ++H V VVGYG    +    KYWLVKNSWG  WG  GYI+M
Sbjct:   255 SFQFYKSGIYYEPNCSSESVDHAVLVVGYGFEGADSDDNKYWLVKNSWGEHWGMNGYIKM 314

Query:   316 ARNSPSSNIGICGILMQASYP 336
             A++  +     CGI   ASYP
Sbjct:   315 AKDRNNH----CGIATMASYP 331


>ZFIN|ZDB-GENE-041010-76 [details] [associations]
            symbol:ctsll "cathepsin L, like" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-041010-76
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            OrthoDB:EOG48PMKF MEROPS:C01.032 EMBL:BX119902 IPI:IPI00616622
            UniGene:Dr.79994 SMR:A2BEM8 Ensembl:ENSDART00000144226
            InParanoid:A2BEM8 OMA:PRYSAAN Uniprot:A2BEM8
        Length = 337

 Score = 400 (145.9 bits), Expect = 2.3e-54, Sum P(2) = 2.3e-54
 Identities = 80/213 (37%), Positives = 128/213 (60%)

Query:    56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNKFA 111
             Q +++ +  W + + + Y  ++E  RR  ++  N++ I+  N ++     +F+L  N+F 
Sbjct:    23 QKLDDHWHLWKRWHEKSYHEKEEGWRRM-VWEKNLKKIELHNLEHSVGKHTFRLGMNQFG 81

Query:   112 DLSNEEFISTYLGYNKPYNEPRWPSV----QYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
             D++NEEF     GYN+  N     S+     +   P  +DWR++G VTP+KDQ +CGSCW
Sbjct:    82 DMTNEEFRQAMNGYNRDPNRKSKGSLFIEPSFFTAPQQIDWRQKGYVTPIKDQKRCGSCW 141

Query:   168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
             AFS+  A+EG    KTGKLVSLSEQ L+DC     N GC+GG M++AF+++    G+ +E
Sbjct:   142 AFSSTGALEGQVFRKTGKLVSLSEQNLMDCSRPQGNNGCDGGLMDQAFQYVQDNNGLDSE 201

Query:   228 DDYPYRGKNDR-CQTDKTKHHAVTITGYEAIPA 259
             + YPY   +D+ C  D  ++ A  +TG+  IP+
Sbjct:   202 ESYPYLATDDQPCHYDP-RYSAANVTGFVDIPS 233

 Score = 179 (68.1 bits), Expect = 2.3e-54, Sum P(2) = 2.3e-54
 Identities = 39/89 (43%), Positives = 55/89 (61%)

Query:   256 AIPARY-AFQLYSHGVF-DEYCG-HQLNHGVTVVGYGEDH----GEKYWLVKNSWGTSWG 308
             AI A + +FQ Y  G++ ++ C   +L+HGV VVGYG +     G +YW+VKNSW   WG
Sbjct:   252 AIDAGHESFQFYQSGIYYEKACSTEELDHGVLVVGYGYEGVDVAGRRYWIVKNSWTDRWG 311

Query:   309 EAGYIRMARNSPSSNIGICGILMQASYPV 337
             + GYI MA++  +     CGI   ASYP+
Sbjct:   312 DKGYIYMAKDLKNH----CGIATSASYPL 336


>DICTYBASE|DDB_G0278721 [details] [associations]
            symbol:cprD "cysteine proteinase 4" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0278721 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000024 EMBL:L36204 RefSeq:XP_641963.1
            ProteinModelPortal:P54639 SMR:P54639 MEROPS:C01.A57 PRIDE:P54639
            EnsemblProtists:DDB0214999 GeneID:8621695 KEGG:ddi:DDB_G0278721
            OMA:NAFADIT ProtClustDB:CLSZ2846820 Uniprot:P54639
        Length = 442

 Score = 391 (142.7 bits), Expect = 2.6e-54, Sum P(3) = 2.6e-54
 Identities = 83/215 (38%), Positives = 122/215 (56%)

Query:    51 QKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKF 110
             Q++        F NW++ + R Y SE E+  R+ I+ SN+ Y+   NS+     L  N F
Sbjct:    19 QQFSELQYRNAFTNWMQAHQRTYSSE-EFNARYQIFKSNMDYVHQWNSKGGETVLGLNVF 77

Query:   111 ADLSNEEFISTYLGYNKPYNEPRWPSVQ---YLGLPA-SVDWRKEGAVTPVKDQGQCGSC 166
             AD++N+E+ +TYLG   P++       +       PA +VDWR +GAVTP+K+QGQCG C
Sbjct:    78 ADITNQEYRTTYLG--TPFDGSALIGTEEEKIFSTPAPTVDWRAQGAVTPIKNQGQCGGC 135

Query:   167 WAFSAVAAVEGINKLKTGK---LVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
             W+FS   + EG + + +G    LVSLSEQ L+DC  +  N GC GG M  AFE+I    G
Sbjct:   136 WSFSTTGSTEGAHFIASGTKKDLVSLSEQNLIDCSKSYGNNGCEGGLMTLAFEYIINNKG 195

Query:   224 VTTEDDYPYRGKNDR-CQTDKTKHHAVTITGYEAI 257
             + TE  YPY  ++ + C+  KT +    I  Y+ +
Sbjct:   196 IDTESSYPYTAEDGKECKF-KTSNIGAQIVSYQNV 229

 Score = 122 (48.0 bits), Expect = 2.6e-54, Sum P(3) = 2.6e-54
 Identities = 25/50 (50%), Positives = 32/50 (64%)

Query:   287 GYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
             G  E     YW+VKNSWGTSWG  GYI M+++  ++N   CGI   AS+P
Sbjct:   392 GAVEASSGNYWIVKNSWGTSWGMDGYIFMSKDR-NNN---CGIATMASFP 437

 Score = 77 (32.2 bits), Expect = 2.6e-54, Sum P(3) = 2.6e-54
 Identities = 17/30 (56%), Positives = 21/30 (70%)

Query:   262 AFQLYSHGVFDE-YCGH-QLNHGVTVVGYG 289
             +FQLY  G++ E  C   QL+HGV VVGYG
Sbjct:   256 SFQLYESGIYYEPACSPTQLDHGVLVVGYG 285


>ZFIN|ZDB-GENE-080215-7 [details] [associations]
            symbol:zgc:174153 "zgc:174153" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-080215-7
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 EMBL:BX000534 EMBL:BX322603
            IPI:IPI00483644 Ensembl:ENSDART00000113654 OMA:ITLCISA Bgee:F1R8Y0
            Uniprot:F1R8Y0
        Length = 336

 Score = 410 (149.4 bits), Expect = 2.9e-54, Sum P(2) = 2.9e-54
 Identities = 84/211 (39%), Positives = 126/211 (59%)

Query:    58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADL 113
             +++ + +W  Q+ + Y  + E  RR  I+  N++ I+  N +    N +FK+  N+F D+
Sbjct:    24 LDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSYGNHTFKMGMNQFGDM 82

Query:   114 SNEEFISTYLGYNKPYNE----PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAF 169
             +NEEF     GY    N+    P +    +   P  VDWR+ G VTPVKDQ QCGSCW+F
Sbjct:    83 TNEEFRQAMNGYKHDPNQTSQGPLFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWSF 142

Query:   170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
             S+  A+EG    KTGKL+S+SEQ LVDC     NQGCNGG M++AF+++ +  G+ +E  
Sbjct:   143 SSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGLMDQAFQYVKENKGLDSEQS 202

Query:   230 YPYRGKNDR-CQTDKTKHHAVTITGYEAIPA 259
             YPY  ++D  C+ D  + +   ITG+  IP+
Sbjct:   203 YPYLARDDLPCRYDP-RFNVAKITGFVDIPS 232

 Score = 168 (64.2 bits), Expect = 2.9e-54, Sum P(2) = 2.9e-54
 Identities = 37/89 (41%), Positives = 53/89 (59%)

Query:   256 AIPARY-AFQLYSHGVFDEY-CGH-QLNHGVTVVGYG----EDHGEKYWLVKNSWGTSWG 308
             AI A + + Q Y  G++ E  C   +L+H V VVGYG    +  G +YW+VKNSW   WG
Sbjct:   251 AIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWG 310

Query:   309 EAGYIRMARNSPSSNIGICGILMQASYPV 337
             + GYI MA++  +     CG+  +ASYP+
Sbjct:   311 DKGYIYMAKDKNNH----CGVATKASYPL 335


>ZFIN|ZDB-GENE-980526-285 [details] [associations]
            symbol:ctsl1b "cathepsin L, 1 b" species:7955
            "Danio rerio" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005576
            "extracellular region" evidence=IDA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-980526-285 GO:GO:0005576 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:BX465840 IPI:IPI00498443 Ensembl:ENSDART00000145570
            Bgee:F1R7B3 Uniprot:F1R7B3
        Length = 352

 Score = 410 (149.4 bits), Expect = 2.9e-54, Sum P(2) = 2.9e-54
 Identities = 84/211 (39%), Positives = 126/211 (59%)

Query:    58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADL 113
             +++ + +W  Q+ + Y  + E  RR  I+  N++ I+  N +    N +FK+  N+F D+
Sbjct:    40 LDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSYGNHTFKMGMNQFGDM 98

Query:   114 SNEEFISTYLGYNKPYNE----PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAF 169
             +NEEF     GY    N+    P +    +   P  VDWR+ G VTPVKDQ QCGSCW+F
Sbjct:    99 TNEEFRQAMNGYTHDPNQTSQGPLFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWSF 158

Query:   170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
             S+  A+EG    KTGKL+S+SEQ LVDC     NQGCNGG M++AF+++ +  G+ +E  
Sbjct:   159 SSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGLMDQAFQYVKENKGLDSEQS 218

Query:   230 YPYRGKNDR-CQTDKTKHHAVTITGYEAIPA 259
             YPY  ++D  C+ D  + +   ITG+  IP+
Sbjct:   219 YPYLARDDLPCRYDP-RFNVAKITGFVDIPS 248

 Score = 168 (64.2 bits), Expect = 2.9e-54, Sum P(2) = 2.9e-54
 Identities = 37/89 (41%), Positives = 53/89 (59%)

Query:   256 AIPARY-AFQLYSHGVFDEY-CGH-QLNHGVTVVGYG----EDHGEKYWLVKNSWGTSWG 308
             AI A + + Q Y  G++ E  C   +L+H V VVGYG    +  G +YW+VKNSW   WG
Sbjct:   267 AIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWG 326

Query:   309 EAGYIRMARNSPSSNIGICGILMQASYPV 337
             + GYI MA++  +     CG+  +ASYP+
Sbjct:   327 DKGYIYMAKDKNNH----CGVATKASYPL 351


>ZFIN|ZDB-GENE-071004-74 [details] [associations]
            symbol:zgc:174855 "zgc:174855" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-071004-74
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 MEROPS:C01.032 EMBL:BX000534 EMBL:BC152282
            IPI:IPI00773140 RefSeq:NP_001096592.1 UniGene:Dr.104905 SMR:A7MCR6
            STRING:A7MCR6 Ensembl:ENSDART00000109968 GeneID:569326
            KEGG:dre:569326 NextBio:20889622 Uniprot:A7MCR6
        Length = 335

 Score = 402 (146.6 bits), Expect = 3.7e-54, Sum P(2) = 3.7e-54
 Identities = 83/210 (39%), Positives = 125/210 (59%)

Query:    58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADL 113
             +++ + +W  Q+ + Y  + E  RR  I+  N++ I+  N +    N +FK+  N+F D+
Sbjct:    24 LDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSLGNHTFKMGMNQFGDM 82

Query:   114 SNEEFISTYLGYNKPYNEPRWPSV----QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAF 169
             +NEEF     GY +  N     ++     +   P  VDWR+ G VTPVKDQ QCGSCW+F
Sbjct:    83 TNEEFRQAMNGYKQDPNRTSKGALFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWSF 142

Query:   170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
             S+  A+EG    KTGKL+S+SEQ LVDC     NQGCNGG M++AF+++ +  G+ +E  
Sbjct:   143 SSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQAFQYVKENKGLDSEQS 202

Query:   230 YPYRGKNDR-CQTDKTKHHAVTITGYEAIP 258
             YPY  ++D  C+ D  + +   ITG+  IP
Sbjct:   203 YPYLARDDLPCRYDP-RFNVAKITGFVDIP 231

 Score = 175 (66.7 bits), Expect = 3.7e-54, Sum P(2) = 3.7e-54
 Identities = 38/88 (43%), Positives = 52/88 (59%)

Query:   256 AIPARY-AFQLYSHGVFDEY-CGHQLNHGVTVVGYG----EDHGEKYWLVKNSWGTSWGE 309
             AI A + + Q Y  G++ E  C  +L+H V VVGYG    +  G +YW+VKNSW   WG+
Sbjct:   251 AIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGD 310

Query:   310 AGYIRMARNSPSSNIGICGILMQASYPV 337
              GYI MA++  +     CGI   ASYP+
Sbjct:   311 KGYIYMAKDKNNH----CGIATMASYPL 334


>DICTYBASE|DDB_G0279187 [details] [associations]
            symbol:cprG "cysteine proteinase 7" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0279187 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000030 ProtClustDB:CLSZ2846820 MEROPS:C01.081
            EMBL:U72746 RefSeq:XP_641720.2 ProteinModelPortal:Q94504 SMR:Q94504
            PRIDE:Q94504 EnsemblProtists:DDB0215005 GeneID:8621915
            KEGG:ddi:DDB_G0279187 OMA:INTETEK Uniprot:Q94504
        Length = 460

 Score = 396 (144.5 bits), Expect = 5.4e-54, Sum P(3) = 5.4e-54
 Identities = 84/199 (42%), Positives = 117/199 (58%)

Query:    62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
             F NW+  + R Y SE E+  R+ I+ +N+ Y++  N++     L  N FAD+SNEE+ +T
Sbjct:    30 FTNWMIAHQRHYSSE-EFNGRYNIFKANMDYVNEWNTKGSETVLGLNVFADISNEEYRAT 88

Query:   122 YLGYNKPYNEPRWP---SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGI 178
             YLG   P++        S +     A VDWR +GAVTP+K+QGQCG CW+FS   A EG 
Sbjct:    89 YLG--TPFDASSLEMTESDKIFDASAQVDWRTQGAVTPIKNQGQCGGCWSFSTTGATEGA 146

Query:   179 NKLKTGK--LVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN 236
               L  GK  LVSLSEQ L+DC  +  N GC GG M  AFE+I    G+ TE  YPY  ++
Sbjct:   147 QYLANGKKNLVSLSEQNLIDCSGSYGNNGCEGGLMTLAFEYIINNKGIDTESSYPYTAED 206

Query:   237 DR-CQTDKTKHHAVTITGY 254
              + C+ +  K+ A  ++ Y
Sbjct:   207 GKKCKFNP-KNVAAQLSSY 224

 Score = 115 (45.5 bits), Expect = 5.4e-54, Sum P(3) = 5.4e-54
 Identities = 22/41 (53%), Positives = 26/41 (63%)

Query:   296 YWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
             YW+VKNSWGTSWG  GYI M + + +     CGI   AS P
Sbjct:   418 YWIVKNSWGTSWGMDGYILMTKGNNNQ----CGIATMASRP 454

 Score = 76 (31.8 bits), Expect = 5.4e-54, Sum P(3) = 5.4e-54
 Identities = 19/41 (46%), Positives = 25/41 (60%)

Query:   256 AIPA-RYAFQLYSHGVFDE-YCGH-QLNHGVTVVGYGEDHG 293
             AI A   +FQLY  G+++E  C   QL+HGV  VG+G   G
Sbjct:   247 AIDASNQSFQLYVSGIYNEPACSSTQLDHGVLAVGFGTGSG 287


>DICTYBASE|DDB_G0281605 [details] [associations]
            symbol:cfaD "peptidase C1A family protein"
            species:44689 "Dictyostelium discoideum" [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IDA] [GO:0006508 "proteolysis" evidence=IDA] [GO:0031410
            "cytoplasmic vesicle" evidence=IDA] [GO:0031288 "sorocarp
            morphogenesis" evidence=IMP] [GO:0008285 "negative regulation of
            cell proliferation" evidence=IGI;IDA] [GO:0005576 "extracellular
            region" evidence=IEA;IDA] [GO:0005515 "protein binding"
            evidence=IPI] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0281605
            GO:GO:0008285 GO:GO:0005615 GenomeReviews:CM000152_GR
            eggNOG:COG4870 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0031410 EMBL:AAFI02000042
            GO:GO:0031288 RefSeq:XP_640530.1 HSSP:P07711
            ProteinModelPortal:Q54TR1 STRING:Q54TR1 PRIDE:Q54TR1
            EnsemblProtists:DDB0229857 GeneID:8623140 KEGG:ddi:DDB_G0281605
            InParanoid:Q54TR1 OMA:PSAHEHE ProtClustDB:CLSZ2430523
            Uniprot:Q54TR1
        Length = 531

 Score = 396 (144.5 bits), Expect = 1.3e-53, Sum P(2) = 1.3e-53
 Identities = 85/203 (41%), Positives = 125/203 (61%)

Query:    62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
             F+ +  QY++EY S+DE   RF  + +  + I   N++  S+KL  N +ADLSN+EF +T
Sbjct:   225 FKEYKAQYNKEYSSQDEHDERFINFKAARKIIATHNAKESSYKLGMNHYADLSNKEF-NT 283

Query:   122 YLGYNKPYNEPRWPSV---------QYL-GLPASVDWRKEGAVTPVKDQGQCGSCWAFSA 171
              +   KP  +   PSV         + L  +P++VDWR +  VTPVKDQG CGSCW F +
Sbjct:   284 LV---KP--KVARPSVTGADSVHDDESLRSIPSTVDWRNQNCVTPVKDQGICGSCWTFGS 338

Query:   172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
               ++EG N +  G+LVSLSEQ+LVDC + + +QGC GG+   AF+++ +IG + TE +YP
Sbjct:   339 TGSLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGSLATESNYP 398

Query:   232 YRGKNDRCQTDKTKHHAVTITGY 254
             Y  +N  C+        V+ITGY
Sbjct:   399 YLMQNGLCRDRTVTPSGVSITGY 421

 Score = 176 (67.0 bits), Expect = 1.3e-53, Sum P(2) = 1.3e-53
 Identities = 40/104 (38%), Positives = 60/104 (57%)

Query:   241 TDKTKHHAVTITGYEAIPARYA---FQLYSHGVFDE-YCGH---QLNHGVTVVGYGEDHG 293
             ++    +A+  TG  AI    +   F+ Y  GV++   C +    L+H V  +GYG   G
Sbjct:   428 SESALQNAIATTGPVAIAIDASVDDFRYYMSGVYNNPACKNGLDDLDHEVLAIGYGTYQG 487

Query:   294 EKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
             + Y+LVKNSW T+WG  GY+ MARN   +N+  CG+  QA+YP+
Sbjct:   488 QDYFLVKNSWSTNWGMDGYVYMARND--NNL--CGVSSQATYPI 527


>WB|WBGene00000776 [details] [associations]
            symbol:cpl-1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0040010 "positive regulation
            of growth rate" evidence=IMP] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] [GO:0040011
            "locomotion" evidence=IMP] [GO:0070265 "necrotic cell death"
            evidence=IMP] [GO:0031983 "vesicle lumen" evidence=IDA] [GO:0042718
            "yolk granule" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0009792 GO:GO:0040010 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0040011
            GO:GO:0031983 GO:GO:0070265 GeneTree:ENSGT00660000095458 KO:K01365
            GO:GO:0042718 MEROPS:I29.009 EMBL:Z92812 GeneID:180111
            KEGG:cel:CELE_T03E6.7 CTD:180111 PIR:T24387 RefSeq:NP_001256718.1
            HSSP:P80067 ProteinModelPortal:O45734 SMR:O45734 DIP:DIP-26616N
            IntAct:O45734 MINT:MINT-211563 STRING:O45734 PaxDb:O45734
            EnsemblMetazoa:T03E6.7.1 EnsemblMetazoa:T03E6.7.2 UCSC:T03E6.7.1
            WormBase:T03E6.7a InParanoid:O45734 OMA:HIENHNR NextBio:908128
            Uniprot:O45734
        Length = 337

 Score = 349 (127.9 bits), Expect = 1.3e-53, Sum P(2) = 1.3e-53
 Identities = 82/213 (38%), Positives = 119/213 (55%)

Query:    56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNKFA 111
             +S  E+++++ + + +EY SE E Q     +  N+ +I+  N  +     +F++  N  A
Sbjct:    26 ESAIEKWDDYKEDFDKEY-SESEEQTYMEAFVKNMIHIENHNRDHRLGRKTFEMGLNHIA 84

Query:   112 DLSNEEFISTYLGYNKPYNEPRWP-SVQYLG-----LPASVDWRKEGAVTPVKDQGQCGS 165
             DL   ++     GY + + + R   S  +L      +P  VDWR    VT VK+QG CGS
Sbjct:    85 DLPFSQYRKLN-GYRRLFGDSRIKNSSSFLAPFNVQVPDEVDWRDTHLVTDVKNQGMCGS 143

Query:   166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
             CWAFSA  A+EG +  K G+LVSLSEQ LVDC     N GCNGG M++AFE+I    GV 
Sbjct:   144 CWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLMDQAFEYIRDNHGVD 203

Query:   226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             TE+ YPY+G++ +C  +K K       GY   P
Sbjct:   204 TEESYPYKGRDMKCHFNK-KTVGADDKGYVDTP 235

 Score = 223 (83.6 bits), Expect = 1.3e-53, Sum P(2) = 1.3e-53
 Identities = 47/87 (54%), Positives = 60/87 (68%)

Query:   256 AIPARY-AFQLYSHGVF-DEYCG-HQLNHGVTVVGYGED--HGEKYWLVKNSWGTSWGEA 310
             AI A + +FQLY  GV+ DE C   +L+HGV +VGYG D  HG+ YW+VKNSWG  WGE 
Sbjct:   255 AIDAGHRSFQLYKKGVYYDEECSSEELDHGVLLVGYGTDPEHGD-YWIVKNSWGAGWGEK 313

Query:   311 GYIRMARNSPSSNIGICGILMQASYPV 337
             GYIR+ARN  +     CG+  +ASYP+
Sbjct:   314 GYIRIARNRNNH----CGVATKASYPL 336


>UNIPROTKB|Q3T0I2 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9913 "Bos taurus"
            [GO:0031638 "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=ISS] [GO:0010815 "bradykinin
            catabolic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0010813 "neuropeptide catabolic
            process" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0045766 "positive regulation of angiogenesis" evidence=ISS]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=ISS] [GO:0008234 "cysteine-type peptidase activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0033619 "membrane protein proteolysis" evidence=ISS]
            [GO:0043066 "negative regulation of apoptotic process"
            evidence=ISS] [GO:0004252 "serine-type endopeptidase activity"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0016505 "apoptotic protease activator activity"
            evidence=ISS] [GO:0010952 "positive regulation of peptidase
            activity" evidence=ISS] [GO:0030108 "HLA-A specific activating MHC
            class I receptor activity" evidence=ISS] [GO:0002764 "immune
            response-regulating signaling pathway" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0070324 "thyroid
            hormone binding" evidence=ISS] [GO:0006508 "proteolysis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0097208
            "alveolar lamellar body" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004175
            "endopeptidase activity" evidence=ISS] [GO:0032526 "response to
            retinoic acid" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 EMBL:BC102386 IPI:IPI00693034
            RefSeq:NP_001029557.1 UniGene:Bt.52393 ProteinModelPortal:Q3T0I2
            SMR:Q3T0I2 STRING:Q3T0I2 MEROPS:C01.040 PRIDE:Q3T0I2
            Ensembl:ENSBTAT00000014593 GeneID:510524 KEGG:bta:510524 CTD:1512
            InParanoid:Q3T0I2 OMA:STSCHKT OrthoDB:EOG4W9J43 NextBio:20869490
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 Uniprot:Q3T0I2
        Length = 335

 Score = 404 (147.3 bits), Expect = 2.6e-53, Sum P(2) = 2.6e-53
 Identities = 90/223 (40%), Positives = 133/223 (59%)

Query:    35 LWVLGIP---AGAWSEGYPQ----KYDPQSMEE-RFENWLKQYSREYGSEDEWQRRFGIY 86
             +W + +P   AGAW  G P     +    S+E+  F++W+ Q+ ++Y SE E+  R   +
Sbjct:     1 MWAV-LPLLCAGAWLLGAPACGAAELAANSLEKFHFQSWMVQHQKKYSSE-EYYHRLQAF 58

Query:    87 SSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGL---P 143
             +SN++ I+  N++N +FK+  N+F+D+S +E    YL +++P N     S    G    P
Sbjct:    59 ASNLREINAHNARNHTFKMGLNQFSDMSFDELKRKYL-WSEPQNCSATKSNYLRGTGPYP 117

Query:   144 ASVDWRKEGA-VTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
              S+DWRK+G  VTPVK+QG CGSCW FS   A+E    + TGKL  L+EQ+LVDC  N  
Sbjct:   118 PSMDWRKKGNFVTPVKNQGSCGSCWTFSTTGALESAVAIATGKLPFLAEQQLVDCAQNFN 177

Query:   203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTK 245
             N GC GG   +AFE+I    G+  ED YPYRG++  C+   +K
Sbjct:   178 NHGCQGGLPSQAFEYIRYNKGIMGEDTYPYRGQDGDCKYQPSK 220

 Score = 165 (63.1 bits), Expect = 2.6e-53, Sum P(2) = 2.6e-53
 Identities = 31/79 (39%), Positives = 44/79 (55%)

Query:   263 FQLYSHGVFDEYCGHQ----LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
             F +Y  G++     H+    +NH V  VGYGE+ G  YW+VKNSWG +WG  GY  + R 
Sbjct:   259 FMMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPNWGMKGYFLIERG 318

Query:   319 SPSSNIGICGILMQASYPV 337
                    +CG+   AS+P+
Sbjct:   319 K-----NMCGLAACASFPI 332


>RGD|2447 [details] [associations]
            symbol:Ctsh "cathepsin H" species:10116 "Rattus norvegicus"
          [GO:0001520 "outer dense fiber" evidence=IDA] [GO:0001656
          "metanephros development" evidence=IEP] [GO:0001669 "acrosomal
          vesicle" evidence=IDA] [GO:0001913 "T cell mediated cytotoxicity"
          evidence=ISO;ISS] [GO:0002250 "adaptive immune response"
          evidence=ISO] [GO:0002764 "immune response-regulating signaling
          pathway" evidence=ISO;ISS] [GO:0004175 "endopeptidase activity"
          evidence=ISO] [GO:0004177 "aminopeptidase activity" evidence=ISO;IDA]
          [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISO;ISS]
          [GO:0004252 "serine-type endopeptidase activity" evidence=ISO;ISS]
          [GO:0005615 "extracellular space" evidence=ISO;ISS;IDA] [GO:0005764
          "lysosome" evidence=ISO;ISS;IDA] [GO:0005829 "cytosol"
          evidence=ISO;ISS] [GO:0006508 "proteolysis" evidence=IEP;ISO]
          [GO:0007283 "spermatogenesis" evidence=IEP] [GO:0008233 "peptidase
          activity" evidence=ISO] [GO:0008234 "cysteine-type peptidase
          activity" evidence=ISO] [GO:0008284 "positive regulation of cell
          proliferation" evidence=ISO;ISS] [GO:0010628 "positive regulation of
          gene expression" evidence=ISO;ISS] [GO:0010634 "positive regulation
          of epithelial cell migration" evidence=ISO;ISS] [GO:0010813
          "neuropeptide catabolic process" evidence=ISO;ISS] [GO:0010815
          "bradykinin catabolic process" evidence=ISO;ISS] [GO:0010952
          "positive regulation of peptidase activity" evidence=ISO;ISS]
          [GO:0016505 "apoptotic protease activator activity" evidence=ISO;ISS]
          [GO:0030108 "HLA-A specific activating MHC class I receptor activity"
          evidence=ISO;ISS] [GO:0030335 "positive regulation of cell migration"
          evidence=ISO;ISS] [GO:0030984 "kininogen binding" evidence=IPI]
          [GO:0031638 "zymogen activation" evidence=ISO;ISS] [GO:0031648
          "protein destabilization" evidence=ISO;ISS] [GO:0032403 "protein
          complex binding" evidence=IPI] [GO:0032526 "response to retinoic
          acid" evidence=ISO;ISS] [GO:0033619 "membrane protein proteolysis"
          evidence=ISO;ISS] [GO:0035085 "cilium axoneme" evidence=IDA]
          [GO:0043066 "negative regulation of apoptotic process"
          evidence=ISO;ISS] [GO:0043129 "surfactant homeostasis"
          evidence=ISO;ISS] [GO:0043621 "protein self-association"
          evidence=IDA] [GO:0045766 "positive regulation of angiogenesis"
          evidence=ISO;ISS] [GO:0060448 "dichotomous subdivision of terminal
          units involved in lung branching" evidence=ISO;ISS] [GO:0070324
          "thyroid hormone binding" evidence=ISO;ISS] [GO:0070371 "ERK1 and
          ERK2 cascade" evidence=ISO;ISS] [GO:0097067 "cellular response to
          thyroid hormone stimulus" evidence=ISO;IEP] [GO:0097208 "alveolar
          lamellar body" evidence=ISO;ISS;IDA] InterPro:IPR000668
          InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
          InterPro:IPR000169 RGD:2447 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
          GO:GO:0008284 GO:GO:0070371 GO:GO:0001669 eggNOG:COG4870
          HOGENOM:HOG000230774 InterPro:IPR025661 InterPro:IPR025660
          InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
          PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0007283
          GO:GO:0045766 GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
          GO:GO:0043621 GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 KO:K01366
          GO:GO:0016505 GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
          HOVERGEN:HBG011513 GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
          GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT OrthoDB:EOG4W9J43
          GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
          GO:GO:0010813 GO:GO:0043129 MEROPS:I29.003 EMBL:Y00708 EMBL:BC085352
          EMBL:M38135 IPI:IPI00212809 PIR:S00211 RefSeq:NP_037071.1
          UniGene:Rn.1997 ProteinModelPortal:P00786 SMR:P00786 STRING:P00786
          PRIDE:P00786 Ensembl:ENSRNOT00000019285 GeneID:25425 KEGG:rno:25425
          UCSC:RGD:2447 InParanoid:P00786 BindingDB:P00786 NextBio:606599
          Genevestigator:P00786 GermOnline:ENSRNOG00000014064 GO:GO:0035086
          GO:GO:0001520 Uniprot:P00786
        Length = 333

 Score = 396 (144.5 bits), Expect = 2.6e-53, Sum P(2) = 2.6e-53
 Identities = 84/211 (39%), Positives = 126/211 (59%)

Query:    42 AGAW--SEGYPQKYDPQSMEE-RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS 98
             AGAW  S G   +    ++E+  F +W+KQ+ + Y S  E+  R  ++++N + I   N 
Sbjct:    10 AGAWLLSAGATAELTVNAIEKFHFTSWMKQHQKTYSSR-EYSHRLQVFANNWRKIQAHNQ 68

Query:    99 QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGL---PASVDWRKEG-AV 154
             +N +FK+  N+F+D+S  E    YL +++P N     S    G    P+S+DWRK+G  V
Sbjct:    69 RNHTFKMGLNQFSDMSFAEIKHKYL-WSEPQNCSATKSNYLRGTGPYPSSMDWRKKGNVV 127

Query:   155 TPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKA 214
             +PVK+QG CGSCW FS   A+E    + +GK+++L+EQ+LVDC  N  N GC GG   +A
Sbjct:   128 SPVKNQGACGSCWTFSTTGALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQA 187

Query:   215 FEFITKIGGVTTEDDYPYRGKNDRCQTDKTK 245
             FE+I    G+  ED YPY GKN +C+ +  K
Sbjct:   188 FEYILYNKGIMGEDSYPYIGKNGQCKFNPEK 218

 Score = 173 (66.0 bits), Expect = 2.6e-53, Sum P(2) = 2.6e-53
 Identities = 33/79 (41%), Positives = 45/79 (56%)

Query:   263 FQLYSHGVFDEYCGHQ----LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
             F +Y  GV+     H+    +NH V  VGYGE +G  YW+VKNSWG++WG  GY  + R 
Sbjct:   257 FMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSNWGNNGYFLIERG 316

Query:   319 SPSSNIGICGILMQASYPV 337
                    +CG+   ASYP+
Sbjct:   317 K-----NMCGLAACASYPI 330


>UNIPROTKB|Q90686 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9031 "Gallus gallus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            CTD:1513 KO:K01371 PANTHER:PTHR12411:SF55 EMBL:U37691
            IPI:IPI00575213 RefSeq:NP_990302.1 UniGene:Gga.51509
            ProteinModelPortal:Q90686 SMR:Q90686 MEROPS:C01.036 GeneID:395818
            KEGG:gga:395818 NextBio:20815886 Uniprot:Q90686
        Length = 334

 Score = 372 (136.0 bits), Expect = 2.6e-53, Sum P(2) = 2.6e-53
 Identities = 78/162 (48%), Positives = 99/162 (61%)

Query:   102 SFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTP 156
             SF+L  N   D+++EE + T  G   P + PR     Y+       PA+VDWR++G VTP
Sbjct:    75 SFQLAMNYLGDMTSEEVVRTMTGLRVPRSRPRPNGTLYVPDWSSRAPAAVDWRRKGYVTP 134

Query:   157 VKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFE 216
             VKDQGQCGSCWAFS+V A+EG  K +TGKL+SLS Q LV C  N  N GC GGYM  AFE
Sbjct:   135 VKDQGQCGSCWAFSSVGALEGQLKRRTGKLLSLSPQNLVYCVSN--NNGCGGGYMTNAFE 192

Query:   217 FITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             ++    G+ +ED YPY G+++ C    T   A    GY  IP
Sbjct:   193 YVRLNRGIDSEDAYPYIGQDESCMYSPTGK-AAKCRGYREIP 233

 Score = 197 (74.4 bits), Expect = 2.6e-53, Sum P(2) = 2.6e-53
 Identities = 37/77 (48%), Positives = 48/77 (62%)

Query:   262 AFQLYSHGVF-DEYCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
             +FQ YS GV+ D  C  + +NH V  VGYG   G K+W++KNSWGT WG  GY+ +ARN 
Sbjct:   260 SFQFYSRGVYYDTGCNPENINHAVLAVGYGAQKGTKHWIIKNSWGTEWGNKGYVLLARNM 319

Query:   320 PSSNIGICGILMQASYP 336
               +    CGI   AS+P
Sbjct:   320 KQT----CGIANLASFP 332


>DICTYBASE|DDB_G0279185 [details] [associations]
            symbol:cprF "cysteine proteinase 6" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0279185 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000030 HSSP:P07711 ProtClustDB:CLSZ2846820 EMBL:U72745
            RefSeq:XP_641725.1 ProteinModelPortal:Q94503 SMR:Q94503
            MEROPS:C01.081 PRIDE:Q94503 EnsemblProtists:DDB0215002
            GeneID:8621921 KEGG:ddi:DDB_G0279185 Uniprot:Q94503
        Length = 434

 Score = 397 (144.8 bits), Expect = 4.7e-53, Sum P(3) = 4.7e-53
 Identities = 81/186 (43%), Positives = 114/186 (61%)

Query:    62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
             F NW+  + R Y SE E+  RF I+ +N+ YI+  N++     L  N FAD++NEE+ +T
Sbjct:    30 FTNWMIAHQRHYSSE-EFNGRFNIFKANMDYINEWNTKGSETVLGLNVFADITNEEYRAT 88

Query:   122 YLGYNKPYNEPRW---PSVQYLG-LPA-SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVE 176
             YLG   P++       PS +  G + A SVDWR +GAVTP+K+QG+CG CW+FSA  A E
Sbjct:    89 YLG--TPFDASSLEMTPSEKVFGGVQANSVDWRAKGAVTPIKNQGECGGCWSFSATGATE 146

Query:   177 GINKLKTGK--LVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
             G   +  G   L S+SEQ+L+DC  +  N GC GG M  AFE+I   GG+ TE  YP+  
Sbjct:   147 GAQYIANGDSDLTSVSEQQLIDCSGSYGNNGCEGGLMTLAFEYIINNGGIDTESSYPFTA 206

Query:   235 KNDRCQ 240
               ++C+
Sbjct:   207 NTEKCK 212

 Score = 107 (42.7 bits), Expect = 4.7e-53, Sum P(3) = 4.7e-53
 Identities = 20/41 (48%), Positives = 25/41 (60%)

Query:   296 YWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
             YW+VKNSWG  WG  GYI M+++  +     CGI   AS P
Sbjct:   389 YWIVKNSWGLDWGINGYILMSKDKDNQ----CGIATMASIP 425

 Score = 74 (31.1 bits), Expect = 4.7e-53, Sum P(3) = 4.7e-53
 Identities = 15/30 (50%), Positives = 21/30 (70%)

Query:   262 AFQLYSHGVFDE-YCGH-QLNHGVTVVGYG 289
             +FQ YS G+++E  C   QL+HGV  VG+G
Sbjct:   255 SFQFYSSGIYNEPACSSTQLDHGVLAVGFG 284


>UNIPROTKB|O46427 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9823 "Sus scrofa"
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0032526 "response to retinoic acid" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0043129
            "surfactant homeostasis" evidence=ISS] [GO:0010815 "bradykinin
            catabolic process" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0005829 "cytosol"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0030335 "positive regulation of cell
            migration" evidence=ISS] [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0016505 "apoptotic protease activator
            activity" evidence=ISS] [GO:0004252 "serine-type endopeptidase
            activity" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=ISS] [GO:0031638 "zymogen activation"
            evidence=ISS] [GO:0030108 "HLA-A specific activating MHC class I
            receptor activity" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0010628 "positive regulation of gene
            expression" evidence=ISS] [GO:0070324 "thyroid hormone binding"
            evidence=ISS] [GO:0006508 "proteolysis" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0060448
            "dichotomous subdivision of terminal units involved in lung
            branching" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] [GO:0005764 "lysosome" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0004177 "aminopeptidase
            activity" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=ISS] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 MEROPS:C01.040 CTD:1512 OMA:STSCHKT
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 EMBL:AF001169
            RefSeq:NP_999094.1 UniGene:Ssc.3593 PDB:1NB3 PDB:1NB5 PDB:8PCH
            PDBsum:1NB3 PDBsum:1NB5 PDBsum:8PCH ProteinModelPortal:O46427
            SMR:O46427 Ensembl:ENSSSCT00000001983 GeneID:396969 KEGG:ssc:396969
            EvolutionaryTrace:O46427 ArrayExpress:O46427 Uniprot:O46427
        Length = 335

 Score = 399 (145.5 bits), Expect = 5.4e-53, Sum P(2) = 5.4e-53
 Identities = 91/224 (40%), Positives = 134/224 (59%)

Query:    28 AVLSLFLL--WVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGI 85
             AVLSL     W+LG PA   S      ++    +  F++W+ Q+ ++Y  E E+  R  +
Sbjct:     3 AVLSLLCAGAWLLGPPACGASNLAVSSFE----KLHFKSWMVQHQKKYSLE-EYHHRLQV 57

Query:    86 YSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYN--EPRWPSVQYLG-L 142
             + SN + I+  N+ N +FKL  N+F+D+S +E    YL +++P N    +   ++  G  
Sbjct:    58 FVSNWRKINAHNAGNHTFKLGLNQFSDMSFDEIRHKYL-WSEPQNCSATKGNYLRGTGPY 116

Query:   143 PASVDWRKEGA-VTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
             P S+DWRK+G  V+PVK+QG CGSCW FS   A+E    + TGK++SL+EQ+LVDC  N 
Sbjct:   117 PPSMDWRKKGNFVSPVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNF 176

Query:   202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRC--QTDK 243
              N GC GG   +AFE+I    G+  ED YPY+G++D C  Q DK
Sbjct:   177 NNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKGQDDHCKFQPDK 220

 Score = 167 (63.8 bits), Expect = 5.4e-53, Sum P(2) = 5.4e-53
 Identities = 32/79 (40%), Positives = 44/79 (55%)

Query:   263 FQLYSHGVFDEYCGHQ----LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
             F +Y  G++     H+    +NH V  VGYGE++G  YW+VKNSWG  WG  GY  + R 
Sbjct:   259 FLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERG 318

Query:   319 SPSSNIGICGILMQASYPV 337
                    +CG+   ASYP+
Sbjct:   319 K-----NMCGLAACASYPI 332


>ZFIN|ZDB-GENE-050522-559 [details] [associations]
            symbol:ctssb.1 "cathepsin S, b.1" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-050522-559 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 MEROPS:C01.034
            EMBL:BC095694 IPI:IPI00607338 UniGene:Dr.75553
            ProteinModelPortal:Q502H6 SMR:Q502H6 InParanoid:Q502H6
            ArrayExpress:Q502H6 Uniprot:Q502H6
        Length = 330

 Score = 357 (130.7 bits), Expect = 5.4e-53, Sum P(2) = 5.4e-53
 Identities = 82/207 (39%), Positives = 119/207 (57%)

Query:    57 SMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYIN---SQNL-SFKLTDNKFA 111
             ++++ +E W K Y + Y +E +E+ RR  ++  N+Q I   N   S  + S+ L+ N   
Sbjct:    22 NLDQHWELWKKTYGKIYTTEVEEFGRR-QLWERNLQLITVHNLEASMGMHSYDLSMNHMG 80

Query:   112 DLSNEEFISTYLGYNKPYNEPRWPS--VQYLG--LPASVDWRKEGAVTPVKDQGQCGSCW 167
             DL+ EE + T    + P    R  +  V   G  +P S+DWR++G V+ VK QG CGSCW
Sbjct:    81 DLTTEEILQTLALTHVPSGFKRQIANIVGSSGDAVPDSLDWREKGYVSSVKMQGACGSCW 140

Query:   168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
             AFS+V A+EG  K  TGKLV LS Q LVDC     N+GCNGG+M  AF+++   GG+ ++
Sbjct:   141 AFSSVGALEGQLKKTTGKLVDLSPQNLVDCSSKYGNKGCNGGFMSDAFQYVIDNGGIASD 200

Query:   228 DDYPYRGKNDRCQTDKTKHHAVTITGY 254
               YPYRG   +C    ++  A   T Y
Sbjct:   201 SAYPYRGVQQQCSYSSSQR-AANCTKY 226

 Score = 209 (78.6 bits), Expect = 5.4e-53, Sum P(2) = 5.4e-53
 Identities = 46/84 (54%), Positives = 55/84 (65%)

Query:   256 AIPA-RYAFQLYSHGVF-DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYI 313
             AI A R  F LY  GV+ D  C  ++NH V VVGYG   G+ +WLVKNSWGT +G+ GYI
Sbjct:   250 AIDATRPQFVLYHSGVYNDPTCSKRVNHAVLVVGYGTLSGQDHWLVKNSWGTRFGDGGYI 309

Query:   314 RMARNSPSSNIGICGILMQASYPV 337
             RMARN   +N+  CGI   A YPV
Sbjct:   310 RMARNK--NNM--CGIASYACYPV 329


>ZFIN|ZDB-GENE-001205-4 [details] [associations]
            symbol:ctsk "cathepsin K" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-001205-4 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            CTD:1513 KO:K01371 OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55
            EMBL:BC092901 IPI:IPI00512751 RefSeq:NP_001017778.1
            UniGene:Dr.76224 ProteinModelPortal:Q568D6 SMR:Q568D6 GeneID:550475
            KEGG:dre:550475 InParanoid:Q568D6 NextBio:20879718
            ArrayExpress:Q568D6 Uniprot:Q568D6
        Length = 333

 Score = 372 (136.0 bits), Expect = 6.9e-53, Sum P(2) = 6.9e-53
 Identities = 89/223 (39%), Positives = 124/223 (55%)

Query:    45 WSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSF 103
             W  G     D  S++E +E+W   + REY   +E   R  I+  N+ +I+  N +  L  
Sbjct:    14 WC-GLAHSLDNLSLDEAWESWKITHKREYNGLNEESIRRTIWEKNMLFIEAHNKEYELGI 72

Query:   104 KLTD---NKFADLSNEEFISTYLGYNKP-YNEPRWPSV--QYLG-LPASVDWRKEGAVTP 156
                D   N F D++ EE     +G   P Y +P    V    +G LP S+D+RK G VT 
Sbjct:    73 HTYDLGMNHFGDMTLEEVAEKVMGLQMPMYRDPANTFVPDDRVGKLPKSIDYRKLGYVTS 132

Query:   157 VKDQGQCGSCWAFSAVAAVEGINKLKT-GKLVSLSEQELVDCDVNSENQGCNGGYMEKAF 215
             VK+QG CGSCWAFS+V A+EG   +KT G+LV LS Q LVDC   +EN GC GGYM  AF
Sbjct:   133 VKNQGSCGSCWAFSSVGALEG-QLMKTKGQLVDLSPQNLVDCV--TENDGCGGGYMTNAF 189

Query:   216 EFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
              +++   G+ +E+ YPY G + +C  + T   A +  GY+ IP
Sbjct:   190 RYVSNNQGIDSEESYPYVGTDQQCAYN-TSGVAASCRGYKEIP 231

 Score = 193 (73.0 bits), Expect = 6.9e-53, Sum P(2) = 6.9e-53
 Identities = 42/105 (40%), Positives = 59/105 (56%)

Query:   236 NDRCQTDKTKHHAVTITGYEAIPARYAFQLYSHGVF-DEYCGHQ-LNHGVTVVGYGED-H 292
             N+R  T    +      G +A+ + + +  Y  GV+ D  C  + +NH V  VGYG    
Sbjct:   234 NERALTAAVANVGPVSVGIDAMQSTFLY--YKSGVYYDPNCNKEDVNHAVLAVGYGATPR 291

Query:   293 GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
             G+KYW+VKNSWG  WG+ GY+ MARN  ++    CGI   AS+PV
Sbjct:   292 GKKYWIVKNSWGEEWGKKGYVLMARNRNNA----CGIANLASFPV 332


>UNIPROTKB|F1NEC8 [details] [associations]
            symbol:CTSL2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:AADN02067812 IPI:IPI00820956 Ensembl:ENSGALT00000037988
            ArrayExpress:F1NEC8 Uniprot:F1NEC8
        Length = 218

 Score = 357 (130.7 bits), Expect = 1.8e-52, Sum P(2) = 1.8e-52
 Identities = 66/116 (56%), Positives = 83/116 (71%)

Query:   143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
             P SVDWR++G VTPVKDQGQCGSCWAFS   A+EG +  KTGKLVSLSEQ LVDC     
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEG 61

Query:   203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             NQGCNGG M++AF+++   GG+ +E+ YPY  K+D     K +++A   TG+  IP
Sbjct:    62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIP 117

 Score = 204 (76.9 bits), Expect = 1.8e-52, Sum P(2) = 1.8e-52
 Identities = 42/85 (49%), Positives = 56/85 (65%)

Query:   256 AIPARYA-FQLYSHGVFDEY-CGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGY 312
             AI A ++ FQ Y  G++ E  C  + L+HGV VVGYG + G+KYW+VKNSWG  WG+ GY
Sbjct:   137 AIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEDGKKYWIVKNSWGEKWGDKGY 196

Query:   313 IRMARNSPSSNIGICGILMQASYPV 337
             I MA++  +     CGI   ASYP+
Sbjct:   197 IYMAKDRKNH----CGIATAASYPL 217


>RGD|621513 [details] [associations]
            symbol:Ctss "cathepsin S" species:10116 "Rattus norvegicus"
            [GO:0001656 "metanephros development" evidence=IEP] [GO:0002250
            "adaptive immune response" evidence=ISO] [GO:0005764 "lysosome"
            evidence=IEA;ISO] [GO:0006508 "proteolysis" evidence=IEA;ISO]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0009986 "cell
            surface" evidence=IDA] [GO:0016020 "membrane" evidence=ISO]
            [GO:0043231 "intracellular membrane-bounded organelle"
            evidence=ISO] [GO:0045453 "bone resorption" evidence=IMP]
            [GO:0051930 "regulation of sensory perception of pain"
            evidence=IMP] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            RGD:621513 GO:GO:0009986 GO:GO:0051930 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0045453
            GO:GO:0001656 HOVERGEN:HBG011513 CTD:1520 KO:K01368 MEROPS:I29.004
            BRENDA:3.4.22.27 EMBL:L03201 IPI:IPI00210228 PIR:A45087
            RefSeq:NP_059016.1 UniGene:Rn.11347 ProteinModelPortal:Q02765
            PhosphoSite:Q02765 PRIDE:Q02765 GeneID:50654 KEGG:rno:50654
            UCSC:RGD:621513 ChEMBL:CHEMBL1075217 NextBio:610462
            Genevestigator:Q02765 Uniprot:Q02765
        Length = 330

 Score = 352 (129.0 bits), Expect = 2.3e-52, Sum P(2) = 2.3e-52
 Identities = 88/232 (37%), Positives = 127/232 (54%)

Query:    37 VLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYI 96
             VLG P      G   +  P +++  ++ W K   R    ++E   R  I+  N+++I   
Sbjct:     3 VLGAPGVLCDNGATAER-P-TLDHHWDLWKKTRMRRNTDQNEEDVRRLIWEKNLKFIMLH 60

Query:    97 NSQNL----SFKLTDNKFADLSNEEFISTYLG---YNKPYNEP-RWPSVQYLGLPASVDW 148
             N ++     S+ +  N   D++ EE I  Y+G     +P+N      S     LP SVDW
Sbjct:    61 NLEHSMGMHSYSVGMNHMGDMTPEEVIG-YMGSLRIPRPWNRSGTLKSSSNQTLPDSVDW 119

Query:   149 RKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE--NQGC 206
             R++G VT VK QG CGSCWAFSA  A+EG  KLKTGKLVSLS Q LVDC    +  N+GC
Sbjct:   120 REKGCVTNVKYQGSCGSCWAFSAEGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGC 179

Query:   207 NGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
              GG+M +AF++I     + +E  YPY+  +++C  D  K+ A T + Y  +P
Sbjct:   180 GGGFMTEAFQYIIDTS-IDSEASYPYKAMDEKCLYDP-KNRAATCSRYIELP 229

 Score = 208 (78.3 bits), Expect = 2.3e-52, Sum P(2) = 2.3e-52
 Identities = 41/76 (53%), Positives = 50/76 (65%)

Query:   262 AFQLYSHGVFDE-YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
             +F LY  GV+D+  C   +NHGV VVGYG   G+ YWLVKNSWG  +G+ GYIRMARN+ 
Sbjct:   257 SFFLYQSGVYDDPSCTENMNHGVLVVGYGTLDGKDYWLVKNSWGLHFGDQGYIRMARNNK 316

Query:   321 SSNIGICGILMQASYP 336
             +     CGI    SYP
Sbjct:   317 NH----CGIASYCSYP 328


>ZFIN|ZDB-GENE-040718-61 [details] [associations]
            symbol:ctsl.1 "cathepsin L.1" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-040718-61
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            GeneTree:ENSGT00660000095458 MEROPS:C01.092 EMBL:FP015965
            EMBL:BC075887 IPI:IPI00513499 RefSeq:NP_001002368.1
            UniGene:Dr.85174 SMR:Q6DHT0 Ensembl:ENSDART00000017756
            GeneID:436641 KEGG:dre:436641 CTD:436641 InParanoid:Q6DHT0
            OMA:GGQMENA OrthoDB:EOG41ZFB9 NextBio:20831086 Uniprot:Q6DHT0
        Length = 334

 Score = 346 (126.9 bits), Expect = 2.3e-52, Sum P(2) = 2.3e-52
 Identities = 83/211 (39%), Positives = 122/211 (57%)

Query:    57 SMEE-RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI---DYINSQNL-SFKLTDNKFA 111
             S+E+  F  W  ++ + Y S +E   R   + +N + +   + +  Q L S++L    FA
Sbjct:    20 SLEDMEFHAWKLKFGKSYRSAEEESHRQLTWLTNRKLVLVHNMMADQGLKSYRLGMTYFA 79

Query:   112 DLSNEEFIS-TYLGYNKPYN--EPRWPSVQY-----LGLPASVDWRKEGAVTPVKDQGQC 163
             D+SNEE+    + G     N  + R  S  +       +P +VDWR +G VT +KDQ QC
Sbjct:    80 DMSNEEYRQLVFRGCLGSMNNTKARGGSTFFRLRKAAVVPDTVDWRDKGYVTDIKDQKQC 139

Query:   164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
             GSCWAFSA  ++EG    KTGKLVSLSEQ+LVDC  +  N GC+GG M++AF++I    G
Sbjct:   140 GSCWAFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGSYGNYGCDGGLMDQAFQYIEANKG 199

Query:   224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
             + TED YPY  ++  C+ + +   A + TGY
Sbjct:   200 LDTEDSYPYEAQDGECRFNPSTVGA-SCTGY 229

 Score = 214 (80.4 bits), Expect = 2.3e-52, Sum P(2) = 2.3e-52
 Identities = 44/85 (51%), Positives = 56/85 (65%)

Query:   256 AIPARYA-FQLYSHGVFDEY-CGH-QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGY 312
             AI A ++ FQLYS GV++E  C   +L+HGV  VGYG  +G+ YW+VKNSWG  WG  GY
Sbjct:   253 AIDAGHSSFQLYSSGVYNEPDCSSSELDHGVLAVGYGSSNGDDYWIVKNSWGLDWGVQGY 312

Query:   313 IRMARNSPSSNIGICGILMQASYPV 337
             I M+RN  +     CGI   ASYP+
Sbjct:   313 ILMSRNKSNQ----CGIATAASYPL 333


>UNIPROTKB|F7B939 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458 CTD:1512
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 EMBL:ACFV01158341
            EMBL:ACFV01158342 EMBL:ACFV01158343 RefSeq:XP_002753411.1
            Ensembl:ENSCJAT00000004397 GeneID:100413104 Uniprot:F7B939
        Length = 336

 Score = 387 (141.3 bits), Expect = 2.9e-52, Sum P(2) = 2.9e-52
 Identities = 83/209 (39%), Positives = 125/209 (59%)

Query:    37 VLGIPAGAWSEGYPQKYDPQSMEE-RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDY 95
             +LG PA   +E         S+E+  F++W+ ++ + Y  E+E+ +R   ++SN + I+ 
Sbjct:    14 LLGAPARGAAE-----LSVNSLEKFHFKSWMAKHHKTYSREEEYHQRLQTFASNWRKINA 68

Query:    96 INSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGL---PASVDWRKEG 152
              N+ N +FK+  N+F+D+S  E    YL +++P N     S    G    P SVDWRK+G
Sbjct:    69 HNNGNHTFKMAVNQFSDMSFAEIKRKYL-WSEPQNCSATKSNYLRGTGPYPPSVDWRKKG 127

Query:   153 A-VTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYM 211
               V+PVK+QG CGSCW FS   A+E    + TGK++SL+EQ+LVDC  +  N GC GG  
Sbjct:   128 HFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLP 187

Query:   212 EKAFEFITKIGGVTTEDDYPYRGKNDRCQ 240
              +AFE+I    G+  ED YPY+GK+  C+
Sbjct:   188 SQAFEYILYNNGIMGEDTYPYQGKDSDCK 216

 Score = 172 (65.6 bits), Expect = 2.9e-52, Sum P(2) = 2.9e-52
 Identities = 33/79 (41%), Positives = 44/79 (55%)

Query:   263 FQLYSHGVFDEYCGHQ----LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
             F +Y  G++     H+    +NH V  VGYGE++G  YW+VKNSWG  WG  GY  + R 
Sbjct:   260 FMMYKRGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERG 319

Query:   319 SPSSNIGICGILMQASYPV 337
                    +CG+   ASYPV
Sbjct:   320 K-----NMCGLAACASYPV 333


>TAIR|locus:2122113 [details] [associations]
            symbol:XCP1 "xylem cysteine peptidase 1" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0000325 "plant-type vacuole" evidence=IDA] [GO:0005634
            "nucleus" evidence=IDA] [GO:0010623 "developmental programmed cell
            death" evidence=IMP] [GO:0010413 "glucuronoxylan metabolic process"
            evidence=RCA] [GO:0045492 "xylan biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005886
            GO:GO:0005634 EMBL:CP002687 GenomeReviews:CT486007_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0000325
            EMBL:AL022604 EMBL:AL161587 GO:GO:0010623 MEROPS:I29.003
            HOGENOM:HOG000230773 EMBL:AF191027 EMBL:AK117394 EMBL:BT005179
            IPI:IPI00532220 PIR:T06122 RefSeq:NP_567983.1 UniGene:At.2280
            UniGene:At.67622 ProteinModelPortal:O65493 SMR:O65493 STRING:O65493
            PaxDb:O65493 PRIDE:O65493 EnsemblPlants:AT4G35350.1 GeneID:829688
            KEGG:ath:AT4G35350 GeneFarm:5033 TAIR:At4g35350 InParanoid:O65493
            KO:K16290 OMA:FEVFREN PhylomeDB:O65493 ProtClustDB:CLSN2689772
            Genevestigator:O65493 Uniprot:O65493
        Length = 355

 Score = 540 (195.1 bits), Expect = 4.4e-52, P = 4.4e-52
 Identities = 109/219 (49%), Positives = 143/219 (65%)

Query:    48 GYPQKY--DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKL 105
             GY  ++  +   + E FE+W+ ++S+ Y S +E   RF ++  N+ +ID  N++  S+ L
Sbjct:    35 GYTPEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWL 94

Query:   106 TDNKFADLSNEEFISTYLGYNKP-YNEPRWPSVQY-----LGLPASVDWRKEGAVTPVKD 159
               N+FADL++EEF   YLG  KP ++  R PS  +       LP SVDWRK+GAV PVKD
Sbjct:    95 GLNEFADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKD 154

Query:   160 QGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFIT 219
             QGQCGSCWAFS VAAVEGIN++ TG L SLSEQEL+DCD    N GCNGG M+ AF++I 
Sbjct:   155 QGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTF-NSGCNGGLMDYAFQYII 213

Query:   220 KIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
               GG+  EDDYPY  +   CQ  K     VTI+GYE +P
Sbjct:   214 STGGLHKEDDYPYLMEEGICQEQKEDVERVTISGYEDVP 252

 Score = 226 (84.6 bits), Expect = 3.9e-17, P = 3.9e-17
 Identities = 52/130 (40%), Positives = 68/130 (52%)

Query:   209 GYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQLYSH 268
             G  ++  E + ++  ++  +D P   +ND     K   H       EA  +   FQ Y  
Sbjct:   231 GICQEQKEDVERVT-ISGYEDVP---ENDDESLVKALAHQPVSVAIEA--SGRDFQFYKG 284

Query:   269 GVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICG 328
             GVF+  CG  L+HGV  VGYG   G  Y +VKNSWG  WGE G+IRM RN+     G+CG
Sbjct:   285 GVFNGKCGTDLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRNTGKPE-GLCG 343

Query:   329 ILMQASYPVK 338
             I   ASYP K
Sbjct:   344 INKMASYPTK 353


>MGI|MGI:107285 [details] [associations]
            symbol:Ctsh "cathepsin H" species:10090 "Mus musculus"
            [GO:0001520 "outer dense fiber" evidence=ISO] [GO:0001669
            "acrosomal vesicle" evidence=ISO] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=IGI] [GO:0002764 "immune response-regulating
            signaling pathway" evidence=ISO] [GO:0004175 "endopeptidase
            activity" evidence=ISO;IMP] [GO:0004177 "aminopeptidase activity"
            evidence=ISO] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISO;IDA] [GO:0004252 "serine-type endopeptidase activity"
            evidence=IMP] [GO:0005615 "extracellular space" evidence=ISO]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0005829 "cytosol"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=ISO;IMP]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0008284
            "positive regulation of cell proliferation" evidence=IMP]
            [GO:0010628 "positive regulation of gene expression" evidence=ISO]
            [GO:0010634 "positive regulation of epithelial cell migration"
            evidence=IMP] [GO:0010813 "neuropeptide catabolic process"
            evidence=ISO] [GO:0010815 "bradykinin catabolic process"
            evidence=ISO] [GO:0010952 "positive regulation of peptidase
            activity" evidence=IGI;ISO] [GO:0016505 "apoptotic protease
            activator activity" evidence=IGI;ISO] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0030108 "HLA-A specific activating MHC
            class I receptor activity" evidence=ISO] [GO:0030335 "positive
            regulation of cell migration" evidence=ISO] [GO:0030984 "kininogen
            binding" evidence=ISO] [GO:0031638 "zymogen activation"
            evidence=ISO;IMP] [GO:0031648 "protein destabilization"
            evidence=ISO;IMP] [GO:0032403 "protein complex binding"
            evidence=ISO] [GO:0032526 "response to retinoic acid" evidence=IDA]
            [GO:0033619 "membrane protein proteolysis" evidence=ISO;IMP]
            [GO:0035085 "cilium axoneme" evidence=ISO] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IMP] [GO:0043129
            "surfactant homeostasis" evidence=ISO] [GO:0043621 "protein
            self-association" evidence=ISO] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IMP] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=IMP]
            [GO:0070324 "thyroid hormone binding" evidence=ISO] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISO] [GO:0097208 "alveolar
            lamellar body" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:107285 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 EMBL:CH466560 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            HOVERGEN:HBG011513 GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT OrthoDB:EOG4W9J43
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 BRENDA:3.4.22.16
            EMBL:U06119 EMBL:AK149949 EMBL:AK150583 EMBL:AK157376 EMBL:AK160026
            EMBL:Y18464 IPI:IPI00118987 RefSeq:NP_031827.2 UniGene:Mm.2277
            ProteinModelPortal:P49935 SMR:P49935 STRING:P49935 MEROPS:I29.003
            PhosphoSite:P49935 PaxDb:P49935 PRIDE:P49935
            Ensembl:ENSMUST00000034915 GeneID:13036 KEGG:mmu:13036
            InParanoid:Q3UCD6 ChEMBL:CHEMBL1949491 NextBio:282920 Bgee:P49935
            CleanEx:MM_CTSH Genevestigator:P49935 GermOnline:ENSMUSG00000032359
            Uniprot:P49935
        Length = 333

 Score = 382 (139.5 bits), Expect = 4.8e-52, Sum P(2) = 4.8e-52
 Identities = 82/211 (38%), Positives = 125/211 (59%)

Query:    42 AGAW--SEGYPQKYDPQSMEE-RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS 98
             AGAW  S G   +    ++E+  F++W+KQ+ + Y S  E+  R  ++++N + I   N 
Sbjct:    10 AGAWLLSTGATAELTVNAIEKFHFKSWMKQHQKTYSSV-EYNHRLQMFANNWRKIQAHNQ 68

Query:    99 QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGL---PASVDWRKEG-AV 154
             +N +FK+  N+F+D+S  E    +L +++P N     S    G    P+S+DWRK+G  V
Sbjct:    69 RNHTFKMALNQFSDMSFAEIKHKFL-WSEPQNCSATKSNYLRGTGPYPSSMDWRKKGNVV 127

Query:   155 TPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKA 214
             +PVK+QG CGSCW FS   A+E    + +GK++SL+EQ+LVDC     N GC GG   +A
Sbjct:   128 SPVKNQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQA 187

Query:   215 FEFITKIGGVTTEDDYPYRGKNDRCQTDKTK 245
             FE+I    G+  ED YPY GK+  C+ +  K
Sbjct:   188 FEYILYNKGIMEEDSYPYIGKDSSCRFNPQK 218

 Score = 175 (66.7 bits), Expect = 4.8e-52, Sum P(2) = 4.8e-52
 Identities = 34/79 (43%), Positives = 45/79 (56%)

Query:   263 FQLYSHGVFDEYCGHQ----LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
             F +Y  GV+     H+    +NH V  VGYGE +G  YW+VKNSWG+ WGE GY  + R 
Sbjct:   257 FLMYKSGVYSSKSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSQWGENGYFLIERG 316

Query:   319 SPSSNIGICGILMQASYPV 337
                    +CG+   ASYP+
Sbjct:   317 K-----NMCGLAACASYPI 330


>DICTYBASE|DDB_G0272815 [details] [associations]
            symbol:cprE "cysteine proteinase 5" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0272815 GO:GO:0005615
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GenomeReviews:CM000151_GR GO:GO:0005764
            EMBL:AAFI02000008 MEROPS:I29.003 KO:K01376 EMBL:L36205
            RefSeq:XP_644977.1 ProteinModelPortal:P54640 SMR:P54640
            PRIDE:P54640 EnsemblProtists:DDB0185092 GeneID:8618654
            KEGG:ddi:DDB_G0272815 OMA:METAFEF ProtClustDB:CLSZ2430780
            Uniprot:P54640
        Length = 344

 Score = 421 (153.3 bits), Expect = 6.1e-52, Sum P(2) = 6.1e-52
 Identities = 84/212 (39%), Positives = 125/212 (58%)

Query:    51 QKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKF 110
             Q++        F +W+  + + Y SE E+  R+ I+ +N+ Y+   NS+     L  N F
Sbjct:    19 QQFSELQYRNAFTDWMITHQKSYTSE-EFGARYNIFKANMDYVQQWNSKGSETVLGLNNF 77

Query:   111 ADLSNEEFISTYLGYNKPYNE---PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
             AD++NEE+ +TYLG     +     +   V      AS DWR EGAVTPVK+QGQCG CW
Sbjct:    78 ADITNEEYRNTYLGTKFDASSLIGTQEEKVFTTSSAASKDWRSEGAVTPVKNQGQCGGCW 137

Query:   168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
             +FS   + EG +    G+LVSLSEQ L+DC  ++EN GC+GG M  AFE+I    G+ TE
Sbjct:   138 SFSTTGSTEGAHFQSKGELVSLSEQNLIDC--STENSGCDGGLMTYAFEYIINNNGIDTE 195

Query:   228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA 259
               YPY+ +N +C+  K+++   T++ Y+ + A
Sbjct:   196 SSYPYKAENGKCEY-KSENSGATLSSYKTVTA 226

 Score = 135 (52.6 bits), Expect = 6.1e-52, Sum P(2) = 6.1e-52
 Identities = 25/43 (58%), Positives = 31/43 (72%)

Query:   295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
             +YW+VKNSWGTSWG  GYI M+RN  ++    CGI   AS+PV
Sbjct:   305 EYWIVKNSWGTSWGIEGYILMSRNRDNN----CGIASSASFPV 343

 Score = 84 (34.6 bits), Expect = 1.4e-46, Sum P(2) = 1.4e-46
 Identities = 27/90 (30%), Positives = 44/90 (48%)

Query:   256 AIPARY-AFQLYSHGVFDE-YCGHQ-LNHGVTVVGYGEDHGEKYWLV--KNSWGTSWGEA 310
             AI A + +FQLY+ G++ E  C  + L+HGV  VGYG   G        ++S   S   +
Sbjct:   244 AIDASHQSFQLYTSGIYYEPECSSENLDHGVLAVGYGSGSGSSSGQSSGQSSGNLSASSS 303

Query:   311 GYIRMARNSPSSNIGICGILMQASYPVKRC 340
                 + +NS  ++ GI G ++ +      C
Sbjct:   304 NEYWIVKNSWGTSWGIEGYILMSRNRDNNC 333

 Score = 39 (18.8 bits), Expect = 2.3e-06, Sum P(2) = 2.3e-06
 Identities = 8/25 (32%), Positives = 14/25 (56%)

Query:    77 DEWQRRFGIYSSNVQYIDYINSQNL 101
             D   + F +Y+S + Y    +S+NL
Sbjct:   246 DASHQSFQLYTSGIYYEPECSSENL 270


>TAIR|locus:2175088 [details] [associations]
            symbol:ALP "aleurain-like protease" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009723 "response to ethylene stimulus" evidence=IEP]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0005829 "cytosol"
            evidence=RCA] [GO:0006096 "glycolysis" evidence=RCA] [GO:0006816
            "calcium ion transport" evidence=RCA] [GO:0006833 "water transport"
            evidence=RCA] [GO:0006972 "hyperosmotic response" evidence=RCA]
            [GO:0007030 "Golgi organization" evidence=RCA] [GO:0009266
            "response to temperature stimulus" evidence=RCA] [GO:0009651
            "response to salt stress" evidence=RCA] [GO:0009750 "response to
            fructose stimulus" evidence=RCA] [GO:0042744 "hydrogen peroxide
            catabolic process" evidence=RCA] [GO:0046686 "response to cadmium
            ion" evidence=RCA] [GO:0007568 "aging" evidence=IEP]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002688 GO:GO:0005773
            GO:GO:0007568 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AB011483 KO:K01366
            ProtClustDB:CLSN2689015 UniGene:At.25414 IPI:IPI00846287
            RefSeq:NP_001078774.1 ProteinModelPortal:A8MQZ1 SMR:A8MQZ1
            STRING:A8MQZ1 PRIDE:A8MQZ1 EnsemblPlants:AT5G60360.3 GeneID:836158
            KEGG:ath:AT5G60360 OMA:CGSTPMD Genevestigator:A8MQZ1 Uniprot:A8MQZ1
        Length = 361

 Score = 382 (139.5 bits), Expect = 6.1e-52, Sum P(2) = 6.1e-52
 Identities = 79/181 (43%), Positives = 108/181 (59%)

Query:    62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
             F  +  +Y ++Y + +E + RF I+  N+  I   N + LS+KL  N+FADL+ +EF  T
Sbjct:    59 FARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRT 118

Query:   122 YLGYNKPYNEPRWPS--VQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGIN 179
              LG  +  +     S  V    LP + DWR++G V+PVKDQG CGSCW FS   A+E   
Sbjct:   119 KLGAAQNCSATLKGSHKVTEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAY 178

Query:   180 KLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRC 239
                 GK +SLSEQ+LVDC     N GCNGG   +AFE+I   GG+ TE  YPY GK++ C
Sbjct:   179 HQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKDETC 238

Query:   240 Q 240
             +
Sbjct:   239 K 239

 Score = 174 (66.3 bits), Expect = 6.1e-52, Sum P(2) = 6.1e-52
 Identities = 33/79 (41%), Positives = 45/79 (56%)

Query:   242 DKTKHHAVTITGYE-AIPARYAFQLYSHGVF-DEYCGH---QLNHGVTVVGYGEDHGEKY 296
             D+ KH    +     A    ++F+LY  GV+ D +CG     +NH V  VGYG + G  Y
Sbjct:   261 DELKHAVGLVRPVSIAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVEDGVPY 320

Query:   297 WLVKNSWGTSWGEAGYIRM 315
             WL+KNSWG  WG+ GY +M
Sbjct:   321 WLIKNSWGADWGDKGYFKM 339


>UNIPROTKB|F6R7P5 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9544 "Macaca
            mulatta" [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=ISS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0004175 "endopeptidase activity" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0004252
            "serine-type endopeptidase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005829 "cytosol" evidence=ISS] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0010628 "positive regulation of
            gene expression" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0010815 "bradykinin catabolic
            process" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0016505 "apoptotic protease
            activator activity" evidence=ISS] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=ISS] [GO:0030335
            "positive regulation of cell migration" evidence=ISS] [GO:0031638
            "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0032526 "response to retinoic
            acid" evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0043129 "surfactant homeostasis"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0070324
            "thyroid hormone binding" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            GO:GO:0043066 GO:GO:0005615 GO:GO:0008284 GO:GO:0070371
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458 CTD:1512
            OMA:STSCHKT GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 RefSeq:XP_001108862.1
            UniGene:Mmu.3000 Ensembl:ENSMMUT00000014095 GeneID:711437
            KEGG:mcc:711437 NextBio:19969972 Uniprot:F6R7P5
        Length = 335

 Score = 385 (140.6 bits), Expect = 9.9e-52, Sum P(2) = 9.9e-52
 Identities = 86/218 (39%), Positives = 129/218 (59%)

Query:    35 LWVLGIP---AGAWSEGYP----QKYDPQSMEE-RFENWLKQYSREYGSEDEWQRRFGIY 86
             +WV  +P   AGAW  G P     +    S+E+  F++W+ ++ + Y +E E+  R   +
Sbjct:     1 MWVT-LPLLCAGAWLLGAPVCGAAELSVNSLEKFHFKSWMSKHHKTYSTE-EYHHRMQTF 58

Query:    87 SSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGL---P 143
             +SN + I+  N+ N +FK+  N+F+D+S  E    YL +++P N     S    G    P
Sbjct:    59 ASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKHKYL-WSEPQNCSATKSNYLRGTGPYP 117

Query:   144 ASVDWRKEGA-VTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
              S+DWRK+G  V+PVK+QG CGSCW FS   A+E    + TGK++SL+EQ+LVDC  +  
Sbjct:   118 PSMDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFN 177

Query:   203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQ 240
             N GC GG   +AFE+I    G+  ED YPY+GK+  C+
Sbjct:   178 NHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGDCK 215

 Score = 169 (64.5 bits), Expect = 9.9e-52, Sum P(2) = 9.9e-52
 Identities = 32/79 (40%), Positives = 44/79 (55%)

Query:   263 FQLYSHGVFDEYCGHQ----LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
             F +Y  G++     H+    +NH V  VGYGE++G  YW+VKNSWG  WG  GY  + R 
Sbjct:   259 FMIYKTGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERG 318

Query:   319 SPSSNIGICGILMQASYPV 337
                    +CG+   ASYP+
Sbjct:   319 K-----NMCGLAACASYPI 332


>UNIPROTKB|F7BRD4 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001656 "metanephros development"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0001656
            GeneTree:ENSGT00660000095458 EMBL:ACFV01158341 EMBL:ACFV01158342
            EMBL:ACFV01158343 Ensembl:ENSCJAT00000004396 Uniprot:F7BRD4
        Length = 336

 Score = 382 (139.5 bits), Expect = 9.9e-52, Sum P(2) = 9.9e-52
 Identities = 76/183 (41%), Positives = 114/183 (62%)

Query:    62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
             F++W+ ++ + Y  E+E+ +R   ++SN + I+  N+ N +FK+  N+F+D+S  E    
Sbjct:    35 FKSWMAKHHKTYSREEEYHQRLQTFASNWRKINAHNNGNHTFKMAVNQFSDMSFAEIKRK 94

Query:   122 YLGYNKPYNEPRWPSVQYLGL---PASVDWRKEGA-VTPVKDQGQCGSCWAFSAVAAVEG 177
             YL +++P N     S    G    P SVDWRK+G  V+PVK+QG CGSCW FS   A+E 
Sbjct:    95 YL-WSEPQNCSATKSNYLRGTGPYPPSVDWRKKGHFVSPVKNQGACGSCWTFSTTGALES 153

Query:   178 INKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKND 237
                + TGK++SL+EQ+LVDC  +  N GC GG   +AFE+I    G+  ED YPY+GK+ 
Sbjct:   154 AIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNNGIMGEDTYPYQGKDS 213

Query:   238 RCQ 240
              C+
Sbjct:   214 DCK 216

 Score = 172 (65.6 bits), Expect = 9.9e-52, Sum P(2) = 9.9e-52
 Identities = 33/79 (41%), Positives = 44/79 (55%)

Query:   263 FQLYSHGVFDEYCGHQ----LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
             F +Y  G++     H+    +NH V  VGYGE++G  YW+VKNSWG  WG  GY  + R 
Sbjct:   260 FMMYKRGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERG 319

Query:   319 SPSSNIGICGILMQASYPV 337
                    +CG+   ASYPV
Sbjct:   320 K-----NMCGLAACASYPV 333


>UNIPROTKB|G1SQF0 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9986
            "Oryctolagus cuniculus" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 RefSeq:XP_002721635.1 UniGene:Ocu.7137
            Ensembl:ENSOCUT00000006138 GeneID:100101597 Uniprot:G1SQF0
        Length = 333

 Score = 385 (140.6 bits), Expect = 1.6e-51, Sum P(2) = 1.6e-51
 Identities = 84/211 (39%), Positives = 122/211 (57%)

Query:    42 AGAWSEGYP--QKYDPQSMEE-RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS 98
             AGAW  G P    +   ++E+  F++W+ Q+ ++Y +E E+ RR   +  N + I+  N+
Sbjct:    10 AGAWLLGAPGADAFSANNLEKFHFKSWMSQHHKKYSAE-EYPRRLQTFVRNWRKINAHNN 68

Query:    99 QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGL---PASVDWRKEGA-V 154
              N +F++  N+F+D+S  E    YL + +P N     S    G    P+SVDWRK+G  V
Sbjct:    69 GNHTFQMGLNQFSDMSFAEIKHKYL-WTEPQNCSATKSNYLRGTGPYPSSVDWRKKGNFV 127

Query:   155 TPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKA 214
             +PVK+QG CGSCW FS   A+E    +  GK++SL+EQ+LVDC  N  N GC GG   +A
Sbjct:   128 SPVKNQGACGSCWTFSTTGALESAVAIAGGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQA 187

Query:   215 FEFITKIGGVTTEDDYPYRGKNDRCQTDKTK 245
             FE+I    G+  ED YPYR    RC+    K
Sbjct:   188 FEYILYNKGIMGEDSYPYRAMEGRCKFQPQK 218

 Score = 167 (63.8 bits), Expect = 1.6e-51, Sum P(2) = 1.6e-51
 Identities = 32/79 (40%), Positives = 44/79 (55%)

Query:   263 FQLYSHGVFDEYCGHQ----LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
             F  Y  G++     H+    +NH V  VGYGE++G  YW+VKNSWG+ WG  GY  + R 
Sbjct:   257 FMQYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGVPYWIVKNSWGSHWGMNGYFYIERG 316

Query:   319 SPSSNIGICGILMQASYPV 337
                    +CG+   ASYP+
Sbjct:   317 K-----NMCGLAACASYPI 330


>UNIPROTKB|G1RBY1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:61853
            "Nomascus leucogenys" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 EMBL:ADFV01087552 RefSeq:XP_003275518.1
            Ensembl:ENSNLET00000011249 GeneID:100584322 Uniprot:G1RBY1
        Length = 335

 Score = 384 (140.2 bits), Expect = 2.0e-51, Sum P(2) = 2.0e-51
 Identities = 83/208 (39%), Positives = 125/208 (60%)

Query:    42 AGAWSEGYP----QKYDPQSMEE-RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYI 96
             AGAW  G P     +    S+E+  F++W+ ++ + Y +E E+  R  +++SN + I+  
Sbjct:    10 AGAWLLGAPVCGAAELSVNSLEKFHFKSWMSKHHKTYSTE-EYHHRLQMFASNWRKINAH 68

Query:    97 NSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGL---PASVDWRKEGA 153
             N+ N +FK+  N+F+D+S  E    YL +++P N     S    G    P S+DWRK+G 
Sbjct:    69 NNGNHTFKMALNQFSDMSFAEIKHKYL-WSEPQNCSATKSNYLRGTGPYPPSMDWRKKGN 127

Query:   154 -VTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYME 212
              V+PVK+QG CGSCW FS   A+E    + TGK++SL+EQ+LVDC  +  N GC GG   
Sbjct:   128 FVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPS 187

Query:   213 KAFEFITKIGGVTTEDDYPYRGKNDRCQ 240
             +AFE+I    G+  ED YPY+GK+  C+
Sbjct:   188 QAFEYILYNKGIMGEDTYPYQGKDGYCK 215

 Score = 167 (63.8 bits), Expect = 2.0e-51, Sum P(2) = 2.0e-51
 Identities = 32/79 (40%), Positives = 43/79 (54%)

Query:   263 FQLYSHGVFDEYCGHQ----LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
             F +Y  G++     H+    +NH V  VGYGE +G  YW+VKNSWG  WG  GY  + R 
Sbjct:   259 FMMYRRGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERG 318

Query:   319 SPSSNIGICGILMQASYPV 337
                    +CG+   ASYP+
Sbjct:   319 K-----NMCGLAACASYPI 332


>TAIR|locus:2167821 [details] [associations]
            symbol:RD21B "esponsive to dehydration 21B" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS] [GO:0005773
            "vacuole" evidence=IDA] [GO:0009651 "response to salt stress"
            evidence=IEP] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005829 "cytosol" evidence=IDA] [GO:0052541 "plant-type cell
            wall cellulose metabolic process" evidence=RCA] [GO:0052546 "cell
            wall pectin metabolic process" evidence=RCA] [GO:0005783
            "endoplasmic reticulum" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005783 GO:GO:0005829 EMBL:CP002688
            GO:GO:0005773 GO:GO:0009651 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AB008267 HSSP:O65039
            HOGENOM:HOG000230773 KO:K01376 InterPro:IPR000118 Pfam:PF00396
            SMART:SM00277 ProtClustDB:CLSN2688498 EMBL:AY062608 EMBL:AY114661
            IPI:IPI00520971 RefSeq:NP_568620.1 UniGene:At.24130 SMR:Q9FMH8
            IntAct:Q9FMH8 STRING:Q9FMH8 MEROPS:C01.A12
            EnsemblPlants:AT5G43060.1 GeneID:834321 KEGG:ath:AT5G43060
            TAIR:At5g43060 InParanoid:Q9FMH8 OMA:ENSEASL Genevestigator:Q9FMH8
            Uniprot:Q9FMH8
        Length = 463

 Score = 532 (192.3 bits), Expect = 3.1e-51, P = 3.1e-51
 Identities = 135/306 (44%), Positives = 178/306 (58%)

Query:    58 MEERFENWLKQYSREYGSED----EWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
             +E  +E W+ ++ ++  +++    E  +RF I+  N+++ID  N++NLS+KL   +FADL
Sbjct:    46 VERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTKNLSYKLGLTRFADL 105

Query:   114 SNEEFISTYLGYNKPYNEPRWPSVQY---LG--LPASVDWRKEGAVTPVKDQGQCGSCWA 168
             +NEE+ S YLG  KP       S +Y   +G  LP SVDWRKEGAV  VKDQG CGSCWA
Sbjct:   106 TNEEYRSMYLGA-KPTKRVLKTSDRYQARVGDALPDSVDWRKEGAVADVKDQGSCGSCWA 164

Query:   169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
             FS + AVEGINK+ TG L+SLSEQELVDCD  S NQGCNGG M+ AFEFI K GG+ TE 
Sbjct:   165 FSTIGAVEGINKIVTGDLISLSEQELVDCDT-SYNQGCNGGLMDYAFEFIIKNGGIDTEA 223

Query:   229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQL---YSH---GVFDEYCGH--QL- 279
             DYPY+  + RC  ++     VTI  YE +P      L    +H    V  E  G   QL 
Sbjct:   224 DYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLY 283

Query:   280 NHGVT--VVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA---S 334
             + GV   + G   DHG    +V   +GT  G+  +I   RNS  +  G  G +  A    
Sbjct:   284 SSGVFDGLCGTELDHG----VVAVGYGTENGKDYWI--VRNSWGNRWGESGYIKMARNIE 337

Query:   335 YPVKRC 340
              P  +C
Sbjct:   338 APTGKC 343

 Score = 290 (107.1 bits), Expect = 2.8e-25, P = 2.8e-25
 Identities = 57/116 (49%), Positives = 71/116 (61%)

Query:   224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQLYSHGVFDEYCGHQLNHGV 283
             V T D Y    +N      K   H       EA     AFQLYS GVFD  CG +L+HGV
Sbjct:   243 VVTIDSYEDVPENSEASLKKALAHQPISVAIEA--GGRAFQLYSSGVFDGLCGTELDHGV 300

Query:   284 TVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
               VGYG ++G+ YW+V+NSWG  WGE+GYI+MARN  +   G CGI M+ASYP+K+
Sbjct:   301 VAVGYGTENGKDYWIVRNSWGNRWGESGYIKMARNIEAPT-GKCGIAMEASYPIKK 355


>TAIR|locus:2120222 [details] [associations]
            symbol:RD19 "RESPONSIVE TO DEHYDRATION 19" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009269 "response to desiccation" evidence=IEP] [GO:0006970
            "response to osmotic stress" evidence=IGI] [GO:0005634 "nucleus"
            evidence=IDA] [GO:0005773 "vacuole" evidence=IDA] [GO:0042742
            "defense response to bacterium" evidence=IMP] [GO:0006096
            "glycolysis" evidence=RCA] [GO:0006833 "water transport"
            evidence=RCA] [GO:0006972 "hyperosmotic response" evidence=RCA]
            [GO:0007030 "Golgi organization" evidence=RCA] [GO:0009266
            "response to temperature stimulus" evidence=RCA] [GO:0009651
            "response to salt stress" evidence=IEP;RCA] [GO:0046686 "response
            to cadmium ion" evidence=RCA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=ISS] [GO:0009414 "response to
            water deprivation" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005634 GO:GO:0005773 EMBL:CP002687
            GenomeReviews:CT486007_GR GO:GO:0009651 GO:GO:0042742
            eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            ProtClustDB:CLSN2688311 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AL035679 EMBL:AL161594 GO:GO:0004197
            MEROPS:C01.022 EMBL:D13042 EMBL:AY080598 EMBL:AY133844
            IPI:IPI00544363 PIR:JN0718 RefSeq:NP_568052.1 UniGene:At.2850
            UniGene:At.74924 ProteinModelPortal:P43296 SMR:P43296 STRING:P43296
            PaxDb:P43296 PRIDE:P43296 EnsemblPlants:AT4G39090.1 GeneID:830064
            KEGG:ath:AT4G39090 TAIR:At4g39090 InParanoid:P43296 OMA:EDFDWRD
            PhylomeDB:P43296 Genevestigator:P43296 GermOnline:AT4G39090
            Uniprot:P43296
        Length = 368

 Score = 406 (148.0 bits), Expect = 4.2e-51, Sum P(2) = 4.2e-51
 Identities = 90/220 (40%), Positives = 128/220 (58%)

Query:    54 DPQSM--EERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDN--K 109
             +PQ +  E+ F  + +++ + Y S +E   RF ++ +N++       Q L    T    +
Sbjct:    41 EPQVLTSEDHFSLFKRKFGKVYASNEEHDYRFSVFKANLRRAR--RHQKLDPSATHGVTQ 98

Query:   110 FADLSNEEFISTYLGYNKPYNEPR----WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGS 165
             F+DL+  EF   +LG    +  P+     P +    LP   DWR  GAVTPVK+QG CGS
Sbjct:    99 FSDLTRSEFRKKHLGVRSGFKLPKDANKAPILPTENLPEDFDWRDHGAVTPVKNQGSCGS 158

Query:   166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD-------VNSENQGCNGGYMEKAFEFI 218
             CW+FSA  A+EG N L TGKLVSLSEQ+LVDCD        +S + GCNGG M  AFE+ 
Sbjct:   159 CWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYT 218

Query:   219 TKIGGVTTEDDYPYRGKNDR-CQTDKTKHHAVTITGYEAI 257
              K GG+  E+DYPY GK+ + C+ DK+K  A +++ +  I
Sbjct:   219 LKTGGLMKEEDYPYTGKDGKTCKLDKSKIVA-SVSNFSVI 257

 Score = 142 (55.0 bits), Expect = 4.2e-51, Sum P(2) = 4.2e-51
 Identities = 35/82 (42%), Positives = 45/82 (54%)

Query:   256 AIPARYAFQLYSHGVFDEY-CGHQLNHGVTVVGYGED------HGEK-YWLVKNSWGTSW 307
             AI A Y  Q Y  GV   Y C  +LNHGV +VGYG          EK YW++KNSWG +W
Sbjct:   277 AINAGY-MQTYIGGVSCPYICTRRLNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGETW 335

Query:   308 GEAGYIRMARNSPSSNIGICGI 329
             GE G+ ++ +        ICG+
Sbjct:   336 GENGFYKICKGR-----NICGV 352


>UNIPROTKB|P09668 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9606 "Homo sapiens"
            [GO:0001520 "outer dense fiber" evidence=IEA] [GO:0001669
            "acrosomal vesicle" evidence=IEA] [GO:0007283 "spermatogenesis"
            evidence=IEA] [GO:0030984 "kininogen binding" evidence=IEA]
            [GO:0032403 "protein complex binding" evidence=IEA] [GO:0043621
            "protein self-association" evidence=IEA] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0031648 "protein destabilization"
            evidence=IMP] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=ISS] [GO:0032526 "response to retinoic acid"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0030108 "HLA-A
            specific activating MHC class I receptor activity" evidence=IDA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEP] [GO:0010813 "neuropeptide catabolic process"
            evidence=IDA] [GO:0010815 "bradykinin catabolic process"
            evidence=IDA] [GO:0030335 "positive regulation of cell migration"
            evidence=IDA] [GO:0070371 "ERK1 and ERK2 cascade" evidence=IDA]
            [GO:0010628 "positive regulation of gene expression" evidence=IDA]
            [GO:0006508 "proteolysis" evidence=IDA;TAS] [GO:0031638 "zymogen
            activation" evidence=IDA] [GO:0016505 "apoptotic protease activator
            activity" evidence=IDA] [GO:0010952 "positive regulation of
            peptidase activity" evidence=IDA] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=ISS] [GO:0004252 "serine-type endopeptidase
            activity" evidence=ISS] [GO:0043066 "negative regulation of
            apoptotic process" evidence=ISS] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0033619 "membrane protein proteolysis"
            evidence=IDA] [GO:0004175 "endopeptidase activity" evidence=IDA]
            [GO:0004177 "aminopeptidase activity" evidence=IDA] [GO:0005764
            "lysosome" evidence=IDA] [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0002250 "adaptive immune response" evidence=IEP]
            [GO:0019882 "antigen processing and presentation" evidence=TAS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0070324 "thyroid hormone binding" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0008284
            "positive regulation of cell proliferation" evidence=ISS]
            [GO:0045766 "positive regulation of angiogenesis" evidence=ISS]
            [GO:0008233 "peptidase activity" evidence=IDA] [GO:0097208
            "alveolar lamellar body" evidence=IDA] [GO:0043129 "surfactant
            homeostasis" evidence=IDA] [GO:0004197 "cysteine-type endopeptidase
            activity" evidence=IDA;TAS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 Reactome:REACT_6900 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913 MEROPS:C01.040 CTD:1512
            OMA:STSCHKT OrthoDB:EOG4W9J43 GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 EMBL:X16832 EMBL:AF426247 EMBL:AK314698 EMBL:AC011944
            EMBL:BC002479 EMBL:X07549 IPI:IPI00297487 PIR:S12486
            RefSeq:NP_004381.2 UniGene:Hs.148641 PDB:1BZN PDBsum:1BZN
            ProteinModelPortal:P09668 SMR:P09668 IntAct:P09668 STRING:P09668
            PhosphoSite:P09668 DMDM:288558851 PaxDb:P09668 PRIDE:P09668
            DNASU:1512 Ensembl:ENST00000220166 GeneID:1512 KEGG:hsa:1512
            UCSC:uc021srk.1 GeneCards:GC15M079213 H-InvDB:HIX0012481
            HGNC:HGNC:2535 HPA:CAB000458 HPA:HPA003524 MIM:116820
            neXtProt:NX_P09668 PharmGKB:PA27033 InParanoid:P09668
            PhylomeDB:P09668 BRENDA:3.4.22.16 ChEMBL:CHEMBL2225 GenomeRNAi:1512
            NextBio:6261 ArrayExpress:P09668 Bgee:P09668 CleanEx:HS_CTSH
            Genevestigator:P09668 GermOnline:ENSG00000103811 GO:GO:0019882
            Uniprot:P09668
        Length = 335

 Score = 383 (139.9 bits), Expect = 4.2e-51, Sum P(2) = 4.2e-51
 Identities = 84/208 (40%), Positives = 124/208 (59%)

Query:    42 AGAWSEGYP----QKYDPQSMEE-RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYI 96
             AGAW  G P     +    S+E+  F++W+ ++ + Y +E E+  R   ++SN + I+  
Sbjct:    10 AGAWLLGVPVCGAAELCVNSLEKFHFKSWMSKHRKTYSTE-EYHHRLQTFASNWRKINAH 68

Query:    97 NSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGL---PASVDWRKEGA 153
             N+ N +FK+  N+F+D+S  E    YL +++P N     S    G    P SVDWRK+G 
Sbjct:    69 NNGNHTFKMALNQFSDMSFAEIKHKYL-WSEPQNCSATKSNYLRGTGPYPPSVDWRKKGN 127

Query:   154 -VTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYME 212
              V+PVK+QG CGSCW FS   A+E    + TGK++SL+EQ+LVDC  +  N GC GG   
Sbjct:   128 FVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPS 187

Query:   213 KAFEFITKIGGVTTEDDYPYRGKNDRCQ 240
             +AFE+I    G+  ED YPY+GK+  C+
Sbjct:   188 QAFEYILYNKGIMGEDTYPYQGKDGYCK 215

 Score = 165 (63.1 bits), Expect = 4.2e-51, Sum P(2) = 4.2e-51
 Identities = 32/79 (40%), Positives = 43/79 (54%)

Query:   263 FQLYSHGVFDEYCGHQ----LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
             F +Y  G++     H+    +NH V  VGYGE +G  YW+VKNSWG  WG  GY  + R 
Sbjct:   259 FMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERG 318

Query:   319 SPSSNIGICGILMQASYPV 337
                    +CG+   ASYP+
Sbjct:   319 K-----NMCGLAACASYPI 332


>UNIPROTKB|G3R9A7 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9595 "Gorilla
            gorilla gorilla" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 OMA:STSCHKT GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 RefSeq:XP_004056662.1 Ensembl:ENSGGOT00000012331
            GeneID:101144312 Uniprot:G3R9A7
        Length = 335

 Score = 383 (139.9 bits), Expect = 4.2e-51, Sum P(2) = 4.2e-51
 Identities = 84/208 (40%), Positives = 123/208 (59%)

Query:    42 AGAWSEGYP----QKYDPQSMEE-RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYI 96
             AGAW  G P     +    S+E+  F +W+ ++ + Y +E E+  R   ++SN + I+  
Sbjct:    10 AGAWLLGVPVCGAAELSVNSLEKFYFRSWMSKHRKTYSTE-EYHHRLQTFASNWRKINAH 68

Query:    97 NSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGL---PASVDWRKEGA 153
             N+ N +FK+  N+F+D+S  E    YL +++P N     S    G    P SVDWRK+G 
Sbjct:    69 NNGNHTFKMALNQFSDMSFAEIKHKYL-WSEPQNCSATKSNYLRGTGPYPPSVDWRKKGN 127

Query:   154 -VTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYME 212
              V+PVK+QG CGSCW FS   A+E    + TGK++SL+EQ+LVDC  +  N GC GG   
Sbjct:   128 FVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPS 187

Query:   213 KAFEFITKIGGVTTEDDYPYRGKNDRCQ 240
             +AFE+I    G+  ED YPY+GK+  C+
Sbjct:   188 QAFEYILYNKGIMGEDTYPYQGKDGYCK 215

 Score = 165 (63.1 bits), Expect = 4.2e-51, Sum P(2) = 4.2e-51
 Identities = 32/79 (40%), Positives = 43/79 (54%)

Query:   263 FQLYSHGVFDEYCGHQ----LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
             F +Y  G++     H+    +NH V  VGYGE +G  YW+VKNSWG  WG  GY  + R 
Sbjct:   259 FMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPKWGMNGYFLIERG 318

Query:   319 SPSSNIGICGILMQASYPV 337
                    +CG+   ASYP+
Sbjct:   319 K-----NMCGLAACASYPI 332


>UNIPROTKB|P09648 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9031 "Gallus gallus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            OrthoDB:EOG48PMKF MEROPS:C01.032 IPI:IPI00602255 PIR:S00081
            UniGene:Gga.523 ProteinModelPortal:P09648 SMR:P09648 Uniprot:P09648
        Length = 218

 Score = 345 (126.5 bits), Expect = 4.2e-51, Sum P(2) = 4.2e-51
 Identities = 64/116 (55%), Positives = 81/116 (69%)

Query:   143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
             P SVDWR++G VTPVKDQGQCGSCWAFS   A+EG +    GKLVSLSEQ LVDC     
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRPEG 61

Query:   203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             NQGCNGG M++AF+++   GG+ +E+ YPY  K+D     K +++A   TG+  IP
Sbjct:    62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIP 117

 Score = 203 (76.5 bits), Expect = 4.2e-51, Sum P(2) = 4.2e-51
 Identities = 42/85 (49%), Positives = 56/85 (65%)

Query:   256 AIPARYA-FQLYSHGVFDEY-CGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGY 312
             AI A ++ FQ Y  G++ E  C  + L+HGV VVGYG + G+KYW+VKNSWG  WG+ GY
Sbjct:   137 AIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGGKKYWIVKNSWGEKWGDKGY 196

Query:   313 IRMARNSPSSNIGICGILMQASYPV 337
             I MA++  +     CGI   ASYP+
Sbjct:   197 IYMAKDRKNH----CGIATAASYPL 217


>WB|WBGene00019986 [details] [associations]
            symbol:R09F10.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            EMBL:FO081137 HSSP:P53634 PIR:D89588 RefSeq:NP_509408.1
            ProteinModelPortal:Q23030 SMR:Q23030 STRING:Q23030 MEROPS:C01.A44
            PaxDb:Q23030 EnsemblMetazoa:R09F10.1 GeneID:181087
            KEGG:cel:CELE_R09F10.1 UCSC:R09F10.1 CTD:181087 WormBase:R09F10.1
            InParanoid:Q23030 OMA:EYPYSAL NextBio:912346 Uniprot:Q23030
        Length = 383

 Score = 389 (142.0 bits), Expect = 8.7e-51, Sum P(2) = 8.7e-51
 Identities = 88/229 (38%), Positives = 137/229 (59%)

Query:    20 DMRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEW 79
             D+  +L      L LL +L + +    +    K +    E+ F +++ ++ R+Y S +E+
Sbjct:    40 DLSYVLTQLFSGLVLLTMLILLSFFVFQRLNHKMENLKHEQMFNDFILKFDRKYTSVEEF 99

Query:    80 QRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNK----PYNEPRWP 135
             + R+ I+  NV   +    +NL   L  N+F D ++EE +   +  NK     ++ P++ 
Sbjct:   100 EYRYQIFLRNVIEFEAEEERNLGLDLDVNEFTDWTDEE-LQKMVQENKYTKYDFDTPKFE 158

Query:   136 SVQYL--GL--PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSE 191
                YL  G+  PAS+DWR++G +TP+K+QGQCGSCWAF+ VA+VE  N +K GKLVSLSE
Sbjct:   159 G-SYLETGVIRPASIDWREQGKLTPIKNQGQCGSCWAFATVASVEAQNAIKKGKLVSLSE 217

Query:   192 QELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG-KNDRC 239
             QE+VDCD    N GC+GGY   A +F+ K  G+ +E +YPY   K+D+C
Sbjct:   218 QEMVDCD--GRNNGCSGGYRPYAMKFV-KENGLESEKEYPYSALKHDQC 263

 Score = 156 (60.0 bits), Expect = 8.7e-51, Sum P(2) = 8.7e-51
 Identities = 33/90 (36%), Positives = 48/90 (53%)

Query:   253 GYEAIPARYAFQLYSHGVFD---EYCGHQLN--HGVTVVGYGEDHGEKYWLVKNSWGTSW 307
             G   + A Y+   Y  G+F+   E C  +    H +T++GYG +    YW+VKNSWGTSW
Sbjct:   300 GMNVVKAMYS---YRSGIFNPSVEDCTEKSMGAHALTIIGYGGEGESAYWIVKNSWGTSW 356

Query:   308 GEAGYIRMARNSPSSNIGICGILMQASYPV 337
             G +GY R+AR      +  CG+      P+
Sbjct:   357 GASGYFRLARG-----VNSCGLANTVVAPI 381


>MGI|MGI:1349426 [details] [associations]
            symbol:Ctsj "cathepsin J" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=ISO]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0048471 "perinuclear region
            of cytoplasm" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:1349426 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF136272
            EMBL:AF158182 EMBL:AY034579 EMBL:AK005526 EMBL:AK131661
            EMBL:BC103769 IPI:IPI00126770 RefSeq:NP_036137.1 UniGene:Mm.31948
            ProteinModelPortal:Q9R014 SMR:Q9R014 MEROPS:C01.038 PRIDE:Q9R014
            Ensembl:ENSMUST00000071526 GeneID:26898 KEGG:mmu:26898
            UCSC:uc007qwa.1 CTD:26898 InParanoid:Q9R014 KO:K09599
            NextBio:304745 Bgee:Q9R014 CleanEx:MM_CTSJ Genevestigator:Q9R014
            GermOnline:ENSMUSG00000055298 Uniprot:Q9R014
        Length = 334

 Score = 378 (138.1 bits), Expect = 8.7e-51, Sum P(2) = 8.7e-51
 Identities = 83/233 (35%), Positives = 131/233 (56%)

Query:    32 LFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQ 91
             L L+   G+ +GA      Q +DP+ ++  +++W  +Y++ Y  ++E  RR  ++  N++
Sbjct:     6 LLLILCFGVASGA------QAHDPK-LDAEWKDWKTKYAKSYSPKEEALRR-AVWEENMR 57

Query:    92 YIDYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPSVQYLGLPAS 145
              I   N +N     +F +  NKF D ++EEF  +      P    +P   +   +GLP  
Sbjct:    58 MIKLHNKENSLGKNNFTMKMNKFGDQTSEEFRKSIDNIPIPAAMTDPHAQNHVSIGLPDY 117

Query:   146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQG 205
              DWR+EG VTPV++QG+CGSCWAF+A  A+EG    KTG L  LS Q L+DC     N+G
Sbjct:   118 KDWREEGYVTPVRNQGKCGSCWAFAAAGAIEGQMFWKTGNLTPLSVQNLLDCSKTVGNKG 177

Query:   206 CNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             C  G   +AFE++ K  G+  E  YPY GK+  C+  ++++ +  IT Y  +P
Sbjct:   178 CQSGTAHQAFEYVLKNKGLEAEATYPYEGKDGPCRY-RSENASANITDYVNLP 229

 Score = 167 (63.8 bits), Expect = 8.7e-51, Sum P(2) = 8.7e-51
 Identities = 37/88 (42%), Positives = 53/88 (60%)

Query:   256 AIPARY-AFQLYSHGVFDE-YCG-HQLNHGVTVVGYGED----HGEKYWLVKNSWGTSWG 308
             AI A + +F+ Y+ G++ E  C  + +NH V VVGYG +     G  YWL+KNSWG  WG
Sbjct:   248 AIDASHDSFRFYNGGIYYEPNCSSYFVNHAVLVVGYGSEGDVKDGNNYWLIKNSWGEEWG 307

Query:   309 EAGYIRMARNSPSSNIGICGILMQASYP 336
               GY+++A++  +     CGI   ASYP
Sbjct:   308 MNGYMQIAKDHNNH----CGIASLASYP 331


>UNIPROTKB|D3ZZR3 [details] [associations]
            symbol:D3ZZR3 "Uncharacterized protein" species:10116
            "Rattus norvegicus" [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0016020 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0002250 GeneTree:ENSGT00560000076577 GO:GO:0097067
            OrthoDB:EOG4JM7Q2 IPI:IPI00210228 PRIDE:D3ZZR3
            Ensembl:ENSRNOT00000028732 Uniprot:D3ZZR3
        Length = 331

 Score = 337 (123.7 bits), Expect = 1.1e-50, Sum P(2) = 1.1e-50
 Identities = 83/233 (35%), Positives = 120/233 (51%)

Query:    37 VLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYI 96
             VLG P      G   +   + ++  ++ W K + +EY  ++E   R  I+  N+++I   
Sbjct:     3 VLGAPGVLCDNGATAE---RPLDHHWDLWKKTHEKEYKDQNEEDVRRLIWEKNLKFIMLH 59

Query:    97 NSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPRW---PSVQYLGLPASVDW- 148
             N ++     S+ +  N   D+  E  I        P         PS     LPA V W 
Sbjct:    60 NLEHSMGMHSYSVGMNHMGDMVAETIIGEMGSERLPRKRKALGLIPSSVNQNLPAGVKWK 119

Query:   149 -RKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE--NQG 205
              R +G    +  QG CGSCWAFSAV A+EG  KLKTGKLVSLS Q LVDC    +  N+G
Sbjct:   120 ERTKGCWKNLVFQGSCGSCWAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKG 179

Query:   206 CNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             C GG+M +AF++I   GG+ +E  YPY+  +++C  D  K+ A T + Y  +P
Sbjct:   180 CGGGFMTEAFQYIIDNGGIDSEASYPYKAMDEKCHYDP-KNRAATCSRYIELP 231

 Score = 207 (77.9 bits), Expect = 1.1e-50, Sum P(2) = 1.1e-50
 Identities = 41/76 (53%), Positives = 50/76 (65%)

Query:   262 AFQLYSHGVFDE-YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
             +F LY  GV+D+  C   +NHGV VVGYG   G+ YWLVKNSWG  +G+ GYIRMARN+ 
Sbjct:   258 SFFLYQSGVYDDPSCTENVNHGVLVVGYGTLDGKDYWLVKNSWGLHFGDQGYIRMARNNK 317

Query:   321 SSNIGICGILMQASYP 336
             +     CGI    SYP
Sbjct:   318 NH----CGIASYCSYP 329


>ZFIN|ZDB-GENE-030131-3539 [details] [associations]
            symbol:ctsh "cathepsin H" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-030131-3539
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 KO:K01366 HOVERGEN:HBG011513
            CTD:1512 OrthoDB:EOG4W9J43 MEROPS:I29.003 HSSP:P43235 EMBL:BC067615
            IPI:IPI00506892 RefSeq:NP_997853.1 UniGene:Dr.14176
            ProteinModelPortal:Q6NWF2 SMR:Q6NWF2 PRIDE:Q6NWF2 GeneID:324818
            KEGG:dre:324818 InParanoid:Q6NWF2 NextBio:20808976 Bgee:Q6NWF2
            Uniprot:Q6NWF2
        Length = 330

 Score = 382 (139.5 bits), Expect = 1.8e-50, Sum P(2) = 1.8e-50
 Identities = 77/186 (41%), Positives = 113/186 (60%)

Query:    59 EERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEF 118
             E  F++W+ QY+++Y   +E+ +R  I+  N + ID  N  N  F +  N+F+D++  EF
Sbjct:    27 EYHFKSWMSQYNKKY-EINEFYQRLQIFLENKKRIDQHNEGNHKFSMGLNQFSDMTFAEF 85

Query:   119 ISTYLGYNKPYN--EPRWPSVQYLGL-PASVDWRKEGA-VTPVKDQGQCGSCWAFSAVAA 174
               TYL   +P N    R   V   GL P ++DWR +G  +T VK+QG CGSCW FS    
Sbjct:    86 KKTYL-LTEPQNCSATRGNHVSSNGLYPDAIDWRTKGHYITDVKNQGPCGSCWTFSTTGC 144

Query:   175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
             +E +  + TGKL+ L+EQ+L+DC  + +N GCNGG    AFE+I    G+ TEDDYPY+ 
Sbjct:   145 LESVTAIATGKLLQLAEQQLIDCAGDFDNHGCNGGLPSHAFEYIMYNKGLMTEDDYPYQA 204

Query:   235 KNDRCQ 240
             K  +C+
Sbjct:   205 KGGQCR 210

 Score = 160 (61.4 bits), Expect = 1.8e-50, Sum P(2) = 1.8e-50
 Identities = 31/79 (39%), Positives = 43/79 (54%)

Query:   263 FQLYSHGVFDEYCGHQ----LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
             F  Y  G++     H     +NH V  VGY E++G  YW+VKNSWGT+WG  GY  + R 
Sbjct:   254 FMHYKDGIYTSTECHNTTDMVNHAVLAVGYAEENGTPYWIVKNSWGTNWGIKGYFYIERG 313

Query:   319 SPSSNIGICGILMQASYPV 337
                    +CG+   +SYP+
Sbjct:   314 K-----NMCGLAACSSYPI 327

 Score = 41 (19.5 bits), Expect = 1.2e-09, Sum P(2) = 1.2e-09
 Identities = 9/42 (21%), Positives = 19/42 (45%)

Query:    44 AWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGI 85
             +W   Y +KY+     +R + +L+   R     +E   +F +
Sbjct:    32 SWMSQYNKKYEINEFYQRLQIFLENKKR-IDQHNEGNHKFSM 72

 Score = 40 (19.1 bits), Expect = 1.5e-09, Sum P(2) = 1.5e-09
 Identities = 17/57 (29%), Positives = 24/57 (42%)

Query:    73 YGSEDEWQRRFGIYSSNVQYIDYINS--QNLSFKLTDNKFADLSNEEFISTYLGYNK 127
             Y  EDE+  +  +   N +Y   IN   Q L   L + K  D  NE      +G N+
Sbjct:    22 YTEEDEYHFKSWMSQYNKKY--EINEFYQRLQIFLENKKRIDQHNEGNHKFSMGLNQ 76


>UNIPROTKB|G3SSC1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9785
            "Loxodonta africana" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 RefSeq:XP_003413898.1
            Ensembl:ENSLAFT00000003415 GeneID:100662496 Uniprot:G3SSC1
        Length = 335

 Score = 376 (137.4 bits), Expect = 1.8e-50, Sum P(2) = 1.8e-50
 Identities = 86/224 (38%), Positives = 134/224 (59%)

Query:    35 LWVLGIP---AGAWSEGYPQKYDPQSME----ERF--ENWLKQYSREYGSEDEWQRRFGI 85
             +W + +P   AGAW  G P+  D  ++     E+F  ++W+ Q+ ++Y SE E+ +R   
Sbjct:     1 MWAV-LPLLCAGAWFLG-PRTCDATALSVSSYEKFHFQSWMAQHQKKYSSE-EYHQRQQT 57

Query:    86 YSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYN--EPRWPSVQYLG-L 142
             + SN + I+  N++N +FK+  N+F+D++  E    YL +++P N    +   ++  G  
Sbjct:    58 FVSNWRKINAHNARNHTFKMALNQFSDMTFAEIKQKYL-WSEPQNCSATKGNYLRGTGPY 116

Query:   143 PASVDWRKEGA-VTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
             P  VDWRK+G  V+PVK+QG CGSCW FS   A+E    +  GKL+SL+EQ+LVDC  + 
Sbjct:   117 PPFVDWRKKGHFVSPVKNQGACGSCWTFSTTGALESAIAIAGGKLLSLAEQQLVDCAKDF 176

Query:   202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTK 245
              N GC GG   +AFE+I    G+  ED YPY+G++D C+    K
Sbjct:   177 NNHGCQGGLPSQAFEYILYNKGIMGEDTYPYKGQDDVCKFQPKK 220

 Score = 166 (63.5 bits), Expect = 1.8e-50, Sum P(2) = 1.8e-50
 Identities = 33/79 (41%), Positives = 43/79 (54%)

Query:   263 FQLYSHGVFDEYCGHQ----LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
             F  YS G++     H+    +NH V  VGYGE+ G  YW+VKNSWG  WG  GY  + R 
Sbjct:   259 FMKYSKGIYSSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPYWGMDGYFLIERG 318

Query:   319 SPSSNIGICGILMQASYPV 337
                    +CG+   ASYP+
Sbjct:   319 K-----NMCGLAACASYPI 332


>UNIPROTKB|G1M0X4 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9646
            "Ailuropoda melanoleuca" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 EMBL:ACTA01057330 EMBL:ACTA01065330
            Ensembl:ENSAMET00000013529 Uniprot:G1M0X4
        Length = 337

 Score = 373 (136.4 bits), Expect = 1.8e-50, Sum P(2) = 1.8e-50
 Identities = 78/188 (41%), Positives = 117/188 (62%)

Query:    62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
             F++W+ Q+ ++Y SE E+Q R   +  N + I+  N+ N +FK+  N+F+D+S  E    
Sbjct:    37 FKSWMVQHQKKYSSE-EYQHRLRTFVGNWRKINAHNAGNHTFKMGLNQFSDMSFAEIKRK 95

Query:   122 YLGYNKPYN--EPRWPSVQYLG-LPASVDWRKEGA-VTPVKDQGQCGSCWAFSAVAAVEG 177
             YL +++P N    +   ++  G  P  VDWRK+G  V+PVK+QG CGSCW FS   A+E 
Sbjct:    96 YL-WSEPQNCSATKGNYLRGTGPYPPFVDWRKKGKFVSPVKNQGGCGSCWTFSTTGALES 154

Query:   178 INKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKND 237
                +KTGKL+SL+EQ+LVDC  +  N GC GG   +AFE+I    G+  ED YPY+G++ 
Sbjct:   155 AIAIKTGKLLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYIRYNRGIMGEDSYPYKGQDG 214

Query:   238 RCQTDKTK 245
              C+   +K
Sbjct:   215 DCKFQPSK 222

 Score = 169 (64.5 bits), Expect = 1.8e-50, Sum P(2) = 1.8e-50
 Identities = 33/79 (41%), Positives = 43/79 (54%)

Query:   263 FQLYSHGVFDEYCGHQ----LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
             F +Y  GV+     H+    +NH V  VGYGE +G  YW+VKNSWG  WG  GY  + R 
Sbjct:   261 FMMYRKGVYSSTSCHKTPDKVNHAVLAVGYGEQNGVPYWIVKNSWGPQWGMHGYFLIERG 320

Query:   319 SPSSNIGICGILMQASYPV 337
                    +CG+   ASYP+
Sbjct:   321 K-----NMCGLAACASYPI 334

 Score = 48 (22.0 bits), Expect = 1.9e-11, Sum P(2) = 1.9e-11
 Identities = 13/54 (24%), Positives = 23/54 (42%)

Query:    26 RNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFE----NWLKQYSREYGS 75
             R + LS F L+   +   +W   + +KY  +  + R      NW K  +   G+
Sbjct:    21 RASFLSFFFLFTEKVHFKSWMVQHQKKYSSEEYQHRLRTFVGNWRKINAHNAGN 74


>MGI|MGI:1922258 [details] [associations]
            symbol:4930486L24Rik "RIKEN cDNA 4930486L24 gene"
            species:10090 "Mus musculus" [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030054 "cell
            junction" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 MGI:MGI:1922258
            GO:GO:0005576 GO:GO:0030054 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 HSSP:P07711
            EMBL:AY146988 EMBL:AK145933 EMBL:BC061218 IPI:IPI00280732
            RefSeq:NP_835199.1 UniGene:Mm.19839 ProteinModelPortal:Q80UB0
            SMR:Q80UB0 MEROPS:C01.972 PRIDE:Q80UB0 Ensembl:ENSMUST00000091569
            GeneID:214639 KEGG:mmu:214639 UCSC:uc007qvs.1 InParanoid:Q80UB0
            OMA:RYHAENS OrthoDB:EOG4XWG0N NextBio:374408 Bgee:Q80UB0
            CleanEx:MM_4930486L24RIK Genevestigator:Q80UB0 Uniprot:Q80UB0
        Length = 333

 Score = 361 (132.1 bits), Expect = 2.3e-50, Sum P(2) = 2.3e-50
 Identities = 77/213 (36%), Positives = 120/213 (56%)

Query:    54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLS----FKLTDNK 109
             DP S++ ++  W  ++ + Y   +E  RR  ++  N + I+  N + L     F +T N 
Sbjct:    22 DP-SLDVQWNEWRTKHGKAYNVNEERLRR-AVWEKNFKMIELHNWEYLEGKHDFTMTMNA 79

Query:   110 FADLSNEEFISTYLGYNKPYNEPR--WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
             F DL+N EF+    G+ +   +    +   Q+L +P  VDWR  G VTPVK+QG C S W
Sbjct:    80 FGDLTNTEFVKMMTGFRRQKIKRMHVFQDHQFLYVPKYVDWRMLGYVTPVKNQGYCASSW 139

Query:   168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
             AFSA  ++EG    KTG+LV LSEQ L+DC  ++    C+GG+M+ AF+++   GG+ TE
Sbjct:   140 AFSATGSLEGQMFKKTGRLVPLSEQNLLDCMGSNVTHDCSGGFMQNAFQYVKDNGGLATE 199

Query:   228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR 260
             + YPY G   +C+    ++ A  +  +  IP R
Sbjct:   200 ESYPYIGPGRKCRYH-AENSAANVRDFVQIPGR 231

 Score = 180 (68.4 bits), Expect = 2.3e-50, Sum P(2) = 2.3e-50
 Identities = 40/89 (44%), Positives = 52/89 (58%)

Query:   256 AIPARY-AFQLYSHGVFDE-YCGH-QLNHGVTVVGYG----EDHGEKYWLVKNSWGTSWG 308
             A+ A + +FQ Y  G++ E  C    LNH V VVGYG    E  G  YWLVKNSWG  WG
Sbjct:   248 AVDASHDSFQFYDSGIYYEPQCKRVHLNHAVLVVGYGFEGEESDGNSYWLVKNSWGEEWG 307

Query:   309 EAGYIRMARNSPSSNIGICGILMQASYPV 337
               GYI++A++  +     CGI   A+YP+
Sbjct:   308 MKGYIKIAKDWNNH----CGIATLATYPI 332


>RGD|69241 [details] [associations]
            symbol:Ctsj "cathepsin J" species:10116 "Rattus norvegicus"
           [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
           evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
           evidence=IEA] [GO:0048471 "perinuclear region of cytoplasm"
           evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
           PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:L14776
           RGD:69241 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
           InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246
           SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
           GO:GO:0005764 HOVERGEN:HBG011513 MEROPS:C01.038 CTD:26898 KO:K09599
           EMBL:AF310623 EMBL:BC097263 IPI:IPI00205027 PIR:I58002
           RefSeq:NP_058817.1 UniGene:Rn.34875 ProteinModelPortal:Q63088
           SMR:Q63088 PRIDE:Q63088 GeneID:29174 KEGG:rno:29174 NextBio:608244
           Genevestigator:Q63088 Uniprot:Q63088
        Length = 334

 Score = 362 (132.5 bits), Expect = 6.0e-50, Sum P(2) = 6.0e-50
 Identities = 81/220 (36%), Positives = 122/220 (55%)

Query:    46 SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----L 101
             + G P + DP +++  +++W  +Y++ Y   +E  +R  ++  N++ I   N +N     
Sbjct:    15 ASGAPAR-DP-NLDAEWQDWKTKYAKSYSPVEEELKR-AVWEENLKMIQLHNKENGLGKN 71

Query:   102 SFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY---LGLPASVDWRKEGAVTPVK 158
              F +  N FAD + EEF  +      P      PS Q    +GLP   DWRKEG VTPV+
Sbjct:    72 GFTMEMNAFADTTGEEFRKSLSDILIPAAVTN-PSAQKQVSIGLPNFKDWRKEGYVTPVR 130

Query:   159 DQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFI 218
             +QG+CGSCWAF+AV A+EG    KTG L  LS Q L+DC  +  N GC  G   +AF ++
Sbjct:   131 NQGKCGSCWAFAAVGAIEGQMFSKTGNLTPLSVQNLLDCSKSEGNNGCRWGTAHQAFNYV 190

Query:   219 TKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
              K  G+  E  YPY GK+  C+   +++ +  ITG+  +P
Sbjct:   191 LKNKGLEAEATYPYEGKDGPCRYH-SENASANITGFVNLP 229

 Score = 175 (66.7 bits), Expect = 6.0e-50, Sum P(2) = 6.0e-50
 Identities = 39/88 (44%), Positives = 54/88 (61%)

Query:   256 AIPARY-AFQLYSHGVFDE-YCG-HQLNHGVTVVGYG----EDHGEKYWLVKNSWGTSWG 308
             AI A + +F+ YS GV+ E  C  + +NH V VVGYG    E  G  YWL+KNSWG  WG
Sbjct:   248 AIDASHDSFRFYSGGVYHEPNCSSYVVNHAVLVVGYGFEGNETDGNNYWLIKNSWGEEWG 307

Query:   309 EAGYIRMARNSPSSNIGICGILMQASYP 336
               G++++A++  +     CGI  QAS+P
Sbjct:   308 INGFMKIAKDRNNH----CGIASQASFP 331


>TAIR|locus:2078312 [details] [associations]
            symbol:AT3G45310 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005773 EMBL:CP002686
            GenomeReviews:BA000014_GR eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AL132953
            EMBL:AY091771 IPI:IPI00540369 PIR:T47471 RefSeq:NP_566880.1
            UniGene:At.25239 ProteinModelPortal:Q8RWQ9 SMR:Q8RWQ9
            MEROPS:C01.162 PaxDb:Q8RWQ9 PRIDE:Q8RWQ9 EnsemblPlants:AT3G45310.1
            GeneID:823669 KEGG:ath:AT3G45310 GeneFarm:5032 TAIR:At3g45310
            InParanoid:Q8RWQ9 KO:K01366 OMA:AFEVVHE PhylomeDB:Q8RWQ9
            ProtClustDB:CLSN2689015 Genevestigator:Q8RWQ9 Uniprot:Q8RWQ9
        Length = 358

 Score = 363 (132.8 bits), Expect = 1.3e-49, Sum P(2) = 1.3e-49
 Identities = 74/181 (40%), Positives = 108/181 (59%)

Query:    62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
             F  +  +Y ++Y S +E + RF ++  N+  I   N + LS+KL+ N+FADL+ +EF   
Sbjct:    59 FSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRY 118

Query:   122 YLGYNKPYNEPRWPS--VQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGIN 179
              LG  +  +     S  +    +P + DWR++G V+PVK+QG CGSCW FS   A+E   
Sbjct:   119 KLGAAQNCSATLKGSHKITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAY 178

Query:   180 KLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRC 239
                 GK +SLSEQ+LVDC     N GC+GG   +AFE+I   GG+ TE+ YPY GK+  C
Sbjct:   179 HQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGGC 238

Query:   240 Q 240
             +
Sbjct:   239 K 239

 Score = 171 (65.3 bits), Expect = 1.3e-49, Sum P(2) = 1.3e-49
 Identities = 37/101 (36%), Positives = 50/101 (49%)

Query:   242 DKTKHHAVTITGYE-AIPARYAFQLYSHGVF-DEYCGH---QLNHGVTVVGYGEDHGEKY 296
             D+ KH    +     A    + F+ Y  GVF    CG+    +NH V  VGYG +    Y
Sbjct:   261 DELKHAVGLVRPVSVAFEVVHEFRFYKKGVFTSNTCGNTPMDVNHAVLAVGYGVEDDVPY 320

Query:   297 WLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
             WL+KNSWG  WG+ GY +M          +CG+   +SYPV
Sbjct:   321 WLIKNSWGGEWGDNGYFKMEMGK-----NMCGVATCSSYPV 356


>ZFIN|ZDB-GENE-050417-107 [details] [associations]
            symbol:zgc:110239 "zgc:110239" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-050417-107
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 MEROPS:I29.003 OrthoDB:EOG412M56 EMBL:BC092817
            IPI:IPI00503987 RefSeq:NP_001017633.1 UniGene:Dr.39081
            ProteinModelPortal:Q568K7 GeneID:550326 KEGG:dre:550326
            HOGENOM:HOG000007373 HOVERGEN:HBG105018 InParanoid:Q568K7
            NextBio:20879584 ArrayExpress:Q568K7 Uniprot:Q568K7
        Length = 546

 Score = 379 (138.5 bits), Expect = 2.0e-49, Sum P(2) = 2.0e-49
 Identities = 86/207 (41%), Positives = 122/207 (58%)

Query:    55 PQSMEER-FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
             P S   R F ++ ++++R+Y +E E + R   +  N++Y+  +N   LSF L+ N  AD 
Sbjct:   235 PVSHAHRMFGHYKEKFNRQYDNEMEHEEREHNFVHNIRYVHSMNRAGLSFSLSVNHLADR 294

Query:   114 SNEEFISTYLGYNKPYNEPR----WPS-VQYLGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
             S +E +S   G  + +   R    +PS ++ +  P SVDWR  GAVTPVKDQ  CGSCW+
Sbjct:   295 SQKE-LSMMRGCQRTHKVHRKAQPFPSEIRSIATPNSVDWRLYGAVTPVKDQAVCGSCWS 353

Query:   169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
             F+    +EG   LKTG+L SLS+Q LVDC     N GC+GG   +AFE+I K GG++T +
Sbjct:   354 FATTGTLEGALFLKTGQLTSLSQQMLVDCTWGFGNNGCDGGEEWRAFEWIMKHGGISTAE 413

Query:   229 DY-PYRGKNDRCQTDKTKHHAVTITGY 254
              Y  Y G N  C  DK+   A  +TGY
Sbjct:   414 SYGAYMGMNGLCHYDKSSMVA-QLTGY 439

 Score = 153 (58.9 bits), Expect = 2.0e-49, Sum P(2) = 2.0e-49
 Identities = 35/81 (43%), Positives = 45/81 (55%)

Query:   259 ARYAFQLYSHGVFDE-YCGHQLN---HGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIR 314
             A  +F  YS+GV+ E  C + +N   H V  VGYG  + E YWLVKNSW + WG  GYI 
Sbjct:   467 AHRSFAFYSNGVYYEPECKNGINDLDHAVLAVGYGIMNNESYWLVKNSWSSYWGNDGYIL 526

Query:   315 MARNSPSSNIGICGILMQASY 335
             M+     +N   CG+   A Y
Sbjct:   527 MSMKD--NN---CGVATDAIY 542


>UNIPROTKB|F1NZ37 [details] [associations]
            symbol:LOC420160 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 EMBL:AADN02062018
            IPI:IPI00587784 Ensembl:ENSGALT00000006765 OMA:CGVANQA
            Uniprot:F1NZ37
        Length = 340

 Score = 362 (132.5 bits), Expect = 2.0e-49, Sum P(2) = 2.0e-49
 Identities = 81/187 (43%), Positives = 110/187 (58%)

Query:    54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN---SQNL-SFKLTDNK 109
             DP  +EE +E W   Y++EY  E E  RR  ++ +N++ I+  N   SQ   +F+L  N 
Sbjct:    27 DPV-LEEAWERWKSLYAKEYPGEAELIRR-EVWENNLRRIEQHNWEESQGQHTFRLGMNH 84

Query:   110 FADLSNEEFISTYLGYNK-PYNEPR--WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
             + DL +EEF     G+    + EP   + +      PA VDWR  G VTPVK+QG CGSC
Sbjct:    85 YGDLMDEEFNQLLNGFAPVQHEEPALTFQASAAQKTPAEVDWRMRGYVTPVKNQGHCGSC 144

Query:   167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
             WAFSA  A+EG+    TGKL  LSEQ L+DC     N GC GGYM +AF+++   GG+ +
Sbjct:   145 WAFSATGALEGLVFNWTGKLAVLSEQNLIDCSWKLGNNGCQGGYMTRAFQYVHDNGGMNS 204

Query:   227 EDDYPYR 233
             E  YPY+
Sbjct:   205 EHIYPYQ 211

 Score = 170 (64.9 bits), Expect = 2.0e-49, Sum P(2) = 2.0e-49
 Identities = 32/82 (39%), Positives = 47/82 (57%)

Query:   261 YAFQLYSHGVFDE-YCGHQLNHGVTVVGYG-EDHGEK---YWLVKNSWGTSWGEAGYIRM 315
             + F  Y  G+F+  +C  ++NHG+  VGYG      K   YW++KNSW   WGE GYIR+
Sbjct:   262 FFFHFYKSGIFNSMFCSQKVNHGMLAVGYGISQEARKNVSYWILKNSWSEVWGEKGYIRL 321

Query:   316 ARNSPSSNIGICGILMQASYPV 337
              +   +     CG+  QAS+P+
Sbjct:   322 LKGVNNH----CGVANQASFPL 339


>UNIPROTKB|F6X9C1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00660000095458
            OMA:STSCHKT Ensembl:ENSCAFT00000036196 EMBL:AAEX03002388
            Uniprot:F6X9C1
        Length = 305

 Score = 360 (131.8 bits), Expect = 2.6e-49, Sum P(2) = 2.6e-49
 Identities = 76/188 (40%), Positives = 116/188 (61%)

Query:    62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
             F++W  Q+ ++Y SE E+ +R   +  N + I+  N+ N +FK+  N+F+D++  E    
Sbjct:     5 FKSWAVQHQKKYSSE-EYLQRLQTFVGNWRKINAHNAGNHTFKMGLNQFSDMNFAEIKHK 63

Query:   122 YLGYNKPYN--EPRWPSVQYLG-LPASVDWRKEGA-VTPVKDQGQCGSCWAFSAVAAVEG 177
             YL +++P N    +   ++  G  P  VDWRK+G  V+PVK+QG CGSCW FS   A+E 
Sbjct:    64 YL-WSEPQNCSATKGNYLRGTGPYPPFVDWRKKGKFVSPVKNQGSCGSCWTFSTTGALES 122

Query:   178 INKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKND 237
                +K+GKL+SL+EQ+LVDC  N  N GC GG   +AFE+I    G+  ED YPY+G++ 
Sbjct:   123 AIAIKSGKLLSLAEQQLVDCAQNFNNHGCQGGAPLQAFEYIRYNKGIMGEDSYPYKGQDG 182

Query:   238 RCQTDKTK 245
              C+   +K
Sbjct:   183 DCKYQPSK 190

 Score = 171 (65.3 bits), Expect = 2.6e-49, Sum P(2) = 2.6e-49
 Identities = 33/79 (41%), Positives = 43/79 (54%)

Query:   263 FQLYSHGVFDEYCGHQ----LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
             F +Y  G++     H+    +NH V  VGYGE +G  YW+VKNSWG  WG  GY  M R 
Sbjct:   229 FMMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEQNGIPYWIVKNSWGPQWGMNGYFLMERG 288

Query:   319 SPSSNIGICGILMQASYPV 337
                    +CG+   ASYP+
Sbjct:   289 K-----NMCGLAACASYPI 302


>UNIPROTKB|Q24940 [details] [associations]
            symbol:Cat-1 "Cathepsin L-like proteinase" species:6192
            "Fasciola hepatica" [GO:0004175 "endopeptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005576 "extracellular region" evidence=IDA]
            [GO:0006508 "proteolysis" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005576 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0004197 EMBL:L33771 PIR:S43991 PDB:2O6X
            PDBsum:2O6X ProteinModelPortal:Q24940 SMR:Q24940 MEROPS:C01.033
            EvolutionaryTrace:Q24940 Uniprot:Q24940
        Length = 326

 Score = 335 (123.0 bits), Expect = 3.3e-49, Sum P(2) = 3.3e-49
 Identities = 74/209 (35%), Positives = 113/209 (54%)

Query:    57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNKFAD 112
             S ++ +  W + Y++EY   D+  RR  I+  NV++I   N ++    +++ L  N+F D
Sbjct:    16 SNDDLWHQWKRMYNKEYNGADDQHRR-NIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTD 74

Query:   113 LSNEEFISTYLGYNKPYNEPRWPSVQY----LGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
             ++ EEF + YL      ++     V Y      +P  +DWR+ G VT VKDQG CGSCWA
Sbjct:    75 MTFEEFKAKYLTEMSRASDILSHGVPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSCWA 134

Query:   169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
             FS    +EG         +S SEQ+LVDC     N GC+GG ME A++++ + G + TE 
Sbjct:   135 FSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQFG-LETES 193

Query:   229 DYPYRGKNDRCQTDKTKHHAVTITGYEAI 257
              YPY     +C+ +K    A  +TGY  +
Sbjct:   194 SYPYTAVEGQCRYNKQLGVA-KVTGYYTV 221

 Score = 195 (73.7 bits), Expect = 3.3e-49, Sum P(2) = 3.3e-49
 Identities = 39/84 (46%), Positives = 50/84 (59%)

Query:   256 AIPARYAFQLYSHGVFD-EYCGH-QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYI 313
             A+     F +Y  G++  + C   ++NH V  VGYG   G  YW+VKNSWGT WGE GYI
Sbjct:   242 AVDVESDFMMYRSGIYQSQTCSPLRVNHAVLAVGYGTQGGTDYWIVKNSWGTYWGERGYI 301

Query:   314 RMARNSPSSNIGICGILMQASYPV 337
             RMARN  +    +CGI   AS P+
Sbjct:   302 RMARNRGN----MCGIASLASLPM 321


>TAIR|locus:2152445 [details] [associations]
            symbol:SAG12 "senescence-associated gene 12" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009723 "response to ethylene stimulus" evidence=IEP]
            [GO:0007568 "aging" evidence=IEP;TAS] [GO:0010150 "leaf senescence"
            evidence=IEP;TAS] [GO:0010282 "senescence-associated vacuole"
            evidence=IDA] [GO:0009817 "defense response to fungus, incompatible
            interaction" evidence=IEP] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:CP002688 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0010150 GO:GO:0009817 EMBL:AB016870
            HSSP:O65039 OMA:NDEQALM EMBL:AF370131 EMBL:AY040073 IPI:IPI00544181
            RefSeq:NP_568651.1 UniGene:At.75256 UniGene:At.7710
            ProteinModelPortal:Q9FJ47 SMR:Q9FJ47 IntAct:Q9FJ47 STRING:Q9FJ47
            MEROPS:C01.117 PRIDE:Q9FJ47 ProMEX:Q9FJ47 EnsemblPlants:AT5G45890.1
            GeneID:834629 KEGG:ath:AT5G45890 TAIR:At5g45890 InParanoid:Q9FJ47
            PhylomeDB:Q9FJ47 ProtClustDB:CLSN2917735 ArrayExpress:Q9FJ47
            Genevestigator:Q9FJ47 GO:GO:0010282 Uniprot:Q9FJ47
        Length = 346

 Score = 512 (185.3 bits), Expect = 4.1e-49, P = 4.1e-49
 Identities = 101/213 (47%), Positives = 141/213 (66%)

Query:    58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS--QNLSFKLTDNKFADLSN 115
             M++R   W+ ++ R Y    E   R+ ++ +NV+ I+++NS     +FKL  N+FADL+N
Sbjct:    34 MQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTN 93

Query:   116 EEFISTYLGY---------NKPYNEP-RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGS 165
             +EF S Y G+         ++    P R+ +V    LP SVDWRK+GAVTP+K+QG CG 
Sbjct:    94 DEFRSMYTGFKGVSALSSQSQTKMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGC 153

Query:   166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
             CWAFSAVAA+EG  ++K GKL+SLSEQ+LVDCD N  + GC GG M+ AFE I   GG+T
Sbjct:   154 CWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN--DFGCEGGLMDTAFEHIKATGGLT 211

Query:   226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             TE +YPY+G++  C + KT   A +ITGYE +P
Sbjct:   212 TESNYPYKGEDATCNSKKTNPKATSITGYEDVP 244

 Score = 245 (91.3 bits), Expect = 2.2e-20, P = 2.2e-20
 Identities = 50/118 (42%), Positives = 66/118 (55%)

Query:   220 KIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQLYSHGVFDEYCGHQL 279
             K   +T  +D P    ND     K   H     G E     + FQ YS GVF   C   L
Sbjct:   233 KATSITGYEDVPV---NDEQALMKAVAHQPVSVGIEG--GGFDFQFYSSGVFTGECTTYL 287

Query:   280 NHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
             +H VT +GYGE  +G KYW++KNSWGT WGE+GY+R+ ++      G+CG+ M+ASYP
Sbjct:   288 DHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGYMRIQKDVKDKQ-GLCGLAMKASYP 344


>TAIR|locus:2050145 [details] [associations]
            symbol:AT2G21430 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002685 GenomeReviews:CT485783_GR
            EMBL:AC006841 EMBL:X74359 IPI:IPI00519637 PIR:B84601
            RefSeq:NP_565512.1 UniGene:At.14069 ProteinModelPortal:P43295
            SMR:P43295 MEROPS:C01.A04 PRIDE:P43295 EnsemblPlants:AT2G21430.1
            GeneID:816682 KEGG:ath:AT2G21430 TAIR:At2g21430 eggNOG:COG4870
            HOGENOM:HOG000230774 InParanoid:P43295 KO:K01373 OMA:GSIEEHY
            PhylomeDB:P43295 ProtClustDB:CLSN2688311 Genevestigator:P43295
            GermOnline:AT2G21430 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 Uniprot:P43295
        Length = 361

 Score = 381 (139.2 bits), Expect = 4.2e-49, Sum P(2) = 4.2e-49
 Identities = 82/201 (40%), Positives = 115/201 (57%)

Query:    57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNE 116
             S E+ F  + K++ + YGS +E   RF ++ +N+         + S +    +F+DL+  
Sbjct:    43 SSEDHFTLFKKKFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQFSDLTRS 102

Query:   117 EFISTYLG----YNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
             EF   +LG    +  P +  + P +    LP   DWR  GAVTPVK+QG CGSCW+FS  
Sbjct:   103 EFRRKHLGVKGGFKLPKDANQAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTT 162

Query:   173 AAVEGINKLKTGKLVSLSEQELVDCDVNSE-------NQGCNGGYMEKAFEFITKIGGVT 225
              A+EG + L TGKLVSLSEQ+LVDCD   +       + GCNGG M  AFE+  K GG+ 
Sbjct:   163 GALEGAHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLM 222

Query:   226 TEDDYPYRGKND-RCQTDKTK 245
              E DYPY G +   C+ D++K
Sbjct:   223 REKDYPYTGTDGGSCKLDRSK 243

 Score = 148 (57.2 bits), Expect = 4.2e-49, Sum P(2) = 4.2e-49
 Identities = 36/82 (43%), Positives = 45/82 (54%)

Query:   256 AIPARYAFQLYSHGVFDEY-CGHQLNHGVTVVGYGEDH------GEK-YWLVKNSWGTSW 307
             AI A Y  Q Y  GV   Y C  +LNHGV +VGYG          EK YW++KNSWG SW
Sbjct:   274 AINAAY-MQTYIGGVSCPYICSRRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESW 332

Query:   308 GEAGYIRMARNSPSSNIGICGI 329
             GE G+ ++ +        ICG+
Sbjct:   333 GENGFYKICKGR-----NICGV 349


>RGD|708447 [details] [associations]
            symbol:Testin "testin gene" species:10116 "Rattus norvegicus"
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0030054 "cell junction" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 RGD:708447 GO:GO:0005576 GO:GO:0030054 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            MEROPS:C01.972 OMA:RYHAENS OrthoDB:EOG4XWG0N EMBL:U16858
            IPI:IPI00207173 PIR:I52525 PIR:PC1251 RefSeq:NP_775155.1
            UniGene:Rn.10029 ProteinModelPortal:P15242 SMR:P15242
            Ensembl:ENSRNOT00000024467 GeneID:286916 KEGG:rno:286916
            UCSC:RGD:708447 CTD:286916 InParanoid:P15242 NextBio:625036
            Genevestigator:P15242 GermOnline:ENSRNOG00000018028 Uniprot:P15242
        Length = 333

 Score = 355 (130.0 bits), Expect = 4.2e-49, Sum P(2) = 4.2e-49
 Identities = 74/211 (35%), Positives = 119/211 (56%)

Query:    54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLS----FKLTDNK 109
             DP S++  +  W  ++ + Y   +E  +R  ++  N + I+  N + L     F +  N 
Sbjct:    22 DP-SLDVEWNEWRTKHGKTYNMNEERLKR-AVWEKNFKMIELHNWEYLEGRHDFTMAMNA 79

Query:   110 FADLSNEEFISTYLGYNKPYNEPR--WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
             F DL+N EF+    G+ +   +    +   Q+L +P  VDWR+ G VTPVK+QG C S W
Sbjct:    80 FGDLTNIEFVKMMTGFQRQKIKKTHIFQDHQFLYVPKRVDWRQLGYVTPVKNQGHCASSW 139

Query:   168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
             AFSA  ++EG    KT +L+ LSEQ L+DC  ++   GC+GG+M+ AF+++   GG+ TE
Sbjct:   140 AFSATGSLEGQMFRKTERLIPLSEQNLLDCMGSNVTHGCSGGFMQYAFQYVKDNGGLATE 199

Query:   228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             + YPYRG+   C+    ++ A  +  +  IP
Sbjct:   200 ESYPYRGQGRECRYH-AENSAANVRDFVQIP 229

 Score = 174 (66.3 bits), Expect = 4.2e-49, Sum P(2) = 4.2e-49
 Identities = 39/89 (43%), Positives = 53/89 (59%)

Query:   256 AIPARY-AFQLYSHGVFDE-YCGH-QLNHGVTVVGYG----EDHGEKYWLVKNSWGTSWG 308
             A+ A + +FQ Y  G++ E  C    LNH V VVGYG    E  G  +WLVKNSWG  WG
Sbjct:   248 AVDASHGSFQFYGSGIYYEPQCKRVHLNHAVLVVGYGFEGEESDGNSFWLVKNSWGEEWG 307

Query:   309 EAGYIRMARNSPSSNIGICGILMQASYPV 337
               GY+++A++   SN   CGI   ++YP+
Sbjct:   308 MKGYMKLAKDW--SNH--CGIATYSTYPI 332


>WB|WBGene00007055 [details] [associations]
            symbol:tag-196 species:6239 "Caenorhabditis elegans"
            [GO:0004869 "cysteine-type endopeptidase inhibitor activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000010
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00031 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00043 SMART:SM00645 InterPro:IPR000169
            GO:GO:0004869 eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 EMBL:FO080488 PIR:T31871
            RefSeq:NP_505215.2 HSSP:Q9UBX1 ProteinModelPortal:O16454 SMR:O16454
            DIP:DIP-27400N IntAct:O16454 MINT:MINT-1044990 MEROPS:C01.A50
            PaxDb:O16454 EnsemblMetazoa:F41E6.6.1 EnsemblMetazoa:F41E6.6.2
            EnsemblMetazoa:F41E6.6.3 GeneID:179240 KEGG:cel:CELE_F41E6.6
            UCSC:F41E6.6.1 CTD:179240 WormBase:F41E6.6 InParanoid:O16454
            OMA:GGGLMTN NextBio:904514 Uniprot:O16454
        Length = 477

 Score = 344 (126.2 bits), Expect = 1.1e-48, Sum P(2) = 1.1e-48
 Identities = 82/211 (38%), Positives = 120/211 (56%)

Query:    62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYI--NSQNLS-FKLTDNKFADLSNEEF 118
             F +++ ++ ++Y ++ E  +RF ++  N + I  +  N Q  + +  T  KF+D++  EF
Sbjct:   174 FLDFVDRHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVYGFT--KFSDMTTMEF 231

Query:   119 ISTYLGYNKPYNEPRWP-----------SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
                 L Y   + +P +P           ++    LP S DWR++GAVT VK+QG CGSCW
Sbjct:   232 KKIMLPYQ--WEQPVYPMEQANFEKHDVTINEEDLPESFDWREKGAVTQVKNQGNCGSCW 289

Query:   168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
             AFS    VEG   +   KLVSLSEQELVDCD  S +QGCNGG    A++ I ++GG+  E
Sbjct:   290 AFSTTGNVEGAWFIAKNKLVSLSEQELVDCD--SMDQGCNGGLPSNAYKEIIRMGGLEPE 347

Query:   228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             D YPY G+ + C   + K  AV I G   +P
Sbjct:   348 DAYPYDGRGETCHLVR-KDIAVYINGSVELP 377

 Score = 181 (68.8 bits), Expect = 1.1e-48, Sum P(2) = 1.1e-48
 Identities = 35/75 (46%), Positives = 46/75 (61%)

Query:   264 QLYSHGV---FDEYCG-HQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
             Q Y HGV   F  +C    LNHGV +VGYG+D  + YW+VKNSWG +WGEAGY ++ R  
Sbjct:   403 QFYRHGVVHPFKIFCEPFMLNHGVLIVGYGKDGRKPYWIVKNSWGPNWGEAGYFKLYRGK 462

Query:   320 PSSNIGICGILMQAS 334
                   +CG+   A+
Sbjct:   463 -----NVCGVQEMAT 472


>UNIPROTKB|F7BJD8 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9796 "Equus
            caballus" [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=ISS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0004175 "endopeptidase activity" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0004252
            "serine-type endopeptidase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005829 "cytosol" evidence=ISS] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0010628 "positive regulation of
            gene expression" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0010815 "bradykinin catabolic
            process" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0016505 "apoptotic protease
            activator activity" evidence=ISS] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=ISS] [GO:0030335
            "positive regulation of cell migration" evidence=ISS] [GO:0031638
            "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0032526 "response to retinoic
            acid" evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0043129 "surfactant homeostasis"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0070324
            "thyroid hormone binding" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            GO:GO:0043066 GO:GO:0005615 GO:GO:0008284 GO:GO:0070371
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458
            OMA:STSCHKT GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129
            Ensembl:ENSECAT00000013967 Uniprot:F7BJD8
        Length = 305

 Score = 356 (130.4 bits), Expect = 1.4e-48, Sum P(2) = 1.4e-48
 Identities = 75/188 (39%), Positives = 115/188 (61%)

Query:    62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
             F++W+ Q+ ++Y SE E+  R   + SN + I+  N+ N +F++  N+F+ ++  E    
Sbjct:     5 FKSWMVQHQKKYSSE-EYHHRLQTFVSNWRKINAHNTGNHTFRMGLNQFSAMNFAELKHK 63

Query:   122 YLGYNKPYN--EPRWPSVQYLG-LPASVDWRKEGA-VTPVKDQGQCGSCWAFSAVAAVEG 177
             YL +++P N    +   ++  G  P SVDWRK+G  V+PVK+QG CGSCW FS   A+E 
Sbjct:    64 YL-WSEPQNCSATKGNYLRGAGPYPPSVDWRKKGNFVSPVKNQGGCGSCWTFSTTGALES 122

Query:   178 INKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKND 237
                + +GKL+SL+EQ+LVDC  N  N GC GG   +AFE+I    G+  ED YPY+G++ 
Sbjct:   123 AVAIASGKLLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKGQDG 182

Query:   238 RCQTDKTK 245
              C+    K
Sbjct:   183 DCKFQPNK 190

 Score = 168 (64.2 bits), Expect = 1.4e-48, Sum P(2) = 1.4e-48
 Identities = 32/79 (40%), Positives = 44/79 (55%)

Query:   263 FQLYSHGVFDEYCGHQ----LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
             F +Y  G++     H+    +NH V  VGYGE++G  YW+VKNSWG  WG  GY  + R 
Sbjct:   229 FMMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPHWGMNGYFLIERG 288

Query:   319 SPSSNIGICGILMQASYPV 337
                    +CG+   ASYP+
Sbjct:   289 K-----NMCGLAACASYPI 302


>TAIR|locus:2825832 [details] [associations]
            symbol:RD21A "responsive to dehydration 21A" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;IMP]
            [GO:0008234 "cysteine-type peptidase activity"
            evidence=ISS;IDA;IMP] [GO:0009507 "chloroplast" evidence=IDA]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0008233 "peptidase
            activity" evidence=IDA] [GO:0048046 "apoplast" evidence=IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005829 "cytosol"
            evidence=RCA] [GO:0009506 "plasmodesma" evidence=IDA] [GO:0050832
            "defense response to fungus" evidence=IMP] [GO:0006096 "glycolysis"
            evidence=RCA] [GO:0006833 "water transport" evidence=RCA]
            [GO:0006972 "hyperosmotic response" evidence=RCA] [GO:0007030
            "Golgi organization" evidence=RCA] [GO:0009266 "response to
            temperature stimulus" evidence=RCA] [GO:0009651 "response to salt
            stress" evidence=RCA] [GO:0015996 "chlorophyll catabolic process"
            evidence=RCA] [GO:0042744 "hydrogen peroxide catabolic process"
            evidence=RCA] [GO:0046686 "response to cadmium ion" evidence=RCA]
            [GO:0009414 "response to water deprivation" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0009506 GO:GO:0009507 GO:GO:0005773
            GO:GO:0050832 GO:GO:0048046 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC083835
            HOGENOM:HOG000230773 KO:K01376 InterPro:IPR000118 Pfam:PF00396
            SMART:SM00277 UniGene:At.43549 EMBL:D13043 EMBL:AY072130
            EMBL:AY133781 IPI:IPI00530094 PIR:JN0719 RefSeq:NP_564497.1
            UniGene:At.47599 UniGene:At.71705 ProteinModelPortal:P43297
            SMR:P43297 IntAct:P43297 STRING:P43297 MEROPS:C01.064 PaxDb:P43297
            PRIDE:P43297 ProMEX:P43297 EnsemblPlants:AT1G47128.1 GeneID:841122
            KEGG:ath:AT1G47128 TAIR:At1g47128 InParanoid:P43297 OMA:EAWLVKH
            PhylomeDB:P43297 ProtClustDB:CLSN2688498 Genevestigator:P43297
            GermOnline:AT1G47128 Uniprot:P43297
        Length = 462

 Score = 506 (183.2 bits), Expect = 1.8e-48, P = 1.8e-48
 Identities = 100/204 (49%), Positives = 134/204 (65%)

Query:    62 FENWLKQYSREYGSED--EWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
             +E WL ++ +        E  RRF I+  N++++D  N +NLS++L   +FADL+N+E+ 
Sbjct:    50 YEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYR 109

Query:   120 STYLGYNKPYNEPRWPSVQY---LG--LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
             S YLG        R  S++Y   +G  LP S+DWRK+GAV  VKDQG CGSCWAFS + A
Sbjct:   110 SKYLGAKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGA 169

Query:   175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
             VEGIN++ TG L++LSEQELVDCD  S N+GCNGG M+ AFEFI K GG+ T+ DYPY+G
Sbjct:   170 VEGINQIVTGDLITLSEQELVDCDT-SYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKG 228

Query:   235 KNDRCQTDKTKHHAVTITGYEAIP 258
              +  C   +     VTI  YE +P
Sbjct:   229 VDGTCDQIRKNAKVVTIDSYEDVP 252

 Score = 280 (103.6 bits), Expect = 3.9e-24, P = 3.9e-24
 Identities = 62/145 (42%), Positives = 84/145 (57%)

Query:   195 VDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP-YRGKNDRCQTDKTKHHAVTITG 253
             +D D +   +G +G       + I K   V T D Y      ++        H  ++I  
Sbjct:   218 IDTDKDYPYKGVDG-----TCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIA- 271

Query:   254 YEAIPARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYI 313
              EA     AFQLY  G+FD  CG QL+HGV  VGYG ++G+ YW+V+NSWG SWGE+GY+
Sbjct:   272 IEA--GGRAFQLYDSGIFDGSCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYL 329

Query:   314 RMARNSPSSNIGICGILMQASYPVK 338
             RMARN  SS+ G CGI ++ SYP+K
Sbjct:   330 RMARNIASSS-GKCGIAIEPSYPIK 353


>UNIPROTKB|H9KYW5 [details] [associations]
            symbol:CTSS "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0002250 "adaptive immune response" evidence=IEA]
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 OMA:YEPACTQ EMBL:AADN02010496
            Ensembl:ENSGALT00000001122 Uniprot:H9KYW5
        Length = 245

 Score = 324 (119.1 bits), Expect = 2.9e-48, Sum P(2) = 2.9e-48
 Identities = 60/118 (50%), Positives = 81/118 (68%)

Query:   141 GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVN 200
             G P ++DWR++G VT VK+QG CG+CWAFSAV A+E   KLKTGKLVSLS Q LVDC + 
Sbjct:    29 GAPDAMDWREKGCVTEVKNQGACGACWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSMM 88

Query:   201 SENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
               N+GC GG+M +AF++I    G+ +E+ YPY  +N  CQ + +   A T + Y  +P
Sbjct:    89 YGNKGCGGGFMTRAFQYIIDNNGIDSEESYPYMAQNGTCQYNVSTR-AATCSKYVELP 145

 Score = 197 (74.4 bits), Expect = 2.9e-48, Sum P(2) = 2.9e-48
 Identities = 39/75 (52%), Positives = 50/75 (66%)

Query:   263 FQLYSHGVFDE-YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
             F LY  GV+D+  C  ++NHGV VVGYG  + + +WLVKNSWG  +G+ GYIRM+RN  +
Sbjct:   173 FFLYRSGVYDDPRCTQEVNHGVLVVGYGTLNEKDFWLVKNSWGERFGDGGYIRMSRNHAN 232

Query:   322 SNIGICGILMQASYP 336
                  CGI   ASYP
Sbjct:   233 H----CGIASYASYP 243


>DICTYBASE|DDB_G0290957 [details] [associations]
            symbol:cprA "cysteine proteinase 1" species:44689
            "Dictyostelium discoideum" [GO:0006972 "hyperosmotic response"
            evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0290957
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GenomeReviews:CM000154_GR GO:GO:0005764
            GO:GO:0006972 EMBL:AAFI02000174 KO:K01376 EMBL:X02407 PIR:A22827
            RefSeq:XP_635417.1 ProteinModelPortal:P04988 MEROPS:C01.022
            GlycoSuiteDB:P04988 SWISS-2DPAGE:P04988 EnsemblProtists:DDB0201647
            GeneID:8627918 KEGG:ddi:DDB_G0290957 OMA:KISNFTM
            ProtClustDB:CLSZ2429603 Uniprot:P04988
        Length = 343

 Score = 368 (134.6 bits), Expect = 6.0e-48, Sum P(2) = 6.0e-48
 Identities = 93/249 (37%), Positives = 134/249 (53%)

Query:    30 LSLFLLWVLGI-PAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
             + + LL+VL +      S G P +   Q +E  F++   +++++Y S +E+  RF I+ S
Sbjct:     1 MKVILLFVLAVFTVFVSSRGIPLEEQSQFLE--FQD---KFNKKY-SHEEYLERFEIFKS 54

Query:    89 NVQYIDYIN----SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYL---- 140
             N+  I+ +N    +     K   NKFADLS++EF + YL   +       P   YL    
Sbjct:    55 NLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEF 114

Query:   141 --GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD 198
                +P + DWR  GAVTPVK+QGQCGSCW+FS    VEG + +   KLVSLSEQ LVDCD
Sbjct:   115 INSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCD 174

Query:   199 ---VNSE-----NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKND-RCQTDKTKHHAV 249
                +  E     ++GCNGG    A+ +I K GG+ TE  YPY  +   +C  +     A 
Sbjct:   175 HECMEYEGEQACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGA- 233

Query:   250 TITGYEAIP 258
              I+ +  IP
Sbjct:   234 KISNFTMIP 242

 Score = 150 (57.9 bits), Expect = 6.0e-48, Sum P(2) = 6.0e-48
 Identities = 33/83 (39%), Positives = 44/83 (53%)

Query:   249 VTITGYEAIPA-RYAFQLYSHGVFDEYCG-HQLNHGVTVVGYGEDH-----GEKYWLVKN 301
             +  TG  AI A    +Q Y  GVFD  C  + L+HG+ +VGY   +        YW+VKN
Sbjct:   252 IVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKN 311

Query:   302 SWGTSWGEAGYIRMARNSPSSNI 324
             SWG  WGE GYI + R   +  +
Sbjct:   312 SWGADWGEQGYIYLRRGKNTCGV 334


>TAIR|locus:2130180 [details] [associations]
            symbol:AT4G16190 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0042744 "hydrogen peroxide catabolic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005773
            EMBL:CP002687 HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 EMBL:Z97340 EMBL:AL161543 UniGene:At.25555
            EMBL:AY039556 EMBL:AY129473 EMBL:AY136316 EMBL:BT000733
            EMBL:AK226366 IPI:IPI00543588 PIR:D71428 RefSeq:NP_567489.1
            HSSP:P25779 ProteinModelPortal:Q9SUL1 SMR:Q9SUL1 STRING:Q9SUL1
            MEROPS:C01.A06 PRIDE:Q9SUL1 EnsemblPlants:AT4G16190.1 GeneID:827311
            KEGG:ath:AT4G16190 TAIR:At4g16190 InParanoid:Q9SUL1 OMA:NACGINK
            PhylomeDB:Q9SUL1 ProtClustDB:CLSN2917559 Genevestigator:Q9SUL1
            Uniprot:Q9SUL1
        Length = 373

 Score = 384 (140.2 bits), Expect = 2.0e-47, Sum P(2) = 2.0e-47
 Identities = 88/238 (36%), Positives = 135/238 (56%)

Query:    50 PQKYDPQSM--EERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTD 107
             P++ D Q +  E  F  +  +Y + Y ++ E   RF ++ +N++        + S     
Sbjct:    41 PEENDEQLLNAEHHFTLFKSKYEKTYATQVEHDHRFRVFKANLRRARRNQLLDPSAVHGV 100

Query:   108 NKFADLSNEEFISTYLGYNK-----PYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQ 162
              +F+DL+ +EF   +LG  +     P +    P +    LP   DWR++GAVTPVK+QG 
Sbjct:   101 TQFSDLTPKEFRRKFLGLKRRGFRLPTDTQTAPILPTSDLPTEFDWREQGAVTPVKNQGM 160

Query:   163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD-------VNSENQGCNGGYMEKAF 215
             CGSCW+FSA+ A+EG + L T +LVSLSEQ+LVDCD        NS + GC+GG M  AF
Sbjct:   161 CGSCWSFSAIGALEGAHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAF 220

Query:   216 EFITKIGGVTTEDDYPYRGKNDR-CQTDKTKHHAVTITGYEAIPA---RYAFQLYSHG 269
             E+  K GG+  E+DYPY G++   C+ DK+K  A +++ +  + +   + A  L  HG
Sbjct:   221 EYALKAGGLMKEEDYPYTGRDHTACKFDKSKIVA-SVSNFSVVSSDEDQIAANLVQHG 277

 Score = 129 (50.5 bits), Expect = 2.0e-47, Sum P(2) = 2.0e-47
 Identities = 32/79 (40%), Positives = 42/79 (53%)

Query:   256 AIPARYAFQLYSHGVFDEY-CGHQLNHGVTVVGYGEDH------GEK-YWLVKNSWGTSW 307
             AI A +  Q Y  GV   Y C    +HGV +VG+G          EK YW++KNSWG  W
Sbjct:   282 AINAMW-MQTYIGGVSCPYVCSKSQDHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGAMW 340

Query:   308 GEAGYIRMARNSPSSNIGI 326
             GE GY ++ R  P +  G+
Sbjct:   341 GEHGYYKICRG-PHNMCGM 358


>UNIPROTKB|Q4QRC2 [details] [associations]
            symbol:Ctsql2 "Protein Ctsql2" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 HOVERGEN:HBG011513 EMBL:CH474032
            RGD:1303225 EMBL:BC097257 IPI:IPI00421946 RefSeq:NP_001002813.2
            UniGene:Rn.128678 SMR:Q4QRC2 MEROPS:C01.111
            Ensembl:ENSRNOT00000038758 GeneID:408201 KEGG:rno:408201 CTD:408201
            InParanoid:Q4QRC2 OMA:NDEGALM NextBio:696394 Genevestigator:Q4QRC2
            Uniprot:Q4QRC2
        Length = 343

 Score = 338 (124.0 bits), Expect = 3.3e-47, Sum P(2) = 3.3e-47
 Identities = 86/245 (35%), Positives = 129/245 (52%)

Query:    31 SLFLLWV-LGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS- 88
             +LFL+ + LG+ +GA +          S++ +++ W  +Y + Y  E+E  +R     + 
Sbjct:     4 ALFLIILCLGVVSGASAFNL-------SLDVQWQEWKMKYEKLYSPEEELLKRVVWEENV 56

Query:    89 -NVQYIDYINSQNLSFKLTD-NKFADLSNEEFISTYLGYNKPYNEPR---W--------P 135
               ++  +  NS   +  + + N FADL++EEF     G   P N      W        P
Sbjct:    57 KKIELHNRENSLGKNTYIMEINNFADLTDEEFKDMITGITLPINNTMKSLWKRALGSPFP 116

Query:   136 SVQYL--GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQE 193
             +  Y    LP S+DWRKEG VT V++QG+C SCWAF    A+EG    KTGKL  LS Q 
Sbjct:   117 NSWYWRDALPKSIDWRKEGYVTRVREQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQN 176

Query:   194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITG 253
             LVDC     N+GC GG    AF+++ + GG+ +E  YPY+GK   C+ +    +A  IT 
Sbjct:   177 LVDCSKPQGNKGCRGGTTYNAFQYVLQNGGLESEATYPYKGKEGLCKYNPKNAYA-KITR 235

Query:   254 YEAIP 258
             + A+P
Sbjct:   236 FVALP 240

 Score = 173 (66.0 bits), Expect = 3.3e-47, Sum P(2) = 3.3e-47
 Identities = 33/81 (40%), Positives = 48/81 (59%)

Query:   262 AFQLYSHGVFDE-YCGHQLNHGVTVVGYG----EDHGEKYWLVKNSWGTSWGEAGYIRMA 316
             + + Y  G++ E  C +++NH V VVGYG    E  G  YWL+KNSWG  WG  GY+++A
Sbjct:   266 SLRFYKKGIYHEPKCNNRVNHAVLVVGYGFEGNETDGNNYWLIKNSWGKQWGLKGYMKIA 325

Query:   317 RNSPSSNIGICGILMQASYPV 337
             ++  +     CGI   A YP+
Sbjct:   326 KDRNNH----CGIATFAQYPI 342


>UNIPROTKB|E9PSK9 [details] [associations]
            symbol:Ctsql2 "Protein Ctsql2" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            IPI:IPI00562656 Ensembl:ENSRNOT00000045847 RGD:1303225
            ArrayExpress:E9PSK9 Uniprot:E9PSK9
        Length = 342

 Score = 338 (124.0 bits), Expect = 6.8e-47, Sum P(2) = 6.8e-47
 Identities = 86/245 (35%), Positives = 129/245 (52%)

Query:    31 SLFLLWV-LGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS- 88
             +LFL+ + LG+ +GA +          S++ +++ W  +Y + Y  E+E  +R     + 
Sbjct:     4 ALFLIILCLGVVSGASAFNL-------SLDVQWQEWKMKYEKLYSPEEELLKRVVWEENV 56

Query:    89 -NVQYIDYINSQNLSFKLTD-NKFADLSNEEFISTYLGYNKPYNEPR---W--------P 135
               ++  +  NS   +  + + N FADL++EEF     G   P N      W        P
Sbjct:    57 KKIELHNRENSLGKNTYIMEINNFADLTDEEFKDMITGITLPINNTMKSLWKRALGSPFP 116

Query:   136 SVQYL--GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQE 193
             +  Y    LP S+DWRKEG VT V++QG+C SCWAF    A+EG    KTGKL  LS Q 
Sbjct:   117 NSWYWRDALPKSIDWRKEGYVTRVREQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQN 176

Query:   194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITG 253
             LVDC     N+GC GG    AF+++ + GG+ +E  YPY+GK   C+ +    +A  IT 
Sbjct:   177 LVDCSKPQGNKGCRGGTTYNAFQYVLQNGGLESEATYPYKGKEGLCKYNPKNAYA-KITR 235

Query:   254 YEAIP 258
             + A+P
Sbjct:   236 FVALP 240

 Score = 170 (64.9 bits), Expect = 6.8e-47, Sum P(2) = 6.8e-47
 Identities = 33/82 (40%), Positives = 49/82 (59%)

Query:   261 YAFQLYSHGVFDE-YCGHQLNHGVTVVGYG----EDHGEKYWLVKNSWGTSWGEAGYIRM 315
             Y++  +  G++ E  C +++NH V VVGYG    E  G  YWL+KNSWG  WG  GY+++
Sbjct:   264 YSYFHFVSGIYHEPKCNNRVNHAVLVVGYGFEGNETDGNNYWLIKNSWGKQWGLKGYMKI 323

Query:   316 ARNSPSSNIGICGILMQASYPV 337
             A++  +     CGI   A YP+
Sbjct:   324 AKDRNNH----CGIATFAQYPI 341


>TAIR|locus:2090614 [details] [associations]
            symbol:AT3G19390 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA;ISS] [GO:0000041 "transition metal ion
            transport" evidence=RCA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:CP002686 GenomeReviews:BA000014_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AB025624
            MEROPS:I29.003 OMA:KAMDQKC HSSP:O65039 HOGENOM:HOG000230773
            InterPro:IPR000118 Pfam:PF00396 SMART:SM00277 EMBL:AY062725
            EMBL:AY093350 IPI:IPI00520189 RefSeq:NP_566633.1 UniGene:At.27473
            ProteinModelPortal:Q9LT78 SMR:Q9LT78 IntAct:Q9LT78 STRING:Q9LT78
            PaxDb:Q9LT78 PRIDE:Q9LT78 EnsemblPlants:AT3G19390.1 GeneID:821473
            KEGG:ath:AT3G19390 TAIR:At3g19390 InParanoid:Q9LT78
            PhylomeDB:Q9LT78 ProtClustDB:CLSN2917188 Genevestigator:Q9LT78
            Uniprot:Q9LT78
        Length = 452

 Score = 491 (177.9 bits), Expect = 6.9e-47, P = 6.9e-47
 Identities = 127/327 (38%), Positives = 178/327 (54%)

Query:    21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEER-FENWLKQYSREYGSEDEW 79
             M   +++  L+L +  VL I     S    +    ++   R +E WL +  + Y    E 
Sbjct:     1 MATSIKSITLALLIFSVLLISLSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEK 60

Query:    80 QRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQ 138
             +RRF I+  N+++++  +S  N ++++   +FADL+N+EF + YL              +
Sbjct:    61 ERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKMERTRVPVKGEK 120

Query:   139 YL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQE 193
             YL      LP ++DWR +GAV PVKDQG CGSCWAFSA+ AVEGIN++KTG+L+SLSEQE
Sbjct:   121 YLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQE 180

Query:   194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN-DRCQTDKTKHHAVTIT 252
             LVDCD  S N GC GG M+ AF+FI + GG+ TE+DYPY   + + C +DK     VTI 
Sbjct:   181 LVDCDT-SYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCNSDKKNTRVVTID 239

Query:   253 GYEAIPA------RYAFQLYSHGVFDEYCGH--QL-NHGVTV--VGYGEDHGEKYWLVKN 301
             GYE +P       + A       V  E  G   QL   GV     G   DHG    +V  
Sbjct:   240 GYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTSLDHG----VVAV 295

Query:   302 SWGTSWGEAGYIRMARNSPSSNIGICG 328
              +G+  G+  +I   RNS  SN G  G
Sbjct:   296 GYGSEGGQDYWI--VRNSWGSNWGESG 320

 Score = 259 (96.2 bits), Expect = 7.9e-22, P = 7.9e-22
 Identities = 52/115 (45%), Positives = 67/115 (58%)

Query:   224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQLYSHGVFDEYCGHQLNHGV 283
             V T D Y    +ND     K   +       EA     AFQLY+ GVF   CG  L+HGV
Sbjct:   235 VVTIDGYEDVPQNDEKSLKKALANQPISVAIEA--GGRAFQLYTSGVFTGTCGTSLDHGV 292

Query:   284 TVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
               VGYG + G+ YW+V+NSWG++WGE+GY ++ RN   S+ G CG+ M ASYP K
Sbjct:   293 VAVGYGSEGGQDYWIVRNSWGSNWGESGYFKLERNIKESS-GKCGVAMMASYPTK 346


>RGD|1562210 [details] [associations]
            symbol:MGC114246 "similar to cathepsin R" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1562210 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 EMBL:CH474032 MEROPS:C01.042 KO:K09601
            OMA:ASHESFK OrthoDB:EOG4ZCT6D EMBL:BC091563 IPI:IPI00555186
            RefSeq:NP_001017509.1 UniGene:Rn.198321 SMR:Q5BJA0
            Ensembl:ENSRNOT00000061470 GeneID:498688 KEGG:rno:498688
            UCSC:RGD:1562210 InParanoid:Q5BJA0 NextBio:700535
            Genevestigator:Q5BJA0 Uniprot:Q5BJA0
        Length = 334

 Score = 351 (128.6 bits), Expect = 1.1e-46, Sum P(2) = 1.1e-46
 Identities = 77/212 (36%), Positives = 118/212 (55%)

Query:    54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN-LS---FKLTDNK 109
             DP S++  ++ W K+Y + Y  E+E  RR  ++  N++ I   N +N L    F +  N+
Sbjct:    22 DP-SLDAEWQEWKKKYDKSYSLEEEELRR-AVWEENLKMIKLHNGENGLGKNGFTMEINE 79

Query:   110 FADLSNEEFISTYLGYN-KPYNEPRWPSVQYLG--LPASVDWRKEGAVTPVKDQGQCGSC 166
             F D + EEF    + +  + + E +    +  G   P  VDWRK+G VTPV+ QG C +C
Sbjct:    80 FGDTTGEEFRKMMVEFPVQTHREGKSIMKRAAGSIFPKFVDWRKKGYVTPVRRQGNCNAC 139

Query:   167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
             WAFS   A+E     ++GKL+ LS Q LVDC     N GC GG    AF+++   GG+ +
Sbjct:   140 WAFSVTGAIEAQTIWQSGKLIPLSVQNLVDCSKPQGNNGCLGGDTYNAFQYVLHNGGLQS 199

Query:   227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             E  YPY GK+  C+ +  K+ +  ITG+ ++P
Sbjct:   200 EATYPYEGKDGPCRYNP-KNSSAEITGFVSLP 230

 Score = 155 (59.6 bits), Expect = 1.1e-46, Sum P(2) = 1.1e-46
 Identities = 31/81 (38%), Positives = 46/81 (56%)

Query:   262 AFQLYSHGVFDE-YCG-HQLNHGVTVVGYG----EDHGEKYWLVKNSWGTSWGEAGYIRM 315
             +F+ Y  G++ E  C  + + HGV VVGYG    +  G+ YWL+KNSWG  WG  GY+++
Sbjct:   256 SFKFYKKGIYHEPNCSSNSVTHGVLVVGYGFKGNDTGGDHYWLIKNSWGKQWGIRGYMKI 315

Query:   316 ARNSPSSNIGICGILMQASYP 336
              ++  +     C I   A YP
Sbjct:   316 TKDKNNH----CAIASYAHYP 332


>UNIPROTKB|F1RU48 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            OMA:LAPPEWD GeneTree:ENSGT00660000095458 EMBL:CU928034
            EMBL:FP565364 Ensembl:ENSSSCT00000014140 Ensembl:ENSSSCT00000014154
            Uniprot:F1RU48
        Length = 460

 Score = 353 (129.3 bits), Expect = 1.4e-46, Sum P(2) = 1.4e-46
 Identities = 83/203 (40%), Positives = 118/203 (58%)

Query:    50 PQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNV---QYIDYINSQNLSFKLT 106
             PQ +  + M   F+ ++  Y+R Y +++E + R  ++++N+   Q I  +++    + +T
Sbjct:   152 PQDFSVK-MASIFKEFVTTYNRTYDTKEEARWRMSVFANNMVRAQKIQALDTGTARYGVT 210

Query:   107 DNKFADLSNEEFISTYLG---YNKPYNEPRWP-SVQYLGLPASVDWRKEGAVTPVKDQGQ 162
               KF+DL+ EEF + YL      +P  + R   SV  L  P   DWRK+GAVT VKDQG 
Sbjct:   211 --KFSDLTEEEFRTIYLNPLLQEEPGRKMRLAKSVSSLP-PPEWDWRKKGAVTKVKDQGM 267

Query:   163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
             CGSCWAFS    VEG   LK G L+SLSEQEL+DCD    ++GC GG    A+  I  +G
Sbjct:   268 CGSCWAFSVTGNVEGQWFLKQGTLLSLSEQELLDCD--KVDKGCMGGLPSNAYSAIKTLG 325

Query:   223 GVTTEDDYPYRGKNDRCQTDKTK 245
             G+ TE+DY YRG    C  +  K
Sbjct:   326 GLETEEDYSYRGHLQTCSFNAEK 348

 Score = 152 (58.6 bits), Expect = 1.4e-46, Sum P(2) = 1.4e-46
 Identities = 35/86 (40%), Positives = 44/86 (51%)

Query:   256 AIPARYAFQLYSHGV---FDEYCGHQL-NHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAG 311
             AI A +  Q Y HG+       C   L +H V +VGYG      +W +KNSWGT WGE G
Sbjct:   379 AINA-FGMQFYRHGISHPLRPLCSPWLIDHAVLLVGYGNRSATPFWAIKNSWGTDWGEEG 437

Query:   312 YIRMARNSPSSNIGICGILMQASYPV 337
             Y  + R S     G CG+ + AS  V
Sbjct:   438 YYYLYRGS-----GACGVNIMASSAV 458


>TAIR|locus:2090629 [details] [associations]
            symbol:AT3G19400 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0019344 "cysteine biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            EMBL:CP002686 GenomeReviews:BA000014_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AB025624
            MEROPS:I29.003 HOGENOM:HOG000230773 EMBL:AK118509 IPI:IPI00543468
            RefSeq:NP_566634.2 UniGene:At.38409 ProteinModelPortal:Q9LT77
            SMR:Q9LT77 PaxDb:Q9LT77 PRIDE:Q9LT77 EnsemblPlants:AT3G19400.1
            GeneID:821474 KEGG:ath:AT3G19400 TAIR:At3g19400 InParanoid:Q9LT77
            OMA:IGEHERR ProtClustDB:CLSN2679975 Genevestigator:Q9LT77
            Uniprot:Q9LT77
        Length = 362

 Score = 487 (176.5 bits), Expect = 1.8e-46, P = 1.8e-46
 Identities = 135/341 (39%), Positives = 184/341 (53%)

Query:    25 LRNAVLSLFLLWVLGIPA--GAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRR 82
             +R  V +L +L VL + +  G  +E   ++ + + +   +E WL +  + Y    E +RR
Sbjct:     6 IRVIVSALVILSVLLLSSSLGVATETEIERNETE-VRLMYEQWLVENRKNYNGLGEKERR 64

Query:    83 FGIYSSNVQYIDYINS-QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYL- 140
             F I+  N++++D  NS  + +F++   +FADL+NEEF + YL       +    + +YL 
Sbjct:    65 FKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRAIYLRKKMERTKDSVKTERYLY 124

Query:   141 --G--LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVD 196
               G  LP  VDWR  GAV  VKDQG CGSCWAFSAV AVEGIN++ TG+L+SLSEQELVD
Sbjct:   125 KEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQITTGELISLSEQELVD 184

Query:   197 CDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDR--CQTDKTKH-HAVTITG 253
             CD    N GC+GG M  AFEFI K GG+ T+ DYPY   ND   C  DK  +   VTI G
Sbjct:   185 CDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNA-NDLGLCNADKNNNTRVVTIDG 243

Query:   254 YEAIPA--------RYAFQLYSHGVFDEYCGHQL-NHGVTV--VGYGEDHGEKYWLVKNS 302
             YE +P           A Q  S  +       QL   GV     G   DHG    +V   
Sbjct:   244 YEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKSGVMTGTCGISLDHG----VVVVG 299

Query:   303 WGTSWGEAGYIRMARNSPSSNIGICG-ILMQASY--PVKRC 340
             +G++ GE  +I   RNS   N G  G + +Q +   P  +C
Sbjct:   300 YGSTSGEDYWI--IRNSWGLNWGDSGYVKLQRNIDDPFGKC 338

 Score = 250 (93.1 bits), Expect = 2.4e-21, P = 2.4e-21
 Identities = 50/115 (43%), Positives = 63/115 (54%)

Query:   224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQLYSHGVFDEYCGHQLNHGV 283
             V T D Y    ++D     K   H       EA  +  AFQLY  GV    CG  L+HGV
Sbjct:   238 VVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEA--SSQAFQLYKSGVMTGTCGISLDHGV 295

Query:   284 TVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
              VVGYG   GE YW+++NSWG +WG++GY+++ RN      G CGI M  SYP K
Sbjct:   296 VVVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQRNIDDP-FGKCGIAMMPSYPTK 349


>DICTYBASE|DDB_G0278401 [details] [associations]
            symbol:cprH "cysteine proteinase 8" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0278401 EMBL:AAFI02000023
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 ProtClustDB:CLSZ2430780 RefSeq:XP_642342.1
            ProteinModelPortal:Q54Y60 MEROPS:C01.A62 EnsemblProtists:DDB0205428
            GeneID:8621547 KEGG:ddi:DDB_G0278401 InParanoid:Q54Y60 OMA:FANMENE
            Uniprot:Q54Y60
        Length = 337

 Score = 382 (139.5 bits), Expect = 2.3e-46, Sum P(2) = 2.3e-46
 Identities = 83/212 (39%), Positives = 119/212 (56%)

Query:    51 QKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKF 110
             Q+       + F +W+    + Y S  E+  R+ I+ +N  YI+  NS+     L  NK 
Sbjct:    19 QELSESQYRDAFTDWMISNQKSYSSS-EFITRYNIFKTNFDYIEEWNSKGSETVLGLNKM 77

Query:   111 ADLSNEEFISTYLGYNKPYNEPRW----PSVQYLG-LPASVDWRKEGAVTPVKDQGQCGS 165
             AD++NEE+ S YLG  KP++          + +     ++VDWRK+GAVT VK+Q  C  
Sbjct:    78 ADITNEEYRSLYLG--KPFDASSLIGTKEEILFSNKFSSTVDWRKKGAVTHVKNQQSCSG 135

Query:   166 CWAFSAVAAVEGINKLK---TGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
             CW+FSA  A EG +KL    T +LVSLSEQ L+DC     N GCNGG +  AFE+I   G
Sbjct:   136 CWSFSATGATEGAHKLANNGTNELVSLSEQNLIDCSTPFGNTGCNGGVITYAFEYIISNG 195

Query:   223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
             G+ TE  YP+ G +  C+  K+++   TI+ Y
Sbjct:   196 GIDTEKSYPFEGTDGTCRY-KSENSGATISSY 226

 Score = 121 (47.7 bits), Expect = 2.3e-46, Sum P(2) = 2.3e-46
 Identities = 31/89 (34%), Positives = 47/89 (52%)

Query:   262 AFQLYSHGV-FDEYCGH-QLNHGVTVVGYGEDHGEK-----------YWLVKNSWGTSWG 308
             +F  Y  G+ F+  C    L+HGV VVGYG ++ +            YW+ KNSWG +  
Sbjct:   256 SFLFYKSGIYFEPACSRTNLDHGVLVVGYGTENSQSQDSSSEPNHSNYWIAKNSWGIN-- 313

Query:   309 EAGYIRMARNSPSSNIGICGILMQASYPV 337
               GYI M+++  +    +CGI   AS+P+
Sbjct:   314 --GYILMSKDRDN----MCGISTLASFPI 336


>RGD|1588248 [details] [associations]
            symbol:Cts8 "cathepsin 8" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1588248 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 IPI:IPI00765053
            RefSeq:NP_001121688.1 UniGene:Rn.220599 Ensembl:ENSRNOT00000061486
            GeneID:680718 KEGG:rno:680718 UCSC:RGD:1588248 CTD:56094
            OMA:DSEWQEW OrthoDB:EOG4JT07C NextBio:719350 Uniprot:D3ZP54
        Length = 333

 Score = 341 (125.1 bits), Expect = 2.3e-46, Sum P(2) = 2.3e-46
 Identities = 77/215 (35%), Positives = 113/215 (52%)

Query:    51 QKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN----SQNLSFKLT 106
             Q  DP S++  ++ W  +Y + Y  E+E Q+R  ++  N++ +   N     +  +F + 
Sbjct:    19 QPSDP-SLDSEWQEWKTKYEKNYSLEEEGQKR-AVWEENMKVVKQHNIEYDQEKKNFTME 76

Query:   107 DNKFADLSNEEF--ISTYLGYN--KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQ 162
              N FAD++ EEF  + T +     +       P  +YL  P  VDWR+ G VT VK+QG 
Sbjct:    77 LNAFADMTGEEFRKMMTNIPVQNLRKKKSIHQPIFRYL--PKFVDWRRRGYVTSVKNQGT 134

Query:   163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
             C SCWAFS   A+EG    KTG+LVSLS Q LVDC     N GC+ G    A +++   G
Sbjct:   135 CNSCWAFSVAGAIEGQMFRKTGRLVSLSPQNLVDCSRPEGNHGCHMGSTLYALKYVWSNG 194

Query:   223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI 257
             G+  E  YPY GK   C+    +  A  +TG+  +
Sbjct:   195 GLEAESTYPYEGKEGPCRY-LPRRSAARVTGFSTV 228

 Score = 162 (62.1 bits), Expect = 2.3e-46, Sum P(2) = 2.3e-46
 Identities = 38/106 (35%), Positives = 57/106 (53%)

Query:   240 QTDKTKHHAVTITGYEAI---PARYAFQLYSHGVFDE-YCG-HQLNHGVTVVGYG----E 290
             ++++   HAV   G  ++    +  +F+ Y  G++ E  C  +++NH V VVGYG    E
Sbjct:   230 RSEEALMHAVATIGPISVGIDASHVSFRFYRRGIYYEPRCSSNRINHSVLVVGYGYEGRE 289

Query:   291 DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
               G KYWL+KNS G  WG  GY+++AR   +     CGI     YP
Sbjct:   290 SDGRKYWLIKNSHGVGWGMNGYMKLARGWNNH----CGIATYGFYP 331


>FB|FBgn0034229 [details] [associations]
            symbol:CG4847 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0032504
            "multicellular organism reproduction" evidence=IEP] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0005615 "extracellular space"
            evidence=ISM;IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:AE013599 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 GO:GO:0032504 GeneTree:ENSGT00560000076599
            KO:K01371 EMBL:BT099507 RefSeq:NP_725686.1 UniGene:Dm.4677
            SMR:A1ZAU4 IntAct:A1ZAU4 MEROPS:C01.A28 EnsemblMetazoa:FBtr0086935
            GeneID:36973 KEGG:dme:Dmel_CG4847 UCSC:CG4847-RB
            FlyBase:FBgn0034229 InParanoid:A1ZAU4 OMA:GGFQEYA OrthoDB:EOG4J9KFC
            ChiTaRS:CG4847 GenomeRNAi:36973 NextBio:801302 Uniprot:A1ZAU4
        Length = 420

 Score = 331 (121.6 bits), Expect = 2.3e-46, Sum P(2) = 2.3e-46
 Identities = 85/218 (38%), Positives = 120/218 (55%)

Query:    62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS---QNL-SFKLTDNKFADLSNEE 117
             F ++L Q  + Y S  +     G ++S    ++  N+   Q + +FK   N FADL++ E
Sbjct:   112 FGDFLSQSGKTYLSAADRALHEGAFASTKNLVEAGNAAFAQGVHTFKQAVNAFADLTHSE 171

Query:   118 FISTYLGYNK-PYNEPRWP-SVQYLGLPA-----SVDWRKEGAVTPVKDQGQCGSCWAFS 170
             F+S   G  + P  + R   S++ + LPA     + DWR+ G VTPVK QG CGSCWAF+
Sbjct:   172 FLSQLTGLKRSPEAKARAAASLKLVNLPAKPIPDAFDWREHGGVTPVKFQGTCGSCWAFA 231

Query:   171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSEN--QGCNGGYMEKAFEFITKIG-GVTTE 227
                A+EG    KTG L +LSEQ LVDC    +    GC+GG+ E AF FI ++  GV+ E
Sbjct:   232 TTGAIEGHTFRKTGSLPNLSEQNLVDCGPVEDFGLNGCDGGFQEAAFCFIDEVQKGVSQE 291

Query:   228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQL 265
               YPY      C+ D +K  A T+ G+ AIP +   QL
Sbjct:   292 GAYPYIDNKGTCKYDGSKSGA-TLQGFAAIPPKDEEQL 328

 Score = 172 (65.6 bits), Expect = 2.3e-46, Sum P(2) = 2.3e-46
 Identities = 36/91 (39%), Positives = 50/91 (54%)

Query:   248 AVTITGYEAIPARYAFQLYSHGVFDEYCGH-QLNHGVTVVGYGEDHGEKYWLVKNSWGTS 306
             A ++ G E +   YA  +Y+    D+ C   + NH + VVGYG + G+ YW+VKNSW  +
Sbjct:   339 ACSVNGLETLK-NYAGGIYN----DDECNKGEPNHSILVVGYGSEKGQDYWIVKNSWDDT 393

Query:   307 WGEAGYIRMARNSPSSNIGICGILMQASYPV 337
             WGE GY R+ R         C I  + SYPV
Sbjct:   394 WGEKGYFRLPRGK-----NYCFIAEECSYPV 419


>ZFIN|ZDB-GENE-050208-336 [details] [associations]
            symbol:ctskl "cathepsin K, like" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-050208-336 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:BX465190
            GeneTree:ENSGT00660000095458 IPI:IPI00491185 RefSeq:XP_695425.1
            UniGene:Dr.110795 Ensembl:ENSDART00000062749 GeneID:567046
            KEGG:dre:567046 CTD:567046 NextBio:20888499 Bgee:F1QCP8
            Uniprot:F1QCP8
        Length = 349

 Score = 289 (106.8 bits), Expect = 3.7e-46, Sum P(2) = 3.7e-46
 Identities = 79/228 (34%), Positives = 119/228 (52%)

Query:    50 PQKYDPQSMEE---RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSF--- 103
             P +   +S EE    +  W K++   Y  E E   R  I+ +N+Q I + N+ + SF   
Sbjct:    26 PVQVASESEEEAPTEWNLWKKKHEISYDEESEDVHRKTIWETNMQKI-WKNNNDFSFGLS 84

Query:   104 --KLTDNKFADLSNEEFISTYLGYN---KPYNEPRWPSVQYLGLPA------SVDWRKEG 152
               K+  NK+ DL++ E+    LG         + +  S Q L L A      ++D+R +G
Sbjct:    85 MFKMAMNKYGDLTSVEY-KRLLGSKIKGTGNRKGKITSAQMLRLNAKRLGVTNIDYRAKG 143

Query:   153 AVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYME 212
              VT VKDQG CGSCW+FS   A+EG     TG+LVSLSEQ+LVDC  +    GC+G +M 
Sbjct:   144 YVTEVKDQGYCGSCWSFSTTGAIEGQMYKHTGRLVSLSEQQLVDCSRSYGTYGCSGAWMA 203

Query:   213 KAFEFITKIGGVTTEDDYPYRGKNDR-CQTDKTKHHAVTITGYEAIPA 259
              A++++     + + D YPY   + + C  +K    A  I+ Y  +PA
Sbjct:   204 NAYDYVIN-NALESSDTYPYTSVDTQPCFYEKNLAMA-GISDYRFVPA 249

 Score = 212 (79.7 bits), Expect = 3.7e-46, Sum P(2) = 3.7e-46
 Identities = 39/78 (50%), Positives = 50/78 (64%)

Query:   262 AFQLYSHGVFDEY-CG-HQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
             +F  YS G++ E  C  + LNH V VVGYG + G  YW++KNSWGT WGE GY+RM RN 
Sbjct:   275 SFLFYSSGIYKESNCNPNNLNHAVLVVGYGSEEGTDYWIIKNSWGTGWGEGGYMRMIRNG 334

Query:   320 PSSNIGICGILMQASYPV 337
              ++    CGI   A YP+
Sbjct:   335 KNT----CGIASYALYPI 348


>UNIPROTKB|J9P7C5 [details] [associations]
            symbol:J9P7C5 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 EMBL:AAEX03010953
            Ensembl:ENSCAFT00000012925 Uniprot:J9P7C5
        Length = 321

 Score = 351 (128.6 bits), Expect = 7.6e-46, Sum P(2) = 7.6e-46
 Identities = 81/210 (38%), Positives = 121/210 (57%)

Query:    58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN---SQNL-SFKLTDNKFADL 113
             +++R++ W   + R YG  +E  RR  ++  N++ I+  N   SQ    F +  N F D+
Sbjct:    21 LDQRYQ-WKAMHRRLYGMNEEGWRR-AVWEKNMKMIELHNREYSQGKHGFTMAMNAFGDM 78

Query:   114 SNEEFISTYLGY-NKPYNEPR-WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSA 171
             +NEEF     G+ N+ + + + +    +  +P SVDWR++G VTPVK+QGQCGSCWAFSA
Sbjct:    79 TNEEFRQVINGFQNQKHKKGKVFQEPLFAEIPKSVDWREKGYVTPVKNQGQCGSCWAFSA 138

Query:   172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
               A EG    KTG LV LSEQ L        N+GCNGG M+ AF+++     + +E+ YP
Sbjct:   139 TGAFEGQMFWKTGNLVPLSEQNLAQ-----GNEGCNGGLMDNAFQYVKDNRCLDSEESYP 193

Query:   232 YRGKN-DRCQTDKTKHHAVTITGYEAIPAR 260
             Y G++ D C   K +  A   +G+  +P R
Sbjct:   194 YLGRDTDTCNY-KPECSAAHDSGFVDLPQR 222

 Score = 147 (56.8 bits), Expect = 7.6e-46, Sum P(2) = 7.6e-46
 Identities = 43/108 (39%), Positives = 54/108 (50%)

Query:   237 DRCQTDKTKHHAVTITGY--EAIPARYA-FQLYSHGV-FDEYCGHQ-LNHGVTVVGYG-- 289
             D  Q +K    A+   G    AI A +  FQ Y   + FD  C  + L+HGV VVGYG  
Sbjct:   218 DLPQREKALMKAMATLGSITVAIDAGHQYFQFYKSSIYFDPDCSSKDLDHGVLVVGYGFE 277

Query:   290 -EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
               D   K W+VKNSW   WG   Y++MA+   +     CGI   ASYP
Sbjct:   278 GTDSNNK-WIVKNSWSPEWGWNSYVKMAKGQNNH----CGITA-ASYP 319


>MGI|MGI:1860262 [details] [associations]
            symbol:Cts7 "cathepsin 7" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005576
            "extracellular region" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005768 "endosome" evidence=IEA]
            [GO:0005794 "Golgi apparatus" evidence=IEA] [GO:0006508
            "proteolysis" evidence=ISA] [GO:0007049 "cell cycle" evidence=IEA]
            [GO:0007067 "mitosis" evidence=IEA] [GO:0008152 "metabolic process"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=ISA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0051301 "cell
            division" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:1860262 GO:GO:0005634 GO:GO:0005794 GO:GO:0048471
            GO:GO:0005615 GO:GO:0051301 GO:GO:0007067 GO:GO:0005768
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GO:GO:0008233 EMBL:CH466546
            EMBL:AY014779 EMBL:CT030645 EMBL:BC064740 EMBL:AF250837
            IPI:IPI00131132 RefSeq:NP_062412.1 UniGene:Mm.3692 HSSP:O60911
            ProteinModelPortal:Q91ZF2 SMR:Q91ZF2 STRING:Q91ZF2 MEROPS:C01.016
            PRIDE:Q91ZF2 Ensembl:ENSMUST00000021892 GeneID:56092 KEGG:mmu:56092
            UCSC:uc007qwi.1 CTD:56092 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 InParanoid:Q91ZF2 OMA:ERRVIWE OrthoDB:EOG44QT2S
            NextBio:311908 Bgee:Q91ZF2 Genevestigator:Q91ZF2 Uniprot:Q91ZF2
        Length = 331

 Score = 348 (127.6 bits), Expect = 7.6e-46, Sum P(2) = 7.6e-46
 Identities = 72/206 (34%), Positives = 112/206 (54%)

Query:    57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI-DYINSQNL---SFKLTDNKFAD 112
             +++  +E W +   R Y  E+E QRR  ++  NV++I  +I    L   +F +  N+F D
Sbjct:    24 NLDAEWEEWKRSNDRTYSPEEEKQRR-AVWEGNVKWIKQHIMENGLWMNNFTIEMNEFGD 82

Query:   113 LSNEEFISTYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
             ++ EE        + P    +    +   +P ++DWRKEG VTPV+ QG CG+CWAFS  
Sbjct:    83 MTGEEMKMLTESSSYPLRNGKHIQKRNPKIPPTLDWRKEGYVTPVRRQGSCGACWAFSVT 142

Query:   173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
             A +EG    KTGKL+ LS Q L+DC V+   +GC+GG    AF+++   GG+  E  YPY
Sbjct:   143 ACIEGQLFKKTGKLIPLSVQNLMDCSVSYGTKGCDGGRPYDAFQYVKNNGGLEAEATYPY 202

Query:   233 RGKNDRCQTDKTKHHAVTITGYEAIP 258
               K   C+  + +   V +  +  +P
Sbjct:   203 EAKAKHCRY-RPERSVVKVNRFFVVP 227

 Score = 150 (57.9 bits), Expect = 7.6e-46, Sum P(2) = 7.6e-46
 Identities = 38/88 (43%), Positives = 49/88 (55%)

Query:   256 AIPARYA-FQLYSHGVFDE-YCGHQ-LNHGVTVVGYG-EDH---GEKYWLVKNSWGTSWG 308
             AI   +A F  Y  G++ E  C    L+HG+ +VGYG E H     KYWL+KNS G  WG
Sbjct:   246 AIDGSHASFHSYRGGIYHEPKCRKDTLDHGLLLVGYGYEGHESENRKYWLLKNSHGERWG 305

Query:   309 EAGYIRMARNSPSSNIGICGILMQASYP 336
             E GY+++ R    +N   CGI   A YP
Sbjct:   306 ENGYMKLPRGQ--NNY--CGIASYAMYP 329

 Score = 45 (20.9 bits), Expect = 7.7e-09, Sum P(2) = 7.7e-09
 Identities = 13/47 (27%), Positives = 20/47 (42%)

Query:    45 WSEGYPQKYDPQSMEER---FEN---WLKQYSREYGSEDEWQRRFGI 85
             W     + Y P+  ++R   +E    W+KQ+  E G    W   F I
Sbjct:    32 WKRSNDRTYSPEEEKQRRAVWEGNVKWIKQHIMENGL---WMNNFTI 75


>GENEDB_PFALCIPARUM|PF11_0161 [details] [associations]
            symbol:PF11_0161 "falcipain-2 precursor,
            putative" species:5833 "Plasmodium falciparum" [GO:0020020 "food
            vacuole" evidence=TAS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014186 HSSP:O65039 OMA:NNVEHIN GO:GO:0020020
            MEROPS:C01.046 HOGENOM:HOG000065857 ProtClustDB:PTZ00021
            RefSeq:XP_001347832.1 ProteinModelPortal:Q8I6U5 SMR:Q8I6U5
            IntAct:Q8I6U5 MINT:MINT-1546851 EnsemblProtists:PF11_0161:mRNA
            GeneID:810708 KEGG:pfa:PF11_0161 EuPathDB:PlasmoDB:PF3D7_1115300
            Uniprot:Q8I6U5
        Length = 482

 Score = 345 (126.5 bits), Expect = 7.6e-46, Sum P(2) = 7.6e-46
 Identities = 81/216 (37%), Positives = 122/216 (56%)

Query:    61 RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLS-FKLTDNKFADLSNEEFI 119
             +F  ++K  +++Y S +E + RF ++  N   +   N+   S +K   N+FADL+  EF 
Sbjct:   162 QFYTFIKTNNKQYNSPNEMKERFQVFLQNAHKVKMHNNNKKSLYKKELNRFADLTYHEFK 221

Query:   120 STYLGY--NKPYNEPRW-------PSV--QYLGLP----ASVDWRKEGAVTPVKDQGQCG 164
             S YL    +KP    ++        +V  +Y G      A+ DWR    VTPVKDQ  CG
Sbjct:   222 SKYLTLRSSKPLKNSKYLLDQINYDAVIKKYKGNENFDHAAYDWRLHSGVTPVKDQKNCG 281

Query:   165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
             SCWAFS++ +VE    ++  KL++LSEQELVDC    +N GCNGG +  AFE + ++GG+
Sbjct:   282 SCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSF--KNYGCNGGLINNAFEDMIELGGI 339

Query:   225 TTEDDYPY-RGKNDRCQTDK-TKHHAVTITGYEAIP 258
              T+DDYPY     + C  D+ T+ +   I  Y ++P
Sbjct:   340 CTDDDYPYVSDAPNLCNIDRCTEKYG--IKNYLSVP 373

 Score = 338 (124.0 bits), Expect = 4.1e-45, Sum P(2) = 4.1e-45
 Identities = 89/260 (34%), Positives = 135/260 (51%)

Query:    51 QKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNV-QYIDYINSQNLSFKLTDNK 109
             Q   P  M+ERF+ +L Q + +    +  ++   +Y   + ++ D    +  S  LT   
Sbjct:   173 QYNSPNEMKERFQVFL-QNAHKVKMHNNNKK--SLYKKELNRFADLTYHEFKSKYLTLRS 229

Query:   110 FADLSNEEFISTYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAF 169
                L N +++   + Y+      ++   +     A+ DWR    VTPVKDQ  CGSCWAF
Sbjct:   230 SKPLKNSKYLLDQINYDAVIK--KYKGNENFD-HAAYDWRLHSGVTPVKDQKNCGSCWAF 286

Query:   170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
             S++ +VE    ++  KL++LSEQELVDC    +N GCNGG +  AFE + ++GG+ T+DD
Sbjct:   287 SSIGSVESQYAIRKNKLITLSEQELVDCSF--KNYGCNGGLINNAFEDMIELGGICTDDD 344

Query:   230 YPY--RGKN----DRCQTDK--TKHH----------AVTITGYEAIPARYA--FQLYSHG 269
             YPY     N    DRC T+K   K++          A+   G  +I    +  F  Y  G
Sbjct:   345 YPYVSDAPNLCNIDRC-TEKYGIKNYLSVPDNKLKEALRFLGPISISIAVSDDFPFYKEG 403

Query:   270 VFDEYCGHQLNHGVTVVGYG 289
             +FD  CG +LNH V +VG+G
Sbjct:   404 IFDGECGDELNHAVMLVGFG 423

 Score = 153 (58.9 bits), Expect = 7.6e-46, Sum P(2) = 7.6e-46
 Identities = 33/85 (38%), Positives = 46/85 (54%)

Query:   263 FQLYSHGVFDEYCGHQLNHGVTVVGYGEDH--------GEK--YWLVKNSWGTSWGEAGY 312
             F  Y  G+FD  CG +LNH V +VG+G           GEK  Y+++KNSWG  WGE G+
Sbjct:   397 FPFYKEGIFDGECGDELNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGF 456

Query:   313 IRMARNSPSSNIGICGILMQASYPV 337
             I +  +  S  +  CG+   A  P+
Sbjct:   457 INIETDE-SGLMRKCGLGTDAFIPL 480

 Score = 61 (26.5 bits), Expect = 3.6e-10, Sum P(2) = 3.6e-10
 Identities = 16/76 (21%), Positives = 37/76 (48%)

Query:    62 FENWLKQYSREYGSEDEWQ-RRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIS 120
             ++N +K  ++   +      +   ++  N    ++I+++N    + D+KF  ++N E I+
Sbjct:   103 YDNKMKDINKNNNNNTSSDFKGLSLFKENKPSNNFIHNENYFINVFDHKFL-MNNVEHIN 161

Query:   121 TYLGY----NKPYNEP 132
              +  +    NK YN P
Sbjct:   162 QFYTFIKTNNKQYNSP 177


>UNIPROTKB|Q8I6U5 [details] [associations]
            symbol:PF11_0161 "Falcipain-2B" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            EMBL:AE014186 HSSP:O65039 OMA:NNVEHIN GO:GO:0020020 MEROPS:C01.046
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347832.1
            ProteinModelPortal:Q8I6U5 SMR:Q8I6U5 IntAct:Q8I6U5
            MINT:MINT-1546851 EnsemblProtists:PF11_0161:mRNA GeneID:810708
            KEGG:pfa:PF11_0161 EuPathDB:PlasmoDB:PF3D7_1115300 Uniprot:Q8I6U5
        Length = 482

 Score = 345 (126.5 bits), Expect = 7.6e-46, Sum P(2) = 7.6e-46
 Identities = 81/216 (37%), Positives = 122/216 (56%)

Query:    61 RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLS-FKLTDNKFADLSNEEFI 119
             +F  ++K  +++Y S +E + RF ++  N   +   N+   S +K   N+FADL+  EF 
Sbjct:   162 QFYTFIKTNNKQYNSPNEMKERFQVFLQNAHKVKMHNNNKKSLYKKELNRFADLTYHEFK 221

Query:   120 STYLGY--NKPYNEPRW-------PSV--QYLGLP----ASVDWRKEGAVTPVKDQGQCG 164
             S YL    +KP    ++        +V  +Y G      A+ DWR    VTPVKDQ  CG
Sbjct:   222 SKYLTLRSSKPLKNSKYLLDQINYDAVIKKYKGNENFDHAAYDWRLHSGVTPVKDQKNCG 281

Query:   165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
             SCWAFS++ +VE    ++  KL++LSEQELVDC    +N GCNGG +  AFE + ++GG+
Sbjct:   282 SCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSF--KNYGCNGGLINNAFEDMIELGGI 339

Query:   225 TTEDDYPY-RGKNDRCQTDK-TKHHAVTITGYEAIP 258
              T+DDYPY     + C  D+ T+ +   I  Y ++P
Sbjct:   340 CTDDDYPYVSDAPNLCNIDRCTEKYG--IKNYLSVP 373

 Score = 338 (124.0 bits), Expect = 4.1e-45, Sum P(2) = 4.1e-45
 Identities = 89/260 (34%), Positives = 135/260 (51%)

Query:    51 QKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNV-QYIDYINSQNLSFKLTDNK 109
             Q   P  M+ERF+ +L Q + +    +  ++   +Y   + ++ D    +  S  LT   
Sbjct:   173 QYNSPNEMKERFQVFL-QNAHKVKMHNNNKK--SLYKKELNRFADLTYHEFKSKYLTLRS 229

Query:   110 FADLSNEEFISTYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAF 169
                L N +++   + Y+      ++   +     A+ DWR    VTPVKDQ  CGSCWAF
Sbjct:   230 SKPLKNSKYLLDQINYDAVIK--KYKGNENFD-HAAYDWRLHSGVTPVKDQKNCGSCWAF 286

Query:   170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
             S++ +VE    ++  KL++LSEQELVDC    +N GCNGG +  AFE + ++GG+ T+DD
Sbjct:   287 SSIGSVESQYAIRKNKLITLSEQELVDCSF--KNYGCNGGLINNAFEDMIELGGICTDDD 344

Query:   230 YPY--RGKN----DRCQTDK--TKHH----------AVTITGYEAIPARYA--FQLYSHG 269
             YPY     N    DRC T+K   K++          A+   G  +I    +  F  Y  G
Sbjct:   345 YPYVSDAPNLCNIDRC-TEKYGIKNYLSVPDNKLKEALRFLGPISISIAVSDDFPFYKEG 403

Query:   270 VFDEYCGHQLNHGVTVVGYG 289
             +FD  CG +LNH V +VG+G
Sbjct:   404 IFDGECGDELNHAVMLVGFG 423

 Score = 153 (58.9 bits), Expect = 7.6e-46, Sum P(2) = 7.6e-46
 Identities = 33/85 (38%), Positives = 46/85 (54%)

Query:   263 FQLYSHGVFDEYCGHQLNHGVTVVGYGEDH--------GEK--YWLVKNSWGTSWGEAGY 312
             F  Y  G+FD  CG +LNH V +VG+G           GEK  Y+++KNSWG  WGE G+
Sbjct:   397 FPFYKEGIFDGECGDELNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGF 456

Query:   313 IRMARNSPSSNIGICGILMQASYPV 337
             I +  +  S  +  CG+   A  P+
Sbjct:   457 INIETDE-SGLMRKCGLGTDAFIPL 480

 Score = 61 (26.5 bits), Expect = 3.6e-10, Sum P(2) = 3.6e-10
 Identities = 16/76 (21%), Positives = 37/76 (48%)

Query:    62 FENWLKQYSREYGSEDEWQ-RRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIS 120
             ++N +K  ++   +      +   ++  N    ++I+++N    + D+KF  ++N E I+
Sbjct:   103 YDNKMKDINKNNNNNTSSDFKGLSLFKENKPSNNFIHNENYFINVFDHKFL-MNNVEHIN 161

Query:   121 TYLGY----NKPYNEP 132
              +  +    NK YN P
Sbjct:   162 QFYTFIKTNNKQYNSP 177


>UNIPROTKB|E9PTT3 [details] [associations]
            symbol:Ctsr "Protein Ctsr" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            IPI:IPI00627092 Ensembl:ENSRNOT00000024115 RGD:631422
            Uniprot:E9PTT3
        Length = 334

 Score = 330 (121.2 bits), Expect = 1.2e-45, Sum P(2) = 1.2e-45
 Identities = 82/230 (35%), Positives = 122/230 (53%)

Query:    38 LGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN 97
             LG+  GA +      +DP S++  + +   +Y + Y  E+E  RR  ++  N++ I   N
Sbjct:    12 LGVGXGALA------FDP-SLDAEWHDXKTEYEKSYTMEEEGHRR-AVWEENMKMIKLHN 63

Query:    98 SQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSV---QYLG--LPASVDW 148
              +N      F +  N+F DL+ EEF    +  N P    R   +   + +G  LP  VDW
Sbjct:    64 RENSLGKNGFIMEMNEFGDLTAEEFRKMMV--NIPIRSHRKGKIIRKRDVGNVLPKFVDW 121

Query:   149 RKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNG 208
             RK+G VT V++Q  C SCWAF+   A+EG    KTG+L  LS Q LVDC  +  N+GC  
Sbjct:   122 RKKGYVTRVQNQKFCNSCWAFAVTGAIEGQMFNKTGQLTPLSVQNLVDCTKSQGNEGCQW 181

Query:   209 GYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             G    A+E++   GG+  E  YPY+GK   C+ +  KH    ITG+ ++P
Sbjct:   182 GDPHIAYEYVLNNGGLEAEATYPYKGKEGVCRYNP-KHSKAEITGFVSLP 230

 Score = 166 (63.5 bits), Expect = 1.2e-45, Sum P(2) = 1.2e-45
 Identities = 36/88 (40%), Positives = 47/88 (53%)

Query:   256 AIPARY-AFQLYSHGVFDE-YCGHQ-LNHGVTVVGYG----EDHGEKYWLVKNSWGTSWG 308
             A+ A + +F  Y  G++DE  C +  +NH V VVGYG    E  G  YWL+KNSWG  WG
Sbjct:   249 AVDASFNSFGFYKKGLYDEPNCSNNTVNHSVLVVGYGFEGNETDGNSYWLIKNSWGRKWG 308

Query:   309 EAGYIRMARNSPSSNIGICGILMQASYP 336
               GY+++    P      C I   A YP
Sbjct:   309 LRGYMKI----PKDQNNFCAIASYAHYP 332


>ZFIN|ZDB-GENE-040426-1583 [details] [associations]
            symbol:ctssa "cathepsin S, a" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-040426-1583
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            GeneTree:ENSGT00660000095458 EMBL:CR548627 IPI:IPI00491948
            UniGene:Dr.81560 SMR:Q1L8W8 Ensembl:ENSDART00000053638 OMA:RNTREER
            OrthoDB:EOG480HX9 Uniprot:Q1L8W8
        Length = 328

 Score = 289 (106.8 bits), Expect = 1.2e-45, Sum P(2) = 1.2e-45
 Identities = 68/209 (32%), Positives = 112/209 (53%)

Query:    58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNKFADL 113
             +  ++  W  Q+++ Y +  E + R  ++  N+Q I   N        S+ L  N+ +D+
Sbjct:    23 LTNQWTTWKSQHNKTYRNTREERLRRSVWKQNLQDILLHNEAAAVGLHSYTLGLNQLSDM 82

Query:   114 SNEEF--ISTYLGYNKPYNEPRW--PSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAF 169
             + +E   ++  L  + P     +  PS+Q L  P  V+W + G V+PV++QG CGSCWAF
Sbjct:    83 TADEVNDMNGLLEEDFPDVNATFSPPSLQTL--PQRVNWTEHGMVSPVQNQGPCGSCWAF 140

Query:   170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
             SAV ++E   K +T  LV LS Q L+DC V+  N+GC GG++ +AF ++ +  G+ +   
Sbjct:   141 SAVGSLEAQMKRRTAALVPLSAQNLLDCSVSLGNRGCKGGFLSRAFLYVIQNRGIDSSTF 200

Query:   230 YPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             YPY  K   C+       A   TG+  +P
Sbjct:   201 YPYEHKEGVCRYS-VSGRAGYCTGFRIVP 228

 Score = 207 (77.9 bits), Expect = 1.2e-45, Sum P(2) = 1.2e-45
 Identities = 41/77 (53%), Positives = 49/77 (63%)

Query:   262 AFQLYSHGVF-DEYCGHQL-NHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
             +F  Y  G++ D  C   L NH V VVGYG ++G+ YWLVKNSWGT+WGE GYIRMARN 
Sbjct:   255 SFHRYRSGIYNDPKCSSALINHAVLVVGYGSENGQDYWLVKNSWGTAWGENGYIRMARNK 314

Query:   320 PSSNIGICGILMQASYP 336
                   +CGI     YP
Sbjct:   315 -----NMCGISSFGIYP 326


>TAIR|locus:2082687 [details] [associations]
            symbol:AT3G54940 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002686 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HSSP:P53634
            OMA:GGGLMTN EMBL:AY070063 IPI:IPI00528988 RefSeq:NP_567010.5
            UniGene:At.28412 ProteinModelPortal:Q8VYS0 SMR:Q8VYS0 PRIDE:Q8VYS0
            EnsemblPlants:AT3G54940.2 GeneID:824659 KEGG:ath:AT3G54940
            TAIR:At3g54940 PhylomeDB:Q8VYS0 ProtClustDB:CLSN2718801
            ArrayExpress:Q8VYS0 Genevestigator:Q8VYS0 Uniprot:Q8VYS0
        Length = 367

 Score = 370 (135.3 bits), Expect = 1.6e-45, Sum P(2) = 1.6e-45
 Identities = 85/227 (37%), Positives = 117/227 (51%)

Query:    59 EERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEF 118
             E +F  ++  Y + Y + +E+  R GI++ NV         + S      +F+DL+ EEF
Sbjct:    48 ESKFRLFMSDYGKNYSTREEYIHRLGIFAKNVLKAAEHQMMDPSAVHGVTQFSDLTEEEF 107

Query:   119 ISTYLGYNKPYNE------PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
                Y G                P V+  GLP   DWR++G VT VK+QG CGSCWAFS  
Sbjct:   108 KRMYTGVADVGGSRGGTVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTT 167

Query:   173 AAVEGINKLKTGKLVSLSEQELVDCDVNSE-------NQGCNGGYMEKAFEFITKIGGVT 225
              A EG + + TGKL+SLSEQ+LVDCD   +       + GC GG M  A+E++ + GG+ 
Sbjct:   168 GAAEGAHFVSTGKLLSLSEQQLVDCDQACDPKDKKACDNGCGGGLMTNAYEYLMEAGGLE 227

Query:   226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA---RYAFQLYSHG 269
              E  YPY GK   C+ D  K  AV +  +  IP    + A  L  HG
Sbjct:   228 EERSYPYTGKRGHCKFDPEKV-AVRVLNFTTIPLDENQIAANLVRHG 273

 Score = 125 (49.1 bits), Expect = 1.6e-45, Sum P(2) = 1.6e-45
 Identities = 30/75 (40%), Positives = 38/75 (50%)

Query:   264 QLYSHGVFDEY-CGHQ-LNHGVTVVGYGED-------HGEKYWLVKNSWGTSWGEAGYIR 314
             Q Y  GV     C  + +NHGV +VGYG           + YW++KNSWG  WGE GY +
Sbjct:   285 QTYIGGVSCPLICSKRNVNHGVLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYK 344

Query:   315 MARNSPSSNIGICGI 329
             + R        ICGI
Sbjct:   345 LCRGHD-----ICGI 354


>UNIPROTKB|F1NT07 [details] [associations]
            symbol:LOC100857883 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 EMBL:AADN02044012
            EMBL:AADN02044013 EMBL:AADN02044014 IPI:IPI00577314
            Ensembl:ENSGALT00000000192 OMA:IYKHGPV Uniprot:F1NT07
        Length = 317

 Score = 312 (114.9 bits), Expect = 1.6e-45, Sum P(2) = 1.6e-45
 Identities = 78/220 (35%), Positives = 119/220 (54%)

Query:    62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
             F ++ ++  R YGS  E + R  I++ +++++   N   LS+ L  N  AD + +E ++ 
Sbjct:    12 FHHYRRRLGRPYGSAREMEHRQRIFAHHMRFVHSKNRAALSYSLALNHLADRTPQE-MAA 70

Query:   122 YLGYNK---PYNEPRWPSVQYLG--LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVE 176
               G  +   P +   +P+  Y G  LP S+DWR  GAVTPVKDQ  CGSCW+F+   A+E
Sbjct:    71 LRGRRRSGDPNHGLPFPAEHYTGIILPESLDWRMYGAVTPVKDQAVCGSCWSFATTGAME 130

Query:   177 GINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV-TTED--DYPYR 233
             G   LKTG L  LS+Q L+DC     N  C+GG   +A  +I K GG+ +TE    +P  
Sbjct:   131 GALFLKTGVLTPLSQQVLIDCSWGKGNYACDGGEEWRAKGWIKKHGGIASTESPPSFPLV 190

Query:   234 GKNDRCQTDKTKHHAVTITGYEAIPA----RYAFQLYSHG 269
              +N  C  ++++  A  ITGY  + +         +Y HG
Sbjct:   191 LQNGLCHYNQSEMLA-KITGYVNVTSGNITAVKTAIYKHG 229

 Score = 183 (69.5 bits), Expect = 1.6e-45, Sum P(2) = 1.6e-45
 Identities = 41/97 (42%), Positives = 56/97 (57%)

Query:   245 KHHAVTITGYEAIPARYAFQLYSHGVFDE-YCGH---QLNHGVTVVGYGEDHGEKYWLVK 300
             KH  V ++  +A  +   F  YS+G++ E  C +   QL+H V  VGYG   GE YWL+K
Sbjct:   227 KHGPVAVS-IDA--SHKTFSFYSNGIYYEPKCANKPGQLDHAVLAVGYGVLQGETYWLIK 283

Query:   301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
             NSW T WG  GYI MA     +N   CG+  +A+YP+
Sbjct:   284 NSWSTYWGNDGYILMAMKD--NN---CGVATEATYPI 315


>UNIPROTKB|Q0VCU3 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            HOVERGEN:HBG011513 MEROPS:C01.018 CTD:8722 OMA:LAPPEWD
            OrthoDB:EOG4CC41T GeneTree:ENSGT00660000095458 EMBL:DAAA02063594
            EMBL:BC120003 IPI:IPI00717812 RefSeq:NP_001068884.1 UniGene:Bt.7264
            SMR:Q0VCU3 Ensembl:ENSBTAT00000014587 GeneID:509715 KEGG:bta:509715
            InParanoid:Q0VCU3 NextBio:20869091 Uniprot:Q0VCU3
        Length = 460

 Score = 344 (126.2 bits), Expect = 2.0e-45, Sum P(2) = 2.0e-45
 Identities = 81/203 (39%), Positives = 114/203 (56%)

Query:    50 PQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNV---QYIDYINSQNLSFKLT 106
             PQ +  + M   F++++  Y+R Y S++E   R  ++++N+   Q I  ++     + +T
Sbjct:   152 PQDFSVK-MASIFKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTARYGVT 210

Query:   107 DNKFADLSNEEFISTYLG---YNKPYNEPRWPSVQYLGLPASV-DWRKEGAVTPVKDQGQ 162
               KF+DL+ EEF + YL     + P    R P+     +P    DWR +GAVT VKDQG 
Sbjct:   211 --KFSDLTEEEFRTIYLNPLLKDAPGRNMR-PAQPVTDVPPPQWDWRNKGAVTNVKDQGM 267

Query:   163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
             CGSCWAFS    VEG   LK G L+SLSEQEL+DCD    ++ C GG    A+  I  +G
Sbjct:   268 CGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCD--KTDKACLGGLPSNAYSAIRTLG 325

Query:   223 GVTTEDDYPYRGKNDRCQTDKTK 245
             G+ TEDDY YRG+   C     K
Sbjct:   326 GLETEDDYSYRGRLQTCSFSAEK 348

 Score = 150 (57.9 bits), Expect = 2.0e-45, Sum P(2) = 2.0e-45
 Identities = 35/86 (40%), Positives = 44/86 (51%)

Query:   256 AIPARYAFQLYSHGV---FDEYCGHQL-NHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAG 311
             AI A +  Q Y HG+       C   L +H V +VGYG      +W +KNSWGT WGE G
Sbjct:   379 AINA-FGMQFYRHGISHPLRPLCSPWLIDHAVLLVGYGNRSAIPFWAIKNSWGTDWGEEG 437

Query:   312 YIRMARNSPSSNIGICGILMQASYPV 337
             Y  + R S     G CG+ + AS  V
Sbjct:   438 YYYLHRGS-----GACGVNIMASSAV 458


>UNIPROTKB|Q9UBX1 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=TAS] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_6900 GO:GO:0019886 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0043202
            GO:GO:0004197 HOVERGEN:HBG011513 EMBL:AJ007331 EMBL:AF088886
            EMBL:AF132894 EMBL:AF136279 EMBL:AF071748 EMBL:AF071749
            EMBL:AK313657 EMBL:BC011682 EMBL:BC036451 EMBL:AL137742
            IPI:IPI00002816 RefSeq:NP_003784.2 UniGene:Hs.11590 PDB:1D5U
            PDB:1M6D PDBsum:1D5U PDBsum:1M6D ProteinModelPortal:Q9UBX1
            SMR:Q9UBX1 STRING:Q9UBX1 MEROPS:C01.018 PhosphoSite:Q9UBX1
            DMDM:12643325 PaxDb:Q9UBX1 PeptideAtlas:Q9UBX1 PRIDE:Q9UBX1
            DNASU:8722 Ensembl:ENST00000310325 GeneID:8722 KEGG:hsa:8722
            UCSC:uc001oip.3 CTD:8722 GeneCards:GC11M066332 HGNC:HGNC:2531
            HPA:CAB002141 MIM:603539 neXtProt:NX_Q9UBX1 PharmGKB:PA27031
            InParanoid:Q9UBX1 OMA:LAPPEWD OrthoDB:EOG4CC41T PhylomeDB:Q9UBX1
            BindingDB:Q9UBX1 ChEMBL:CHEMBL2517 ChiTaRS:CTSF
            EvolutionaryTrace:Q9UBX1 GenomeRNAi:8722 NextBio:32715
            ArrayExpress:Q9UBX1 Bgee:Q9UBX1 CleanEx:HS_CTSF
            Genevestigator:Q9UBX1 GermOnline:ENSG00000174080 Uniprot:Q9UBX1
        Length = 484

 Score = 343 (125.8 bits), Expect = 4.1e-45, Sum P(2) = 4.1e-45
 Identities = 83/204 (40%), Positives = 113/204 (55%)

Query:    50 PQKYD-PQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNV---QYIDYINSQNLSFKL 105
             P   D P  M   F+N++  Y+R Y S++E + R  ++ +N+   Q I  ++     + +
Sbjct:   174 PLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGV 233

Query:   106 TDNKFADLSNEEFISTYLGY---NKPYNEPRWP-SVQYLGLPASVDWRKEGAVTPVKDQG 161
             T  KF+DL+ EEF + YL      +P N+ +   SV  L  P   DWR +GAVT VKDQG
Sbjct:   234 T--KFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLA-PPEWDWRSKGAVTKVKDQG 290

Query:   162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKI 221
              CGSCWAFS    VEG   L  G L+SLSEQEL+DCD    ++ C GG    A+  I  +
Sbjct:   291 MCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD--KMDKACMGGLPSNAYSAIKNL 348

Query:   222 GGVTTEDDYPYRGKNDRCQTDKTK 245
             GG+ TEDDY Y+G    C     K
Sbjct:   349 GGLETEDDYSYQGHMQSCNFSAEK 372

 Score = 148 (57.2 bits), Expect = 4.1e-45, Sum P(2) = 4.1e-45
 Identities = 35/86 (40%), Positives = 43/86 (50%)

Query:   256 AIPARYAFQLYSHGV---FDEYCGHQL-NHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAG 311
             AI A +  Q Y HG+       C   L +H V +VGYG      +W +KNSWGT WGE G
Sbjct:   403 AINA-FGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWGTDWGEKG 461

Query:   312 YIRMARNSPSSNIGICGILMQASYPV 337
             Y  + R S     G CG+   AS  V
Sbjct:   462 YYYLHRGS-----GACGVNTMASSAV 482


>DICTYBASE|DDB_G0272298 [details] [associations]
            symbol:DDB_G0272298 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0272298 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246
            SMART:SM00848 EMBL:AAFI02000008 KO:K01365 RefSeq:XP_645281.1
            ProteinModelPortal:Q559Q3 MEROPS:C01.A53 EnsemblProtists:DDB0203746
            GeneID:8618447 KEGG:ddi:DDB_G0272298 InParanoid:Q559Q3 OMA:PANINWR
            Uniprot:Q559Q3
        Length = 305

 Score = 341 (125.1 bits), Expect = 4.1e-45, Sum P(2) = 4.1e-45
 Identities = 72/203 (35%), Positives = 116/203 (57%)

Query:    68 QYSREYGSEDEWQRRFGIYSSNVQYI-DYINSQNLSFKLTDNKFADLSNEEFISTYLGYN 126
             +Y++ Y +  E+ +RF I+  N  +I ++ N    + ++  N+++DL+ +EF   +  + 
Sbjct:     3 KYNKHYKNNKEYLKRFDIFQDNYNFILNHRNKNGENIEMDLNEYSDLTQKEFADKF--FE 60

Query:   127 KPYNEPRWPSVQYLG-----------LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
             K   EPR   +  +            +P S DWR  GAV  VK+QG C SCW+FSA+ A+
Sbjct:    61 KLVPEPRSGPINDIKATPFKHNVNATIPKSFDWRDHGAVGKVKNQGSCASCWSFSALGAL 120

Query:   176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
             EG   +K G+L+ LSEQ LVDC      +GC  G+M  AF++I   GGV  E  YPY GK
Sbjct:   121 EGHYYIKYGELLDLSEQNLVDCATPFGPKGCKTGWMHDAFKYIISSGGVNLESQYPYTGK 180

Query:   236 NDRCQTDKTKHHAVTITGYEAIP 258
             ++ C+ ++++  A  ++G+  IP
Sbjct:   181 DEVCKFNQSEKEA-KVSGFVMIP 202

 Score = 150 (57.9 bits), Expect = 4.1e-45, Sum P(2) = 4.1e-45
 Identities = 36/96 (37%), Positives = 52/96 (54%)

Query:   248 AVTITGYEAIP---ARYAFQLYSHGVF-DEYCGH-QLNHGVTVVGYGED-HGEKYWLVKN 301
             A+ + G  A+P   +   FQ  S G++  + C      H V  +GYG D +G  Y+L+KN
Sbjct:   212 AIALYGPVAVPIDTSTKEFQHLSGGIYYSDSCDPWNTIHAVLAIGYGTDENGVDYFLMKN 271

Query:   302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
             SWG SWG  G+ ++ R       G CGI+  ASYP+
Sbjct:   272 SWGKSWGTNGFFKVKRGVK----GKCGIVTAASYPI 303


>UNIPROTKB|E2RR02 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            OMA:LAPPEWD GeneTree:ENSGT00660000095458 EMBL:AAEX03011628
            Ensembl:ENSCAFT00000019742 Uniprot:E2RR02
        Length = 460

 Score = 336 (123.3 bits), Expect = 5.3e-45, Sum P(2) = 5.3e-45
 Identities = 79/203 (38%), Positives = 111/203 (54%)

Query:    50 PQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNV---QYIDYINSQNLSFKLT 106
             PQ +  + M   F+ ++  Y+R Y +++E + R  ++S+N+   Q I  ++     + +T
Sbjct:   151 PQDFSVK-MASVFKEFVTTYNRTYETKEEAEWRMSVFSNNMVRAQKIQALDRGTAQYGIT 209

Query:   107 DNKFADLSNEEFISTYLG----YNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQ 162
               KF+DL+ EEF + YL      N+        S+     P   DWR +GAVT VKDQG 
Sbjct:   210 --KFSDLTEEEFRTIYLNPLLRENRGKKMRLAKSISDHAPPPEWDWRSKGAVTKVKDQGM 267

Query:   163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
             CGSCWAFS    VEG   LK G L+SLSEQEL+DCD    ++ C GG    A+  I  +G
Sbjct:   268 CGSCWAFSVTGNVEGQWFLKEGTLLSLSEQELLDCD--KVDKACLGGLPSNAYSAIMTLG 325

Query:   223 GVTTEDDYPYRGKNDRCQTDKTK 245
             G+ TEDDY Y+G    C     K
Sbjct:   326 GLETEDDYSYQGHLQACSFSAKK 348

 Score = 154 (59.3 bits), Expect = 5.3e-45, Sum P(2) = 5.3e-45
 Identities = 36/86 (41%), Positives = 44/86 (51%)

Query:   256 AIPARYAFQLYSHGV---FDEYCGHQL-NHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAG 311
             AI A +  Q Y HG+       C   L +H V +VGYG   G  +W +KNSWGT WGE G
Sbjct:   379 AINA-FGMQFYRHGISHPLRPLCSPWLIDHAVLLVGYGNRSGIPFWAIKNSWGTDWGEEG 437

Query:   312 YIRMARNSPSSNIGICGILMQASYPV 337
             Y  + R S     G CG+   AS  V
Sbjct:   438 YYYLHRGS-----GACGVNTMASSAV 458


>MGI|MGI:1861434 [details] [associations]
            symbol:Ctsf "cathepsin F" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=ISS]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISS] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:1861434 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513 MEROPS:C01.018
            CTD:8722 OMA:LAPPEWD OrthoDB:EOG4CC41T EMBL:AF136280 EMBL:AF217224
            EMBL:AJ131851 EMBL:AK075862 EMBL:BC058758 IPI:IPI00126769
            RefSeq:NP_063914.1 UniGene:Mm.29561 ProteinModelPortal:Q9R013
            SMR:Q9R013 STRING:Q9R013 PhosphoSite:Q9R013 PaxDb:Q9R013
            PRIDE:Q9R013 Ensembl:ENSMUST00000119694 GeneID:56464 KEGG:mmu:56464
            UCSC:uc008gbc.1 GeneTree:ENSGT00660000095458 InParanoid:Q9R013
            NextBio:312722 Bgee:Q9R013 CleanEx:MM_CTSF Genevestigator:Q9R013
            GermOnline:ENSMUSG00000006458 Uniprot:Q9R013
        Length = 462

 Score = 335 (123.0 bits), Expect = 1.1e-44, Sum P(2) = 1.1e-44
 Identities = 79/196 (40%), Positives = 109/196 (55%)

Query:    50 PQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNV---QYIDYINSQNLSFKLT 106
             PQ +  + M   F++++  Y+R Y S +E Q R  +++ N+   Q I  ++     + +T
Sbjct:   154 PQDFSVK-MAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGIT 212

Query:   107 DNKFADLSNEEFISTYLG--YNKPYNEPRWPSVQYLGL-PASVDWRKEGAVTPVKDQGQC 163
               KF+DL+ EEF + YL     K       P+     L P   DWRK+GAVT VK+QG C
Sbjct:   213 --KFSDLTEEEFHTIYLNPLLQKESGRKMSPAKSINDLAPPEWDWRKKGAVTEVKNQGMC 270

Query:   164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
             GSCWAFS    VEG   L  G L+SLSEQEL+DCD    ++ C GG    A+  I  +GG
Sbjct:   271 GSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCD--KVDKACLGGLPSNAYAAIKNLGG 328

Query:   224 VTTEDDYPYRGKNDRC 239
             + TEDDY Y+G    C
Sbjct:   329 LETEDDYGYQGHVQTC 344

 Score = 152 (58.6 bits), Expect = 1.1e-44, Sum P(2) = 1.1e-44
 Identities = 35/86 (40%), Positives = 44/86 (51%)

Query:   256 AIPARYAFQLYSHGV---FDEYCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAG 311
             AI A +  Q Y HG+   F   C    ++H V +VGYG      YW +KNSWG+ WGE G
Sbjct:   381 AINA-FGMQFYRHGIAHPFRPLCSPWFIDHAVLLVGYGNRSNIPYWAIKNSWGSDWGEEG 439

Query:   312 YIRMARNSPSSNIGICGILMQASYPV 337
             Y  + R S     G CG+   AS  V
Sbjct:   440 YYYLYRGS-----GACGVNTMASSAV 460


>MGI|MGI:1927229 [details] [associations]
            symbol:Ctsm "cathepsin M" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008152 "metabolic process" evidence=ISS] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015645 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1927229 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF202528
            EMBL:AY014777 EMBL:AY057446 EMBL:AK005550 EMBL:AK005428
            IPI:IPI00131133 RefSeq:NP_071721.2 UniGene:Mm.279933
            ProteinModelPortal:Q9JL96 SMR:Q9JL96 STRING:Q9JL96 MEROPS:C01.023
            PRIDE:Q9JL96 DNASU:64139 Ensembl:ENSMUST00000099451 GeneID:64139
            KEGG:mmu:64139 UCSC:uc007qwj.1 CTD:64139 InParanoid:Q9JL96
            KO:K09600 OrthoDB:EOG4TTGKR NextBio:319931 Bgee:Q9JL96
            CleanEx:MM_CTSM Genevestigator:Q9JL96 GermOnline:ENSMUSG00000074484
            GermOnline:ENSMUSG00000074871 PANTHER:PTHR12411:SF58 Uniprot:Q9JL96
        Length = 333

 Score = 324 (119.1 bits), Expect = 1.1e-44, Sum P(2) = 1.1e-44
 Identities = 74/212 (34%), Positives = 112/212 (52%)

Query:    54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNK 109
             DP  ++  ++ W  +Y + Y  E+E Q+R  ++  N++ I   N +N      F +  N 
Sbjct:    22 DP-ILDVEWQKWKIKYGKAYSLEEEGQKR-AVWEDNMKKIKLHNGENGLGKHGFTMEMNA 79

Query:   110 FADLSNEEFISTYLGYNKPYNEPRWPSVQY---LGLPASVDWRKEGAVTPVKDQGQCGSC 166
             F D++ EEF    +    P    +  SVQ    + LP  ++W+K G VTPV+ QG+C SC
Sbjct:    80 FGDMTLEEFRKVMIEIPVP-TVKKGKSVQKRLSVNLPKFINWKKRGYVTPVQTQGRCNSC 138

Query:   167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
             WAFS   A+EG    KTG+L+ LS Q LVDC     N GC  G    A  ++ + GG+ +
Sbjct:   139 WAFSVTGAIEGQMFRKTGQLIPLSVQNLVDCSRPQGNWGCYLGNTYLALHYVMENGGLES 198

Query:   227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             E  YPY  K+  C+    ++    ITG+E +P
Sbjct:   199 EATYPYEEKDGSCRYSP-ENSTANITGFEFVP 229

 Score = 163 (62.4 bits), Expect = 1.1e-44, Sum P(2) = 1.1e-44
 Identities = 38/88 (43%), Positives = 50/88 (56%)

Query:   256 AIPARYA-FQLYSHGVFDE-YCGH-QLNHGVTVVGYG----EDHGEKYWLVKNSWGTSWG 308
             AI AR+A F  Y  G++ E  C    + H + +VGYG    E  G KYWLVKNS GT WG
Sbjct:   248 AIDARHASFLFYKRGIYYEPNCSSCVVTHSMLLVGYGFTGRESDGRKYWLVKNSMGTQWG 307

Query:   309 EAGYIRMARNSPSSNIGICGILMQASYP 336
               GY++++R+  +     CGI   A YP
Sbjct:   308 NKGYMKISRDKGNH----CGIATYALYP 331


>FB|FBgn0250848 [details] [associations]
            symbol:26-29-p "26-29kD-proteinase" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005811
            "lipid particle" evidence=IDA] [GO:0005875 "microtubule associated
            complex" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005875 EMBL:AE014296 GO:GO:0005811 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 MEROPS:I29.003 HSSP:O65039
            EMBL:AY122222 EMBL:AB011376 RefSeq:NP_620470.1 UniGene:Dm.3049
            SMR:Q9V3U6 MINT:MINT-890485 STRING:Q9V3U6
            EnsemblMetazoa:FBtr0075766 GeneID:39547 KEGG:dme:Dmel_CG8947
            UCSC:CG8947-RA CTD:39547 FlyBase:FBgn0250848 InParanoid:Q9V3U6
            OMA:IHSKNRA OrthoDB:EOG4BVQ8T GenomeRNAi:39547 NextBio:814210
            Uniprot:Q9V3U6
        Length = 549

 Score = 318 (117.0 bits), Expect = 1.5e-44, Sum P(2) = 1.5e-44
 Identities = 78/226 (34%), Positives = 120/226 (53%)

Query:    56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSN 115
             + +++ F ++ +++   Y S+ E + R  I+  N++YI   N   L++ L  N  AD + 
Sbjct:   239 EHVDKAFHHFKRKHGVAYHSDTEHEHRKNIFRQNLRYIHSKNRAKLTYTLAVNHLADKTE 298

Query:   116 EEFISTYLGYNKP--YNEPR-WP-SV-QYLG-LPASVDWRKEGAVTPVKDQGQCGSCWAF 169
             EE +    GY     YN  + +P  V +Y   +P   DWR  GAVTPVKDQ  CGSCW+F
Sbjct:   299 EE-LKARRGYKSSGIYNTGKPFPYDVPKYKDEIPDQYDWRLYGAVTPVKDQSVCGSCWSF 357

Query:   170 SAVAAVEGINKLKTG-KLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
               +  +EG   LK G  LV LS+Q L+DC     N GC+GG   + ++++ + GGV TE+
Sbjct:   358 GTIGHLEGAFFLKNGGNLVRLSQQALIDCSWAYGNNGCDGGEDFRVYQWMLQSGGVPTEE 417

Query:   229 DY-PYRGKNDRCQTDKTKHHAVTITGYEAI----PARYAFQLYSHG 269
             +Y PY G++  C  +     A  I G+  +    P  +   L  HG
Sbjct:   418 EYGPYLGQDGYCHVNNVTLVA-PIKGFVNVTSNDPNAFKLALLKHG 462

 Score = 178 (67.7 bits), Expect = 1.5e-44, Sum P(2) = 1.5e-44
 Identities = 40/95 (42%), Positives = 56/95 (58%)

Query:   245 KHHAVTITGYEAIPARYAFQLYSHGVFDE-YCGHQ---LNHGVTVVGYGEDHGEKYWLVK 300
             KH  +++   +A P  ++F  YSHGV+ E  C +    L+H V  VGYG  +GE YWLVK
Sbjct:   460 KHGPLSVA-IDASPKTFSF--YSHGVYYEPTCKNDVDGLDHAVLAVGYGSINGEDYWLVK 516

Query:   301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASY 335
             NSW T WG  GYI M+  +  +N   CG++   +Y
Sbjct:   517 NSWSTYWGNDGYILMS--AKKNN---CGVMTMPTY 546


>TAIR|locus:2117979 [details] [associations]
            symbol:AT4G23520 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002687 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            KO:K01376 IPI:IPI00527171 RefSeq:NP_567686.2 UniGene:At.32421
            ProteinModelPortal:F4JNL3 SMR:F4JNL3 MEROPS:C01.A22 PRIDE:F4JNL3
            EnsemblPlants:AT4G23520.1 GeneID:828452 KEGG:ath:AT4G23520
            OMA:PANDEIS ArrayExpress:F4JNL3 Uniprot:F4JNL3
        Length = 356

 Score = 467 (169.5 bits), Expect = 2.4e-44, P = 2.4e-44
 Identities = 103/242 (42%), Positives = 148/242 (61%)

Query:    29 VLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEER---FENWLKQYSREYGSE-DEWQRRFG 84
             +L L +++VL  P+ A           +S EE    F+ W+ ++ + Y +   E +RRF 
Sbjct:    11 ILFLLIVFVLSAPSSAMDLPATSGGHNRSNEEVEFIFQMWMSKHGKTYTNALGEKERRFQ 70

Query:    85 IYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYL---G 141
              +  N+++ID  N++NLS++L   +FADL+ +E+   + G  KP       S +Y+   G
Sbjct:    71 NFKDNLRFIDQHNAKNLSYQLGLTRFADLTVQEYRDLFPGSPKPKQRNLKTSRRYVPLAG 130

Query:   142 --LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDV 199
               LP SVDWR+EGAV+ +KDQG C SCWAFS VAAVEG+NK+ TG+L+SLSEQELVDC  
Sbjct:   131 DQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQELVDC-- 188

Query:   200 NSENQGCNG-GYMEKAFEFITKIGGVTTEDDYPYRGKNDRC-QTDKTKHHAVTITGYEAI 257
             N  N GC G G M+ AF+F+    G+ +E DYPY+G    C +   T +  +TI  YE +
Sbjct:   189 NLVNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQSTSNKVITIDSYEDV 248

Query:   258 PA 259
             PA
Sbjct:   249 PA 250

 Score = 253 (94.1 bits), Expect = 1.1e-21, P = 1.1e-21
 Identities = 48/115 (41%), Positives = 66/115 (57%)

Query:   224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQLYSHGVFDEYCGHQLNHGV 283
             V T D Y     ND     K   H     G +       F LY   +++  CG  L+H +
Sbjct:   239 VITIDSYEDVPANDEISLQKAVAHQPVSVGVDK--KSQEFMLYRSCIYNGPCGTNLDHAL 296

Query:   284 TVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
              +VGYG ++G+ YW+V+NSWGT+WG+AGYI++ARN      G+CGI M ASYP+K
Sbjct:   297 VIVGYGSENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPK-GLCGIAMLASYPIK 350


>FB|FBgn0033874 [details] [associations]
            symbol:CG6347 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE013599 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 HSSP:P53634 EMBL:AY069609
            RefSeq:NP_610906.1 UniGene:Dm.608 SMR:Q7K0S6 MEROPS:C01.A29
            EnsemblMetazoa:FBtr0087637 GeneID:36531 KEGG:dme:Dmel_CG6347
            UCSC:CG6347-RA FlyBase:FBgn0033874 InParanoid:Q7K0S6 OMA:FEYIRDH
            OrthoDB:EOG4FQZ74 GenomeRNAi:36531 NextBio:799046 Uniprot:Q7K0S6
        Length = 352

 Score = 274 (101.5 bits), Expect = 2.9e-44, Sum P(2) = 2.9e-44
 Identities = 76/231 (32%), Positives = 122/231 (52%)

Query:    31 SLFLLWVLGIPA-GAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
             +++L   LG+   GA S    Q +      + F+++L+Q  + Y  E+   R   I+++ 
Sbjct:     6 TMWLQMTLGLALLGAVSLQQLQSFPKLCDVQNFDDFLRQTGKVYSDEERVYRE-SIFAAK 64

Query:    90 VQYIDYINSQ---NLS-FKLTDNKFADLSNEEFISTYLGYN-KPYNEPRWPS--VQYL-- 140
             +  I   N      +S F+L  N  AD++ +E I+T LG     + E R+ +  + ++  
Sbjct:    65 MSLITLSNKNADNGVSGFRLGVNTLADMTRKE-IATLLGSKISEFGE-RYTNGHINFVTA 122

Query:   141 ------GLPASVDWRKEGAVTPVKDQGQ-CGSCWAFSAVAAVEGINKLKTGKLVSLSEQE 193
                    LP   DWR++G VTP   QG  CG+CW+F+   A+EG    +TG L SLS+Q 
Sbjct:   123 RNPASANLPEMFDWREKGGVTPPGFQGVGCGACWSFATTGALEGHLFRRTGVLASLSQQN 182

Query:   194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKT 244
             LVDC  +  N GC+GG+ E  FE+I +  GVT  + YPY     +C+ ++T
Sbjct:   183 LVDCADDYGNMGCDGGFQEYGFEYI-RDHGVTLANKYPYTQTEMQCRQNET 232

 Score = 209 (78.6 bits), Expect = 2.9e-44, Sum P(2) = 2.9e-44
 Identities = 37/78 (47%), Positives = 54/78 (69%)

Query:   262 AFQLYSHGVF-DEYCGH-QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
             +F+ YS G++ DE C   +LNH VTVVGYG ++G  YW++KNS+  +WGE G++R+ RN+
Sbjct:   278 SFEQYSGGIYEDEECNQGELNHSVTVVGYGTENGRDYWIIKNSYSQNWGEGGFMRILRNA 337

Query:   320 PSSNIGICGILMQASYPV 337
                  G CGI  + SYP+
Sbjct:   338 G----GFCGIASECSYPI 351

 Score = 37 (18.1 bits), Expect = 3.1e-26, Sum P(2) = 3.1e-26
 Identities = 6/20 (30%), Positives = 13/20 (65%)

Query:   235 KNDRCQTDKTKHHAVTITGY 254
             +++ C   +  +H+VT+ GY
Sbjct:   288 EDEECNQGEL-NHSVTVVGY 306


>MGI|MGI:1861723 [details] [associations]
            symbol:Ctsr "cathepsin R" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=ISA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=ISA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0030163 "protein
            catabolic process" evidence=ISA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1861723 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0030163
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF245399
            EMBL:AY014778 EMBL:AK014432 EMBL:AK005429 IPI:IPI00120321
            RefSeq:NP_064680.1 UniGene:Mm.315715 ProteinModelPortal:Q9JIA9
            SMR:Q9JIA9 MEROPS:C01.042 PRIDE:Q9JIA9 Ensembl:ENSMUST00000021889
            GeneID:56835 KEGG:mmu:56835 CTD:56835 InParanoid:Q9JIA9 KO:K09601
            OMA:ASHESFK OrthoDB:EOG4ZCT6D NextBio:313379 Bgee:Q9JIA9
            CleanEx:MM_CTSR Genevestigator:Q9JIA9 GermOnline:ENSMUSG00000055679
            Uniprot:Q9JIA9
        Length = 334

 Score = 321 (118.1 bits), Expect = 3.7e-44, Sum P(2) = 3.7e-44
 Identities = 71/209 (33%), Positives = 114/209 (54%)

Query:    57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNKFAD 112
             S++  +++W  +Y++ Y  ++E  +R  ++   ++ I   N +N      F +  N+F D
Sbjct:    24 SLDAEWQDWKIKYNKSYSLKEEKLKRV-VWEEKLKMIKLHNRENSLGKNGFTMKMNEFGD 82

Query:   113 LSNEEFISTYLGYNK-PYNEPRWPSVQYLG--LPASVDWRKEGAVTPVKDQGQCGSCWAF 169
              ++EEF    +  +   + E +    +  G  LP  VDWRK+G VTPV+ QG C +CWAF
Sbjct:    83 QTDEEFRKMMIEISVWTHREGKSIMKREAGSILPKFVDWRKKGYVTPVRRQGDCDACWAF 142

Query:   170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
             +   A+E     +TGKL  LS Q LVDC     N GC GG    AF+++   GG+ +E  
Sbjct:   143 AVTGAIEAQAIWQTGKLTPLSVQNLVDCSKPQGNNGCLGGDTYNAFQYVLHNGGLESEAT 202

Query:   230 YPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             YPY GK+  C+ +  K+    ITG+ ++P
Sbjct:   203 YPYEGKDGPCRYNP-KNSKAEITGFVSLP 230

 Score = 161 (61.7 bits), Expect = 3.7e-44, Sum P(2) = 3.7e-44
 Identities = 34/81 (41%), Positives = 46/81 (56%)

Query:   262 AFQLYSHGVFDE-YCGHQ-LNHGVTVVGYG----EDHGEKYWLVKNSWGTSWGEAGYIRM 315
             +F+ Y  G++ E  C    + HGV VVGYG    E  G  YWL+KNSWG  WG  GY+++
Sbjct:   256 SFKNYKGGIYHEPNCSSDTVTHGVLVVGYGFKGIETDGNHYWLIKNSWGKRWGIRGYMKL 315

Query:   316 ARNSPSSNIGICGILMQASYP 336
             A++  +     CGI   A YP
Sbjct:   316 AKDKNNH----CGIASYAHYP 332


>TAIR|locus:505006391 [details] [associations]
            symbol:CEP3 "cysteine endopeptidase 3" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005783 EMBL:CP002686
            GenomeReviews:BA000014_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 PROSITE:PS00014
            EMBL:AL049659 HSSP:O65039 HOGENOM:HOG000230773 KO:K16292
            EMBL:AK119026 IPI:IPI00525150 PIR:T06707 RefSeq:NP_566901.1
            UniGene:At.3162 ProteinModelPortal:Q9STL5 SMR:Q9STL5 MEROPS:C01.A02
            PRIDE:Q9STL5 EnsemblPlants:AT3G48350.1 GeneID:823993
            KEGG:ath:AT3G48350 TAIR:At3g48350 InParanoid:Q9STL5 OMA:DITHHEF
            PhylomeDB:Q9STL5 ProtClustDB:CLSN2917387 Genevestigator:Q9STL5
            Uniprot:Q9STL5
        Length = 364

 Score = 464 (168.4 bits), Expect = 5.0e-44, P = 5.0e-44
 Identities = 98/207 (47%), Positives = 132/207 (63%)

Query:    62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
             +E W   +S    S  E  +RF ++  NV ++   N +N  +KL  N+FAD+++ EF S+
Sbjct:    38 YERWRGHHSVSRASH-EAIKRFNVFRHNVLHVHRTNKKNKPYKLKINRFADITHHEFRSS 96

Query:   122 YLGYNKPYNE----PRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
             Y G N  ++     P+  S  ++      +P+SVDWR++GAVT VK+Q  CGSCWAFS V
Sbjct:    97 YAGSNVKHHRMLRGPKRGSGGFMYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTV 156

Query:   173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
             AAVEGINK++T KLVSLSEQELVDCD   ENQGC GG ME AFEFI   GG+ TE+ YPY
Sbjct:   157 AAVEGINKIRTNKLVSLSEQELVDCDTE-ENQGCAGGLMEPAFEFIKNNGGIKTEETYPY 215

Query:   233 RGKNDR-CQTDKTKHHAVTITGYEAIP 258
                + + C+ +      VTI G+E +P
Sbjct:   216 DSSDVQFCRANSIGGETVTIDGHEHVP 242

 Score = 279 (103.3 bits), Expect = 2.0e-24, P = 2.0e-24
 Identities = 63/122 (51%), Positives = 77/122 (63%)

Query:   221 IGGVT-TEDDYPYRGKNDRCQTDKT-KHHAVTITGYEAIPARYA-FQLYSHGVFDEYCGH 277
             IGG T T D + +  +ND  +  K   H  V++    AI A  + FQLYS GVF   CG 
Sbjct:   228 IGGETVTIDGHEHVPENDEEELLKAVAHQPVSV----AIDAGSSDFQLYSEGVFIGECGT 283

Query:   278 QLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
             QLNHGV +VGYGE  +G KYW+V+NSWG  WGE GY+R+ R   S N G CGI M+ASYP
Sbjct:   284 QLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIERGI-SENEGRCGIAMEASYP 342

Query:   337 VK 338
              K
Sbjct:   343 TK 344


>RGD|1308181 [details] [associations]
            symbol:Ctsf "cathepsin F" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1308181 eggNOG:COG4870 HOGENOM:HOG000230774
            KO:K01373 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 MEROPS:C01.018
            CTD:8722 OMA:LAPPEWD OrthoDB:EOG4CC41T GeneTree:ENSGT00660000095458
            EMBL:CH473953 EMBL:BC099780 EMBL:EU253481 IPI:IPI00201100
            RefSeq:NP_001029282.1 UniGene:Rn.25087 SMR:Q499S6
            Ensembl:ENSRNOT00000026718 GeneID:361704 KEGG:rno:361704
            UCSC:RGD:1308181 InParanoid:Q499S6 NextBio:677325
            Genevestigator:Q499S6 Uniprot:Q499S6
        Length = 462

 Score = 330 (121.2 bits), Expect = 5.9e-44, Sum P(2) = 5.9e-44
 Identities = 80/197 (40%), Positives = 109/197 (55%)

Query:    50 PQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNV---QYIDYINSQNLSFKLT 106
             PQ +  + M   F++++  Y+R Y S +E Q R  +++ N+   Q I  ++     + +T
Sbjct:   154 PQDFSVK-MATLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGIT 212

Query:   107 DNKFADLSNEEFISTYLG--YNKPYNEPRW--PSVQYLGLPASVDWRKEGAVTPVKDQGQ 162
               KF+DL+ EEF + YL     K          S+  L  P   DWRK+GAVT VKDQG 
Sbjct:   213 --KFSDLTEEEFHTIYLNPLLQKESGGKMSLAKSINDLA-PPEWDWRKKGAVTEVKDQGM 269

Query:   163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
             CGSCWAFS    VEG   L  G L+SLSEQEL+DCD    ++ C GG    A+  I  +G
Sbjct:   270 CGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCD--KMDKACMGGLPSNAYTAIKNLG 327

Query:   223 GVTTEDDYPYRGKNDRC 239
             G+ TEDDY Y+G    C
Sbjct:   328 GLETEDDYGYQGHVQAC 344

 Score = 150 (57.9 bits), Expect = 5.9e-44, Sum P(2) = 5.9e-44
 Identities = 35/86 (40%), Positives = 43/86 (50%)

Query:   256 AIPARYAFQLYSHGV---FDEYCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAG 311
             AI A +  Q Y HG+   F   C    ++H V +VGYG      YW +KNSWG  WGE G
Sbjct:   381 AINA-FGMQFYRHGIAHPFRPLCSPWFIDHAVLLVGYGNRSNIPYWAIKNSWGRDWGEEG 439

Query:   312 YIRMARNSPSSNIGICGILMQASYPV 337
             Y  + R S     G CG+   AS  V
Sbjct:   440 YYYLYRGS-----GACGVNTMASSAV 460


>ZFIN|ZDB-GENE-030131-9831 [details] [associations]
            symbol:ctsf "cathepsin F" species:7955 "Danio
            rerio" [GO:0004869 "cysteine-type endopeptidase inhibitor activity"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000010 InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00031 Pfam:PF00112 PRINTS:PR00705 SMART:SM00043
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-030131-9831
            GO:GO:0004869 eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 HOVERGEN:HBG011513 CTD:8722 OrthoDB:EOG4CC41T
            MEROPS:I25.006 EMBL:BC124243 IPI:IPI00503226 RefSeq:NP_001071036.1
            UniGene:Dr.81265 ProteinModelPortal:Q08CH0 SMR:Q08CH0 GeneID:565588
            KEGG:dre:565588 InParanoid:Q08CH0 NextBio:20885952
            ArrayExpress:Q08CH0 Uniprot:Q08CH0
        Length = 473

 Score = 341 (125.1 bits), Expect = 9.6e-44, Sum P(2) = 9.6e-44
 Identities = 76/185 (41%), Positives = 105/185 (56%)

Query:    62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLSNEEFIS 120
             F+N++  Y+R Y S++E ++R  I+  N++    + S +  S +    KF+DL+ +EF  
Sbjct:   175 FKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKFSDLTEDEFRM 234

Query:   121 TYLGYNKPYNEPRWPSVQYLGLPASV------DWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
              YL  N   ++          +PAS       DWR  GAV+PVK+QG CGSCWAFS    
Sbjct:   235 MYL--NPMLSQWSLKKEMKPAIPASAPAPDTWDWRDHGAVSPVKNQGMCGSCWAFSVTGN 292

Query:   175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
             +EG    KTG+L+SLSEQELVDCD    +Q C GG    A+E I  +GG+ TE DY Y G
Sbjct:   293 IEGQWFKKTGQLLSLSEQELVDCD--KLDQACGGGLPSNAYEAIENLGGLETETDYSYTG 350

Query:   235 KNDRC 239
                 C
Sbjct:   351 HKQSC 355

 Score = 137 (53.3 bits), Expect = 9.6e-44, Sum P(2) = 9.6e-44
 Identities = 28/73 (38%), Positives = 41/73 (56%)

Query:   261 YAFQLYSHGVFDE---YCGH-QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMA 316
             +A Q Y  GV      +C    ++H V +VG+G+ +G  +W +KNSWG  +GE GY  + 
Sbjct:   396 FAMQFYRKGVSHPLKIFCNPWMIDHAVLLVGFGQRNGVPFWAIKNSWGEDYGEQGYYYLY 455

Query:   317 RNSPSSNIGICGI 329
             R S     G+CGI
Sbjct:   456 RGS-----GLCGI 463


>TAIR|locus:2128243 [details] [associations]
            symbol:AT4G11310 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005618 "cell wall"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 GO:GO:0005618 EMBL:CP002687
            GenomeReviews:CT486007_GR EMBL:AL096882 EMBL:AL161531
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            HOGENOM:HOG000230773 KO:K01376 EMBL:AY093066 EMBL:BT000099
            IPI:IPI00520496 PIR:T13022 RefSeq:NP_567376.1 UniGene:At.43189
            ProteinModelPortal:Q9SUT0 SMR:Q9SUT0 IntAct:Q9SUT0 STRING:Q9SUT0
            MEROPS:C01.A20 PaxDb:Q9SUT0 PRIDE:Q9SUT0 EnsemblPlants:AT4G11310.1
            GeneID:826733 KEGG:ath:AT4G11310 TAIR:At4g11310 InParanoid:Q9SUT0
            OMA:EVCHGAD PhylomeDB:Q9SUT0 ProtClustDB:CLSN2689395
            Genevestigator:Q9SUT0 GermOnline:AT4G11310 Uniprot:Q9SUT0
        Length = 364

 Score = 461 (167.3 bits), Expect = 1.0e-43, P = 1.0e-43
 Identities = 98/207 (47%), Positives = 132/207 (63%)

Query:    62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
             FE+W+ ++ + YGS  E +RR  I+  N+++I+  N++NLS++L    FADLS  E+   
Sbjct:    49 FESWMVKHGKVYGSVAEKERRLTIFEDNLRFINNRNAENLSYRLGLTGFADLSLHEYKEV 108

Query:   122 YLGYNK--PYNE------PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
               G +   P N        R+ +     LP SVDWR EGAVT VKDQG C SCWAFS V 
Sbjct:   109 CHGADPRPPRNHVFMTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVG 168

Query:   174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
             AVEG+NK+ TG+LV+LSEQ+L++C  N EN GC GG +E A+EFI K GG+ T++DYPY+
Sbjct:   169 AVEGLNKIVTGELVTLSEQDLINC--NKENNGCGGGKLETAYEFIMKNGGLGTDNDYPYK 226

Query:   234 GKNDRCQTD-KTKHHAVTITGYEAIPA 259
               N  C    K  +  V I GYE +PA
Sbjct:   227 AVNGVCDGRLKENNKNVMIDGYENLPA 253

 Score = 271 (100.5 bits), Expect = 1.4e-23, P = 1.4e-23
 Identities = 57/111 (51%), Positives = 68/111 (61%)

Query:   228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQLYSHGVFDEYCGHQLNHGVTVVG 287
             D Y     ND     K   H   +T      +R  FQLY  GVFD  CG  LNHGV VVG
Sbjct:   246 DGYENLPANDESALMKAVAHQ-PVTAVIDSSSR-EFQLYESGVFDGSCGTNLNHGVVVVG 303

Query:   288 YGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
             YG ++G  YWLVKNS G +WGEAGY++MARN  +   G+CGI M+ASYP+K
Sbjct:   304 YGTENGRDYWLVKNSRGITWGEAGYMKMARNIANPR-GLCGIAMRASYPLK 353


>RGD|1309226 [details] [associations]
            symbol:Cts7 "cathepsin 7" species:10116 "Rattus norvegicus"
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005768 "endosome" evidence=IEA] [GO:0005794 "Golgi apparatus"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0007067
            "mitosis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IEA] [GO:0051301 "cell division" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 RGD:1309226 GO:GO:0005634
            GO:GO:0005794 GO:GO:0048471 GO:GO:0005615 GO:GO:0051301
            GO:GO:0007067 GO:GO:0005768 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 MEROPS:C01.016 CTD:56092
            GeneTree:ENSGT00560000076577 OrthoDB:EOG44QT2S EMBL:CH474032
            IPI:IPI00870531 RefSeq:NP_001099569.1 UniGene:Rn.218615
            Ensembl:ENSRNOT00000043686 GeneID:290970 KEGG:rno:290970
            UCSC:RGD:1309226 OMA:VESFNAN Uniprot:D3ZZ07
        Length = 331

 Score = 319 (117.4 bits), Expect = 2.0e-43, Sum P(2) = 2.0e-43
 Identities = 69/206 (33%), Positives = 107/206 (51%)

Query:    57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN-L---SFKLTDNKFAD 112
             S++  +E W +  ++ Y  E+E QRR  ++  NV+ I +   QN L   +F +  N+F D
Sbjct:    24 SLDAEWEEWKRNNAKTYSPEEEKQRR-AVWEENVKMIKWHTMQNGLWMNNFTIEMNEFGD 82

Query:   113 LSNEEFISTYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
             ++ EE               +    + + +P ++DWR  G V PV+ QG CG+CWAFS  
Sbjct:    83 MTGEEMRMMTDSSALTLRNGKHIQKRNVKIPKTLDWRDTGCVAPVRSQGGCGACWAFSVA 142

Query:   173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
             A++E     KTGKL+ LS Q L+DC V   N  C+GG    AF+++   GG+  E  YPY
Sbjct:   143 ASIESQLFKKTGKLIPLSVQNLIDCTVTYGNNDCSGGKPYTAFQYVKNNGGLEAEATYPY 202

Query:   233 RGKNDRCQTDKTKHHAVTITGYEAIP 258
               K   C+  + +   V I  +  +P
Sbjct:   203 EAKLRHCRY-RPERSVVKIARFFVVP 227

 Score = 156 (60.0 bits), Expect = 2.0e-43, Sum P(2) = 2.0e-43
 Identities = 38/89 (42%), Positives = 52/89 (58%)

Query:   256 AIPARYA-FQLYSHGVFDE-YCGHQ-LNHGVTVVGYG-EDH---GEKYWLVKNSWGTSWG 308
             AI   +A F+ Y  G++ E  C    L+HG+ +VGYG E H     KYWL+KNS G  WG
Sbjct:   246 AIDGSHASFKRYRGGIYHEPKCRRDTLDHGLLLVGYGYEGHESENRKYWLLKNSHGEQWG 305

Query:   309 EAGYIRMARNSPSSNIGICGILMQASYPV 337
             E GY+++ R+   +N   CGI   A YP+
Sbjct:   306 ERGYMKLPRDQ--NNY--CGIASYAMYPL 330


>FB|FBgn0032228 [details] [associations]
            symbol:CG5367 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE014134 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 HSSP:P80067
            RefSeq:NP_609387.1 UniGene:Dm.26782 ProteinModelPortal:Q9VKY4
            SMR:Q9VKY4 MEROPS:C01.A30 EnsemblMetazoa:FBtr0080055 GeneID:34401
            KEGG:dme:Dmel_CG5367 UCSC:CG5367-RA FlyBase:FBgn0032228
            InParanoid:Q9VKY4 OMA:QIVDCSV OrthoDB:EOG4THT8X PhylomeDB:Q9VKY4
            GenomeRNAi:34401 NextBio:788324 ArrayExpress:Q9VKY4 Bgee:Q9VKY4
            Uniprot:Q9VKY4
        Length = 338

 Score = 302 (111.4 bits), Expect = 2.5e-43, Sum P(2) = 2.5e-43
 Identities = 74/227 (32%), Positives = 119/227 (52%)

Query:    46 SEGYPQKYDPQSMEERFENWL-KQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----N 100
             SEG     + +S  E+F+N   ++Y R Y   DE  R +  +  N + I+  N       
Sbjct:    23 SEGNSSSANCKSEFEKFKNNNNRKYLRTY---DE-MRSYKAFEENFKVIEEHNQNYKEGQ 78

Query:   101 LSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSV-QYLG------LPASVDWRKEGA 153
              SF+L  N FAD+S + ++  +L   K   E    ++ + +G      +P S+DWR +G 
Sbjct:    79 TSFRLKPNIFADMSTDGYLKGFLRLLKSNIEDSADNMAEIVGSPLMANVPESLDWRSKGF 138

Query:   154 VTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEK 213
             +TP  +Q  CGSC+AFS   ++ G    +TGK++SLS+Q++VDC V+  NQGC GG +  
Sbjct:   139 ITPPYNQLSCGSCYAFSIAESIMGQVFKRTGKILSLSKQQIVDCSVSHGNQGCVGGSLRN 198

Query:   214 AFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR 260
                ++   GG+  + DYPY  +  +CQ        V +T +  +P R
Sbjct:   199 TLSYLQSTGGIMRDQDYPYVARKGKCQFVPDLS-VVNVTSWAILPVR 244

 Score = 172 (65.6 bits), Expect = 2.5e-43, Sum P(2) = 2.5e-43
 Identities = 38/95 (40%), Positives = 56/95 (58%)

Query:   248 AVTITGYEAIPARYA---FQLYSHGVFDE-YCGH-QLNHGVTVVGYGEDHGEKYWLVKNS 302
             AVT  G  AI    +   FQLYS G++D+  C    +NH + V+G+G+D    YW++KN 
Sbjct:   252 AVTHIGPVAISINASPKTFQLYSDGIYDDPLCSSASVNHAMVVIGFGKD----YWILKNW 307

Query:   303 WGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
             WG +WGE GYIR+ +      + +CGI   A+Y +
Sbjct:   308 WGQNWGENGYIRIRKG-----VNMCGIANYAAYAI 337


>UNIPROTKB|F1P3U9 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005829 "cytosol" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=IEA] [GO:0010628 "positive regulation of gene expression"
            evidence=IEA] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=IEA] [GO:0010813 "neuropeptide catabolic
            process" evidence=IEA] [GO:0010815 "bradykinin catabolic process"
            evidence=IEA] [GO:0016505 "apoptotic protease activator activity"
            evidence=IEA] [GO:0030108 "HLA-A specific activating MHC class I
            receptor activity" evidence=IEA] [GO:0031638 "zymogen activation"
            evidence=IEA] [GO:0031648 "protein destabilization" evidence=IEA]
            [GO:0032526 "response to retinoic acid" evidence=IEA] [GO:0033619
            "membrane protein proteolysis" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0043129
            "surfactant homeostasis" evidence=IEA] [GO:0045766 "positive
            regulation of angiogenesis" evidence=IEA] [GO:0060448 "dichotomous
            subdivision of terminal units involved in lung branching"
            evidence=IEA] [GO:0070324 "thyroid hormone binding" evidence=IEA]
            [GO:0070371 "ERK1 and ERK2 cascade" evidence=IEA] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA]
            [GO:0097208 "alveolar lamellar body" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066
            GO:GO:0005615 GO:GO:0008284 GO:GO:0070371 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0032526 GO:GO:0010628
            GO:GO:0070324 GO:GO:0016505 GO:GO:0010634 GO:GO:0004197
            GO:GO:0042599 GO:GO:0031648 GO:GO:0097067 GO:GO:0031638
            GO:GO:0001913 GeneTree:ENSGT00660000095458 OMA:STSCHKT
            GO:GO:0030108 GO:GO:0010815 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 EMBL:AADN02038832 EMBL:AADN02038831 IPI:IPI00594147
            Ensembl:ENSGALT00000013440 Uniprot:F1P3U9
        Length = 261

 Score = 301 (111.0 bits), Expect = 5.2e-43, Sum P(2) = 5.2e-43
 Identities = 70/162 (43%), Positives = 94/162 (58%)

Query:   103 FKLTDNKFADLSNEEFISTYLGYNKPYN--EPRWPSVQYLG-LPASVDWRKEGA-VTPVK 158
             F +  N+F+D++  EF   YL +++P N    R   ++  G  P +VDWRK+G  VTPVK
Sbjct:     1 FLVALNQFSDMTFAEFKKLYL-WSEPQNCSATRGNFLRSDGPCPEAVDWRKKGNFVTPVK 59

Query:   159 DQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFI 218
             +QG CGSCW FS    +E    + TGKL+SL+EQ LVDC     N GC+GG   +AFE+I
Sbjct:    60 NQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQLLVDCAQAFNNHGCSGGLPSQAFEYI 119

Query:   219 TKIGGVTTEDDYPYRGKNDRC--QTDKT---KHHAVTITGYE 255
                 G+  ED YPYR +N  C  Q DK        + IT Y+
Sbjct:   120 LYNKGLMGEDAYPYRAQNGTCKFQPDKAIAFVKDVINITQYD 161

 Score = 170 (64.9 bits), Expect = 5.2e-43, Sum P(2) = 5.2e-43
 Identities = 39/97 (40%), Positives = 49/97 (50%)

Query:   245 KHHAVTITGYEAIPARYAFQLYSHGVFDE-YCGH---QLNHGVTVVGYGEDHGEKYWLVK 300
             KH+ V+     A      F  Y  GV+    C H   ++NH V  VGYGE+ G  YW+VK
Sbjct:   171 KHNPVSF----AFEVTSDFMHYRKGVYSNPRCEHTPDKVNHAVLAVGYGEEDGRPYWIVK 226

Query:   301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
             NSWG  WG  GY  + R        +CG+   ASYPV
Sbjct:   227 NSWGPLWGMDGYFLIERGK-----NMCGLAACASYPV 258


>RGD|631421 [details] [associations]
            symbol:Ctsq "cathepsin Q" species:10116 "Rattus norvegicus"
            [GO:0005764 "lysosome" evidence=NAS] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 RGD:631421 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 UniGene:Rn.34875 EMBL:AF187323 IPI:IPI00214897
            PIR:JC7183 RefSeq:NP_640355.1 UniGene:Rn.35820
            ProteinModelPortal:Q9QZE3 SMR:Q9QZE3 STRING:Q9QZE3 MEROPS:C01.039
            PRIDE:Q9QZE3 Ensembl:ENSRNOT00000024208 GeneID:246147
            KEGG:rno:246147 UCSC:RGD:631421 CTD:104002 InParanoid:Q9QZE3
            OMA:ESEDVLM OrthoDB:EOG4HHP48 NextBio:623425 Genevestigator:Q9QZE3
            GermOnline:ENSRNOG00000017946 Uniprot:Q9QZE3
        Length = 343

 Score = 301 (111.0 bits), Expect = 5.2e-43, Sum P(2) = 5.2e-43
 Identities = 74/218 (33%), Positives = 112/218 (51%)

Query:    57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSS--NVQYIDYINSQNLS-FKLTDNKFADL 113
             S++ +++ W  +Y + Y  E+E  +R     +   ++  +  NS   + + +  N FAD+
Sbjct:    24 SLDVQWQEWKIKYEKLYSPEEEVLKRVVWEENVKKIELHNRENSLGKNTYTMEINDFADM 83

Query:   114 SNEEFISTYLGYNKP-YN-EPR-WPSV--QYL--------GLPASVDWRKEGAVTPVKDQ 160
             ++EEF    +G+  P +N E R W      +          LP  VDWR EG VT V+ Q
Sbjct:    84 TDEEFKDMIIGFQLPVHNTEKRLWKRALGSFFPNSWNWRDALPKFVDWRNEGYVTRVRKQ 143

Query:   161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
             G C SCWAF    A+EG    KTGKL+ LS Q L+DC     N+GC  G    AF+++  
Sbjct:   144 GGCSSCWAFPVTGAIEGQMFKKTGKLIPLSVQNLIDCSKPQGNRGCLWGNTYNAFQYVLH 203

Query:   221 IGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
              GG+  E  YPY  K   C+ +  K+ +  ITG+  +P
Sbjct:   204 NGGLEAEATYPYERKEGVCRYNP-KNSSAKITGFVVLP 240

 Score = 170 (64.9 bits), Expect = 5.2e-43, Sum P(2) = 5.2e-43
 Identities = 37/90 (41%), Positives = 49/90 (54%)

Query:   252 TGYEAIPARYAFQLYSHGVFDE-YCGHQLNHGVTVVGYG----EDHGEKYWLVKNSWGTS 306
             TG   I +  +F+ Y  GV+ E  C   +NH V VVGYG    E  G  YWL+KNSWG  
Sbjct:   258 TGVHVISS--SFRFYQKGVYHEPKCSSYVNHAVLVVGYGFEGNETDGNNYWLIKNSWGKR 315

Query:   307 WGEAGYIRMARNSPSSNIGICGILMQASYP 336
             WG  GY+++A++  +     C I   A YP
Sbjct:   316 WGLRGYMKIAKDRNNH----CAIASLAQYP 341


>FB|FBgn0260462 [details] [associations]
            symbol:CG12163 species:7227 "Drosophila melanogaster"
            [GO:0035071 "salivary gland cell autophagic cell death"
            evidence=IEP] [GO:0048102 "autophagic cell death" evidence=IEP]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0004869 "cysteine-type
            endopeptidase inhibitor activity" evidence=IEA] [GO:0045169
            "fusome" evidence=IDA] [GO:0035220 "wing disc development"
            evidence=IGI] [GO:0022416 "chaeta development" evidence=IGI]
            InterPro:IPR000010 InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00043 SMART:SM00645
            InterPro:IPR000169 EMBL:AE014297 GO:GO:0004869 eggNOG:COG4870
            KO:K01373 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0022416 GO:GO:0035220 GO:GO:0035071
            GO:GO:0045169 GeneTree:ENSGT00660000095458 EMBL:AY121614
            EMBL:BT003231 RefSeq:NP_649521.1 RefSeq:NP_730901.1
            RefSeq:NP_730902.2 UniGene:Dm.7315 ProteinModelPortal:Q9VN93
            SMR:Q9VN93 DIP:DIP-17491N IntAct:Q9VN93 MINT:MINT-763966
            STRING:Q9VN93 MEROPS:C01.A27 PaxDb:Q9VN93
            EnsemblMetazoa:FBtr0078823 GeneID:40628 KEGG:dme:Dmel_CG12163
            UCSC:CG12163-RA FlyBase:FBgn0260462 InParanoid:Q9VN93 OMA:GPRWGEQ
            OrthoDB:EOG4CC2G9 PhylomeDB:Q9VN93 GenomeRNAi:40628 NextBio:819744
            Bgee:Q9VN93 GermOnline:CG12163 Uniprot:Q9VN93
        Length = 614

 Score = 348 (127.6 bits), Expect = 6.5e-43, Sum P(2) = 6.5e-43
 Identities = 76/216 (35%), Positives = 121/216 (56%)

Query:    49 YPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL-SFKLTD 107
             +  ++D   ++  F  +  ++ R Y S  E Q R  I+  N++ I+ +N+  + S K   
Sbjct:   297 HSHRFD--KVDHLFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGI 354

Query:   108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQ----YLG-LPASVDWRKEGAVTPVKDQGQ 162
              +FAD+++ E+      + +   +    S      Y G LP   DWR++ AVT VK+QG 
Sbjct:   355 TEFADMTSSEYKERTGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQGS 414

Query:   163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
             CGSCWAFS    +EG+  +KTG+L   SEQEL+DCD    +  CNGG M+ A++ I  IG
Sbjct:   415 CGSCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTT--DSACNGGLMDNAYKAIKDIG 472

Query:   223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             G+  E +YPY+ K ++C  ++T  H V + G+  +P
Sbjct:   473 GLEYEAEYPYKAKKNQCHFNRTLSH-VQVAGFVDLP 507

 Score = 134 (52.2 bits), Expect = 6.5e-43, Sum P(2) = 6.5e-43
 Identities = 35/86 (40%), Positives = 44/86 (51%)

Query:   262 AFQLYSHGVFDEY---CGHQ-LNHGVTVVGYG-EDHGE-----KYWLVKNSWGTSWGEAG 311
             A Q Y  GV   +   C  + L+HGV VVGYG  D+        YW+VKNSWG  WGE G
Sbjct:   532 AMQFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQG 591

Query:   312 YIRMARNSPSSNIGICGILMQASYPV 337
             Y R+ R   +     CG+   A+  V
Sbjct:   592 YYRVYRGDNT-----CGVSEMATSAV 612

 Score = 38 (18.4 bits), Expect = 2.5e-05, Sum P(2) = 2.5e-05
 Identities = 9/28 (32%), Positives = 15/28 (53%)

Query:   104 KLTDNKFADLSNEEFISTYLGYNKPYNE 131
             +L + K+   S     +  LG +KPY+E
Sbjct:   169 RLLNEKYVHRSRRS-ANDILGRHKPYDE 195


>TAIR|locus:2128253 [details] [associations]
            symbol:AT4G11320 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002687 GenomeReviews:CT486007_GR EMBL:AL096882 EMBL:AL161531
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 OMA:ICHGADP
            HOGENOM:HOG000230773 KO:K01376 ProtClustDB:CLSN2689395
            EMBL:AY035055 EMBL:AY051062 IPI:IPI00520480 PIR:T13023
            RefSeq:NP_567377.1 UniGene:At.25206 ProteinModelPortal:Q9SUS9
            SMR:Q9SUS9 STRING:Q9SUS9 MEROPS:C01.A21 PaxDb:Q9SUS9 PRIDE:Q9SUS9
            EnsemblPlants:AT4G11320.1 GeneID:826734 KEGG:ath:AT4G11320
            TAIR:At4g11320 InParanoid:Q9SUS9 PhylomeDB:Q9SUS9
            Genevestigator:Q9SUS9 GermOnline:AT4G11320 Uniprot:Q9SUS9
        Length = 371

 Score = 453 (164.5 bits), Expect = 7.3e-43, P = 7.3e-43
 Identities = 99/207 (47%), Positives = 133/207 (64%)

Query:    62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
             FE+W+ ++ + Y S  E +RR  I+  N+++I   N++NLS++L  N+FADLS  E+   
Sbjct:    56 FESWMVKHGKVYDSVAEKERRLTIFEDNLRFITNRNAENLSYRLGLNRFADLSLHEYGEI 115

Query:   122 YLGYNK--PYNEPRWPSV-QYL---G--LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
               G +   P N     S  +Y    G  LP SVDWR EGAVT VKDQG C SCWAFS V 
Sbjct:   116 CHGADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVG 175

Query:   174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
             AVEG+NK+ TG+LV+LSEQ+L++C  N EN GC GG +E A+EFI   GG+ T++DYPY+
Sbjct:   176 AVEGLNKIVTGELVTLSEQDLINC--NKENNGCGGGKVETAYEFIMNNGGLGTDNDYPYK 233

Query:   234 GKNDRCQTD-KTKHHAVTITGYEAIPA 259
               N  C+   K  +  V I GYE +PA
Sbjct:   234 ALNGVCEGRLKEDNKNVMIDGYENLPA 260

 Score = 271 (100.5 bits), Expect = 1.4e-23, P = 1.4e-23
 Identities = 56/111 (50%), Positives = 68/111 (61%)

Query:   228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQLYSHGVFDEYCGHQLNHGVTVVG 287
             D Y     ND     K   H   +T      +R  FQLY  GVFD  CG  LNHGV VVG
Sbjct:   253 DGYENLPANDEAALMKAVAHQ-PVTAVVDSSSR-EFQLYESGVFDGTCGTNLNHGVVVVG 310

Query:   288 YGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
             YG ++G  YW+VKNS G +WGEAGY++MARN  +   G+CGI M+ASYP+K
Sbjct:   311 YGTENGRDYWIVKNSRGDTWGEAGYMKMARNIANPR-GLCGIAMRASYPLK 360


>DICTYBASE|DDB_G0291191 [details] [associations]
            symbol:DDB_G0291191 "cysteine protease" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0291191
            GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AAFI02000175 MEROPS:C01.022
            ProtClustDB:CLSZ2429603 RefSeq:XP_635374.1
            ProteinModelPortal:Q54F16 PRIDE:Q54F16 EnsemblProtists:DDB0252831
            GeneID:8628022 KEGG:ddi:DDB_G0291191 OMA:NETQIAS Uniprot:Q54F16
        Length = 352

 Score = 289 (106.8 bits), Expect = 8.5e-43, Sum P(2) = 8.5e-43
 Identities = 74/188 (39%), Positives = 99/188 (52%)

Query:   108 NKFADLSNEEFISTYLGYNKPY---NEPRWPSVQ---YLGLPASVDWRKEGA-------- 153
             NKFADLS EEF   YL   +     + P  P++        PA+ DWR  G         
Sbjct:    76 NKFADLSKEEFKKYYLSSKEARLTDDLPMLPNLSDDIISATPAAFDWRNTGGSTKFPQGT 135

Query:   154 -VTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD---VNSENQ----- 204
              VT VK+QGQCGSCW+FS    VEG + L TG LV LSEQ LVDCD   +  EN+     
Sbjct:   136 PVTAVKNQGQCGSCWSFSTTGNVEGQHYLSTGTLVGLSEQNLVDCDHTCMTYENENVCNA 195

Query:   205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP---ARY 261
             GC+GG    A+ +I K GG+ TE  YPY   +  C+ +  +  A  I+ +  +P    + 
Sbjct:   196 GCDGGLQPNAYNYIIKNGGIQTEATYPYTAVDGECKFNSAQVGA-KISSFTMVPQNETQI 254

Query:   262 AFQLYSHG 269
             A  L+++G
Sbjct:   255 ASYLFNNG 262

 Score = 180 (68.4 bits), Expect = 8.5e-43, Sum P(2) = 8.5e-43
 Identities = 32/67 (47%), Positives = 45/67 (67%)

Query:   263 FQLYSHGVFDEYCGHQLNHGVTVVGYG-EDH--GEK--YWLVKNSWGTSWGEAGYIRMAR 317
             +Q Y  GVFD  CG  L+HG+ +VGYG +D   G+   YW++KNSWG  WGEAGY+++ R
Sbjct:   273 WQFYMGGVFDFPCGQTLDHGILIVGYGAQDTIVGKNTPYWIIKNSWGADWGEAGYLKVER 332

Query:   318 NSPSSNI 324
             N+    +
Sbjct:   333 NTDKCGV 339

 Score = 115 (45.5 bits), Expect = 1.2e-19, Sum P(2) = 1.2e-19
 Identities = 37/110 (33%), Positives = 55/110 (50%)

Query:    57 SMEE-RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSF----KLTDNKFA 111
             S+EE +F  +  +Y++ Y +E E+  +F  + SN+  ID +N Q  +     K   NKFA
Sbjct:    21 SVEESQFIAFQNKYNKIYSAE-EYLVKFETFKSNLLNIDALNKQATTIGSDTKFGVNKFA 79

Query:   112 DLSNEEFISTYLGYNKPY---NEPRWPSVQ---YLGLPASVDWRKEGAVT 155
             DLS EEF   YL   +     + P  P++        PA+ DWR  G  T
Sbjct:    80 DLSKEEFKKYYLSSKEARLTDDLPMLPNLSDDIISATPAAFDWRNTGGST 129


>UNIPROTKB|G3V9F8 [details] [associations]
            symbol:Ctsm "RCG24133" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015645 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 EMBL:CH474032
            PANTHER:PTHR12411:SF58 Ensembl:ENSRNOT00000045830 RGD:631420
            Uniprot:G3V9F8
        Length = 333

 Score = 309 (113.8 bits), Expect = 3.6e-42, Sum P(2) = 3.6e-42
 Identities = 72/211 (34%), Positives = 108/211 (51%)

Query:    54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNK 109
             DP  ++  ++ W  +Y + Y  E+E Q+R  ++  N++ I   N +N      F +  N 
Sbjct:    22 DPV-LDAEWQKWKIKYEKTYSLEEEGQKR-AVWEENMKKIKLHNGENGLGKHGFTMEMNA 79

Query:   110 FADLSNEEFISTYLGYNKPY--NEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
             F D++ EEF    +    P    E      Q + +P  ++WRK G VTPV+ QG+C  CW
Sbjct:    80 FGDMTIEEFRKLMIEIPIPTVKKENSVQKRQAVNVPNFINWRKRGYVTPVRRQGRCNVCW 139

Query:   168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
             AFS   A+EG    KTG+L+ LS Q LVDC     N GC  G    A +++ + GG+ +E
Sbjct:   140 AFSVAGAIEGQMFQKTGQLIPLSVQNLVDCSRPQGNLGCYLGNTYLALQYVKENGGLESE 199

Query:   228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
               YPY  K   C+       A +IT +E +P
Sbjct:   200 ATYPYEEKEGSCRYHPDNSTA-SITDFEFVP 229

 Score = 154 (59.3 bits), Expect = 3.6e-42, Sum P(2) = 3.6e-42
 Identities = 34/88 (38%), Positives = 50/88 (56%)

Query:   256 AIPARY-AFQLYSHGVFDE-YCGHQL-NHGVTVVGYG----EDHGEKYWLVKNSWGTSWG 308
             AI AR+ +F  Y +G++ E  C   +  H + +VGYG    E  G KYW++KNS G  WG
Sbjct:   248 AIDARHESFLFYRNGIYHEPNCSSSVVTHAMLLVGYGFVGEESDGRKYWILKNSMGNKWG 307

Query:   309 EAGYIRMARNSPSSNIGICGILMQASYP 336
               GY+++A++  +     CGI   A YP
Sbjct:   308 NRGYMKIAKDQGNH----CGIATYALYP 331


>TAIR|locus:2055440 [details] [associations]
            symbol:AT2G34080 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002685 GenomeReviews:CT485783_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 MEROPS:I29.003 EMBL:AC002341
            HOGENOM:HOG000230773 HSSP:P53634 IPI:IPI00530325 PIR:B84752
            RefSeq:NP_565780.1 UniGene:At.28613 UniGene:At.37859
            ProteinModelPortal:O22961 SMR:O22961 EnsemblPlants:AT2G34080.1
            GeneID:817969 KEGG:ath:AT2G34080 TAIR:At2g34080 InParanoid:O22961
            OMA:SENDYSY PhylomeDB:O22961 ProtClustDB:CLSN2688064
            ArrayExpress:O22961 Genevestigator:O22961 Uniprot:O22961
        Length = 345

 Score = 436 (158.5 bits), Expect = 4.6e-41, P = 4.6e-41
 Identities = 91/217 (41%), Positives = 136/217 (62%)

Query:    56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLS 114
             QSM ++ E W+ ++SREY  E E   R  ++  N+++I+  N + N S+KL  N+FAD +
Sbjct:    33 QSMVDKHEQWMARFSREYRDELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWT 92

Query:   115 NEEFISTYLGYN------------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQ 162
             NEEF++ + G              K  +   W +V  + +  S DWR EGAVTPVK QGQ
Sbjct:    93 NEEFLAIHTGLKGLTEVSPSKVVAKTISSQTW-NVSDM-VVESKDWRAEGAVTPVKYQGQ 150

Query:   163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
             CG CWAFSAVAAVEG+ K+  G LVSLSEQ+L+DCD    ++GC+GG M  AF ++ +  
Sbjct:   151 CGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCD-REYDRGCDGGIMSDAFNYVVQNR 209

Query:   223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA 259
             G+ +E+DY Y+G +  C+++     A  I+G++ +P+
Sbjct:   210 GIASENDYSYQGSDGGCRSNARP--AARISGFQTVPS 244

 Score = 219 (82.2 bits), Expect = 2.8e-16, P = 2.8e-16
 Identities = 57/144 (39%), Positives = 72/144 (50%)

Query:   199 VNSEN----QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
             + SEN    QG +GG    A     +I G  T    P    N+R   +      V+++  
Sbjct:   211 IASENDYSYQGSDGGCRSNARP-AARISGFQT---VP--SNNERALLEAVSRQPVSVS-M 263

Query:   255 EAIPARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYI 313
             +A      F  YS GV+D  CG   NH VT VGYG    G KYWL KNSWG +WGE GYI
Sbjct:   264 DATGD--GFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGTKYWLAKNSWGETWGEKGYI 321

Query:   314 RMARNSPSSNIGICGILMQASYPV 337
             R+ R+      G+CG+   A YPV
Sbjct:   322 RIRRDVAWPQ-GMCGVAQYAFYPV 344


>MGI|MGI:88564 [details] [associations]
            symbol:Ctsl "cathepsin L" species:10090 "Mus musculus"
            [GO:0004177 "aminopeptidase activity" evidence=ISO] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISO] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005730 "nucleolus"
            evidence=NAS] [GO:0005737 "cytoplasm" evidence=ISO] [GO:0005764
            "lysosome" evidence=ISO] [GO:0005773 "vacuole" evidence=ISO]
            [GO:0005902 "microvillus" evidence=ISO] [GO:0006508 "proteolysis"
            evidence=ISO;IDA] [GO:0007154 "cell communication" evidence=ISO]
            [GO:0008152 "metabolic process" evidence=TAS] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISO;TAS] [GO:0009897 "external side of
            plasma membrane" evidence=ISO] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0030141 "secretory granule" evidence=ISO]
            [GO:0030984 "kininogen binding" evidence=ISO] [GO:0032403 "protein
            complex binding" evidence=ISO] [GO:0042277 "peptide binding"
            evidence=ISO] [GO:0042393 "histone binding" evidence=ISO;NAS]
            [GO:0043005 "neuron projection" evidence=ISO] [GO:0043204
            "perikaryon" evidence=ISO] [GO:0045177 "apical part of cell"
            evidence=ISO] [GO:0048863 "stem cell differentiation" evidence=NAS]
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IMP] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:88564 GO:GO:0005730 GO:GO:0009897 GO:GO:0034698
            GO:GO:0043204 GO:GO:0009749 GO:GO:0030141 GO:GO:0048863
            GO:GO:0051384 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045177 GO:GO:0043005
            GO:GO:0007283 GO:GO:0004177 GO:GO:0005764 GO:GO:0042277
            GO:GO:0009267 GO:GO:0021675 GO:GO:0042393 GO:GO:0005902
            GO:GO:0010259 GO:GO:0004197 GO:GO:0048102 GO:GO:0046697
            HOVERGEN:HBG011513 KO:K01365 OMA:EEFRATH OrthoDB:EOG48PMKF
            MEROPS:C01.032 BRENDA:3.4.22.15 ChiTaRS:CTSL1 EMBL:X06086
            EMBL:J02583 EMBL:M20495 EMBL:AF121837 EMBL:AF121838 EMBL:AF121839
            EMBL:BC068163 EMBL:X04392 IPI:IPI00128154 PIR:S01177
            RefSeq:NP_034114.1 UniGene:Mm.930 PDB:1MVV PDBsum:1MVV
            ProteinModelPortal:P06797 SMR:P06797 STRING:P06797
            PhosphoSite:P06797 PaxDb:P06797 PRIDE:P06797
            Ensembl:ENSMUST00000021933 GeneID:13039 KEGG:mmu:13039 CTD:13039
            InParanoid:P06797 BioCyc:MetaCyc:MONOMER-14812 BindingDB:P06797
            ChEMBL:CHEMBL5291 NextBio:282928 Bgee:P06797 CleanEx:MM_CTSL
            Genevestigator:P06797 GermOnline:ENSMUSG00000021477 GO:GO:0060008
            Uniprot:P06797
        Length = 334

 Score = 435 (158.2 bits), Expect = 5.9e-41, P = 5.9e-41
 Identities = 93/215 (43%), Positives = 131/215 (60%)

Query:    52 KYDPQSMEERFENWLKQYSREYGS-EDEWQRRFGIYSSNVQYI-----DYINSQNLSFKL 105
             K+D Q+    +  W   + R YG+ E+EW+R   I+  N++ I     +Y N Q+  F +
Sbjct:    20 KFD-QTFSAEWHQWKSTHRRLYGTNEEEWRR--AIWEKNMRMIQLHNGEYSNGQH-GFSM 75

Query:   106 TDNKFADLSNEEFISTYLGY-NKPYNEPR-WPSVQYLGLPASVDWRKEGAVTPVKDQGQC 163
               N F D++NEEF     GY ++ + + R +     L +P SVDWR++G VTPVK+QGQC
Sbjct:    76 EMNAFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLKIPKSVDWREKGCVTPVKNQGQC 135

Query:   164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
             GSCWAFSA   +EG   LKTGKL+SLSEQ LVDC     NQGCNGG M+ AF++I + GG
Sbjct:   136 GSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGG 195

Query:   224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             + +E+ YPY  K+  C+  + +      TG+  IP
Sbjct:   196 LDSEESYPYEAKDGSCKY-RAEFAVANDTGFVDIP 229

 Score = 190 (71.9 bits), Expect = 1.0e-12, P = 1.0e-12
 Identities = 52/144 (36%), Positives = 76/144 (52%)

Query:   207 NGGY-MEKAFEFITKIGGVTTEDDYPYR---GKNDRCQTDKTKHHAVTITG--YEAIPAR 260
             NGG   E+++ +  K G      ++      G  D  Q +K    AV   G    A+ A 
Sbjct:   193 NGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPISVAMDAS 252

Query:   261 Y-AFQLYSHGVFDE-YCGHQ-LNHGVTVVGYG----EDHGEKYWLVKNSWGTSWGEAGYI 313
             + + Q YS G++ E  C  + L+HGV +VGYG    + +  KYWLVKNSWG+ WG  GYI
Sbjct:   253 HPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYI 312

Query:   314 RMARNSPSSNIGICGILMQASYPV 337
             ++A++  +     CG+   ASYPV
Sbjct:   313 KIAKDRDNH----CGLATAASYPV 332


>RGD|2448 [details] [associations]
            symbol:Ctsl1 "cathepsin L1" species:10116 "Rattus norvegicus"
          [GO:0002250 "adaptive immune response" evidence=ISO] [GO:0004177
          "aminopeptidase activity" evidence=IDA] [GO:0004197 "cysteine-type
          endopeptidase activity" evidence=ISO;IDA] [GO:0005576 "extracellular
          region" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IDA]
          [GO:0005764 "lysosome" evidence=ISO;IDA] [GO:0005773 "vacuole"
          evidence=IDA] [GO:0005902 "microvillus" evidence=IDA] [GO:0006508
          "proteolysis" evidence=IEP;ISO] [GO:0007154 "cell communication"
          evidence=IDA] [GO:0007283 "spermatogenesis" evidence=IEP] [GO:0008234
          "cysteine-type peptidase activity" evidence=ISO] [GO:0008584 "male
          gonad development" evidence=IEP] [GO:0009267 "cellular response to
          starvation" evidence=IEP] [GO:0009749 "response to glucose stimulus"
          evidence=IEP] [GO:0009897 "external side of plasma membrane"
          evidence=IDA] [GO:0010259 "multicellular organismal aging"
          evidence=IEP] [GO:0014070 "response to organic cyclic compound"
          evidence=IEP] [GO:0021675 "nerve development" evidence=IEP]
          [GO:0030984 "kininogen binding" evidence=IPI] [GO:0032403 "protein
          complex binding" evidence=IPI] [GO:0034698 "response to gonadotropin
          stimulus" evidence=IEP] [GO:0042277 "peptide binding" evidence=IDA]
          [GO:0042393 "histone binding" evidence=ISO] [GO:0043005 "neuron
          projection" evidence=IDA] [GO:0043204 "perikaryon" evidence=IDA]
          [GO:0046697 "decidualization" evidence=IEP] [GO:0048102 "autophagic
          cell death" evidence=IEP] [GO:0051384 "response to glucocorticoid
          stimulus" evidence=IEP] [GO:0060008 "Sertoli cell differentiation"
          evidence=IEP] [GO:0097067 "cellular response to thyroid hormone
          stimulus" evidence=ISO] [GO:0030141 "secretory granule" evidence=IDA]
          [GO:0045177 "apical part of cell" evidence=IDA] [GO:0060441
          "epithelial tube branching involved in lung morphogenesis"
          evidence=ISO] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
          PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:Y00697 RGD:2448
          GO:GO:0005576 GO:GO:0009897 GO:GO:0034698 GO:GO:0043204 GO:GO:0009749
          GO:GO:0051384 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
          InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
          PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
          PROSITE:PS00139 PROSITE:PS00639 GO:GO:0043005 GO:GO:0007283
          GO:GO:0004177 GO:GO:0005764 GO:GO:0042277 GO:GO:0009267 GO:GO:0021675
          GO:GO:0005902 GO:GO:0010259 GO:GO:0004197 GO:GO:0048102 GO:GO:0046697
          GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 KO:K01365
          OrthoDB:EOG48PMKF MEROPS:C01.032 OMA:FDQNLDT CTD:1514
          BRENDA:3.4.22.15 GO:GO:0060008 EMBL:AF025476 EMBL:BC063175
          EMBL:S85184 IPI:IPI00326070 PIR:S07098 RefSeq:NP_037288.1
          UniGene:Rn.1294 ProteinModelPortal:P07154 SMR:P07154 IntAct:P07154
          STRING:P07154 PhosphoSite:P07154 PRIDE:P07154
          Ensembl:ENSRNOT00000025462 GeneID:25697 KEGG:rno:25697 UCSC:RGD:2448
          InParanoid:P07154 SABIO-RK:P07154 BindingDB:P07154 ChEMBL:CHEMBL2305
          NextBio:607715 Genevestigator:P07154 GermOnline:ENSRNOG00000018566
          Uniprot:P07154
        Length = 334

 Score = 432 (157.1 bits), Expect = 1.2e-40, P = 1.2e-40
 Identities = 90/215 (41%), Positives = 134/215 (62%)

Query:    52 KYDPQSMEERFENWLKQYSREYGS-EDEWQRRFGIYSSNVQYI-----DYINSQNLSFKL 105
             K+D Q+   ++  W   + R YG+ E+EW+R   ++  N++ I     +Y N ++  F +
Sbjct:    20 KFD-QTFNAQWHQWKSTHRRLYGTNEEEWRR--AVWEKNMRMIQLHNGEYSNGKH-GFTM 75

Query:   106 TDNKFADLSNEEFISTYLGY-NKPYNEPR-WPSVQYLGLPASVDWRKEGAVTPVKDQGQC 163
               N F D++NEEF     GY ++ + + R +     L +P +VDWR++G VTPVK+QGQC
Sbjct:    76 EMNAFGDMTNEEFRQIVNGYRHQKHKKGRLFQEPLMLQIPKTVDWREKGCVTPVKNQGQC 135

Query:   164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
             GSCWAFSA   +EG   LKTGKL+SLSEQ LVDC  +  NQGCNGG M+ AF++I + GG
Sbjct:   136 GSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGG 195

Query:   224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             + +E+ YPY  K+  C+  + ++     TG+  IP
Sbjct:   196 LDSEESYPYEAKDGSCKY-RAEYAVANDTGFVDIP 229

 Score = 194 (73.4 bits), Expect = 3.3e-13, P = 3.3e-13
 Identities = 53/144 (36%), Positives = 76/144 (52%)

Query:   207 NGGY-MEKAFEFITKIGGVTTEDDYPYR---GKNDRCQTDKTKHHAVTITG--YEAIPAR 260
             NGG   E+++ +  K G      +Y      G  D  Q +K    AV   G    A+ A 
Sbjct:   193 NGGLDSEESYPYEAKDGSCKYRAEYAVANDTGFVDIPQQEKALMKAVATVGPISVAMDAS 252

Query:   261 Y-AFQLYSHGVFDE-YCGHQ-LNHGVTVVGYG----EDHGEKYWLVKNSWGTSWGEAGYI 313
             + + Q YS G++ E  C  + L+HGV VVGYG    + + +KYWLVKNSWG  WG  GYI
Sbjct:   253 HPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYI 312

Query:   314 RMARNSPSSNIGICGILMQASYPV 337
             ++A++  +     CG+   ASYP+
Sbjct:   313 KIAKDRNNH----CGLATAASYPI 332


>DICTYBASE|DDB_G0282991 [details] [associations]
            symbol:DDB_G0282991 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0282991 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            EMBL:AAFI02000049 eggNOG:NOG331187 RefSeq:XP_639299.1
            ProteinModelPortal:Q54RQ2 EnsemblProtists:DDB0185304 GeneID:8623870
            KEGG:ddi:DDB_G0282991 InParanoid:Q54RQ2 OMA:PENGNEY Uniprot:Q54RQ2
        Length = 339

 Score = 305 (112.4 bits), Expect = 1.4e-40, Sum P(2) = 1.4e-40
 Identities = 69/186 (37%), Positives = 103/186 (55%)

Query:    58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEE 117
             +E  F  W  +Y++ Y S  E+  RF  +  N +Y+D  N + L   L  N FADLS  E
Sbjct:    23 IENLFIEWTNKYNKIY-SNKEFYMRFNNFKKNKEYVDQWNEKQLETILELNFFADLSRNE 81

Query:   118 FISTYLGYNKPYNEPRWPSVQYLG-LP-------ASVDWRKEGAVTPVKDQGQC-GSCWA 168
             +I+ YL      +     + +Y G L         S+DWR   AVTPVK+QG C G+ ++
Sbjct:    82 YINNYLASFIDISNIEQKNTKYEGNLKNNFNNSIKSIDWRNFDAVTPVKNQGLCSGAGYS 141

Query:   169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
             FSA+  +E  + +K  +L++LSEQ ++DC  +  N GC GG    AF++I K  G+ +E 
Sbjct:   142 FSAIGVIESSHFIKNKELITLSEQNIIDCTTDMGNNGCMGGLALIAFDYIIKQKGIDSEF 201

Query:   229 DYPYRG 234
             +YPY G
Sbjct:   202 NYPYEG 207

 Score = 143 (55.4 bits), Expect = 1.4e-40, Sum P(2) = 1.4e-40
 Identities = 28/75 (37%), Positives = 49/75 (65%)

Query:   259 ARYAFQLYSHGVF-DEYCGHQ-LNHGVTVVGYG--EDHGEKYWLVKNSWGTSWGEAGYIR 314
             ++ +F LY  GV+ D  C    LNHG+  +G+G   ++G +Y+++KNS+G+ WG  GYI 
Sbjct:   260 SQLSFMLYKSGVYKDPSCSSTILNHGILNIGFGVTPENGNEYYILKNSFGSKWGMKGYIY 319

Query:   315 MARNSPSSNIGICGI 329
             ++RN  +++ GI  +
Sbjct:   320 LSRNF-NNHCGISSV 333

 Score = 44 (20.5 bits), Expect = 7.6e-08, Sum P(2) = 7.6e-08
 Identities = 15/50 (30%), Positives = 26/50 (52%)

Query:    92 YIDYINSQNLSFKLTDNKFADL-SNEEFISTYLGY--NKPYNEPRWPSVQ 138
             Y+  +  +NL  + T NK+  + SN+EF   +  +  NK Y + +W   Q
Sbjct:    17 YLCNLEIENLFIEWT-NKYNKIYSNKEFYMRFNNFKKNKEYVD-QWNEKQ 64


>RGD|1309354 [details] [associations]
            symbol:Ctsw "cathepsin W" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1309354 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 MEROPS:C01.037
            CTD:1521 eggNOG:NOG288820 HOVERGEN:HBG100117 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 EMBL:CH473953 EMBL:BC093401 IPI:IPI00371471
            RefSeq:NP_001019413.1 UniGene:Rn.34406 Ensembl:ENSRNOT00000037404
            GeneID:293676 KEGG:rno:293676 UCSC:RGD:1309354 InParanoid:Q561Q9
            NextBio:636716 Genevestigator:Q561Q9 Uniprot:Q561Q9
        Length = 371

 Score = 284 (105.0 bits), Expect = 8.3e-40, Sum P(3) = 8.3e-40
 Identities = 65/202 (32%), Positives = 108/202 (53%)

Query:    55 PQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL-SFKLTDNKFADL 113
             P  ++E F+ +  Q++R Y +  E+ RR GI++ N+     +  ++L + +     F+DL
Sbjct:    33 PLELKEVFKLFQIQFNRSYSNPAEYTRRLGIFAHNLAQAQRLQEEDLGTAEFGQTPFSDL 92

Query:   114 SNEEFISTY---------LGYNKPYNEPRWPSVQYLGLPASVDWRK-EGAVTPVKDQGQC 163
             + EEF   Y         L   K     RW       +P + DWRK +  ++ +K+QG C
Sbjct:    93 TEEEFGQLYGHQRAPERILNMAKKVKSERWGE----SVPPTCDWRKVKNIISSIKNQGNC 148

Query:   164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
               CWA +A   ++ + ++KT + V +S QEL+DCD    N GCNGG++  A+  +    G
Sbjct:   149 RCCWAIAAADNIQTLWRIKTQQFVDVSVQELLDCD-RCGN-GCNGGFVWDAYITVLNNSG 206

Query:   224 VTTEDDYPYRG--KNDRCQTDK 243
             + +E+DYP++G  K  RC  DK
Sbjct:   207 LASEEDYPFQGHQKPHRCLADK 228

 Score = 112 (44.5 bits), Expect = 8.3e-40, Sum P(3) = 8.3e-40
 Identities = 19/44 (43%), Positives = 26/44 (59%)

Query:   296 YWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
             YW++KNSWG  WGE GY R+ R + +  I    I  +   PVK+
Sbjct:   321 YWILKNSWGAEWGEKGYFRLYRGNNTCGIAKYPITARVDRPVKK 364

 Score = 55 (24.4 bits), Expect = 8.3e-40, Sum P(3) = 8.3e-40
 Identities = 13/34 (38%), Positives = 18/34 (52%)

Query:   264 QLYSHGVFD---EYCG-HQLNHGVTVVGYGEDHG 293
             Q Y  GV       C  H +NH V +VG+G++ G
Sbjct:   268 QYYQKGVIKATPSTCDPHLVNHSVLLVGFGKEKG 301


>UNIPROTKB|P83654 [details] [associations]
            symbol:P83654 "Ervatamin-C" species:52861 "Tabernaemontana
            divaricata" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0006508 "proteolysis" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005576 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197 PDB:1O0E PDB:2PNS
            PDBsum:1O0E PDBsum:2PNS MEROPS:C01.116 EvolutionaryTrace:P83654
            Uniprot:P83654
        Length = 208

 Score = 416 (151.5 bits), Expect = 6.1e-39, P = 6.1e-39
 Identities = 94/215 (43%), Positives = 121/215 (56%)

Query:   142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
             LP  +DWRK+GAVTPVK+QG CGSCWAFS V+ VE IN+++TG L+SLSEQELVDCD   
Sbjct:     1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCD--K 58

Query:   202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP--- 258
             +N GC GG    A+++I   GG+ T+ +YPY+     CQ   +K   V+I GY  +P   
Sbjct:    59 KNHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPCQA-ASK--VVSIDGYNGVPFCN 115

Query:   259 ---ARYAFQLYSHGV-FDEYCGH--QLNHGVTVVGYGE--DHG-------EKYWLVKNSW 303
                 + A  +    V  D       Q + G+     G   +HG         YW+V+NSW
Sbjct:   116 EXALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQANYWIVRNSW 175

Query:   304 GTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
             G  WGE GYIRM R       G+CGI     YP K
Sbjct:   176 GRYWGEKGYIRMLR---VGGCGLCGIARLPYYPTK 207


>UNIPROTKB|F1MHV4 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 OMA:GRCGDGC EMBL:DAAA02063574
            IPI:IPI00716321 Ensembl:ENSBTAT00000027681 Uniprot:F1MHV4
        Length = 375

 Score = 290 (107.1 bits), Expect = 6.2e-38, Sum P(3) = 6.2e-38
 Identities = 76/230 (33%), Positives = 112/230 (48%)

Query:    32 LFLLWVLGIPAGAWSEGYPQKYDPQSME--ERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
             L  L V G+  G       Q   PQ +E  E F  +  QY+R Y +  E+ RR  I++ N
Sbjct:    10 LLALLVAGLAQGIKDSLRGQDPGPQPLELKEVFRLFQMQYNRSYPNPAEYARRLDIFAQN 69

Query:    90 VQYIDYINSQNLS---FKLTDNKFADLSNEEFISTY--------LGYNKPYNEPRWPSVQ 138
             +     +  ++L    F +T  +F+DL+ EEF+  Y        LG ++      W   +
Sbjct:    70 LAKAQRLQEEDLGTAEFGVT--QFSDLTEEEFVQLYGSQVAGEALGVSRKVGSEEWGESE 127

Query:   139 YLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQ-ELVDC 197
                 P + DWRK G ++PV+DQ  C  CWA +A   +E +  +K    V +S Q EL+DC
Sbjct:   128 ----PQTCDWRKVGTISPVRDQRNCNCCWAMAAAGNIEALWAIKFRHFVEVSVQPELLDC 183

Query:   198 DVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG--KNDRCQTDKTK 245
             D    N GC GG++  AF  +    G+ +E DYP+ G  K  RC   K K
Sbjct:   184 D-RCGN-GCRGGFVWDAFLTVLNNSGLASEKDYPFNGSGKTHRCLAKKYK 231

 Score = 104 (41.7 bits), Expect = 6.2e-38, Sum P(3) = 6.2e-38
 Identities = 18/44 (40%), Positives = 25/44 (56%)

Query:   296 YWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
             YW++KNSWG  WGE GY R+ R S +  I    +  +   P K+
Sbjct:   325 YWILKNSWGPQWGEEGYFRLHRGSNTCGITKFPVTARVDKPKKQ 368

 Score = 39 (18.8 bits), Expect = 6.2e-38, Sum P(3) = 6.2e-38
 Identities = 6/13 (46%), Positives = 11/13 (84%)

Query:   278 QLNHGVTVVGYGE 290
             Q++H V +VG+G+
Sbjct:   287 QVDHSVLLVGFGK 299


>UNIPROTKB|F1S4J6 [details] [associations]
            symbol:Ssc.54235 "Cathepsin L1" species:9823 "Sus scrofa"
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197
            GO:GO:0002250 GeneTree:ENSGT00560000076577 GO:GO:0097067
            EMBL:CU571031 RefSeq:XP_003130681.1 Ensembl:ENSSSCT00000011983
            GeneID:100515919 KEGG:ssc:100515919 OMA:IAICATK Uniprot:F1S4J6
        Length = 332

 Score = 400 (145.9 bits), Expect = 3.0e-37, P = 3.0e-37
 Identities = 94/233 (40%), Positives = 135/233 (57%)

Query:    32 LFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQ 91
             L   + LGI + A     P ++D  S++  +  W   + + YG  +E +RR  I+  N++
Sbjct:     6 LLAAFCLGIASAA-----P-RHD-HSLDADWYKWKATHRKLYGLNEEGRRR-AIWEKNMK 57

Query:    92 YIDYINSQNL----SFKLTDNKFADLSNEEFISTYLGY-NKPYNEPR-WPSVQYLGLPAS 145
              I+  N ++     SF +  N F D++NEEF  T  G+ N+ + + + +        P S
Sbjct:    58 MIERHNWEHRQGKHSFTMAMNAFGDMTNEEFRKTMNGFQNQKHKKGKVFLDAGSALTPHS 117

Query:   146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQG 205
             VDWR++G VT VK+QG CGSCWAFSA  A+EG    KT KL+SLSEQ LVDC     N+G
Sbjct:   118 VDWREKGYVTAVKNQGHCGSCWAFSATGALEGQMFRKTSKLISLSEQNLVDCSWPEGNEG 177

Query:   206 CNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             CNGG M+ AF++I   GG+ +E+ YPY GK+  C+  K +  A   TGY  IP
Sbjct:   178 CNGGLMDNAFQYIKDNGGLDSEESYPYFGKDGSCKY-KPQSSAANDTGYVDIP 229

 Score = 197 (74.4 bits), Expect = 1.4e-13, P = 1.4e-13
 Identities = 55/143 (38%), Positives = 74/143 (51%)

Query:   207 NGGY-MEKAFEFITKIGGVTTEDDYPYR---GKNDRCQTDKTKHHAVTITG--YEAIPAR 260
             NGG   E+++ +  K G    +         G  D  + +K    AV   G     I A 
Sbjct:   193 NGGLDSEESYPYFGKDGSCKYKPQSSAANDTGYVDIPKQEKALMKAVATVGPISVGIDAS 252

Query:   261 Y-AFQLYSHGV-FDEYCGHQ-LNHGVTVVGYGED--HGE-KYWLVKNSWGTSWGEAGYIR 314
             + +FQ YS G+ F+  C  + L+HGV VVGYG +  H   KYWLVKNSWG +WG  GYI+
Sbjct:   253 HESFQFYSTGIYFEPQCSSEDLDHGVLVVGYGVEGAHSNNKYWLVKNSWGNTWGMDGYIK 312

Query:   315 MARNSPSSNIGICGILMQASYPV 337
             M ++  +     CGI   ASYPV
Sbjct:   313 MTKDQNNH----CGIATMASYPV 331


>UNIPROTKB|P83443 [details] [associations]
            symbol:P83443 "Macrodontain-1" species:203992 "Pseudananas
            sagenarius" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0006508 "proteolysis" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197
            ProteinModelPortal:P83443 SMR:P83443 MEROPS:C01.028 Uniprot:P83443
        Length = 213

 Score = 397 (144.8 bits), Expect = 6.3e-37, P = 6.3e-37
 Identities = 85/214 (39%), Positives = 120/214 (56%)

Query:   142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
             +P S+DWR  GAV  VK+QG CG CWAF+A+A VEGI K++ G LV LSEQE++DC V+ 
Sbjct:     2 VPQSIDWRDYGAVNEVKNQGPCGGCWAFAAIATVEGIYKIRKGNLVYLSEQEVLDCAVS- 60

Query:   202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR- 260
                GC GG++ +A++FI    GVTT+++YPYR     C  +   + A  ITGY  +    
Sbjct:    61 --YGCKGGWVNRAYDFIISNNGVTTDENYPYRAYQGTCNANYFPNSAY-ITGYSYVRRND 117

Query:   261 -----YAFQLYSHGVFDEYCGHQLNH---GVTV--VGYGEDHG--------EKYWLVKNS 302
                  YA          +  G    +   GV     G+  +H         + YW+V+NS
Sbjct:   118 ESHMMYAVSNQPIAALIDASGDNFQYYKGGVYSGPCGFSLNHAITIIGYGRDSYWIVRNS 177

Query:   303 WGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
             WG+SWG+ GY+R+ R+   S  G+CGI M   +P
Sbjct:   178 WGSSWGQGGYVRIRRDVSHSG-GVCGIAMSPLFP 210


>DICTYBASE|DDB_G0272742 [details] [associations]
            symbol:DDB_G0272742 species:44689 "Dictyostelium
            discoideum" [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0272742 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639 EMBL:AAFI02000008
            eggNOG:NOG331187 RefSeq:XP_644986.1 ProteinModelPortal:Q7KWP5
            PRIDE:Q7KWP5 EnsemblProtists:DDB0168242 GeneID:8618663
            KEGG:ddi:DDB_G0272742 InParanoid:Q7KWP5 OMA:ATESAHF Uniprot:Q7KWP5
        Length = 345

 Score = 266 (98.7 bits), Expect = 1.0e-36, Sum P(2) = 1.0e-36
 Identities = 72/212 (33%), Positives = 110/212 (51%)

Query:    62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
             F  W+    R Y S  E+  R+  + SN+ +I+  NS+     L  N+FAD+SNEE+   
Sbjct:    29 FTAWMTSNQRTYASS-EFTNRYNTFKSNLDFINQWNSKGSKTVLALNEFADISNEEYRKN 87

Query:   122 YLGYNKPYN-----------EPRWPSVQYLGLPAS-VDWRKEGAVTPVKDQ-GQCGSCWA 168
             YL  +   N           +    S    G  +S +DWRK+GAV  VK Q G CGS W 
Sbjct:    88 YLRNDNNINKLSSLLINDKEDKEIKSSSSSGSGSSGIDWRKKGAVPSVKSQIGGCGS-WP 146

Query:   169 FSAVAAVEGINKLKTGK--LVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
              +AV A E  + L   K   +SLS Q L+DC  ++ N+ C  G + +AF++I + GG+ +
Sbjct:   147 ITAVGATESAHFLANPKDPFISLSMQNLIDC--SNLNKQCYQGTVNEAFQYIIENGGIDS 204

Query:   227 EDDYPYRG-KNDRCQTDKTKHHAVTITGYEAI 257
             E+ Y + G +  +C+ + +   A  IT YE +
Sbjct:   205 EESYKFSGGEPGKCKYNSSNSVA-KITSYEKV 235

 Score = 145 (56.1 bits), Expect = 1.0e-36, Sum P(2) = 1.0e-36
 Identities = 33/87 (37%), Positives = 49/87 (56%)

Query:   262 AFQLYSHGVFDE-YCGH-QLNHGVTVVGYGE------D---HGEKYWLVKNSWGTSWGEA 310
             +FQ YS G++ E  C    LNH + +VG+ +      D   H   YW+V+NS+G +WGE 
Sbjct:   262 SFQFYSSGIYYEPSCNSTDLNHSILIVGFSDFSTTPTDSLKHSSNYWIVQNSFGKNWGEN 321

Query:   311 GYIRMARNSPSSNIGICGILMQASYPV 337
             GYI M+++   +    CGI   ASY +
Sbjct:   322 GYIFMSKDRDDN----CGISKMASYVI 344


>MGI|MGI:1338045 [details] [associations]
            symbol:Ctsw "cathepsin W" species:10090 "Mus musculus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 MGI:MGI:1338045 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 MEROPS:C01.037
            CTD:1521 eggNOG:NOG288820 HOVERGEN:HBG100117 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 EMBL:AF014941 EMBL:AC122861 IPI:IPI00111727
            RefSeq:NP_034115.2 UniGene:Mm.113590 ProteinModelPortal:P56203
            SMR:P56203 PhosphoSite:P56203 PRIDE:P56203 DNASU:13041
            Ensembl:ENSMUST00000025844 GeneID:13041 KEGG:mmu:13041
            InParanoid:P56203 NextBio:282936 Bgee:P56203 CleanEx:MM_CTSW
            Genevestigator:P56203 GermOnline:ENSMUSG00000024910 Uniprot:P56203
        Length = 371

 Score = 260 (96.6 bits), Expect = 2.8e-36, Sum P(3) = 2.8e-36
 Identities = 61/200 (30%), Positives = 106/200 (53%)

Query:    55 PQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL-SFKLTDNKFADL 113
             P  ++E F+ +  +++R Y +  E+ RR  I++ N+     +  ++L + +  +  F+DL
Sbjct:    33 PLELKEVFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDLGTAEFGETPFSDL 92

Query:   114 SNEEFISTYLGYNKPYNEPRWPS-VQYL----GLPASVDWRK-EGAVTPVKDQGQCGSCW 167
             + EEF   Y     P   P     V+       +P + DWRK +  ++ VK+QG C  CW
Sbjct:    93 TEEEFGQLYGQERSPERTPNMTKKVESNTWGESVPRTCDWRKAKNIISSVKNQGSCKCCW 152

Query:   168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
             A +A   ++ + ++K  + V +S QEL+DC+    N GCNGG++  A+  +    G+ +E
Sbjct:   153 AMAAADNIQALWRIKHQQFVDVSVQELLDCE-RCGN-GCNGGFVWDAYLTVLNNSGLASE 210

Query:   228 DDYPYRG--KNDRCQTDKTK 245
              DYP++G  K  RC   K K
Sbjct:   211 KDYPFQGDRKPHRCLAKKYK 230

 Score = 115 (45.5 bits), Expect = 2.8e-36, Sum P(3) = 2.8e-36
 Identities = 19/48 (39%), Positives = 26/48 (54%)

Query:   292 HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
             H   YW++KNSWG  WGE GY R+ R + +  +       Q   PVK+
Sbjct:   317 HSSPYWILKNSWGAHWGEKGYFRLYRGNNTCGVTKYPFTAQVDSPVKK 364

 Score = 42 (19.8 bits), Expect = 2.8e-36, Sum P(3) = 2.8e-36
 Identities = 11/32 (34%), Positives = 17/32 (53%)

Query:   264 QLYSHGVFD---EYCG-HQLNHGVTVVGYGED 291
             Q Y  GV       C   Q++H V +VG+G++
Sbjct:   268 QHYQKGVIKATPSSCDPRQVDHSVLLVGFGKE 299


>UNIPROTKB|Q5T8F0 [details] [associations]
            symbol:CTSL1 "Cathepsin L1 light chain" species:9606 "Homo
            sapiens" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            EMBL:AL160279 UniGene:Hs.731507 UniGene:Hs.731952 HGNC:HGNC:2537
            ChiTaRS:CTSL1 IPI:IPI00640540 SMR:Q5T8F0 Ensembl:ENST00000342020
            ChEMBL:CHEMBL1293261 Uniprot:Q5T8F0
        Length = 225

 Score = 388 (141.6 bits), Expect = 5.7e-36, P = 5.7e-36
 Identities = 78/182 (42%), Positives = 111/182 (60%)

Query:    57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFAD 112
             S+E ++  W   ++R YG  +E  RR  ++  N++ I+  N +      SF +  N F D
Sbjct:    24 SLEAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEYREGKHSFTMAMNAFGD 82

Query:   113 LSNEEFISTYLGYN--KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
             +++EEF     G+   KP     +    +   P SVDWR++G VTPVK+QGQCGSCWAFS
Sbjct:    83 MTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFS 142

Query:   171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
             A  A+EG    KTG+L+SLSEQ LVDC     N+GCNGG M+ AF+++   GG+ +E+ Y
Sbjct:   143 ATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESY 202

Query:   231 PY 232
             PY
Sbjct:   203 PY 204


>GENEDB_PFALCIPARUM|PF11_0162 [details] [associations]
            symbol:PF11_0162 "falcipain-3" species:5833
            "Plasmodium falciparum" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 HSSP:P43235 EMBL:AE014186 GO:GO:0020020
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347833.1
            ProteinModelPortal:Q8IIL0 SMR:Q8IIL0 MEROPS:C01.063
            EnsemblProtists:PF11_0162:mRNA GeneID:810709 KEGG:pfa:PF11_0162
            EuPathDB:PlasmoDB:PF3D7_1115400 OMA:ENDEDYW ChEMBL:CHEMBL1250373
            Uniprot:Q8IIL0
        Length = 492

 Score = 268 (99.4 bits), Expect = 9.2e-36, Sum P(2) = 9.2e-36
 Identities = 79/218 (36%), Positives = 110/218 (50%)

Query:    51 QKYDP-QSMEERF----ENWLKQYSREYGSEDEWQR---RFGIYSSNVQYIDYINSQNLS 102
             +KY+  + M++RF    EN+ K       +   ++R   +FG  S      +   S+ L+
Sbjct:   180 KKYETSEEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKFGDLSP-----EEFRSKYLN 234

Query:   103 FKLTDNKFADLSNE-EFISTYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQG 161
              K T   F  LS    + + Y    K Y     P+   L   A  DWR  G VTPVKDQ 
Sbjct:   235 LK-THGPFKTLSPPVSYEANYEDVIKKYK----PADAKLDRIA-YDWRLHGGVTPVKDQA 288

Query:   162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKI 221
              CGSCWAFS+V +VE    ++   L   SEQELVDC V  +N GC GGY+  AF+ +  +
Sbjct:   289 LCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVDCSV--KNNGCYGGYITNAFDDMIDL 346

Query:   222 GGVTTEDDYPYRGK-NDRCQTDKTKHHAVTITGYEAIP 258
             GG+ ++DDYPY     + C   K  +   TI  Y +IP
Sbjct:   347 GGLCSQDDYPYVSNLPETCNL-KRCNERYTIKSYVSIP 383

 Score = 145 (56.1 bits), Expect = 9.2e-36, Sum P(2) = 9.2e-36
 Identities = 36/92 (39%), Positives = 47/92 (51%)

Query:   256 AIPARYAFQLYSHGVFDEYCGHQLNHGVTVVGYG------EDHG--EK--YWLVKNSWGT 305
             +I A   F  Y  G +D  CG   NH V +VGYG      ED G  EK  Y+++KNSWG+
Sbjct:   400 SIAASDDFAFYRGGFYDGECGAAPNHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWGS 459

Query:   306 SWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
              WGE GYI +  +        C I  +A  P+
Sbjct:   460 DWGEGGYINLETDENGYK-KTCSIGTEAYVPL 490

 Score = 116 (45.9 bits), Expect = 6.5e-15, Sum P(2) = 6.5e-15
 Identities = 30/83 (36%), Positives = 49/83 (59%)

Query:    44 AWSEGYPQKYDPQSME--ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-N 100
             ++S  +  K+   ++E    F  +LK+ +++Y + +E Q+RF I+S N + I+  N + N
Sbjct:   151 SYSNLFDTKFLMDNLETVNLFYIFLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKTN 210

Query:   101 LSFKLTDNKFADLSNEEFISTYL 123
               +K   NKF DLS EEF S YL
Sbjct:   211 SLYKRGMNKFGDLSPEEFRSKYL 233

 Score = 59 (25.8 bits), Expect = 5.2e-09, Sum P(2) = 5.2e-09
 Identities = 15/66 (22%), Positives = 32/66 (48%)

Query:    51 QKYDPQSMEERFENWLKQYS-REYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNK 109
             +K+    +EE   ++ K+   R  G+E+      GI   + + + ++N +N + K+ +N 
Sbjct:    89 KKFIVSKLEELISSYDKEKKMRTTGAEENNMNMNGIDDKDNKSVSFVNKKNGNLKVNNNN 148

Query:   110 FADLSN 115
                 SN
Sbjct:   149 QVSYSN 154

 Score = 45 (20.9 bits), Expect = 0.00027, Sum P(2) = 0.00027
 Identities = 12/45 (26%), Positives = 21/45 (46%)

Query:   212 EKAFEFITKIG-GVTTEDDYP-YRGKNDRCQTDKTKHHAVTITGY 254
             ++A  ++  I   +   DD+  YRG     +     +HAV + GY
Sbjct:   388 KEALRYLGPISISIAASDDFAFYRGGFYDGECGAAPNHAVILVGY 432


>UNIPROTKB|Q8IIL0 [details] [associations]
            symbol:PF11_0162 "Falcipain-3" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 HSSP:P43235 EMBL:AE014186 GO:GO:0020020
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347833.1
            ProteinModelPortal:Q8IIL0 SMR:Q8IIL0 MEROPS:C01.063
            EnsemblProtists:PF11_0162:mRNA GeneID:810709 KEGG:pfa:PF11_0162
            EuPathDB:PlasmoDB:PF3D7_1115400 OMA:ENDEDYW ChEMBL:CHEMBL1250373
            Uniprot:Q8IIL0
        Length = 492

 Score = 268 (99.4 bits), Expect = 9.2e-36, Sum P(2) = 9.2e-36
 Identities = 79/218 (36%), Positives = 110/218 (50%)

Query:    51 QKYDP-QSMEERF----ENWLKQYSREYGSEDEWQR---RFGIYSSNVQYIDYINSQNLS 102
             +KY+  + M++RF    EN+ K       +   ++R   +FG  S      +   S+ L+
Sbjct:   180 KKYETSEEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKFGDLSP-----EEFRSKYLN 234

Query:   103 FKLTDNKFADLSNE-EFISTYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQG 161
              K T   F  LS    + + Y    K Y     P+   L   A  DWR  G VTPVKDQ 
Sbjct:   235 LK-THGPFKTLSPPVSYEANYEDVIKKYK----PADAKLDRIA-YDWRLHGGVTPVKDQA 288

Query:   162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKI 221
              CGSCWAFS+V +VE    ++   L   SEQELVDC V  +N GC GGY+  AF+ +  +
Sbjct:   289 LCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVDCSV--KNNGCYGGYITNAFDDMIDL 346

Query:   222 GGVTTEDDYPYRGK-NDRCQTDKTKHHAVTITGYEAIP 258
             GG+ ++DDYPY     + C   K  +   TI  Y +IP
Sbjct:   347 GGLCSQDDYPYVSNLPETCNL-KRCNERYTIKSYVSIP 383

 Score = 145 (56.1 bits), Expect = 9.2e-36, Sum P(2) = 9.2e-36
 Identities = 36/92 (39%), Positives = 47/92 (51%)

Query:   256 AIPARYAFQLYSHGVFDEYCGHQLNHGVTVVGYG------EDHG--EK--YWLVKNSWGT 305
             +I A   F  Y  G +D  CG   NH V +VGYG      ED G  EK  Y+++KNSWG+
Sbjct:   400 SIAASDDFAFYRGGFYDGECGAAPNHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWGS 459

Query:   306 SWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
              WGE GYI +  +        C I  +A  P+
Sbjct:   460 DWGEGGYINLETDENGYK-KTCSIGTEAYVPL 490

 Score = 116 (45.9 bits), Expect = 6.5e-15, Sum P(2) = 6.5e-15
 Identities = 30/83 (36%), Positives = 49/83 (59%)

Query:    44 AWSEGYPQKYDPQSME--ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-N 100
             ++S  +  K+   ++E    F  +LK+ +++Y + +E Q+RF I+S N + I+  N + N
Sbjct:   151 SYSNLFDTKFLMDNLETVNLFYIFLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKTN 210

Query:   101 LSFKLTDNKFADLSNEEFISTYL 123
               +K   NKF DLS EEF S YL
Sbjct:   211 SLYKRGMNKFGDLSPEEFRSKYL 233

 Score = 59 (25.8 bits), Expect = 5.2e-09, Sum P(2) = 5.2e-09
 Identities = 15/66 (22%), Positives = 32/66 (48%)

Query:    51 QKYDPQSMEERFENWLKQYS-REYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNK 109
             +K+    +EE   ++ K+   R  G+E+      GI   + + + ++N +N + K+ +N 
Sbjct:    89 KKFIVSKLEELISSYDKEKKMRTTGAEENNMNMNGIDDKDNKSVSFVNKKNGNLKVNNNN 148

Query:   110 FADLSN 115
                 SN
Sbjct:   149 QVSYSN 154

 Score = 45 (20.9 bits), Expect = 0.00027, Sum P(2) = 0.00027
 Identities = 12/45 (26%), Positives = 21/45 (46%)

Query:   212 EKAFEFITKIG-GVTTEDDYP-YRGKNDRCQTDKTKHHAVTITGY 254
             ++A  ++  I   +   DD+  YRG     +     +HAV + GY
Sbjct:   388 KEALRYLGPISISIAASDDFAFYRGGFYDGECGAAPNHAVILVGY 432


>UNIPROTKB|F1RU23 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 KO:K08569 EMBL:CU928325
            RefSeq:XP_003122571.1 UniGene:Ssc.28940 Ensembl:ENSSSCT00000014177
            GeneID:100525853 KEGG:ssc:100525853 OMA:CWAMAAV Uniprot:F1RU23
        Length = 367

 Score = 267 (99.0 bits), Expect = 1.5e-35, Sum P(2) = 1.5e-35
 Identities = 79/233 (33%), Positives = 119/233 (51%)

Query:    24 MLRNAVLSLFLLWVLGIPAGAWSEGY-PQKYDPQSM--EERFENWLKQYSREYGSEDEWQ 80
             M   A LS  L+ V+  PA    +    Q   PQ M  +E F  +  QY+R Y +  E  
Sbjct:     1 MALTAHLSCLLVLVVAGPAQGLKDALRSQDPGPQPMGLKEVFTLFQIQYNRSYSNPAEHA 60

Query:    81 RRFGIYSSNVQYIDYINSQNLS---FKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSV 137
             RR  I++ N+     +  ++L    F +T   F+DL+ EEF   + G++  +   + PS+
Sbjct:    61 RRLDIFAQNLAKAQRLQEEDLGTAEFGVTP--FSDLTEEEFGQLH-GHH--WGAGKAPSM 115

Query:   138 QY-LG-------LPASVDWRKE-GAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
                +G       +P S DWRK+ G ++ +K Q  C  CWA +AV  VE    +K  + V 
Sbjct:   116 GIKVGSEESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQ 175

Query:   189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG--KNDRC 239
             LS Q+++DCD    N GCNGG++  AF  +    G+ +E DYPY+G  K  RC
Sbjct:   176 LSVQQVLDCD-RCGN-GCNGGFVWDAFLTVLNTSGLASEQDYPYKGTVKTHRC 226

 Score = 133 (51.9 bits), Expect = 1.5e-35, Sum P(2) = 1.5e-35
 Identities = 32/91 (35%), Positives = 43/91 (47%)

Query:   264 QLYSHGVF---DEYCG-HQLNHGVTVVGYGED-----------HGEKYWLVKNSWGTSWG 308
             Q Y  GV       C  H +NH V +VG+G+            H   YW++KNSWG  WG
Sbjct:   270 QQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGRRPRPGHSIPYWILKNSWGPDWG 329

Query:   309 EAGYIRMARNSPSSNIGICGILMQASYPVKR 339
             E GY R+ R S +  I    +  +   PVK+
Sbjct:   330 EEGYFRLHRGSNTCGITKYPVTARVDKPVKK 360


>FB|FBgn0037396 [details] [associations]
            symbol:CG11459 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE014297 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 KO:K01365 HSSP:P07711 EMBL:AY060710
            RefSeq:NP_649608.1 UniGene:Dm.3894 SMR:Q9VNK6 MEROPS:C01.A31
            EnsemblMetazoa:FBtr0078623 GeneID:40741 KEGG:dme:Dmel_CG11459
            UCSC:CG11459-RA FlyBase:FBgn0037396 InParanoid:Q9VNK6 OMA:NYDEREL
            OrthoDB:EOG4MGQPX ChiTaRS:CG11459 GenomeRNAi:40741 NextBio:820359
            Uniprot:Q9VNK6
        Length = 336

 Score = 250 (93.1 bits), Expect = 2.4e-35, Sum P(2) = 2.4e-35
 Identities = 64/207 (30%), Positives = 103/207 (49%)

Query:    59 EERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNKFADLS 114
             +  ++ +  +Y+++Y + D++ R   +Y   V  ++  N   L    +FK+  NKF+D  
Sbjct:    27 DTEWDQYKAKYNKQYRNRDKYHR--ALYEQRVLAVESHNQLYLQGKVAFKMGLNKFSDTD 84

Query:   115 NEEFISTYLGYNKPYNEPR---WPSVQYLG---LPASVDWRKEGAVTPVKDQG-QCGSCW 167
                  +       P          +V Y     +   +DWR+ G ++PV DQG +C SCW
Sbjct:    85 QRILFNYRSSIPAPLETSTNALTETVNYKRYDQITEGIDWRQYGYISPVGDQGTECLSCW 144

Query:   168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
             AFS    +E     K G LV LS + LVDC V   N GC+GG++  AF + T+  G+ T+
Sbjct:   145 AFSTSGVLEAHMAKKYGNLVPLSPKHLVDC-VPYPNNGCSGGWVSVAFNY-TRDHGIATK 202

Query:   228 DDYPYRGKNDRCQTDKTKHHAVTITGY 254
             + YPY   +  C   K+   A T++GY
Sbjct:   203 ESYPYEPVSGECLW-KSDRSAGTLSGY 228

 Score = 148 (57.2 bits), Expect = 2.4e-35, Sum P(2) = 2.4e-35
 Identities = 33/80 (41%), Positives = 48/80 (60%)

Query:   263 FQLYSHGVFD-EYCGHQ---LNHGVTVVGYGEDH--GEKYWLVKNSWGTSWGEAGYIRMA 316
             F  YS GV     C  +   L H V +VG+G     G+ YW++KNS+GT WGE+GY+++A
Sbjct:   260 FDQYSGGVLSIPACRSKRQDLTHSVLLVGFGTHRKWGD-YWIIKNSYGTDWGESGYLKLA 318

Query:   317 RNSPSSNIGICGILMQASYP 336
             RN+  +N+  CG+     YP
Sbjct:   319 RNA--NNM--CGVASLPQYP 334


>UNIPROTKB|F1PGK4 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 OMA:SNVCGIA
            EMBL:AAEX03010073 Ensembl:ENSCAFT00000013638 Uniprot:F1PGK4
        Length = 316

 Score = 269 (99.8 bits), Expect = 3.1e-35, Sum P(2) = 3.1e-35
 Identities = 63/172 (36%), Positives = 90/172 (52%)

Query:    95 YINS----QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWP-----SVQYLGLPAS 145
             Y+NS    +N S     N+F+ LS EEF + YL  +KP   PR+P     S++ + LP  
Sbjct:    48 YLNSVFPRENSSAVYGINQFSYLSPEEFKAIYLR-SKPSRSPRYPAEVRTSIRNVSLPLR 106

Query:   146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQG 205
              DWR +  VT V++Q  CG CWAFS V AVE    +K   L  +S Q+++DC  N  N G
Sbjct:   107 FDWRDKRVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGKPLADISVQQVIDCSYN--NYG 164

Query:   206 CNGGYMEKAFEFITKIGGVTTED-DYPYRGKNDRCQTDKTKHHAVTITGYEA 256
             C+GG    A  ++ K       D +YP++ +N  C      +   +I GY A
Sbjct:   165 CSGGSTLNALNWLNKTQVKLVRDSEYPFKAQNGLCHYFSDSYSGFSIRGYSA 216

 Score = 128 (50.1 bits), Expect = 3.1e-35, Sum P(2) = 3.1e-35
 Identities = 26/71 (36%), Positives = 39/71 (54%)

Query:   262 AFQLYSHGVFDEYCGH-QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGY--IRMARN 318
             ++Q Y  G+   +C   + NH V + G+ +     YW+V+NSWG+SWG  GY  ++M  N
Sbjct:   244 SWQDYLGGIIQHHCSSGEANHAVLITGFDKIGSTPYWIVRNSWGSSWGVDGYAHVKMGGN 303

Query:   319 SPSSNIGICGI 329
                    ICGI
Sbjct:   304 -------ICGI 307


>ZFIN|ZDB-GENE-080724-8 [details] [associations]
            symbol:ctso "cathepsin O" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            ZFIN:ZDB-GENE-080724-8 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 CTD:1519 KO:K01374 EMBL:CR931784
            IPI:IPI00513613 RefSeq:XP_695717.3 UniGene:Dr.88386
            Ensembl:ENSDART00000074786 GeneID:567333 KEGG:dre:567333
            NextBio:20888622 Uniprot:E7FA09
        Length = 334

 Score = 255 (94.8 bits), Expect = 3.9e-35, Sum P(2) = 3.9e-35
 Identities = 63/189 (33%), Positives = 98/189 (51%)

Query:    77 DEWQRRFGIYSSNVQYIDYINS----QNLSFKLTDNKFADLSNEEFISTYLGYNK---P- 128
             +E  +R+  Y S++Q   ++NS     N S +   N+F+ LS ++F   YL       P 
Sbjct:    48 NELYQRWINYQSSLQRQAFLNSALGKSNQSAQYGVNQFSYLSQKQFKEQYLTARAEAAPK 107

Query:   129 YNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
             +++ +         P   DWR  G V PV +QG CG CWAFS V A+E ++     KL  
Sbjct:   108 FDQSKSEIKVKANNPPRFDWRDHGVVGPVHNQGSCGGCWAFSIVEAIESVSAKGGEKLQQ 167

Query:   189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG-GVTTEDDYPYRGKNDRCQTDKTKHH 247
             LS Q+++DC    +NQGCNGG   +A  ++T+    + +E +YP++G +  CQ     H 
Sbjct:   168 LSVQQVIDCSY--QNQGCNGGSPVEALYWLTQSKLKLVSEAEYPFKGADGVCQFFPQAHA 225

Query:   248 AVTITGYEA 256
              V +  Y A
Sbjct:   226 GVAVRNYSA 234

 Score = 141 (54.7 bits), Expect = 3.9e-35, Sum P(2) = 3.9e-35
 Identities = 25/53 (47%), Positives = 35/53 (66%)

Query:   262 AFQLYSHGVFDEYCG-HQLNHGVTVVGYGEDHGE-KYWLVKNSWGTSWGEAGY 312
             ++Q Y  G+   +C  H+ NH V + GY +  GE  YW+V+NSWGTSWG+ GY
Sbjct:   262 SWQDYLGGIIQHHCSSHKANHAVLITGY-DTTGEVPYWIVRNSWGTSWGDDGY 313

 Score = 45 (20.9 bits), Expect = 4.4e-25, Sum P(2) = 4.4e-25
 Identities = 9/17 (52%), Positives = 12/17 (70%)

Query:   239 CQTDKTKHHAVTITGYE 255
             C + K  +HAV ITGY+
Sbjct:   275 CSSHKA-NHAVLITGYD 290


>UNIPROTKB|P25774 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0016020 "membrane"
            evidence=IEA] [GO:0005576 "extracellular region" evidence=NAS]
            [GO:0005764 "lysosome" evidence=IDA;NAS] [GO:0097067 "cellular
            response to thyroid hormone stimulus" evidence=IEP] [GO:0002250
            "adaptive immune response" evidence=IEP] [GO:0019882 "antigen
            processing and presentation" evidence=TAS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=TAS] [GO:0006955
            "immune response" evidence=TAS] [GO:0002474 "antigen processing and
            presentation of peptide antigen via MHC class I" evidence=TAS]
            [GO:0002480 "antigen processing and presentation of exogenous
            peptide antigen via MHC class I, TAP-independent" evidence=TAS]
            [GO:0019886 "antigen processing and presentation of exogenous
            peptide antigen via MHC class II" evidence=TAS] [GO:0036021
            "endolysosome lumen" evidence=TAS] [GO:0042590 "antigen processing
            and presentation of exogenous peptide antigen via MHC class I"
            evidence=TAS] [GO:0043202 "lysosomal lumen" evidence=TAS]
            [GO:0045087 "innate immune response" evidence=TAS] [GO:0043231
            "intracellular membrane-bounded organelle" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 Reactome:REACT_118779
            Reactome:REACT_6900 GO:GO:0005576 GO:GO:0002480 GO:GO:0016020
            GO:GO:0019886 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087 EMBL:CH471121
            GO:GO:0004197 GO:GO:0002250 HOVERGEN:HBG011513 GO:GO:0097067
            GO:GO:0036021 EMBL:AL356292 CTD:1520 KO:K01368 OMA:KAMDQKC
            OrthoDB:EOG4JM7Q2 EMBL:S93414 EMBL:M86553 EMBL:M90696 EMBL:U07374
            EMBL:U07370 EMBL:U07371 EMBL:U07372 EMBL:U07373 EMBL:CR541676
            EMBL:AK301472 EMBL:AK314482 EMBL:BC002642 IPI:IPI00299150
            IPI:IPI00910216 PIR:A42482 RefSeq:NP_001186668.1 RefSeq:NP_004070.3
            UniGene:Hs.181301 PDB:1BXF PDB:1GLO PDB:1MS6 PDB:1NPZ PDB:1NQC
            PDB:2C0Y PDB:2F1G PDB:2FQ9 PDB:2FRA PDB:2FRQ PDB:2FT2 PDB:2FUD
            PDB:2FYE PDB:2G6D PDB:2G7Y PDB:2H7J PDB:2HH5 PDB:2HHN PDB:2HXZ
            PDB:2OP3 PDB:2R9M PDB:2R9N PDB:2R9O PDB:3IEJ PDB:3KWN PDB:3MPE
            PDB:3MPF PDB:3N3G PDB:3N4C PDB:3OVX PDBsum:1BXF PDBsum:1GLO
            PDBsum:1MS6 PDBsum:1NPZ PDBsum:1NQC PDBsum:2C0Y PDBsum:2F1G
            PDBsum:2FQ9 PDBsum:2FRA PDBsum:2FRQ PDBsum:2FT2 PDBsum:2FUD
            PDBsum:2FYE PDBsum:2G6D PDBsum:2G7Y PDBsum:2H7J PDBsum:2HH5
            PDBsum:2HHN PDBsum:2HXZ PDBsum:2OP3 PDBsum:2R9M PDBsum:2R9N
            PDBsum:2R9O PDBsum:3IEJ PDBsum:3KWN PDBsum:3MPE PDBsum:3MPF
            PDBsum:3N3G PDBsum:3N4C PDBsum:3OVX ProteinModelPortal:P25774
            SMR:P25774 IntAct:P25774 STRING:P25774 MEROPS:I29.004
            PhosphoSite:P25774 DMDM:88984046 PaxDb:P25774 PeptideAtlas:P25774
            PRIDE:P25774 DNASU:1520 Ensembl:ENST00000368985
            Ensembl:ENST00000448301 GeneID:1520 KEGG:hsa:1520 UCSC:uc001evn.3
            GeneCards:GC01M150702 HGNC:HGNC:2545 HPA:CAB000460 HPA:HPA002988
            MIM:116845 neXtProt:NX_P25774 PharmGKB:PA27041 InParanoid:P25774
            PhylomeDB:P25774 BRENDA:3.4.22.27 BindingDB:P25774
            ChEMBL:CHEMBL2954 ChiTaRS:CTSS EvolutionaryTrace:P25774
            GenomeRNAi:1520 NextBio:6291 PMAP-CutDB:P25774 ArrayExpress:P25774
            Bgee:P25774 CleanEx:HS_CTSS Genevestigator:P25774
            GermOnline:ENSG00000163131 Uniprot:P25774
        Length = 331

 Score = 380 (138.8 bits), Expect = 4.0e-35, P = 4.0e-35
 Identities = 84/215 (39%), Positives = 124/215 (57%)

Query:    54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNK 109
             DP +++  +  W K Y ++Y  ++E   R  I+  N++++   N ++     S+ L  N 
Sbjct:    21 DP-TLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNH 79

Query:   110 FADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCG 164
               D+++EE +S       P    R  ++ Y       LP SVDWR++G VT VK QG CG
Sbjct:    80 LGDMTSEEVMSLMSSLRVPSQWQR--NITYKSNPNRILPDSVDWREKGCVTEVKYQGSCG 137

Query:   165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE-NQGCNGGYMEKAFEFITKIGG 223
             +CWAFSAV A+E   KLKTGKLVSLS Q LVDC      N+GCNGG+M  AF++I    G
Sbjct:   138 ACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKG 197

Query:   224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             + ++  YPY+  + +CQ D +K+ A T + Y  +P
Sbjct:   198 IDSDASYPYKAMDQKCQYD-SKYRAATCSKYTELP 231

 Score = 213 (80.0 bits), Expect = 1.3e-15, P = 1.3e-15
 Identities = 52/130 (40%), Positives = 74/130 (56%)

Query:   211 MEKAFEFITKIGGVTTED--DYPYRGKNDRCQTDKTKHHAVTITGYEAIPARY-AFQLYS 267
             M++  ++ +K    T     + PY G+ D  +        V++     + AR+ +F LY 
Sbjct:   209 MDQKCQYDSKYRAATCSKYTELPY-GREDVLKEAVANKGPVSV----GVDARHPSFFLYR 263

Query:   268 HGVFDE-YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
              GV+ E  C   +NHGV VVGYG+ +G++YWLVKNSWG ++GE GYIRMARN  +     
Sbjct:   264 SGVYYEPSCTQNVNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNH---- 319

Query:   327 CGILMQASYP 336
             CGI    SYP
Sbjct:   320 CGIASFPSYP 329


>UNIPROTKB|P43234 [details] [associations]
            symbol:CTSO "Cathepsin O" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 Reactome:REACT_6900
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0004197
            CleanEx:HS_CTSO EMBL:X77383 EMBL:BC049206 IPI:IPI00017257
            PIR:A55090 RefSeq:NP_001325.1 UniGene:Hs.75262
            ProteinModelPortal:P43234 SMR:P43234 IntAct:P43234 STRING:P43234
            MEROPS:C01.035 PhosphoSite:P43234 DMDM:1168795 PRIDE:P43234
            DNASU:1519 Ensembl:ENST00000433477 GeneID:1519 KEGG:hsa:1519
            UCSC:uc003ipg.3 CTD:1519 GeneCards:GC04M156845 HGNC:HGNC:2542
            HPA:HPA002041 MIM:600550 neXtProt:NX_P43234 PharmGKB:PA27040
            HOVERGEN:HBG105050 InParanoid:P43234 KO:K01374 OMA:SNVCGIA
            OrthoDB:EOG4V6ZH1 PhylomeDB:P43234 BindingDB:P43234
            ChEMBL:CHEMBL3035 GenomeRNAi:1519 NextBio:6287 Bgee:P43234
            Genevestigator:P43234 GermOnline:ENSG00000151792 Uniprot:P43234
        Length = 321

 Score = 261 (96.9 bits), Expect = 5.0e-35, Sum P(2) = 5.0e-35
 Identities = 64/187 (34%), Positives = 94/187 (50%)

Query:    80 QRRFGIYSSNVQYIDYINS----QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWP 135
             +R    +  ++    Y+NS    +N +     N+F+ L  EEF + YL  +KP   PR+ 
Sbjct:    38 EREAAAFRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLR-SKPSKFPRYS 96

Query:   136 -----SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLS 190
                  S+  + LP   DWR +  VT V++Q  CG CWAFS V AVE    +K   L  LS
Sbjct:    97 AEVHMSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLS 156

Query:   191 EQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED-DYPYRGKNDRCQTDKTKHHAV 249
              Q+++DC  N  N GCNGG    A  ++ K+     +D +YP++ +N  C      H   
Sbjct:   157 VQQVIDCSYN--NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGF 214

Query:   250 TITGYEA 256
             +I GY A
Sbjct:   215 SIKGYSA 221

 Score = 134 (52.2 bits), Expect = 5.0e-35, Sum P(2) = 5.0e-35
 Identities = 28/91 (30%), Positives = 43/91 (47%)

Query:   240 QTDKTKHHAVTITGYEAIPARYAFQLYSHGVFDEYCGH-QLNHGVTVVGYGEDHGEKYWL 298
             Q D+     +T      I    ++Q Y  G+   +C   + NH V + G+ +     YW+
Sbjct:   227 QEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHCSSGEANHAVLITGFDKTGSTPYWI 286

Query:   299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
             V+NSWG+SWG  GY  +   S      +CGI
Sbjct:   287 VRNSWGSSWGVDGYAHVKMGS-----NVCGI 312


>UNIPROTKB|E2RPX3 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 CTD:1521 KO:K08569 OMA:GRCGDGC
            EMBL:AAEX03011632 RefSeq:XP_540846.2 Ensembl:ENSCAFT00000020910
            GeneID:483725 KEGG:cfa:483725 Uniprot:E2RPX3
        Length = 374

 Score = 254 (94.5 bits), Expect = 2.7e-34, Sum P(3) = 2.7e-34
 Identities = 73/237 (30%), Positives = 118/237 (49%)

Query:    30 LSLFLLWVLGIPAGAWSEGYP-----QKYDPQSMEER--FENWLKQYSREYGSEDEWQRR 82
             L+++L  +L +   + + G       Q   PQ +E +  F  +  QY+R Y + +E+ RR
Sbjct:     3 LTIYLSCLLALSVASLAHGIKRSLKNQDPGPQPLELKQVFALFQIQYNRSYSNPEEYARR 62

Query:    83 FGIYSSNVQYIDYINSQNLS---FKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSV-- 137
               I++ N+     +  ++L    F +T   F+DL+ EEF   Y G+ +   E   PSV  
Sbjct:    63 LDIFAHNLAQAQQLEDEDLGTAEFGVTP--FSDLTEEEFGQFY-GHQRMAGEA--PSVGR 117

Query:   138 ----QYLG--LPASVDWRK-EGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLS 190
                 +  G  +P + DWRK  G ++P+K QG C  CWA +A   +E +  ++  + V +S
Sbjct:   118 KVESEEWGEPVPPTCDWRKLPGIISPIKQQGNCRCCWAMAAAGNIEALWGIRYHQPVEVS 177

Query:   191 EQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG--KNDRCQTDKTK 245
              QEL+DC    +  GC GG+   AF  +    G+ +  DYP+ G  K  RC   K K
Sbjct:   178 VQELLDCGRCGD--GCKGGFTWDAFITVLNNSGLASAKDYPFLGNTKPHRCLAKKYK 232

 Score = 106 (42.4 bits), Expect = 2.7e-34, Sum P(3) = 2.7e-34
 Identities = 21/46 (45%), Positives = 26/46 (56%)

Query:   292 HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
             H   YW++KNSWG  WGE GY R+ R + +     CGI     YPV
Sbjct:   320 HPIPYWILKNSWGAEWGEEGYFRLHRGNNT-----CGI---TKYPV 357

 Score = 38 (18.4 bits), Expect = 2.7e-34, Sum P(3) = 2.7e-34
 Identities = 11/31 (35%), Positives = 16/31 (51%)

Query:   264 QLYSHGVFDEY---CGHQ-LNHGVTVVGYGE 290
             Q Y  GV       C  Q ++H V +VG+G+
Sbjct:   270 QHYQKGVIQATHTTCDPQRVDHSVLLVGFGK 300


>UNIPROTKB|Q5QP40 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015644 Pfam:PF00112
            InterPro:IPR000169 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 EMBL:AL355860 HOVERGEN:HBG011513
            PANTHER:PTHR12411:SF55 EMBL:AL356292 UniGene:Hs.632466
            HGNC:HGNC:2536 IPI:IPI00514633 SMR:Q5QP40 STRING:Q5QP40
            Ensembl:ENST00000443913 Uniprot:Q5QP40
        Length = 258

 Score = 366 (133.9 bits), Expect = 1.2e-33, P = 1.2e-33
 Identities = 86/209 (41%), Positives = 128/209 (61%)

Query:    33 FLLW---VLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSS 88
             F++W   VL +P  +++  YP++     ++  +E W K + ++Y ++ DE  RR  I+  
Sbjct:    58 FVMWGLKVLLLPVVSFAL-YPEEI----LDTHWELWKKTHRKQYNNKVDEISRRL-IWEK 111

Query:    89 NVQYIDYINSQ-NL---SFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYL---- 140
             N++YI   N + +L   +++L  N   D+++EE +    G   P +  R     Y+    
Sbjct:   112 NLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWE 171

Query:   141 G-LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDV 199
             G  P SVD+RK+G VTPVK+QGQCGSCWAFS+V A+EG  K KTGKL++LS Q LVDC  
Sbjct:   172 GRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV- 230

Query:   200 NSENQGCNGGYMEKAFEFITKIGGVTTED 228
              SEN GC GGYM  AF+++ K  G+ +ED
Sbjct:   231 -SENDGCGGGYMTNAFQYVQKNRGIDSED 258


>GENEDB_PFALCIPARUM|PF14_0553 [details] [associations]
            symbol:PF14_0553 "cysteine proteinase
            falcipain-1" species:5833 "Plasmodium falciparum" [GO:0042540
            "hemoglobin catabolic process" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014187 KO:K01376 HSSP:P07688 GO:GO:0042540
            RefSeq:XP_001348727.1 ProteinModelPortal:Q8I6V0 PRIDE:Q8I6V0
            EnsemblProtists:PF14_0553:mRNA GeneID:812135 KEGG:pfa:PF14_0553
            EuPathDB:PlasmoDB:PF3D7_1458000 HOGENOM:HOG000065906
            ProtClustDB:CLSZ2457715 ChEMBL:CHEMBL1250371 Uniprot:Q8I6V0
        Length = 569

 Score = 241 (89.9 bits), Expect = 1.8e-33, Sum P(3) = 1.8e-33
 Identities = 57/168 (33%), Positives = 91/168 (54%)

Query:   142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
             +P  +D+R++G V   KDQG CGSCWAF++V  +E +   K   ++S SEQE+VDC  + 
Sbjct:   333 VPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDC--SK 390

Query:   202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKND------RCQTDKT-------KHHA 248
             +N GC+GG+   +F ++ +   +   D+Y Y+ K+D      RC+   +       K + 
Sbjct:   391 DNFGCDGGHPFYSFLYVLQ-NELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQ 449

Query:   249 VTITGYEAIP------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGE 290
             + +   E  P          F  YS GV++  C  +LNH V +VGYG+
Sbjct:   450 LILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLVGYGQ 497

 Score = 125 (49.1 bits), Expect = 1.8e-33, Sum P(3) = 1.8e-33
 Identities = 19/42 (45%), Positives = 28/42 (66%)

Query:   296 YWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
             YW++KNSW   WGE G++R++RN    N+  CGI  +  YP+
Sbjct:   528 YWIIKNSWSKKWGENGFMRLSRNKNGDNV-FCGIGEEVFYPI 568

 Score = 80 (33.2 bits), Expect = 1.8e-33, Sum P(3) = 1.8e-33
 Identities = 19/59 (32%), Positives = 34/59 (57%)

Query:    61 RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN--SQNLSFKLTDNKFADLSNEE 117
             +F  ++K++++ Y + DE  R+F I+  N   I   N  ++N  +K   N+F+D S EE
Sbjct:   224 KFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEE 282


>UNIPROTKB|Q8I6V0 [details] [associations]
            symbol:PF14_0553 "Cysteine proteinase falcipain-1"
            species:36329 "Plasmodium falciparum 3D7" [GO:0042540 "hemoglobin
            catabolic process" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014187 KO:K01376 HSSP:P07688 GO:GO:0042540
            RefSeq:XP_001348727.1 ProteinModelPortal:Q8I6V0 PRIDE:Q8I6V0
            EnsemblProtists:PF14_0553:mRNA GeneID:812135 KEGG:pfa:PF14_0553
            EuPathDB:PlasmoDB:PF3D7_1458000 HOGENOM:HOG000065906
            ProtClustDB:CLSZ2457715 ChEMBL:CHEMBL1250371 Uniprot:Q8I6V0
        Length = 569

 Score = 241 (89.9 bits), Expect = 1.8e-33, Sum P(3) = 1.8e-33
 Identities = 57/168 (33%), Positives = 91/168 (54%)

Query:   142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
             +P  +D+R++G V   KDQG CGSCWAF++V  +E +   K   ++S SEQE+VDC  + 
Sbjct:   333 VPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDC--SK 390

Query:   202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKND------RCQTDKT-------KHHA 248
             +N GC+GG+   +F ++ +   +   D+Y Y+ K+D      RC+   +       K + 
Sbjct:   391 DNFGCDGGHPFYSFLYVLQ-NELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQ 449

Query:   249 VTITGYEAIP------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGE 290
             + +   E  P          F  YS GV++  C  +LNH V +VGYG+
Sbjct:   450 LILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLVGYGQ 497

 Score = 125 (49.1 bits), Expect = 1.8e-33, Sum P(3) = 1.8e-33
 Identities = 19/42 (45%), Positives = 28/42 (66%)

Query:   296 YWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
             YW++KNSW   WGE G++R++RN    N+  CGI  +  YP+
Sbjct:   528 YWIIKNSWSKKWGENGFMRLSRNKNGDNV-FCGIGEEVFYPI 568

 Score = 80 (33.2 bits), Expect = 1.8e-33, Sum P(3) = 1.8e-33
 Identities = 19/59 (32%), Positives = 34/59 (57%)

Query:    61 RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN--SQNLSFKLTDNKFADLSNEE 117
             +F  ++K++++ Y + DE  R+F I+  N   I   N  ++N  +K   N+F+D S EE
Sbjct:   224 KFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEE 282


>WB|WBGene00019314 [details] [associations]
            symbol:K02E7.10 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 SMART:SM00645 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 HSSP:P07711
            EMBL:FO080411 PIR:T32392 RefSeq:NP_493904.1 UniGene:Cel.14828
            ProteinModelPortal:O17255 SMR:O17255 EnsemblMetazoa:K02E7.10
            GeneID:186889 KEGG:cel:CELE_K02E7.10 UCSC:K02E7.10 CTD:186889
            WormBase:K02E7.10 eggNOG:NOG331187 HOGENOM:HOG000114005
            InParanoid:O17255 OMA:GNANEAR NextBio:933344 Uniprot:O17255
        Length = 299

 Score = 212 (79.7 bits), Expect = 1.4e-32, Sum P(2) = 1.4e-32
 Identities = 45/103 (43%), Positives = 63/103 (61%)

Query:   146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGIN-KLKTGKLVSLSEQELVDCDVNSENQ 204
             +DWR++G V PVKDQG+C + +AF+A+AA+E +  K   GKL+S SEQ+++DC  N  N 
Sbjct:    84 LDWREKGIVGPVKDQGKCNASYAFAAIAAIESMYAKANNGKLLSFSEQQIIDC-ANFTNP 142

Query:   205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKND--RCQTDKTK 245
              C             K  GV TE DYPY GK +  +C+ D +K
Sbjct:   143 -CQENLENVLSNRFLKENGVGTEADYPYVGKENVGKCEYDSSK 184

 Score = 183 (69.5 bits), Expect = 1.4e-32, Sum P(2) = 1.4e-32
 Identities = 39/97 (40%), Positives = 57/97 (58%)

Query:   247 HAVTI-TGYEAIPARYAFQLYSHGVFD---EYCGHQLN-HGVTVVGYGEDHGEKYWLVKN 301
             H  T  TGY  + +  +F  Y  G+++   E CG+      + +VGYG+D  EKYW+VK 
Sbjct:   204 HITTFGTGYFRMRSPPSFFHYKTGIYNPTKEECGNANEARSLAIVGYGKDGAEKYWIVKG 263

Query:   302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
             S+GTSWGE GY+++ARN     +  CG+    S P+K
Sbjct:   264 SFGTSWGEHGYMKLARN-----VNACGMAESISIPIK 295


>UNIPROTKB|P56202 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 GO:GO:0006955 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AF013611
            EMBL:AF015954 EMBL:AF055903 EMBL:AP001201 EMBL:BC048255
            IPI:IPI00328978 RefSeq:NP_001326.2 UniGene:Hs.416848
            ProteinModelPortal:P56202 SMR:P56202 STRING:P56202 MEROPS:C01.037
            PhosphoSite:P56202 DMDM:259016196 PaxDb:P56202 PRIDE:P56202
            Ensembl:ENST00000307886 GeneID:1521 KEGG:hsa:1521 UCSC:uc001ogc.1
            CTD:1521 GeneCards:GC11P065647 HGNC:HGNC:2546 HPA:CAB016345
            MIM:602364 neXtProt:NX_P56202 PharmGKB:PA27042 eggNOG:NOG288820
            HOVERGEN:HBG100117 InParanoid:P56202 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 PhylomeDB:P56202 GenomeRNAi:1521 NextBio:6295
            ArrayExpress:P56202 Bgee:P56202 CleanEx:HS_CTSW
            Genevestigator:P56202 GermOnline:ENSG00000172543 Uniprot:P56202
        Length = 376

 Score = 262 (97.3 bits), Expect = 2.6e-32, Sum P(2) = 2.6e-32
 Identities = 74/227 (32%), Positives = 112/227 (49%)

Query:    32 LFLLWVLGIPAGAWSEGYPQKYDPQSME--ERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
             L  L V G+  G       Q   PQ +E  E F+ +  Q++R Y S +E   R  I++ N
Sbjct:    10 LLALLVAGLAQGIRGPLRAQDLGPQPLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHN 69

Query:    90 VQYIDYINSQNLS---FKLTDNKFADLSNEEFISTYLGYNK-----PY--NEPRWPSVQY 139
             +     +  ++L    F +T   F+DL+ EEF   Y GY +     P    E R    + 
Sbjct:    70 LAQAQRLQEEDLGTAEFGVTP--FSDLTEEEFGQLY-GYRRAAGGVPSMGREIRSEEPEE 126

Query:   140 LGLPASVDWRK-EGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD 198
               +P S DWRK   A++P+KDQ  C  CWA +A   +E + ++     V +S QEL+DC 
Sbjct:   127 -SVPFSCDWRKVASAISPIKDQKNCNCCWAMAAAGNIETLWRISFWDFVDVSVQELLDCG 185

Query:   199 VNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK--NDRCQTDK 243
                +  GC+GG++  AF  +    G+ +E DYP++GK    RC   K
Sbjct:   186 RCGD--GCHGGFVWDAFITVLNNSGLASEKDYPFQGKVRAHRCHPKK 230

 Score = 107 (42.7 bits), Expect = 2.6e-32, Sum P(2) = 2.6e-32
 Identities = 17/33 (51%), Positives = 21/33 (63%)

Query:   292 HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
             H   YW++KNSWG  WGE GY R+ R S +  I
Sbjct:   322 HPTPYWILKNSWGAQWGEKGYFRLHRGSNTCGI 354

 Score = 48 (22.0 bits), Expect = 4.0e-26, Sum P(2) = 4.0e-26
 Identities = 15/39 (38%), Positives = 19/39 (48%)

Query:   264 QLYSHGVFDEY---CGHQL-NHGVTVVGYGEDHGEK-YW 297
             QLY  GV       C  QL +H V +VG+G    E+  W
Sbjct:   270 QLYRKGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEGIW 308


>GENEDB_PFALCIPARUM|PF11_0165 [details] [associations]
            symbol:PF11_0165 "falcipain 2 precursor"
            species:5833 "Plasmodium falciparum" [GO:0020020 "food vacuole"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014186 HSSP:O65039 GO:GO:0020020
            RefSeq:XP_001347836.1 ProteinModelPortal:Q8I6U4 SMR:Q8I6U4
            IntAct:Q8I6U4 MINT:MINT-1559493 MEROPS:C01.046
            EnsemblProtists:PF11_0165:mRNA GeneID:810712 KEGG:pfa:PF11_0165
            EuPathDB:PlasmoDB:PF3D7_1115700 HOGENOM:HOG000065857 OMA:NESLHAN
            ProtClustDB:PTZ00021 BindingDB:Q8I6U4 ChEMBL:CHEMBL3470
            Uniprot:Q8I6U4
        Length = 484

 Score = 348 (127.6 bits), Expect = 9.8e-32, P = 9.8e-32
 Identities = 81/216 (37%), Positives = 126/216 (58%)

Query:    61 RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYID-YINSQNLSFKLTDNKFADLSNEEFI 119
             +F  ++K  +++Y S +E + RF ++  N   ++ + N++N  +K   N+FADL+  EF 
Sbjct:   164 QFYMFIKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKELNRFADLTYHEFK 223

Query:   120 STYLGY--NKP-----Y--NEPRWPSV--QYLGLP----ASVDWRKEGAVTPVKDQGQCG 164
             + YL    +KP     Y  ++  +  V  +Y G      A+ DWR    VTPVKDQ  CG
Sbjct:   224 NKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYKGNENFDHAAYDWRLHSGVTPVKDQKNCG 283

Query:   165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
             SCWAFS++ +VE    ++  KL++LSEQELVDC    +N GCNGG +  AFE + ++GG+
Sbjct:   284 SCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSF--KNYGCNGGLINNAFEDMIELGGI 341

Query:   225 TTEDDYPY-RGKNDRCQTDK-TKHHAVTITGYEAIP 258
              T+DDYPY     + C  D+ T+ +   I  Y ++P
Sbjct:   342 CTDDDYPYVSDAPNLCNIDRCTEKYG--IKNYLSVP 375

 Score = 160 (61.4 bits), Expect = 8.1e-09, P = 8.1e-09
 Identities = 78/316 (24%), Positives = 132/316 (41%)

Query:    51 QKYDPQSMEERFENWLKQYSR----EYGSEDEWQRRFGIYSSNVQYIDYIN---SQNLSF 103
             Q   P  M+ERF+ +L+   +           +++    ++ ++ Y ++ N   S   S 
Sbjct:   175 QYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKELNRFA-DLTYHEFKNKYLSLRSSK 233

Query:   104 KLTDNKFA-DLSN-EEFISTYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQG 161
              L ++K+  D  N EE I  Y G N+ ++   +    + G+    D +  G+       G
Sbjct:   234 PLKNSKYLLDQMNYEEVIKKYKG-NENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIG 292

Query:   162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD---VNS--ENQGCNGGYM-EKAF 215
                S +A      +     L   +LV  S +    C+   +N+  E+    GG   +  +
Sbjct:   293 SVESQYAIRKNKLIT----LSEQELVDCSFKNY-GCNGGLINNAFEDMIELGGICTDDDY 347

Query:   216 EFITKIGGVTTED--DYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA--FQLYSHGVF 271
              +++    +   D     Y  KN     D     A+   G  +I    +  F  Y  G+F
Sbjct:   348 PYVSDAPNLCNIDRCTEKYGIKNYLSVPDNKLKEALRFLGPISISVAVSDDFAFYKEGIF 407

Query:   272 DEYCGHQLNHGVTVVGYGEDH--------GEK--YWLVKNSWGTSWGEAGYIRMARNSPS 321
             D  CG QLNH V +VG+G           GEK  Y+++KNSWG  WGE G+I +  +  S
Sbjct:   408 DGECGDQLNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDE-S 466

Query:   322 SNIGICGILMQASYPV 337
               +  CG+   A  P+
Sbjct:   467 GLMRKCGLGTDAFIPL 482


>UNIPROTKB|Q8I6U4 [details] [associations]
            symbol:PF11_0165 "Falcipain-2A" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            EMBL:AE014186 HSSP:O65039 GO:GO:0020020 RefSeq:XP_001347836.1
            ProteinModelPortal:Q8I6U4 SMR:Q8I6U4 IntAct:Q8I6U4
            MINT:MINT-1559493 MEROPS:C01.046 EnsemblProtists:PF11_0165:mRNA
            GeneID:810712 KEGG:pfa:PF11_0165 EuPathDB:PlasmoDB:PF3D7_1115700
            HOGENOM:HOG000065857 OMA:NESLHAN ProtClustDB:PTZ00021
            BindingDB:Q8I6U4 ChEMBL:CHEMBL3470 Uniprot:Q8I6U4
        Length = 484

 Score = 348 (127.6 bits), Expect = 9.8e-32, P = 9.8e-32
 Identities = 81/216 (37%), Positives = 126/216 (58%)

Query:    61 RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYID-YINSQNLSFKLTDNKFADLSNEEFI 119
             +F  ++K  +++Y S +E + RF ++  N   ++ + N++N  +K   N+FADL+  EF 
Sbjct:   164 QFYMFIKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKELNRFADLTYHEFK 223

Query:   120 STYLGY--NKP-----Y--NEPRWPSV--QYLGLP----ASVDWRKEGAVTPVKDQGQCG 164
             + YL    +KP     Y  ++  +  V  +Y G      A+ DWR    VTPVKDQ  CG
Sbjct:   224 NKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYKGNENFDHAAYDWRLHSGVTPVKDQKNCG 283

Query:   165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
             SCWAFS++ +VE    ++  KL++LSEQELVDC    +N GCNGG +  AFE + ++GG+
Sbjct:   284 SCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSF--KNYGCNGGLINNAFEDMIELGGI 341

Query:   225 TTEDDYPY-RGKNDRCQTDK-TKHHAVTITGYEAIP 258
              T+DDYPY     + C  D+ T+ +   I  Y ++P
Sbjct:   342 CTDDDYPYVSDAPNLCNIDRCTEKYG--IKNYLSVP 375

 Score = 160 (61.4 bits), Expect = 8.1e-09, P = 8.1e-09
 Identities = 78/316 (24%), Positives = 132/316 (41%)

Query:    51 QKYDPQSMEERFENWLKQYSR----EYGSEDEWQRRFGIYSSNVQYIDYIN---SQNLSF 103
             Q   P  M+ERF+ +L+   +           +++    ++ ++ Y ++ N   S   S 
Sbjct:   175 QYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKELNRFA-DLTYHEFKNKYLSLRSSK 233

Query:   104 KLTDNKFA-DLSN-EEFISTYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQG 161
              L ++K+  D  N EE I  Y G N+ ++   +    + G+    D +  G+       G
Sbjct:   234 PLKNSKYLLDQMNYEEVIKKYKG-NENFDHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIG 292

Query:   162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD---VNS--ENQGCNGGYM-EKAF 215
                S +A      +     L   +LV  S +    C+   +N+  E+    GG   +  +
Sbjct:   293 SVESQYAIRKNKLIT----LSEQELVDCSFKNY-GCNGGLINNAFEDMIELGGICTDDDY 347

Query:   216 EFITKIGGVTTED--DYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA--FQLYSHGVF 271
              +++    +   D     Y  KN     D     A+   G  +I    +  F  Y  G+F
Sbjct:   348 PYVSDAPNLCNIDRCTEKYGIKNYLSVPDNKLKEALRFLGPISISVAVSDDFAFYKEGIF 407

Query:   272 DEYCGHQLNHGVTVVGYGEDH--------GEK--YWLVKNSWGTSWGEAGYIRMARNSPS 321
             D  CG QLNH V +VG+G           GEK  Y+++KNSWG  WGE G+I +  +  S
Sbjct:   408 DGECGDQLNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGERGFINIETDE-S 466

Query:   322 SNIGICGILMQASYPV 337
               +  CG+   A  P+
Sbjct:   467 GLMRKCGLGTDAFIPL 482


>UNIPROTKB|Q10991 [details] [associations]
            symbol:CTSL "Cathepsin L1" species:9940 "Ovis aries"
            [GO:0005515 "protein binding" evidence=IPI] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            MEROPS:C01.032 ProteinModelPortal:Q10991 SMR:Q10991 Uniprot:Q10991
        Length = 217

 Score = 337 (123.7 bits), Expect = 1.4e-30, P = 1.4e-30
 Identities = 66/119 (55%), Positives = 81/119 (68%)

Query:   142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
             +P SVDW K+G VTPVK+QGQCGSCWAFSA  A+EG    KTGKLVSLSEQ LVD     
Sbjct:     1 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQ 60

Query:   202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR 260
              NQGCNGG M+ AF++I + GG+ +E+ YPY   +  C   K ++ A   TG+  IP R
Sbjct:    61 GNQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTSCNY-KPEYSAAKDTGFVDIPQR 118

 Score = 194 (73.4 bits), Expect = 4.7e-15, P = 4.7e-15
 Identities = 45/103 (43%), Positives = 59/103 (57%)

Query:   240 QTDKTKHHAVTITG--YEAIPARYA-FQLYSHGVF-DEYCGHQ-LNHGVTVVGYG-EDHG 293
             Q +K    AV   G    AI A ++ FQ Y  G++ D  C  + L+HGV VVGYG E   
Sbjct:   117 QREKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTN 176

Query:   294 EKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
              K+W+VKNSWG  WG  GY++MA++  +     CGI   ASYP
Sbjct:   177 NKFWIVKNSWGPEWGNKGYVKMAKDQNNH----CGIATAASYP 215


>UNIPROTKB|F1NHB8 [details] [associations]
            symbol:F1NHB8 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 EMBL:AADN02044011
            IPI:IPI00586027 Ensembl:ENSGALT00000021873 OMA:SELDHAV
            Uniprot:F1NHB8
        Length = 329

 Score = 336 (123.3 bits), Expect = 1.8e-30, P = 1.8e-30
 Identities = 82/228 (35%), Positives = 127/228 (55%)

Query:    54 DPQSMEER-FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFAD 112
             D + +  R F ++ +++ + Y SE+E + R   +  N++++   N   LS+ L  N  AD
Sbjct:    17 DTEHVHHRLFHHYKERFGKRYSSEEEHEHRKRTFIHNMRFVHSKNRAALSYSLALNHLAD 76

Query:   113 LSNEEFISTYLGYNKPYNEPR--WP-SVQ-Y--LGLPASVDWRKEGAVTPVKDQGQCGSC 166
              + +E  +  L   +   +P+   P S+Q Y  L LP S+DWR  GAVTPVKDQ  CGSC
Sbjct:    77 RTPQEMAA--LRGRRRSGDPKSGQPFSMQLYASLVLPESLDWRLYGAVTPVKDQAVCGSC 134

Query:   167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
             W+F+   A+EG   LKTG L  LS+Q L+DC     N  C+GG   +A+E+I K GG+ +
Sbjct:   135 WSFATTGAMEGALFLKTGVLTPLSQQVLIDCSWGFGNYACDGGEEWRAYEWIKKHGGIAS 194

Query:   227 EDDY-PYRGKNDRCQTDKTKHHAVTITGYEAIPARYA----FQLYSHG 269
              + Y PY G+N  C  ++++  A  + GY  + +  A      L+ HG
Sbjct:   195 TESYGPYLGQNGYCHYNQSELVA-PLAGYVTVESGNAEALKAALFKHG 241

 Score = 213 (80.0 bits), Expect = 1.2e-15, P = 1.2e-15
 Identities = 57/192 (29%), Positives = 89/192 (46%)

Query:   150 KEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGG 209
             K G +TP+  Q      W F   A  +G  + +  + +      +   +      G NG 
Sbjct:   150 KTGVLTPLSQQVLIDCSWGFGNYAC-DGGEEWRAYEWIK-KHGGIASTESYGPYLGQNGY 207

Query:   210 YMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQLYSHG 269
                   E +  + G  T +     G  +  +    KH  V +   +A  +  +F  Y++G
Sbjct:   208 CHYNQSELVAPLAGYVTVES----GNAEALKAALFKHGPVAVN-IDA--SHKSFTFYANG 260

Query:   270 VFDE-YCGHQ---LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIG 325
             V++E +CG++   L+H V  VGYG  HG+ YWL+KNSW T WG  GYI MA     +N  
Sbjct:   261 VYEEPHCGNETSELDHAVLAVGYGVLHGKSYWLIKNSWSTYWGNDGYILMAMKD--NN-- 316

Query:   326 ICGILMQASYPV 337
              CG+   AS+P+
Sbjct:   317 -CGVATAASFPI 327


>TAIR|locus:2030027 [details] [associations]
            symbol:AT1G29110 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            EMBL:CP002684 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            IPI:IPI00544534 RefSeq:NP_564322.1 UniGene:At.51816
            ProteinModelPortal:F4HZW2 SMR:F4HZW2 EnsemblPlants:AT1G29110.1
            GeneID:839786 KEGG:ath:AT1G29110 OMA:SCRANAR Uniprot:F4HZW2
        Length = 334

 Score = 335 (123.0 bits), Expect = 2.3e-30, P = 2.3e-30
 Identities = 83/246 (33%), Positives = 131/246 (53%)

Query:    28 AVLSLFL-LWVLGIPAGAWSEGYPQ-KYDPQSMEERFENWLKQYSREYGSEDEWQRRFGI 85
             +V S+F+ L +L +     S+  P    + QS+ +  + W+ Q+SR Y  E E + R  +
Sbjct:     3 SVRSVFVALTILSMDLRI-SQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKV 61

Query:    86 YSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLG-----------YNKPYNEPR 133
             +  N+++I+  N+  N S+ L  N+F D   EEF++T+ G           +NK      
Sbjct:    62 FKKNLKFIENFNNMGNQSYTLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSRN 121

Query:   134 WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQE 193
             W          S DWR EGAVTPVK QG C        +  + G N      L++LSEQ+
Sbjct:   122 WNMSDIDMEDESKDWRDEGAVTPVKYQGAC-------RLTKISGKN------LLTLSEQQ 168

Query:   194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITG 253
             L+DCD+  +N GCNGG  E+AF++I K GGV+ E +YPY+ K + C+ +  +     I G
Sbjct:   169 LIDCDIE-KNGGCNGGEFEEAFKYIIKNGGVSLETEYPYQVKKESCRANARRAPHTQIRG 227

Query:   254 YEAIPA 259
             ++ +P+
Sbjct:   228 FQMVPS 233

 Score = 234 (87.4 bits), Expect = 1.2e-18, P = 1.2e-18
 Identities = 72/213 (33%), Positives = 101/213 (47%)

Query:   145 SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGK--LVSLSEQELVDCDVNSE 202
             S DWR EGAVTPVK QG C        +  + G N L   +  L+    ++   C+    
Sbjct:   133 SKDWRDEGAVTPVKYQGAC-------RLTKISGKNLLTLSEQQLIDCDIEKNGGCNGGEF 185

Query:   203 NQGC-----NGGY-MEKAFEF-ITKIGGVTTEDDYPY---RG------KNDRCQTDKTKH 246
              +       NGG  +E  + + + K          P+   RG       N+R   +  + 
Sbjct:   186 EEAFKYIIKNGGVSLETEYPYQVKKESCRANARRAPHTQIRGFQMVPSHNERALLEAVRR 245

Query:   247 HAVTITGYEAIPARY-AFQLYSHGVFDEY-CGHQLNHGVTVVGYGEDHGEKYWLVKNSWG 304
               V++     I AR  +F  Y  GV+    CG  +NH VT+VGYG   G  YW++KNSWG
Sbjct:   246 QPVSVL----IDARADSFGHYKGGVYAGLDCGTDVNHAVTIVGYGTMSGLNYWVLKNSWG 301

Query:   305 TSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
              SWGE GY+R+ R+      G+CGI   A+YPV
Sbjct:   302 ESWGENGYMRIRRDVEWPQ-GMCGIAQVAAYPV 333


>RGD|1564827 [details] [associations]
            symbol:RGD1564827 "similar to cathepsin M" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 IPI:IPI00192321
            Ensembl:ENSRNOT00000023990 ArrayExpress:D3ZY04 Uniprot:D3ZY04
        Length = 338

 Score = 218 (81.8 bits), Expect = 2.5e-30, Sum P(2) = 2.5e-30
 Identities = 41/91 (45%), Positives = 53/91 (58%)

Query:   160 QGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFIT 219
             QG+C SCWAF  V A+EG    KTGKL  LS Q LVDC     N+GC GG    AF+++ 
Sbjct:   139 QGRCNSCWAFPVVGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVL 198

Query:   220 KIGGVTTEDDYPYRGKNDRCQTDKTKHHAVT 250
             + GG+ +E  YPY GK   C+ +      +T
Sbjct:   199 QNGGLESEATYPYEGKEGLCRYNPNSSAKIT 229

 Score = 165 (63.1 bits), Expect = 2.5e-30, Sum P(2) = 2.5e-30
 Identities = 32/81 (39%), Positives = 47/81 (58%)

Query:   262 AFQLYSHGVFDE-YCGHQLNHGVTVVGYG----EDHGEKYWLVKNSWGTSWGEAGYIRMA 316
             + + Y  G++ E  C + +NH V VVGYG    E  G  YWL++NSWG  WG  GY+++A
Sbjct:   261 SLRFYKKGIYHEPKCNNYVNHAVLVVGYGFEGNETDGNNYWLIQNSWGERWGLNGYMKIA 320

Query:   317 RNSPSSNIGICGILMQASYPV 337
             ++  +     CGI   A YP+
Sbjct:   321 KDRNNH----CGIATFAQYPI 337


>UNIPROTKB|H0YD65 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 EMBL:AP002748
            HGNC:HGNC:2531 ChiTaRS:CTSF Ensembl:ENST00000524994 Uniprot:H0YD65
        Length = 283

 Score = 334 (122.6 bits), Expect = 3.0e-30, P = 3.0e-30
 Identities = 83/205 (40%), Positives = 112/205 (54%)

Query:    50 PQKYD-PQSMEERFENWLKQYSREYGSED-EWQRRFGIYSSNV---QYIDYINSQNLSFK 104
             P   D P  M   F+N++  Y+R Y S++  W  R  ++ +N+   Q I  ++     + 
Sbjct:    23 PLSQDLPVKMASIFKNFVITYNRTYESKEARW--RLSVFVNNMVRAQKIQALDRGTAQYG 80

Query:   105 LTDNKFADLSNEEFISTYLGY---NKPYNEPRWP-SVQYLGLPASVDWRKEGAVTPVKDQ 160
             +T  KF+DL+ EEF + YL      +P N+ +   SV  L  P   DWR +GAVT VKDQ
Sbjct:    81 VT--KFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLA-PPEWDWRSKGAVTKVKDQ 137

Query:   161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
             G CGSCWAFS    VEG   L  G L+SLSEQEL+DCD    ++ C GG    A+  I  
Sbjct:   138 GMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD--KMDKACMGGLPSNAYSAIKN 195

Query:   221 IGGVTTEDDYPYRGKNDRCQTDKTK 245
             +GG+ TEDDY Y+G    C     K
Sbjct:   196 LGGLETEDDYSYQGHMQSCNFSAEK 220


>DICTYBASE|DDB_G0274385 [details] [associations]
            symbol:DDB_G0274385 "Cysteine proteinase 1,
            mitochondrial" species:44689 "Dictyostelium discoideum" [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0274385 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 EMBL:AAFI02000012 RefSeq:XP_644301.1
            ProteinModelPortal:Q86KD4 EnsemblProtists:DDB0167535 GeneID:8619729
            KEGG:ddi:DDB_G0274385 InParanoid:Q86KD4 OMA:SICVDAS Uniprot:Q86KD4
        Length = 358

 Score = 240 (89.5 bits), Expect = 5.7e-30, Sum P(2) = 5.7e-30
 Identities = 71/219 (32%), Positives = 107/219 (48%)

Query:    45 WSEGYPQKY-DPQSMEERFENWLKQYSR--EYGSEDEWQRRFGIYS-SNVQYIDYINSQ- 99
             W++ + + Y D   ME RF N+ +   +  E  S    + +F     S++   ++ N   
Sbjct:    47 WAKKHSKIYKDSIEMENRFSNFKENMKKNIELNSMHAGKAKFESNGFSDLSEEEFSNFHL 106

Query:   100 NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKD 159
             N +FK   +   +    +    +   N  Y E     +  L    S+DWRK+G VTPVKD
Sbjct:   107 NKAFKGKPSHLRNSIKPQPTPHHSLING-YKEMENGDLNEL---YSIDWRKKGLVTPVKD 162

Query:   160 QGQCGSCWAFSAVAAVEGINKLKTG-KLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFI 218
             QGQCGSC+ FSAV  +E    +K G K + LSEQ+ VDCD   + Q C GG     +E+ 
Sbjct:   163 QGQCGSCYIFSAVEQIETA-WIKAGNKPILLSEQQAVDCDPY-DGQ-CGGGDPYTVYEYF 219

Query:   219 TKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI 257
             +++GGV+T   YPY   +  C        AV +  Y  +
Sbjct:   220 SQVGGVSTNAQYPYTATDGTCVN---MSRAVPVVSYHYV 255

 Score = 128 (50.1 bits), Expect = 5.7e-30, Sum P(2) = 5.7e-30
 Identities = 28/77 (36%), Positives = 43/77 (55%)

Query:   263 FQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGE-----KYWLVKNSWGTSWGEAGYIRMAR 317
             +Q YS G+    CG  ++H V VVG   D  +     +Y++++NSWGT WG  GYI +A 
Sbjct:   283 WQSYSGGIITTGCGKNIDHCVQVVGLEVDKTDPSNPVQYYIIRNSWGTDWGIDGYIYVAT 342

Query:   318 NSPSSNIGICGILMQAS 334
              S      +CGI  +++
Sbjct:   343 GSD-----LCGITYEST 354

 Score = 108 (43.1 bits), Expect = 1.4e-12, Sum P(2) = 1.4e-12
 Identities = 32/114 (28%), Positives = 57/114 (50%)

Query:    21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQK----YDPQSMEERFENWLKQYSREYGSE 76
             M+++L   +L +F+     I     ++GY +     +   SM + F +W K++S+ Y   
Sbjct:     1 MKLLLCLIIL-VFICLTNAININV-NQGYHRNDGIIHSDSSMRDTFNHWAKKHSKIYKDS 58

Query:    77 DEWQRRFGIYSSNVQYIDYINSQNLS-FKLTDNKFADLSNEEFISTYLGYNKPY 129
              E + RF  +  N++    +NS +    K   N F+DLS EEF + +L  NK +
Sbjct:    59 IEMENRFSNFKENMKKNIELNSMHAGKAKFESNGFSDLSEEEFSNFHL--NKAF 110


>WB|WBGene00011102 [details] [associations]
            symbol:R07E3.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            EMBL:Z49207 HSSP:P53634 PIR:T24030 RefSeq:NP_001041280.1
            ProteinModelPortal:Q21810 SMR:Q21810 STRING:Q21810 MEROPS:C01.A43
            PaxDb:Q21810 EnsemblMetazoa:R07E3.1a GeneID:181242
            KEGG:cel:CELE_R07E3.1 UCSC:R07E3.1a CTD:181242 WormBase:R07E3.1a
            HOGENOM:HOG000021028 InParanoid:Q21810 OMA:ACKNEVI NextBio:913066
            ArrayExpress:Q21810 Uniprot:Q21810
        Length = 402

 Score = 236 (88.1 bits), Expect = 6.7e-30, Sum P(2) = 6.7e-30
 Identities = 64/193 (33%), Positives = 100/193 (51%)

Query:    56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL--SFKLTDNKFADL 113
             Q++ + +  + +++ + Y +  E  +R   Y +  + I   N QN   S +   N  +D 
Sbjct:    84 QNIAKEYIAYTEKFDKSYATSQESLKRLNAYYNTDENIANWNIQNEHGSAEYGHNDMSDW 143

Query:   114 SNEEFISTYLG---YNKPYNEPRW--PSVQYL----G-----LPASVDWRKEGAVTPVKD 159
             ++EEF  T L    Y + + E  +  P  + L    G      P   DWR +  +TPVK 
Sbjct:   144 TDEEFEKTLLPKSFYKRLHKEAEFIEPIPESLTAKKGESSSPFPDFFDWRDKNVITPVKA 203

Query:   160 QGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFIT 219
             QGQCGSCWAF++ A VE    +  G+  +LSEQ L+DCD+  +N  C+GG  +KAF +I 
Sbjct:   204 QGQCGSCWAFASTATVEAAWAIAHGEKRNLSEQTLLDCDL-VDN-ACDGGDEDKAFRYIH 261

Query:   220 KIGGVTTEDDYPY 232
             +  G+    D PY
Sbjct:   262 R-NGLANAVDLPY 273

 Score = 142 (55.0 bits), Expect = 6.7e-30, Sum P(2) = 6.7e-30
 Identities = 34/71 (47%), Positives = 41/71 (57%)

Query:   266 YSHGVF--DEY-CGHQLN--HGVTVVGYGEDH-GEKYWLVKNSWGTSWG-EAGYIRMARN 318
             Y  GVF   EY C +++   H + + GYG    GEKYW+VKNSWG +WG E GYI  AR 
Sbjct:   328 YKGGVFTPSEYACKNEVIGLHALLITGYGTSKTGEKYWIVKNSWGNTWGVEHGYIYFARG 387

Query:   319 SPSSNIGICGI 329
                  I  CGI
Sbjct:   388 -----INACGI 393

 Score = 43 (20.2 bits), Expect = 1.5e-19, Sum P(2) = 1.5e-19
 Identities = 12/33 (36%), Positives = 16/33 (48%)

Query:   222 GGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
             GGV T  +Y        C+ +    HA+ ITGY
Sbjct:   330 GGVFTPSEYA-------CKNEVIGLHALLITGY 355


>UNIPROTKB|E9PI30 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            EMBL:AP001201 HGNC:HGNC:2546 IPI:IPI00984532
            ProteinModelPortal:E9PI30 SMR:E9PI30 Ensembl:ENST00000528419
            ArrayExpress:E9PI30 Bgee:E9PI30 Uniprot:E9PI30
        Length = 364

 Score = 262 (97.3 bits), Expect = 2.3e-29, Sum P(2) = 2.3e-29
 Identities = 74/227 (32%), Positives = 112/227 (49%)

Query:    32 LFLLWVLGIPAGAWSEGYPQKYDPQSME--ERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
             L  L V G+  G       Q   PQ +E  E F+ +  Q++R Y S +E   R  I++ N
Sbjct:    10 LLALLVAGLAQGIRGPLRAQDLGPQPLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHN 69

Query:    90 VQYIDYINSQNLS---FKLTDNKFADLSNEEFISTYLGYNK-----PY--NEPRWPSVQY 139
             +     +  ++L    F +T   F+DL+ EEF   Y GY +     P    E R    + 
Sbjct:    70 LAQAQRLQEEDLGTAEFGVTP--FSDLTEEEFGQLY-GYRRAAGGVPSMGREIRSEEPEE 126

Query:   140 LGLPASVDWRK-EGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD 198
               +P S DWRK   A++P+KDQ  C  CWA +A   +E + ++     V +S QEL+DC 
Sbjct:   127 -SVPFSCDWRKVASAISPIKDQKNCNCCWAMAAAGNIETLWRISFWDFVDVSVQELLDCG 185

Query:   199 VNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK--NDRCQTDK 243
                +  GC+GG++  AF  +    G+ +E DYP++GK    RC   K
Sbjct:   186 RCGD--GCHGGFVWDAFITVLNNSGLASEKDYPFQGKVRAHRCHPKK 230

 Score = 79 (32.9 bits), Expect = 2.3e-29, Sum P(2) = 2.3e-29
 Identities = 11/18 (61%), Positives = 13/18 (72%)

Query:   292 HGEKYWLVKNSWGTSWGE 309
             H   YW++KNSWG  WGE
Sbjct:   322 HPTPYWILKNSWGAQWGE 339

 Score = 48 (22.0 bits), Expect = 4.0e-26, Sum P(2) = 4.0e-26
 Identities = 15/39 (38%), Positives = 19/39 (48%)

Query:   264 QLYSHGVFDEY---CGHQL-NHGVTVVGYGEDHGEK-YW 297
             QLY  GV       C  QL +H V +VG+G    E+  W
Sbjct:   270 QLYRKGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEGIW 308


>DICTYBASE|DDB_G0281079 [details] [associations]
            symbol:DDB_G0281079 species:44689 "Dictyostelium
            discoideum" [GO:0030246 "carbohydrate binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR008965 InterPro:IPR013128 InterPro:IPR019028
            Pfam:PF00112 Pfam:PF09478 PRINTS:PR00705 SMART:SM00645
            SMART:SM01063 InterPro:IPR000169 dictyBase:DDB_G0281079
            GO:GO:0030246 EMBL:AAFI02000040 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 SUPFAM:SSF49384 RefSeq:XP_640804.1
            ProteinModelPortal:Q54UH2 EnsemblProtists:DDB0204000 GeneID:8622858
            KEGG:ddi:DDB_G0281079 InParanoid:Q54UH2 OMA:ALESHYY
            ProtClustDB:CLSZ2430562 Uniprot:Q54UH2
        Length = 664

 Score = 258 (95.9 bits), Expect = 7.3e-28, Sum P(2) = 7.3e-28
 Identities = 59/165 (35%), Positives = 89/165 (53%)

Query:    79 WQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSV 137
             W+    + SS+     YI  + +++  L+ NK    S+    S     N   +EP    +
Sbjct:   413 WKNSIEVGSSHT--FGYIQKAYSINPLLSVNKVCQESSSSSSS-----NITTDEPSKSRL 465

Query:   138 QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDC 197
                  P S+DWR  G V+ VK+QG CGSC+AFS V A+E     K  +++ LSEQ LVDC
Sbjct:   466 LKWSRPISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALESHYYRKNNRMLDLSEQNLVDC 525

Query:   198 DVNSE--NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQ 240
               +++  N GC+GG+M   + +I + GG+  E  YPY GK  +C+
Sbjct:   526 TASNKYRNGGCSGGWMHNCYSYIQENGGINQESTYPYEGKFGQCR 570

 Score = 87 (35.7 bits), Expect = 7.3e-28, Sum P(2) = 7.3e-28
 Identities = 15/40 (37%), Positives = 24/40 (60%)

Query:   263 FQLYSHGVF-DEYCG-HQLNHGVTVVGYGEDHGEKYWLVK 300
             F  YS G++  + C  ++  H V VVGY  ++G  YW++K
Sbjct:   615 FMYYSRGIYYSDNCNKYRTTHAVVVVGYDNENGVDYWIIK 654

 Score = 80 (33.2 bits), Expect = 0.00025, Sum P(2) = 0.00025
 Identities = 15/66 (22%), Positives = 39/66 (59%)

Query:    59 EERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN--SQNLSFKLTDNKFADLSNE 116
             +  F  W  Q++R Y + D++  ++  +  + ++I+     +QN + +L   +F+D++++
Sbjct:   158 QNSFIQWSNQFNRTYRA-DQFLLKYEAFKDSSRFIEQYKRENQNSTMELGLTQFSDMTHD 216

Query:   117 EFISTY 122
             EF++ Y
Sbjct:   217 EFLNVY 222

 Score = 46 (21.3 bits), Expect = 1.4e-23, Sum P(2) = 1.4e-23
 Identities = 13/33 (39%), Positives = 17/33 (51%)

Query:   225 TTEDDYPYRG--KNDRCQTDKTKHHAVTITGYE 255
             T E  Y  RG   +D C   +T H AV + GY+
Sbjct:   612 TREFMYYSRGIYYSDNCNKYRTTH-AVVVVGYD 643


>DICTYBASE|DDB_G0281077 [details] [associations]
            symbol:DDB_G0281077 species:44689 "Dictyostelium
            discoideum" [GO:0030246 "carbohydrate binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR008965 InterPro:IPR013128 InterPro:IPR019028
            Pfam:PF00112 Pfam:PF09478 PRINTS:PR00705 SMART:SM00645
            SMART:SM01063 InterPro:IPR000169 dictyBase:DDB_G0281077
            GO:GO:0030246 EMBL:AAFI02000040 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 SUPFAM:SSF49384 ProtClustDB:CLSZ2430562
            RefSeq:XP_640803.1 ProteinModelPortal:Q54UH3
            EnsemblProtists:DDB0203998 GeneID:8622857 KEGG:ddi:DDB_G0281077
            InParanoid:Q54UH3 OMA:LINDFNF Uniprot:Q54UH3
        Length = 662

 Score = 252 (93.8 bits), Expect = 4.1e-27, Sum P(2) = 4.1e-27
 Identities = 58/163 (35%), Positives = 86/163 (52%)

Query:    79 WQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSV 137
             W+    + SS+     YI  + +++  L+ NK    S+    S     N   +EP    +
Sbjct:   414 WKNSIEVGSSHT--FGYIQKAYSINPLLSVNKVCQESSSSSSS-----NITTDEPSKSRL 466

Query:   138 QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDC 197
                  P S+DWR  G V+ VK+QG CGSC+AFS V A+E     K  ++++LSEQ LVDC
Sbjct:   467 LKWSRPISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALEAHYYRKNNRMLNLSEQNLVDC 526

Query:   198 DVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQ 240
               N  N  C+GG+M   F +I + GG+  +  YPY G+   C+
Sbjct:   527 TRNYGNGECSGGWMHNCFRYIKENGGINLQSTYPYEGRVGLCR 569

 Score = 86 (35.3 bits), Expect = 4.1e-27, Sum P(2) = 4.1e-27
 Identities = 15/40 (37%), Positives = 26/40 (65%)

Query:   263 FQLYSHGVFD-EYCG-HQLNHGVTVVGYGEDHGEKYWLVK 300
             F  YS G+++ + C  ++  H V VVGYG ++G  +W++K
Sbjct:   614 FMYYSSGIYNSDSCDKYRTTHAVVVVGYGIENGVDFWIIK 653

 Score = 79 (32.9 bits), Expect = 0.00039, Sum P(2) = 0.00039
 Identities = 15/66 (22%), Positives = 39/66 (59%)

Query:    59 EERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN--SQNLSFKLTDNKFADLSNE 116
             +  F  W  Q++R Y + D++  ++  +  + ++I+     +QN + +L   +F+D++++
Sbjct:   159 QNSFIQWSNQFNRTYRA-DQFLLKYEAFKDSSRFIEQYKRENQNSTMELGLTQFSDMTHD 217

Query:   117 EFISTY 122
             EF++ Y
Sbjct:   218 EFLNIY 223


>UNIPROTKB|F1NWG2 [details] [associations]
            symbol:CTSC "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 InterPro:IPR014882 Pfam:PF08773
            OMA:YDDFLHY GO:GO:0001913 EMBL:AADN02004805 IPI:IPI00577371
            Ensembl:ENSGALT00000027869 Uniprot:F1NWG2
        Length = 463

 Score = 189 (71.6 bits), Expect = 2.5e-26, Sum P(2) = 2.5e-26
 Identities = 56/183 (30%), Positives = 93/183 (50%)

Query:    81 RRFGIYSSNVQYIDYINSQNLSFKLTD-NKFADLSNEEFI----STYLGYNKPYNEPRWP 135
             RRF ++  N  +++ IN+   S++ T   ++ + S EE        Y   ++P   P  P
Sbjct:   166 RRF-VH--NFDFVNAINAHQKSWRATRYEEYENFSLEELTRRAGGLYSRTSRPKPAPLTP 222

Query:   136 SV--QYLGLPASVDWRK-EGA--VTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS-- 188
              +  +  GLP S DWR   G   V+PV++Q  CGSC+AF+++  +E   ++ T       
Sbjct:   223 ELLKKVSGLPESWDWRNVNGVNYVSPVRNQASCGSCYAFASMGMLEARIRILTNNTQKPV 282

Query:   189 LSEQELVDCDVNSENQGCNGGYMEK-AFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHH 247
              S Q++V C   S  QGC+GG+    A +++   G V  ED +PY  K+  C   ++ +H
Sbjct:   283 FSPQQVVSCSQYS--QGCDGGFPYLIAGKYVQDFG-VVEEDCFPYTAKDTPCLFKRSCYH 339

Query:   248 AVT 250
               T
Sbjct:   340 YYT 342

 Score = 174 (66.3 bits), Expect = 2.5e-26, Sum P(2) = 2.5e-26
 Identities = 34/63 (53%), Positives = 44/63 (69%)

Query:   261 YAFQLYSH-GVFDEYCGHQL-NHGVTVVGYGED--HGEKYWLVKNSWGTSWGEAGYIRMA 316
             Y   +Y H G+ DE+   +L NH V +VGYG+D   GEK+W+VKNSWGTSWGE GY R+ 
Sbjct:   383 YKEGIYHHTGLKDEFNPFELTNHAVLLVGYGKDPESGEKFWIVKNSWGTSWGEDGYFRIR 442

Query:   317 RNS 319
             R +
Sbjct:   443 RGT 445


>UNIPROTKB|E9PKT6 [details] [associations]
            symbol:CTSH "Cathepsin H" species:9606 "Homo sapiens"
            [GO:0001520 "outer dense fiber" evidence=IEA] [GO:0001656
            "metanephros development" evidence=IEA] [GO:0001669 "acrosomal
            vesicle" evidence=IEA] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0007283 "spermatogenesis" evidence=IEA] [GO:0008284 "positive
            regulation of cell proliferation" evidence=IEA] [GO:0010634
            "positive regulation of epithelial cell migration" evidence=IEA]
            [GO:0016505 "apoptotic protease activator activity" evidence=IEA]
            [GO:0030984 "kininogen binding" evidence=IEA] [GO:0031638 "zymogen
            activation" evidence=IEA] [GO:0031648 "protein destabilization"
            evidence=IEA] [GO:0032403 "protein complex binding" evidence=IEA]
            [GO:0032526 "response to retinoic acid" evidence=IEA] [GO:0033619
            "membrane protein proteolysis" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0043621 "protein
            self-association" evidence=IEA] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IEA] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            InterPro:IPR000169 GO:GO:0043066 GO:GO:0008284 PANTHER:PTHR12411
            PROSITE:PS00139 GO:GO:0045766 GO:GO:0004252 GO:GO:0032526
            GO:GO:0016505 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 GO:GO:0060448 GO:GO:0033619
            EMBL:AC011944 HGNC:HGNC:2535 IPI:IPI00375426
            ProteinModelPortal:E9PKT6 SMR:E9PKT6 PRIDE:E9PKT6
            Ensembl:ENST00000528741 ArrayExpress:E9PKT6 Bgee:E9PKT6
            Uniprot:E9PKT6
        Length = 134

 Score = 296 (109.3 bits), Expect = 3.2e-26, P = 3.2e-26
 Identities = 62/132 (46%), Positives = 82/132 (62%)

Query:   108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGL---PASVDWRKEGA-VTPVKDQGQC 163
             N+F+D+S  E    YL +++P N     S    G    P SVDWRK+G  V+PVK+QG C
Sbjct:     4 NQFSDMSFAEIKHKYL-WSEPQNCSATKSNYLRGTGPYPPSVDWRKKGNFVSPVKNQGAC 62

Query:   164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
             GSCW FS   A+E    + TGK++SL+EQ+LVDC  +  N GC GG   +AFE+I    G
Sbjct:    63 GSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKG 122

Query:   224 VTTEDDYPYRGK 235
             +  ED YPY+GK
Sbjct:   123 IMGEDTYPYQGK 134


>WB|WBGene00012747 [details] [associations]
            symbol:Y40H7A.10 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000230773 EMBL:AL033510
            HSSP:P80067 MEROPS:C01.A48 PIR:T26792 RefSeq:NP_502836.1
            ProteinModelPortal:Q9XWA4 SMR:Q9XWA4 STRING:Q9XWA4
            EnsemblMetazoa:Y40H7A.10 GeneID:189809 KEGG:cel:CELE_Y40H7A.10
            UCSC:Y40H7A.10 CTD:189809 WormBase:Y40H7A.10 eggNOG:NOG286423
            InParanoid:Q9XWA4 OMA:NGPMIVC NextBio:943702 Uniprot:Q9XWA4
        Length = 343

 Score = 290 (107.1 bits), Expect = 1.4e-25, P = 1.4e-25
 Identities = 74/188 (39%), Positives = 107/188 (56%)

Query:    62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTD--NKFADLSNEEFI 119
             F+N+L +Y REY +E E  +RF I+S N+  ++  N ++   K+T   N F+DL+ EE+ 
Sbjct:    51 FQNFLVKYLREYPNEYEIVKRFTIFSRNLDLVERYNKEDAG-KVTYELNDFSDLTEEEW- 108

Query:   120 STYLGYNKP-YNEPRW-PS--VQYLGLPASVDWRK-EGA--VTPVKDQGQCGSCWAFSAV 172
               YL   KP ++E    P   +    LP SVDWR   G   VT +K QG CGSCWAF+  
Sbjct:   109 KKYLMTPKPDHSEKSLKPKTLIDKKNLPNSVDWRNVNGTNHVTGIKYQGPCGSCWAFATA 168

Query:   173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
             AA+E    +  G L SLS Q+L+DC V S+   C GG   +A ++  +  G+TT  +YPY
Sbjct:   169 AAIESAVSISGGGLQSLSSQQLLDCTVVSDK--CGGGEPVEALKY-AQSHGITTAHNYPY 225

Query:   233 RGKNDRCQ 240
                  +C+
Sbjct:   226 YFWTTKCR 233

 Score = 286 (105.7 bits), Expect = 3.6e-25, P = 3.6e-25
 Identities = 74/211 (35%), Positives = 107/211 (50%)

Query:   142 LPASVDWRK-EGA--VTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD 198
             LP SVDWR   G   VT +K QG CGSCWAF+  AA+E    +  G L SLS Q+L+DC 
Sbjct:   135 LPNSVDWRNVNGTNHVTGIKYQGPCGSCWAFATAAAIESAVSISGGGLQSLSSQQLLDCT 194

Query:   199 VNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRC----------------QTD 242
             V S+   C GG   +A ++  +  G+TT  +YPY     +C                +++
Sbjct:   195 VVSDK--CGGGEPVEALKY-AQSHGITTAHNYPYYFWTTKCRETVPTVARISSWMKAESE 251

Query:   243 KTKHHAVTITGYEAIPARYAF---QLYSHGVF-DEYCGHQLNHGVTVVGYGEDHGEKYWL 298
                   V + G   + A +A    + Y  G+  D  CG +  H + V+GYG D    YW+
Sbjct:   252 DEMAQIVALNGPMIVCANFATNKNRFYHSGIAEDPDCGTEPTHALIVIGYGPD----YWI 307

Query:   299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
             +KN++   WGE GY+R+ R+     +  CGI
Sbjct:   308 LKNTYSKVWGEKGYMRVKRD-----VNWCGI 333


>WB|WBGene00044760 [details] [associations]
            symbol:Y71H2AM.25 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0004181
            "metallocarboxypeptidase activity" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            GeneTree:ENSGT00560000076599 EMBL:FO081822 eggNOG:NOG331187
            HOGENOM:HOG000114005 RefSeq:NP_001040887.1
            ProteinModelPortal:Q2AAB9 SMR:Q2AAB9 EnsemblMetazoa:Y71H2AM.25
            GeneID:4363054 KEGG:cel:CELE_Y71H2AM.25 UCSC:Y71H2AM.25 CTD:4363054
            WormBase:Y71H2AM.25 InParanoid:Q2AAB9 NextBio:959635 Uniprot:Q2AAB9
        Length = 299

 Score = 199 (75.1 bits), Expect = 3.2e-25, Sum P(2) = 3.2e-25
 Identities = 43/102 (42%), Positives = 60/102 (58%)

Query:   146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGIN-KLKTGKLVSLSEQELVDCDVNSENQ 204
             +DWR +G V PVKDQG+C +  AF+  +++E +  K   G L+S SEQ+L+DCD +   +
Sbjct:    86 LDWRDKGIVGPVKDQGKCNASHAFAISSSIESMYAKATNGSLLSFSEQQLIDCDDHGF-K 144

Query:   205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGK-NDRCQTDKTK 245
             GC       A  +     G+ TE DYPY GK N +C  D TK
Sbjct:   145 GCEEQPAINAVSYFI-FHGIETEADYPYAGKENGKCTFDSTK 185

 Score = 136 (52.9 bits), Expect = 3.2e-25, Sum P(2) = 3.2e-25
 Identities = 26/65 (40%), Positives = 43/65 (66%)

Query:   256 AIPARYAFQLYSHGVFDEYCG--HQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYI 313
             A P+ Y +++  +    E C   H++   + +VGYG +  +KYW+VK S+GTSWGE GY+
Sbjct:   219 APPSLYDYKIGIYNPSIEECTSTHEIR-SMVIVGYGIEGVQKYWIVKGSFGTSWGEQGYM 277

Query:   314 RMARN 318
             ++AR+
Sbjct:   278 KLARD 282


>WB|WBGene00022189 [details] [associations]
            symbol:Y71H2AR.2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0004181
            "metallocarboxypeptidase activity" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] [GO:0008340 "determination of adult lifespan"
            evidence=IMP] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0008340 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 HSSP:P07711
            eggNOG:NOG331187 HOGENOM:HOG000114005 EMBL:FO081570
            RefSeq:NP_497627.1 UniGene:Cel.28419 ProteinModelPortal:Q9BL26
            SMR:Q9BL26 EnsemblMetazoa:Y71H2AR.2 GeneID:190615
            KEGG:cel:CELE_Y71H2AR.2 UCSC:Y71H2AR.2 CTD:190615
            WormBase:Y71H2AR.2 InParanoid:Q9BL26 OMA:CAMATTI NextBio:946382
            Uniprot:Q9BL26
        Length = 345

 Score = 201 (75.8 bits), Expect = 8.6e-25, Sum P(2) = 8.6e-25
 Identities = 41/102 (40%), Positives = 63/102 (61%)

Query:   146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGIN-KLKTGKLVSLSEQELVDCDVNSENQ 204
             +DWR++G V PVKDQG+C +  AF+  +++E +  K   G L+S SEQ+L+DC+ +   +
Sbjct:    86 LDWREKGIVGPVKDQGKCNASHAFAITSSIESMYAKATNGTLLSFSEQQLIDCN-DQGYK 144

Query:   205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGK-NDRCQTDKTK 245
             GC   +   A  ++    G+ TE DYPY  K N++C  D TK
Sbjct:   145 GCEEQFAMNAIGYLAT-HGIETEADYPYVDKTNEKCTFDSTK 185

 Score = 137 (53.3 bits), Expect = 8.6e-25, Sum P(2) = 8.6e-25
 Identities = 26/65 (40%), Positives = 43/65 (66%)

Query:   256 AIPARYAFQLYSHGVFDEYCG--HQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYI 313
             A P+ Y +++  +    E C   H++   + +VGYG +  +KYW+VK S+GTSWGE GY+
Sbjct:   219 APPSLYDYKIGIYNPSIEECTSTHEIR-SMVIVGYGIEGEQKYWIVKGSFGTSWGEQGYM 277

Query:   314 RMARN 318
             ++AR+
Sbjct:   278 KLARD 282


>WB|WBGene00013764 [details] [associations]
            symbol:Y113G7B.15 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 GeneTree:ENSGT00560000076599
            EMBL:AL110477 HOGENOM:HOG000019851 RefSeq:NP_507904.2
            ProteinModelPortal:Q9U2X1 SMR:Q9U2X1 DIP:DIP-25339N IntAct:Q9U2X1
            MINT:MINT-1058673 STRING:Q9U2X1 MEROPS:C01.A47
            EnsemblMetazoa:Y113G7B.15 GeneID:190976 KEGG:cel:CELE_Y113G7B.15
            UCSC:Y113G7B.15 CTD:190976 WormBase:Y113G7B.15 eggNOG:NOG302449
            OMA:AEEDIME Uniprot:Q9U2X1
        Length = 362

 Score = 174 (66.3 bits), Expect = 6.7e-24, Sum P(3) = 6.7e-24
 Identities = 33/79 (41%), Positives = 45/79 (56%)

Query:   154 VTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEK 213
             V PVKDQ QCG CWAF+  A  E  N L +    SLS+QE+ DC  + +  GC GG    
Sbjct:   148 VGPVKDQEQCGCCWAFATTAITEAANTLYSKSFTSLSDQEICDCADSGDTPGCVGGDPRN 207

Query:   214 AFEFITKIGGVTTEDDYPY 232
               + +  + G +++ DYPY
Sbjct:   208 GLKMV-HLRGQSSDGDYPY 225

 Score = 132 (51.5 bits), Expect = 6.7e-24, Sum P(3) = 6.7e-24
 Identities = 27/61 (44%), Positives = 37/61 (60%)

Query:   263 FQLYSHGVFD-EYCGHQLN----HGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMA 316
             F+ Y+ GV   E C +Q+     H V +VGYG  D G  YWLV+NSW + WG  GY+++ 
Sbjct:   285 FEWYTSGVLQSEDC-YQMTPAEWHSVAIVGYGTSDDGVPYWLVRNSWNSDWGLHGYVKIR 343

Query:   317 R 317
             R
Sbjct:   344 R 344

 Score = 65 (27.9 bits), Expect = 6.7e-24, Sum P(3) = 6.7e-24
 Identities = 19/68 (27%), Positives = 33/68 (48%)

Query:    56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ------NLSFKLTDNK 109
             Q +   F N+   + + Y +  E  RR   ++ N Q I  +N++      N++F    NK
Sbjct:    24 QEVLSHFNNFTMHHKKHYRTPAEKDRRLAHFAKNHQKIQELNAKARREGRNVTFGW--NK 81

Query:   110 FADLSNEE 117
             FAD + +E
Sbjct:    82 FADKNRQE 89


>UNIPROTKB|F1N455 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1 exclusion domain chain"
            species:9913 "Bos taurus" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 IPI:IPI00697314 UniGene:Bt.49573
            InterPro:IPR014882 Pfam:PF08773 OMA:YDDFLHY GO:GO:0001913
            EMBL:DAAA02062487 EMBL:DAAA02062488 Ensembl:ENSBTAT00000014735
            Uniprot:F1N455
        Length = 463

 Score = 181 (68.8 bits), Expect = 7.2e-24, Sum P(2) = 7.2e-24
 Identities = 57/190 (30%), Positives = 92/190 (48%)

Query:    63 ENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDN-KFADLSNEEFIST 121
             EN     +R  G E+ +  R  +Y  N  ++  IN+   S+      ++  L+ +E I  
Sbjct:   147 ENVNVNTARLAGLEETYSNR--LYRYNHDFVKAINAIQKSWTAAPYMEYETLTLKEMIRR 204

Query:   122 YLGYNK----PYNEPRWPSVQ--YLGLPASVDWRK-EGA--VTPVKDQGQCGSCWAFSAV 172
               G+++    P   P    +Q   L LP S DWR   G   VTPV++QG CGSC++F+++
Sbjct:   205 GGGHSRRIPRPKPAPITAEIQKKILHLPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASM 264

Query:   173 AAVEGINKLKTGKLVS--LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
               +E   ++ T    +  LS QE+V C   +  QGC GG+         +  G+  ED +
Sbjct:   265 GMMEARIRILTNNTQTPILSPQEVVSCSQYA--QGCEGGFPYLIAGKYAQDFGLVEEDCF 322

Query:   231 PYRGKNDRCQ 240
             PY G +  C+
Sbjct:   323 PYTGTDSPCR 332

 Score = 160 (61.4 bits), Expect = 7.2e-24, Sum P(2) = 7.2e-24
 Identities = 37/94 (39%), Positives = 48/94 (51%)

Query:   232 YRGKNDRCQTDKTKHHAVTITGYEAIP--ARYAFQLYSH-GVFDEYCGHQL-NHGVTVVG 287
             Y G N+     +  H       +E       Y   +Y H G+ D +   +L NH V +VG
Sbjct:   352 YGGCNEALMKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLRDPFNPFELTNHAVLLVG 411

Query:   288 YGED--HGEKYWLVKNSWGTSWGEAGYIRMARNS 319
             YG D   G  YW+VKNSWGTSWGE GY R+ R +
Sbjct:   412 YGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGT 445

 Score = 38 (18.4 bits), Expect = 3.5e-11, Sum P(2) = 3.5e-11
 Identities = 8/25 (32%), Positives = 12/25 (48%)

Query:   230 YPYRGKNDRCQTDKTKHHAVTITGY 254
             Y + G  D     +  +HAV + GY
Sbjct:   388 YHHTGLRDPFNPFELTNHAVLLVGY 412


>UNIPROTKB|Q3ZCJ8 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9913 "Bos
            taurus" [GO:0031638 "zymogen activation" evidence=IDA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0005764 EMBL:BC102115 IPI:IPI00697314 RefSeq:NP_001028789.1
            UniGene:Bt.49573 ProteinModelPortal:Q3ZCJ8 SMR:Q3ZCJ8 STRING:Q3ZCJ8
            PRIDE:Q3ZCJ8 GeneID:352958 KEGG:bta:352958 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 InParanoid:Q3ZCJ8 KO:K01275
            OrthoDB:EOG4H19VZ BindingDB:Q3ZCJ8 ChEMBL:CHEMBL1075050
            NextBio:20812686 GO:GO:0031638 InterPro:IPR014882 Pfam:PF08773
            Uniprot:Q3ZCJ8
        Length = 463

 Score = 181 (68.8 bits), Expect = 7.2e-24, Sum P(2) = 7.2e-24
 Identities = 57/190 (30%), Positives = 92/190 (48%)

Query:    63 ENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDN-KFADLSNEEFIST 121
             EN     +R  G E+ +  R  +Y  N  ++  IN+   S+      ++  L+ +E I  
Sbjct:   147 ENVNVNTARLAGLEETYSNR--LYRYNHDFVKAINAIQKSWTAAPYMEYETLTLKEMIRR 204

Query:   122 YLGYNK----PYNEPRWPSVQ--YLGLPASVDWRK-EGA--VTPVKDQGQCGSCWAFSAV 172
               G+++    P   P    +Q   L LP S DWR   G   VTPV++QG CGSC++F+++
Sbjct:   205 GGGHSRRIPRPKPAPITAEIQKKILHLPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASM 264

Query:   173 AAVEGINKLKTGKLVS--LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
               +E   ++ T    +  LS QE+V C   +  QGC GG+         +  G+  ED +
Sbjct:   265 GMMEARIRILTNNTQTPILSPQEVVSCSQYA--QGCEGGFPYLIAGKYAQDFGLVEEDCF 322

Query:   231 PYRGKNDRCQ 240
             PY G +  C+
Sbjct:   323 PYTGTDSPCR 332

 Score = 160 (61.4 bits), Expect = 7.2e-24, Sum P(2) = 7.2e-24
 Identities = 37/94 (39%), Positives = 48/94 (51%)

Query:   232 YRGKNDRCQTDKTKHHAVTITGYEAIP--ARYAFQLYSH-GVFDEYCGHQL-NHGVTVVG 287
             Y G N+     +  H       +E       Y   +Y H G+ D +   +L NH V +VG
Sbjct:   352 YGGCNEALMKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLRDPFNPFELTNHAVLLVG 411

Query:   288 YGED--HGEKYWLVKNSWGTSWGEAGYIRMARNS 319
             YG D   G  YW+VKNSWGTSWGE GY R+ R +
Sbjct:   412 YGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGT 445

 Score = 38 (18.4 bits), Expect = 3.5e-11, Sum P(2) = 3.5e-11
 Identities = 8/25 (32%), Positives = 12/25 (48%)

Query:   230 YPYRGKNDRCQTDKTKHHAVTITGY 254
             Y + G  D     +  +HAV + GY
Sbjct:   388 YHHTGLRDPFNPFELTNHAVLLVGY 412


>ZFIN|ZDB-GENE-030619-9 [details] [associations]
            symbol:ctsc "cathepsin C" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030619-9 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 MEROPS:C01.070 HSSP:P43235
            EMBL:BC064286 IPI:IPI00486570 RefSeq:NP_999887.1 UniGene:Dr.32463
            ProteinModelPortal:Q6P2V1 SMR:Q6P2V1 PRIDE:Q6P2V1 GeneID:368704
            KEGG:dre:368704 InParanoid:Q6P2V1 NextBio:20813127
            ArrayExpress:Q6P2V1 Bgee:Q6P2V1 Uniprot:Q6P2V1
        Length = 455

 Score = 183 (69.5 bits), Expect = 9.8e-24, Sum P(2) = 9.8e-24
 Identities = 55/177 (31%), Positives = 85/177 (48%)

Query:    86 YSSNVQYIDYINSQNLSFKLTDNKFAD-LSNEEFISTYLGYNKPYNEPRWP------SVQ 138
             Y++N+ ++D INS   S+  T   F + LS  E +    G          P      S  
Sbjct:   161 YTNNMMFVDEINSVQKSWTATAYSFHETLSIHEMLRRSGGPASRIPRRVRPVTVAADSKA 220

Query:   139 YLGLPASVDWRK-EGA--VTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS--LSEQE 193
               GLP   DWR   G   V+PV++Q QCGSC++F+ +  +E   +++T        S Q+
Sbjct:   221 ASGLPQHWDWRNVNGVNFVSPVRNQAQCGSCYSFATMGMLEARVRIQTNNTQQPVFSPQQ 280

Query:   194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDK--TKHHA 248
             +V C   S  QGC+GG+     ++I   G +  ED +PY G +  C      TK++A
Sbjct:   281 VVSCSQYS--QGCDGGFPYLIGKYIQDFG-IVEEDCFPYTGSDSPCNLPAKCTKYYA 334

 Score = 156 (60.0 bits), Expect = 9.8e-24, Sum P(2) = 9.8e-24
 Identities = 31/63 (49%), Positives = 41/63 (65%)

Query:   261 YAFQLYSH-GVFDEYCGHQL-NHGVTVVGYGEDH--GEKYWLVKNSWGTSWGEAGYIRMA 316
             Y   +Y H G+ D     +L NH V +VGYG+ H  GEKYW+VKNSWG+ WGE G+ R+ 
Sbjct:   375 YKEGIYHHTGLRDANNPFELTNHAVLLVGYGQCHKTGEKYWIVKNSWGSGWGENGFFRIR 434

Query:   317 RNS 319
             R +
Sbjct:   435 RGT 437


>UNIPROTKB|F1STR1 [details] [associations]
            symbol:CTSC "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0004252
            "serine-type endopeptidase activity" evidence=IEA] [GO:0001913 "T
            cell mediated cytotoxicity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 KO:K01275 InterPro:IPR014882
            Pfam:PF08773 OMA:YDDFLHY GO:GO:0001913 EMBL:CU855751
            RefSeq:XP_003129789.1 UniGene:Ssc.6155 Ensembl:ENSSSCT00000016280
            GeneID:100522387 KEGG:ssc:100522387 Uniprot:F1STR1
        Length = 463

 Score = 171 (65.3 bits), Expect = 1.1e-23, Sum P(2) = 1.1e-23
 Identities = 53/174 (30%), Positives = 84/174 (48%)

Query:    80 QRRFG--IYSSNVQYIDYINSQNLSFKLTDN-KFADLSNEEFISTYLGYN----KPYNEP 132
             Q+++   +Y  N  ++  IN    S+  T   ++  L+ +E      GYN    +P   P
Sbjct:   160 QKKYSNRLYKYNHDFVKAINGIQKSWTATAYMEYETLTLKEMTQRGGGYNQRLPRPKPAP 219

Query:   133 RWPSVQY--LGLPASVDWRK-EGA--VTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLV 187
                 +Q   L LPAS DWR   G   VTPV++Q  CGSC++F+++  +E   ++ T    
Sbjct:   220 ITAEIQEKSLHLPASWDWRNVRGTNFVTPVRNQASCGSCYSFASMGMMEARIRILTNNTQ 279

Query:   188 S--LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRC 239
             +  LS QE+V C   +  QGC GG+         +  G+  E  +PY G +  C
Sbjct:   280 TPILSPQEVVSCSQYA--QGCAGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPC 331

 Score = 170 (64.9 bits), Expect = 1.1e-23, Sum P(2) = 1.1e-23
 Identities = 38/94 (40%), Positives = 49/94 (52%)

Query:   232 YRGKNDRCQTDKTKHHAVTITGYEAIP--ARYAFQLYSH-GVFDEYCGHQL-NHGVTVVG 287
             Y G N+     +  HH      +E       Y   +Y H G+ D +   +L NH V +VG
Sbjct:   352 YGGCNEALMKLELVHHGPMAVAFEVYDDFLHYRKGIYHHTGLRDPFNPFELTNHAVLLVG 411

Query:   288 YGED--HGEKYWLVKNSWGTSWGEAGYIRMARNS 319
             YG D   G  YW+VKNSWGTSWGE GY R+ R +
Sbjct:   412 YGTDLASGMDYWIVKNSWGTSWGEDGYFRIRRGT 445

 Score = 38 (18.4 bits), Expect = 5.7e-10, Sum P(2) = 5.7e-10
 Identities = 8/25 (32%), Positives = 12/25 (48%)

Query:   230 YPYRGKNDRCQTDKTKHHAVTITGY 254
             Y + G  D     +  +HAV + GY
Sbjct:   388 YHHTGLRDPFNPFELTNHAVLLVGY 412


>UNIPROTKB|E1BPI9 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 CTD:1519 KO:K01374 OMA:SNVCGIA
            EMBL:DAAA02044933 IPI:IPI01004081 RefSeq:XP_002694471.2
            RefSeq:XP_874012.4 Ensembl:ENSBTAT00000014691 GeneID:616804
            KEGG:bta:616804 Uniprot:E1BPI9
        Length = 313

 Score = 262 (97.3 bits), Expect = 1.3e-22, P = 1.3e-22
 Identities = 63/172 (36%), Positives = 91/172 (52%)

Query:    95 YINS----QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY-----LGLPAS 145
             Y+NS    +N +     N+F+ L  EEF + YL  + P   PR+P+ +Y     L LP  
Sbjct:    45 YLNSLFPYENSTAVYGINQFSYLFPEEFKAIYLR-SSPSRFPRFPAEEYTSISNLSLPLR 103

Query:   146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQG 205
              DWR +  VT V++Q  CG CWAFS V AVE +  +K   L  LS Q+++DC  +  N G
Sbjct:   104 FDWRDKHVVTQVRNQKTCGGCWAFSVVGAVESVCAIKGQPLEVLSVQQVIDCSYS--NYG 161

Query:   206 CNGGYMEKAFEFITKIGGVTTED-DYPYRGKNDRCQTDKTKHHAVTITGYEA 256
             CNGG    A  ++ K+      D +YP++ +N  C+     H   +I GY A
Sbjct:   162 CNGGSPLSALYWLNKLQVKLVRDSEYPFQAQNGLCRYFSDSHSGSSIKGYSA 213

 Score = 144 (55.7 bits), Expect = 2.1e-07, P = 2.1e-07
 Identities = 47/165 (28%), Positives = 73/165 (44%)

Query:   166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
             C   S ++A+  +NKL+  KLV  SE         ++N  C   Y   +    + I G +
Sbjct:   162 CNGGSPLSALYWLNKLQV-KLVRDSEYPF-----QAQNGLCR--YFSDSHSG-SSIKGYS 212

Query:   226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQLYSHGVFDEYCGH-QLNHGVT 284
                 Y + G     Q DK     + +     +    ++Q Y  G+   +C   + NH V 
Sbjct:   213 A---YDFSG-----QEDKMAEALLALGPLIVVVDAMSWQDYLGGIIQHHCSSGEANHAVL 264

Query:   285 VVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
             V G+ +     YW+V+NSWGTSWG  GY+R+          +CGI
Sbjct:   265 VTGFDKTGSIPYWIVRNSWGTSWGIDGYVRVKMGG-----NVCGI 304


>MGI|MGI:109553 [details] [associations]
            symbol:Ctsc "cathepsin C" species:10090 "Mus musculus"
            [GO:0001913 "T cell mediated cytotoxicity" evidence=IGI]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IMP]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0005783 "endoplasmic
            reticulum" evidence=ISO] [GO:0005794 "Golgi apparatus"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=ISO;IMP]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0010033
            "response to organic substance" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0031404 "chloride ion
            binding" evidence=ISO] [GO:0042802 "identical protein binding"
            evidence=ISO] [GO:0043621 "protein self-association" evidence=ISO]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 MGI:MGI:109553 GO:GO:0005783
            GO:GO:0005794 GO:GO:0007568 GO:GO:0010033 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004252 GO:GO:0005764 GO:GO:0031404 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 MEROPS:C01.070 OMA:YDDFLHY
            GO:GO:0001913 EMBL:U89269 EMBL:U74683 EMBL:BC067063 IPI:IPI00130015
            RefSeq:NP_034112.3 UniGene:Mm.322945 ProteinModelPortal:P97821
            SMR:P97821 STRING:P97821 PhosphoSite:P97821 PaxDb:P97821
            PRIDE:P97821 Ensembl:ENSMUST00000032779 GeneID:13032 KEGG:mmu:13032
            InParanoid:P97821 BindingDB:P97821 ChEMBL:CHEMBL3454 ChiTaRS:CTSC
            NextBio:282904 Bgee:P97821 CleanEx:MM_CTSC Genevestigator:P97821
            Uniprot:P97821
        Length = 462

 Score = 173 (66.0 bits), Expect = 1.8e-22, Sum P(2) = 1.8e-22
 Identities = 52/179 (29%), Positives = 89/179 (49%)

Query:    74 GSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNK-FADLSNEEFISTYLGYN----KP 128
             G ++ +  R  +Y+ N  ++  IN+   S+  T  K +  +S  + I    G++    +P
Sbjct:   158 GLQERYSER--LYTHNHNFVKAINTVQKSWTATAYKEYEKMSLRDLIRRS-GHSQRIPRP 214

Query:   129 YNEPRWPSVQY--LGLPASVDWRK-EGA--VTPVKDQGQCGSCWAFSAVAAVEGINKLKT 183
                P    +Q   L LP S DWR  +G   V+PV++Q  CGSC++F+++  +E   ++ T
Sbjct:   215 KPAPMTDEIQQQILNLPESWDWRNVQGVNYVSPVRNQESCGSCYSFASMGMLEARIRILT 274

Query:   184 GKLVS--LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQ 240
                 +  LS QE+V C   +  QGC+GG+         +  GV  E  +PY  K+  C+
Sbjct:   275 NNSQTPILSPQEVVSCSPYA--QGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDSPCK 331

 Score = 156 (60.0 bits), Expect = 1.8e-22, Sum P(2) = 1.8e-22
 Identities = 34/94 (36%), Positives = 50/94 (53%)

Query:   232 YRGKNDRCQTDKTKHHAVTITGYEAIP--ARYAFQLYSH-GVFDEYCGHQL-NHGVTVVG 287
             Y G N+     +   H      +E       Y   +Y H G+ D +   +L NH V +VG
Sbjct:   351 YGGCNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVG 410

Query:   288 YGEDH--GEKYWLVKNSWGTSWGEAGYIRMARNS 319
             YG D   G +YW++KNSWG++WGE+GY R+ R +
Sbjct:   411 YGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRGT 444

 Score = 40 (19.1 bits), Expect = 2.0e-10, Sum P(2) = 2.0e-10
 Identities = 9/29 (31%), Positives = 14/29 (48%)

Query:   230 YPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             Y + G +D     +  +HAV + GY   P
Sbjct:   387 YHHTGLSDPFNPFELTNHAVLLVGYGRDP 415


>RGD|2445 [details] [associations]
            symbol:Ctsc "cathepsin C" species:10116 "Rattus norvegicus"
          [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA;ISO]
          [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
          [GO:0004252 "serine-type endopeptidase activity" evidence=IEA;ISO]
          [GO:0005764 "lysosome" evidence=IDA;TAS] [GO:0005783 "endoplasmic
          reticulum" evidence=IDA] [GO:0005794 "Golgi apparatus" evidence=IDA]
          [GO:0006508 "proteolysis" evidence=IEP;ISO;TAS] [GO:0007568 "aging"
          evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
          evidence=ISO] [GO:0010033 "response to organic substance"
          evidence=IDA] [GO:0031404 "chloride ion binding" evidence=IDA]
          [GO:0042802 "identical protein binding" evidence=IDA] [GO:0043621
          "protein self-association" evidence=IDA] InterPro:IPR000668
          InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
          InterPro:IPR000169 RGD:2445 GO:GO:0005783 GO:GO:0005794 GO:GO:0007568
          GO:GO:0010033 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
          InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139
          PROSITE:PS00639 GO:GO:0004252 GO:GO:0005764 GO:GO:0043621
          GO:GO:0042802 GO:GO:0031404 GO:GO:0004197
          GeneTree:ENSGT00560000076599 CTD:1075 HOGENOM:HOG000068022
          HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ InterPro:IPR014882
          Pfam:PF08773 MEROPS:C01.070 OMA:YDDFLHY GO:GO:0001913 EMBL:D90404
          IPI:IPI00193765 PIR:A41158 RefSeq:NP_058793.1 UniGene:Rn.203177
          PDB:1JQP PDBsum:1JQP ProteinModelPortal:P80067 SMR:P80067
          STRING:P80067 PhosphoSite:P80067 PRIDE:P80067
          Ensembl:ENSRNOT00000022342 GeneID:25423 KEGG:rno:25423
          InParanoid:P80067 SABIO-RK:P80067 EvolutionaryTrace:P80067
          NextBio:606591 ArrayExpress:P80067 Genevestigator:P80067
          GermOnline:ENSRNOG00000016496 Uniprot:P80067
        Length = 462

 Score = 172 (65.6 bits), Expect = 1.8e-22, Sum P(2) = 1.8e-22
 Identities = 54/178 (30%), Positives = 89/178 (50%)

Query:    74 GSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTD-NKFADLSNEEFI--STYLG-YNKPY 129
             G ++++  R  +YS N  ++  INS   S+  T   ++  LS  + I  S + G   +P 
Sbjct:   158 GLQEKYSER--LYSHNHNFVKAINSVQKSWTATTYEEYEKLSIRDLIRRSGHSGRILRPK 215

Query:   130 NEPRWPSVQY--LGLPASVDWRK-EGA--VTPVKDQGQCGSCWAFSAVAAVEGINKLKTG 184
               P    +Q   L LP S DWR   G   V+PV++Q  CGSC++F+++  +E   ++ T 
Sbjct:   216 PAPITDEIQQQILSLPESWDWRNVRGINFVSPVRNQESCGSCYSFASLGMLEARIRILTN 275

Query:   185 KLVS--LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQ 240
                +  LS QE+V C   +  QGC+GG+         +  GV  E+ +PY   +  C+
Sbjct:   276 NSQTPILSPQEVVSCSPYA--QGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDAPCK 331

 Score = 157 (60.3 bits), Expect = 1.8e-22, Sum P(2) = 1.8e-22
 Identities = 40/114 (35%), Positives = 57/114 (50%)

Query:   232 YRGKNDRCQTDKTKHHAVTITGYEAIP--ARYAFQLYSH-GVFDEYCGHQL-NHGVTVVG 287
             Y G N+     +   H      +E       Y   +Y H G+ D +   +L NH V +VG
Sbjct:   351 YGGCNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVG 410

Query:   288 YGEDH--GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
             YG+D   G  YW+VKNSWG+ WGE+GY R+ R +      I  I M A+ P+ +
Sbjct:   411 YGKDPVTGLDYWIVKNSWGSQWGESGYFRIRRGT--DECAIESIAM-AAIPIPK 461

 Score = 40 (19.1 bits), Expect = 2.6e-10, Sum P(2) = 2.6e-10
 Identities = 9/29 (31%), Positives = 14/29 (48%)

Query:   230 YPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
             Y + G +D     +  +HAV + GY   P
Sbjct:   387 YHHTGLSDPFNPFELTNHAVLLVGYGKDP 415


>ZFIN|ZDB-GENE-040426-2650 [details] [associations]
            symbol:ctsba "cathepsin B, a" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0031101 "fin regeneration"
            evidence=IEP] [GO:0008233 "peptidase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 ZFIN:ZDB-GENE-040426-2650 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790
            GO:GO:0004197 GO:GO:0031101 MEROPS:C01.060 HOVERGEN:HBG003480
            PANTHER:PTHR12411:SF16 HSSP:P07688 EMBL:BC044517 IPI:IPI00485996
            UniGene:Dr.3374 ProteinModelPortal:Q803E4 SMR:Q803E4 STRING:Q803E4
            PRIDE:Q803E4 InParanoid:Q803E4 ArrayExpress:Q803E4 Bgee:Q803E4
            Uniprot:Q803E4
        Length = 330

 Score = 168 (64.2 bits), Expect = 9.4e-22, Sum P(2) = 9.4e-22
 Identities = 49/145 (33%), Positives = 75/145 (51%)

Query:    91 QYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWP-SVQY---LGLPASV 146
             + +++IN  N ++    N F D+ +  ++    G       P+ P  VQY   L LP + 
Sbjct:    28 EMVNFINKANTTWTAGHN-FRDV-DYSYVKRLCGTF--LKGPKLPVMVQYTEGLKLPKNF 83

Query:   147 DWRKEGAVTP----VKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS--LSEQELVDCDVN 200
             D R++    P    ++DQG CGSCWAF A  A+     +++   VS  +S Q+L+ C  +
Sbjct:    84 DAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIQSNAKVSVEISSQDLLTC-CD 142

Query:   201 SENQGCNGGYMEKAFEFITKIGGVT 225
             S   GCNGGY   A++F T  G VT
Sbjct:   143 SCGMGCNGGYPSAAWDFWTTDGLVT 167

 Score = 148 (57.2 bits), Expect = 9.4e-22, Sum P(2) = 9.4e-22
 Identities = 25/63 (39%), Positives = 35/63 (55%)

Query:   263 FQLYSHGVFDEYCGHQLN-HGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
             F LY  GV+    G  L  H + ++G+GE++G  YWL  NSW T WG+ GY ++ R    
Sbjct:   258 FLLYKSGVYQHMSGSALGGHAIKILGWGEENGVPYWLAANSWNTDWGDNGYFKILRGEDH 317

Query:   322 SNI 324
               I
Sbjct:   318 CGI 320


>UNIPROTKB|P53634 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9606 "Homo
            sapiens" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0005783 "endoplasmic reticulum" evidence=IEA] [GO:0005794
            "Golgi apparatus" evidence=IEA] [GO:0007568 "aging" evidence=IEA]
            [GO:0010033 "response to organic substance" evidence=IEA]
            [GO:0031404 "chloride ion binding" evidence=IEA] [GO:0042802
            "identical protein binding" evidence=IEA] [GO:0043621 "protein
            self-association" evidence=IEA] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0006508 "proteolysis" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0006955
            "immune response" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005783 GO:GO:0005794 Reactome:REACT_6900
            GO:GO:0006955 GO:GO:0007568 GO:GO:0010033 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004252 GO:GO:0005764 GO:GO:0031404 CTD:1075
            HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ InterPro:IPR014882
            Pfam:PF08773 MEROPS:C01.070 EMBL:X87212 EMBL:U79415 EMBL:AF234263
            EMBL:AF234264 EMBL:AF254757 EMBL:AF525032 EMBL:AF525033
            EMBL:AK292117 EMBL:AK311923 EMBL:AK223038 EMBL:BX537913
            EMBL:AC011088 EMBL:CH471185 EMBL:BC054028 EMBL:BC100891
            EMBL:BC100892 EMBL:BC100893 EMBL:BC100894 EMBL:BC109386
            EMBL:BC110071 EMBL:BC113850 EMBL:BC113897 IPI:IPI00022810
            IPI:IPI00171323 IPI:IPI00872258 PIR:S23941 PIR:S66504
            RefSeq:NP_001107645.1 RefSeq:NP_001805.3 RefSeq:NP_680475.1
            UniGene:Hs.128065 PDB:1K3B PDB:2DJF PDB:2DJG PDB:3PDF PDBsum:1K3B
            PDBsum:2DJF PDBsum:2DJG PDBsum:3PDF ProteinModelPortal:P53634
            SMR:P53634 IntAct:P53634 MINT:MINT-4655964 STRING:P53634
            PhosphoSite:P53634 DMDM:1705632 PaxDb:P53634 PRIDE:P53634
            DNASU:1075 Ensembl:ENST00000227266 Ensembl:ENST00000524463
            Ensembl:ENST00000529974 GeneID:1075 KEGG:hsa:1075 UCSC:uc001pck.4
            UCSC:uc001pcm.4 GeneCards:GC11M088026 HGNC:HGNC:2528 HPA:CAB025364
            MIM:170650 MIM:245000 MIM:245010 MIM:602365 neXtProt:NX_P53634
            Orphanet:2342 Orphanet:678 PharmGKB:PA27028 HOGENOM:HOG000127503
            InParanoid:P53634 OMA:YDDFLHY PhylomeDB:P53634
            BioCyc:MetaCyc:HS03265-MONOMER SABIO-RK:P53634 BindingDB:P53634
            ChEMBL:CHEMBL2252 EvolutionaryTrace:P53634 GenomeRNAi:1075
            NextBio:4488 PMAP-CutDB:P53634 ArrayExpress:P53634 Bgee:P53634
            Genevestigator:P53634 GermOnline:ENSG00000109861 GO:GO:0001913
            Uniprot:P53634
        Length = 463

 Score = 166 (63.5 bits), Expect = 9.5e-22, Sum P(2) = 9.5e-22
 Identities = 37/94 (39%), Positives = 48/94 (51%)

Query:   232 YRGKNDRCQTDKTKHHAVTITGYEAIP--ARYAFQLYSH-GVFDEYCGHQL-NHGVTVVG 287
             Y G N+     +  HH      +E       Y   +Y H G+ D +   +L NH V +VG
Sbjct:   352 YGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVG 411

Query:   288 YGEDH--GEKYWLVKNSWGTSWGEAGYIRMARNS 319
             YG D   G  YW+VKNSWGT WGE GY R+ R +
Sbjct:   412 YGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGT 445

 Score = 157 (60.3 bits), Expect = 9.5e-22, Sum P(2) = 9.5e-22
 Identities = 50/178 (28%), Positives = 86/178 (48%)

Query:    75 SEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDN-KFADLSNEEFISTYLGYNK----PY 129
             S++++  R   Y  N  ++  IN+   S+  T   ++  L+  + I    G+++    P 
Sbjct:   159 SQEKYSNRLYKYDHN--FVKAINAIQKSWTATTYMEYETLTLGDMIRRSGGHSRKIPRPK 216

Query:   130 NEPRWPSVQY--LGLPASVDWRK-EGA--VTPVKDQGQCGSCWAFSAVAAVEGINKLKTG 184
               P    +Q   L LP S DWR   G   V+PV++Q  CGSC++F+++  +E   ++ T 
Sbjct:   217 PAPLTAEIQQKILHLPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTN 276

Query:   185 KLVS--LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQ 240
                +  LS QE+V C   +  QGC GG+         +  G+  E  +PY G +  C+
Sbjct:   277 NSQTPILSPQEVVSCSQYA--QGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCK 332

 Score = 38 (18.4 bits), Expect = 2.5e-08, Sum P(2) = 2.5e-08
 Identities = 8/25 (32%), Positives = 12/25 (48%)

Query:   230 YPYRGKNDRCQTDKTKHHAVTITGY 254
             Y + G  D     +  +HAV + GY
Sbjct:   388 YHHTGLRDPFNPFELTNHAVLLVGY 412


>WB|WBGene00013076 [details] [associations]
            symbol:Y51A2D.8 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            EMBL:AL021497 HSSP:P53634 HOGENOM:HOG000019851 PIR:T27079
            RefSeq:NP_507627.1 ProteinModelPortal:Q9XXQ7 SMR:Q9XXQ7
            MEROPS:C01.A49 EnsemblMetazoa:Y51A2D.8 GeneID:180208
            KEGG:cel:CELE_Y51A2D.8 UCSC:Y51A2D.8 CTD:180208 WormBase:Y51A2D.8
            eggNOG:NOG307864 InParanoid:Q9XXQ7 OMA:VAVYFKV NextBio:908434
            Uniprot:Q9XXQ7
        Length = 386

 Score = 197 (74.4 bits), Expect = 1.1e-21, Sum P(2) = 1.1e-21
 Identities = 65/219 (29%), Positives = 107/219 (48%)

Query:    49 YPQKY-DPQSMEERFENWLKQYSR--EYGSEDE---WQRRFGI--YS--SNVQY----ID 94
             Y +KY D    ++RF N++K Y+   +  ++ +   +  +FGI  +S  S  ++     +
Sbjct:    50 YNRKYKDESENQQRFNNFVKSYNNVDKLNAKSKAAGYDTQFGINKFSDLSTAEFHGRLSN 109

Query:    95 YINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGLPASVDWRKE--- 151
              + S N    + +  F D    +F +  +  NK  ++ R  S +Y   P   D R E   
Sbjct:   110 VVPSNNTGLPMLN--F-DKKKPDFRAADM--NKTRHKRR--STRY---PDYFDLRNEKIN 159

Query:   152 GA--VTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGG 209
             G   V P+KDQGQC  CW F+  A VE +    +GK  SLS+QE+ DC       GC GG
Sbjct:   160 GRYIVGPIKDQGQCACCWGFAVTALVETVYAAHSGKFKSLSDQEVCDCGTEG-TPGCKGG 218

Query:   210 YMEKAFEFITKIGGVTTEDDYPY----RGKNDRCQTDKT 244
              +    +++ K G ++ ++DYPY      +  RC+  +T
Sbjct:   219 SLTLGVQYVKKYG-LSGDEDYPYDQNRANQGRRCRLRET 256

 Score = 116 (45.9 bits), Expect = 1.1e-21, Sum P(2) = 1.1e-21
 Identities = 25/62 (40%), Positives = 34/62 (54%)

Query:   263 FQLYSHGVF-DEYCGHQLN-HGVTVVGYG--ED-HGEK--YWLVKNSWGTSWGEAGYIRM 315
             F+ Y  GV  ++ C      H   +VGY   ED  G    YW++KNSWG  W E+GY+R+
Sbjct:   300 FKEYKEGVIIEDDCRRATQWHAGAIVGYDTVEDSRGRSHDYWIIKNSWGGDWAESGYVRV 359

Query:   316 AR 317
              R
Sbjct:   360 VR 361

 Score = 102 (41.0 bits), Expect = 1.8e-10, Sum P(2) = 1.8e-10
 Identities = 22/68 (32%), Positives = 41/68 (60%)

Query:    55 PQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSF----KLTDNKF 110
             P+ + + FE++ K+Y+R+Y  E E Q+RF  +  +   +D +N+++ +     +   NKF
Sbjct:    36 PEKLYKAFEDFKKKYNRKYKDESENQQRFNNFVKSYNNVDKLNAKSKAAGYDTQFGINKF 95

Query:   111 ADLSNEEF 118
             +DLS  EF
Sbjct:    96 SDLSTAEF 103

 Score = 38 (18.4 bits), Expect = 1.4e-13, Sum P(2) = 1.4e-13
 Identities = 8/21 (38%), Positives = 12/21 (57%)

Query:   237 DRCQTDKTKHHAVTITGYEAI 257
             D C+   T+ HA  I GY+ +
Sbjct:   311 DDCRR-ATQWHAGAIVGYDTV 330


>WB|WBGene00008861 [details] [associations]
            symbol:F15D4.4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 SMART:SM00848 PROSITE:PS00640
            GeneTree:ENSGT00560000076599 EMBL:Z80344 HSSP:P53634
            eggNOG:NOG310593 PIR:T20981 ProteinModelPortal:Q93512 SMR:Q93512
            MEROPS:C01.A45 EnsemblMetazoa:F15D4.4 KEGG:cel:CELE_F15D4.4
            UCSC:F15D4.4 CTD:184530 WormBase:F15D4.4 InParanoid:Q93512
            OMA:ITMEQNI NextBio:925068 Uniprot:Q93512
        Length = 608

 Score = 172 (65.6 bits), Expect = 6.7e-21, Sum P(2) = 6.7e-21
 Identities = 34/76 (44%), Positives = 46/76 (60%)

Query:   242 DKTKHHAVTITGYEAIPARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKN 301
             DK +   + + G  A P  Y    YS GV+D  CG  +NH V +VG+ +D    YW+++N
Sbjct:   361 DKVRKGPIAV-GMAAGPDIYK---YSEGVYDGDCGTIINHAVVIVGFTDD----YWIIRN 412

Query:   302 SWGTSWGEAGYIRMAR 317
             SWG SWGEAGY R+ R
Sbjct:   413 SWGASWGEAGYFRVKR 428

 Score = 146 (56.5 bits), Expect = 6.7e-21, Sum P(2) = 6.7e-21
 Identities = 50/179 (27%), Positives = 82/179 (45%)

Query:    78 EWQRRFGIYSSNVQYIDYINSQ-NL---SFKLTDNKFADLSNEEFISTYLGYNK--PYNE 131
             E  +RF +YS   + +D  N    L   S+K++ N+F+   + E     L  +   P   
Sbjct:   150 EGLKRFNVYSKVKKEVDEHNIMYELGMSSYKMSTNQFSVALDGEVAPLTLNLDALTPTAT 209

Query:   132 --PRW-PSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
               P    S +      +VDWR    + P+ DQ  CG CWAFS ++ +E    ++     S
Sbjct:   210 VIPATISSRKKRDTEPTVDWRP--FLKPILDQSTCGGCWAFSMISMIESFFAIQGYNTSS 267

Query:   189 LSEQELVDCD--VNSE----NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQT 241
             LS Q+L+ CD  V+S     N GC GGY + A  ++ ++         P+  ++  C +
Sbjct:   268 LSVQQLLTCDTKVDSTYGLANVGCKGGYFQIAGSYL-EVSAARDASLIPFDLEDTSCDS 325


>UNIPROTKB|A1E295 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9823 "Sus scrofa"
            [GO:0005615 "extracellular space" evidence=IDA] [GO:0042470
            "melanosome" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730
            "nucleolus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0042470
            GO:GO:0005615 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 MEROPS:C01.060 CTD:1508
            GeneTree:ENSGT00560000076599 HOVERGEN:HBG003480 KO:K01363
            OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16 EMBL:EF095956
            RefSeq:NP_001090927.1 UniGene:Ssc.53773 ProteinModelPortal:A1E295
            SMR:A1E295 PRIDE:A1E295 Ensembl:ENSSSCT00000026923 GeneID:100037961
            KEGG:ssc:100037961 Uniprot:A1E295
        Length = 335

 Score = 173 (66.0 bits), Expect = 1.1e-20, Sum P(2) = 1.1e-20
 Identities = 48/143 (33%), Positives = 76/143 (53%)

Query:    91 QYIDYINSQNLSFKLTDNKF-ADLSN-EEFISTYLGYNKPYNEPRWPSVQYLGLPASVDW 148
             + +++IN QN ++    N +  DLS  ++   T+LG   P    R      + LP S D 
Sbjct:    29 ELVNFINKQNTTWTAGHNFYNVDLSYVKKLCGTFLG--GPKLPQRAAFAADMILPKSFDA 86

Query:   149 RKEGAVTP----VKDQGQCGSCWAFSAVAAV-EGINKLKTGKL-VSLSEQELVDCDVNSE 202
             R++    P    ++DQG CGSCWAF AV A+ + I     G++ V +S ++++ C  +  
Sbjct:    87 REQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDMLTCCGDEC 146

Query:   203 NQGCNGGYMEKAFEFITKIGGVT 225
               GCNGG+   A+ F TK G V+
Sbjct:   147 GDGCNGGFPSGAWNFWTKKGLVS 169

 Score = 132 (51.5 bits), Expect = 1.1e-20, Sum P(2) = 1.1e-20
 Identities = 22/63 (34%), Positives = 34/63 (53%)

Query:   263 FQLYSHGVFDEYCGHQLN-HGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
             F  Y  GV+    G  +  H + ++G+G ++G  YWLV NSW T WG+ G+ ++ R    
Sbjct:   259 FLQYKSGVYQHVTGDLMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDH 318

Query:   322 SNI 324
               I
Sbjct:   319 CGI 321


>UNIPROTKB|P07688 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9913 "Bos taurus"
            [GO:0042470 "melanosome" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730
            "nucleolus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0042470
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 EMBL:L06075 EMBL:M64620
            EMBL:U16336 EMBL:U16337 EMBL:U16338 EMBL:U16339 EMBL:U16341
            EMBL:U16342 EMBL:U16343 EMBL:BC102997 IPI:IPI00692061 PIR:S38328
            RefSeq:NP_776456.1 UniGene:Bt.393 PDB:1ITO PDB:1QDQ PDB:1SP4
            PDB:2DC6 PDB:2DC7 PDB:2DC8 PDB:2DC9 PDB:2DCA PDB:2DCB PDB:2DCC
            PDB:2DCD PDBsum:1ITO PDBsum:1QDQ PDBsum:1SP4 PDBsum:2DC6
            PDBsum:2DC7 PDBsum:2DC8 PDBsum:2DC9 PDBsum:2DCA PDBsum:2DCB
            PDBsum:2DCC PDBsum:2DCD ProteinModelPortal:P07688 SMR:P07688
            STRING:P07688 MEROPS:C01.060 PRIDE:P07688
            Ensembl:ENSBTAT00000036795 GeneID:281105 KEGG:bta:281105 CTD:1508
            eggNOG:NOG315657 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 InParanoid:P07688 KO:K01363 OMA:GYPSGAW
            OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 BindingDB:P07688
            ChEMBL:CHEMBL2323 EvolutionaryTrace:P07688 NextBio:20805177
            ArrayExpress:P07688 GO:GO:0097067 PANTHER:PTHR12411:SF16
            Uniprot:P07688
        Length = 335

 Score = 167 (63.8 bits), Expect = 1.5e-20, Sum P(2) = 1.5e-20
 Identities = 48/144 (33%), Positives = 75/144 (52%)

Query:    91 QYIDYINSQNLSFKLTDNKF-ADLSN-EEFISTYLGYNK-PYNEPRWPSVQYLGLPASVD 147
             + ++++N QN ++K   N +  DLS  ++     LG  K P  +     V    LP S D
Sbjct:    29 ELVNFVNKQNTTWKAGHNFYNVDLSYVKKLCGAILGGPKLPQRDAFAADVV---LPESFD 85

Query:   148 WRKEGAVTP----VKDQGQCGSCWAFSAVAAV-EGINKLKTGKL-VSLSEQELVDCDVNS 201
              R++    P    ++DQG CGSCWAF AV A+ + I     G++ V +S ++++ C    
Sbjct:    86 AREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCGGE 145

Query:   202 ENQGCNGGYMEKAFEFITKIGGVT 225
                GCNGG+   A+ F TK G V+
Sbjct:   146 CGDGCNGGFPSGAWNFWTKKGLVS 169

 Score = 138 (53.6 bits), Expect = 1.5e-20, Sum P(2) = 1.5e-20
 Identities = 23/63 (36%), Positives = 35/63 (55%)

Query:   263 FQLYSHGVFDEYCGHQLN-HGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
             F LY  GV+    G  +  H + ++G+G ++G  YWLV NSW T WG+ G+ ++ R    
Sbjct:   259 FLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDH 318

Query:   322 SNI 324
               I
Sbjct:   319 CGI 321


>UNIPROTKB|O97578 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9615 "Canis
            lupus familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 EMBL:AF060171 RefSeq:NP_001182763.1
            UniGene:Cfa.28653 ProteinModelPortal:O97578 SMR:O97578
            MEROPS:C01.070 PRIDE:O97578 GeneID:403458 KEGG:cfa:403458
            InParanoid:O97578 NextBio:20816976 Uniprot:O97578
        Length = 435

 Score = 161 (61.7 bits), Expect = 1.6e-20, Sum P(2) = 1.6e-20
 Identities = 47/166 (28%), Positives = 80/166 (48%)

Query:    85 IYSSNVQYIDYINSQNLSFKLTDN-KFADLSNEEFISTYLGYN--KPYNEPRWPSV--QY 139
             +Y  N +++  IN+   S+  T   ++  L+  + ++   G    +P   P    +  + 
Sbjct:   142 LYKYNYEFVKAINTIQKSWTATRYIEYETLTLRDMMTRVGGRKIPRPKPTPLTAEIHEEI 201

Query:   140 LGLPASVDWRK-EGA--VTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS--LSEQEL 194
               LP S DWR   G   V+PV++Q  CGSC+AF++ A +E   ++ T    +  LS QE+
Sbjct:   202 SRLPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEI 261

Query:   195 VDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQ 240
             V C   +  QGC GG+         +  G+  E  +PY G +  C+
Sbjct:   262 VSCSQYA--QGCEGGFPYLIAGKYAQDFGLVEEACFPYAGSDSPCK 305

 Score = 150 (57.9 bits), Expect = 1.6e-20, Sum P(2) = 1.6e-20
 Identities = 34/94 (36%), Positives = 48/94 (51%)

Query:   232 YRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQ--LYSH-GVFDEYCGHQL-NHGVTVVG 287
             Y   N+     +   H      +E     + +Q  +Y H G+ D +   +L NH V +VG
Sbjct:   324 YGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTNHAVLLVG 383

Query:   288 YGEDH--GEKYWLVKNSWGTSWGEAGYIRMARNS 319
             YG D   G  YW+VKNSWG+ WGE GY R+ R +
Sbjct:   384 YGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGT 417

 Score = 37 (18.1 bits), Expect = 8.4e-09, Sum P(2) = 8.4e-09
 Identities = 8/25 (32%), Positives = 12/25 (48%)

Query:   230 YPYRGKNDRCQTDKTKHHAVTITGY 254
             Y + G  D     +  +HAV + GY
Sbjct:   360 YYHTGLRDPFNPFELTNHAVLLVGY 384


>MGI|MGI:88561 [details] [associations]
            symbol:Ctsb "cathepsin B" species:10090 "Mus musculus"
            [GO:0004175 "endopeptidase activity" evidence=ISO] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISO] [GO:0005576
            "extracellular region" evidence=ISO] [GO:0005615 "extracellular
            space" evidence=ISO] [GO:0005737 "cytoplasm" evidence=ISO]
            [GO:0005739 "mitochondrion" evidence=ISO;IDA] [GO:0005764
            "lysosome" evidence=ISO;IDA] [GO:0005901 "caveola" evidence=ISO]
            [GO:0006508 "proteolysis" evidence=ISO] [GO:0008233 "peptidase
            activity" evidence=ISO] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISO] [GO:0009897 "external side of plasma
            membrane" evidence=ISO] [GO:0009986 "cell surface" evidence=ISO]
            [GO:0016324 "apical plasma membrane" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0030984 "kininogen binding"
            evidence=ISO] [GO:0032403 "protein complex binding" evidence=ISO]
            [GO:0042277 "peptide binding" evidence=ISO] [GO:0042383
            "sarcolemma" evidence=ISO] [GO:0043621 "protein self-association"
            evidence=ISO] [GO:0048471 "perinuclear region of cytoplasm"
            evidence=ISO] [GO:0050790 "regulation of catalytic activity"
            evidence=IEA] [GO:0060548 "negative regulation of cell death"
            evidence=ISO] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 MGI:MGI:88561
            GO:GO:0005739 GO:GO:0042470 GO:GO:0048471 GO:GO:0005615
            GO:GO:0009612 GO:GO:0009611 GO:GO:0009897 GO:GO:0045471
            GO:GO:0016324 GO:GO:0009749 GO:GO:0006914 GO:GO:0043434
            eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0042383 GO:GO:0014070
            GO:GO:0042277 GO:GO:0060548 GO:GO:0005901 GO:GO:0014075
            GO:GO:0004197 GO:GO:0070670 GO:GO:0007519 MEROPS:C01.060 CTD:1508
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 KO:K01363 OMA:GYPSGAW OrthoDB:EOG4K6G4C
            BRENDA:3.4.22.1 GO:GO:0097067 PANTHER:PTHR12411:SF16 ChiTaRS:CTSB
            EMBL:M65270 EMBL:M65263 EMBL:M65264 EMBL:M65265 EMBL:M65266
            EMBL:M65267 EMBL:M65268 EMBL:M65269 EMBL:M14222 EMBL:X54966
            EMBL:S69034 EMBL:AK083393 EMBL:AK147192 EMBL:AK149884 EMBL:AK151790
            EMBL:AK167361 EMBL:BC006656 IPI:IPI00113517 PIR:A38458
            RefSeq:NP_031824.1 UniGene:Mm.236553 UniGene:Mm.489070
            ProteinModelPortal:P10605 SMR:P10605 IntAct:P10605 STRING:P10605
            PhosphoSite:P10605 SWISS-2DPAGE:P10605 PaxDb:P10605 PRIDE:P10605
            Ensembl:ENSMUST00000006235 GeneID:13030 KEGG:mmu:13030
            UCSC:uc007uhh.1 InParanoid:P10605 BioCyc:MetaCyc:MONOMER-14810
            BindingDB:P10605 ChEMBL:CHEMBL5187 NextBio:282900 Bgee:P10605
            CleanEx:MM_CTSB Genevestigator:P10605 GermOnline:ENSMUSG00000021939
            Uniprot:P10605
        Length = 339

 Score = 178 (67.7 bits), Expect = 2.4e-20, Sum P(2) = 2.4e-20
 Identities = 50/141 (35%), Positives = 76/141 (53%)

Query:    93 IDYINSQNLSFKLTDNKF-ADLSN-EEFISTYLGYNKPYNEPRWPSVQYLGLPASVDWRK 150
             I+YIN QN +++   N +  D+S  ++   T LG   P    R    + + LP + D R+
Sbjct:    31 INYINKQNTTWQAGRNFYNVDISYLKKLCGTVLG--GPKLPGRVAFGEDIDLPETFDARE 88

Query:   151 EGAVTP----VKDQGQCGSCWAFSAVAAVEGINKLKT-GKL-VSLSEQELVDCDVNSENQ 204
             + +  P    ++DQG CGSCWAF AV A+     + T G++ V +S ++L+ C       
Sbjct:    89 QWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQCGD 148

Query:   205 GCNGGYMEKAFEFITKIGGVT 225
             GCNGGY   A+ F TK G V+
Sbjct:   149 GCNGGYPSGAWSFWTKKGLVS 169

 Score = 123 (48.4 bits), Expect = 2.4e-20, Sum P(2) = 2.4e-20
 Identities = 19/56 (33%), Positives = 31/56 (55%)

Query:   263 FQLYSHGVFDEYCGHQLN-HGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
             F  Y  GV+    G  +  H + ++G+G ++G  YWL  NSW   WG+ G+ ++ R
Sbjct:   259 FLTYKSGVYKHEAGDMMGGHAIRILGWGVENGVPYWLAANSWNLDWGDNGFFKILR 314

 Score = 38 (18.4 bits), Expect = 6.7e-05, Sum P(2) = 6.7e-05
 Identities = 6/8 (75%), Positives = 7/8 (87%)

Query:    39 GIPAGAWS 46
             G P+GAWS
Sbjct:   153 GYPSGAWS 160


>UNIPROTKB|F1N9D7 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9031 "Gallus gallus"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005730 "nucleolus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005739 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0005764
            GO:GO:0004197 GeneTree:ENSGT00560000076599 OMA:GYPSGAW
            GO:GO:0097067 PANTHER:PTHR12411:SF16 IPI:IPI00573387
            EMBL:AADN02018292 Ensembl:ENSGALT00000026896
            Ensembl:ENSGALT00000036723 Uniprot:F1N9D7
        Length = 340

 Score = 162 (62.1 bits), Expect = 4.3e-20, Sum P(2) = 4.3e-20
 Identities = 46/141 (32%), Positives = 72/141 (51%)

Query:    93 IDYINSQNLSFKLTDNKF-ADLSN-EEFISTYLGYNKPYNEPRWPSVQYLGLPASVDWRK 150
             +++IN  N ++K   N    D+S  ++   T+LG   P    R      + LP + D RK
Sbjct:    31 VNHINKLNTTWKAGHNFHNTDMSYVKKLCGTFLG--GPKLPERVDFAADMDLPDTFDSRK 88

Query:   151 EG----AVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL--SEQELVDCDVNSENQ 204
             +      ++ ++DQG CGSCWAF AV A+     + T   VS+  S ++L+ C       
Sbjct:    89 QWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLLSCCGFECGM 148

Query:   205 GCNGGYMEKAFEFITKIGGVT 225
             GCNGGY   A+ + T+ G V+
Sbjct:   149 GCNGGYPSGAWRYWTERGLVS 169

 Score = 140 (54.3 bits), Expect = 4.3e-20, Sum P(2) = 4.3e-20
 Identities = 22/63 (34%), Positives = 35/63 (55%)

Query:   263 FQLYSHGVFDEYCGHQLN-HGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
             F +Y  GV+    G Q+  H + ++G+G ++G  YWL  NSW T WG+ G+ ++ R    
Sbjct:   260 FLMYKSGVYQHVSGEQVGGHAIRILGWGVENGTPYWLAANSWNTDWGDNGFFKILRGEDH 319

Query:   322 SNI 324
               I
Sbjct:   320 CGI 322

 Score = 38 (18.4 bits), Expect = 7.2e-07, Sum P(2) = 7.2e-07
 Identities = 9/39 (23%), Positives = 17/39 (43%)

Query:     7 IAIYTNLHLKIAIDMRMMLRNAVLSLFLLWVLGIPAGAW 45
             I ++TN  + + +    +L        +    G P+GAW
Sbjct:   121 ICVHTNAKVSVEVSAEDLLSCCGFECGMGCNGGYPSGAW 159


>WB|WBGene00008231 [details] [associations]
            symbol:tag-329 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            eggNOG:NOG288820 EMBL:Z70750 HSSP:P53634 HOGENOM:HOG000019851
            PIR:T20110 RefSeq:NP_505458.1 ProteinModelPortal:Q18740 SMR:Q18740
            MEROPS:C01.A36 EnsemblMetazoa:C50F4.3 GeneID:183677
            KEGG:cel:CELE_C50F4.3 UCSC:C50F4.3 CTD:183677 WormBase:C50F4.3
            InParanoid:Q18740 OMA:WIFRNSW NextBio:921986 Uniprot:Q18740
        Length = 374

 Score = 194 (73.4 bits), Expect = 6.4e-20, Sum P(2) = 6.4e-20
 Identities = 57/181 (31%), Positives = 80/181 (44%)

Query:   108 NKFADLSNEEFISTYLGYNKPYNEPRWPSV---------QYLGLPASVDWR--KEGA--- 153
             NKF+DLS +E    Y  +  P N    P           Q  GLP + D R  K G    
Sbjct:    97 NKFSDLSKKEIHGMYSKFGPPKNNTNVPKFNLKNLRVKRQMEGLPKTFDLRNKKVGGHYI 156

Query:   154 VTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEK 213
             + P+K Q  C  CW F+A A  E    +   K ++LSEQE+ DC       GCNGG    
Sbjct:   157 IGPIKTQDSCACCWGFAATAVAEAALTVHLKKAMNLSEQEVCDC-APKHGPGCNGGDPVD 215

Query:   214 AFEFITKIGGVTTEDDYPYRGKND----RCQTDKTKHHAVTIT-GYEAI-PARYAFQLYS 267
               E+I ++G +T   +YP+         RC+++K       +   Y AI P    +Q+  
Sbjct:   216 GLEYIKEMG-LTGGKEYPFNVNRSTQLGRCESEKYDRELNPLELDYYAIDPFNAEYQMTH 274

Query:   268 H 268
             H
Sbjct:   275 H 275

 Score = 102 (41.0 bits), Expect = 6.4e-20, Sum P(2) = 6.4e-20
 Identities = 21/51 (41%), Positives = 26/51 (50%)

Query:   272 DEYCGHQLNHGVTVVGYGEDHGEK-----YWLVKNSWGTSWGEAGYIRMAR 317
             DE  GH   H   +VGYG           YW+ +NSW T WG+ GY R+ R
Sbjct:   308 DEKGGHW--HSGAIVGYGTTKNSAGRTVDYWIFRNSWWTDWGDDGYARIVR 356

 Score = 90 (36.7 bits), Expect = 1.1e-07, Sum P(2) = 1.1e-07
 Identities = 35/149 (23%), Positives = 62/149 (41%)

Query:    21 MRMMLRNAVLSLFLLWVLGIPAGA-WSEGYPQ----KYDPQSMEERFENWLKQYSREYGS 75
             M  +L    + +F+  V     GA + + + +    + +P+ + + FE+++ +Y R Y  
Sbjct:     1 MASLLALFFIQIFIFTVTSFDVGANFEDSFFEINIDRNNPEKLYKEFEDFIVKYKRNYKD 60

Query:    76 EDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLGYNKPYNE 131
             E E + RF  + +    +  +N          K   NKF+DLS +E    Y  +  P N 
Sbjct:    61 EIEKKFRFQQFVATHNRVGKMNKAAKKAGHDTKYGINKFSDLSKKEIHGMYSKFGPPKNN 120

Query:   132 PRWPSV---------QYLGLPASVDWRKE 151
                P           Q  GLP + D R +
Sbjct:   121 TNVPKFNLKNLRVKRQMEGLPKTFDLRNK 149


>UNIPROTKB|P43233 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9031 "Gallus gallus"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 MEROPS:C01.060 CTD:1508
            HOGENOM:HOG000241341 HOVERGEN:HBG003480 KO:K01363 OrthoDB:EOG4K6G4C
            PANTHER:PTHR12411:SF16 EMBL:U18083 IPI:IPI00573387 PIR:S58770
            RefSeq:NP_990702.1 UniGene:Gga.3854 ProteinModelPortal:P43233
            SMR:P43233 STRING:P43233 PRIDE:P43233 GeneID:396329 KEGG:gga:396329
            InParanoid:P43233 NextBio:20816377 Uniprot:P43233
        Length = 340

 Score = 160 (61.4 bits), Expect = 1.6e-19, Sum P(2) = 1.6e-19
 Identities = 45/141 (31%), Positives = 72/141 (51%)

Query:    93 IDYINSQNLSFKLTDNKF-ADLSN-EEFISTYLGYNKPYNEPRWPSVQYLGLPASVDWRK 150
             +++IN  N + +   N    D+S  ++   T+LG   P    R    + + LP + D RK
Sbjct:    31 VNHINKLNTTGRAGHNFHNTDMSYVKKLCGTFLG--GPKAPERVDFAEDMDLPDTFDTRK 88

Query:   151 EG----AVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL--SEQELVDCDVNSENQ 204
             +      ++ ++DQG CGSCWAF AV A+     + T   VS+  S ++L+ C       
Sbjct:    89 QWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLLSCCGFECGM 148

Query:   205 GCNGGYMEKAFEFITKIGGVT 225
             GCNGGY   A+ + T+ G V+
Sbjct:   149 GCNGGYPSGAWRYWTERGLVS 169

 Score = 137 (53.3 bits), Expect = 1.6e-19, Sum P(2) = 1.6e-19
 Identities = 22/63 (34%), Positives = 34/63 (53%)

Query:   263 FQLYSHGVFDEYCGHQLN-HGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
             F +Y  GV+    G Q+  H + ++G+G ++G  YWL  NSW T WG  G+ ++ R    
Sbjct:   260 FLMYKSGVYQHVSGEQVGGHAIRILGWGVENGTPYWLAANSWNTDWGITGFFKILRGEDH 319

Query:   322 SNI 324
               I
Sbjct:   320 CGI 322

 Score = 38 (18.4 bits), Expect = 1.6e-06, Sum P(2) = 1.6e-06
 Identities = 9/39 (23%), Positives = 17/39 (43%)

Query:     7 IAIYTNLHLKIAIDMRMMLRNAVLSLFLLWVLGIPAGAW 45
             I ++TN  + + +    +L        +    G P+GAW
Sbjct:   121 ICVHTNAKVSVEVSAEDLLSCCGFECGMGCNGGYPSGAW 159


>UNIPROTKB|J9P219 [details] [associations]
            symbol:J9P219 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 InterPro:IPR014882
            Pfam:PF08773 OMA:YDDFLHY EMBL:AAEX03012741
            Ensembl:ENSCAFT00000050015 Uniprot:J9P219
        Length = 406

 Score = 151 (58.2 bits), Expect = 1.9e-19, Sum P(2) = 1.9e-19
 Identities = 48/168 (28%), Positives = 80/168 (47%)

Query:    85 IYSSNVQYIDYINSQNLSFKLTDN-KFADLSNEEFISTYLGYN---KPYNEPRWPSV--Q 138
             +Y  N +++  IN+   S+  T   ++  L+  + ++   G     KP   P    +  +
Sbjct:   111 LYKYNYEFVKAINTIQKSWTATRYIEYETLTLRDMMTRGGGRKIPRKPKPTPLTAEIHEE 170

Query:   139 YLGLPASVDWRK-EGA--VTPVKDQG-QCGSCWAFSAVAAVEGINKLKTGKLVS--LSEQ 192
                LP S DWR   G   V+PV++Q   CGSC+AF++ A +E   ++ T    +  LS Q
Sbjct:   171 ISRLPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRILTNNTQTPILSPQ 230

Query:   193 ELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQ 240
             E+V C   +  QGC GG+         +  G+  E  +PY G +  C+
Sbjct:   231 EIVSCSQYA--QGCEGGFPYLIAGKYAQDFGLVEEACFPYAGSDSPCK 276

 Score = 150 (57.9 bits), Expect = 1.9e-19, Sum P(2) = 1.9e-19
 Identities = 34/94 (36%), Positives = 48/94 (51%)

Query:   232 YRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQ--LYSH-GVFDEYCGHQL-NHGVTVVG 287
             Y   N+     +   H      +E     + +Q  +Y H G+ D +   +L NH V +VG
Sbjct:   295 YGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTNHAVLLVG 354

Query:   288 YGEDH--GEKYWLVKNSWGTSWGEAGYIRMARNS 319
             YG D   G  YW+VKNSWG+ WGE GY R+ R +
Sbjct:   355 YGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGT 388

 Score = 37 (18.1 bits), Expect = 9.7e-08, Sum P(2) = 9.7e-08
 Identities = 8/25 (32%), Positives = 12/25 (48%)

Query:   230 YPYRGKNDRCQTDKTKHHAVTITGY 254
             Y + G  D     +  +HAV + GY
Sbjct:   331 YYHTGLRDPFNPFELTNHAVLLVGY 355


>UNIPROTKB|F1PSK8 [details] [associations]
            symbol:F1PSK8 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 InterPro:IPR014882
            Pfam:PF08773 EMBL:AAEX03012741 Ensembl:ENSCAFT00000007054
            Uniprot:F1PSK8
        Length = 405

 Score = 150 (57.9 bits), Expect = 2.4e-19, Sum P(2) = 2.4e-19
 Identities = 34/94 (36%), Positives = 48/94 (51%)

Query:   232 YRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQ--LYSH-GVFDEYCGHQL-NHGVTVVG 287
             Y   N+     +   H      +E     + +Q  +Y H G+ D +   +L NH V +VG
Sbjct:   294 YGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPFNPFELTNHAVLLVG 353

Query:   288 YGEDH--GEKYWLVKNSWGTSWGEAGYIRMARNS 319
             YG D   G  YW+VKNSWG+ WGE GY R+ R +
Sbjct:   354 YGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGT 387

 Score = 150 (57.9 bits), Expect = 2.4e-19, Sum P(2) = 2.4e-19
 Identities = 47/167 (28%), Positives = 80/167 (47%)

Query:    85 IYSSNVQYIDYINSQNLSFKLTDN-KFADLSNEEFISTYLGYN--KPYNEPRWPSV--QY 139
             +Y  N +++  IN+   S+  T   ++  L+  + ++   G    +P   P    +  + 
Sbjct:   111 LYKYNYEFVKAINTIQKSWTATRYIEYETLTLRDMMTRGGGRKIPRPKPTPLTAEIHEEI 170

Query:   140 LGLPASVDWRK-EGA--VTPVKDQG-QCGSCWAFSAVAAVEGINKLKTGKLVS--LSEQE 193
               LP S DWR   G   V+PV++Q   CGSC+AF++ A +E   ++ T    +  LS QE
Sbjct:   171 SRLPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRILTNNTQTPILSPQE 230

Query:   194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQ 240
             +V C   +  QGC GG+         +  G+  E  +PY G +  C+
Sbjct:   231 IVSCSQYA--QGCEGGFPYLIAGKYAQDFGLVEEACFPYAGSDSPCK 275

 Score = 37 (18.1 bits), Expect = 1.2e-07, Sum P(2) = 1.2e-07
 Identities = 8/25 (32%), Positives = 12/25 (48%)

Query:   230 YPYRGKNDRCQTDKTKHHAVTITGY 254
             Y + G  D     +  +HAV + GY
Sbjct:   330 YYHTGLRDPFNPFELTNHAVLLVGY 354


>FB|FBgn0034709 [details] [associations]
            symbol:Swim "Secreted Wg-interacting molecule" species:7227
            "Drosophila melanogaster" [GO:0004197 "cysteine-type endopeptidase
            activity" evidence=ISS] [GO:0006955 "immune response" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044 "scavenger
            receptor activity" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0042600 "chorion" evidence=IDA]
            [GO:0035593 "positive regulation of Wnt receptor signaling pathway
            by establishment of Wnt protein localization to extracellular
            region" evidence=IMP] [GO:0030177 "positive regulation of Wnt
            receptor signaling pathway" evidence=IDA] [GO:0005615
            "extracellular space" evidence=IDA] [GO:0017147 "Wnt-protein
            binding" evidence=IDA] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR013128 Pfam:PF00112 PROSITE:PS50958 SMART:SM00201
            SMART:SM00645 EMBL:AE013599 GO:GO:0005615 GO:GO:0006955
            GO:GO:0030247 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GO:GO:0017147 GO:GO:0005044
            GeneTree:ENSGT00560000076599 GO:GO:0042600 eggNOG:NOG310046
            OMA:DNCNRCT HSSP:P80067 EMBL:AY113377 RefSeq:NP_611652.2
            RefSeq:NP_726176.1 UniGene:Dm.732 SMR:Q7JWQ7 IntAct:Q7JWQ7
            EnsemblMetazoa:FBtr0071784 EnsemblMetazoa:FBtr0071785 GeneID:37537
            KEGG:dme:Dmel_CG3074 UCSC:CG3074-RA FlyBase:FBgn0034709
            HOGENOM:HOG000264150 InParanoid:Q7JWQ7 OrthoDB:EOG48CZ9P
            GenomeRNAi:37537 NextBio:804155 GO:GO:0035593 Uniprot:Q7JWQ7
        Length = 431

 Score = 155 (59.6 bits), Expect = 3.3e-19, Sum P(2) = 3.3e-19
 Identities = 36/81 (44%), Positives = 47/81 (58%)

Query:   263 FQLYSHGVFDEYCGHQLN----HGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMAR 317
             F  YS GV+ E   ++      H V +VG+GE+H GEKYW+  NSWG+ WGE GY R+ R
Sbjct:   345 FFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEHNGEKYWIAANSWGSWWGEHGYFRILR 404

Query:   318 NSPSSNIGICGI--LMQASYP 336
              S       CGI   + AS+P
Sbjct:   405 GSNE-----CGIEEYVLASWP 420

 Score = 144 (55.7 bits), Expect = 3.3e-19, Sum P(2) = 3.3e-19
 Identities = 36/112 (32%), Positives = 59/112 (52%)

Query:   141 GLPASVDWRKEGA--VTPVKDQGQCGSCWAFSAVAAVEGINKLKT-GKL-VSLSEQELVD 196
             GLP+S +   + +  ++ V DQG CG+ W  S  +       +++ GK  V LS Q ++ 
Sbjct:   186 GLPSSFNALDKWSSYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNILS 245

Query:   197 CDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
             C      QGC GG+++ A+ ++ K  GV  E+ YPY    D C   K +H++
Sbjct:   246 C--TRRQQGCEGGHLDAAWRYLHK-KGVVDENCYPYTQHRDTC---KIRHNS 291


>MGI|MGI:2139628 [details] [associations]
            symbol:Ctso "cathepsin O" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:2139628 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0005764 GeneTree:ENSGT00560000076599 MEROPS:C01.035 CTD:1519
            HOVERGEN:HBG105050 KO:K01374 OMA:SNVCGIA OrthoDB:EOG4V6ZH1
            EMBL:AK034490 EMBL:AK049470 EMBL:AK165930 EMBL:AK166103
            EMBL:BC044664 IPI:IPI00453524 RefSeq:NP_808330.1 UniGene:Mm.254642
            ProteinModelPortal:Q8BM88 SMR:Q8BM88 STRING:Q8BM88
            PhosphoSite:Q8BM88 PRIDE:Q8BM88 Ensembl:ENSMUST00000029649
            GeneID:229445 KEGG:mmu:229445 UCSC:uc008pon.1 InParanoid:Q8BM88
            NextBio:379433 Bgee:Q8BM88 CleanEx:MM_CTSO Genevestigator:Q8BM88
            GermOnline:ENSMUSG00000028015 Uniprot:Q8BM88
        Length = 312

 Score = 229 (85.7 bits), Expect = 4.0e-19, P = 4.0e-19
 Identities = 63/201 (31%), Positives = 98/201 (48%)

Query:    95 YINS---QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPS-----VQYLGLPASV 146
             Y+NS   +N +     N+F+ L  EEF + YLG    +  PR+P+     +  + LP   
Sbjct:    45 YLNSFPHENSTAFYGVNQFSYLFPEEFKALYLGSKYAW-APRYPAEGQRPIPNVSLPLRF 103

Query:   147 DWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGC 206
             DWR +  V PV++Q  CG CWAFS V+A+E    ++   L  LS Q+++DC  N  N GC
Sbjct:   104 DWRDKHVVNPVRNQEMCGGCWAFSVVSAIESARAIQGKSLDYLSVQQVIDCSFN--NSGC 161

Query:   207 NGGYMEKAFEFITKIG-GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQL 265
              GG    A  ++ +    +  +  YP++  N +C+        V++  + A    Y F+ 
Sbjct:   162 LGGSPLCALRWLNETQLKLVADSQYPFKAVNGQCRHFPQSQAGVSVKDFSA----YNFR- 216

Query:   266 YSHGVFDEYCGHQLNHGVTVV 286
                G  DE     L+ G  VV
Sbjct:   217 ---GQEDEMARALLSFGPLVV 234

 Score = 139 (54.0 bits), Expect = 7.7e-07, P = 7.7e-07
 Identities = 34/110 (30%), Positives = 52/110 (47%)

Query:   223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQLYSHGVFDEYCGH-QLNH 281
             GV+ +D   Y   N R Q D+     ++      I    ++Q Y  G+   +C   + NH
Sbjct:   204 GVSVKDFSAY---NFRGQEDEMARALLSFGPLVVIVDAMSWQDYLGGIIQHHCSSGEANH 260

Query:   282 GVTVVGYGEDHGEKYWLVKNSWGTSWGEAGY--IRMARNSPSSNIGICGI 329
              V + G+       YW+V+NSWG+SWG  GY  ++M  N       +CGI
Sbjct:   261 AVLITGFDRTGNTPYWMVRNSWGSSWGVEGYAHVKMGGN-------VCGI 303


>UNIPROTKB|J9NSE7 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 InterPro:IPR014882 Pfam:PF08773
            EMBL:AAEX03017125 Ensembl:ENSCAFT00000014269 OMA:INGQICH
            Uniprot:J9NSE7
        Length = 458

 Score = 151 (58.2 bits), Expect = 1.6e-18, Sum P(2) = 1.6e-18
 Identities = 46/173 (26%), Positives = 80/173 (46%)

Query:    85 IYSSNVQYIDYINSQNLSFKLTDN-KFADLSNEEFISTYLGYN--KPYNEPRWPSV--QY 139
             +Y  N +++  IN+   S+  T   ++  L+  + +    G    +P   P    +  + 
Sbjct:   165 LYKYNYEFVKAINTIQKSWTATRYIEYETLTLRDMMRRAGGRKIPRPKPTPLTAEIHEEI 224

Query:   140 LGLPASVDWRK-EGA--VTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS--LSEQEL 194
               LP S DWR   G   V+PV++Q  CGSC+AF++   +E   ++ T    +  LS QE+
Sbjct:   225 SRLPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTVMLEARIRILTNNTQTPILSPQEI 284

Query:   195 VDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHH 247
             V C   +  QGC GG+         +  G+  E  + Y G +  C+ +   H+
Sbjct:   285 VSCSQYA--QGCEGGFPYLIAGKYAQDFGLVDEACFSYAGSDSPCKPNDCFHY 335

 Score = 143 (55.4 bits), Expect = 1.6e-18, Sum P(2) = 1.6e-18
 Identities = 33/94 (35%), Positives = 47/94 (50%)

Query:   232 YRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQ--LYSH-GVFDEYCGHQL-NHGVTVVG 287
             Y   N+     +   H      +E     + +Q  +Y H G+ D     +L NH V +VG
Sbjct:   347 YGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPINPFELTNHAVLLVG 406

Query:   288 YGEDH--GEKYWLVKNSWGTSWGEAGYIRMARNS 319
             YG D   G  YW+VKNSWG+ WGE GY ++ R +
Sbjct:   407 YGTDSASGMDYWIVKNSWGSRWGEDGYFQICRGT 440


>WB|WBGene00000781 [details] [associations]
            symbol:cpr-1 species:6239 "Caenorhabditis elegans"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008340 "determination
            of adult lifespan" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008340 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            eggNOG:NOG315657 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            KO:K01363 PANTHER:PTHR12411:SF16 EMBL:M74797 EMBL:Z78012 PIR:T20148
            RefSeq:NP_506002.2 ProteinModelPortal:P25807 SMR:P25807
            DIP:DIP-25619N MINT:MINT-1058393 STRING:P25807 MEROPS:C01.A32
            PaxDb:P25807 EnsemblMetazoa:C52E4.1 GeneID:179637
            KEGG:cel:CELE_C52E4.1 UCSC:C52E4.1 CTD:179637 WormBase:C52E4.1
            InParanoid:P25807 OMA:CSLSCQS NextBio:906250 Uniprot:P25807
        Length = 329

 Score = 146 (56.5 bits), Expect = 8.5e-18, Sum P(2) = 8.5e-18
 Identities = 24/63 (38%), Positives = 35/63 (55%)

Query:   263 FQLYSHGVFDEYCGHQLN-HGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
             F  Y  GV+    G  L  H + ++G+G + G  YWLV NSWG +WGE+G+ ++ R    
Sbjct:   256 FYKYKSGVYKHTAGKYLGGHAIKIIGWGTESGSPYWLVANSWGVNWGESGFFKIYRGDDQ 315

Query:   322 SNI 324
               I
Sbjct:   316 CGI 318

 Score = 136 (52.9 bits), Expect = 8.5e-18, Sum P(2) = 8.5e-18
 Identities = 40/141 (28%), Positives = 63/141 (44%)

Query:    96 INSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGLPASVD----WRKE 151
             I  + + FKL D K+A   ++E  +T               V    +PA+ D    W + 
Sbjct:    51 ITEEEMKFKLMDGKYAAAHSDEIRATE------------QEVVLASVPATFDSRTQWSEC 98

Query:   152 GAVTPVKDQGQCGSCWAFSAVAAVEGINKLKT--GKLVSLSEQELVDCDVNSENQGCNGG 209
              ++  ++DQ  CGSCWAF A   +     ++T   +   +S  +L+ C  +S   GC GG
Sbjct:    99 KSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLLSCCGSSCGNGCEGG 158

Query:   210 YMEKAFEFITKIGGVTTEDDY 230
             Y  +A  +     GV T  DY
Sbjct:   159 YPIQALRWWDS-KGVVTGGDY 178


>WB|WBGene00010204 [details] [associations]
            symbol:F57F5.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0006898
            "receptor-mediated endocytosis" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0009792 GO:GO:0006898
            GO:GO:0040007 GO:GO:0002119 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0040011
            EMBL:Z75953 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            PANTHER:PTHR12411:SF16 RefSeq:NP_506011.2 ProteinModelPortal:Q20950
            SMR:Q20950 DIP:DIP-24447N IntAct:Q20950 MINT:MINT-211137
            STRING:Q20950 MEROPS:C01.A42 EnsemblMetazoa:F57F5.1 GeneID:179645
            KEGG:cel:CELE_F57F5.1 UCSC:F57F5.1 CTD:179645 WormBase:F57F5.1
            OMA:ADDINAC Uniprot:Q20950
        Length = 351

 Score = 142 (55.0 bits), Expect = 1.3e-17, Sum P(2) = 1.3e-17
 Identities = 29/69 (42%), Positives = 37/69 (53%)

Query:   263 FQLYSHGVFDEYCGHQLN-HGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
             F+ YS GV+    G  L  H V ++G+G D+G  YWL  NSW   WGE GY R+ R    
Sbjct:   278 FEHYSGGVYVHTAGASLGGHAVKMLGWGVDNGTPYWLCANSWNEDWGENGYFRIIRGVNE 337

Query:   322 SNI--GICG 328
               I  G+ G
Sbjct:   338 CGIEGGVVG 346

 Score = 140 (54.3 bits), Expect = 1.3e-17, Sum P(2) = 1.3e-17
 Identities = 50/185 (27%), Positives = 81/185 (43%)

Query:    91 QYIDYINSQNLSFKLT----DNKFADLSNEEFISTYL-GYNKPYN--EPRWPSVQYLGLP 143
             + +DY+N    SFK       + + D   ++ +   +    + Y   E   P V+   +P
Sbjct:    39 ELVDYVNKVQTSFKAELGSYFSSYPDTIKKQLMGAKMVEIPEEYRVFEMTHPEVEDAAVP 98

Query:   144 ASVD----WRKEGAVTPVKDQGQCGSCWAFSAVAAV-EGINKLKTGK-LVSLSEQELVDC 197
              S D    W    +++ ++DQ  CGSCWA SA   + + I      K ++S+S  ++  C
Sbjct:    99 DSFDSRTAWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNAKTILSISADDINAC 158

Query:   198 DVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQT---DKTKHHAVTITGY 254
                    GCNGGY  +A+    K G VT      Y+ K   C+       +HH V  T Y
Sbjct:   159 CGMVCGNGCNGGYPIEAWRHYVKKGYVTGGS---YQDKTG-CKPYPYPPCEHH-VNGTHY 213

Query:   255 EAIPA 259
             +  P+
Sbjct:   214 KPCPS 218


>WB|WBGene00000784 [details] [associations]
            symbol:cpr-4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39895 EMBL:L39926 EMBL:FO081381
            PIR:T37280 RefSeq:NP_504682.1 UniGene:Cel.5404
            ProteinModelPortal:P43508 SMR:P43508 DIP:DIP-25376N
            MINT:MINT-1069892 STRING:P43508 MEROPS:C01.A34 PaxDb:P43508
            EnsemblMetazoa:F44C4.3 GeneID:179053 KEGG:cel:CELE_F44C4.3
            UCSC:F44C4.3 CTD:179053 WormBase:F44C4.3 InParanoid:P43508
            OMA:CCGFLCG NextBio:903704 Uniprot:P43508
        Length = 335

 Score = 148 (57.2 bits), Expect = 3.6e-17, Sum P(2) = 3.6e-17
 Identities = 26/63 (41%), Positives = 36/63 (57%)

Query:   263 FQLYSHGVFDEYCGHQLN-HGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
             F  Y  GV+    G +L  H + ++G+G D+G  YWLV NSW  +WGE GY R+ R +  
Sbjct:   262 FYQYKTGVYVHTTGQELGGHAIRILGWGTDNGTPYWLVANSWNVNWGENGYFRIIRGTNE 321

Query:   322 SNI 324
               I
Sbjct:   322 CGI 324

 Score = 128 (50.1 bits), Expect = 3.6e-17, Sum P(2) = 3.6e-17
 Identities = 39/145 (26%), Positives = 68/145 (46%)

Query:    94 DYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-------LPASV 146
             +Y+NS+   +K    K  D++ E+     +     +  P  P V+ +        +PA+ 
Sbjct:    30 EYVNSKQSLWKAEIPK--DITIEQVKKRLM--RTEFVAPHTPDVEVVKHDINEDTIPATF 85

Query:   147 D----WRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS--LSEQELVDCDVN 200
             D    W    ++  ++DQ  CGSCWAF+A  A      + +   V+  LS ++++ C  N
Sbjct:    86 DARTQWPNCMSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSCCSN 145

Query:   201 SENQGCNGGYMEKAFEFITKIGGVT 225
                 GC GGY   A++++ K G  T
Sbjct:   146 C-GYGCEGGYPINAWKYLVKSGFCT 169


>WB|WBGene00009158 [details] [associations]
            symbol:F26E4.3 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005576
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005044
            GeneTree:ENSGT00560000076599 HSSP:P07711 EMBL:Z81070
            eggNOG:NOG310046 HOGENOM:HOG000241342 OMA:DNCNRCT PIR:T21421
            RefSeq:NP_492593.2 ProteinModelPortal:P90850 SMR:P90850
            PaxDb:P90850 EnsemblMetazoa:F26E4.3.1 EnsemblMetazoa:F26E4.3.2
            GeneID:172827 KEGG:cel:CELE_F26E4.3 UCSC:F26E4.3.1 CTD:172827
            WormBase:F26E4.3 InParanoid:P90850 NextBio:877161 Uniprot:P90850
        Length = 452

 Score = 155 (59.6 bits), Expect = 4.8e-17, Sum P(2) = 4.8e-17
 Identities = 37/95 (38%), Positives = 58/95 (61%)

Query:   142 LPASVDWR-KEGA-VTPVKDQGQCGSCWAFSAVA-AVEGINKLKTGKLVS-LSEQELVDC 197
             LP   D R K G  + PV DQG CGS W+ S  A + + +  +  G++ S LS Q+L+ C
Sbjct:   184 LPEHFDARDKWGPLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQLLSC 243

Query:   198 DVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
             + + + +GC GGY+++A+ +I K+G V  +  YPY
Sbjct:   244 NQHRQ-KGCEGGYLDRAWWYIRKLG-VVGDHCYPY 276

 Score = 124 (48.7 bits), Expect = 4.8e-17, Sum P(2) = 4.8e-17
 Identities = 23/41 (56%), Positives = 28/41 (68%)

Query:   281 HGVTVVGYGEDH--GE--KYWLVKNSWGTSWGEAGYIRMAR 317
             H V V+G+G DH  G+  KYWL  NSWGT WGE GY ++ R
Sbjct:   374 HSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGYFKVLR 414


>UNIPROTKB|F1P0K2 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            OMA:SNVCGIA EMBL:AADN02016534 IPI:IPI00651180
            Ensembl:ENSGALT00000015270 Uniprot:F1P0K2
        Length = 320

 Score = 222 (83.2 bits), Expect = 4.9e-17, P = 4.9e-17
 Identities = 53/175 (30%), Positives = 88/175 (50%)

Query:    87 SSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRW---PSVQYLGLP 143
             +  ++ ++  ++ N S     N+F+ L  EEF + YL  + PY  PR+   P  +   LP
Sbjct:    50 AKRIRLLNSPSNDNGSAFYGKNQFSHLFPEEFKAIYLR-SIPYKLPRYIKVPKGEEKPLP 108

Query:   144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSEN 203
                DWR +  +  V++Q  CG CWAFS V  +E    +K   L  LS Q+++DC  +  N
Sbjct:   109 KKFDWRDKKVIAEVRNQQTCGGCWAFSVVGGIESAYAIKGHNLEELSVQQVIDCSYS--N 166

Query:   204 QGCNGGYMEKAFEFI--TKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEA 256
              GC+GG    A  ++  TK+  +  + +Y ++ +   C         V+ITG+ A
Sbjct:   167 YGCSGGSTITALSWLNQTKVK-LVRDSEYTFKAQTGLCHYFPHSDFGVSITGFAA 220

 Score = 124 (48.7 bits), Expect = 4.1e-05, P = 4.1e-05
 Identities = 43/165 (26%), Positives = 72/165 (43%)

Query:   166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
             C   S + A+  +N+ K  KLV  SE         ++   C+  Y   + +F   I G  
Sbjct:   169 CSGGSTITALSWLNQTKV-KLVRDSEYTF-----KAQTGLCH--YFPHS-DFGVSITGFA 219

Query:   226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQLYSHGVFDEYCGH-QLNHGVT 284
                 Y + G+ +           + +T  +A+    ++Q Y  G+   +C   + NH V 
Sbjct:   220 A---YDFSGQEEEMMRVLVDWGPLAVT-VDAV----SWQDYLGGIIQYHCSSGKANHAVL 271

Query:   285 VVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
             + G+       YW+V+NSWG +WG  GY+R+   S      +CGI
Sbjct:   272 ITGFDTTGIIPYWIVQNSWGRTWGIDGYVRVKIGS-----NVCGI 311


>DICTYBASE|DDB_G0283401 [details] [associations]
            symbol:ctsZ "cathepsin Z precursor" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            dictyBase:DDB_G0283401 GO:GO:0005615 GenomeReviews:CM000153_GR
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 EMBL:AAFI02000055 KO:K08568 OMA:QCGTCTE
            eggNOG:NOG275763 RefSeq:XP_639036.1 ProteinModelPortal:Q54R55
            IntAct:Q54R55 MEROPS:C01.A60 PRIDE:Q54R55
            EnsemblProtists:DDB0233836 GeneID:8624061 KEGG:ddi:DDB_G0283401
            InParanoid:Q54R55 Uniprot:Q54R55
        Length = 296

 Score = 143 (55.4 bits), Expect = 6.1e-17, Sum P(2) = 6.1e-17
 Identities = 26/72 (36%), Positives = 43/72 (59%)

Query:   256 AIPARYAFQLYSHGVFDEYCGHQL-NHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIR 314
             +I A    + Y+ G+F E+    L NH ++V+G+G      YW+V+NSWG+ +GE G+  
Sbjct:   211 SIDATSKLEAYTSGIFKEFKLDPLPNHIISVIGWGVQDSTPYWIVRNSWGSYYGEGGFFN 270

Query:   315 MARNSPSSNIGI 326
             + + S   N+GI
Sbjct:   271 IVQGSLFENLGI 282

 Score = 129 (50.5 bits), Expect = 6.1e-17, Sum P(2) = 6.1e-17
 Identities = 37/116 (31%), Positives = 56/116 (48%)

Query:   135 PSVQYLGLPASVDWRKEGAV---TPVKDQG---QCGSCWAFSAVAAVEGINKLKTGKL-- 186
             P    L +P S DWR    V   T  ++Q     CG CWAF++ +++    K++      
Sbjct:    51 PKDMNLEVPQSWDWRNVSGVNYLTMNRNQHIPQYCGGCWAFASTSSISDRIKIQRKAAFP 110

Query:   187 -VSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN--DRC 239
              V+++ Q L+DC+       C+GG    AF FI +  G+  E   PY+ KN  D C
Sbjct:   111 DVNVAPQHLIDCNGGGT---CDGGDPGDAFAFINE-NGIVDETCKPYQAKNLPDEC 162


>ZFIN|ZDB-GENE-070323-1 [details] [associations]
            symbol:ctsbb "capthepsin B, b" species:7955 "Danio
            rerio" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-070323-1 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197
            GeneTree:ENSGT00560000076599 PANTHER:PTHR12411:SF16 OMA:CCGFLCG
            EMBL:CU207296 EMBL:CABZ01037785 IPI:IPI00877452
            Ensembl:ENSDART00000097263 Bgee:F1QZT5 Uniprot:F1QZT5
        Length = 326

 Score = 147 (56.8 bits), Expect = 6.7e-17, Sum P(2) = 6.7e-17
 Identities = 25/63 (39%), Positives = 36/63 (57%)

Query:   263 FQLYSHGVFDEYCGHQLN-HGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
             F LY  GV+    G  L  H V ++G+GE++G  +WLV NSW + WG+ GY ++ R    
Sbjct:   253 FPLYKSGVYQHLTGSALGGHAVKILGWGEENGTPFWLVANSWNSDWGDNGYFKILRGHDE 312

Query:   322 SNI 324
               I
Sbjct:   313 CGI 315

 Score = 126 (49.4 bits), Expect = 6.7e-17, Sum P(2) = 6.7e-17
 Identities = 35/104 (33%), Positives = 57/104 (54%)

Query:   132 PRWP-SVQY---LGLPASVD----WRKEGAVTPVKDQGQCGSCWAFSAVAAV-EGINKLK 182
             PR P +V++   + LP S D    W     +  ++DQG CGSCWAF AV ++ + I    
Sbjct:    61 PRLPHTVKHSTNVKLPDSFDLRDQWPNCKTLNQIRDQGSCGSCWAFGAVESISDRICIHS 120

Query:   183 TGKLV-SLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
              GK    +S ++L+ C  +    GC+GG+  +A+++  + G VT
Sbjct:   121 KGKQSPEISAEDLLSC-CDQCGFGCSGGFPAEAWDYWRRSGLVT 163


>DICTYBASE|DDB_G0276111 [details] [associations]
            symbol:DDB_G0276111 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0276111 Pfam:PF00188
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            PROSITE:PS00139 EMBL:AAFI02000014 Gene3D:3.40.33.10
            InterPro:IPR001283 InterPro:IPR014044 PRINTS:PR00837 SMART:SM00198
            SUPFAM:SSF55797 ProtClustDB:CLSZ2429919 RefSeq:XP_643261.1
            ProteinModelPortal:Q75JH0 EnsemblProtists:DDB0169514 GeneID:8620304
            KEGG:ddi:DDB_G0276111 InParanoid:Q75JH0 OMA:GFVTSIK Uniprot:Q75JH0
        Length = 415

 Score = 224 (83.9 bits), Expect = 1.6e-16, P = 1.6e-16
 Identities = 62/195 (31%), Positives = 84/195 (43%)

Query:   146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKL---VSLSEQELVDCDVNSE 202
             VDW+  G VT +K+QGQCG C++F+  AA+E    +K       + LSEQ  V C     
Sbjct:   213 VDWKSLGFVTSIKNQGQCGGCYSFATCAALESAYLIKNNLPNTDIDLSEQNFVSC----V 268

Query:   203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA-RY 261
             N GC GG  +   + + K  G+  E  YPY+     C            TGY  I   + 
Sbjct:   269 NYGCGGGNGQSCLDKL-KSTGIMYETSYPYKAVTGSCPNVIQSPQPFKWTGYSNIQGNKE 327

Query:   262 AF-----------QLYSHGVFDEY------CGHQLNHGVTVVGYGEDHGEKYWLVKNSWG 304
             AF            LY    F  Y      C         +   G    +  +L+KNSWG
Sbjct:   328 AFLNALKSGPIYASLYVDSGFQLYKSGIYSCSQSSTPNHAITIVGYSSADNSYLIKNSWG 387

Query:   305 TSWGEAGYIRMARNS 319
             T +GE+GYIR+   S
Sbjct:   388 TIYGESGYIRLKEGS 402


>UNIPROTKB|H0YDT2 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            EMBL:AP001201 HGNC:HGNC:2546 Ensembl:ENST00000526034 Bgee:H0YDT2
            Uniprot:H0YDT2
        Length = 211

 Score = 171 (65.3 bits), Expect = 1.8e-16, Sum P(2) = 1.8e-16
 Identities = 54/174 (31%), Positives = 82/174 (47%)

Query:    32 LFLLWVLGIPAGAWSEGYPQKYDPQSME--ERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
             L  L V G+  G       Q   PQ +E  E F+ +  Q++R Y S +E   R  I++ N
Sbjct:     9 LLALLVAGLAQGIRGPLRAQDLGPQPLELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHN 68

Query:    90 VQYIDYINSQNLS---FKLTDNKFADLSNEEFISTYLGYNK-----PY--NEPRWPSVQY 139
             +     +  ++L    F +T   F+DL+ EEF   Y GY +     P    E R    + 
Sbjct:    69 LAQAQRLQEEDLGTAEFGVTP--FSDLTEEEFGQLY-GYRRAAGGVPSMGREIRSEEPEE 125

Query:   140 LGLPASVDWRK-EGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQ 192
               +P S DWRK   A++P+KDQ  C  CWA +A   +E + ++     V +S Q
Sbjct:   126 -SVPFSCDWRKVASAISPIKDQKNCNCCWAMAAAGNIETLWRISFWDFVDVSVQ 178

 Score = 56 (24.8 bits), Expect = 1.8e-16, Sum P(2) = 1.8e-16
 Identities = 11/24 (45%), Positives = 15/24 (62%)

Query:   222 GGVTTEDDYPYRGK--NDRCQTDK 243
             GG+ +E DYP++GK    RC   K
Sbjct:   179 GGLASEKDYPFQGKVRAHRCHPKK 202


>WB|WBGene00021072 [details] [associations]
            symbol:W07B8.4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:FO081739 PIR:T31728 RefSeq:NP_503382.1
            HSSP:P53634 ProteinModelPortal:O16288 SMR:O16288 STRING:O16288
            MEROPS:C01.A39 PaxDb:O16288 EnsemblMetazoa:W07B8.4 GeneID:178611
            KEGG:cel:CELE_W07B8.4 UCSC:W07B8.4 CTD:178611 WormBase:W07B8.4
            InParanoid:O16288 OMA:ESQYGCK NextBio:901836 Uniprot:O16288
        Length = 335

 Score = 151 (58.2 bits), Expect = 2.2e-16, Sum P(2) = 2.2e-16
 Identities = 29/68 (42%), Positives = 38/68 (55%)

Query:   263 FQLYSHGVFDEYCGHQLN-HGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
             F LY  G++    G +L  H V ++G+G D+G  YWL  NSW T WGE GY R+ R    
Sbjct:   258 FYLYKTGIYTHVAGGELGGHAVKMLGWGVDNGTPYWLAANSWNTVWGEKGYFRILRG--- 314

Query:   322 SNIGICGI 329
               +  CGI
Sbjct:   315 --VDECGI 320

 Score = 117 (46.2 bits), Expect = 2.2e-16, Sum P(2) = 2.2e-16
 Identities = 30/92 (32%), Positives = 47/92 (51%)

Query:   142 LPASVD----WRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS--LSEQELV 195
             +P S D    W +  +V  ++DQ  CGSCWA +A  A+     + +   V+  LS ++++
Sbjct:    73 IPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGDVNTLLSAEDIL 132

Query:   196 DCDVNSEN--QGCNGGYMEKAFEFITKIGGVT 225
              C     N   GC GGY  +A+ +  K G VT
Sbjct:   133 TCCTGKFNCGDGCEGGYPIQAWRYWVKNGLVT 164


>WB|WBGene00000786 [details] [associations]
            symbol:cpr-6 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            PANTHER:PTHR12411:SF16 EMBL:L39894 EMBL:L39939 EMBL:FO080666
            PIR:T37274 RefSeq:NP_741818.1 UniGene:Cel.18138
            ProteinModelPortal:P43510 SMR:P43510 DIP:DIP-25139N
            MINT:MINT-1074025 STRING:P43510 MEROPS:C01.A51 PaxDb:P43510
            PRIDE:P43510 EnsemblMetazoa:C25B8.3a GeneID:180931
            KEGG:cel:CELE_C25B8.3 UCSC:C25B8.3a CTD:180931 WormBase:C25B8.3a
            InParanoid:P43510 OMA:KAKWGLM NextBio:911608 ArrayExpress:P43510
            Uniprot:P43510
        Length = 379

 Score = 143 (55.4 bits), Expect = 2.5e-16, Sum P(2) = 2.5e-16
 Identities = 55/184 (29%), Positives = 85/184 (46%)

Query:    93 IDYIN-SQNLSFKLTDNKFADLSNEEFISTY--LGYN----KPYNEPRWPSVQYLGL--P 143
             IDY+N +QNL       +F+ +  E   + +  +G N        +      + L L  P
Sbjct:    47 IDYVNENQNLWTAKKQRRFSSVYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDLDIP 106

Query:   144 ASVD----WRKEGAVTPVKDQGQCGSCWAFSAVAAV-EGINKLKTGKL-VSLSEQELVDC 197
              S D    W K  ++  ++DQ  CGSCWAF AV A+ + I     G+L V+LS  +L+ C
Sbjct:   107 ESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSC 166

Query:   198 DVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQT---DKTKHHAVTITGY 254
                S   GCNGG    A+ +  K  G+ T  +Y     N+ C+       +HH+   T +
Sbjct:   167 -CKSCGFGCNGGDPLAAWRYWVK-DGIVTGSNYT---ANNGCKPYPFPPCEHHSKK-THF 220

Query:   255 EAIP 258
             +  P
Sbjct:   221 DPCP 224

 Score = 128 (50.1 bits), Expect = 2.5e-16, Sum P(2) = 2.5e-16
 Identities = 28/70 (40%), Positives = 37/70 (52%)

Query:   263 FQLYSHGVFDEYCGHQLN--HGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
             F  Y  GV+  + G +L   H V ++G+G D G  YW V NSW T WGE G+ R+ R   
Sbjct:   286 FLNYDGGVY-VHTGGKLGGGHAVKLIGWGIDDGIPYWTVANSWNTDWGEDGFFRILRGVD 344

Query:   321 SSNI--GICG 328
                I  G+ G
Sbjct:   345 ECGIESGVVG 354


>WB|WBGene00000782 [details] [associations]
            symbol:cpr-2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 eggNOG:NOG315657 GeneTree:ENSGT00560000076599
            HOGENOM:HOG000241341 PANTHER:PTHR12411:SF16 EMBL:Z81531
            RefSeq:NP_507186.3 ProteinModelPortal:O45466 SMR:O45466
            MEROPS:C01.A40 PaxDb:O45466 EnsemblMetazoa:F36D3.9 GeneID:185355
            KEGG:cel:CELE_F36D3.9 CTD:185355 WormBase:F36D3.9 OMA:FDARLRW
            Uniprot:O45466
        Length = 326

 Score = 136 (52.9 bits), Expect = 2.6e-16, Sum P(2) = 2.6e-16
 Identities = 25/68 (36%), Positives = 36/68 (52%)

Query:   263 FQLYSHGVFDEYCGHQLN-HGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
             F+ Y  G++    G     H V ++G+G + G  YWL  NSWG+ WGE+G  R+ R    
Sbjct:   254 FEKYKSGIYRHIAGRSKGGHAVKLIGWGTERGTPYWLAVNSWGSQWGESGTFRILRG--- 310

Query:   322 SNIGICGI 329
               +  CGI
Sbjct:   311 --VDECGI 316

 Score = 133 (51.9 bits), Expect = 2.6e-16, Sum P(2) = 2.6e-16
 Identities = 39/147 (26%), Positives = 71/147 (48%)

Query:    93 IDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPY-NEPRWPSVQYL--GLPASVD-- 147
             +D+INS   +F+ T+N           S +  +N P+ +E R    +++    P + D  
Sbjct:    32 VDHINSAASTFQ-TENYAVTHEKMHTRSMHEKFNAPFPDEFRATEREFVLDATPLNFDAR 90

Query:   148 --WRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS--LSEQELVDCDVNSEN 203
               W +  ++  +++Q  CGSCWAFS    +     + +       +S  +L+ C   S  
Sbjct:    91 TRWPQCKSMKLIREQSNCGSCWAFSTAEVISDRTCIASNGTQQPIISPTDLLTCCGMSCG 150

Query:   204 QGCNGGYMEKAFEFITKIGGVTTEDDY 230
             +GC+GG+  +AF++  +  GV T  DY
Sbjct:   151 EGCDGGFPYRAFQWWAR-RGVVTGGDY 176


>WB|WBGene00000789 [details] [associations]
            symbol:cpz-2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GeneTree:ENSGT00560000076599 KO:K08568 EMBL:Z81103
            HSSP:P80067 PIR:T23720 RefSeq:NP_506318.1 ProteinModelPortal:P92005
            SMR:P92005 STRING:P92005 MEROPS:C01.A41 PaxDb:P92005
            EnsemblMetazoa:M04G12.2 GeneID:179818 KEGG:cel:CELE_M04G12.2
            UCSC:M04G12.2 CTD:179818 WormBase:M04G12.2 eggNOG:NOG275763
            InParanoid:P92005 OMA:VEYWIAR NextBio:906990 Uniprot:P92005
        Length = 467

 Score = 140 (54.3 bits), Expect = 3.6e-16, Sum P(2) = 3.6e-16
 Identities = 40/120 (33%), Positives = 54/120 (45%)

Query:   132 PR-WPSVQYLG--LPASVDWRKEGAV---TPVKDQG---QCGSCWAFSAVAAV-EGINKL 181
             PR W S  +    LP   DWR    V   +P ++Q     CGSCW F    A+ +  N  
Sbjct:   208 PREWESSSFKSNDLPTGWDWRNVSGVNYCSPTRNQHIPVYCGSCWVFGTTGALNDRFNVA 267

Query:   182 KTGK--LVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRC 239
             + G+  +  LS QE++DC  N +   C GG +    E   KI G+  E    YR  N  C
Sbjct:   268 RKGRWPMTQLSPQEIIDC--NGKGN-CQGGEIGNVLEH-AKIQGLVEEGCNVYRATNGEC 323

 Score = 133 (51.9 bits), Expect = 3.6e-16, Sum P(2) = 3.6e-16
 Identities = 25/62 (40%), Positives = 39/62 (62%)

Query:   256 AIPARYAFQL-YSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYI 313
             AI A   F+  Y  GV+ E    + NH +++ G+G D +G +YW+ +NSWG +WGE G+ 
Sbjct:   373 AIGATKKFEYEYVKGVYSEKSDLESNHIISLTGWGVDENGVEYWIARNSWGEAWGELGWF 432

Query:   314 RM 315
             R+
Sbjct:   433 RV 434

 Score = 40 (19.1 bits), Expect = 9.1e-06, Sum P(2) = 9.1e-06
 Identities = 5/6 (83%), Positives = 6/6 (100%)

Query:   162 QCGSCW 167
             +CGSCW
Sbjct:   328 RCGSCW 333


>DICTYBASE|DDB_G0288563 [details] [associations]
            symbol:DDB_G0288563 species:44689 "Dictyostelium
            discoideum" [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0288563
            GO:GO:0005615 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197
            EMBL:AAFI02000117 PANTHER:PTHR12411:SF16 RefSeq:XP_636643.1
            MEROPS:C01.A58 PRIDE:Q54IS1 EnsemblProtists:DDB0187993
            GeneID:8626689 KEGG:ddi:DDB_G0288563 InParanoid:Q54IS1 OMA:AWEYMEL
            Uniprot:Q54IS1
        Length = 314

 Score = 148 (57.2 bits), Expect = 4.6e-16, Sum P(2) = 4.6e-16
 Identities = 38/103 (36%), Positives = 55/103 (53%)

Query:   142 LPASVDWRKE--GAVTPVKDQGQCGSCWAFSAVAA------VEGINKLKTGKLVSLSEQE 193
             +P S D R +    + P+ +Q QCGSCWAFS+         +   NK   G   +LS Q 
Sbjct:    88 IPTSFDSRVQWPDCIHPILNQEQCGSCWAFSSSEVLSDRLCIASNNKTNPG---ALSPQT 144

Query:   194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN 236
             LV CDV   N GC+GG  + A+E++ ++ G+ T+   PY   N
Sbjct:   145 LVACDVYG-NDGCSGGIPQLAWEYM-ELKGLPTDSCVPYTAGN 185

 Score = 116 (45.9 bits), Expect = 4.6e-16, Sum P(2) = 4.6e-16
 Identities = 27/67 (40%), Positives = 36/67 (53%)

Query:   263 FQLYSHGVFDEYCGHQL--NHGVTVVGYGEDHGEK--YWLVKNSWGTSWGEAGY--IRMA 316
             F  YS GV+    G  L   H + +VG+G D   +  YW+V NSWG  WG+ G+  I M 
Sbjct:   240 FMSYSSGVYVMTPGSSLLGGHAIKIVGWGFDQTSQLNYWIVANSWGADWGQQGFFFISME 299

Query:   317 RNSPSSN 323
               S SS+
Sbjct:   300 TCSISSD 306


>UNIPROTKB|E2QV47 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0097208 "alveolar lamellar body"
            evidence=IEA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0070371 "ERK1 and ERK2 cascade"
            evidence=IEA] [GO:0070324 "thyroid hormone binding" evidence=IEA]
            [GO:0060448 "dichotomous subdivision of terminal units involved in
            lung branching" evidence=IEA] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IEA] [GO:0043129 "surfactant homeostasis"
            evidence=IEA] [GO:0043066 "negative regulation of apoptotic
            process" evidence=IEA] [GO:0033619 "membrane protein proteolysis"
            evidence=IEA] [GO:0032526 "response to retinoic acid" evidence=IEA]
            [GO:0031648 "protein destabilization" evidence=IEA] [GO:0031638
            "zymogen activation" evidence=IEA] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=IEA] [GO:0016505
            "apoptotic protease activator activity" evidence=IEA] [GO:0010815
            "bradykinin catabolic process" evidence=IEA] [GO:0010813
            "neuropeptide catabolic process" evidence=IEA] [GO:0010634
            "positive regulation of epithelial cell migration" evidence=IEA]
            [GO:0010628 "positive regulation of gene expression" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=IEA] [GO:0005829 "cytosol" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0004252 "serine-type endopeptidase activity"
            evidence=IEA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IEA] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 GO:GO:0016505 GO:GO:0010634
            GO:GO:0004197 GO:GO:0042599 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 Ensembl:ENSCAFT00000036196 Uniprot:E2QV47
        Length = 136

 Score = 171 (65.3 bits), Expect = 4.8e-16, Sum P(2) = 4.8e-16
 Identities = 33/79 (41%), Positives = 43/79 (54%)

Query:   263 FQLYSHGVFDEYCGHQ----LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
             F +Y  G++     H+    +NH V  VGYGE +G  YW+VKNSWG  WG  GY  M R 
Sbjct:    60 FMMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEQNGIPYWIVKNSWGPQWGMNGYFLMERG 119

Query:   319 SPSSNIGICGILMQASYPV 337
                    +CG+   ASYP+
Sbjct:   120 K-----NMCGLAACASYPI 133

 Score = 52 (23.4 bits), Expect = 4.8e-16, Sum P(2) = 4.8e-16
 Identities = 8/19 (42%), Positives = 13/19 (68%)

Query:   227 EDDYPYRGKNDRCQTDKTK 245
             ED YPY+G++  C+   +K
Sbjct:     3 EDSYPYKGQDGDCKYQPSK 21


>DICTYBASE|DDB_G0280187 [details] [associations]
            symbol:DDB_G0280187 "cathepsin Z-like protein"
            species:44689 "Dictyostelium discoideum" [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            dictyBase:DDB_G0280187 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            EMBL:AAFI02000035 KO:K08568 RefSeq:XP_641294.1
            ProteinModelPortal:Q54VR1 MEROPS:C01.A61 PRIDE:Q54VR1
            EnsemblProtists:DDB0233838 GeneID:8622427 KEGG:ddi:DDB_G0280187
            InParanoid:Q54VR1 OMA:VWKVGDY Uniprot:Q54VR1
        Length = 291

 Score = 147 (56.8 bits), Expect = 7.9e-16, Sum P(2) = 7.9e-16
 Identities = 24/58 (41%), Positives = 40/58 (68%)

Query:   262 AFQLYSHGVFDEYCGH--QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
             AF+ Y+ GVF    G   ++NH ++++G+G ++G  YW+ +NSWGT +GE G+ R+ R
Sbjct:   214 AFESYTSGVFTSSVGSTGEINHEISIIGWGTENGVDYWIGRNSWGTYFGELGFFRIQR 271

 Score = 113 (44.8 bits), Expect = 7.9e-16, Sum P(2) = 7.9e-16
 Identities = 34/112 (30%), Positives = 57/112 (50%)

Query:   142 LPASVDWRK-EGA--VTPVKDQG---QCGSCWAFSAVAAVEGINKLKTGKL-----VSLS 190
             LP   DWR   G+  +T  ++Q     CGSCWA    +A+ G +++K G+      V L+
Sbjct:    49 LPTQYDWRNISGSSYITITRNQHLPQYCGSCWAHGTTSAL-G-DRIKIGRKGTFPEVVLA 106

Query:   191 EQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTD 242
              Q L++C    +N  C+GG   +A+ ++    G+T E   PY   ++ C  +
Sbjct:   107 PQVLLNC-AGPDNT-CDGGDPTEAYAYMAA-KGITDETCAPYEAIDNECNAE 155


>DICTYBASE|DDB_G0288221 [details] [associations]
            symbol:DDB_G0288221 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0288221 Pfam:PF00188 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 PANTHER:PTHR12411 EMBL:AAFI02000109 Gene3D:3.40.33.10
            InterPro:IPR001283 InterPro:IPR014044 SMART:SM00198 SUPFAM:SSF55797
            MEROPS:C01.A52 ProtClustDB:CLSZ2429919 RefSeq:XP_636852.1
            ProteinModelPortal:Q54J84 EnsemblProtists:DDB0187839 GeneID:8626520
            KEGG:ddi:DDB_G0288221 InParanoid:Q54J84 Uniprot:Q54J84
        Length = 395

 Score = 216 (81.1 bits), Expect = 1.4e-15, P = 1.4e-15
 Identities = 65/216 (30%), Positives = 95/216 (43%)

Query:   145 SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTG----KLVSLSEQELVDCDVN 200
             SVDW      TPV+DQG+C SCW F ++AA+E    +K G      + LS Q  ++C + 
Sbjct:   191 SVDW--SDYQTPVRDQGECKSCWVFGSLAALESRYLIKNGVSEKSTLHLSAQNAMNC-IT 247

Query:   201 SENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG-KNDRCQTDKTKHHAVTITGYEAIP- 258
             S   GC  G+    F++     G+  E DYPY    +D C +   K      +GY+++  
Sbjct:   248 S---GCESGWPANVFDYFES-SGIAFEKDYPYDAIGSDNCTSSSNKFE---YSGYDSVEN 300

Query:   259 -----------ARYAFQLYSHGVFDEYCG-------HQLNHGVTVVGYGEDHGEKYWLVK 300
                              LYS   F  Y G          +    V+  G D     W +K
Sbjct:   301 TKDSLIQELKNGPITIALYSDTAFQSYAGGIYDSVEEYKDVNHIVLLVGYDKPTDSWKIK 360

Query:   301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
             NS GT WGE GY R+  ++    +GI  +L  + +P
Sbjct:   361 NSLGTKWGELGYARITASN--DKLGI--LLYNSFFP 392


>UNIPROTKB|E9PQM1 [details] [associations]
            symbol:CTSB "Cathepsin B heavy chain" species:9606 "Homo
            sapiens" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0005739
            "mitochondrion" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005730 "nucleolus" evidence=IDA] [GO:0043231 "intracellular
            membrane-bounded organelle" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 InterPro:IPR000169 GO:GO:0005739
            GO:GO:0048471 GO:GO:0005615 GO:GO:0009612 GO:GO:0009611
            GO:GO:0005730 GO:GO:0009897 GO:GO:0045471 GO:GO:0016324
            GO:GO:0009749 GO:GO:0006914 GO:GO:0043434 GO:GO:0006508
            PANTHER:PTHR12411 PROSITE:PS00139 GO:GO:0050790 GO:GO:0005764
            GO:GO:0042383 GO:GO:0014070 GO:GO:0042277 GO:GO:0060548
            GO:GO:0005901 GO:GO:0014075 GO:GO:0004197 GO:GO:0070670
            GO:GO:0007519 PANTHER:PTHR12411:SF16 HGNC:HGNC:2527 ChiTaRS:CTSB
            EMBL:AC069185 EMBL:AC025857 IPI:IPI01011636
            ProteinModelPortal:E9PQM1 SMR:E9PQM1 Ensembl:ENST00000534636
            ArrayExpress:E9PQM1 Bgee:E9PQM1 Uniprot:E9PQM1
        Length = 175

 Score = 192 (72.6 bits), Expect = 7.9e-15, P = 7.9e-15
 Identities = 50/143 (34%), Positives = 78/143 (54%)

Query:    91 QYIDYINSQNLSFKLTDNKF-ADLSN-EEFISTYLGYNKPYNEPRWPSVQYLGLPASVDW 148
             + ++Y+N +N +++   N +  D+S  +    T+LG  KP    R    + L LPAS D 
Sbjct:    29 ELVNYVNKRNTTWQAGHNFYNVDMSYLKRLCGTFLGGPKP--PQRVMFTEDLKLPASFDA 86

Query:   149 RKEGAVTP----VKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL--SEQELVDCDVNSE 202
             R++    P    ++DQG CGSCWAF AV A+     + T   VS+  S ++L+ C  +  
Sbjct:    87 REQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMC 146

Query:   203 NQGCNGGYMEKAFEFITKIGGVT 225
               GCNGGY  +A+ F T+ G V+
Sbjct:   147 GDGCNGGYPAEAWNFWTRKGLVS 169


>UNIPROTKB|E9PSG5 [details] [associations]
            symbol:CTSB "Cathepsin B heavy chain" species:9606 "Homo
            sapiens" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0005739
            "mitochondrion" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005730 "nucleolus" evidence=IDA] [GO:0043231 "intracellular
            membrane-bounded organelle" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 InterPro:IPR000169 GO:GO:0005739
            GO:GO:0048471 GO:GO:0005615 GO:GO:0009612 GO:GO:0009611
            GO:GO:0005730 GO:GO:0009897 GO:GO:0045471 GO:GO:0016324
            GO:GO:0009749 GO:GO:0006914 GO:GO:0043434 GO:GO:0006508
            PANTHER:PTHR12411 PROSITE:PS00139 GO:GO:0050790 GO:GO:0005764
            GO:GO:0042383 GO:GO:0014070 GO:GO:0042277 GO:GO:0060548
            GO:GO:0005901 GO:GO:0014075 GO:GO:0004197 GO:GO:0070670
            GO:GO:0007519 PANTHER:PTHR12411:SF16 HGNC:HGNC:2527 ChiTaRS:CTSB
            EMBL:AC069185 EMBL:AC025857 IPI:IPI00976781
            ProteinModelPortal:E9PSG5 SMR:E9PSG5 Ensembl:ENST00000533572
            ArrayExpress:E9PSG5 Bgee:E9PSG5 Uniprot:E9PSG5
        Length = 170

 Score = 192 (72.6 bits), Expect = 7.9e-15, P = 7.9e-15
 Identities = 50/143 (34%), Positives = 78/143 (54%)

Query:    91 QYIDYINSQNLSFKLTDNKF-ADLSN-EEFISTYLGYNKPYNEPRWPSVQYLGLPASVDW 148
             + ++Y+N +N +++   N +  D+S  +    T+LG  KP    R    + L LPAS D 
Sbjct:    29 ELVNYVNKRNTTWQAGHNFYNVDMSYLKRLCGTFLGGPKP--PQRVMFTEDLKLPASFDA 86

Query:   149 RKEGAVTP----VKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL--SEQELVDCDVNSE 202
             R++    P    ++DQG CGSCWAF AV A+     + T   VS+  S ++L+ C  +  
Sbjct:    87 REQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMC 146

Query:   203 NQGCNGGYMEKAFEFITKIGGVT 225
               GCNGGY  +A+ F T+ G V+
Sbjct:   147 GDGCNGGYPAEAWNFWTRKGLVS 169


>WB|WBGene00000783 [details] [associations]
            symbol:cpr-3 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0009792 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39890 EMBL:L39925 EMBL:Z81119
            EMBL:Z82057 PIR:T37282 RefSeq:NP_506790.1 UniGene:Cel.23503
            ProteinModelPortal:P43507 SMR:P43507 MEROPS:C01.A33
            EnsemblMetazoa:T10H4.12 GeneID:180033 KEGG:cel:CELE_T10H4.12
            UCSC:T10H4.12 CTD:180033 WormBase:T10H4.12 eggNOG:NOG240190
            InParanoid:P43507 OMA:PVEASYK NextBio:907824 Uniprot:P43507
        Length = 370

 Score = 135 (52.6 bits), Expect = 8.3e-15, Sum P(2) = 8.3e-15
 Identities = 33/102 (32%), Positives = 49/102 (48%)

Query:   229 DYPYRGKNDRCQTDKTKHHAVT-ITGYEAIPARYA----FQLYSHGVFDEYCGHQLN-HG 282
             D  Y     +  T K+     T I  Y  + A Y     F  Y  GV+    G  +  H 
Sbjct:   226 DKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHYKSGVYHYTSGKLVGGHA 285

Query:   283 VTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
             V ++G+G ++G  YWL+ NSWGTS+GE G+ ++ R +    I
Sbjct:   286 VKIIGWGVENGVDYWLIANSWGTSFGEKGFFKIRRGTNECQI 327

 Score = 122 (48.0 bits), Expect = 8.3e-15, Sum P(2) = 8.3e-15
 Identities = 37/137 (27%), Positives = 58/137 (42%)

Query:    96 INSQNLSFKLTDNKFAD-LSNEEFISTYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAV 154
             I+   + FK+ D KFA+ L  +  +++ L           P        A   W     +
Sbjct:    53 ISEFEMKFKVMDVKFAEPLEKDSDVASELFVRGEIVPEPLPDT----FDAREKWPDCNTI 108

Query:   155 TPVKDQGQCGSCWAFSAVAAVEG---INKLKTGKLVSLSEQELVDCDVNSENQGCNGGYM 211
               +++Q  CGSCWAF A   +     I    T + V +S ++++ C   +   GC GGY 
Sbjct:   109 KLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPV-ISVEDILSCCGTTCGYGCKGGYS 167

Query:   212 EKAFEFITKIGGVTTED 228
              +A  F    G VT  D
Sbjct:   168 IEALRFWASSGAVTGGD 184


>UNIPROTKB|P05689 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:BC122603
            EMBL:X01809 IPI:IPI00708474 PIR:A29172 RefSeq:NP_001071303.1
            UniGene:Bt.4902 ProteinModelPortal:P05689 SMR:P05689 MEROPS:C01.013
            PRIDE:P05689 GeneID:404187 KEGG:bta:404187 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 InParanoid:P05689 KO:K08568
            OrthoDB:EOG42Z4QN BRENDA:3.4.18.1 NextBio:20817615 Uniprot:P05689
        Length = 304

 Score = 140 (54.3 bits), Expect = 1.2e-14, Sum P(2) = 1.2e-14
 Identities = 23/55 (41%), Positives = 37/55 (67%)

Query:   266 YSHGVFDEYCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
             Y+ G++ EY     +NH V+V G+G   G +YW+V+NSWG  WGE G++R+  ++
Sbjct:   226 YTGGIYSEYNDQAFINHIVSVAGWGVSDGMEYWIVRNSWGEPWGEHGWMRIVTST 280

 Score = 111 (44.1 bits), Expect = 1.2e-14, Sum P(2) = 1.2e-14
 Identities = 49/171 (28%), Positives = 74/171 (43%)

Query:   121 TYLGYNKPYNEPRWPSVQYLG---LPASVDWRKEGAV---TPVKDQG---QCGSCWAFSA 171
             T LG  + Y  P     +YL    LP S DWR    V   +  ++Q     CGSCWA  +
Sbjct:    44 TQLG-RRTYPRPH----EYLSPSDLPKSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGS 98

Query:   172 VAAV-EGINKLKTGKLVS--LSEQELVDC-DVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
              +A+ + IN  + G   S  LS Q ++DC D  S    C GG     +E+  +  G+  E
Sbjct:    99 TSAMADRINIKRKGAWPSTLLSVQHVIDCGDAGS----CEGGNDLPVWEYAHR-HGIPDE 153

Query:   228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQLYSHGVFDEYCGHQ 278
                 Y+ K+  C  DK  +   T T ++       + L+  G +    G +
Sbjct:   154 TCNNYQAKDQEC--DKF-NQCGTCTEFKECHVIKNYTLWKVGDYGSLSGRE 201

 Score = 55 (24.4 bits), Expect = 7.5e-09, Sum P(2) = 7.5e-09
 Identities = 21/97 (21%), Positives = 34/97 (35%)

Query:   132 PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSE 191
             P W      G+P       +          QCG+C  F     ++     K G   SLS 
Sbjct:   140 PVWEYAHRHGIPDETCNNYQAKDQECDKFNQCGTCTEFKECHVIKNYTLWKVGDYGSLSG 199

Query:   192 QELVDCDVNSENQ-GCNGGYMEKAFEFITKIGGVTTE 227
             +E +  ++ +     C     EK   +    GG+ +E
Sbjct:   200 REKMMAEIYTNGPISCGIMATEKMSNYT---GGIYSE 233


>UNIPROTKB|F1MW68 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9913 "Bos taurus"
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0060441
            GeneTree:ENSGT00560000076599 IPI:IPI00708474 UniGene:Bt.4902
            OMA:QCGTCTE EMBL:DAAA02036315 PRIDE:F1MW68
            Ensembl:ENSBTAT00000025007 Uniprot:F1MW68
        Length = 304

 Score = 140 (54.3 bits), Expect = 2.0e-14, Sum P(2) = 2.0e-14
 Identities = 23/55 (41%), Positives = 37/55 (67%)

Query:   266 YSHGVFDEYCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
             Y+ G++ EY     +NH V+V G+G   G +YW+V+NSWG  WGE G++R+  ++
Sbjct:   226 YTGGIYSEYNDQAFINHIVSVAGWGVSDGMEYWIVRNSWGEPWGEHGWMRIVTST 280

 Score = 109 (43.4 bits), Expect = 2.0e-14, Sum P(2) = 2.0e-14
 Identities = 49/171 (28%), Positives = 74/171 (43%)

Query:   121 TYLGYNKPYNEPRWPSVQYLG---LPASVDWRKEGAV---TPVKDQG---QCGSCWAFSA 171
             T LG  + Y  P     +YL    LP S DWR    V   +  ++Q     CGSCWA  +
Sbjct:    44 TQLG-RRTYPRPH----EYLSPSDLPKSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGS 98

Query:   172 VAAV-EGINKLKTGKLVS--LSEQELVDC-DVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
              +A+ + IN  + G   S  LS Q ++DC D  S    C GG     +E+  +  G+  E
Sbjct:    99 TSAMADRINIKRKGAWPSTLLSVQHVLDCGDAGS----CEGGNDLPVWEYAHR-HGIPDE 153

Query:   228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQLYSHGVFDEYCGHQ 278
                 Y+ K+  C  DK  +   T T ++       + L+  G +    G +
Sbjct:   154 TCNNYQAKDQEC--DKF-NQCGTCTEFKECHVIKNYTLWKVGDYGSLSGRE 201

 Score = 55 (24.4 bits), Expect = 7.5e-09, Sum P(2) = 7.5e-09
 Identities = 21/97 (21%), Positives = 34/97 (35%)

Query:   132 PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSE 191
             P W      G+P       +          QCG+C  F     ++     K G   SLS 
Sbjct:   140 PVWEYAHRHGIPDETCNNYQAKDQECDKFNQCGTCTEFKECHVIKNYTLWKVGDYGSLSG 199

Query:   192 QELVDCDVNSENQ-GCNGGYMEKAFEFITKIGGVTTE 227
             +E +  ++ +     C     EK   +    GG+ +E
Sbjct:   200 REKMMAEIYTNGPISCGIMATEKMSNYT---GGIYSE 233


>UNIPROTKB|E1C4M3 [details] [associations]
            symbol:CTSZ "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0060441 "epithelial tube branching
            involved in lung morphogenesis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GeneTree:ENSGT00560000076599 CTD:1522 KO:K08568 OMA:QCGTCTE
            EMBL:AADN02019004 IPI:IPI00596430 RefSeq:XP_417483.3
            Ensembl:ENSGALT00000012067 GeneID:419311 KEGG:gga:419311
            Uniprot:E1C4M3
        Length = 305

 Score = 143 (55.4 bits), Expect = 2.2e-14, Sum P(2) = 2.2e-14
 Identities = 25/64 (39%), Positives = 41/64 (64%)

Query:   257 IPARYAFQLYSHGVFDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRM 315
             I A      Y+ G++ EY     +NH V+V G+G ++G +YW+V+NSWG  WGE G++R+
Sbjct:   217 IMATEKLDAYTGGLYTEYNPSPTVNHIVSVAGWGVENGTEYWIVRNSWGEPWGERGWLRI 276

Query:   316 ARNS 319
               ++
Sbjct:   277 VTSA 280

 Score = 105 (42.0 bits), Expect = 2.2e-14, Sum P(2) = 2.2e-14
 Identities = 35/109 (32%), Positives = 51/109 (46%)

Query:   142 LPASVDWRKEGAV---TPVKDQG---QCGSCWAFSAVAAV-EGINKLKTGKLVS--LSEQ 192
             LP S DWR    V   +  ++Q     CGSCWA  + +A+ + IN  + G   S  LS Q
Sbjct:    63 LPQSWDWRNVNGVNYASTTRNQHIPQYCGSCWAHGSTSALADRINIKRKGAWPSAYLSVQ 122

Query:   193 ELVDCDVNSENQG-CNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQ 240
              ++DC     N G C GG     + +     G+  E    Y+ KN +C+
Sbjct:   123 NVIDC----ANAGSCEGGDHTGVWMYAHD-HGIPDETCNNYQAKNQKCK 166


>UNIPROTKB|E9PLY3 [details] [associations]
            symbol:CTSB "Cathepsin B heavy chain" species:9606 "Homo
            sapiens" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0005739
            "mitochondrion" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005730 "nucleolus" evidence=IDA] [GO:0043231 "intracellular
            membrane-bounded organelle" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 InterPro:IPR000169 GO:GO:0005739
            GO:GO:0048471 GO:GO:0005615 GO:GO:0009612 GO:GO:0009611
            GO:GO:0005730 GO:GO:0009897 GO:GO:0045471 GO:GO:0016324
            GO:GO:0009749 GO:GO:0006914 GO:GO:0043434 GO:GO:0006508
            PANTHER:PTHR12411 PROSITE:PS00139 GO:GO:0050790 GO:GO:0005764
            GO:GO:0042383 GO:GO:0014070 GO:GO:0042277 GO:GO:0060548
            GO:GO:0005901 GO:GO:0014075 GO:GO:0004197 GO:GO:0070670
            GO:GO:0007519 PANTHER:PTHR12411:SF16 HGNC:HGNC:2527 ChiTaRS:CTSB
            EMBL:AC069185 EMBL:AC025857 IPI:IPI00978678
            ProteinModelPortal:E9PLY3 SMR:E9PLY3 Ensembl:ENST00000530296
            ArrayExpress:E9PLY3 Bgee:E9PLY3 Uniprot:E9PLY3
        Length = 165

 Score = 188 (71.2 bits), Expect = 2.3e-14, P = 2.3e-14
 Identities = 48/138 (34%), Positives = 75/138 (54%)

Query:    91 QYIDYINSQNLSFKLTDNKF-ADLSN-EEFISTYLGYNKPYNEPRWPSVQYLGLPASVDW 148
             + ++Y+N +N +++   N +  D+S  +    T+LG  KP    R    + L LPAS D 
Sbjct:    29 ELVNYVNKRNTTWQAGHNFYNVDMSYLKRLCGTFLGGPKP--PQRVMFTEDLKLPASFDA 86

Query:   149 RKEGAVTP----VKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL--SEQELVDCDVNSE 202
             R++    P    ++DQG CGSCWAF AV A+     + T   VS+  S ++L+ C  +  
Sbjct:    87 REQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMC 146

Query:   203 NQGCNGGYMEKAFEFITK 220
               GCNGGY  +A+ F T+
Sbjct:   147 GDGCNGGYPAEAWNFWTR 164


>MGI|MGI:2137617 [details] [associations]
            symbol:Tinagl1 "tubulointerstitial nephritis antigen-like 1"
            species:10090 "Mus musculus" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
            [GO:0043236 "laminin binding" evidence=IPI] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 MGI:MGI:2137617
            GO:GO:0005737 GO:GO:0005576 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GO:GO:0031012 CleanEx:MM_ARG1 GO:GO:0005044
            GeneTree:ENSGT00560000076599 eggNOG:NOG310046 HOGENOM:HOG000241342
            HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OMA:DNCNRCT
            OrthoDB:EOG4BG8W0 EMBL:AB047402 EMBL:AB050626 EMBL:BC005738
            EMBL:BC018539 IPI:IPI00115458 RefSeq:NP_001161805.1
            RefSeq:NP_075965.2 UniGene:Mm.15801 ProteinModelPortal:Q99JR5
            SMR:Q99JR5 STRING:Q99JR5 PhosphoSite:Q99JR5 PaxDb:Q99JR5
            PRIDE:Q99JR5 Ensembl:ENSMUST00000030560 Ensembl:ENSMUST00000105998
            Ensembl:ENSMUST00000105999 GeneID:94242 KEGG:mmu:94242
            InParanoid:Q99JR5 NextBio:352247 Bgee:Q99JR5 Genevestigator:Q99JR5
            GermOnline:ENSMUSG00000028776 Uniprot:Q99JR5
        Length = 466

 Score = 143 (55.4 bits), Expect = 3.1e-14, Sum P(2) = 3.1e-14
 Identities = 40/133 (30%), Positives = 63/133 (47%)

Query:   120 STYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA-AVEGI 178
             ST +  N+ Y       V      AS  W     +    DQG C   WAFS  A A + +
Sbjct:   184 STVMNMNEIYTVLGQGEVLPTAFEASEKW--PNLIHEPLDQGNCAGSWAFSTAAVASDRV 241

Query:   179 NKLKTGKLVS-LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK-- 235
             +    G +   LS Q L+ CD + + QGC GG ++ A+ F+ +  GV +++ YP+ G+  
Sbjct:   242 SIHSLGHMTPILSPQNLLSCDTHHQ-QGCRGGRLDGAWWFLRR-RGVVSDNCYPFSGREQ 299

Query:   236 NDRCQTDKTKHHA 248
             N+   T +   H+
Sbjct:   300 NEASPTPRCMMHS 312

 Score = 111 (44.1 bits), Expect = 3.1e-14, Sum P(2) = 3.1e-14
 Identities = 25/69 (36%), Positives = 35/69 (50%)

Query:   261 YAFQLYSHGVFDEYCGHQLNHGVTVVGYGED---HGE--KYWLVKNSWGTSWGEAGYIRM 315
             Y+    S G  ++Y  H   H V + G+GE+    G   KYW   NSWG  WGE G+ R+
Sbjct:   380 YSHTPVSQGRPEQYRRHG-THSVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRI 438

Query:   316 ARNSPSSNI 324
              R +   +I
Sbjct:   439 VRGTNECDI 447


>UNIPROTKB|F1PIF2 [details] [associations]
            symbol:CTSZ "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0060441 "epithelial tube branching involved
            in lung morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0005783 GO:GO:0005615 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GO:GO:0060441 GeneTree:ENSGT00560000076599 OMA:QCGTCTE
            EMBL:AAEX03014054 Ensembl:ENSCAFT00000019357 Uniprot:F1PIF2
        Length = 261

 Score = 139 (54.0 bits), Expect = 4.3e-14, Sum P(2) = 4.3e-14
 Identities = 23/55 (41%), Positives = 37/55 (67%)

Query:   266 YSHGVFDEYCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
             Y+ G+  EY     +NH ++VVG+G   G +YW+V+NSWG  WGE G++R+  ++
Sbjct:   183 YTGGIHAEYQEQAYINHVISVVGWGVSDGTEYWIVRNSWGEPWGERGWMRIVTST 237

 Score = 103 (41.3 bits), Expect = 4.3e-14, Sum P(2) = 4.3e-14
 Identities = 45/166 (27%), Positives = 73/166 (43%)

Query:   127 KPYNEPRWPSV-QYLG---LPASVDWRKEGAV---TPVKDQG---QCGSCWAFSAVAAV- 175
             +P +   +P   +YL    LP S DWR    V   +  ++Q     CGSCWA  + +A+ 
Sbjct:     1 RPVSSRTYPRPHEYLSPSDLPKSWDWRNVNGVNYASATRNQHIPQYCGSCWAHGSTSAMA 60

Query:   176 EGINKLKTGKLVS--LSEQELVDCDVNSENQG-CNGGYMEKAFEFITKIGGVTTEDDYPY 232
             + IN  + G   S  LS Q ++DC     N G C GG     + +  +  G+  E    Y
Sbjct:    61 DRINIKRKGAWPSTLLSVQHVLDC----ANAGSCEGGNDLPVWSYAHE-HGIPDETCNNY 115

Query:   233 RGKNDRCQTDKTKHHAVTITGYEAIPARYAFQLYSHGVFDEYCGHQ 278
             + K+  C  +K  +   T T ++   A   + L+  G +    G +
Sbjct:   116 QAKDQEC--NKF-NQCGTCTEFKECHAIQNYTLWRVGDYGSLSGRE 158

 Score = 61 (26.5 bits), Expect = 9.5e-10, Sum P(2) = 9.5e-10
 Identities = 21/96 (21%), Positives = 36/96 (37%)

Query:   132 PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSE 191
             P W      G+P       +          QCG+C  F    A++     + G   SLS 
Sbjct:    97 PVWSYAHEHGIPDETCNNYQAKDQECNKFNQCGTCTEFKECHAIQNYTLWRVGDYGSLSG 156

Query:   192 QELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
             +E +  ++ + N   + G M    + +   GG+  E
Sbjct:   157 REKMMAEIYA-NGPISCGIMATE-KMVNYTGGIHAE 190


>UNIPROTKB|F1RWA9 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 EMBL:CU855637
            Ensembl:ENSSSCT00000009707 OMA:WAFSIVG Uniprot:F1RWA9
        Length = 194

 Score = 184 (69.8 bits), Expect = 6.3e-14, P = 6.3e-14
 Identities = 38/96 (39%), Positives = 52/96 (54%)

Query:   162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKI 221
             QCG CWAFS V+AVE    +K   L  LS Q+++DC  N  N GCNGG    A  ++ K 
Sbjct:     1 QCGGCWAFSVVSAVESAYAIKGQPLEVLSVQQVIDCSYN--NYGCNGGSTLNALYWLNKT 58

Query:   222 G-GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEA 256
                V ++ +YP++ +N  C      H  V+I  Y A
Sbjct:    59 QVKVVSDSEYPFKAQNGLCHYFSCSHSGVSIKDYSA 94

 Score = 134 (52.2 bits), Expect = 2.6e-07, P = 2.6e-07
 Identities = 36/112 (32%), Positives = 56/112 (50%)

Query:   223 GVTTED--DYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQLYSHGVFDEYCGH-QL 279
             GV+ +D   Y + G+ D  +  KT    +T+     I    ++Q Y  G+   +C   + 
Sbjct:    86 GVSIKDYSAYDFSGQED--EMAKT---LLTLGPLIVIVDAVSWQDYLGGIIQHHCSSGEA 140

Query:   280 NHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGY--IRMARNSPSSNIGICGI 329
             NH V V G+ +     YW+V+NSWG++WG  GY  ++M  N       ICGI
Sbjct:   141 NHAVLVTGFDKTGSTPYWIVRNSWGSAWGIDGYALVKMGGN-------ICGI 185


>TAIR|locus:2204873 [details] [associations]
            symbol:AT1G02300 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002684 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 KO:K01363
            PANTHER:PTHR12411:SF16 OMA:ADDINAC IPI:IPI00534431
            RefSeq:NP_563647.1 UniGene:At.43952 ProteinModelPortal:F4HVZ1
            SMR:F4HVZ1 MEROPS:C01.A10 EnsemblPlants:AT1G02300.1 GeneID:839576
            KEGG:ath:AT1G02300 ArrayExpress:F4HVZ1 Uniprot:F4HVZ1
        Length = 379

 Score = 130 (50.8 bits), Expect = 7.0e-14, Sum P(2) = 7.0e-14
 Identities = 24/64 (37%), Positives = 35/64 (54%)

Query:   263 FQLYSHGVFDEYCGHQLN-HGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
             F  Y  GV+    G ++  H V ++G+G  D GE YWL+ N W  SWG+ GY ++ R + 
Sbjct:   287 FAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTN 346

Query:   321 SSNI 324
                I
Sbjct:   347 ECGI 350

 Score = 119 (46.9 bits), Expect = 7.0e-14, Sum P(2) = 7.0e-14
 Identities = 28/72 (38%), Positives = 39/72 (54%)

Query:   161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
             G CGSCWAF AV ++     +K    VSLS  +++ C       GCNGG+   A+ +  K
Sbjct:   146 GHCGSCWAFGAVESLSDRFCIKYNLNVSLSANDVIACCGLLCGFGCNGGFPMGAWLYF-K 204

Query:   221 IGGVTTEDDYPY 232
               GV T++  PY
Sbjct:   205 YHGVVTQECDPY 216


>UNIPROTKB|Q9UBR2 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0060441 "epithelial tube
            branching involved in lung morphogenesis" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            Reactome:REACT_11123 Reactome:REACT_17015 InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 EMBL:CH471077 GO:GO:0005615 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AL109840 GO:GO:0060441 MEROPS:C01.013 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN
            BRENDA:3.4.18.1 EMBL:AF073890 EMBL:AF032906 EMBL:AF136273
            EMBL:AF136276 EMBL:AF136274 EMBL:AF136275 EMBL:AK314931
            EMBL:BC042168 EMBL:AF009923 IPI:IPI00002745 RefSeq:NP_001327.2
            UniGene:Hs.252549 PDB:1DEU PDB:1EF7 PDBsum:1DEU PDBsum:1EF7
            ProteinModelPortal:Q9UBR2 SMR:Q9UBR2 STRING:Q9UBR2 DMDM:12643324
            PaxDb:Q9UBR2 PeptideAtlas:Q9UBR2 PRIDE:Q9UBR2 DNASU:1522
            Ensembl:ENST00000217131 GeneID:1522 KEGG:hsa:1522 UCSC:uc002yai.2
            GeneCards:GC20M057570 HGNC:HGNC:2547 HPA:CAB025114 MIM:603169
            neXtProt:NX_Q9UBR2 PharmGKB:PA27043 InParanoid:Q9UBR2 OMA:QCGTCTE
            PhylomeDB:Q9UBR2 BindingDB:Q9UBR2 ChEMBL:CHEMBL4160 ChiTaRS:CTSZ
            EvolutionaryTrace:Q9UBR2 GenomeRNAi:1522 NextBio:6299 Bgee:Q9UBR2
            CleanEx:HS_CTSZ Genevestigator:Q9UBR2 GermOnline:ENSG00000101160
            Uniprot:Q9UBR2
        Length = 303

 Score = 139 (54.0 bits), Expect = 8.3e-14, Sum P(2) = 8.3e-14
 Identities = 23/55 (41%), Positives = 37/55 (67%)

Query:   266 YSHGVFDEYCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
             Y+ G++ EY     +NH V+V G+G   G +YW+V+NSWG  WGE G++R+  ++
Sbjct:   225 YTGGIYAEYQDTTYINHVVSVAGWGISDGTEYWIVRNSWGEPWGERGWLRIVTST 279

 Score = 104 (41.7 bits), Expect = 8.3e-14, Sum P(2) = 8.3e-14
 Identities = 46/165 (27%), Positives = 74/165 (44%)

Query:   128 PYNEPRWPSV-QYLG---LPASVDWRK-EG----AVTPVKDQGQ-CGSCWAFSAVAAV-E 176
             P     +P   +YL    LP S DWR  +G    ++T  +   Q CGSCWA ++ +A+ +
Sbjct:    44 PLGRSTYPRPHEYLSPADLPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMAD 103

Query:   177 GINKLKTGKLVS--LSEQELVDCDVNSENQG-CNGGYMEKAFEFITKIGGVTTEDDYPYR 233
              IN  + G   S  LS Q ++DC     N G C GG     +++  +  G+  E    Y+
Sbjct:   104 RINIKRKGAWPSTLLSVQNVIDCG----NAGSCEGGNDLSVWDYAHQ-HGIPDETCNNYQ 158

Query:   234 GKNDRCQTDKTKHHAVTITGYEAIPARYAFQLYSHGVFDEYCGHQ 278
              K+  C  DK  +   T   ++   A   + L+  G +    G +
Sbjct:   159 AKDQEC--DKF-NQCGTCNEFKECHAIRNYTLWRVGDYGSLSGRE 200


>UNIPROTKB|A5GFX7 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9823 "Sus scrofa"
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GO:GO:0060441 GeneTree:ENSGT00560000076599 MEROPS:C01.013 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN
            OMA:QCGTCTE EMBL:CR956646 RefSeq:NP_001116576.1 UniGene:Ssc.16769
            ProteinModelPortal:A5GFX7 SMR:A5GFX7 STRING:A5GFX7
            Ensembl:ENSSSCT00000008249 GeneID:100141405 KEGG:ssc:100141405
            ArrayExpress:A5GFX7 Uniprot:A5GFX7
        Length = 304

 Score = 139 (54.0 bits), Expect = 8.4e-14, Sum P(2) = 8.4e-14
 Identities = 23/55 (41%), Positives = 37/55 (67%)

Query:   266 YSHGVFDEYCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
             Y+ G++ EY     +NH V+V G+G   G +YW+V+NSWG  WGE G++R+  ++
Sbjct:   226 YTGGIYAEYKDQAYINHIVSVAGWGVSGGTEYWIVRNSWGEPWGERGWMRIVTST 280

 Score = 104 (41.7 bits), Expect = 8.4e-14, Sum P(2) = 8.4e-14
 Identities = 48/171 (28%), Positives = 74/171 (43%)

Query:   121 TYLGYNKPYNEPRWPSVQYLG---LPASVDWRKEGAV---TPVKDQG---QCGSCWAFSA 171
             T LG+ + Y  P     +YL    LP S DWR    V   +  ++Q     CGSCWA  +
Sbjct:    44 TQLGH-RTYPRPH----EYLSPSDLPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGS 98

Query:   172 VAAV-EGINKLKTGKLVS--LSEQELVDCDVNSENQG-CNGGYMEKAFEFITKIGGVTTE 227
              +A+ + IN  + G   S  LS Q ++DC     N G C GG     + +  +  G+  E
Sbjct:    99 TSAMADRINIKRKGAWPSTLLSVQHVIDCG----NAGSCEGGDDLPVWAYAHR-HGIPDE 153

Query:   228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQLYSHGVFDEYCGHQ 278
                 Y+ K+  C  DK  +   T T ++       + L+  G +    G +
Sbjct:   154 TCNNYQAKDQVC--DKF-NQCGTCTEFKECHVIQNYTLWKVGDYGSVSGRE 201

 Score = 56 (24.8 bits), Expect = 7.8e-09, Sum P(2) = 7.8e-09
 Identities = 25/104 (24%), Positives = 38/104 (36%)

Query:   132 PRWPSVQYLGLPASVDWRKEGAVTPVKDQ-GQCGSCWAFSAVAAVEGINKLKTGKLVSLS 190
             P W      G+P         A   V D+  QCG+C  F     ++     K G   S+S
Sbjct:   140 PVWAYAHRHGIPDET-CNNYQAKDQVCDKFNQCGTCTEFKECHVIQNYTLWKVGDYGSVS 198

Query:   191 EQELVDCDVNSENQ-GCNGGYMEKAFEFITKIGGVTTE-DDYPY 232
              +E +  ++ +     C     EK   +    GG+  E  D  Y
Sbjct:   199 GREKMMAEIYANGPISCGIMATEKMSNYT---GGIYAEYKDQAY 239


>UNIPROTKB|F1SVA2 [details] [associations]
            symbol:TINAGL1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005615 "extracellular space" evidence=IDA] [GO:0043236
            "laminin binding" evidence=IEA] [GO:0031012 "extracellular matrix"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044
            "scavenger receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005737
            GO:GO:0005615 GO:GO:0006955 GO:GO:0030247 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GO:GO:0031012 GO:GO:0005044 GeneTree:ENSGT00560000076599
            OMA:DNCNRCT EMBL:CU856262 Ensembl:ENSSSCT00000003995 Uniprot:F1SVA2
        Length = 467

 Score = 131 (51.2 bits), Expect = 8.7e-14, Sum P(2) = 8.7e-14
 Identities = 29/78 (37%), Positives = 45/78 (57%)

Query:   159 DQGQCGSCWAFSAVA-AVEGINKLKTGKLVS-LSEQELVDCDVNSENQGCNGGYMEKAFE 216
             DQG C   WAFS  A A + ++    G +   LS Q L+ CD +++ QGC GG ++ A+ 
Sbjct:   222 DQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHNQ-QGCQGGRLDGAWW 280

Query:   217 FITKIGGVTTEDDYPYRG 234
             F+ +  GV ++  YP+ G
Sbjct:   281 FLRR-RGVVSDHCYPFSG 297

 Score = 120 (47.3 bits), Expect = 8.7e-14, Sum P(2) = 8.7e-14
 Identities = 26/69 (37%), Positives = 35/69 (50%)

Query:   261 YAFQLYSHGVFDEYCGHQLNHGVTVVGYGED---HGE--KYWLVKNSWGTSWGEAGYIRM 315
             Y+    SHG  + Y  H   H V + G+GE+    G   KYW   NSWG  WGE G+ R+
Sbjct:   381 YSHTPVSHGRPERYRRHG-THSVKITGWGEETLPDGRMLKYWTAANSWGPGWGERGHFRI 439

Query:   316 ARNSPSSNI 324
              R +   +I
Sbjct:   440 VRGANECDI 448


>WB|WBGene00000788 [details] [associations]
            symbol:cpz-1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] [GO:0009792 "embryo development ending in
            birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0010171 "body morphogenesis" evidence=IMP]
            [GO:0018996 "molting cycle, collagen and cuticulin-based cuticle"
            evidence=IMP] [GO:0031012 "extracellular matrix" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0009792 GO:GO:0040007 GO:GO:0002119
            GO:GO:0018996 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0010171 GO:GO:0031012
            GeneTree:ENSGT00560000076599 KO:K08568 OMA:QCGTCTE EMBL:FO081275
            EMBL:BK001409 PIR:T29872 RefSeq:NP_491023.2 HSSP:Q9UBR2
            ProteinModelPortal:G5EGP8 SMR:G5EGP8 IntAct:G5EGP8 MEROPS:C01.A38
            EnsemblMetazoa:F32B5.8 GeneID:171829 KEGG:cel:CELE_F32B5.8
            CTD:171829 WormBase:F32B5.8 NextBio:872879 Uniprot:G5EGP8
        Length = 306

 Score = 144 (55.7 bits), Expect = 9.0e-14, Sum P(2) = 9.0e-14
 Identities = 27/71 (38%), Positives = 42/71 (59%)

Query:   257 IPARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDH--GEKYWLVKNSWGTSWGEAGYIR 314
             I A  AF+ Y+ G++ E     ++H ++V G+G DH  G +YW+ +NSWG  WGE G+ +
Sbjct:   219 IAATKAFETYAGGIYKEVTDEDIDHIISVHGWGVDHESGVEYWIGRNSWGEPWGEHGWFK 278

Query:   315 MARNSPSSNIG 325
             +   S   N G
Sbjct:   279 IV-TSQYKNAG 288

 Score = 98 (39.6 bits), Expect = 9.0e-14, Sum P(2) = 9.0e-14
 Identities = 32/107 (29%), Positives = 50/107 (46%)

Query:   142 LPASVDWRKEGAVTPVK-DQGQ-----CGSCWAFSAVAAV-EGIN-KLKTG-KLVSLSEQ 192
             LP + DWR    +     D+ Q     CGSCWAF A +A+ + IN K K       LS Q
Sbjct:    65 LPKTWDWRDANGINYASADRNQHIPQYCGSCWAFGATSALADRINIKRKNAWPQAYLSVQ 124

Query:   193 ELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRC 239
             E++DC  +       GG     +++  +  G+  E    Y+ ++ +C
Sbjct:   125 EVIDC--SGAGTCVMGGEPGGVYKYAHE-HGIPHETCNNYQARDGKC 168


>WB|WBGene00016306 [details] [associations]
            symbol:C32B5.13 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 EMBL:FO080745
            PIR:T25581 RefSeq:NP_493866.1 UniGene:Cel.15740 HSSP:P00785
            ProteinModelPortal:P91110 SMR:P91110 EnsemblMetazoa:C32B5.13
            GeneID:183116 KEGG:cel:CELE_C32B5.13 UCSC:C32B5.13 CTD:183116
            WormBase:C32B5.13 eggNOG:KOG1543 HOGENOM:HOG000115376
            InParanoid:P91110 NextBio:919978 Uniprot:P91110
        Length = 150

 Score = 109 (43.4 bits), Expect = 1.0e-13, Sum P(2) = 1.0e-13
 Identities = 26/67 (38%), Positives = 38/67 (56%)

Query:   180 KLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK-NDR 238
             K     ++S SEQ+++DC   +    C    +  + EFI K  GV TE DYPY GK N++
Sbjct:     4 KANNRTVLSFSEQQIIDC--GNFTSPCQENIL--SHEFIKK-NGVVTEADYPYVGKENEK 58

Query:   239 CQTDKTK 245
             C+ D+ K
Sbjct:    59 CKYDENK 65

 Score = 102 (41.0 bits), Expect = 1.0e-13, Sum P(2) = 1.0e-13
 Identities = 22/57 (38%), Positives = 33/57 (57%)

Query:   253 GYEAIPARYAFQLYSHGVFD---EYCGHQLN-HGVTVVGYGEDHGEKYWLVKNSWGT 305
             GY  + A  +F  Y  G++    E CG   +   +T+VGYG + G+ YW+VK S+GT
Sbjct:    94 GYFRMKAPPSFFNYKTGIYSPTQEECGKATDARSLTIVGYGIEGGQNYWIVKGSFGT 150


>RGD|708479 [details] [associations]
            symbol:Ctsz "cathepsin Z" species:10116 "Rattus norvegicus"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=TAS]
            [GO:0005615 "extracellular space" evidence=IEA;ISO] [GO:0005783
            "endoplasmic reticulum" evidence=IEA;ISO] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0060441 "epithelial tube branching involved in
            lung morphogenesis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:708479 GO:GO:0005576 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004197 MEROPS:C01.013 CTD:1522 HOVERGEN:HBG004456 KO:K08568
            EMBL:AB023781 EMBL:BC091110 IPI:IPI00207663 RefSeq:NP_899159.1
            UniGene:Rn.1475 ProteinModelPortal:Q9R1T3 SMR:Q9R1T3 PRIDE:Q9R1T3
            GeneID:252929 KEGG:rno:252929 BindingDB:Q9R1T3 NextBio:624097
            Genevestigator:Q9R1T3 Uniprot:Q9R1T3
        Length = 306

 Score = 135 (52.6 bits), Expect = 1.0e-13, Sum P(2) = 1.0e-13
 Identities = 23/62 (37%), Positives = 39/62 (62%)

Query:   266 YSHGVFDEYCGHQL-NHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
             Y+ G++ EY    + NH ++V G+G  + G +YW+V+NSWG  WGE G++R+  ++    
Sbjct:   227 YTGGIYTEYQNQAIINHIISVAGWGVSNDGIEYWIVRNSWGEPWGERGWMRIVTSTYKGG 286

Query:   324 IG 325
              G
Sbjct:   287 TG 288

 Score = 108 (43.1 bits), Expect = 1.0e-13, Sum P(2) = 1.0e-13
 Identities = 42/147 (28%), Positives = 65/147 (44%)

Query:   142 LPASVDWRKEGAV---TPVKDQG---QCGSCWAFSAVAAV-EGINKLKTGKLVS--LSEQ 192
             LP + DWR    V   +  ++Q     CGSCWA  + +A+ + IN  + G   S  LS Q
Sbjct:    64 LPKNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSALADRINIKRKGAWPSTLLSVQ 123

Query:   193 ELVDCDVNSENQG-CNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTI 251
              ++DC     N G C GG     +E+  K  G+  E    Y+ K+  C  DK  +   T 
Sbjct:   124 NVIDCG----NAGSCEGGNDLPVWEYAHK-HGIPDETCNNYQAKDQEC--DKF-NQCGTC 175

Query:   252 TGYEAIPARYAFQLYSHGVFDEYCGHQ 278
             T ++       + L+  G +    G +
Sbjct:   176 TEFKECHTIQNYTLWRVGDYGSLSGRE 202

 Score = 57 (25.1 bits), Expect = 1.9e-08, Sum P(2) = 1.9e-08
 Identities = 23/97 (23%), Positives = 38/97 (39%)

Query:   132 PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSE 191
             P W      G+P       +          QCG+C  F     ++     + G   SLS 
Sbjct:   141 PVWEYAHKHGIPDETCNNYQAKDQECDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSLSG 200

Query:   192 QELVDCDVNSENQGCNGGYMEKAFEFITKI-GGVTTE 227
             +E +  ++ + N   + G M  A E ++   GG+ TE
Sbjct:   201 REKMMAEIYA-NGPISCGIM--ATERMSNYTGGIYTE 234


>DICTYBASE|DDB_G0283921 [details] [associations]
            symbol:ctsB "cathepsin B precursor" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0283921 GO:GO:0005615
            GenomeReviews:CM000153_GR GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:AAFI02000058
            eggNOG:NOG315657 PANTHER:PTHR12411:SF16 OMA:CSLSCQS
            RefSeq:XP_638805.1 HSSP:P07688 MEROPS:C01.A59
            EnsemblProtists:DDB0233997 GeneID:8624329 KEGG:ddi:DDB_G0283921
            Uniprot:Q54QD9
        Length = 311

 Score = 142 (55.0 bits), Expect = 1.1e-13, Sum P(2) = 1.1e-13
 Identities = 37/122 (30%), Positives = 61/122 (50%)

Query:   119 ISTYLGYNKPYNEPRWPSVQY--LG--LPAS----VDWRKEGAVTPVKDQGQCGSCWAFS 170
             +   LG+ +  N P+     Y  LG  +P S     +W     ++ +++Q +CGSCWAF 
Sbjct:    52 VGQLLGFKRSPNRPKLQIKSYDPLGVQIPTSFNAQTNWPNCTTISQIQNQARCGSCWAFG 111

Query:   171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
             A  +      +   + V LS  ++V CD  ++N GC GG    A+ ++ K G V+ E+  
Sbjct:   112 ATESATDRLCIHNNENVQLSFMDMVTCD-ETDN-GCEGGDAFSAWNWLRKQGAVS-EECL 168

Query:   231 PY 232
             PY
Sbjct:   169 PY 170

 Score = 100 (40.3 bits), Expect = 1.1e-13, Sum P(2) = 1.1e-13
 Identities = 25/68 (36%), Positives = 32/68 (47%)

Query:   263 FQLYSHGVFDEYCGHQLN-HGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
             F  Y  GV+    G  L  H V +VG+G  +G  Y+   N W TSWG+ G   + R    
Sbjct:   242 FLAYKSGVYVHTTGKDLGGHCVKLVGFGTLNGVDYYAANNQWTTSWGDNGTFLIKR---- 297

Query:   322 SNIGICGI 329
                G CGI
Sbjct:   298 ---GDCGI 302


>ZFIN|ZDB-GENE-060503-240 [details] [associations]
            symbol:tinagl1 "tubulointerstitial nephritis
            antigen-like 1" species:7955 "Danio rerio" [GO:0005044 "scavenger
            receptor activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0030414 "peptidase inhibitor activity" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0002040 "sprouting
            angiogenesis" evidence=IMP] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR008037 InterPro:IPR013128 Pfam:PF00112 Pfam:PF05375
            PROSITE:PS00524 PROSITE:PS50958 SMART:SM00201 SMART:SM00645
            ZFIN:ZDB-GENE-060503-240 GO:GO:0006955 GO:GO:0030247 GO:GO:0030414
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 GO:GO:0002040
            GO:GO:0005044 GeneTree:ENSGT00560000076599 GO:GO:0010466
            SUPFAM:SSF57283 HOVERGEN:HBG053961 MEROPS:C01.975 OMA:DNCNRCT
            EMBL:BX950864 IPI:IPI00609339 UniGene:Dr.103937
            Ensembl:ENSDART00000087096 Ensembl:ENSDART00000126228
            InParanoid:Q1LUC6 Uniprot:Q1LUC6
        Length = 471

 Score = 134 (52.2 bits), Expect = 1.3e-13, Sum P(2) = 1.3e-13
 Identities = 34/91 (37%), Positives = 50/91 (54%)

Query:   145 SVD-WRKEGAVTPVKDQGQCGSCWAFSAVA-AVEGINKLKTGKLV-SLSEQELVDCDVNS 201
             +VD W   G +    DQG C + WAFS  A A + I+    G +   LS Q L+ CD   
Sbjct:   206 AVDKW--PGKIHEPLDQGNCNASWAFSTAAVASDRISIQSMGHMTPQLSPQNLISCDTRH 263

Query:   202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
             ++ GC GG ++ A+ F+ +  GV T+D YP+
Sbjct:   264 QD-GCAGGRIDGAWWFMRR-RGVVTQDCYPF 292

 Score = 115 (45.5 bits), Expect = 1.3e-13, Sum P(2) = 1.3e-13
 Identities = 22/50 (44%), Positives = 29/50 (58%)

Query:   273 EYCGHQLNHGVTVVGYGEDHG-----EKYWLVKNSWGTSWGEAGYIRMAR 317
             +Y  H   H V + G+GE+        KYW+  NSWG +WGE GY R+AR
Sbjct:   389 QYRKHA-THSVRITGWGEERDYSGRTRKYWIGANSWGKNWGEDGYFRIAR 437


>ZFIN|ZDB-GENE-041010-139 [details] [associations]
            symbol:ctsz "cathepsin Z" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0001525 "angiogenesis"
            evidence=IMP] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 ZFIN:ZDB-GENE-041010-139 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0001525
            CTD:1522 HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568
            OrthoDB:EOG42Z4QN UniGene:Dr.935 eggNOG:NOG275763 EMBL:BC083369
            IPI:IPI00483065 RefSeq:NP_001006043.1 ProteinModelPortal:Q5XJD4
            SMR:Q5XJD4 STRING:Q5XJD4 GeneID:450022 KEGG:dre:450022
            InParanoid:Q5XJD4 NextBio:20833005 ArrayExpress:Q5XJD4
            Uniprot:Q5XJD4
        Length = 301

 Score = 136 (52.9 bits), Expect = 2.3e-13, Sum P(2) = 2.3e-13
 Identities = 24/62 (38%), Positives = 40/62 (64%)

Query:   266 YSHGVFDEYCGHQ-LNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
             Y+ G++ EY     +NH V+V G+G D +G ++W+V+NSWG  WGE G++R+  ++    
Sbjct:   217 YTGGLYSEYVQEPYINHIVSVAGWGVDENGVEFWVVRNSWGEPWGEKGWLRIVTSAYKGG 276

Query:   324 IG 325
              G
Sbjct:   277 SG 278

 Score = 103 (41.3 bits), Expect = 2.3e-13, Sum P(2) = 2.3e-13
 Identities = 38/120 (31%), Positives = 57/120 (47%)

Query:   132 PR-WPSVQYLGLPASVDWRK-EGA--VTPVKDQG---QCGSCWAFSAVAAV-EGIN-KLK 182
             PR + S+    LP   DWR  +G   V+  ++Q     CGSCWA  + +A+ + IN K K
Sbjct:    43 PRPYESMNLKELPKEWDWRNIKGVNYVSTTRNQHIPQYCGSCWAHGSTSALADRINIKRK 102

Query:   183 TG-KLVSLSEQELVDC-DVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQ 240
                    LS Q ++DC D  S    C+GG     +E+     G+  E    Y+ K+  C+
Sbjct:   103 AAWPSAYLSVQNVIDCGDAGS----CSGGDHSGVWEYAHN-KGIPDETCNNYQAKDQDCK 157

 Score = 51 (23.0 bits), Expect = 5.5e-08, Sum P(2) = 5.5e-08
 Identities = 15/68 (22%), Positives = 23/68 (33%)

Query:   134 WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQE 193
             W      G+P       +      K   QCG+C  F     V+     K G   S S  +
Sbjct:   133 WEYAHNKGIPDETCNNYQAKDQDCKPFNQCGTCTTFGVCNIVKNFTLWKVGDYGSASGLD 192

Query:   194 LVDCDVNS 201
              +  ++ S
Sbjct:   193 KMKAEIYS 200


>DICTYBASE|DDB_G0292462 [details] [associations]
            symbol:DDB_G0292462 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0292462 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            EMBL:AAFI02000190 RefSeq:XP_629634.1 MEROPS:C01.A56
            EnsemblProtists:DDB0184413 GeneID:8628698 KEGG:ddi:DDB_G0292462
            InParanoid:Q54D62 OMA:NTQVESH Uniprot:Q54D62
        Length = 323

 Score = 122 (48.0 bits), Expect = 2.6e-13, Sum P(2) = 2.6e-13
 Identities = 45/136 (33%), Positives = 66/136 (48%)

Query:   142 LPASVDWRKE-G-AVTPVKDQGQCGSCWA--FSAVAAVEG-INKLKTGKLVSLSEQELVD 196
             +PAS D R   G  ++PV++Q  CGSCWA   S + A    I   K  K++ LS Q L+D
Sbjct:    46 IPASFDVRTNWGDCMSPVREQQSCGSCWAQVTSGILADRMCIESDKNIKML-LSPQYLMD 104

Query:   197 CD-------VNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDR-CQTDKTKHHA 248
             CD       V+  N GC GG++  A   +    G+ +++   Y+   D  C T       
Sbjct:   105 CDGSCVSDGVSGCNNGCKGGFVGLALTRLIN-EGIVSDECLSYQASKDSSCPTTCDDGSP 163

Query:   249 VTITG-YEAIPARYAF 263
             ++ T  Y+A   R AF
Sbjct:   164 ISNTTIYKATSCR-AF 178

 Score = 120 (47.3 bits), Expect = 2.6e-13, Sum P(2) = 2.6e-13
 Identities = 28/70 (40%), Positives = 37/70 (52%)

Query:   258 PARYAFQLYS----H--GVFDEYCGHQL-NHGVTVVGYGE-DHGEKYWLVKNSWGTSWGE 309
             P    F LYS    H   V+ +    Q+ +H V VVG+G    G  YW+  NSWGT WG+
Sbjct:   193 PVIATFMLYSDFKPHKWDVYIKSSNTQVESHAVRVVGWGTTSDGVDYWIAANSWGTGWGD 252

Query:   310 AGYIRMARNS 319
              GY ++ R S
Sbjct:   253 KGYFKIRRGS 262


>FB|FBgn0033873 [details] [associations]
            symbol:CG6337 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 EMBL:AE013599
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 HSSP:P80067 EMBL:AY084123
            RefSeq:NP_610905.1 UniGene:Dm.5230 SMR:Q7JYA0 IntAct:Q7JYA0
            EnsemblMetazoa:FBtr0087646 GeneID:36530 KEGG:dme:Dmel_CG6337
            UCSC:CG6337-RA FlyBase:FBgn0033873 eggNOG:NOG310593
            InParanoid:Q7JYA0 OMA:NRTTYRE OrthoDB:EOG4MCVFZ GenomeRNAi:36530
            NextBio:799041 Uniprot:Q7JYA0
        Length = 340

 Score = 128 (50.1 bits), Expect = 4.3e-13, Sum P(2) = 4.3e-13
 Identities = 59/234 (25%), Positives = 99/234 (42%)

Query:    37 VLGIPAGAWSEGYPQKY-DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN--VQYI 93
             +L I +G W+  + Q   D Q+ E+ F    K Y+    S   +   + IY+ N   Q+ 
Sbjct:    10 LLTIDSG-WAFNHGQDLVDFQTYEDNFN---KTYAST--SARNFANYYFIYNRNQVAQHN 63

Query:    94 DYINSQNLSFKLTDNKFADLSNEEFISTY-LGYNKPYNEPRWPSVQYLGLPASVDWRKE- 151
                +    +++   N+F+D+   +F +      N   +    P        AS D   + 
Sbjct:    64 AQADRNRTTYREAVNQFSDIRLIQFAALLPKAVNTVTSAASDPPASQAA-SASFDIITDF 122

Query:   152 GAVTPVKDQG-QCGSCWAFSAVAAVEGINKLKTGKLV--SLSEQELVDCDVNSENQGCNG 208
             G    V+DQG  C S WA++   AVE +N ++T   +  SLS Q+L+DC       GC+ 
Sbjct:   123 GLTVAVEDQGVNCSSSWAYATAKAVEIMNAVQTANPLPSSLSAQQLLDCA--GMGTGCST 180

Query:   209 GYMEKAFEFITKIGG--VTTEDDYPYRGK---NDRCQTDKTKHHAVTITGYEAI 257
                  A  ++T++    +  E DYP          CQ   +    V + GY  +
Sbjct:   181 QTPLAALNYLTQLTDAYLYPEVDYPNNNSLKTPGMCQPPSSVSVGVKLAGYSTV 234

 Score = 112 (44.5 bits), Expect = 4.3e-13, Sum P(2) = 4.3e-13
 Identities = 27/68 (39%), Positives = 34/68 (50%)

Query:   258 PARYAFQLYSHGVFDEYCGHQLN----HGVTVVGYGEDHGEK--YWLVKNSWGTSWGEAG 311
             PA + F  YS GV+ +      N      + VVGY  D      YW   NS+G +WGE G
Sbjct:   258 PATFGFMQYSSGVYVQETRALTNPKSSQFLVVVGYDHDVDSNLDYWRCLNSFGDTWGEEG 317

Query:   312 YIRMARNS 319
             YIR+ R S
Sbjct:   318 YIRIVRRS 325


>UNIPROTKB|E2QXH3 [details] [associations]
            symbol:TINAGL1 "Uncharacterized protein" species:9615
            "Canis lupus familiaris" [GO:0043236 "laminin binding"
            evidence=IEA] [GO:0031012 "extracellular matrix" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006955 "immune response" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044 "scavenger
            receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005737
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639 GO:GO:0031012
            GO:GO:0005044 GeneTree:ENSGT00560000076599 CTD:64129 OMA:DNCNRCT
            EMBL:AAEX03001668 RefSeq:XP_535330.3 Ensembl:ENSCAFT00000035659
            GeneID:478155 KEGG:cfa:478155 NextBio:20853523 Uniprot:E2QXH3
        Length = 467

 Score = 131 (51.2 bits), Expect = 5.9e-13, Sum P(2) = 5.9e-13
 Identities = 29/79 (36%), Positives = 46/79 (58%)

Query:   159 DQGQCGSCWAFSAVA-AVEGINKLKTGKLVS-LSEQELVDCDVNSENQGCNGGYMEKAFE 216
             DQG C   WAFS  A A + ++    G +   LS Q L+ CD +++ QGC GG ++ A+ 
Sbjct:   222 DQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHNQ-QGCRGGRLDGAWW 280

Query:   217 FITKIGGVTTEDDYPYRGK 235
             F+ +  GV ++  YP+ G+
Sbjct:   281 FLRR-RGVVSDHCYPFVGR 298

 Score = 112 (44.5 bits), Expect = 5.9e-13, Sum P(2) = 5.9e-13
 Identities = 27/74 (36%), Positives = 37/74 (50%)

Query:   261 YAFQLYSH-----GVFDEYCGHQLNHGVTVVGYGED---HGE--KYWLVKNSWGTSWGEA 310
             Y   +YSH     G  + Y  H   H V + G+GE+    G   KYW   NSWG +WGE 
Sbjct:   376 YQGGIYSHTPVSLGRPERYRRHG-THSVKITGWGEETLPDGRTLKYWTAANSWGPAWGER 434

Query:   311 GYIRMARNSPSSNI 324
             G+ R+ R +   +I
Sbjct:   435 GHFRIVRGANECDI 448


>UNIPROTKB|P07858 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9606 "Homo sapiens"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0005739 "mitochondrion" evidence=IEA] [GO:0042470 "melanosome"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042981 "regulation of apoptotic process" evidence=TAS]
            [GO:0006508 "proteolysis" evidence=IDA] [GO:0005764 "lysosome"
            evidence=IDA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEP] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IDA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IDA] [GO:0005622 "intracellular" evidence=TAS]
            [GO:0036021 "endolysosome lumen" evidence=TAS] [GO:0045087 "innate
            immune response" evidence=TAS] [GO:0008233 "peptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0005730 "nucleolus" evidence=IDA] [GO:0043231 "intracellular
            membrane-bounded organelle" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_118779 GO:GO:0005739
            GO:GO:0042470 GO:GO:0048471 Reactome:REACT_6900 GO:GO:0005615
            GO:GO:0009612 GO:GO:0009611 GO:GO:0005730 GO:GO:0042981
            GO:GO:0009897 GO:GO:0045471 GO:GO:0016324 GO:GO:0009749
            GO:GO:0006914 GO:GO:0043434 eggNOG:COG4870 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087
            GO:GO:0050790 GO:GO:0042383 GO:GO:0014070 GO:GO:0042277
            GO:GO:0060548 GO:GO:0005901 GO:GO:0014075 GO:GO:0004197
            GO:GO:0070670 EMBL:CH471157 GO:GO:0007519 MEROPS:C01.060 CTD:1508
            HOGENOM:HOG000241341 HOVERGEN:HBG003480 KO:K01363 OMA:GYPSGAW
            OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 GO:GO:0097067
            PANTHER:PTHR12411:SF16 EMBL:M14221 EMBL:L16510 EMBL:AK092070
            EMBL:AK075393 EMBL:BC010240 EMBL:BC095408 EMBL:M13230
            IPI:IPI00295741 PIR:A26498 RefSeq:NP_001899.1 RefSeq:NP_680090.1
            RefSeq:NP_680091.1 RefSeq:NP_680092.1 RefSeq:NP_680093.1
            UniGene:Hs.520898 PDB:1CSB PDB:1GMY PDB:1HUC PDB:1PBH PDB:2IPP
            PDB:2PBH PDB:3AI8 PDB:3CBJ PDB:3CBK PDB:3K9M PDB:3PBH PDBsum:1CSB
            PDBsum:1GMY PDBsum:1HUC PDBsum:1PBH PDBsum:2IPP PDBsum:2PBH
            PDBsum:3AI8 PDBsum:3CBJ PDBsum:3CBK PDBsum:3K9M PDBsum:3PBH
            ProteinModelPortal:P07858 SMR:P07858 DIP:DIP-42785N IntAct:P07858
            MINT:MINT-1397666 STRING:P07858 PhosphoSite:P07858 DMDM:68067549
            SWISS-2DPAGE:P07858 UCD-2DPAGE:P07858 PaxDb:P07858
            PeptideAtlas:P07858 PRIDE:P07858 DNASU:1508 Ensembl:ENST00000345125
            Ensembl:ENST00000353047 Ensembl:ENST00000434271
            Ensembl:ENST00000453527 Ensembl:ENST00000530640
            Ensembl:ENST00000531089 Ensembl:ENST00000533455
            Ensembl:ENST00000534510 GeneID:1508 KEGG:hsa:1508 UCSC:uc003wum.3
            GeneCards:GC08M011700 H-InvDB:HIX0007320 HGNC:HGNC:2527
            HPA:CAB000457 HPA:HPA018156 MIM:116810 neXtProt:NX_P07858
            PharmGKB:PA27027 InParanoid:P07858 PhylomeDB:P07858
            BindingDB:P07858 ChEMBL:CHEMBL4072 ChiTaRS:CTSB
            EvolutionaryTrace:P07858 GenomeRNAi:1508 NextBio:6235
            PMAP-CutDB:P07858 ArrayExpress:P07858 Bgee:P07858 CleanEx:HS_CTSB
            Genevestigator:P07858 GermOnline:ENSG00000164733 GO:GO:0036021
            Uniprot:P07858
        Length = 339

 Score = 192 (72.6 bits), Expect = 6.1e-13, P = 6.1e-13
 Identities = 50/143 (34%), Positives = 78/143 (54%)

Query:    91 QYIDYINSQNLSFKLTDNKF-ADLSN-EEFISTYLGYNKPYNEPRWPSVQYLGLPASVDW 148
             + ++Y+N +N +++   N +  D+S  +    T+LG  KP    R    + L LPAS D 
Sbjct:    29 ELVNYVNKRNTTWQAGHNFYNVDMSYLKRLCGTFLGGPKP--PQRVMFTEDLKLPASFDA 86

Query:   149 RKEGAVTP----VKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL--SEQELVDCDVNSE 202
             R++    P    ++DQG CGSCWAF AV A+     + T   VS+  S ++L+ C  +  
Sbjct:    87 REQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMC 146

Query:   203 NQGCNGGYMEKAFEFITKIGGVT 225
               GCNGGY  +A+ F T+ G V+
Sbjct:   147 GDGCNGGYPAEAWNFWTRKGLVS 169

 Score = 144 (55.7 bits), Expect = 2.5e-07, P = 2.5e-07
 Identities = 39/134 (29%), Positives = 57/134 (42%)

Query:   199 VNSENQGCNG-GYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI 257
             VN     C G G   K  +        T + D  Y G N    ++  K     I  Y+  
Sbjct:   191 VNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHY-GYNSYSVSNSEKDIMAEI--YKNG 247

Query:   258 PARYAFQLYSH------GVFDEYCGHQLN-HGVTVVGYGEDHGEKYWLVKNSWGTSWGEA 310
             P   AF +YS       GV+    G  +  H + ++G+G ++G  YWLV NSW T WG+ 
Sbjct:   248 PVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDN 307

Query:   311 GYIRMARNSPSSNI 324
             G+ ++ R      I
Sbjct:   308 GFFKILRGQDHCGI 321


>UNIPROTKB|E1B9H1 [details] [associations]
            symbol:TINAGL1 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0043236 "laminin binding" evidence=IEA] [GO:0031012
            "extracellular matrix" evidence=IEA] [GO:0005737 "cytoplasm"
            evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0005044 "scavenger receptor
            activity" evidence=IEA] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0005737 GO:GO:0006955
            GO:GO:0030247 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00639 GO:GO:0031012 GO:GO:0005044
            GeneTree:ENSGT00560000076599 OMA:DNCNRCT EMBL:DAAA02006255
            IPI:IPI00732137 Ensembl:ENSBTAT00000038022 Uniprot:E1B9H1
        Length = 469

 Score = 129 (50.5 bits), Expect = 8.0e-13, Sum P(2) = 8.0e-13
 Identities = 29/78 (37%), Positives = 45/78 (57%)

Query:   159 DQGQCGSCWAFSAVA-AVEGINKLKTGKLVS-LSEQELVDCDVNSENQGCNGGYMEKAFE 216
             DQG C   WAFS  A A + ++    G +   LS Q L+ CD +++ QGC GG ++ A+ 
Sbjct:   224 DQGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQNLLSCDTHNQ-QGCRGGRLDGAWW 282

Query:   217 FITKIGGVTTEDDYPYRG 234
             F+ +  GV ++  YP+ G
Sbjct:   283 FLRR-RGVVSDHCYPFSG 299

 Score = 113 (44.8 bits), Expect = 8.0e-13, Sum P(2) = 8.0e-13
 Identities = 27/74 (36%), Positives = 37/74 (50%)

Query:   261 YAFQLYSH-----GVFDEYCGHQLNHGVTVVGYGED---HGE--KYWLVKNSWGTSWGEA 310
             Y   +YSH     G  + Y  H   H V + G+GE+    G   KYW   NSWG +WGE 
Sbjct:   378 YQSGIYSHTPVSLGRPERYRRHG-THSVKITGWGEETLPDGRTIKYWTAANSWGPAWGER 436

Query:   311 GYIRMARNSPSSNI 324
             G+ R+ R +   +I
Sbjct:   437 GHFRIVRGANECDI 450


>RGD|70956 [details] [associations]
            symbol:Tinagl1 "tubulointerstitial nephritis antigen-like 1"
           species:10116 "Rattus norvegicus" [GO:0005044 "scavenger receptor
           activity" evidence=IEA] [GO:0005576 "extracellular region"
           evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA;ISO] [GO:0006508
           "proteolysis" evidence=IEA] [GO:0006955 "immune response"
           evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
           evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
           [GO:0031012 "extracellular matrix" evidence=IEA;ISO] [GO:0043236
           "laminin binding" evidence=IEA;ISO] InterPro:IPR000668
           InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
           PROSITE:PS50958 SMART:SM00201 SMART:SM00645 RGD:70956 GO:GO:0005737
           GO:GO:0005576 GO:GO:0006955 GO:GO:0030247 GO:GO:0008234
           GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
           GO:GO:0031012 GO:GO:0005044 eggNOG:NOG310046 HOGENOM:HOG000241342
           HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OrthoDB:EOG4BG8W0
           EMBL:AB050717 IPI:IPI00190428 RefSeq:NP_446034.1 UniGene:Rn.1256
           ProteinModelPortal:Q9EQT5 PRIDE:Q9EQT5 GeneID:94174 KEGG:rno:94174
           UCSC:RGD:70956 InParanoid:Q9EQT5 NextBio:617830 ArrayExpress:Q9EQT5
           Genevestigator:Q9EQT5 GermOnline:ENSRNOG00000013179 Uniprot:Q9EQT5
        Length = 467

 Score = 133 (51.9 bits), Expect = 9.0e-13, Sum P(2) = 9.0e-13
 Identities = 36/127 (28%), Positives = 59/127 (46%)

Query:   120 STYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA-AVEGI 178
             S+ +  N+ Y       V      AS  W     +    DQG C   WAFS  A A + +
Sbjct:   184 SSVMNMNEIYTVLGQGEVLPTAFEASEKW--PNLIHEPLDQGNCAGSWAFSTAAVASDRV 241

Query:   179 NKLKTGKLVS-LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKND 237
             +    G +   LS Q L+ CD + + +GC GG ++ A+ F+ +  GV +++ YP+ G+  
Sbjct:   242 SIHSLGHMTPILSPQNLLSCDTHHQ-KGCRGGRLDGAWWFLRR-RGVVSDNCYPFSGREQ 299

Query:   238 RCQTDKT 244
               +   T
Sbjct:   300 NDEASPT 306

 Score = 108 (43.1 bits), Expect = 9.0e-13, Sum P(2) = 9.0e-13
 Identities = 24/62 (38%), Positives = 32/62 (51%)

Query:   261 YAFQLYSHGVFDEYCGHQLNHGVTVVGYGED---HGE--KYWLVKNSWGTSWGEAGYIRM 315
             Y+    S G  ++Y  H   H V + G+GE+    G   KYW   NSWG  WGE G+ R+
Sbjct:   381 YSHTPVSQGRPEQYRRHG-THSVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRI 439

Query:   316 AR 317
              R
Sbjct:   440 VR 441


>UNIPROTKB|Q9EQT5 [details] [associations]
            symbol:Tinagl1 "Tubulointerstitial nephritis antigen-like"
            species:10116 "Rattus norvegicus" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 RGD:70956 GO:GO:0005737
            GO:GO:0005576 GO:GO:0006955 GO:GO:0030247 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GO:GO:0031012 GO:GO:0005044 eggNOG:NOG310046 HOGENOM:HOG000241342
            HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OrthoDB:EOG4BG8W0
            EMBL:AB050717 IPI:IPI00190428 RefSeq:NP_446034.1 UniGene:Rn.1256
            ProteinModelPortal:Q9EQT5 PRIDE:Q9EQT5 GeneID:94174 KEGG:rno:94174
            UCSC:RGD:70956 InParanoid:Q9EQT5 NextBio:617830 ArrayExpress:Q9EQT5
            Genevestigator:Q9EQT5 GermOnline:ENSRNOG00000013179 Uniprot:Q9EQT5
        Length = 467

 Score = 133 (51.9 bits), Expect = 9.0e-13, Sum P(2) = 9.0e-13
 Identities = 36/127 (28%), Positives = 59/127 (46%)

Query:   120 STYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA-AVEGI 178
             S+ +  N+ Y       V      AS  W     +    DQG C   WAFS  A A + +
Sbjct:   184 SSVMNMNEIYTVLGQGEVLPTAFEASEKW--PNLIHEPLDQGNCAGSWAFSTAAVASDRV 241

Query:   179 NKLKTGKLVS-LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKND 237
             +    G +   LS Q L+ CD + + +GC GG ++ A+ F+ +  GV +++ YP+ G+  
Sbjct:   242 SIHSLGHMTPILSPQNLLSCDTHHQ-KGCRGGRLDGAWWFLRR-RGVVSDNCYPFSGREQ 299

Query:   238 RCQTDKT 244
               +   T
Sbjct:   300 NDEASPT 306

 Score = 108 (43.1 bits), Expect = 9.0e-13, Sum P(2) = 9.0e-13
 Identities = 24/62 (38%), Positives = 32/62 (51%)

Query:   261 YAFQLYSHGVFDEYCGHQLNHGVTVVGYGED---HGE--KYWLVKNSWGTSWGEAGYIRM 315
             Y+    S G  ++Y  H   H V + G+GE+    G   KYW   NSWG  WGE G+ R+
Sbjct:   381 YSHTPVSQGRPEQYRRHG-THSVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRI 439

Query:   316 AR 317
              R
Sbjct:   440 VR 441


>UNIPROTKB|Q9GZM7 [details] [associations]
            symbol:TINAGL1 "Tubulointerstitial nephritis antigen-like"
            species:9606 "Homo sapiens" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0005737 "cytoplasm"
            evidence=IEA] [GO:0043236 "laminin binding" evidence=IEA]
            [GO:0016197 "endosomal transport" evidence=TAS] [GO:0005201
            "extracellular matrix structural constituent" evidence=NAS]
            [GO:0005576 "extracellular region" evidence=NAS] [GO:0031012
            "extracellular matrix" evidence=IDA] [GO:0005615 "extracellular
            space" evidence=ISS] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0005737 GO:GO:0005615
            GO:GO:0006955 GO:GO:0030247 EMBL:CH471059 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GO:GO:0016197 EMBL:AC114488 GO:GO:0005044 GO:GO:0005201
            eggNOG:NOG310046 HOGENOM:HOG000241342 HOVERGEN:HBG053961
            EMBL:AF236155 EMBL:AF236151 EMBL:AF236152 EMBL:AF236153
            EMBL:AF236154 EMBL:AF236150 EMBL:AF205436 EMBL:AB050716
            EMBL:AB050719 EMBL:AK074124 EMBL:AY358421 EMBL:AF289569
            EMBL:AK027839 EMBL:AK292770 EMBL:AK298382 EMBL:AK075398
            EMBL:BC009048 EMBL:BC064633 IPI:IPI00005563 IPI:IPI00439435
            IPI:IPI00910801 RefSeq:NP_001191343.1 RefSeq:NP_001191344.1
            RefSeq:NP_071447.1 UniGene:Hs.199368 ProteinModelPortal:Q9GZM7
            SMR:Q9GZM7 IntAct:Q9GZM7 MINT:MINT-253718 STRING:Q9GZM7
            MEROPS:C01.975 PhosphoSite:Q9GZM7 DMDM:61213628 PaxDb:Q9GZM7
            PRIDE:Q9GZM7 Ensembl:ENST00000271064 Ensembl:ENST00000457433
            GeneID:64129 KEGG:hsa:64129 UCSC:uc001bta.3 CTD:64129
            GeneCards:GC01P032042 HGNC:HGNC:19168 HPA:HPA048695
            neXtProt:NX_Q9GZM7 PharmGKB:PA38810 InParanoid:Q9GZM7 OMA:DNCNRCT
            OrthoDB:EOG4BG8W0 PhylomeDB:Q9GZM7 ChiTaRS:TINAGL1 GenomeRNAi:64129
            NextBio:66016 ArrayExpress:Q9GZM7 Bgee:Q9GZM7 CleanEx:HS_TINAGL1
            Genevestigator:Q9GZM7 GermOnline:ENSG00000142910 Uniprot:Q9GZM7
        Length = 467

 Score = 132 (51.5 bits), Expect = 9.3e-13, Sum P(2) = 9.3e-13
 Identities = 29/79 (36%), Positives = 45/79 (56%)

Query:   159 DQGQCGSCWAFSAVA-AVEGINKLKTGKLVS-LSEQELVDCDVNSENQGCNGGYMEKAFE 216
             DQG C   WAFS  A A + ++    G +   LS Q L+ CD + + QGC GG ++ A+ 
Sbjct:   222 DQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQQ-QGCRGGRLDGAWW 280

Query:   217 FITKIGGVTTEDDYPYRGK 235
             F+ +  GV ++  YP+ G+
Sbjct:   281 FLRR-RGVVSDHCYPFSGR 298

 Score = 109 (43.4 bits), Expect = 9.3e-13, Sum P(2) = 9.3e-13
 Identities = 26/67 (38%), Positives = 34/67 (50%)

Query:   261 YAFQLYSH-----GVFDEYCGHQLNHGVTVVGYGED---HGE--KYWLVKNSWGTSWGEA 310
             Y   +YSH     G  + Y  H   H V + G+GE+    G   KYW   NSWG +WGE 
Sbjct:   376 YKGGIYSHTPVSLGRPERYRRHG-THSVKITGWGEETLPDGRTLKYWTAANSWGPAWGER 434

Query:   311 GYIRMAR 317
             G+ R+ R
Sbjct:   435 GHFRIVR 441


>UNIPROTKB|F1RKR7 [details] [associations]
            symbol:CTSH "Cathepsin H light chain" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] InterPro:IPR013128 GO:GO:0008234 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            GeneTree:ENSGT00660000095458 EMBL:CU326382
            Ensembl:ENSSSCT00000001985 ArrayExpress:F1RKR7 Uniprot:F1RKR7
        Length = 197

 Score = 173 (66.0 bits), Expect = 1.1e-12, P = 1.1e-12
 Identities = 53/161 (32%), Positives = 88/161 (54%)

Query:    28 AVLSLFLL--WVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGI 85
             AVLSL     W+LG PA   S      ++    +  F++W+ Q+ ++Y  E E+  R  +
Sbjct:     3 AVLSLLCAGAWLLGPPACGASNLAVSSFE----KLHFKSWMVQHQKKYSLE-EYHHRLQV 57

Query:    86 YSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYN--EPRWPSVQYLG-L 142
             + SN + I+  N+ N +FKL  N+F+D+S +E    YL +++P N    +   ++  G  
Sbjct:    58 FVSNWRKINAHNAGNHTFKLGLNQFSDMSFDEIRHKYL-WSEPQNCSATKGNYLRGTGPY 116

Query:   143 PASVDWRKEGA-VTPVKDQGQCGSCWAF---SAVAAVEGIN 179
             P S+DWRK+G  V+PVK+Q    S W     S + A +G++
Sbjct:   117 PPSMDWRKKGNFVSPVKNQNS--SWWTAPRTSTITAAKGVS 155


>MGI|MGI:1891190 [details] [associations]
            symbol:Ctsz "cathepsin Z" species:10090 "Mus musculus"
            [GO:0005615 "extracellular space" evidence=ISO] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1891190 GO:GO:0005783 GO:GO:0005615 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0060441 GeneTree:ENSGT00560000076599 MEROPS:C01.013 CTD:1522
            HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN OMA:QCGTCTE
            ChiTaRS:CTSZ EMBL:AJ242663 EMBL:AF136277 EMBL:AF136278
            EMBL:BC008619 IPI:IPI00986833 RefSeq:NP_071720.1 UniGene:Mm.156919
            ProteinModelPortal:Q9WUU7 SMR:Q9WUU7 IntAct:Q9WUU7 STRING:Q9WUU7
            PaxDb:Q9WUU7 PRIDE:Q9WUU7 Ensembl:ENSMUST00000016400 GeneID:64138
            KEGG:mmu:64138 InParanoid:Q9WUU7 NextBio:319927 Bgee:Q9WUU7
            CleanEx:MM_CTSZ Genevestigator:Q9WUU7 GermOnline:ENSMUSG00000016256
            Uniprot:Q9WUU7
        Length = 306

 Score = 128 (50.1 bits), Expect = 1.1e-12, Sum P(2) = 1.1e-12
 Identities = 22/62 (35%), Positives = 39/62 (62%)

Query:   266 YSHGVFDEYCGHQ-LNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
             Y+ G++ E+     +NH ++V G+G  + G +YW+V+NSWG  WGE G++R+  ++    
Sbjct:   227 YTGGIYAEHQDQAVINHIISVAGWGVSNDGIEYWIVRNSWGEPWGEKGWMRIVTSTYKGG 286

Query:   324 IG 325
              G
Sbjct:   287 TG 288

 Score = 106 (42.4 bits), Expect = 1.1e-12, Sum P(2) = 1.1e-12
 Identities = 42/147 (28%), Positives = 65/147 (44%)

Query:   142 LPASVDWRKEGAV---TPVKDQG---QCGSCWAFSAVAAV-EGINKLKTGKLVS--LSEQ 192
             LP + DWR    V   +  ++Q     CGSCWA  + +A+ + IN  + G   S  LS Q
Sbjct:    64 LPKNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSILLSVQ 123

Query:   193 ELVDCDVNSENQG-CNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTI 251
              ++DC     N G C GG     +E+  K  G+  E    Y+ K+  C  DK  +   T 
Sbjct:   124 NVIDCG----NAGSCEGGNDLPVWEYAHK-HGIPDETCNNYQAKDQDC--DKF-NQCGTC 175

Query:   252 TGYEAIPARYAFQLYSHGVFDEYCGHQ 278
             T ++       + L+  G +    G +
Sbjct:   176 TEFKECHTIQNYTLWRVGDYGSLSGRE 202

 Score = 55 (24.4 bits), Expect = 2.1e-07, Sum P(2) = 2.1e-07
 Identities = 22/97 (22%), Positives = 37/97 (38%)

Query:   132 PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSE 191
             P W      G+P       +          QCG+C  F     ++     + G   SLS 
Sbjct:   141 PVWEYAHKHGIPDETCNNYQAKDQDCDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSLSG 200

Query:   192 QELVDCDVNSENQGCNGGYMEKAFEFITKI-GGVTTE 227
             +E +  ++ + N   + G M  A E ++   GG+  E
Sbjct:   201 REKMMAEIYA-NGPISCGIM--ATEMMSNYTGGIYAE 234


>UNIPROTKB|E1BTI7 [details] [associations]
            symbol:TINAG "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0005044 "scavenger receptor activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0006955 "immune
            response" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0030247 "polysaccharide binding"
            evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
            [GO:0007155 "cell adhesion" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GO:GO:0007155 GO:GO:0005604 GO:GO:0005044
            GeneTree:ENSGT00560000076599 CTD:27283 OMA:WGQLTSS
            EMBL:AADN02002720 EMBL:AADN02002721 IPI:IPI00581566
            RefSeq:XP_419905.3 UniGene:Gga.11215 Ensembl:ENSGALT00000026295
            GeneID:421888 KEGG:gga:421888 Uniprot:E1BTI7
        Length = 467

 Score = 125 (49.1 bits), Expect = 3.6e-12, Sum P(2) = 3.6e-12
 Identities = 29/75 (38%), Positives = 44/75 (58%)

Query:   159 DQGQCGSCWAFS-AVAAVEGINKLKTGKLV-SLSEQELVDCDVNSENQGCNGGYMEKAFE 216
             DQ  CG+ WAFS A  A + I     G++  +LS Q L+ CD  ++ +GCNGG ++ A+ 
Sbjct:   241 DQRNCGASWAFSTASVAADRITIHSDGQITDNLSVQNLISCDTGNQ-RGCNGGSIDGAWR 299

Query:   217 FITKIGGVTTEDDYP 231
             ++T   GV +   YP
Sbjct:   300 YLTT-HGVVSYACYP 313

 Score = 111 (44.1 bits), Expect = 3.6e-12, Sum P(2) = 3.6e-12
 Identities = 28/79 (35%), Positives = 40/79 (50%)

Query:   255 EAIPARYA-FQLYSHGVF-DEY-CGHQLN-HGVTVVGYGEDHG-----EKYWLVKNSWGT 305
             +AI   Y  F LY  G++   Y  G +   H V ++G+G   G     +K+W+  NSWG 
Sbjct:   380 QAIMKVYEDFFLYKEGIYRHSYKAGSKWKTHSVKLLGWGSLPGKNGQKQKFWIAANSWGK 439

Query:   306 SWGEAGYIRMARNSPSSNI 324
              WGE GY R+ R     +I
Sbjct:   440 YWGENGYFRILRGQNECDI 458

 Score = 37 (18.1 bits), Expect = 0.00015, Sum P(2) = 0.00015
 Identities = 6/18 (33%), Positives = 12/18 (66%)

Query:   243 KTKHHAVTITGYEAIPAR 260
             K K H+V + G+ ++P +
Sbjct:   406 KWKTHSVKLLGWGSLPGK 423


>UNIPROTKB|H0YE42 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 PANTHER:PTHR12411 PROSITE:PS00139 EMBL:AP002748
            HGNC:HGNC:2531 ChiTaRS:CTSF Ensembl:ENST00000525733 Uniprot:H0YE42
        Length = 82

 Score = 165 (63.1 bits), Expect = 8.4e-12, P = 8.4e-12
 Identities = 40/78 (51%), Positives = 48/78 (61%)

Query:   118 FISTYLGYNKPYNEPRWP-SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVE 176
             +++T L   +P N+ +   SV  L  P   DWR +GAVT VKDQG CGSCWAFS    VE
Sbjct:     5 YLNTLLR-KEPGNKMKQAKSVGDLA-PPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVE 62

Query:   177 GINKLKTGKLVSLSEQEL 194
             G   L  G L+SLSEQ L
Sbjct:    63 GQWFLNQGTLLSLSEQAL 80


>UNIPROTKB|E2R6Q7 [details] [associations]
            symbol:CTSB "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730 "nucleolus"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790
            GO:GO:0005764 GO:GO:0004197 CTD:1508 GeneTree:ENSGT00560000076599
            KO:K01363 OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16
            EMBL:AAEX03014318 RefSeq:XP_543203.3 Ensembl:ENSCAFT00000012692
            GeneID:486077 KEGG:cfa:486077 NextBio:20859923 Uniprot:E2R6Q7
        Length = 339

 Score = 179 (68.1 bits), Expect = 2.2e-11, P = 2.2e-11
 Identities = 51/154 (33%), Positives = 79/154 (51%)

Query:    80 QRRFGIYSSNVQYIDYINSQNLSFKLTDNKF-ADLSN-EEFISTYLGYNKPYNEPRWPSV 137
             Q R    + + + +DY+N +N ++K   N    D S       T+LG   P    R    
Sbjct:    18 QSRLPFRALSDELVDYVNKRNTTWKAGHNFHNVDPSYLRRLCGTFLG--GPKLPQRVQFA 75

Query:   138 QYLGLPASVDWRKEGAVTP----VKDQGQCGSCWAFSAVAAVEGINKLKT-GKL-VSLSE 191
             + L LP S D R++    P    ++DQG CGSCWAF AV A+     ++T G + V +S 
Sbjct:    76 KNLILPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNVEVSA 135

Query:   192 QELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
             ++++ C  +    GCNGG+  +A+ F TK G V+
Sbjct:   136 EDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVS 169

 Score = 141 (54.7 bits), Expect = 5.5e-07, P = 5.5e-07
 Identities = 39/125 (31%), Positives = 56/125 (44%)

Query:   199 VNSENQGCNG-GYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTI--TG-Y 254
             VN     C G G   K  +        + ++D  Y G +    +D  K     I   G  
Sbjct:   191 VNGSRPPCTGEGDTPKCSKICEPGYSPSYKEDKHY-GCSSYSVSDNEKEIMAEIYKNGPV 249

Query:   255 EAIPARYA-FQLYSHGVFDEYCGHQLN-HGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGY 312
             EA    Y+ F LY  GV+    G  +  H V ++G+G + G  YWLV NSW T WG+ G+
Sbjct:   250 EAAFTVYSDFLLYKSGVYQHVTGEMMGGHAVRILGWGVEDGTPYWLVGNSWNTDWGDNGF 309

Query:   313 IRMAR 317
              ++ R
Sbjct:   310 FKILR 314

WARNING:  HSPs involving 50 database sequences were not reported due to the
          limiting value of parameter B = 250.


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.318   0.135   0.430    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      340       340   0.00095  116 3  11 22  0.41    34
                                                     33  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  300
  No. of states in DFA:  631 (67 KB)
  Total size of DFA:  280 KB (2145 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  27.05u 0.14s 27.19t   Elapsed:  00:00:01
  Total cpu time:  27.10u 0.14s 27.24t   Elapsed:  00:00:01
  Start:  Thu May  9 22:26:48 2013   End:  Thu May  9 22:26:49 2013
WARNINGS ISSUED:  2

Back to top