BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>018104
MKRVYLLAAFLLALVLGIVEGFDFHEKELESEEGLWDLYERWRSHHTVSRSLDEKHKRFN
VFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMY
GKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVD
CDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENV
PANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTGECGTELNHGVAAVGYGTTLDGT
KYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPIKKSATNPTGPSDYPKDEL

High Scoring Gene Products

Symbol, full name Information P value
CEP1
cysteine endopeptidase 1
protein from Arabidopsis thaliana 2.2e-144
CEP3
cysteine endopeptidase 3
protein from Arabidopsis thaliana 1.3e-130
XCP2
AT1G20850
protein from Arabidopsis thaliana 4.3e-93
RD21A
responsive to dehydration 21A
protein from Arabidopsis thaliana 1.3e-91
RD21B
esponsive to dehydration 21B
protein from Arabidopsis thaliana 1.7e-91
XCP1
xylem cysteine peptidase 1
protein from Arabidopsis thaliana 3.1e-90
AT3G19390 protein from Arabidopsis thaliana 1.1e-87
SAG12
senescence-associated gene 12
protein from Arabidopsis thaliana 2.9e-87
AT3G19400 protein from Arabidopsis thaliana 5.8e-82
AT1G06260 protein from Arabidopsis thaliana 6.6e-81
XBCP3
xylem bark cysteine peptidase 3
protein from Arabidopsis thaliana 1.6e-79
AT2G27420 protein from Arabidopsis thaliana 1.6e-79
CP1
cysteine protease 1
protein from Arabidopsis thaliana 8.7e-79
CP2
cysteine protease 2
protein from Arabidopsis thaliana 3.0e-78
AT4G23520 protein from Arabidopsis thaliana 4.3e-77
AT2G34080 protein from Arabidopsis thaliana 1.4e-73
AT1G29090 protein from Arabidopsis thaliana 1.2e-72
AT3G49340 protein from Arabidopsis thaliana 1.2e-72
AT1G29080 protein from Arabidopsis thaliana 3.3e-70
CTSL2
Uncharacterized protein
protein from Gallus gallus 1.7e-68
AT3G43960 protein from Arabidopsis thaliana 1.2e-67
ctsl1a
cathepsin L, 1 a
gene_product from Danio rerio 9.4e-66
cprB
cysteine proteinase 2
gene from Dictyostelium discoideum 3.7e-64
Cp1
Cysteine proteinase-1
protein from Drosophila melanogaster 7.6e-64
Ssc.54235
Uncharacterized protein
protein from Sus scrofa 7.6e-64
CTSL1
Cathepsin L1
protein from Sus scrofa 7.6e-64
CTSL1
Cathepsin L1
protein from Bos taurus 3.3e-63
Ctsl1
cathepsin L1
gene from Rattus norvegicus 4.2e-63
Ctsl
cathepsin L
protein from Mus musculus 1.1e-62
CTSL2
Cathepsin L2
protein from Bos taurus 3.0e-62
CTSL1
Cathepsin L1
protein from Canis lupus familiaris 4.8e-62
P83654
Ervatamin-C
protein from Tabernaemontana divaricata 1.6e-61
cprD
cysteine proteinase 4
gene from Dictyostelium discoideum 6.9e-61
CTSL2
Uncharacterized protein
protein from Gallus gallus 7.1e-61
CTSL1
Cathepsin L1
protein from Homo sapiens 1.5e-60
AT1G29110 protein from Arabidopsis thaliana 1.9e-60
wu:fb37b09 gene_product from Danio rerio 2.4e-60
ctsl.1
cathepsin L.1
gene_product from Danio rerio 2.4e-60
CTSL2
Cathepsin L2
protein from Homo sapiens 3.1e-60
Ctsll3
cathepsin L-like 3
gene from Rattus norvegicus 5.0e-60
ctsll
cathepsin L, like
gene_product from Danio rerio 5.0e-60
CTSL1
Cathepsin L1
protein from Gallus gallus 1.0e-59
zgc:174153 gene_product from Danio rerio 1.0e-59
zgc:174855 gene_product from Danio rerio 1.3e-59
cprC
cysteine proteinase 3
gene from Dictyostelium discoideum 1.7e-59
cprF
cysteine proteinase 6
gene from Dictyostelium discoideum 2.6e-59
CTSL1
Cathepsin L1
protein from Canis lupus familiaris 4.5e-59
RGD1308751
similar to Cathepsin L precursor (Major excreted protein) (MEP)
gene from Rattus norvegicus 4.5e-59
ctsl1b
cathepsin L, 1 b
gene_product from Danio rerio 9.3e-59
CTSL
Cathepsin L1
protein from Ovis aries 1.5e-58
CTSL1
CTSL1 protein
protein from Bos taurus 1.9e-58
CTSK
Cathepsin K
protein from Canis lupus familiaris 1.9e-58
CTSK
Cathepsin K
protein from Canis lupus familiaris 1.9e-58
AT3G45310 protein from Arabidopsis thaliana 3.2e-58
Ctsk
cathepsin K
gene from Rattus norvegicus 1.1e-57
CTSK
Cathepsin K
protein from Bos taurus 1.7e-57
cpl-1 gene from Caenorhabditis elegans 1.7e-57
cprG
cysteine proteinase 7
gene from Dictyostelium discoideum 2.0e-57
CTSK
Cathepsin K
protein from Sus scrofa 2.2e-57
CTSK
Cathepsin K
protein from Homo sapiens 2.8e-57
CTSK
Cathepsin K
protein from Gallus gallus 3.6e-57
Ctss
cathepsin S
protein from Mus musculus 1.2e-56
Ctsk
cathepsin K
protein from Mus musculus 1.6e-56
CTSH
Pro-cathepsin H
protein from Sus scrofa 2.0e-56
ALP
aleurain-like protease
protein from Arabidopsis thaliana 2.5e-56
ctsk
cathepsin K
gene_product from Danio rerio 2.5e-56
ctssb.2
cathepsin S, b.2
gene_product from Danio rerio 2.5e-56
ctsh
cathepsin H
gene_product from Danio rerio 3.3e-56
Cat-1
Cathepsin L-like proteinase
protein from Fasciola hepatica 5.3e-56
Cys
Crustapain
protein from Pandalus borealis 5.3e-56
CTSH
Pro-cathepsin H
protein from Homo sapiens 2.3e-55
CTSH
Pro-cathepsin H
protein from Bos taurus 2.9e-55
cprH
cysteine proteinase 8
gene from Dictyostelium discoideum 3.7e-55
CTSH
Uncharacterized protein
protein from Gorilla gorilla gorilla 3.7e-55
CTSH
Uncharacterized protein
protein from Macaca mulatta 6.1e-55
CTSH
Uncharacterized protein
protein from Ailuropoda melanoleuca 6.1e-55
CTSH
Uncharacterized protein
protein from Nomascus leucogenys 6.1e-55
LOC100662496
Uncharacterized protein
protein from Loxodonta africana 7.8e-55
cfaD
peptidase C1A family protein
gene from Dictyostelium discoideum 1.6e-54
CTSS
Cathepsin S
protein from Bos taurus 1.6e-54
CTSH
Uncharacterized protein
protein from Callithrix jacchus 1.6e-54
CTSH
Uncharacterized protein
protein from Callithrix jacchus 1.6e-54
CTSS
Cathepsin S
protein from Canis lupus familiaris 4.3e-54
Testin
testin gene
gene from Rattus norvegicus 4.3e-54
CTSS
Cathepsin S
protein from Canis lupus familiaris 5.5e-54
CTSH
Uncharacterized protein
protein from Canis lupus familiaris 7.0e-54
CTSH
Uncharacterized protein
protein from Equus caballus 7.0e-54
CTSS
Uncharacterized protein
protein from Sus scrofa 8.9e-54
cprE
cysteine proteinase 5
gene from Dictyostelium discoideum 1.9e-53
4930486L24Rik
RIKEN cDNA 4930486L24 gene
protein from Mus musculus 2.4e-53
CTSS
Cathepsin S
protein from Homo sapiens 3.0e-53
Ctss
cathepsin S
gene from Rattus norvegicus 4.9e-53
P83443
Macrodontain-1
protein from Pseudananas sagenarius 6.3e-53
PF11_0161
falcipain-2 precursor, putative
gene from Plasmodium falciparum 2.1e-52
PF11_0161
Falcipain-2B
protein from Plasmodium falciparum 3D7 2.1e-52
LOC420160
Uncharacterized protein
protein from Gallus gallus 4.4e-52
Ctsh
cathepsin H
protein from Mus musculus 1.2e-51
CTSH
Uncharacterized protein
protein from Oryctolagus cuniculus 1.5e-51
PF11_0165
falcipain 2 precursor
gene from Plasmodium falciparum 2.4e-51
PF11_0165
Falcipain-2A
protein from Plasmodium falciparum 3D7 2.4e-51

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  018104
        (360 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2157712 - symbol:CEP1 "cysteine endopeptidase ...  1411  2.2e-144  1
TAIR|locus:505006391 - symbol:CEP3 "cysteine endopeptidas...  1281  1.3e-130  1
TAIR|locus:2030427 - symbol:XCP2 "xylem cysteine peptidas...   927  4.3e-93   1
TAIR|locus:2825832 - symbol:RD21A "responsive to dehydrat...   913  1.3e-91   1
TAIR|locus:2167821 - symbol:RD21B "esponsive to dehydrati...   912  1.7e-91   1
TAIR|locus:2122113 - symbol:XCP1 "xylem cysteine peptidas...   900  3.1e-90   1
TAIR|locus:2090614 - symbol:AT3G19390 species:3702 "Arabi...   876  1.1e-87   1
TAIR|locus:2152445 - symbol:SAG12 "senescence-associated ...   872  2.9e-87   1
TAIR|locus:2090629 - symbol:AT3G19400 species:3702 "Arabi...   822  5.8e-82   1
TAIR|locus:2038515 - symbol:AT1G06260 species:3702 "Arabi...   812  6.6e-81   1
TAIR|locus:2024362 - symbol:XBCP3 "xylem bark cysteine pe...   799  1.6e-79   1
TAIR|locus:2038588 - symbol:AT2G27420 species:3702 "Arabi...   799  1.6e-79   1
TAIR|locus:2128243 - symbol:AT4G11310 species:3702 "Arabi...   792  8.7e-79   1
TAIR|locus:2128253 - symbol:AT4G11320 species:3702 "Arabi...   787  3.0e-78   1
TAIR|locus:2117979 - symbol:AT4G23520 species:3702 "Arabi...   776  4.3e-77   1
TAIR|locus:2055440 - symbol:AT2G34080 species:3702 "Arabi...   743  1.4e-73   1
TAIR|locus:2029924 - symbol:AT1G29090 species:3702 "Arabi...   734  1.2e-72   1
TAIR|locus:2082881 - symbol:AT3G49340 species:3702 "Arabi...   734  1.2e-72   1
TAIR|locus:2029934 - symbol:AT1G29080 species:3702 "Arabi...   711  3.3e-70   1
UNIPROTKB|F1NYJ1 - symbol:CTSL2 "Uncharacterized protein"...   695  1.7e-68   1
TAIR|locus:2097104 - symbol:AT3G43960 species:3702 "Arabi...   687  1.2e-67   1
ZFIN|ZDB-GENE-030131-106 - symbol:ctsl1a "cathepsin L, 1 ...   669  9.4e-66   1
DICTYBASE|DDB_G0279799 - symbol:cprB "cysteine proteinase...   539  3.7e-64   2
FB|FBgn0013770 - symbol:Cp1 "Cysteine proteinase-1" speci...   651  7.6e-64   1
UNIPROTKB|F1S4J6 - symbol:Ssc.54235 "Cathepsin L1" specie...   651  7.6e-64   1
UNIPROTKB|Q28944 - symbol:CTSL1 "Cathepsin L1" species:98...   651  7.6e-64   1
UNIPROTKB|P25975 - symbol:CTSL1 "Cathepsin L1" species:99...   645  3.3e-63   1
RGD|2448 - symbol:Ctsl1 "cathepsin L1" species:10116 "Rat...   644  4.2e-63   1
MGI|MGI:88564 - symbol:Ctsl "cathepsin L" species:10090 "...   640  1.1e-62   1
UNIPROTKB|Q5E998 - symbol:CTSL2 "Cathepsin L2" species:99...   636  3.0e-62   1
UNIPROTKB|Q9GL24 - symbol:CTSL1 "Cathepsin L1" species:96...   634  4.8e-62   1
UNIPROTKB|P83654 - symbol:P83654 "Ervatamin-C" species:52...   629  1.6e-61   1
DICTYBASE|DDB_G0278721 - symbol:cprD "cysteine proteinase...   520  6.9e-61   2
UNIPROTKB|F1NEC8 - symbol:CTSL2 "Uncharacterized protein"...   623  7.1e-61   1
UNIPROTKB|P07711 - symbol:CTSL1 "Cathepsin L1" species:96...   620  1.5e-60   1
TAIR|locus:2030027 - symbol:AT1G29110 species:3702 "Arabi...   619  1.9e-60   1
ZFIN|ZDB-GENE-030131-572 - symbol:wu:fb37b09 "wu:fb37b09"...   618  2.4e-60   1
ZFIN|ZDB-GENE-040718-61 - symbol:ctsl.1 "cathepsin L.1" s...   618  2.4e-60   1
UNIPROTKB|O60911 - symbol:CTSL2 "Cathepsin L2" species:96...   617  3.1e-60   1
RGD|1560071 - symbol:Ctsll3 "cathepsin L-like 3" species:...   615  5.0e-60   1
ZFIN|ZDB-GENE-041010-76 - symbol:ctsll "cathepsin L, like...   615  5.0e-60   1
UNIPROTKB|P09648 - symbol:CTSL1 "Cathepsin L1" species:90...   612  1.0e-59   1
ZFIN|ZDB-GENE-080215-7 - symbol:zgc:174153 "zgc:174153" s...   612  1.0e-59   1
ZFIN|ZDB-GENE-071004-74 - symbol:zgc:174855 "zgc:174855" ...   611  1.3e-59   1
DICTYBASE|DDB_G0283867 - symbol:cprC "cysteine proteinase...   610  1.7e-59   1
DICTYBASE|DDB_G0279185 - symbol:cprF "cysteine proteinase...   500  2.6e-59   2
UNIPROTKB|F1PMM9 - symbol:CTSL1 "Cathepsin L1" species:96...   606  4.5e-59   1
RGD|1308751 - symbol:RGD1308751 "similar to Cathepsin L p...   606  4.5e-59   1
ZFIN|ZDB-GENE-980526-285 - symbol:ctsl1b "cathepsin L, 1 ...   603  9.3e-59   1
UNIPROTKB|Q10991 - symbol:CTSL "Cathepsin L1" species:994...   601  1.5e-58   1
UNIPROTKB|A4IFS7 - symbol:CTSL1 "CTSL1 protein" species:9...   600  1.9e-58   1
UNIPROTKB|G1K2A7 - symbol:CTSK "Cathepsin K" species:9615...   600  1.9e-58   1
UNIPROTKB|Q3ZKN1 - symbol:CTSK "Cathepsin K" species:9615...   600  1.9e-58   1
TAIR|locus:2078312 - symbol:AT3G45310 species:3702 "Arabi...   598  3.2e-58   1
RGD|61810 - symbol:Ctsk "cathepsin K" species:10116 "Ratt...   593  1.1e-57   1
UNIPROTKB|Q5E968 - symbol:CTSK "Cathepsin K" species:9913...   591  1.7e-57   1
WB|WBGene00000776 - symbol:cpl-1 species:6239 "Caenorhabd...   591  1.7e-57   1
DICTYBASE|DDB_G0279187 - symbol:cprG "cysteine proteinase...   490  2.0e-57   2
UNIPROTKB|Q9GLE3 - symbol:CTSK "Cathepsin K" species:9823...   590  2.2e-57   1
UNIPROTKB|P43235 - symbol:CTSK "Cathepsin K" species:9606...   589  2.8e-57   1
UNIPROTKB|Q90686 - symbol:CTSK "Cathepsin K" species:9031...   588  3.6e-57   1
MGI|MGI:107341 - symbol:Ctss "cathepsin S" species:10090 ...   583  1.2e-56   1
MGI|MGI:107823 - symbol:Ctsk "cathepsin K" species:10090 ...   582  1.6e-56   1
UNIPROTKB|O46427 - symbol:CTSH "Pro-cathepsin H" species:...   581  2.0e-56   1
TAIR|locus:2175088 - symbol:ALP "aleurain-like protease" ...   580  2.5e-56   1
ZFIN|ZDB-GENE-001205-4 - symbol:ctsk "cathepsin K" specie...   580  2.5e-56   1
ZFIN|ZDB-GENE-050626-55 - symbol:ctssb.2 "cathepsin S, b....   580  2.5e-56   1
ZFIN|ZDB-GENE-030131-3539 - symbol:ctsh "cathepsin H" spe...   579  3.3e-56   1
UNIPROTKB|Q24940 - symbol:Cat-1 "Cathepsin L-like protein...   577  5.3e-56   1
UNIPROTKB|Q86GF7 - symbol:Cys "Crustapain" species:6703 "...   577  5.3e-56   1
UNIPROTKB|P09668 - symbol:CTSH "Pro-cathepsin H" species:...   571  2.3e-55   1
UNIPROTKB|Q3T0I2 - symbol:CTSH "Pro-cathepsin H" species:...   570  2.9e-55   1
DICTYBASE|DDB_G0278401 - symbol:cprH "cysteine proteinase...   569  3.7e-55   1
UNIPROTKB|G3R9A7 - symbol:CTSH "Uncharacterized protein" ...   569  3.7e-55   1
UNIPROTKB|F6R7P5 - symbol:CTSH "Uncharacterized protein" ...   567  6.1e-55   1
UNIPROTKB|G1M0X4 - symbol:CTSH "Uncharacterized protein" ...   567  6.1e-55   1
UNIPROTKB|G1RBY1 - symbol:CTSH "Uncharacterized protein" ...   567  6.1e-55   1
UNIPROTKB|G3SSC1 - symbol:CTSH "Uncharacterized protein" ...   566  7.8e-55   1
DICTYBASE|DDB_G0281605 - symbol:cfaD "peptidase C1A famil...   563  1.6e-54   1
UNIPROTKB|P25326 - symbol:CTSS "Cathepsin S" species:9913...   563  1.6e-54   1
UNIPROTKB|F7B939 - symbol:CTSH "Uncharacterized protein" ...   563  1.6e-54   1
UNIPROTKB|F7BRD4 - symbol:CTSH "Uncharacterized protein" ...   563  1.6e-54   1
UNIPROTKB|Q8HY81 - symbol:CTSS "Cathepsin S" species:9615...   559  4.3e-54   1
RGD|708447 - symbol:Testin "testin gene" species:10116 "R...   559  4.3e-54   1
UNIPROTKB|F1PAK0 - symbol:CTSS "Cathepsin S" species:9615...   558  5.5e-54   1
UNIPROTKB|F6X9C1 - symbol:CTSH "Uncharacterized protein" ...   557  7.0e-54   1
UNIPROTKB|F7BJD8 - symbol:CTSH "Uncharacterized protein" ...   557  7.0e-54   1
UNIPROTKB|F1SS93 - symbol:CTSS "Uncharacterized protein" ...   556  8.9e-54   1
DICTYBASE|DDB_G0272815 - symbol:cprE "cysteine proteinase...   553  1.9e-53   1
MGI|MGI:1922258 - symbol:4930486L24Rik "RIKEN cDNA 493048...   552  2.4e-53   1
UNIPROTKB|P25774 - symbol:CTSS "Cathepsin S" species:9606...   551  3.0e-53   1
RGD|621513 - symbol:Ctss "cathepsin S" species:10116 "Rat...   549  4.9e-53   1
UNIPROTKB|P83443 - symbol:P83443 "Macrodontain-1" species...   548  6.3e-53   1
GENEDB_PFALCIPARUM|PF11_0161 - symbol:PF11_0161 "falcipai...   543  2.1e-52   1
UNIPROTKB|Q8I6U5 - symbol:PF11_0161 "Falcipain-2B" specie...   543  2.1e-52   1
UNIPROTKB|F1NZ37 - symbol:LOC420160 "Uncharacterized prot...   540  4.4e-52   1
MGI|MGI:107285 - symbol:Ctsh "cathepsin H" species:10090 ...   536  1.2e-51   1
UNIPROTKB|G1SQF0 - symbol:CTSH "Uncharacterized protein" ...   535  1.5e-51   1
GENEDB_PFALCIPARUM|PF11_0165 - symbol:PF11_0165 "falcipai...   533  2.4e-51   1
UNIPROTKB|Q8I6U4 - symbol:PF11_0165 "Falcipain-2A" specie...   533  2.4e-51   1

WARNING:  Descriptions of 197 database sequences were not reported due to the
          limiting value of parameter V = 100.


>TAIR|locus:2157712 [details] [associations]
            symbol:CEP1 "cysteine endopeptidase 1" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005783 EMBL:CP002688
            GenomeReviews:BA000015_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 PROSITE:PS00014
            EMBL:AB024031 MEROPS:I29.003 EMBL:HM367092 EMBL:AY091087
            IPI:IPI00516991 RefSeq:NP_568722.1 UniGene:At.7918 HSSP:O65039
            ProteinModelPortal:Q9FGR9 SMR:Q9FGR9 PaxDb:Q9FGR9 PRIDE:Q9FGR9
            EnsemblPlants:AT5G50260.1 GeneID:835091 KEGG:ath:AT5G50260
            TAIR:At5g50260 HOGENOM:HOG000230773 InParanoid:Q9FGR9 KO:K16292
            OMA:WHSKKYH PhylomeDB:Q9FGR9 ProtClustDB:CLSN2689970
            Genevestigator:Q9FGR9 Uniprot:Q9FGR9
        Length = 361

 Score = 1411 (501.8 bits), Expect = 2.2e-144, P = 2.2e-144
 Identities = 258/343 (75%), Positives = 291/343 (84%)

Query:    20 EGFDFHEKELESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPY 79
             +G DFH K++ESE  LW+LYERWRSHHTV+RSL+EK KRFNVFK NV H+H+TNK DK Y
Sbjct:    19 KGLDFHNKDVESENSLWELYERWRSHHTVARSLEEKAKRFNVFKHNVKHIHETNKKDKSY 78

Query:    80 KLKLNKFADMTNHEFASTYAGSKIKHHRMFQGTR-GNGTFMYGKVTSIPPSVDWRKKGSV 138
             KLKLNKF DMT+ EF  TYAGS IKHHRMFQG +    +FMY  V ++P SVDWRK G+V
Sbjct:    79 KLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAV 138

Query:   139 TAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAF 198
             T VK+QGQCGSCWAFST+ AVEGIN I T KL SLSEQELVDCDT+QNQGCNGGLM+LAF
Sbjct:   139 TPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCDTNQNQGCNGGLMDLAF 198

Query:   199 EFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVS 258
             EFIK+KGG+T+E  YPY+A+D TCD +KE++P VSIDGHE+VP N ED L+KAVA QPVS
Sbjct:   199 EFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVS 258

Query:   259 VAIDAGSSDFQFYSEGVFTGECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGY 318
             VAIDAG SDFQFYSEGVFTG CGTELNHGVA VGYGTT+DGTKYWIV+NSWG EWGEKGY
Sbjct:   259 VAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGY 318

Query:   319 IRMQRGISDKKGLCGIAMEASYPIKKSATNPTGPS-DYPKDEL 360
             IRMQRGI  K+GLCGIAMEASYP+K S TNP+  S D  KDEL
Sbjct:   319 IRMQRGIRHKEGLCGIAMEASYPLKNSNTNPSRLSLDSLKDEL 361


>TAIR|locus:505006391 [details] [associations]
            symbol:CEP3 "cysteine endopeptidase 3" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005783 EMBL:CP002686
            GenomeReviews:BA000014_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 PROSITE:PS00014
            EMBL:AL049659 HSSP:O65039 HOGENOM:HOG000230773 KO:K16292
            EMBL:AK119026 IPI:IPI00525150 PIR:T06707 RefSeq:NP_566901.1
            UniGene:At.3162 ProteinModelPortal:Q9STL5 SMR:Q9STL5 MEROPS:C01.A02
            PRIDE:Q9STL5 EnsemblPlants:AT3G48350.1 GeneID:823993
            KEGG:ath:AT3G48350 TAIR:At3g48350 InParanoid:Q9STL5 OMA:DITHHEF
            PhylomeDB:Q9STL5 ProtClustDB:CLSN2917387 Genevestigator:Q9STL5
            Uniprot:Q9STL5
        Length = 364

 Score = 1281 (456.0 bits), Expect = 1.3e-130, P = 1.3e-130
 Identities = 239/346 (69%), Positives = 278/346 (80%)

Query:    20 EGFDFHEKELESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPY 79
             +GFDF EKELE+EE +W LYERWR HH+VSR+  E  KRFNVF+ NV+HVH+TNK +KPY
Sbjct:    19 KGFDFDEKELETEENVWKLYERWRGHHSVSRASHEAIKRFNVFRHNVLHVHRTNKKNKPY 78

Query:    80 KLKLNKFADMTNHEFASTYAGSKIKHHRMFQGT-RGNGTFMYGKVTSIPPSVDWRKKGSV 138
             KLK+N+FAD+T+HEF S+YAGS +KHHRM +G  RG+G FMY  VT +P SVDWR+KG+V
Sbjct:    79 KLKINRFADITHHEFRSSYAGSNVKHHRMLRGPKRGSGGFMYENVTRVPSSVDWREKGAV 138

Query:   139 TAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAF 198
             T VK+Q  CGSCWAFST+AAVEGIN I TNKLVSLSEQELVDCDT++NQGC GGLME AF
Sbjct:   139 TEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQGCAGGLMEPAF 198

Query:   199 EFIKKKGGVTTEAKYPYQANDGT-CDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPV 257
             EFIK  GG+ TE  YPY ++D   C  +      V+IDGHE+VP N E+ LLKAVA QPV
Sbjct:   199 EFIKNNGGIKTEETYPYDSSDVQFCRANSIGGETVTIDGHEHVPENDEEELLKAVAHQPV 258

Query:   258 SVAIDAGSSDFQFYSEGVFTGECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKG 317
             SVAIDAGSSDFQ YSEGVF GECGT+LNHGV  VGYG T +GTKYWIVRNSWGPEWGE G
Sbjct:   259 SVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGG 318

Query:   318 YIRMQRGISDKKGLCGIAMEASYPIKKSATNPTGPS---DYPKDEL 360
             Y+R++RGIS+ +G CGIAMEASYP K S+T  T  S   D  KDEL
Sbjct:   319 YVRIERGISENEGRCGIAMEASYPTKLSSTPSTHESVVRDDVKDEL 364


>TAIR|locus:2030427 [details] [associations]
            symbol:XCP2 "xylem cysteine peptidase 2" species:3702
            "Arabidopsis thaliana" [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009507 "chloroplast" evidence=ISM] [GO:0008233 "peptidase
            activity" evidence=ISS] [GO:0005618 "cell wall" evidence=IDA]
            [GO:0010623 "developmental programmed cell death" evidence=IMP]
            [GO:0010075 "regulation of meristem growth" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0005886 GO:GO:0005618 GO:GO:0005773
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AC069251 EMBL:AC007369 GO:GO:0010623
            OMA:YKEIPEG HOGENOM:HOG000230773 KO:K16290 EMBL:AF191028
            EMBL:BT004822 IPI:IPI00526722 PIR:A86341 RefSeq:NP_564126.1
            UniGene:At.21316 ProteinModelPortal:Q9LM66 SMR:Q9LM66 IntAct:Q9LM66
            STRING:Q9LM66 MEROPS:C01.120 PaxDb:Q9LM66 PRIDE:Q9LM66
            ProMEX:Q9LM66 EnsemblPlants:AT1G20850.1 GeneID:838677
            KEGG:ath:AT1G20850 GeneFarm:5034 TAIR:At1g20850 InParanoid:Q9LM66
            PhylomeDB:Q9LM66 ProtClustDB:CLSN2917031 Genevestigator:Q9LM66
            GermOnline:AT1G20850 Uniprot:Q9LM66
        Length = 356

 Score = 927 (331.4 bits), Expect = 4.3e-93, P = 4.3e-93
 Identities = 174/321 (54%), Positives = 227/321 (70%)

Query:    24 FHEKELESEEGLWDLYERWRSHHTVS-RSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLK 82
             +  ++LES + L +L+E W S+   +  +++EK  RF VFK N+ H+ +TNK  K Y L 
Sbjct:    36 YSPEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKSYWLG 95

Query:    83 LNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVK 142
             LN+FAD+++ EF   Y G K    R  +  R    F Y  V ++P SVDWRKKG+V  VK
Sbjct:    96 LNEFADLSHEEFKKMYLGLKTDIVRRDE-ERSYAEFAYRDVEAVPKSVDWRKKGAVAEVK 154

Query:   143 DQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIK 202
             +QG CGSCWAFST+AAVEGIN I+T  L +LSEQEL+DCDT  N GCNGGLM+ AFE+I 
Sbjct:   155 NQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIV 214

Query:   203 KKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAID 262
             K GG+  E  YPY   +GTC++ K+ S  V+I+GH++VP N E +LLKA+A QP+SVAID
Sbjct:   215 KNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAID 274

Query:   263 AGSSDFQFYSEGVFTGECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQ 322
             A   +FQFYS GVF G CG +L+HGVAAVGYG++  G+ Y IV+NSWGP+WGEKGYIR++
Sbjct:   275 ASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSS-KGSDYIIVKNSWGPKWGEKGYIRLK 333

Query:   323 RGISDKKGLCGIAMEASYPIK 343
             R     +GLCGI   AS+P K
Sbjct:   334 RNTGKPEGLCGINKMASFPTK 354


>TAIR|locus:2825832 [details] [associations]
            symbol:RD21A "responsive to dehydration 21A" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;IMP]
            [GO:0008234 "cysteine-type peptidase activity"
            evidence=ISS;IDA;IMP] [GO:0009507 "chloroplast" evidence=IDA]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0008233 "peptidase
            activity" evidence=IDA] [GO:0048046 "apoplast" evidence=IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005829 "cytosol"
            evidence=RCA] [GO:0009506 "plasmodesma" evidence=IDA] [GO:0050832
            "defense response to fungus" evidence=IMP] [GO:0006096 "glycolysis"
            evidence=RCA] [GO:0006833 "water transport" evidence=RCA]
            [GO:0006972 "hyperosmotic response" evidence=RCA] [GO:0007030
            "Golgi organization" evidence=RCA] [GO:0009266 "response to
            temperature stimulus" evidence=RCA] [GO:0009651 "response to salt
            stress" evidence=RCA] [GO:0015996 "chlorophyll catabolic process"
            evidence=RCA] [GO:0042744 "hydrogen peroxide catabolic process"
            evidence=RCA] [GO:0046686 "response to cadmium ion" evidence=RCA]
            [GO:0009414 "response to water deprivation" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0009506 GO:GO:0009507 GO:GO:0005773
            GO:GO:0050832 GO:GO:0048046 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC083835
            HOGENOM:HOG000230773 KO:K01376 InterPro:IPR000118 Pfam:PF00396
            SMART:SM00277 UniGene:At.43549 EMBL:D13043 EMBL:AY072130
            EMBL:AY133781 IPI:IPI00530094 PIR:JN0719 RefSeq:NP_564497.1
            UniGene:At.47599 UniGene:At.71705 ProteinModelPortal:P43297
            SMR:P43297 IntAct:P43297 STRING:P43297 MEROPS:C01.064 PaxDb:P43297
            PRIDE:P43297 ProMEX:P43297 EnsemblPlants:AT1G47128.1 GeneID:841122
            KEGG:ath:AT1G47128 TAIR:At1g47128 InParanoid:P43297 OMA:EAWLVKH
            PhylomeDB:P43297 ProtClustDB:CLSN2688498 Genevestigator:P43297
            GermOnline:AT1G47128 Uniprot:P43297
        Length = 462

 Score = 913 (326.5 bits), Expect = 1.3e-91, P = 1.3e-91
 Identities = 174/330 (52%), Positives = 224/330 (67%)

Query:    31 SEEGLWDLYERWRSHHTVSRS---LDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFA 87
             SE  +  +YE W   H  ++S   L EK +RF +FK N+  V + N+ +  Y+L L +FA
Sbjct:    42 SEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFA 101

Query:    88 DMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKV-TSIPPSVDWRKKGSVTAVKDQGQ 146
             D+TN E+ S Y G+K++     +G R        +V   +P S+DWRKKG+V  VKDQG 
Sbjct:   102 DLTNDEYRSKYLGAKMEK----KGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGG 157

Query:   147 CGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGG 206
             CGSCWAFSTI AVEGIN I+T  L++LSEQELVDCDT  N+GCNGGLM+ AFEFI K GG
Sbjct:   158 CGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGG 217

Query:   207 VTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSS 266
             + T+  YPY+  DGTCD  ++++  V+ID +E+VP   E++L KAVA QP+S+AI+AG  
Sbjct:   218 IDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGR 277

Query:   267 DFQFYSEGVFTGECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGIS 326
              FQ Y  G+F G CGT+L+HGV AVGYGT  +G  YWIVRNSWG  WGE GY+RM R I+
Sbjct:   278 AFQLYDSGIFDGSCGTQLDHGVVAVGYGTE-NGKDYWIVRNSWGKSWGESGYLRMARNIA 336

Query:   327 DKKGLCGIAMEASYPIKKSATNPTGPSDYP 356
                G CGIA+E SYPIK +  NP  P   P
Sbjct:   337 SSSGKCGIAIEPSYPIK-NGENPPNPGPSP 365


>TAIR|locus:2167821 [details] [associations]
            symbol:RD21B "esponsive to dehydration 21B" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS] [GO:0005773
            "vacuole" evidence=IDA] [GO:0009651 "response to salt stress"
            evidence=IEP] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005829 "cytosol" evidence=IDA] [GO:0052541 "plant-type cell
            wall cellulose metabolic process" evidence=RCA] [GO:0052546 "cell
            wall pectin metabolic process" evidence=RCA] [GO:0005783
            "endoplasmic reticulum" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005783 GO:GO:0005829 EMBL:CP002688
            GO:GO:0005773 GO:GO:0009651 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AB008267 HSSP:O65039
            HOGENOM:HOG000230773 KO:K01376 InterPro:IPR000118 Pfam:PF00396
            SMART:SM00277 ProtClustDB:CLSN2688498 EMBL:AY062608 EMBL:AY114661
            IPI:IPI00520971 RefSeq:NP_568620.1 UniGene:At.24130 SMR:Q9FMH8
            IntAct:Q9FMH8 STRING:Q9FMH8 MEROPS:C01.A12
            EnsemblPlants:AT5G43060.1 GeneID:834321 KEGG:ath:AT5G43060
            TAIR:At5g43060 InParanoid:Q9FMH8 OMA:ENSEASL Genevestigator:Q9FMH8
            Uniprot:Q9FMH8
        Length = 463

 Score = 912 (326.1 bits), Expect = 1.7e-91, P = 1.7e-91
 Identities = 181/324 (55%), Positives = 219/324 (67%)

Query:    38 LYERWRSHHTVSRSLD-----EKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNH 92
             +YE W   H   +        EK +RF +FK N+  + + N  +  YKL L +FAD+TN 
Sbjct:    49 IYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTKNLSYKLGLTRFADLTNE 108

Query:    93 EFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWA 152
             E+ S Y G+K    R+ + T        G   ++P SVDWRK+G+V  VKDQG CGSCWA
Sbjct:   109 EYRSMYLGAK-PTKRVLK-TSDRYQARVGD--ALPDSVDWRKEGAVADVKDQGSCGSCWA 164

Query:   153 FSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAK 212
             FSTI AVEGIN I+T  L+SLSEQELVDCDT  NQGCNGGLM+ AFEFI K GG+ TEA 
Sbjct:   165 FSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEAD 224

Query:   213 YPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYS 272
             YPY+A DG CD +++++  V+ID +E+VP N E +L KA+A QP+SVAI+AG   FQ YS
Sbjct:   225 YPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYS 284

Query:   273 EGVFTGECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLC 332
              GVF G CGTEL+HGV AVGYGT  +G  YWIVRNSWG  WGE GYI+M R I    G C
Sbjct:   285 SGVFDGLCGTELDHGVVAVGYGTE-NGKDYWIVRNSWGNRWGESGYIKMARNIEAPTGKC 343

Query:   333 GIAMEASYPIKKSATNPTGPSDYP 356
             GIAMEASYPIKK   NP  P   P
Sbjct:   344 GIAMEASYPIKKGQ-NPPNPGPSP 366


>TAIR|locus:2122113 [details] [associations]
            symbol:XCP1 "xylem cysteine peptidase 1" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0000325 "plant-type vacuole" evidence=IDA] [GO:0005634
            "nucleus" evidence=IDA] [GO:0010623 "developmental programmed cell
            death" evidence=IMP] [GO:0010413 "glucuronoxylan metabolic process"
            evidence=RCA] [GO:0045492 "xylan biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005886
            GO:GO:0005634 EMBL:CP002687 GenomeReviews:CT486007_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0000325
            EMBL:AL022604 EMBL:AL161587 GO:GO:0010623 MEROPS:I29.003
            HOGENOM:HOG000230773 EMBL:AF191027 EMBL:AK117394 EMBL:BT005179
            IPI:IPI00532220 PIR:T06122 RefSeq:NP_567983.1 UniGene:At.2280
            UniGene:At.67622 ProteinModelPortal:O65493 SMR:O65493 STRING:O65493
            PaxDb:O65493 PRIDE:O65493 EnsemblPlants:AT4G35350.1 GeneID:829688
            KEGG:ath:AT4G35350 GeneFarm:5033 TAIR:At4g35350 InParanoid:O65493
            KO:K16290 OMA:FEVFREN PhylomeDB:O65493 ProtClustDB:CLSN2689772
            Genevestigator:O65493 Uniprot:O65493
        Length = 355

 Score = 900 (321.9 bits), Expect = 3.1e-90, P = 3.1e-90
 Identities = 173/319 (54%), Positives = 222/319 (69%)

Query:    27 KELESEEGLWDLYERWRSHHTVS-RSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNK 85
             + L + + L +L+E W S H+ + +S++EK  RF VF++N+MH+ Q N     Y L LN+
Sbjct:    39 EHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNE 98

Query:    86 FADMTNHEFASTYAG-SKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQ 144
             FAD+T+ EF   Y G +K +  R  Q +     F Y  +T +P SVDWRKKG+V  VKDQ
Sbjct:    99 FADLTHEEFKGRYLGLAKPQFSRKRQPS---ANFRYRDITDLPKSVDWRKKGAVAPVKDQ 155

Query:   145 GQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKK 204
             GQCGSCWAFST+AAVEGIN I T  L SLSEQEL+DCDT  N GCNGGLM+ AF++I   
Sbjct:   156 GQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIIST 215

Query:   205 GGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAG 264
             GG+  E  YPY   +G C   KE    V+I G+E+VP N +++L+KA+A QPVSVAI+A 
Sbjct:   216 GGLHKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEAS 275

Query:   265 SSDFQFYSEGVFTGECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRG 324
               DFQFY  GVF G+CGT+L+HGVAAVGYG++  G+ Y IV+NSWGP WGEKG+IRM+R 
Sbjct:   276 GRDFQFYKGGVFNGKCGTDLDHGVAAVGYGSS-KGSDYVIVKNSWGPRWGEKGFIRMKRN 334

Query:   325 ISDKKGLCGIAMEASYPIK 343
                 +GLCGI   ASYP K
Sbjct:   335 TGKPEGLCGINKMASYPTK 353


>TAIR|locus:2090614 [details] [associations]
            symbol:AT3G19390 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA;ISS] [GO:0000041 "transition metal ion
            transport" evidence=RCA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:CP002686 GenomeReviews:BA000014_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AB025624
            MEROPS:I29.003 OMA:KAMDQKC HSSP:O65039 HOGENOM:HOG000230773
            InterPro:IPR000118 Pfam:PF00396 SMART:SM00277 EMBL:AY062725
            EMBL:AY093350 IPI:IPI00520189 RefSeq:NP_566633.1 UniGene:At.27473
            ProteinModelPortal:Q9LT78 SMR:Q9LT78 IntAct:Q9LT78 STRING:Q9LT78
            PaxDb:Q9LT78 PRIDE:Q9LT78 EnsemblPlants:AT3G19390.1 GeneID:821473
            KEGG:ath:AT3G19390 TAIR:At3g19390 InParanoid:Q9LT78
            PhylomeDB:Q9LT78 ProtClustDB:CLSN2917188 Genevestigator:Q9LT78
            Uniprot:Q9LT78
        Length = 452

 Score = 876 (313.4 bits), Expect = 1.1e-87, P = 1.1e-87
 Identities = 165/318 (51%), Positives = 217/318 (68%)

Query:    38 LYERWRSHHTVS-RSLDEKHKRFNVFKQNVMHVHQTNKM-DKPYKLKLNKFADMTNHEFA 95
             +YERW   +  +   L EK +RF +FK N+  V + + + ++ Y++ L +FAD+TN EF 
Sbjct:    42 MYERWLVENRKNYNGLGEKERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFR 101

Query:    96 STYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFST 155
             + Y  SK++  R+    +G   ++Y    S+P ++DWR KG+V  VKDQG CGSCWAFS 
Sbjct:   102 AIYLRSKMERTRV--PVKGE-KYLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSA 158

Query:   156 IAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPY 215
             I AVEGIN I T +L+SLSEQELVDCDT  N GC GGLM+ AF+FI + GG+ TE  YPY
Sbjct:   159 IGAVEGINQIKTGELISLSEQELVDCDTSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPY 218

Query:   216 QAND-GTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEG 274
              A D   C+  K+++  V+IDG+E+VP N E +L KA+A QP+SVAI+AG   FQ Y+ G
Sbjct:   219 IATDVNVCNSDKKNTRVVTIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYTSG 278

Query:   275 VFTGECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGI 334
             VFTG CGT L+HGV AVGYG+   G  YWIVRNSWG  WGE GY +++R I +  G CG+
Sbjct:   279 VFTGTCGTSLDHGVVAVGYGSE-GGQDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGV 337

Query:   335 AMEASYPIKKSATNPTGP 352
             AM ASYP K S +NP  P
Sbjct:   338 AMMASYPTKSSGSNPPKP 355


>TAIR|locus:2152445 [details] [associations]
            symbol:SAG12 "senescence-associated gene 12" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009723 "response to ethylene stimulus" evidence=IEP]
            [GO:0007568 "aging" evidence=IEP;TAS] [GO:0010150 "leaf senescence"
            evidence=IEP;TAS] [GO:0010282 "senescence-associated vacuole"
            evidence=IDA] [GO:0009817 "defense response to fungus, incompatible
            interaction" evidence=IEP] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:CP002688 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0010150 GO:GO:0009817 EMBL:AB016870
            HSSP:O65039 OMA:NDEQALM EMBL:AF370131 EMBL:AY040073 IPI:IPI00544181
            RefSeq:NP_568651.1 UniGene:At.75256 UniGene:At.7710
            ProteinModelPortal:Q9FJ47 SMR:Q9FJ47 IntAct:Q9FJ47 STRING:Q9FJ47
            MEROPS:C01.117 PRIDE:Q9FJ47 ProMEX:Q9FJ47 EnsemblPlants:AT5G45890.1
            GeneID:834629 KEGG:ath:AT5G45890 TAIR:At5g45890 InParanoid:Q9FJ47
            PhylomeDB:Q9FJ47 ProtClustDB:CLSN2917735 ArrayExpress:Q9FJ47
            Genevestigator:Q9FJ47 GO:GO:0010282 Uniprot:Q9FJ47
        Length = 346

 Score = 872 (312.0 bits), Expect = 2.9e-87, P = 2.9e-87
 Identities = 165/320 (51%), Positives = 218/320 (68%)

Query:    27 KELESEEGLWDLYERWRSHH-TVSRSLDEKHKRFNVFKQNVMHVHQTNKMD--KPYKLKL 83
             + L++E  +   +  W + H  V   + E++ R+ VFK NV  +   N +   + +KL +
Sbjct:    26 RPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAV 85

Query:    84 NKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTS--IPPSVDWRKKGSVTAV 141
             N+FAD+TN EF S Y G K       Q       F Y  V+S  +P SVDWRKKG+VT +
Sbjct:    86 NQFADLTNDEFRSMYTGFKGVSALSSQSQTKMSPFRYQNVSSGALPVSVDWRKKGAVTPI 145

Query:   142 KDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFI 201
             K+QG CG CWAFS +AA+EG   I   KL+SLSEQ+LVDCDT+ + GC GGLM+ AFE I
Sbjct:   146 KNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN-DFGCEGGLMDTAFEHI 204

Query:   202 KKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAI 261
             K  GG+TTE+ YPY+  D TC+  K +  A SI G+E+VP N E AL+KAVA QPVSV I
Sbjct:   205 KATGGLTTESNYPYKGEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAHQPVSVGI 264

Query:   262 DAGSSDFQFYSEGVFTGECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRM 321
             + G  DFQFYS GVFTGEC T L+H V A+GYG + +G+KYWI++NSWG +WGE GY+R+
Sbjct:   265 EGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGYMRI 324

Query:   322 QRGISDKKGLCGIAMEASYP 341
             Q+ + DK+GLCG+AM+ASYP
Sbjct:   325 QKDVKDKQGLCGLAMKASYP 344


>TAIR|locus:2090629 [details] [associations]
            symbol:AT3G19400 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0019344 "cysteine biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            EMBL:CP002686 GenomeReviews:BA000014_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AB025624
            MEROPS:I29.003 HOGENOM:HOG000230773 EMBL:AK118509 IPI:IPI00543468
            RefSeq:NP_566634.2 UniGene:At.38409 ProteinModelPortal:Q9LT77
            SMR:Q9LT77 PaxDb:Q9LT77 PRIDE:Q9LT77 EnsemblPlants:AT3G19400.1
            GeneID:821474 KEGG:ath:AT3G19400 TAIR:At3g19400 InParanoid:Q9LT77
            OMA:IGEHERR ProtClustDB:CLSN2679975 Genevestigator:Q9LT77
            Uniprot:Q9LT77
        Length = 362

 Score = 822 (294.4 bits), Expect = 5.8e-82, P = 5.8e-82
 Identities = 162/326 (49%), Positives = 213/326 (65%)

Query:    26 EKELE-SEEGLWDLYERWRSHHTVS-RSLDEKHKRFNVFKQNVMHVHQTNKM-DKPYKLK 82
             E E+E +E  +  +YE+W   +  +   L EK +RF +FK N+  V + N + D+ +++ 
Sbjct:    30 ETEIERNETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVG 89

Query:    83 LNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVK 142
             L +FAD+TN EF + Y   K++  +    T     ++Y +   +P  VDWR  G+V +VK
Sbjct:    90 LTRFADLTNEEFRAIYLRKKMERTKDSVKTE---RYLYKEGDVLPDEVDWRANGAVVSVK 146

Query:   143 DQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTD-QNQGCNGGLMELAFEFI 201
             DQG CGSCWAFS + AVEGIN I T +L+SLSEQELVDCD    N GC+GG+M  AFEFI
Sbjct:   147 DQGNCGSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFI 206

Query:   202 KKKGGVTTEAKYPYQAND-GTCDVSKESSP-AVSIDGHENVPANHEDALLKAVAKQPVSV 259
              K GG+ T+  YPY AND G C+  K ++   V+IDG+E+VP + E +L KAVA QPVSV
Sbjct:   207 MKNGGIETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSV 266

Query:   260 AIDAGSSDFQFYSEGVFTGECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYI 319
             AI+A S  FQ Y  GV TG CG  L+HGV  VGYG+T  G  YWI+RNSWG  WG+ GY+
Sbjct:   267 AIEASSQAFQLYKSGVMTGTCGISLDHGVVVVGYGST-SGEDYWIIRNSWGLNWGDSGYV 325

Query:   320 RMQRGISDKKGLCGIAMEASYPIKKS 345
             ++QR I D  G CGIAM  SYP K S
Sbjct:   326 KLQRNIDDPFGKCGIAMMPSYPTKSS 351


>TAIR|locus:2038515 [details] [associations]
            symbol:AT1G06260 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0048046 "apoplast"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0048046 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC025290
            MEROPS:I29.003 HSSP:O65039 HOGENOM:HOG000230773 OMA:METAFEF
            IPI:IPI00525965 PIR:D86198 RefSeq:NP_563764.1 UniGene:At.24617
            ProteinModelPortal:Q9LNC1 SMR:Q9LNC1 PaxDb:Q9LNC1 PRIDE:Q9LNC1
            EnsemblPlants:AT1G06260.1 GeneID:837137 KEGG:ath:AT1G06260
            TAIR:At1g06260 InParanoid:Q9LNC1 PhylomeDB:Q9LNC1
            ProtClustDB:CLSN2916975 Genevestigator:Q9LNC1 Uniprot:Q9LNC1
        Length = 343

 Score = 812 (290.9 bits), Expect = 6.6e-81, P = 6.6e-81
 Identities = 156/307 (50%), Positives = 204/307 (66%)

Query:    39 YERW-RSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFAST 97
             +E+W ++H  +    DE   RF +++ NV  +   N +  P+KL  N+FADMTN EF + 
Sbjct:    43 FEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEFKAH 102

Query:    98 YAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIA 157
             + G      R+ +  R     +     ++P +VDWR +G+VT +++QG+CG CWAFS +A
Sbjct:   103 FLGLNTSSLRLHKKQRP----VCDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVA 158

Query:   158 AVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEAKYPYQ 216
             A+EGIN I T  LVSLSEQ+L+DCD    N+GC+GGLME AFEFIK  GG+ TE  YPY 
Sbjct:   159 AIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKTNGGLATETDYPYT 218

Query:   217 ANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVF 276
               +GTCD  K  +  V+I G++ V A +E +L  A A+QPVSV IDAG   FQ YS GVF
Sbjct:   219 GIEGTCDQEKSKNKVVTIQGYQKV-AQNEASLQIAAAQQPVSVGIDAGGFIFQLYSSGVF 277

Query:   277 TGECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAM 336
             T  CGT LNHGV  VGYG   D  KYWIV+NSWG  WGE+GYIRM+RG+S+  G CGIAM
Sbjct:   278 TNYCGTNLNHGVTVVGYGVEGD-QKYWIVKNSWGTGWGEEGYIRMERGVSEDTGKCGIAM 336

Query:   337 EASYPIK 343
              ASYP++
Sbjct:   337 MASYPLQ 343


>TAIR|locus:2024362 [details] [associations]
            symbol:XBCP3 "xylem bark cysteine peptidase 3"
            species:3702 "Arabidopsis thaliana" [GO:0005576 "extracellular
            region" evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0005783 "endoplasmic
            reticulum" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005783 EMBL:CP002684 GO:GO:0005773 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 MEROPS:I29.003
            HOGENOM:HOG000230773 InterPro:IPR000118 Pfam:PF00396 SMART:SM00277
            UniGene:At.10233 OMA:CEIESAV EMBL:BT026490 EMBL:AK226753
            IPI:IPI00536687 RefSeq:NP_563855.1 ProteinModelPortal:Q0WVJ5
            SMR:Q0WVJ5 PRIDE:Q0WVJ5 EnsemblPlants:AT1G09850.1 GeneID:837517
            KEGG:ath:AT1G09850 TAIR:At1g09850 InParanoid:Q0WVJ5
            PhylomeDB:Q0WVJ5 ProtClustDB:CLSN2687747 Genevestigator:Q0WVJ5
            Uniprot:Q0WVJ5
        Length = 437

 Score = 799 (286.3 bits), Expect = 1.6e-79, P = 1.6e-79
 Identities = 156/325 (48%), Positives = 204/325 (62%)

Query:    31 SEEGLWDLYERW-RSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKM-DKPYKLKLNKFAD 88
             S + + +L++ W + H     S +E+ +R  +FK N   V Q N + +  Y L LN FAD
Sbjct:    24 SSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFAD 83

Query:    89 MTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCG 148
             +T+HEF ++  G  +    +   ++G    + G V  +P SVDWRKKG+VT VKDQG CG
Sbjct:    84 LTHHEFKASRLGLSVSAPSVIMASKGQS--LGGSV-KVPDSVDWRKKGAVTNVKDQGSCG 140

Query:   149 SCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVT 208
             +CW+FS   A+EGIN I+T  L+SLSEQEL+DCD   N GCNGGLM+ AFEF+ K  G+ 
Sbjct:   141 ACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGID 200

Query:   209 TEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDF 268
             TE  YPYQ  DGTC   K     V+ID +  V +N E AL++AVA QPVSV I      F
Sbjct:   201 TEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAF 260

Query:   269 QFYSEGVFTGECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDK 328
             Q YS G+F+G C T L+H V  VGYG+  +G  YWIV+NSWG  WG  G++ MQR   + 
Sbjct:   261 QLYSSGIFSGPCSTSLDHAVLIVGYGSQ-NGVDYWIVKNSWGKSWGMDGFMHMQRNTENS 319

Query:   329 KGLCGIAMEASYPIKKSATNPTGPS 353
              G+CGI M ASYPIK +  NP  PS
Sbjct:   320 DGVCGINMLASYPIK-THPNPPPPS 343


>TAIR|locus:2038588 [details] [associations]
            symbol:AT2G27420 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002685
            GenomeReviews:CT485783_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC006232
            MEROPS:I29.003 OMA:EEFRATH HOGENOM:HOG000230773 HSSP:P53634
            ProtClustDB:CLSN2688476 EMBL:AY064033 EMBL:AY096388 IPI:IPI00539752
            PIR:F84672 RefSeq:NP_565649.1 UniGene:At.27094
            ProteinModelPortal:Q9ZQH7 SMR:Q9ZQH7 PRIDE:Q9ZQH7
            EnsemblPlants:AT2G27420.1 GeneID:817287 KEGG:ath:AT2G27420
            TAIR:At2g27420 InParanoid:Q9ZQH7 PhylomeDB:Q9ZQH7
            ArrayExpress:Q9ZQH7 Genevestigator:Q9ZQH7 Uniprot:Q9ZQH7
        Length = 348

 Score = 799 (286.3 bits), Expect = 1.6e-79, P = 1.6e-79
 Identities = 153/313 (48%), Positives = 211/313 (67%)

Query:    39 YERWRSH-HTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDK-PYKLKLNKFADMTNHEFAS 96
             +E+W +  + V     EK  RFN+FK+N+  V   N  +K  YK+ +N+F+D+T+ EF +
Sbjct:    35 HEQWMARFNRVYSDETEKRNRFNIFKKNLEFVQNFNMNNKITYKVDINEFSDLTDEEFRA 94

Query:    97 TYAGSKIKHH--RMFQGTRGNGT--FMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWA 152
             T+ G  +     R+   + G  T  F YG V+    S+DWR++G+VT VK QG+CG CWA
Sbjct:    95 THTGLVVPEAITRISTLSSGKNTVPFRYGNVSDNGESMDWRQEGAVTPVKYQGRCGGCWA 154

Query:   153 FSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAK 212
             FS +AAVEGI  I   +LVSLSEQ+L+DCD D NQGC GG+M  AFE+I K  G+TTE  
Sbjct:   155 FSAVAAVEGITKITKGELVSLSEQQLLDCDRDYNQGCRGGIMSKAFEYIIKNQGITTEDN 214

Query:   213 YPYQANDGTCDVS---KESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQ 269
             YPYQ +  TC  S     S  A +I G+E VP N+E+ALL+AV++QPVSV I+   + F+
Sbjct:   215 YPYQESQQTCSSSTTLSSSFRAATISGYETVPMNNEEALLQAVSQQPVSVGIEGTGAAFR 274

Query:   270 FYSEGVFTGECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKK 329
              YS GVF GECGT+L+H V  VGYG + +GTKYW+V+NSWG  WGE GY+R++R +   +
Sbjct:   275 HYSGGVFNGECGTDLHHAVTIVGYGMSEEGTKYWVVKNSWGETWGENGYMRIKRDVDAPQ 334

Query:   330 GLCGIAMEASYPI 342
             G+CG+A+ A YP+
Sbjct:   335 GMCGLAILAFYPL 347


>TAIR|locus:2128243 [details] [associations]
            symbol:AT4G11310 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005618 "cell wall"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 GO:GO:0005618 EMBL:CP002687
            GenomeReviews:CT486007_GR EMBL:AL096882 EMBL:AL161531
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            HOGENOM:HOG000230773 KO:K01376 EMBL:AY093066 EMBL:BT000099
            IPI:IPI00520496 PIR:T13022 RefSeq:NP_567376.1 UniGene:At.43189
            ProteinModelPortal:Q9SUT0 SMR:Q9SUT0 IntAct:Q9SUT0 STRING:Q9SUT0
            MEROPS:C01.A20 PaxDb:Q9SUT0 PRIDE:Q9SUT0 EnsemblPlants:AT4G11310.1
            GeneID:826733 KEGG:ath:AT4G11310 TAIR:At4g11310 InParanoid:Q9SUT0
            OMA:EVCHGAD PhylomeDB:Q9SUT0 ProtClustDB:CLSN2689395
            Genevestigator:Q9SUT0 GermOnline:AT4G11310 Uniprot:Q9SUT0
        Length = 364

 Score = 792 (283.9 bits), Expect = 8.7e-79, P = 8.7e-79
 Identities = 151/310 (48%), Positives = 205/310 (66%)

Query:    38 LYERWR-SHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFAS 96
             ++E W   H  V  S+ EK +R  +F+ N+  ++  N  +  Y+L L  FAD++ HE+  
Sbjct:    48 IFESWMVKHGKVYGSVAEKERRLTIFEDNLRFINNRNAENLSYRLGLTGFADLSLHEYKE 107

Query:    97 TYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTI 156
                G+  +  R       +  +       +P SVDWR +G+VT VKDQG C SCWAFST+
Sbjct:   108 VCHGADPRPPRNHVFMTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTV 167

Query:   157 AAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQ 216
              AVEG+N I+T +LV+LSEQ+L++C+ + N GC GG +E A+EFI K GG+ T+  YPY+
Sbjct:   168 GAVEGLNKIVTGELVTLSEQDLINCNKENN-GCGGGKLETAYEFIMKNGGLGTDNDYPYK 226

Query:   217 ANDGTCDVS-KESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGV 275
             A +G CD   KE++  V IDG+EN+PAN E AL+KAVA QPV+  ID+ S +FQ Y  GV
Sbjct:   227 AVNGVCDGRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGV 286

Query:   276 FTGECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIA 335
             F G CGT LNHGV  VGYGT  +G  YW+V+NS G  WGE GY++M R I++ +GLCGIA
Sbjct:   287 FDGSCGTNLNHGVVVVGYGTE-NGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIA 345

Query:   336 MEASYPIKKS 345
             M ASYP+K S
Sbjct:   346 MRASYPLKNS 355


>TAIR|locus:2128253 [details] [associations]
            symbol:AT4G11320 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002687 GenomeReviews:CT486007_GR EMBL:AL096882 EMBL:AL161531
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 OMA:ICHGADP
            HOGENOM:HOG000230773 KO:K01376 ProtClustDB:CLSN2689395
            EMBL:AY035055 EMBL:AY051062 IPI:IPI00520480 PIR:T13023
            RefSeq:NP_567377.1 UniGene:At.25206 ProteinModelPortal:Q9SUS9
            SMR:Q9SUS9 STRING:Q9SUS9 MEROPS:C01.A21 PaxDb:Q9SUS9 PRIDE:Q9SUS9
            EnsemblPlants:AT4G11320.1 GeneID:826734 KEGG:ath:AT4G11320
            TAIR:At4g11320 InParanoid:Q9SUS9 PhylomeDB:Q9SUS9
            Genevestigator:Q9SUS9 GermOnline:AT4G11320 Uniprot:Q9SUS9
        Length = 371

 Score = 787 (282.1 bits), Expect = 3.0e-78, P = 3.0e-78
 Identities = 150/310 (48%), Positives = 204/310 (65%)

Query:    38 LYERWR-SHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFAS 96
             ++E W   H  V  S+ EK +R  +F+ N+  +   N  +  Y+L LN+FAD++ HE+  
Sbjct:    55 MFESWMVKHGKVYDSVAEKERRLTIFEDNLRFITNRNAENLSYRLGLNRFADLSLHEYGE 114

Query:    97 TYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTI 156
                G+  +  R       +  +       +P SVDWR +G+VT VKDQG C SCWAFST+
Sbjct:   115 ICHGADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTV 174

Query:   157 AAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQ 216
              AVEG+N I+T +LV+LSEQ+L++C+ + N GC GG +E A+EFI   GG+ T+  YPY+
Sbjct:   175 GAVEGLNKIVTGELVTLSEQDLINCNKENN-GCGGGKVETAYEFIMNNGGLGTDNDYPYK 233

Query:   217 ANDGTCDVS-KESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGV 275
             A +G C+   KE +  V IDG+EN+PAN E AL+KAVA QPV+  +D+ S +FQ Y  GV
Sbjct:   234 ALNGVCEGRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGV 293

Query:   276 FTGECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIA 335
             F G CGT LNHGV  VGYGT  +G  YWIV+NS G  WGE GY++M R I++ +GLCGIA
Sbjct:   294 FDGTCGTNLNHGVVVVGYGTE-NGRDYWIVKNSRGDTWGEAGYMKMARNIANPRGLCGIA 352

Query:   336 MEASYPIKKS 345
             M ASYP+K S
Sbjct:   353 MRASYPLKNS 362


>TAIR|locus:2117979 [details] [associations]
            symbol:AT4G23520 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002687 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            KO:K01376 IPI:IPI00527171 RefSeq:NP_567686.2 UniGene:At.32421
            ProteinModelPortal:F4JNL3 SMR:F4JNL3 MEROPS:C01.A22 PRIDE:F4JNL3
            EnsemblPlants:AT4G23520.1 GeneID:828452 KEGG:ath:AT4G23520
            OMA:PANDEIS ArrayExpress:F4JNL3 Uniprot:F4JNL3
        Length = 356

 Score = 776 (278.2 bits), Expect = 4.3e-77, P = 4.3e-77
 Identities = 152/322 (47%), Positives = 209/322 (64%)

Query:    31 SEEGLWDLYERWRSHH--TVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFAD 88
             S E +  +++ W S H  T + +L EK +RF  FK N+  + Q N  +  Y+L L +FAD
Sbjct:    39 SNEEVEFIFQMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHNAKNLSYQLGLTRFAD 98

Query:    89 MTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCG 148
             +T  E+   + GS     R  + +R    ++      +P SVDWR++G+V+ +KDQG C 
Sbjct:    99 LTVQEYRDLFPGSPKPKQRNLKTSR---RYVPLAGDQLPESVDWRQEGAVSEIKDQGTCN 155

Query:   149 SCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNG-GLMELAFEFIKKKGGV 207
             SCWAFST+AAVEG+N I+T +L+SLSEQELVDC+   N GC G GLM+ AF+F+    G+
Sbjct:   156 SCWAFSTVAAVEGLNKIVTGELISLSEQELVDCNL-VNNGCYGSGLMDTAFQFLINNNGL 214

Query:   208 TTEAKYPYQANDGTCDVSKESS-PAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSS 266
              +E  YPYQ   G+C+  + +S   ++ID +E+VPAN E +L KAVA QPVSV +D  S 
Sbjct:   215 DSEKDYPYQGTQGSCNRKQSTSNKVITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQ 274

Query:   267 DFQFYSEGVFTGECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGIS 326
             +F  Y   ++ G CGT L+H +  VGYG+  +G  YWIVRNSWG  WG+ GYI++ R   
Sbjct:   275 EFMLYRSCIYNGPCGTNLDHALVIVGYGSE-NGQDYWIVRNSWGTTWGDAGYIKIARNFE 333

Query:   327 DKKGLCGIAMEASYPIKKSATN 348
             D KGLCGIAM ASYPIK SA+N
Sbjct:   334 DPKGLCGIAMLASYPIKNSASN 355


>TAIR|locus:2055440 [details] [associations]
            symbol:AT2G34080 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002685 GenomeReviews:CT485783_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 MEROPS:I29.003 EMBL:AC002341
            HOGENOM:HOG000230773 HSSP:P53634 IPI:IPI00530325 PIR:B84752
            RefSeq:NP_565780.1 UniGene:At.28613 UniGene:At.37859
            ProteinModelPortal:O22961 SMR:O22961 EnsemblPlants:AT2G34080.1
            GeneID:817969 KEGG:ath:AT2G34080 TAIR:At2g34080 InParanoid:O22961
            OMA:SENDYSY PhylomeDB:O22961 ProtClustDB:CLSN2688064
            ArrayExpress:O22961 Genevestigator:O22961 Uniprot:O22961
        Length = 345

 Score = 743 (266.6 bits), Expect = 1.4e-73, P = 1.4e-73
 Identities = 144/317 (45%), Positives = 205/317 (64%)

Query:    32 EEGLWDLYERWRSHHTVS-RSLDEKHKRFNVFKQNVMHVHQTNKM-DKPYKLKLNKFADM 89
             E+ + D +E+W +  +   R   EK+ R +VFK+N+  +   NK  +K YKL +N+FAD 
Sbjct:    32 EQSMVDKHEQWMARFSREYRDELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADW 91

Query:    90 TNHEFASTYAGSK----IKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQG 145
             TN EF + + G K    +   ++   T  + T+    +  +  S DWR +G+VT VK QG
Sbjct:    92 TNEEFLAIHTGLKGLTEVSPSKVVAKTISSQTWNVSDM--VVESKDWRAEGAVTPVKYQG 149

Query:   146 QCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKG 205
             QCG CWAFS +AAVEG+  I    LVSLSEQ+L+DCD + ++GC+GG+M  AF ++ +  
Sbjct:   150 QCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCDREYDRGCDGGIMSDAFNYVVQNR 209

Query:   206 GVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGS 265
             G+ +E  Y YQ +DG C     + PA  I G + VP+N+E ALL+AV++QPVSV++DA  
Sbjct:   210 GIASENDYSYQGSDGGC--RSNARPAARISGFQTVPSNNERALLEAVSRQPVSVSMDATG 267

Query:   266 SDFQFYSEGVFTGECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGI 325
               F  YS GV+ G CGT  NH V  VGYGT+ DGTKYW+ +NSWG  WGEKGYIR++R +
Sbjct:   268 DGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGTKYWLAKNSWGETWGEKGYIRIRRDV 327

Query:   326 SDKKGLCGIAMEASYPI 342
             +  +G+CG+A  A YP+
Sbjct:   328 AWPQGMCGVAQYAFYPV 344


>TAIR|locus:2029924 [details] [associations]
            symbol:AT1G29090 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002684 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            HOGENOM:HOG000230773 HSSP:P53634 ProtClustDB:CLSN2688064
            EMBL:BT004146 IPI:IPI00545702 RefSeq:NP_564321.2 UniGene:At.40814
            ProteinModelPortal:Q84W75 SMR:Q84W75 MEROPS:C01.A15
            EnsemblPlants:AT1G29090.1 GeneID:839784 KEGG:ath:AT1G29090
            TAIR:At1g29090 InParanoid:Q84W75 OMA:SIRGHED PhylomeDB:Q84W75
            ArrayExpress:Q84W75 Genevestigator:Q84W75 Uniprot:Q84W75
        Length = 355

 Score = 734 (263.4 bits), Expect = 1.2e-72, P = 1.2e-72
 Identities = 149/316 (47%), Positives = 204/316 (64%)

Query:    39 YERW--RSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKM-DKPYKLKLNKFADMTNHEFA 95
             +++W  R     S  L EK  RF+VFK+N+  + + NK  D+ YKL +N+FAD T  EF 
Sbjct:    47 HQQWMTRFSRVYSDEL-EKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTREEFI 105

Query:    96 STYAGSKIKH--------HRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQC 147
             +T+ G K  +          M      N + + G+ T      DWR +G+VT VK QGQC
Sbjct:   106 ATHTGLKGVNGIPSSEFVDEMIPSWNWNVSDVAGRETK-----DWRYEGAVTPVKYQGQC 160

Query:   148 GSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGV 207
             G CWAFS++AAVEG+  I+ N LVSLSEQ+L+DCD +++ GCNGG+M  AF +I K  G+
Sbjct:   161 GCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMSDAFSYIIKNRGI 220

Query:   208 TTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSD 267
              +EA YPYQA +GTC  +    P+  I G + VP+N+E ALL+AV+KQPVSV+IDA    
Sbjct:   221 ASEASYPYQAAEGTCRYN--GKPSAWIRGFQTVPSNNERALLEAVSKQPVSVSIDADGPG 278

Query:   268 FQFYSEGVFTGE-CGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGIS 326
             F  YS GV+    CGT +NH V  VGYGT+ +G KYW+ +NSWG  WGE GYIR++R ++
Sbjct:   279 FMHYSGGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVA 338

Query:   327 DKKGLCGIAMEASYPI 342
               +G+CG+A  A YP+
Sbjct:   339 WPQGMCGVAQYAFYPV 354


>TAIR|locus:2082881 [details] [associations]
            symbol:AT3G49340 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002686
            GenomeReviews:BA000014_GR EMBL:AC012329 EMBL:AL132956
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 MEROPS:I29.003 HOGENOM:HOG000230773 HSSP:P07711
            KO:K01376 IPI:IPI00520642 PIR:T45839 RefSeq:NP_566920.1
            UniGene:At.53854 ProteinModelPortal:Q9SG15 SMR:Q9SG15
            EnsemblPlants:AT3G49340.1 GeneID:824096 KEGG:ath:AT3G49340
            TAIR:At3g49340 InParanoid:Q9SG15 OMA:PQNDEEA PhylomeDB:Q9SG15
            ProtClustDB:CLSN2688476 Genevestigator:Q9SG15 Uniprot:Q9SG15
        Length = 341

 Score = 734 (263.4 bits), Expect = 1.2e-72, P = 1.2e-72
 Identities = 143/309 (46%), Positives = 196/309 (63%)

Query:    39 YERWRSHHTVSRSLD-EKHKRFNVFKQNVMHVHQTN-KMDKPYKLKLNKFADMTNHEFAS 96
             +E+W S      S D EK  RF +F  N+  V   N   +K Y L +N+F+D+T+ EF +
Sbjct:    35 HEQWMSRFNRVYSDDSEKTSRFEIFTNNLKFVESINMNTNKTYTLDVNEFSDLTDEEFKA 94

Query:    97 TYAGSKIKHHRM-FQGTRGNGT--FMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAF 153
              Y G  +         T  + T  F Y  V     S+DW ++G+VT+VK Q QCG CWAF
Sbjct:    95 RYTGLVVPEGMTRISTTDSHETVSFRYENVGETGESMDWIQEGAVTSVKHQQQCGCCWAF 154

Query:   154 STIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKY 213
             S +AAVEG+  I   +LVSLSEQ+L+DC T+ N GC GG+M  AF++IK+  G+TTE  Y
Sbjct:   155 SAVAAVEGMTKIANGELVSLSEQQLLDCSTENN-GCGGGIMWKAFDYIKENQGITTEDNY 213

Query:   214 PYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSE 273
             PYQ    TC+ +  +  A +I G+E VP N E+ALLKAV++QPVSVAI+    +F  YS 
Sbjct:   214 PYQGAQQTCESNHLA--AATISGYETVPQNDEEALLKAVSQQPVSVAIEGSGYEFIHYSG 271

Query:   274 GVFTGECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCG 333
             G+F GECGT+L H V  VGYG + +G KYW+++NSWG  WGE GY+R+ R +   +G+CG
Sbjct:   272 GIFNGECGTQLTHAVTIVGYGVSEEGIKYWLLKNSWGESWGENGYMRIMRDVDSPQGMCG 331

Query:   334 IAMEASYPI 342
             +A  A YP+
Sbjct:   332 LASLAYYPV 340


>TAIR|locus:2029934 [details] [associations]
            symbol:AT1G29080 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002684 GenomeReviews:CT485782_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AC021043 MEROPS:I29.003 HOGENOM:HOG000230773
            HSSP:P53634 ProtClustDB:CLSN2688064 EMBL:DQ056468 IPI:IPI00521747
            PIR:C86413 RefSeq:NP_564320.1 UniGene:At.51814
            ProteinModelPortal:Q9LP39 SMR:Q9LP39 EnsemblPlants:AT1G29080.1
            GeneID:839783 KEGG:ath:AT1G29080 TAIR:At1g29080 InParanoid:Q9LP39
            OMA:KTWGENG PhylomeDB:Q9LP39 Genevestigator:Q9LP39 Uniprot:Q9LP39
        Length = 346

 Score = 711 (255.3 bits), Expect = 3.3e-70, P = 3.3e-70
 Identities = 140/313 (44%), Positives = 193/313 (61%)

Query:    37 DLYERWRSHHTVSRSLD---EKHKRFNVFKQNVMHVHQTNKM-DKPYKLKLNKFADMTNH 92
             D +++W      SR  D   EK  R  V  +N+  +   N M ++ YKL +N+F D T  
Sbjct:    37 DYHQQWMIQF--SRVYDDEFEKQLRLQVLTENLKFIESFNNMGNQSYKLGVNEFTDWTKE 94

Query:    93 EFASTYAGSK-IKHHRMFQGTRGNGTFMYGKVTSI-PPSVDWRKKGSVTAVKDQGQCGSC 150
             EF +TY G + +     F+            V+ +   + DWR +G+VT VK QG+CG C
Sbjct:    95 EFLATYTGLRGVNVTSPFEVVNETKPAWNWTVSDVLGTNKDWRNEGAVTPVKSQGECGGC 154

Query:   151 WAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTE 210
             WAFS IAAVEG+  I    L+SLSEQ+L+DC  +QN GC GG    AF +I K  G+++E
Sbjct:   155 WAFSAIAAVEGLTKIARGNLISLSEQQLLDCTREQNNGCKGGTFVNAFNYIIKHRGISSE 214

Query:   211 AKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQF 270
              +YPYQ  +G C     + PA+ I G ENVP+N+E ALL+AV++QPV+VAIDA  + F  
Sbjct:   215 NEYPYQVKEGPC--RSNARPAILIRGFENVPSNNERALLEAVSRQPVAVAIDASEAGFVH 272

Query:   271 YSEGVFTGE-CGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKK 329
             YS GV+    CGT +NH V  VGYGT+ +G KYW+ +NSWG  WGE GYIR++R +   +
Sbjct:   273 YSGGVYNARNCGTSVNHAVTLVGYGTSPEGMKYWLAKNSWGKTWGENGYIRIRRDVEWPQ 332

Query:   330 GLCGIAMEASYPI 342
             G+CG+A  ASYP+
Sbjct:   333 GMCGVAQYASYPV 345


>UNIPROTKB|F1NYJ1 [details] [associations]
            symbol:CTSL2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 IPI:IPI00602255
            OMA:DITHHEF EMBL:AADN02067812 Ensembl:ENSGALT00000020588
            ArrayExpress:F1NYJ1 Uniprot:F1NYJ1
        Length = 339

 Score = 695 (249.7 bits), Expect = 1.7e-68, P = 1.7e-68
 Identities = 153/319 (47%), Positives = 199/319 (62%)

Query:    36 WDLYERWRSHHTVSRSLDEKHKRFNVFKQNV--MHVHQTN-KMDK-PYKLKLNKFADMTN 91
             W L++ W S     R  +E  +R  V+++N+  + +H  +  + K  YKL +N+F DMT 
Sbjct:    30 WQLWKSWHSKDYHER--EESWRRV-VWEKNLKMIELHNLDHSLGKHSYKLGMNQFGDMTA 86

Query:    92 HEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCW 151
              EF     G   KH +  +  RG+  F+       P SVDWR+KG VT VKDQGQCGSCW
Sbjct:    87 EEFRQLMNG--YKHKKSERKYRGS-QFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCW 143

Query:   152 AFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTE 210
             AFST  A+EG +   T KLVSLSEQ LVDC   + NQGCNGGLM+ AF++++  GG+ +E
Sbjct:   144 AFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSE 203

Query:   211 AKYPYQANDGT-CDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDF 268
               YPY A D   C    E + A +  G  ++P  HE AL+KAVA   PVSVAIDAG S F
Sbjct:   204 ESYPYTAKDDEDCRYKAEYN-AANDTGFVDIPQGHERALMKAVASVGPVSVAIDAGHSSF 262

Query:   269 QFYSEGVF-TGECGTE-LNHGVAAVGYG---TTLDGTKYWIVRNSWGPEWGEKGYIRMQR 323
             QFY  G++   +C +E L+HGV  VGYG     +DG KYWIV+NSWG +WG+KGYI M +
Sbjct:   263 QFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAK 322

Query:   324 GISDKKGLCGIAMEASYPI 342
                D+K  CGIA  ASYP+
Sbjct:   323 ---DRKNHCGIATAASYPL 338


>TAIR|locus:2097104 [details] [associations]
            symbol:AT3G43960 species:3702 "Arabidopsis thaliana"
            [GO:0005886 "plasma membrane" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0031225 "anchored to
            membrane" evidence=TAS] [GO:0048767 "root hair elongation"
            evidence=IMP] [GO:0016132 "brassinosteroid biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:CP002686
            GenomeReviews:BA000014_GR GO:GO:0031225 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0048767 MEROPS:I29.003 HOGENOM:HOG000230773
            EMBL:AL163975 EMBL:AK118634 IPI:IPI00526842 PIR:T48950
            RefSeq:NP_566867.1 UniGene:At.43352 ProteinModelPortal:Q9LXW3
            SMR:Q9LXW3 STRING:Q9LXW3 PaxDb:Q9LXW3 PRIDE:Q9LXW3
            EnsemblPlants:AT3G43960.1 GeneID:823513 KEGG:ath:AT3G43960
            TAIR:At3g43960 eggNOG:NOG286334 InParanoid:Q9LXW3 KO:K01376
            OMA:MAISFRT PhylomeDB:Q9LXW3 ProtClustDB:CLSN2917367
            Genevestigator:Q9LXW3 GermOnline:AT3G43960 Uniprot:Q9LXW3
        Length = 376

 Score = 687 (246.9 bits), Expect = 1.2e-67, P = 1.2e-67
 Identities = 145/336 (43%), Positives = 203/336 (60%)

Query:    28 ELESEEG-LWDLYERWRSHHTVS-RSLDEKHKRFNVFKQNVMHVHQTNK-MDKPYKLKLN 84
             E +  EG +  +YE+W   +  +   L EK +RF +FK N+  + + N   ++ Y+  LN
Sbjct:    29 ESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLN 88

Query:    85 KFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTA-VKD 143
             KF+D+T  EF ++Y G K++   +         + Y +   +P  VDWR++G+V   VK 
Sbjct:    89 KFSDLTADEFQASYLGGKMEKKSLSDVAE---RYQYKEGDVLPDEVDWRERGAVVPRVKR 145

Query:   144 QGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDT-DQNQGCNGGLMELAFEFIK 202
             QG+CGSCWAF+   AVEGIN I T +LVSLSEQEL+DCD  + N GC GG    AFEFIK
Sbjct:   146 QGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIK 205

Query:   203 KKGGVTTEAKYPYQAND-GTCD-VSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVA 260
             + GG+ ++  Y Y   D   C  +  +++  V+I+GHE VP N E +L KAVA QP+SV 
Sbjct:   206 ENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPISVM 265

Query:   261 IDAGS-SDFQFYSEGVFTGECGTEL-NHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGY 318
             I A + SD   Y  GV+ G C     +H V  VGYGT+ D   YW++RNSWGPEWGE GY
Sbjct:   266 ISAANMSD---YKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGY 322

Query:   319 IRMQRGISDKKGLCGIAMEASYPIKK-SATNPTGPS 353
             +R+QR   +  G C +A+   YPIK  S+++   PS
Sbjct:   323 LRLQRNFHEPTGKCAVAVAPVYPIKSNSSSHLLSPS 358


>ZFIN|ZDB-GENE-030131-106 [details] [associations]
            symbol:ctsl1a "cathepsin L, 1 a" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030131-106 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 HSSP:P43235
            KO:K01365 EMBL:BC066490 IPI:IPI00495935 RefSeq:NP_997749.1
            UniGene:Dr.104499 ProteinModelPortal:Q6NYR5 SMR:Q6NYR5
            MEROPS:C01.074 PRIDE:Q6NYR5 GeneID:321453 KEGG:dre:321453
            CTD:321453 InParanoid:Q6NYR5 NextBio:20807387 ArrayExpress:Q6NYR5
            Bgee:Q6NYR5 Uniprot:Q6NYR5
        Length = 337

 Score = 669 (240.6 bits), Expect = 9.4e-66, P = 9.4e-66
 Identities = 144/323 (44%), Positives = 195/323 (60%)

Query:    32 EEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTN---KMD-KPYKLKLNKFA 87
             ++ L D +++W+  H+      E+  R  ++++N+  +   N    M    Y+L +N F 
Sbjct:    22 DQQLNDHWDQWKKWHSKKYHATEEGWRRVIWEKNLKKIEMHNLEHSMGIHTYRLGMNHFG 81

Query:    88 DMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQC 147
             DMT+ EF     G K K  R F+G+     FM      +P  +DWR+KG VT VKDQG+C
Sbjct:    82 DMTHEEFRQVMNGFKHKKDRRFRGS----LFMEPNFIEVPNKLDWREKGYVTPVKDQGEC 137

Query:   148 GSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGG 206
             GSCWAFST  A+EG     T KLVSLSEQ LVDC   + N+GCNGGLM+ AF+++K + G
Sbjct:   138 GSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDQNG 197

Query:   207 VTTEAKYPYQANDGT-CDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAG 264
             + +E  YPY   D   C    ++S A +  G  ++P+  E AL+KA+A   PVSVAIDAG
Sbjct:   198 LDSEESYPYLGTDDQPCHFDPKNS-AANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAG 256

Query:   265 SSDFQFYSEGVF-TGECGTE-LNHGVAAVGYG---TTLDGTKYWIVRNSWGPEWGEKGYI 319
                FQFY  G++   EC +E L+HGV AVGYG     +DG KYWIV+NSW   WG+KGYI
Sbjct:   257 HESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVDGKKYWIVKNSWSENWGDKGYI 316

Query:   320 RMQRGISDKKGLCGIAMEASYPI 342
              M +   D+   CGIA  ASYP+
Sbjct:   317 YMAK---DRHNHCGIATAASYPL 336


>DICTYBASE|DDB_G0279799 [details] [associations]
            symbol:cprB "cysteine proteinase 2" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0279799 GenomeReviews:CM000152_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            MEROPS:I29.003 KO:K01365 EMBL:AAFI02000033 EMBL:M16039 EMBL:X03344
            PIR:A25439 RefSeq:XP_641494.1 ProteinModelPortal:P04989 SMR:P04989
            EnsemblProtists:DDB0214998 GeneID:8622234 KEGG:ddi:DDB_G0279799
            OMA:YVNITAG Uniprot:P04989
        Length = 376

 Score = 539 (194.8 bits), Expect = 3.7e-64, Sum P(2) = 3.7e-64
 Identities = 113/268 (42%), Positives = 154/268 (57%)

Query:    31 SEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTN-KMDKPYKLKLNKFADM 89
             SE      +  W        S  E   R+++FK N+ +V   N K D    L LN FAD+
Sbjct:    28 SESQYRTAFTEWTLKFNRQYSSSEFSNRYSIFKSNMDYVDNWNSKGDSQTVLGLNNFADI 87

Query:    90 TNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGS 149
             TN E+  TY G+++  H  + G  G        + + P S+DWR K +VT +KDQGQCGS
Sbjct:    88 TNEEYRKTYLGTRVNAHS-YNGYDGREVLNVEDLQTNPKSIDWRTKNAVTPIKDQGQCGS 146

Query:   150 CWAFSTIAAVEGINHIMTNKLVSLSEQELVDCD-TDQNQGCNGGLMELAFEFIKKKGGVT 208
             CW+FST  + EG + + T KLVSLSEQ LVDC   ++N GC+GGLM  AF++I K  G+ 
Sbjct:   147 CWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAFDYIIKNKGID 206

Query:   209 TEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDF 268
             TE+ YPY A  G+  +  +S    +I G+ N+ A  E +L       PVSVAIDA  + F
Sbjct:   207 TESSYPYTAETGSTCLFNKSDIGATIKGYVNITAGSEISLENGAQHGPVSVAIDASHNSF 266

Query:   269 QFYSEGVF-TGECG-TELNHGVAAVGYG 294
             Q Y+ G++   +C  TEL+HGV  VGYG
Sbjct:   267 QLYTSGIYYEPKCSPTELDHGVLVVGYG 294

 Score = 133 (51.9 bits), Expect = 3.7e-64, Sum P(2) = 3.7e-64
 Identities = 24/41 (58%), Positives = 29/41 (70%)

Query:   302 YWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPI 342
             YWIV+NSWG  WG KGYI M +   D+K  CGIA  +SYP+
Sbjct:   338 YWIVKNSWGTSWGIKGYILMSK---DRKNNCGIASVSSYPL 375


>FB|FBgn0013770 [details] [associations]
            symbol:Cp1 "Cysteine proteinase-1" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS;NAS] [GO:0030163 "protein catabolic process"
            evidence=NAS] [GO:0005764 "lysosome" evidence=NAS] [GO:0048102
            "autophagic cell death" evidence=IEP] [GO:0035071 "salivary gland
            cell autophagic cell death" evidence=IEP] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0045169 "fusome" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:AE013599 GO:GO:0007586 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0035071 GO:GO:0045169 GeneTree:ENSGT00660000095458 KO:K01365
            EMBL:U75652 EMBL:AF012089 EMBL:BT016071 EMBL:D31970
            RefSeq:NP_523735.2 RefSeq:NP_725347.1 RefSeq:NP_725348.1
            UniGene:Dm.7400 ProteinModelPortal:Q95029 SMR:Q95029 IntAct:Q95029
            MINT:MINT-814156 STRING:Q95029 MEROPS:C01.092 PaxDb:Q95029
            EnsemblMetazoa:FBtr0087593 GeneID:36546 KEGG:dme:Dmel_CG6692
            CTD:36546 FlyBase:FBgn0013770 InParanoid:Q95029 OMA:ICHGADP
            OrthoDB:EOG46M91C PhylomeDB:Q95029 GenomeRNAi:36546 NextBio:799136
            Bgee:Q95029 GermOnline:CG6692 Uniprot:Q95029
        Length = 371

 Score = 651 (234.2 bits), Expect = 7.6e-64, P = 7.6e-64
 Identities = 142/318 (44%), Positives = 190/318 (59%)

Query:    40 ERWRSHHTVSRS--LDEKHKRFN--VFKQNVMHVHQTNKM---DK-PYKLKLNKFADMTN 91
             E W +     R    DE  +RF   +F +N   + + N+     K  +KL +NK+AD+ +
Sbjct:    57 EEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLH 116

Query:    92 HEFASTYAGSKIKHHRMFQGTRGN--G-TFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCG 148
             HEF     G     H+  +    +  G TF+     ++P SVDWR KG+VTAVKDQG CG
Sbjct:   117 HEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCG 176

Query:   149 SCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGV 207
             SCWAFS+  A+EG +   +  LVSLSEQ LVDC T   N GCNGGLM+ AF +IK  GG+
Sbjct:   177 SCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGI 236

Query:   208 TTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSS 266
              TE  YPY+A D +C  +K +  A    G  ++P   E  + +AVA   PVSVAIDA   
Sbjct:   237 DTEKSYPYEAIDDSCHFNKGTVGATD-RGFTDIPQGDEKKMAEAVATVGPVSVAIDASHE 295

Query:   267 DFQFYSEGVFTG-ECGTE-LNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRG 324
              FQFYSEGV+   +C  + L+HGV  VG+GT   G  YW+V+NSWG  WG+KG+I+M R 
Sbjct:   296 SFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLR- 354

Query:   325 ISDKKGLCGIAMEASYPI 342
               +K+  CGIA  +SYP+
Sbjct:   355 --NKENQCGIASASSYPL 370


>UNIPROTKB|F1S4J6 [details] [associations]
            symbol:Ssc.54235 "Cathepsin L1" species:9823 "Sus scrofa"
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197
            GO:GO:0002250 GeneTree:ENSGT00560000076577 GO:GO:0097067
            EMBL:CU571031 RefSeq:XP_003130681.1 Ensembl:ENSSSCT00000011983
            GeneID:100515919 KEGG:ssc:100515919 OMA:IAICATK Uniprot:F1S4J6
        Length = 332

 Score = 651 (234.2 bits), Expect = 7.6e-64, P = 7.6e-64
 Identities = 144/336 (42%), Positives = 200/336 (59%)

Query:    17 GIVEGFDFHEKELESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTN--- 73
             GI      H+  L+++   W  Y +W++ H     L+E+ +R  ++++N+  + + N   
Sbjct:    13 GIASAAPRHDHSLDAD---W--Y-KWKATHRKLYGLNEEGRRRAIWEKNMKMIERHNWEH 66

Query:    74 KMDK-PYKLKLNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDW 132
             +  K  + + +N F DMTN EF  T  G + + H+     +G   F+       P SVDW
Sbjct:    67 RQGKHSFTMAMNAFGDMTNEEFRKTMNGFQNQKHK-----KGK-VFLDAGSALTPHSVDW 120

Query:   133 RKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNG 191
             R+KG VTAVK+QG CGSCWAFS   A+EG     T+KL+SLSEQ LVDC   + N+GCNG
Sbjct:   121 REKGYVTAVKNQGHCGSCWAFSATGALEGQMFRKTSKLISLSEQNLVDCSWPEGNEGCNG 180

Query:   192 GLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKA 251
             GLM+ AF++IK  GG+ +E  YPY   DG+C    +SS A +  G+ ++P   E AL+KA
Sbjct:   181 GLMDNAFQYIKDNGGLDSEESYPYFGKDGSCKYKPQSS-AANDTGYVDIP-KQEKALMKA 238

Query:   252 VAKQ-PVSVAIDAGSSDFQFYSEGV-FTGECGTE-LNHGVAAVGYGT--TLDGTKYWIVR 306
             VA   P+SV IDA    FQFYS G+ F  +C +E L+HGV  VGYG        KYW+V+
Sbjct:   239 VATVGPISVGIDASHESFQFYSTGIYFEPQCSSEDLDHGVLVVGYGVEGAHSNNKYWLVK 298

Query:   307 NSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPI 342
             NSWG  WG  GYI+M +   D+   CGIA  ASYP+
Sbjct:   299 NSWGNTWGMDGYIKMTK---DQNNHCGIATMASYPV 331


>UNIPROTKB|Q28944 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9823 "Sus scrofa"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 KO:K01365 OrthoDB:EOG48PMKF MEROPS:C01.032
            CTD:1514 EMBL:D37917 EMBL:AJ315771 PIR:A58195 RefSeq:NP_999057.1
            UniGene:Ssc.54036 ProteinModelPortal:Q28944 SMR:Q28944
            STRING:Q28944 Ensembl:ENSSSCT00000012233 GeneID:396926
            KEGG:ssc:396926 OMA:DASETGK ArrayExpress:Q28944 Uniprot:Q28944
        Length = 334

 Score = 651 (234.2 bits), Expect = 7.6e-64, P = 7.6e-64
 Identities = 142/317 (44%), Positives = 194/317 (61%)

Query:    37 DLYERWRSHHTVSRSLDEKHKRFNVFKQNV----MHVHQTNKMDKPYKLKLNKFADMTNH 92
             D Y +W++ H     ++E+  R  V+++N+    +H  + ++    + + +N F DMTN 
Sbjct:    28 DWY-KWKATHGRLYGMNEEGWRRAVWEKNMKMIELHNQEYSQGKHGFSMAMNAFGDMTNE 86

Query:    93 EFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWA 152
             EF     G + + H+     +G   F    V  +P SVDWR+KG VTAVK+QGQCGSCWA
Sbjct:    87 EFRQVMNGFQNQKHK-----KGK-VFHESLVLEVPKSVDWREKGYVTAVKNQGQCGSCWA 140

Query:   153 FSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEA 211
             FS   A+EG     T KLVSLSEQ LVDC   Q NQGCNGGLM+ AF+++K  GG+ TE 
Sbjct:   141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTEE 200

Query:   212 KYPYQAND-GTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQ 269
              YPY   +  +C    E S A +  G  ++P   E AL+KAVA   P+SVAIDAG S FQ
Sbjct:   201 SYPYLGRETNSCTYKPECS-AANDTGFVDIP-QREKALMKAVATVGPISVAIDAGHSSFQ 258

Query:   270 FYSEGVFTG-ECGT-ELNHGVAAVGYG---TTLDGTKYWIVRNSWGPEWGEKGYIRMQRG 324
             FY  G++   +C + +L+HGV  VGYG   T  + +K+WIV+NSWGPEWG  GY++M + 
Sbjct:   259 FYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWGPEWGWNGYVKMAK- 317

Query:   325 ISDKKGLCGIAMEASYP 341
               D+   CGI+  ASYP
Sbjct:   318 --DQNNHCGISTAASYP 332


>UNIPROTKB|P25975 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 EMBL:X91755 EMBL:BC102312 EMBL:AB017648
            IPI:IPI00687440 PIR:S15845 RefSeq:NP_776457.1 UniGene:Bt.3987
            ProteinModelPortal:P25975 SMR:P25975 STRING:P25975
            Ensembl:ENSBTAT00000022710 Ensembl:ENSBTAT00000036427 GeneID:281108
            KEGG:bta:281108 CTD:1515 InParanoid:P25975 KO:K01365 OMA:EEFRATH
            OrthoDB:EOG48PMKF BindingDB:P25975 ChEMBL:CHEMBL2113
            NextBio:20805179 ArrayExpress:P25975 Uniprot:P25975
        Length = 334

 Score = 645 (232.1 bits), Expect = 3.3e-63, P = 3.3e-63
 Identities = 140/315 (44%), Positives = 192/315 (60%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNV----MHVHQTNKMDKPYKLKLNKFADMTNHEF 94
             + +W++ H     ++E+  R  V+++N     +H  + ++    +++ +N F DMTN EF
Sbjct:    29 WHQWKATHRRLYGMNEEEWRRAVWEKNKKIIDLHNQEYSEGKHGFRMAMNAFGDMTNEEF 88

Query:    95 ASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFS 154
                  G + + H+     +G   F    +  +P SVDW KKG VT VK+QGQCGSCWAFS
Sbjct:    89 RQVMNGFQNQKHK-----KGK-LFHEPLLVDVPKSVDWTKKGYVTPVKNQGQCGSCWAFS 142

Query:   155 TIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEAKY 213
                A+EG     T KLVSLSEQ LVDC   Q NQGCNGGLM+ AF++IK  GG+ +E  Y
Sbjct:   143 ATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGGLDSEESY 202

Query:   214 PYQAND-GTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFY 271
             PY A D  +C+   E S A +  G  ++P   E AL+KAVA   P+SVAIDAG + FQFY
Sbjct:   203 PYLATDTNSCNYKPECS-AANDTGFVDIP-QREKALMKAVATVGPISVAIDAGHTSFQFY 260

Query:   272 SEGVFTG-ECGT-ELNHGVAAVGYG---TTLDGTKYWIVRNSWGPEWGEKGYIRMQRGIS 326
               G++   +C + +L+HGV  VGYG   T  +  K+WIV+NSWGPEWG  GY++M +   
Sbjct:   261 KSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPEWGWNGYVKMAK--- 317

Query:   327 DKKGLCGIAMEASYP 341
             D+   CGIA  ASYP
Sbjct:   318 DQNNHCGIATAASYP 332


>RGD|2448 [details] [associations]
            symbol:Ctsl1 "cathepsin L1" species:10116 "Rattus norvegicus"
          [GO:0002250 "adaptive immune response" evidence=ISO] [GO:0004177
          "aminopeptidase activity" evidence=IDA] [GO:0004197 "cysteine-type
          endopeptidase activity" evidence=ISO;IDA] [GO:0005576 "extracellular
          region" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IDA]
          [GO:0005764 "lysosome" evidence=ISO;IDA] [GO:0005773 "vacuole"
          evidence=IDA] [GO:0005902 "microvillus" evidence=IDA] [GO:0006508
          "proteolysis" evidence=IEP;ISO] [GO:0007154 "cell communication"
          evidence=IDA] [GO:0007283 "spermatogenesis" evidence=IEP] [GO:0008234
          "cysteine-type peptidase activity" evidence=ISO] [GO:0008584 "male
          gonad development" evidence=IEP] [GO:0009267 "cellular response to
          starvation" evidence=IEP] [GO:0009749 "response to glucose stimulus"
          evidence=IEP] [GO:0009897 "external side of plasma membrane"
          evidence=IDA] [GO:0010259 "multicellular organismal aging"
          evidence=IEP] [GO:0014070 "response to organic cyclic compound"
          evidence=IEP] [GO:0021675 "nerve development" evidence=IEP]
          [GO:0030984 "kininogen binding" evidence=IPI] [GO:0032403 "protein
          complex binding" evidence=IPI] [GO:0034698 "response to gonadotropin
          stimulus" evidence=IEP] [GO:0042277 "peptide binding" evidence=IDA]
          [GO:0042393 "histone binding" evidence=ISO] [GO:0043005 "neuron
          projection" evidence=IDA] [GO:0043204 "perikaryon" evidence=IDA]
          [GO:0046697 "decidualization" evidence=IEP] [GO:0048102 "autophagic
          cell death" evidence=IEP] [GO:0051384 "response to glucocorticoid
          stimulus" evidence=IEP] [GO:0060008 "Sertoli cell differentiation"
          evidence=IEP] [GO:0097067 "cellular response to thyroid hormone
          stimulus" evidence=ISO] [GO:0030141 "secretory granule" evidence=IDA]
          [GO:0045177 "apical part of cell" evidence=IDA] [GO:0060441
          "epithelial tube branching involved in lung morphogenesis"
          evidence=ISO] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
          PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:Y00697 RGD:2448
          GO:GO:0005576 GO:GO:0009897 GO:GO:0034698 GO:GO:0043204 GO:GO:0009749
          GO:GO:0051384 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
          InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
          PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
          PROSITE:PS00139 PROSITE:PS00639 GO:GO:0043005 GO:GO:0007283
          GO:GO:0004177 GO:GO:0005764 GO:GO:0042277 GO:GO:0009267 GO:GO:0021675
          GO:GO:0005902 GO:GO:0010259 GO:GO:0004197 GO:GO:0048102 GO:GO:0046697
          GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 KO:K01365
          OrthoDB:EOG48PMKF MEROPS:C01.032 OMA:FDQNLDT CTD:1514
          BRENDA:3.4.22.15 GO:GO:0060008 EMBL:AF025476 EMBL:BC063175
          EMBL:S85184 IPI:IPI00326070 PIR:S07098 RefSeq:NP_037288.1
          UniGene:Rn.1294 ProteinModelPortal:P07154 SMR:P07154 IntAct:P07154
          STRING:P07154 PhosphoSite:P07154 PRIDE:P07154
          Ensembl:ENSRNOT00000025462 GeneID:25697 KEGG:rno:25697 UCSC:RGD:2448
          InParanoid:P07154 SABIO-RK:P07154 BindingDB:P07154 ChEMBL:CHEMBL2305
          NextBio:607715 Genevestigator:P07154 GermOnline:ENSRNOG00000018566
          Uniprot:P07154
        Length = 334

 Score = 644 (231.8 bits), Expect = 4.2e-63, P = 4.2e-63
 Identities = 139/316 (43%), Positives = 193/316 (61%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNV----MHVHQTNKMDKPYKLKLNKFADMTNHEF 94
             + +W+S H      +E+  R  V+++N+    +H  + +     + +++N F DMTN EF
Sbjct:    29 WHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTNEEF 88

Query:    95 ASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFS 154
                  G + + H+     +G   F    +  IP +VDWR+KG VT VK+QGQCGSCWAFS
Sbjct:    89 RQIVNGYRHQKHK-----KGR-LFQEPLMLQIPKTVDWREKGCVTPVKNQGQCGSCWAFS 142

Query:   155 TIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEAKY 213
                 +EG   + T KL+SLSEQ LVDC  DQ NQGCNGGLM+ AF++IK+ GG+ +E  Y
Sbjct:   143 ASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESY 202

Query:   214 PYQANDGTCDVSKESSPAVSID-GHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFY 271
             PY+A DG+C    E   AV+ D G  ++P   E AL+KAVA   P+SVA+DA     QFY
Sbjct:   203 PYEAKDGSCKYRAEY--AVANDTGFVDIP-QQEKALMKAVATVGPISVAMDASHPSLQFY 259

Query:   272 SEGVF-TGECGT-ELNHGVAAVGYG---TTLDGTKYWIVRNSWGPEWGEKGYIRMQRGIS 326
             S G++    C + +L+HGV  VGYG   T  +  KYW+V+NSWG EWG  GYI++ +   
Sbjct:   260 SSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAK--- 316

Query:   327 DKKGLCGIAMEASYPI 342
             D+   CG+A  ASYPI
Sbjct:   317 DRNNHCGLATAASYPI 332


>MGI|MGI:88564 [details] [associations]
            symbol:Ctsl "cathepsin L" species:10090 "Mus musculus"
            [GO:0004177 "aminopeptidase activity" evidence=ISO] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISO] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005730 "nucleolus"
            evidence=NAS] [GO:0005737 "cytoplasm" evidence=ISO] [GO:0005764
            "lysosome" evidence=ISO] [GO:0005773 "vacuole" evidence=ISO]
            [GO:0005902 "microvillus" evidence=ISO] [GO:0006508 "proteolysis"
            evidence=ISO;IDA] [GO:0007154 "cell communication" evidence=ISO]
            [GO:0008152 "metabolic process" evidence=TAS] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISO;TAS] [GO:0009897 "external side of
            plasma membrane" evidence=ISO] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0030141 "secretory granule" evidence=ISO]
            [GO:0030984 "kininogen binding" evidence=ISO] [GO:0032403 "protein
            complex binding" evidence=ISO] [GO:0042277 "peptide binding"
            evidence=ISO] [GO:0042393 "histone binding" evidence=ISO;NAS]
            [GO:0043005 "neuron projection" evidence=ISO] [GO:0043204
            "perikaryon" evidence=ISO] [GO:0045177 "apical part of cell"
            evidence=ISO] [GO:0048863 "stem cell differentiation" evidence=NAS]
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IMP] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:88564 GO:GO:0005730 GO:GO:0009897 GO:GO:0034698
            GO:GO:0043204 GO:GO:0009749 GO:GO:0030141 GO:GO:0048863
            GO:GO:0051384 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045177 GO:GO:0043005
            GO:GO:0007283 GO:GO:0004177 GO:GO:0005764 GO:GO:0042277
            GO:GO:0009267 GO:GO:0021675 GO:GO:0042393 GO:GO:0005902
            GO:GO:0010259 GO:GO:0004197 GO:GO:0048102 GO:GO:0046697
            HOVERGEN:HBG011513 KO:K01365 OMA:EEFRATH OrthoDB:EOG48PMKF
            MEROPS:C01.032 BRENDA:3.4.22.15 ChiTaRS:CTSL1 EMBL:X06086
            EMBL:J02583 EMBL:M20495 EMBL:AF121837 EMBL:AF121838 EMBL:AF121839
            EMBL:BC068163 EMBL:X04392 IPI:IPI00128154 PIR:S01177
            RefSeq:NP_034114.1 UniGene:Mm.930 PDB:1MVV PDBsum:1MVV
            ProteinModelPortal:P06797 SMR:P06797 STRING:P06797
            PhosphoSite:P06797 PaxDb:P06797 PRIDE:P06797
            Ensembl:ENSMUST00000021933 GeneID:13039 KEGG:mmu:13039 CTD:13039
            InParanoid:P06797 BioCyc:MetaCyc:MONOMER-14812 BindingDB:P06797
            ChEMBL:CHEMBL5291 NextBio:282928 Bgee:P06797 CleanEx:MM_CTSL
            Genevestigator:P06797 GermOnline:ENSMUSG00000021477 GO:GO:0060008
            Uniprot:P06797
        Length = 334

 Score = 640 (230.4 bits), Expect = 1.1e-62, P = 1.1e-62
 Identities = 137/316 (43%), Positives = 193/316 (61%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNV----MHVHQTNKMDKPYKLKLNKFADMTNHEF 94
             + +W+S H      +E+  R  ++++N+    +H  + +     + +++N F DMTN EF
Sbjct:    29 WHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEF 88

Query:    95 ASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFS 154
                  G + + H+     +G   F    +  IP SVDWR+KG VT VK+QGQCGSCWAFS
Sbjct:    89 RQVVNGYRHQKHK-----KGR-LFQEPLMLKIPKSVDWREKGCVTPVKNQGQCGSCWAFS 142

Query:   155 TIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEAKY 213
                 +EG   + T KL+SLSEQ LVDC   Q NQGCNGGLM+ AF++IK+ GG+ +E  Y
Sbjct:   143 ASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESY 202

Query:   214 PYQANDGTCDVSKESSPAVSID-GHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFY 271
             PY+A DG+C    E   AV+ D G  ++P   E AL+KAVA   P+SVA+DA     QFY
Sbjct:   203 PYEAKDGSCKYRAEF--AVANDTGFVDIP-QQEKALMKAVATVGPISVAMDASHPSLQFY 259

Query:   272 SEGVF-TGECGTE-LNHGVAAVGYG---TTLDGTKYWIVRNSWGPEWGEKGYIRMQRGIS 326
             S G++    C ++ L+HGV  VGYG   T  +  KYW+V+NSWG EWG +GYI++ +   
Sbjct:   260 SSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAK--- 316

Query:   327 DKKGLCGIAMEASYPI 342
             D+   CG+A  ASYP+
Sbjct:   317 DRDNHCGLATAASYPV 332


>UNIPROTKB|Q5E998 [details] [associations]
            symbol:CTSL2 "Cathepsin L2" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            HOVERGEN:HBG011513 UniGene:Bt.3987 MEROPS:C01.032 EMBL:BT021022
            IPI:IPI00711962 ProteinModelPortal:Q5E998 SMR:Q5E998 STRING:Q5E998
            InParanoid:Q5E998 Uniprot:Q5E998
        Length = 334

 Score = 636 (228.9 bits), Expect = 3.0e-62, P = 3.0e-62
 Identities = 139/315 (44%), Positives = 191/315 (60%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNV----MHVHQTNKMDKPYKLKLNKFADMTNHEF 94
             + +W++ H     ++E+  R  V+++N     +H  + ++    +++ +N F DMTN EF
Sbjct:    29 WHQWKATHRRLYGMNEEEWRRAVWEKNKKIIDLHNQEYSEGKHGFRMAMNAFGDMTNEEF 88

Query:    95 ASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFS 154
                  G + + H+     +G   F    +  +P SVDW KKG VT VK+QGQCGSCWAFS
Sbjct:    89 RQVMNGFQNQKHK-----KGK-LFHEPLLVDVPKSVDWTKKGYVTPVKNQGQCGSCWAFS 142

Query:   155 TIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEAKY 213
                A+EG     T KLVSLSEQ LVDC   Q NQGCNGGLM+ AF++IK  G + +E  Y
Sbjct:   143 ATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGCLDSEESY 202

Query:   214 PYQAND-GTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFY 271
             PY A D  +C+   E S A +  G  ++P   E AL+KAVA   P+SVAIDAG + FQFY
Sbjct:   203 PYLATDTNSCNYKPECS-AANDTGFVDIP-QREKALMKAVATVGPISVAIDAGHTSFQFY 260

Query:   272 SEGVFTG-ECGT-ELNHGVAAVGYG---TTLDGTKYWIVRNSWGPEWGEKGYIRMQRGIS 326
               G++   +C + +L+HGV  VGYG   T  +  K+WIV+NSWGPEWG  GY++M +   
Sbjct:   261 KSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPEWGWNGYVKMAK--- 317

Query:   327 DKKGLCGIAMEASYP 341
             D+   CGIA  ASYP
Sbjct:   318 DQNNHCGIATAASYP 332


>UNIPROTKB|Q9GL24 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 CTD:1515 KO:K01365
            OrthoDB:EOG48PMKF EMBL:AJ279008 RefSeq:NP_001239115.1
            UniGene:Cfa.3571 ProteinModelPortal:Q9GL24 SMR:Q9GL24
            MEROPS:C01.032 Ensembl:ENSCAFT00000001770
            Ensembl:ENSCAFT00000023837 GeneID:100684364 KEGG:cfa:100684364
            InParanoid:Q9GL24 OMA:FDQNLDT NextBio:20817211 Uniprot:Q9GL24
        Length = 333

 Score = 634 (228.2 bits), Expect = 4.8e-62, P = 4.8e-62
 Identities = 140/312 (44%), Positives = 187/312 (59%)

Query:    41 RWRSHHTVSRSLDEKHKRFNVFKQNV----MHVHQTNKMDKPYKLKLNKFADMTNHEFAS 96
             +W++ H     ++E+  R  V+++N+    +H  + ++    + + +N F DMTN EF  
Sbjct:    31 QWKATHRRLYGMNEEGWRRAVWEKNMKMIELHNREYSQGKHGFTMAMNAFGDMTNEEFRQ 90

Query:    97 TYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTI 156
                G + + H+     +G   F       IP SVDWR+KG VT VK+QGQCGSCWAFS  
Sbjct:    91 VMNGFQNQKHK-----KGK-MFQEPLFAEIPKSVDWREKGYVTPVKNQGQCGSCWAFSAT 144

Query:   157 AAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEAKYPY 215
              A+EG     T KLVSLSEQ LVDC   Q N+GCNGGLM+ AF ++K  GG+ +E  YPY
Sbjct:   145 GALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNAFRYVKDNGGLDSEESYPY 204

Query:   216 QANDG-TCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSE 273
                D  TC+   E S A +  G  ++P   E AL+KAVA   P+SVAIDAG   FQFY  
Sbjct:   205 LGRDTETCNYKPECS-AANDTGFVDLP-QREKALMKAVATLGPISVAIDAGHQSFQFYKS 262

Query:   274 GV-FTGECGT-ELNHGVAAVGYGT--TLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKK 329
             G+ F  +C + +L+HGV  VGYG   T    K+WIV+NSWGPEWG  GY++M +   D+ 
Sbjct:   263 GIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKFWIVKNSWGPEWGWNGYVKMAK---DQN 319

Query:   330 GLCGIAMEASYP 341
               CGIA  ASYP
Sbjct:   320 NHCGIATAASYP 331


>UNIPROTKB|P83654 [details] [associations]
            symbol:P83654 "Ervatamin-C" species:52861 "Tabernaemontana
            divaricata" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0006508 "proteolysis" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005576 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197 PDB:1O0E PDB:2PNS
            PDBsum:1O0E PDBsum:2PNS MEROPS:C01.116 EvolutionaryTrace:P83654
            Uniprot:P83654
        Length = 208

 Score = 629 (226.5 bits), Expect = 1.6e-61, P = 1.6e-61
 Identities = 125/218 (57%), Positives = 149/218 (68%)

Query:   126 IPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ 185
             +P  +DWRKKG+VT VK+QG CGSCWAFST++ VE IN I T  L+SLSEQELVDCD  +
Sbjct:     1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDK-K 59

Query:   186 NQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHE 245
             N GC GG    A+++I   GG+ T+A YPY+A  G C  +   S  VSIDG+  VP  +E
Sbjct:    60 NHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPCQAA---SKVVSIDGYNGVPFCNE 116

Query:   246 DALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTGECGTELNHGVAAVGYGTTLDGTKYWIV 305
              AL +AVA QP +VAIDA S+ FQ YS G+F+G CGT+LNHGV  VGY        YWIV
Sbjct:   117 XALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQAN-----YWIV 171

Query:   306 RNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPIK 343
             RNSWG  WGEKGYIRM R +    GLCGIA    YP K
Sbjct:   172 RNSWGRYWGEKGYIRMLR-VGGC-GLCGIARLPYYPTK 207


>DICTYBASE|DDB_G0278721 [details] [associations]
            symbol:cprD "cysteine proteinase 4" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0278721 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000024 EMBL:L36204 RefSeq:XP_641963.1
            ProteinModelPortal:P54639 SMR:P54639 MEROPS:C01.A57 PRIDE:P54639
            EnsemblProtists:DDB0214999 GeneID:8621695 KEGG:ddi:DDB_G0278721
            OMA:NAFADIT ProtClustDB:CLSZ2846820 Uniprot:P54639
        Length = 442

 Score = 520 (188.1 bits), Expect = 6.9e-61, Sum P(2) = 6.9e-61
 Identities = 119/276 (43%), Positives = 158/276 (57%)

Query:    27 KELESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKF 86
             K+  SE    + +  W   H  + S +E + R+ +FK N+ +VHQ N       L LN F
Sbjct:    18 KQQFSELQYRNAFTNWMQAHQRTYSSEEFNARYQIFKSNMDYVHQWNSKGGETVLGLNVF 77

Query:    87 ADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIP-PSVDWRKKGSVTAVKDQG 145
             AD+TN E+ +TY G+       F G+   GT    K+ S P P+VDWR +G+VT +K+QG
Sbjct:    78 ADITNQEYRTTYLGTP------FDGSALIGT-EEEKIFSTPAPTVDWRAQGAVTPIKNQG 130

Query:   146 QCGSCWAFSTIAAVEGINHIM--TNK-LVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFI 201
             QCG CW+FST  + EG + I   T K LVSLSEQ L+DC     N GC GGLM LAFE+I
Sbjct:   131 QCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSEQNLIDCSKSYGNNGCEGGLMTLAFEYI 190

Query:   202 KKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAI 261
                 G+ TE+ YPY A DG     K S+    I  ++NV +  E +L  A    PVSVAI
Sbjct:   191 INNKGIDTESSYPYTAEDGKECKFKTSNIGAQIVSYQNVTSGSEASLQSASNNAPVSVAI 250

Query:   262 DAGSSDFQFYSEGVF-TGECG-TELNHGVAAVGYGT 295
             DA +  FQ Y  G++    C  T+L+HGV  VGYG+
Sbjct:   251 DASNESFQLYESGIYYEPACSPTQLDHGVLVVGYGS 286

 Score = 121 (47.7 bits), Expect = 6.9e-61, Sum P(2) = 6.9e-61
 Identities = 23/44 (52%), Positives = 27/44 (61%)

Query:   302 YWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPIKKS 345
             YWIV+NSWG  WG  GYI M +   D+   CGIA  AS+P   S
Sbjct:   401 YWIVKNSWGTSWGMDGYIFMSK---DRNNNCGIATMASFPTASS 441


>UNIPROTKB|F1NEC8 [details] [associations]
            symbol:CTSL2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:AADN02067812 IPI:IPI00820956 Ensembl:ENSGALT00000037988
            ArrayExpress:F1NEC8 Uniprot:F1NEC8
        Length = 218

 Score = 623 (224.4 bits), Expect = 7.1e-61, P = 7.1e-61
 Identities = 126/221 (57%), Positives = 153/221 (69%)

Query:   127 PPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ- 185
             P SVDWR+KG VT VKDQGQCGSCWAFST  A+EG +   T KLVSLSEQ LVDC   + 
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEG 61

Query:   186 NQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGT-CDVSKESSPAVSIDGHENVPANH 244
             NQGCNGGLM+ AF++++  GG+ +E  YPY A D   C    E + A +  G  ++P  H
Sbjct:    62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYN-AANDTGFVDIPQGH 120

Query:   245 EDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVF-TGECGTE-LNHGVAAVGYGTTLDGTK 301
             E AL+KAVA   PVSVAIDAG S FQFY  G++   +C +E L+HGV  VGYG   DG K
Sbjct:   121 ERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFE-DGKK 179

Query:   302 YWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPI 342
             YWIV+NSWG +WG+KGYI M +   D+K  CGIA  ASYP+
Sbjct:   180 YWIVKNSWGEKWGDKGYIYMAK---DRKNHCGIATAASYPL 217


>UNIPROTKB|P07711 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9606 "Homo sapiens"
            [GO:0005576 "extracellular region" evidence=NAS] [GO:0005764
            "lysosome" evidence=IDA;NAS] [GO:0006508 "proteolysis"
            evidence=IDA] [GO:0002250 "adaptive immune response" evidence=IEP]
            [GO:0019882 "antigen processing and presentation" evidence=TAS]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0036021 "endolysosome lumen" evidence=TAS] [GO:0043202
            "lysosomal lumen" evidence=TAS] [GO:0045087 "innate immune
            response" evidence=TAS] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042393 "histone binding" evidence=IDA] [GO:0005634 "nucleus"
            evidence=TAS] [GO:0071888 "macrophage apoptotic process"
            evidence=NAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            Reactome:REACT_118779 EMBL:X12451 GO:GO:0005634 Reactome:REACT_6900
            GO:GO:0005576 GO:GO:0019886 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087
            GO:GO:0042393 GO:GO:0004197 GO:GO:0002250 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0036021 KO:K01365 OrthoDB:EOG48PMKF EMBL:M20496
            EMBL:CR457053 EMBL:BX537395 EMBL:AL160279 EMBL:BC012612 EMBL:X05256
            IPI:IPI00012887 PIR:S01002 RefSeq:NP_001244900.1
            RefSeq:NP_001244901.1 RefSeq:NP_001903.1 RefSeq:NP_666023.1
            UniGene:Hs.731507 UniGene:Hs.731952 PDB:1CJL PDB:1CS8 PDB:1ICF
            PDB:1MHW PDB:2NQD PDB:2VHS PDB:2XU1 PDB:2XU3 PDB:2XU4 PDB:2XU5
            PDB:2YJ2 PDB:2YJ8 PDB:2YJ9 PDB:2YJB PDB:2YJC PDB:3BC3 PDB:3H89
            PDB:3H8B PDB:3H8C PDB:3HHA PDB:3HWN PDB:3IV2 PDB:3K24 PDB:3KSE
            PDB:3OF8 PDB:3OF9 PDBsum:1CJL PDBsum:1CS8 PDBsum:1ICF PDBsum:1MHW
            PDBsum:2NQD PDBsum:2VHS PDBsum:2XU1 PDBsum:2XU3 PDBsum:2XU4
            PDBsum:2XU5 PDBsum:2YJ2 PDBsum:2YJ8 PDBsum:2YJ9 PDBsum:2YJB
            PDBsum:2YJC PDBsum:3BC3 PDBsum:3H89 PDBsum:3H8B PDBsum:3H8C
            PDBsum:3HHA PDBsum:3HWN PDBsum:3IV2 PDBsum:3K24 PDBsum:3KSE
            PDBsum:3OF8 PDBsum:3OF9 ProteinModelPortal:P07711 SMR:P07711
            IntAct:P07711 STRING:P07711 MEROPS:I29.001 PhosphoSite:P07711
            DMDM:115741 PaxDb:P07711 PeptideAtlas:P07711 PRIDE:P07711
            DNASU:1514 Ensembl:ENST00000340342 Ensembl:ENST00000343150
            GeneID:1514 KEGG:hsa:1514 UCSC:uc004aph.3 CTD:1514
            GeneCards:GC09P090341 H-InvDB:HIX0058839 H-InvDB:HIX0170314
            HGNC:HGNC:2537 HPA:CAB000459 MIM:116880 neXtProt:NX_P07711
            PharmGKB:PA162382890 InParanoid:P07711 OMA:REPLFAQ PhylomeDB:P07711
            BRENDA:3.4.22.15 BindingDB:P07711 ChEMBL:CHEMBL3837 ChiTaRS:CTSL1
            DrugBank:DB00040 EvolutionaryTrace:P07711 GenomeRNAi:1514
            NextBio:6271 PMAP-CutDB:P07711 ArrayExpress:P07711 Bgee:P07711
            CleanEx:HS_CTSL1 Genevestigator:P07711 GermOnline:ENSG00000135047
            GO:GO:0071888 Uniprot:P07711
        Length = 333

 Score = 620 (223.3 bits), Expect = 1.5e-60, P = 1.5e-60
 Identities = 133/314 (42%), Positives = 189/314 (60%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNV----MHVHQTNKMDKPYKLKLNKFADMTNHEF 94
             + +W++ H     ++E+  R  V+++N+    +H  +  +    + + +N F DMT+ EF
Sbjct:    29 WTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEF 88

Query:    95 ASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFS 154
                  G + +  R     +G   F        P SVDWR+KG VT VK+QGQCGSCWAFS
Sbjct:    89 RQVMNGFQNRKPR-----KGK-VFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFS 142

Query:   155 TIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEAKY 213
                A+EG     T +L+SLSEQ LVDC   Q N+GCNGGLM+ AF++++  GG+ +E  Y
Sbjct:   143 ATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESY 202

Query:   214 PYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYS 272
             PY+A + +C  + + S A +  G  ++P   E AL+KAVA   P+SVAIDAG   F FY 
Sbjct:   203 PYEATEESCKYNPKYSVA-NDTGFVDIP-KQEKALMKAVATVGPISVAIDAGHESFLFYK 260

Query:   273 EGV-FTGECGTE-LNHGVAAVGYG---TTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISD 327
             EG+ F  +C +E ++HGV  VGYG   T  D  KYW+V+NSWG EWG  GY++M +   D
Sbjct:   261 EGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAK---D 317

Query:   328 KKGLCGIAMEASYP 341
             ++  CGIA  ASYP
Sbjct:   318 RRNHCGIASAASYP 331


>TAIR|locus:2030027 [details] [associations]
            symbol:AT1G29110 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            EMBL:CP002684 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            IPI:IPI00544534 RefSeq:NP_564322.1 UniGene:At.51816
            ProteinModelPortal:F4HZW2 SMR:F4HZW2 EnsemblPlants:AT1G29110.1
            GeneID:839786 KEGG:ath:AT1G29110 OMA:SCRANAR Uniprot:F4HZW2
        Length = 334

 Score = 619 (223.0 bits), Expect = 1.9e-60, P = 1.9e-60
 Identities = 124/318 (38%), Positives = 189/318 (59%)

Query:    31 SEEGLWDLYERWRSHHT-VSRSLDEKHKRFNVFKQNVMHVHQTNKM-DKPYKLKLNKFAD 88
             +E+ + D +++W +  + V +   EK  R  VFK+N+  +   N M ++ Y L +N+F D
Sbjct:    30 NEQSIVDYHQQWMTQFSRVYKDESEKEMRLKVFKKNLKFIENFNNMGNQSYTLGVNEFTD 89

Query:    89 MTNHEFASTYAGSKIKH---HRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQG 145
                 EF +T+ G ++       +F  T+ +  +    +     S DWR +G+VT VK QG
Sbjct:    90 WKTEEFLATHTGLRVNVTSLSELFNKTKPSRNWNMSDIDMEDESKDWRDEGAVTPVKYQG 149

Query:   146 QCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKG 205
              C        +  + G N      L++LSEQ+L+DCD ++N GCNGG  E AF++I K G
Sbjct:   150 AC-------RLTKISGKN------LLTLSEQQLIDCDIEKNGGCNGGEFEEAFKYIIKNG 196

Query:   206 GVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGS 265
             GV+ E +YPYQ    +C  +   +P   I G + VP+++E ALL+AV +QPVSV IDA +
Sbjct:   197 GVSLETEYPYQVKKESCRANARRAPHTQIRGFQMVPSHNERALLEAVRRQPVSVLIDARA 256

Query:   266 SDFQFYSEGVFTG-ECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRG 324
               F  Y  GV+ G +CGT++NH V  VGYGT + G  YW+++NSWG  WGE GY+R++R 
Sbjct:   257 DSFGHYKGGVYAGLDCGTDVNHAVTIVGYGT-MSGLNYWVLKNSWGESWGENGYMRIRRD 315

Query:   325 ISDKKGLCGIAMEASYPI 342
             +   +G+CGIA  A+YP+
Sbjct:   316 VEWPQGMCGIAQVAAYPV 333


>ZFIN|ZDB-GENE-030131-572 [details] [associations]
            symbol:wu:fb37b09 "wu:fb37b09" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030131-572 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:BX465840 IPI:IPI00866294 RefSeq:XP_001923796.1
            UniGene:Dr.25683 PRIDE:E9QBE2 Ensembl:ENSDART00000133962
            GeneID:321853 KEGG:dre:321853 NextBio:20807556 Uniprot:E9QBE2
        Length = 335

 Score = 618 (222.6 bits), Expect = 2.4e-60, P = 2.4e-60
 Identities = 136/317 (42%), Positives = 183/317 (57%)

Query:    37 DLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKM----DKPYKLKLNKFADMTNH 92
             D +  W+S H  S   D +  R  ++++N+  + Q N      +  +K+ +N+F DMTN 
Sbjct:    26 DHWNSWKSQHGKSYHEDVEVGRRMIWEENLRKIEQHNFEYSLGNHTFKMGMNQFGDMTNE 85

Query:    93 EFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWA 152
             EF     G K   +R  QG      FM  K  + P  VDWR++G VT VKDQ QCGSCW+
Sbjct:    86 EFRQAMNGYKHDPNRTSQGP----LFMEPKFFAAPQQVDWRQRGYVTPVKDQKQCGSCWS 141

Query:   153 FSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEA 211
             FS+  A+EG     T KL+S+SEQ LVDC     NQGCNGGLM+ AF+++K+  G+ +E 
Sbjct:   142 FSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHGNQGCNGGLMDQAFQYVKENKGLDSEQ 201

Query:   212 KYPYQANDGT-CDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQ 269
              YPY A D   C      + A  I G  ++P  +E AL+ AVA   PVSVAIDA     Q
Sbjct:   202 SYPYLARDDLPCRYDPRFNVA-KITGFVDIPKGNELALMNAVAAVGPVSVAIDASHQSLQ 260

Query:   270 FYSEGVFTGE-CGTELNHGVAAVGYG---TTLDGTKYWIVRNSWGPEWGEKGYIRMQRGI 325
             FY  G++    C ++L+H V  VGYG     + G +YWIV+NSW  +WG+KGYI M +  
Sbjct:   261 FYQSGIYYERACTSQLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAK-- 318

Query:   326 SDKKGLCGIAMEASYPI 342
              DK   CGIA  ASYP+
Sbjct:   319 -DKNNHCGIATMASYPL 334


>ZFIN|ZDB-GENE-040718-61 [details] [associations]
            symbol:ctsl.1 "cathepsin L.1" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-040718-61
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            GeneTree:ENSGT00660000095458 MEROPS:C01.092 EMBL:FP015965
            EMBL:BC075887 IPI:IPI00513499 RefSeq:NP_001002368.1
            UniGene:Dr.85174 SMR:Q6DHT0 Ensembl:ENSDART00000017756
            GeneID:436641 KEGG:dre:436641 CTD:436641 InParanoid:Q6DHT0
            OMA:GGQMENA OrthoDB:EOG41ZFB9 NextBio:20831086 Uniprot:Q6DHT0
        Length = 334

 Score = 618 (222.6 bits), Expect = 2.4e-60, P = 2.4e-60
 Identities = 136/313 (43%), Positives = 188/313 (60%)

Query:    39 YERWRSHHTVS-RSLDEK-HKRFN-VFKQNVMHVHQ--TNKMDKPYKLKLNKFADMTNHE 93
             +  W+     S RS +E+ H++   +  + ++ VH    ++  K Y+L +  FADM+N E
Sbjct:    26 FHAWKLKFGKSYRSAEEESHRQLTWLTNRKLVLVHNMMADQGLKSYRLGMTYFADMSNEE 85

Query:    94 FASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAF 153
             +        +      +   G+  F   K   +P +VDWR KG VT +KDQ QCGSCWAF
Sbjct:    86 YRQLVFRGCLGSMNNTKARGGSTFFRLRKAAVVPDTVDWRDKGYVTDIKDQKQCGSCWAF 145

Query:   154 STIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEAK 212
             S   ++EG     T KLVSLSEQ+LVDC     N GC+GGLM+ AF++I+   G+ TE  
Sbjct:   146 SATGSLEGQTFRKTGKLVSLSEQQLVDCSGSYGNYGCDGGLMDQAFQYIEANKGLDTEDS 205

Query:   213 YPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSDFQFY 271
             YPY+A DG C  +  S+   S  G+ ++ +  E AL +AVA   P+SVAIDAG S FQ Y
Sbjct:   206 YPYEAQDGECRFNP-STVGASCTGYVDIASGDESALQEAVATIGPISVAIDAGHSSFQLY 264

Query:   272 SEGVFTG-ECGT-ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKK 329
             S GV+   +C + EL+HGV AVGYG++ +G  YWIV+NSWG +WG +GYI M R   +K 
Sbjct:   265 SSGVYNEPDCSSSELDHGVLAVGYGSS-NGDDYWIVKNSWGLDWGVQGYILMSR---NKS 320

Query:   330 GLCGIAMEASYPI 342
               CGIA  ASYP+
Sbjct:   321 NQCGIATAASYPL 333


>UNIPROTKB|O60911 [details] [associations]
            symbol:CTSL2 "Cathepsin L2" species:9606 "Homo sapiens"
            [GO:0004177 "aminopeptidase activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA] [GO:0005902
            "microvillus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0007283 "spermatogenesis" evidence=IEA] [GO:0009267 "cellular
            response to starvation" evidence=IEA] [GO:0009749 "response to
            glucose stimulus" evidence=IEA] [GO:0009897 "external side of
            plasma membrane" evidence=IEA] [GO:0010259 "multicellular
            organismal aging" evidence=IEA] [GO:0021675 "nerve development"
            evidence=IEA] [GO:0030984 "kininogen binding" evidence=IEA]
            [GO:0032403 "protein complex binding" evidence=IEA] [GO:0034698
            "response to gonadotropin stimulus" evidence=IEA] [GO:0042277
            "peptide binding" evidence=IEA] [GO:0043005 "neuron projection"
            evidence=IEA] [GO:0043204 "perikaryon" evidence=IEA] [GO:0046697
            "decidualization" evidence=IEA] [GO:0048102 "autophagic cell death"
            evidence=IEA] [GO:0051384 "response to glucocorticoid stimulus"
            evidence=IEA] [GO:0060008 "Sertoli cell differentiation"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=TAS] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_118779 Reactome:REACT_6900
            GO:GO:0009897 GO:GO:0019886 GO:GO:0034698 GO:GO:0043204
            GO:GO:0009749 GO:GO:0030141 GO:GO:0051384 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0045177 GO:GO:0043005 GO:GO:0007283
            GO:GO:0004177 GO:GO:0042277 GO:GO:0009267 GO:GO:0021675
            GO:GO:0043202 GO:GO:0005902 GO:GO:0010259 GO:GO:0004197
            GO:GO:0048102 GO:GO:0046697 HOVERGEN:HBG011513 CTD:1515
            OrthoDB:EOG48PMKF OMA:FDQNLDT GO:GO:0060008 EMBL:Y14734
            EMBL:AB001928 EMBL:AF070448 EMBL:AB019534 EMBL:AY358641
            EMBL:AL445670 EMBL:BC023504 EMBL:BC110512 IPI:IPI00000013
            RefSeq:NP_001188504.1 RefSeq:NP_001324.2 UniGene:Hs.610096 PDB:1FH0
            PDB:3H6S PDB:3KFQ PDBsum:1FH0 PDBsum:3H6S PDBsum:3KFQ
            ProteinModelPortal:O60911 SMR:O60911 IntAct:O60911 STRING:O60911
            MEROPS:I29.010 PhosphoSite:O60911 PaxDb:O60911 PeptideAtlas:O60911
            PRIDE:O60911 Ensembl:ENST00000259470 Ensembl:ENST00000538255
            GeneID:1515 KEGG:hsa:1515 UCSC:uc004awt.3 GeneCards:GC09M099794
            HGNC:HGNC:2538 HPA:CAB017112 MIM:603308 neXtProt:NX_O60911
            PharmGKB:PA27036 InParanoid:O60911 KO:K01375 PhylomeDB:O60911
            BRENDA:3.4.22.43 SABIO-RK:O60911 BindingDB:O60911 ChEMBL:CHEMBL3272
            ChiTaRS:CTSL2 EvolutionaryTrace:O60911 GenomeRNAi:1515 NextBio:6277
            Bgee:O60911 CleanEx:HS_CTSL2 Genevestigator:O60911
            GermOnline:ENSG00000136943 Uniprot:O60911
        Length = 334

 Score = 617 (222.3 bits), Expect = 3.1e-60, P = 3.1e-60
 Identities = 142/327 (43%), Positives = 188/327 (57%)

Query:    32 EEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNV----MHVHQTNKMDKPYKLKLNKFA 87
             ++ L   + +W++ H      +E+  R  V+++N+    +H  + ++    + + +N F 
Sbjct:    22 DQNLDTKWYQWKATHRRLYGANEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFG 81

Query:    88 DMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKV------TSIPPSVDWRKKGSVTAV 141
             DMTN EF            R   G   N  F  GKV        +P SVDWRKKG VT V
Sbjct:    82 DMTNEEF------------RQMMGCFRNQKFRKGKVFREPLFLDLPKSVDWRKKGYVTPV 129

Query:   142 KDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEF 200
             K+Q QCGSCWAFS   A+EG     T KLVSLSEQ LVDC   Q NQGCNGG M  AF++
Sbjct:   130 KNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQY 189

Query:   201 IKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSV 259
             +K+ GG+ +E  YPY A D  C    E+S A +  G   V    E AL+KAVA   P+SV
Sbjct:   190 VKENGGLDSEESYPYVAVDEICKYRPENSVA-NDTGFTVVAPGKEKALMKAVATVGPISV 248

Query:   260 AIDAGSSDFQFYSEGV-FTGECGTE-LNHGVAAVGYG---TTLDGTKYWIVRNSWGPEWG 314
             A+DAG S FQFY  G+ F  +C ++ L+HGV  VGYG      + +KYW+V+NSWGPEWG
Sbjct:   249 AMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWG 308

Query:   315 EKGYIRMQRGISDKKGLCGIAMEASYP 341
               GY+++ +   DK   CGIA  ASYP
Sbjct:   309 SNGYVKIAK---DKNNHCGIATAASYP 332


>RGD|1560071 [details] [associations]
            symbol:Ctsll3 "cathepsin L-like 3" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1560071 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:CH474032 IPI:IPI00560469 RefSeq:XP_001065834.2
            RefSeq:XP_573976.3 UniGene:Rn.104851 MEROPS:C01.107
            Ensembl:ENSRNOT00000061398 GeneID:498691 KEGG:rno:498691
            UCSC:RGD:1560071 CTD:70202 OMA:NCGIASD OrthoDB:EOG4HDSTZ
            NextBio:700548 Uniprot:D3ZJV2
        Length = 330

 Score = 615 (221.5 bits), Expect = 5.0e-60, P = 5.0e-60
 Identities = 133/315 (42%), Positives = 189/315 (60%)

Query:    38 LYERWRSHHTVSRSLDEKHKRFNVFKQNV--MHVHQTN--KMDKPYKLKLNKFADMTNHE 93
             ++E W++ H  + + +E+ ++  V++ N+  +++H  +  K    + L++N F D+TN E
Sbjct:    28 VWEEWKTKHGKTYNTNEEGQKRAVWENNMKMINLHNEDYLKGKHGFSLEMNAFGDLTNTE 87

Query:    94 FASTYAGSKIKHHRMFQG--TRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCW 151
             F     G        FQG  T+    F    +  +P +VDWRK G VT VK+QG CGSCW
Sbjct:    88 FRELMTG--------FQGQKTKMMKVFPEPFLGDVPKTVDWRKHGYVTPVKNQGPCGSCW 139

Query:   152 AFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTE 210
             AFS + ++EG     T KLV LSEQ LVDC     N+GC+GGL + AF+++K  GG+ T 
Sbjct:   140 AFSAVGSLEGQVFRKTGKLVPLSEQNLVDCSWSHGNKGCDGGLPDFAFQYVKDNGGLDTS 199

Query:   211 AKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQ 269
               YPY+A +GTC  + + S A  + G  ++P + E+AL+KAVA   P+SV ID     FQ
Sbjct:   200 VSYPYEALNGTCRYNPKYS-AAKVVGFMSIPPS-ENALMKAVATVGPISVGIDIKHKSFQ 257

Query:   270 FYSEGVF-TGECG-TELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISD 327
             FY  G++   +C  T LNH V  VGYG   DG KYW+V+NSWG +WG  GYI+M +   D
Sbjct:   258 FYKGGMYYEPDCSSTNLNHAVLVVGYGEESDGRKYWLVKNSWGRDWGMDGYIKMAK---D 314

Query:   328 KKGLCGIAMEASYPI 342
                 CGIA +ASYPI
Sbjct:   315 WNNNCGIASDASYPI 329


>ZFIN|ZDB-GENE-041010-76 [details] [associations]
            symbol:ctsll "cathepsin L, like" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-041010-76
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            OrthoDB:EOG48PMKF MEROPS:C01.032 EMBL:BX119902 IPI:IPI00616622
            UniGene:Dr.79994 SMR:A2BEM8 Ensembl:ENSDART00000144226
            InParanoid:A2BEM8 OMA:PRYSAAN Uniprot:A2BEM8
        Length = 337

 Score = 615 (221.5 bits), Expect = 5.0e-60, P = 5.0e-60
 Identities = 136/319 (42%), Positives = 187/319 (58%)

Query:    36 WDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTN---KMDK-PYKLKLNKFADMTN 91
             W L++RW  H       +E  +R  V+++N+  +   N    + K  ++L +N+F DMTN
Sbjct:    29 WHLWKRW--HEKSYHEKEEGWRRM-VWEKNLKKIELHNLEHSVGKHTFRLGMNQFGDMTN 85

Query:    92 HEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCW 151
              EF     G     +R  +G+     F+     + P  +DWR+KG VT +KDQ +CGSCW
Sbjct:    86 EEFRQAMNGYNRDPNRKSKGS----LFIEPSFFTAPQQIDWRQKGYVTPIKDQKRCGSCW 141

Query:   152 AFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTE 210
             AFS+  A+EG     T KLVSLSEQ L+DC   Q N GC+GGLM+ AF++++   G+ +E
Sbjct:   142 AFSSTGALEGQVFRKTGKLVSLSEQNLMDCSRPQGNNGCDGGLMDQAFQYVQDNNGLDSE 201

Query:   211 AKYPYQANDGT-CDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDF 268
               YPY A D   C      S A ++ G  ++P+  E AL+KAVA   PV+VAIDAG   F
Sbjct:   202 ESYPYLATDDQPCHYDPRYS-AANVTGFVDIPSGKEHALMKAVAAVGPVAVAIDAGHESF 260

Query:   269 QFYSEGVFTGE-CGTE-LNHGVAAVGYG---TTLDGTKYWIVRNSWGPEWGEKGYIRMQR 323
             QFY  G++  + C TE L+HGV  VGYG     + G +YWIV+NSW   WG+KGYI M +
Sbjct:   261 QFYQSGIYYEKACSTEELDHGVLVVGYGYEGVDVAGRRYWIVKNSWTDRWGDKGYIYMAK 320

Query:   324 GISDKKGLCGIAMEASYPI 342
                D K  CGIA  ASYP+
Sbjct:   321 ---DLKNHCGIATSASYPL 336


>UNIPROTKB|P09648 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9031 "Gallus gallus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            OrthoDB:EOG48PMKF MEROPS:C01.032 IPI:IPI00602255 PIR:S00081
            UniGene:Gga.523 ProteinModelPortal:P09648 SMR:P09648 Uniprot:P09648
        Length = 218

 Score = 612 (220.5 bits), Expect = 1.0e-59, P = 1.0e-59
 Identities = 126/222 (56%), Positives = 152/222 (68%)

Query:   127 PPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTN-KLVSLSEQELVDCDTDQ 185
             P SVDWR+KG VT VKDQGQCGSCWAFST  A+EG  H  T  KLVSLSEQ LVDC   +
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEG-QHFRTKGKLVSLSEQNLVDCSRPE 60

Query:   186 -NQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGT-CDVSKESSPAVSIDGHENVPAN 243
              NQGCNGGLM+ AF++++  GG+ +E  YPY A D   C    E + A +  G  ++P  
Sbjct:    61 GNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYN-AANDTGFVDIPQG 119

Query:   244 HEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVF-TGECGTE-LNHGVAAVGYGTTLDGT 300
             HE AL+KAVA   PVSVAIDAG S FQFY  G++   +C +E L+HGV  VGYG    G 
Sbjct:   120 HERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFE-GGK 178

Query:   301 KYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPI 342
             KYWIV+NSWG +WG+KGYI M +   D+K  CGIA  ASYP+
Sbjct:   179 KYWIVKNSWGEKWGDKGYIYMAK---DRKNHCGIATAASYPL 217


>ZFIN|ZDB-GENE-080215-7 [details] [associations]
            symbol:zgc:174153 "zgc:174153" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-080215-7
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 EMBL:BX000534 EMBL:BX322603
            IPI:IPI00483644 Ensembl:ENSDART00000113654 OMA:ITLCISA Bgee:F1R8Y0
            Uniprot:F1R8Y0
        Length = 336

 Score = 612 (220.5 bits), Expect = 1.0e-59, P = 1.0e-59
 Identities = 134/318 (42%), Positives = 184/318 (57%)

Query:    37 DLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKM----DKPYKLKLNKFADMTNH 92
             D +  W+S H  S   D +  R  ++++N+  + Q N      +  +K+ +N+F DMTN 
Sbjct:    26 DHWNSWKSQHGKSYHEDVEVGRRMIWEENLRKIEQHNFEYSYGNHTFKMGMNQFGDMTNE 85

Query:    93 EFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWA 152
             EF     G K   ++  QG      FM     + P  VDWR++G VT VKDQ QCGSCW+
Sbjct:    86 EFRQAMNGYKHDPNQTSQGP----LFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWS 141

Query:   153 FSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEA 211
             FS+  A+EG     T KL+S+SEQ LVDC   Q NQGCNGGLM+ AF+++K+  G+ +E 
Sbjct:   142 FSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGLMDQAFQYVKENKGLDSEQ 201

Query:   212 KYPYQANDGT-CDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQ 269
              YPY A D   C      + A  I G  ++P+ +E AL+ AVA   PVSVAIDA     Q
Sbjct:   202 SYPYLARDDLPCRYDPRFNVA-KITGFVDIPSGNEPALMNAVAAVGPVSVAIDASHQSLQ 260

Query:   270 FYSEGVFTGE-CGTE-LNHGVAAVGYG---TTLDGTKYWIVRNSWGPEWGEKGYIRMQRG 324
             FY  G++    C +  L+H V  VGYG     + G +YWIV+NSW  +WG+KGYI M + 
Sbjct:   261 FYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAK- 319

Query:   325 ISDKKGLCGIAMEASYPI 342
               DK   CG+A +ASYP+
Sbjct:   320 --DKNNHCGVATKASYPL 335


>ZFIN|ZDB-GENE-071004-74 [details] [associations]
            symbol:zgc:174855 "zgc:174855" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-071004-74
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 MEROPS:C01.032 EMBL:BX000534 EMBL:BC152282
            IPI:IPI00773140 RefSeq:NP_001096592.1 UniGene:Dr.104905 SMR:A7MCR6
            STRING:A7MCR6 Ensembl:ENSDART00000109968 GeneID:569326
            KEGG:dre:569326 NextBio:20889622 Uniprot:A7MCR6
        Length = 335

 Score = 611 (220.1 bits), Expect = 1.3e-59, P = 1.3e-59
 Identities = 134/317 (42%), Positives = 182/317 (57%)

Query:    37 DLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKM----DKPYKLKLNKFADMTNH 92
             D +  W+S H  S   D +  R  ++++N+  + Q N      +  +K+ +N+F DMTN 
Sbjct:    26 DHWNSWKSQHGKSYHEDVEVGRRMIWEENLRKIEQHNFEYSLGNHTFKMGMNQFGDMTNE 85

Query:    93 EFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWA 152
             EF     G K   +R  +G      FM     + P  VDWR++G VT VKDQ QCGSCW+
Sbjct:    86 EFRQAMNGYKQDPNRTSKGA----LFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWS 141

Query:   153 FSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEA 211
             FS+  A+EG     T KL+S+SEQ LVDC   Q NQGCNGG+M+ AF+++K+  G+ +E 
Sbjct:   142 FSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQAFQYVKENKGLDSEQ 201

Query:   212 KYPYQANDGT-CDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQ 269
              YPY A D   C      + A  I G  ++P  +E AL+ AVA   PVSVAIDA     Q
Sbjct:   202 SYPYLARDDLPCRYDPRFNVA-KITGFVDIPRGNELALMNAVAAVGPVSVAIDASHQSLQ 260

Query:   270 FYSEGVFTGE-CGTELNHGVAAVGYG---TTLDGTKYWIVRNSWGPEWGEKGYIRMQRGI 325
             FY  G++    C + L+H V  VGYG     + G +YWIV+NSW  +WG+KGYI M +  
Sbjct:   261 FYQSGIYYERACTSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAK-- 318

Query:   326 SDKKGLCGIAMEASYPI 342
              DK   CGIA  ASYP+
Sbjct:   319 -DKNNHCGIATMASYPL 334


>DICTYBASE|DDB_G0283867 [details] [associations]
            symbol:cprC "cysteine proteinase 3" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0283867 GenomeReviews:CM000153_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:AAFI02000057
            KO:K01365 EMBL:X03930 RefSeq:XP_638859.1 ProteinModelPortal:Q23894
            SMR:Q23894 MEROPS:C01.114 EnsemblProtists:DDB0220784 GeneID:8624257
            KEGG:ddi:DDB_G0283867 OMA:NNVEHIN Uniprot:Q23894
        Length = 337

 Score = 610 (219.8 bits), Expect = 1.7e-59, P = 1.7e-59
 Identities = 137/313 (43%), Positives = 184/313 (58%)

Query:    37 DLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFAS 96
             D +  W   +  + +  E   R+  FK+N+ +VH  N       L LN+ AD++N E+  
Sbjct:    32 DSFIDWMRSNNKAYTHKEFMPRYEEFKKNMDYVHNWNSKGSKTVLGLNQHADLSNEEYRL 91

Query:    97 TYAGSKIKHHRMFQG--TRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFS 154
              Y G++   H    G   R  G  +       P +VDWR+K +VT VKDQGQCGSC++FS
Sbjct:    92 NYLGTRA--HIKLNGYHKRNLGLRLNRPQFKQPLNVDWREKDAVTPVKDQGQCGSCYSFS 149

Query:   155 TIAAVEGINHIMTNKLVSLSEQELVDCDTD-QNQGCNGGLMELAFEFIKKKGGVTTEAKY 213
             T  +VEG+  I T KLVSLSEQ ++DC +   N+GCNGGLM  AFE+I K  G+ +E +Y
Sbjct:   150 TTGSVEGVTAIKTGKLVSLSEQNILDCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQY 209

Query:   214 PYQ--ANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFY 271
             PY+   ND  C   +E S A  I  ++ + A  E+ L  A+   PVSVAIDA  + FQ Y
Sbjct:   210 PYEMKVND-ECKF-QEGSVAAKITSYKEIEAGDENDLQNALLLNPVSVAIDASHNSFQLY 267

Query:   272 SEGVF-TGECGTE-LNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKK 329
             + GV+    C +E L+HGV AVG GT  +G  Y+IV+NSWGP WG  GYI M R   +K 
Sbjct:   268 TAGVYYEPACSSEDLDHGVLAVGMGTD-NGEDYYIVKNSWGPSWGLNGYIHMAR---NKD 323

Query:   330 GLCGIAMEASYPI 342
               CGI+  ASYPI
Sbjct:   324 NNCGISTMASYPI 336


>DICTYBASE|DDB_G0279185 [details] [associations]
            symbol:cprF "cysteine proteinase 6" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0279185 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000030 HSSP:P07711 ProtClustDB:CLSZ2846820 EMBL:U72745
            RefSeq:XP_641725.1 ProteinModelPortal:Q94503 SMR:Q94503
            MEROPS:C01.081 PRIDE:Q94503 EnsemblProtists:DDB0215002
            GeneID:8621921 KEGG:ddi:DDB_G0279185 Uniprot:Q94503
        Length = 434

 Score = 500 (181.1 bits), Expect = 2.6e-59, Sum P(2) = 2.6e-59
 Identities = 111/280 (39%), Positives = 157/280 (56%)

Query:    27 KELESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKF 86
             K+  SE    + +  W   H    S +E + RFN+FK N+ ++++ N       L LN F
Sbjct:    18 KQQLSELQYRNAFTNWMIAHQRHYSSEEFNGRFNIFKANMDYINEWNTKGSETVLGLNVF 77

Query:    87 ADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQ 146
             AD+TN E+ +TY G+      + + T     F  G V +   SVDWR KG+VT +K+QG+
Sbjct:    78 ADITNEEYRATYLGTPFDASSL-EMTPSEKVF--GGVQA--NSVDWRAKGAVTPIKNQGE 132

Query:   147 CGSCWAFSTIAAVEGINHIMT--NKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKK 203
             CG CW+FS   A EG  +I    + L S+SEQ+L+DC     N GC GGLM LAFE+I  
Sbjct:   133 CGGCWSFSATGATEGAQYIANGDSDLTSVSEQQLIDCSGSYGNNGCEGGLMTLAFEYIIN 192

Query:   204 KGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDA 263
              GG+ TE+ YP+ AN   C  +  S+    +  + NV +  E  L   V + P SVAIDA
Sbjct:   193 NGGIDTESSYPFTANTEKCKYNP-SNIGAELSSYVNVTSGSESDLAAKVTQGPTSVAIDA 251

Query:   264 GSSDFQFYSEGVFTGE-CG-TELNHGVAAVGYGTTLDGTK 301
                 FQFYS G++    C  T+L+HGV AVG+G+   G++
Sbjct:   252 SQPSFQFYSSGIYNEPACSSTQLDHGVLAVGFGSGSSGSQ 291

 Score = 126 (49.4 bits), Expect = 2.6e-59, Sum P(2) = 2.6e-59
 Identities = 29/53 (54%), Positives = 32/53 (60%)

Query:   298 DGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYP--IKKSATN 348
             DG  YWIV+NSWG +WG  GYI M +   DK   CGIA  AS P  I KS  N
Sbjct:   386 DGN-YWIVKNSWGLDWGINGYILMSK---DKDNQCGIATMASIPQAIPKSKWN 434


>UNIPROTKB|F1PMM9 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9615 "Canis lupus
            familiaris" [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00639 GO:GO:0004197 GO:GO:0002250
            GeneTree:ENSGT00560000076577 GO:GO:0097067 EMBL:AAEX03000499
            Ensembl:ENSCAFT00000002029 OMA:EFKQVLN Uniprot:F1PMM9
        Length = 341

 Score = 606 (218.4 bits), Expect = 4.5e-59, P = 4.5e-59
 Identities = 133/315 (42%), Positives = 178/315 (56%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKM----DKPYKLKLNKFADMTNHEF 94
             + +W+  H      DE+  R  V+++N+  + Q N+     +  + L +N F DMTN EF
Sbjct:    37 WSQWKEAHGKLYDKDEEGWRRTVWERNMEMIEQHNQEYSQGEHSFTLAMNAFGDMTNEEF 96

Query:    95 ASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFS 154
                    KI+ H+     +G   F       +P SVDWR++G VT VKDQGQC  CWAFS
Sbjct:    97 KQVLNDFKIQKHK-----KGK-VFPAPLFAEVPSSVDWREQGYVTPVKDQGQCLGCWAFS 150

Query:   155 TIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEAKY 213
                A+EG     T KLVSLSEQ LVDC   Q N+GCNGGLME AF+++K  GG+ +E  Y
Sbjct:   151 ATGALEGQMFRKTGKLVSLSEQNLVDCSWSQGNRGCNGGLMEYAFQYVKDNGGLDSEESY 210

Query:   214 PYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYS 272
             PY A +  C    E S A ++     +  N ED L+  VA   PVS A+D+    FQFY 
Sbjct:   211 PYLARNEPCKYRPEKS-AANVTAFWPI-LNEEDGLMTTVATVGPVSAAVDSSPQSFQFYK 268

Query:   273 EGVFTG-ECGTEL-NHGVAAVGYG---TTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISD 327
             +G++   +C  +L NHGV  VGYG      D  KYWIV+NSWG  WG +GY+ + +   D
Sbjct:   269 KGIYYDPKCSNKLLNHGVLVVGYGFEGAESDNKKYWIVKNSWGTNWGMQGYMLLAK---D 325

Query:   328 KKGLCGIAMEASYPI 342
             +   CGIA  ASYP+
Sbjct:   326 RDNHCGIATRASYPV 340


>RGD|1308751 [details] [associations]
            symbol:RGD1308751 "similar to Cathepsin L precursor (Major
            excreted protein) (MEP)" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1308751 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:CH474032 IPI:IPI00365697 RefSeq:XP_001065885.2
            RefSeq:XP_225137.5 MEROPS:C01.069 Ensembl:ENSRNOT00000061391
            GeneID:290981 KEGG:rno:290981 UCSC:RGD:1308751 CTD:290981
            OMA:ESYAYEA OrthoDB:EOG42823G NextBio:631921 Uniprot:D3ZKC3
        Length = 330

 Score = 606 (218.4 bits), Expect = 4.5e-59, P = 4.5e-59
 Identities = 131/312 (41%), Positives = 186/312 (59%)

Query:    38 LYERWRSHHTVSRSLDEKHKRFNVFKQNV--MHVHQTN--KMDKPYKLKLNKFADMTNHE 93
             ++E W++ H  + + +E+ ++  V++ N+  +++H  +  K    + L++N F D+TN E
Sbjct:    28 VWEEWKTKHGKTYNTNEEGQKRAVWENNMKMINLHNEDYLKGKHGFSLEMNAFGDLTNTE 87

Query:    94 FASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAF 153
             F     G +        G +    F    +  IP S+DWR+ G VT VK+QGQCGSCWAF
Sbjct:    88 FRELMTGFQS------MGPKETTIFREPFLGDIPKSLDWREHGYVTPVKNQGQCGSCWAF 141

Query:   154 STIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEAK 212
             S + ++EG     T KLVSLSEQ LVDC     N GCNGGLME AF+++K+  G+ T   
Sbjct:   142 SAVGSLEGQIFKKTGKLVSLSEQNLVDCSWSYGNLGCNGGLMEFAFQYVKENRGLDTGES 201

Query:   213 YPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFY 271
             Y Y+A DG C  + + S A ++ G   VP + ED L+ AVA   PVSV ID+    F+FY
Sbjct:   202 YAYEAQDGLCRYNPKYS-AANVTGFVKVPLS-EDDLMSAVASVGPVSVGIDSHHQSFRFY 259

Query:   272 SEGVF-TGECG-TELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKK 329
             S G++   +C  TE++H V  VGYG   DG KYW+V+NSWG +WG  GYI+M +   D+ 
Sbjct:   260 SGGMYYEPDCSSTEMDHAVLVVGYGEESDGGKYWLVKNSWGEDWGMDGYIKMAK---DQN 316

Query:   330 GLCGIAMEASYP 341
               CGIA  A YP
Sbjct:   317 NNCGIATYAIYP 328


>ZFIN|ZDB-GENE-980526-285 [details] [associations]
            symbol:ctsl1b "cathepsin L, 1 b" species:7955
            "Danio rerio" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005576
            "extracellular region" evidence=IDA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-980526-285 GO:GO:0005576 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:BX465840 IPI:IPI00498443 Ensembl:ENSDART00000145570
            Bgee:F1R7B3 Uniprot:F1R7B3
        Length = 352

 Score = 603 (217.3 bits), Expect = 9.3e-59, P = 9.3e-59
 Identities = 133/318 (41%), Positives = 183/318 (57%)

Query:    37 DLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKM----DKPYKLKLNKFADMTNH 92
             D +  W+S H  S   D +  R  ++++N+  + Q N      +  +K+ +N+F DMTN 
Sbjct:    42 DHWNSWKSQHGKSYHEDVEVGRRMIWEENLRKIEQHNFEYSYGNHTFKMGMNQFGDMTNE 101

Query:    93 EFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWA 152
             EF     G     ++  QG      FM     + P  VDWR++G VT VKDQ QCGSCW+
Sbjct:   102 EFRQAMNGYTHDPNQTSQGP----LFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWS 157

Query:   153 FSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEA 211
             FS+  A+EG     T KL+S+SEQ LVDC   Q NQGCNGGLM+ AF+++K+  G+ +E 
Sbjct:   158 FSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGLMDQAFQYVKENKGLDSEQ 217

Query:   212 KYPYQANDGT-CDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQ 269
              YPY A D   C      + A  I G  ++P+ +E AL+ AVA   PVSVAIDA     Q
Sbjct:   218 SYPYLARDDLPCRYDPRFNVA-KITGFVDIPSGNELALMNAVAAVGPVSVAIDASHQSLQ 276

Query:   270 FYSEGVFTGE-CGTE-LNHGVAAVGYG---TTLDGTKYWIVRNSWGPEWGEKGYIRMQRG 324
             FY  G++    C +  L+H V  VGYG     + G +YWIV+NSW  +WG+KGYI M + 
Sbjct:   277 FYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYMAK- 335

Query:   325 ISDKKGLCGIAMEASYPI 342
               DK   CG+A +ASYP+
Sbjct:   336 --DKNNHCGVATKASYPL 351


>UNIPROTKB|Q10991 [details] [associations]
            symbol:CTSL "Cathepsin L1" species:9940 "Ovis aries"
            [GO:0005515 "protein binding" evidence=IPI] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            MEROPS:C01.032 ProteinModelPortal:Q10991 SMR:Q10991 Uniprot:Q10991
        Length = 217

 Score = 601 (216.6 bits), Expect = 1.5e-58, P = 1.5e-58
 Identities = 121/220 (55%), Positives = 149/220 (67%)

Query:   126 IPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ 185
             +P SVDW KKG VT VK+QGQCGSCWAFS   A+EG     T KLVSLSEQ LVD    Q
Sbjct:     1 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQ 60

Query:   186 -NQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANH 244
              NQGCNGGLM+ AF++IK+ GG+ +E  YPY+A D +C+   E S A    G  ++P   
Sbjct:    61 GNQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTSCNYKPEYSAAKDT-GFVDIP-QR 118

Query:   245 EDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFTG-ECGT-ELNHGVAAVGYGTTLDGTK 301
             E AL+KAVA   P+SVAIDAG S FQFY  G++   +C + +L+HGV  VGYG      K
Sbjct:   119 EKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTNNK 178

Query:   302 YWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYP 341
             +WIV+NSWGPEWG KGY++M +   D+   CGIA  ASYP
Sbjct:   179 FWIVKNSWGPEWGNKGYVKMAK---DQNNHCGIATAASYP 215


>UNIPROTKB|A4IFS7 [details] [associations]
            symbol:CTSL1 "CTSL1 protein" species:9913 "Bos taurus"
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197 GO:GO:0002250
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 GO:GO:0097067
            OrthoDB:EOG48PMKF MEROPS:C01.032 CTD:1514 EMBL:DAAA02023987
            EMBL:BC134741 IPI:IPI00708619 RefSeq:NP_001077155.1
            UniGene:Bt.23199 SMR:A4IFS7 Ensembl:ENSBTAT00000000962
            GeneID:515200 KEGG:bta:515200 InParanoid:A4IFS7 OMA:NDEQALM
            NextBio:20871707 Uniprot:A4IFS7
        Length = 333

 Score = 600 (216.3 bits), Expect = 1.9e-58, P = 1.9e-58
 Identities = 132/314 (42%), Positives = 187/314 (59%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNV----MHVHQTNKMDKPYKLKLNKFADMTNHEF 94
             ++ W++ H     L+E+  R  V+K+N+    +H  + ++    + + +N F DMTN EF
Sbjct:    29 WKLWKAAHRKPYDLNEEGWRKAVWKKNMKMIELHNQEYSQGKHSFSMAMNAFGDMTNEEF 88

Query:    95 ASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFS 154
               T  G + + ++  +G   + T       SIPPSVDWR+KG VT VK+QG+CGSCWAFS
Sbjct:    89 RHTMNGFQRQKNK--KGKEFHETIF----ASIPPSVDWREKGYVTPVKNQGKCGSCWAFS 142

Query:   155 TIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEAKY 213
                A+EG     T KLVSLSEQ LVDC   + N+GC+GG ++ AF+++   GG+ +E  Y
Sbjct:   143 ATGALEGQMFQKTGKLVSLSEQNLVDCSQPEGNRGCHGGFIDNAFQYVLDVGGLDSEESY 202

Query:   214 PYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYS 272
             PY    GTC  +  +S A +  G  ++P   E AL+KAVA   P+SVA+DA +  FQFY 
Sbjct:   203 PYTGLVGTCLYNPNNS-AANETGFVDLP-KQEKALMKAVANLGPISVAVDAHNPSFQFYK 260

Query:   273 EGVF-TGECGTE-LNHGVAAVGYG---TTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISD 327
              G++    C +E ++H V  VGYG      D  KYW+V+NSWG  WG  GYI+M +   D
Sbjct:   261 SGIYYEPNCSSESVDHAVLVVGYGFEGADSDDNKYWLVKNSWGEHWGMNGYIKMAK---D 317

Query:   328 KKGLCGIAMEASYP 341
             +   CGIA  ASYP
Sbjct:   318 RNNHCGIATMASYP 331


>UNIPROTKB|G1K2A7 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9615 "Canis lupus
            familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 PANTHER:PTHR12411:SF55 OMA:LKVPPSH
            EMBL:AAEX03011051 Ensembl:ENSCAFT00000019202 Uniprot:G1K2A7
        Length = 333

 Score = 600 (216.3 bits), Expect = 1.9e-58, P = 1.9e-58
 Identities = 133/315 (42%), Positives = 187/315 (59%)

Query:    36 WDLYER-WRSHHTVSRSLDEKHKRFNVFKQNVMHV--H--QTNKMDKPYKLKLNKFADMT 90
             WDL+++ +R  +  +  +DE  +R  ++++N+ H+  H  + +     Y+L +N   DMT
Sbjct:    30 WDLWKKTYRKQY--NSKVDELSRRL-IWEKNLKHISIHNLEASLGVHTYELAMNHLGDMT 86

Query:    91 NHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTS-IPPSVDWRKKGSVTAVKDQGQCGS 149
             + E      G K+        +R N T       S  P SVD+RKKG VT VK+QGQCGS
Sbjct:    87 SEEVVQKMTGLKVPPSH----SRSNDTLYIPDWESRAPDSVDYRKKGYVTPVKNQGQCGS 142

Query:   150 CWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTT 209
             CWAFS++ A+EG     T KL++LS Q LVDC   +N GC GG M  AF++++K  G+ +
Sbjct:   143 CWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNAFQYVQKNRGIDS 201

Query:   210 EAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDF 268
             E  YPY   D +C +   +  A    G+  +P  +E AL +AVA+  P+SVAIDA  + F
Sbjct:   202 EDAYPYVGQDESC-MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSF 260

Query:   269 QFYSEGVFTGE-CGTE-LNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGIS 326
             QFYS+GV+  E C ++ LNH V AVGYG    G K+WI++NSWG  WG KGYI M R   
Sbjct:   261 QFYSKGVYYDENCNSDNLNHAVLAVGYGIQ-KGNKHWIIKNSWGENWGNKGYILMAR--- 316

Query:   327 DKKGLCGIAMEASYP 341
             +K   CGIA  AS+P
Sbjct:   317 NKNNACGIANLASFP 331


>UNIPROTKB|Q3ZKN1 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:AY738221
            RefSeq:NP_001029168.1 UniGene:Cfa.588 HSSP:P43235
            ProteinModelPortal:Q3ZKN1 SMR:Q3ZKN1 STRING:Q3ZKN1 GeneID:608843
            KEGG:cfa:608843 InParanoid:Q3ZKN1 NextBio:20894470 Uniprot:Q3ZKN1
        Length = 330

 Score = 600 (216.3 bits), Expect = 1.9e-58, P = 1.9e-58
 Identities = 133/315 (42%), Positives = 187/315 (59%)

Query:    36 WDLYER-WRSHHTVSRSLDEKHKRFNVFKQNVMHV--H--QTNKMDKPYKLKLNKFADMT 90
             WDL+++ +R  +  +  +DE  +R  ++++N+ H+  H  + +     Y+L +N   DMT
Sbjct:    27 WDLWKKTYRKQY--NSKVDELSRRL-IWEKNLKHISIHNLEASLGVHTYELAMNHLGDMT 83

Query:    91 NHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTS-IPPSVDWRKKGSVTAVKDQGQCGS 149
             + E      G K+        +R N T       S  P SVD+RKKG VT VK+QGQCGS
Sbjct:    84 SEEVVQKMTGLKVPPSH----SRSNDTLYIPDWESRAPDSVDYRKKGYVTPVKNQGQCGS 139

Query:   150 CWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTT 209
             CWAFS++ A+EG     T KL++LS Q LVDC   +N GC GG M  AF++++K  G+ +
Sbjct:   140 CWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNAFQYVQKNRGIDS 198

Query:   210 EAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDF 268
             E  YPY   D +C +   +  A    G+  +P  +E AL +AVA+  P+SVAIDA  + F
Sbjct:   199 EDAYPYVGQDESC-MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSF 257

Query:   269 QFYSEGVFTGE-CGTE-LNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGIS 326
             QFYS+GV+  E C ++ LNH V AVGYG    G K+WI++NSWG  WG KGYI M R   
Sbjct:   258 QFYSKGVYYDENCNSDNLNHAVLAVGYGIQ-KGNKHWIIKNSWGENWGNKGYILMAR--- 313

Query:   327 DKKGLCGIAMEASYP 341
             +K   CGIA  AS+P
Sbjct:   314 NKNNACGIANLASFP 328


>TAIR|locus:2078312 [details] [associations]
            symbol:AT3G45310 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005773 EMBL:CP002686
            GenomeReviews:BA000014_GR eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AL132953
            EMBL:AY091771 IPI:IPI00540369 PIR:T47471 RefSeq:NP_566880.1
            UniGene:At.25239 ProteinModelPortal:Q8RWQ9 SMR:Q8RWQ9
            MEROPS:C01.162 PaxDb:Q8RWQ9 PRIDE:Q8RWQ9 EnsemblPlants:AT3G45310.1
            GeneID:823669 KEGG:ath:AT3G45310 GeneFarm:5032 TAIR:At3g45310
            InParanoid:Q8RWQ9 KO:K01366 OMA:AFEVVHE PhylomeDB:Q8RWQ9
            ProtClustDB:CLSN2689015 Genevestigator:Q8RWQ9 Uniprot:Q8RWQ9
        Length = 358

 Score = 598 (215.6 bits), Expect = 3.2e-58, P = 3.2e-58
 Identities = 130/301 (43%), Positives = 176/301 (58%)

Query:    50 RSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTYAGSKIKHHRMF 109
             +S++E   RF+VFK+N+  +  TNK    YKL LN+FAD+T  EF     G+        
Sbjct:    71 QSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRYKLGAAQNCSATL 130

Query:   110 QGTRGNGTFMYGKVT--SIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMT 167
             +G+         K+T  ++P + DWR+ G V+ VK+QG CGSCW FST  A+E   H   
Sbjct:   131 KGSH--------KITEATVPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQAF 182

Query:   168 NKLVSLSEQELVDC-DTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSK 226
              K +SLSEQ+LVDC  T  N GC+GGL   AFE+IK  GG+ TE  YPY   DG C  S 
Sbjct:   183 GKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDGGCKFSA 242

Query:   227 ESSPAVSIDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSDFQFYSEGVFTGE-CGT-- 282
             ++   V +    N+    ED L  AV   +PVSVA +    +F+FY +GVFT   CG   
Sbjct:   243 KNI-GVQVRDSVNITLGAEDELKHAVGLVRPVSVAFEV-VHEFRFYKKGVFTSNTCGNTP 300

Query:   283 -ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYP 341
              ++NH V AVGYG   D   YW+++NSWG EWG+ GY +M+ G    K +CG+A  +SYP
Sbjct:   301 MDVNHAVLAVGYGVE-DDVPYWLIKNSWGGEWGDNGYFKMEMG----KNMCGVATCSSYP 355

Query:   342 I 342
             +
Sbjct:   356 V 356


>RGD|61810 [details] [associations]
            symbol:Ctsk "cathepsin K" species:10116 "Rattus norvegicus"
           [GO:0001957 "intramembranous ossification" evidence=IEP] [GO:0005615
           "extracellular space" evidence=IDA] [GO:0005737 "cytoplasm"
           evidence=IDA] [GO:0005764 "lysosome" evidence=IDA] [GO:0006508
           "proteolysis" evidence=TAS] [GO:0008234 "cysteine-type peptidase
           activity" evidence=TAS] [GO:0045453 "bone resorption" evidence=IMP]
           InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
           Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
           RGD:61810 GO:GO:0005615 eggNOG:COG4870 HOGENOM:HOG000230774
           GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
           InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
           PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
           GO:GO:0045453 GO:GO:0001957 GeneTree:ENSGT00560000076577
           HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
           OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:AF010306 EMBL:BC078793
           IPI:IPI00206378 RefSeq:NP_113748.1 UniGene:Rn.5598
           ProteinModelPortal:O35186 SMR:O35186 STRING:O35186
           PhosphoSite:O35186 PRIDE:O35186 Ensembl:ENSRNOT00000028730
           GeneID:29175 KEGG:rno:29175 UCSC:RGD:61810 InParanoid:O35186
           OMA:YKEIPEG BindingDB:O35186 ChEMBL:CHEMBL3034 NextBio:608248
           Genevestigator:O35186 GermOnline:ENSRNOG00000021155 Uniprot:O35186
        Length = 329

 Score = 593 (213.8 bits), Expect = 1.1e-57, P = 1.1e-57
 Identities = 131/322 (40%), Positives = 185/322 (57%)

Query:    29 LESEEGLWDLYERWRSHH--TVSRSLDEKHKRFNVFKQNV--MHVH--QTNKMDKPYKLK 82
             L  EE L   +E W+  H    +  +DE  +R  ++++N+  + VH  + +     Y+L 
Sbjct:    16 LSPEETLDTQWELWKKTHGKQYNSKVDEISRRL-IWEKNLKKISVHNLEASLGAHTYELA 74

Query:    83 LNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVK 142
             +N   DMT+ E      G ++   R F           G+V   P S+D+RKKG VT VK
Sbjct:    75 MNHLGDMTSEEVVQKMTGLRVPPSRSFSNDTLYTPEWEGRV---PDSIDYRKKGYVTPVK 131

Query:   143 DQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIK 202
             +QGQCGSCWAFS+  A+EG     T KL++LS Q LVDC   +N GC GG M  AF++++
Sbjct:   132 NQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDC-VSENYGCGGGYMTTAFQYVQ 190

Query:   203 KKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAI 261
             + GG+ +E  YPY   D +C +   ++ A    G+  +P  +E AL +AVA+  PVSV+I
Sbjct:   191 QNGGIDSEDAYPYVGQDESC-MYNATAKAAKCRGYREIPVGNEKALKRAVARVGPVSVSI 249

Query:   262 DAGSSDFQFYSEGVFTGE-CGTE-LNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYI 319
             DA  + FQFYS GV+  E C  + +NH V  VGYGT   G KYWI++NSWG  WG KGY+
Sbjct:   250 DASLTSFQFYSRGVYYDENCDRDNVNHAVLVVGYGTQ-KGNKYWIIKNSWGESWGNKGYV 308

Query:   320 RMQRGISDKKGLCGIAMEASYP 341
              + R   +K   CGI   AS+P
Sbjct:   309 LLAR---NKNNACGITNLASFP 327


>UNIPROTKB|Q5E968 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:BT021052
            EMBL:BC109853 IPI:IPI00709374 RefSeq:NP_001029607.1
            UniGene:Bt.23218 ProteinModelPortal:Q5E968 SMR:Q5E968 STRING:Q5E968
            MEROPS:I29.007 PRIDE:Q5E968 Ensembl:ENSBTAT00000028016
            GeneID:513038 KEGG:bta:513038 CTD:1513 InParanoid:Q5E968 KO:K01371
            OrthoDB:EOG4SJ5FC NextBio:20870669 PANTHER:PTHR12411:SF55
            Uniprot:Q5E968
        Length = 329

 Score = 591 (213.1 bits), Expect = 1.7e-57, P = 1.7e-57
 Identities = 135/332 (40%), Positives = 194/332 (58%)

Query:    19 VEGFDFHEKELESEEGLWDLYER-WRSHHTVSRSLDEKHKRFNVFKQNVMHV--H--QTN 73
             V  F  + +E+   +  W+L+++ +R  +  S+  DE  +R  ++++N+ H+  H  + +
Sbjct:    11 VVSFALYPEEILDTQ--WELWKKTYRKQYN-SKG-DEISRRL-IWEKNLKHISIHNLEAS 65

Query:    74 KMDKPYKLKLNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGT-FMYGKVTSIPPSVDW 132
                  Y+L +N   DMT+ E      G K+   R    +R N T ++       P SVD+
Sbjct:    66 LGVHTYELAMNHLGDMTSEEVVQKMTGLKVPASR----SRSNDTLYIPDWEGRAPDSVDY 121

Query:   133 RKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGG 192
             RKKG VT VK+QGQCGSCWAFS++ A+EG     T KL++LS Q LVDC   +N GC GG
Sbjct:   122 RKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGG 180

Query:   193 LMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAV 252
              M  AF++++K  G+ +E  YPY   D  C +   +  A    G+  +P  +E AL +AV
Sbjct:   181 YMTNAFQYVQKNRGIDSEDAYPYVGQDENC-MYNPTGKAAKCRGYREIPEGNEKALKRAV 239

Query:   253 AKQ-PVSVAIDAGSSDFQFYSEGVFTGE-CGTE-LNHGVAAVGYGTTLDGTKYWIVRNSW 309
             A+  P+SVAIDA  + FQFY +GV+  E C ++ LNH V AVGYG    G K+WI++NSW
Sbjct:   240 ARVGPISVAIDASLTSFQFYRKGVYYDENCNSDNLNHAVLAVGYGIQ-KGNKHWIIKNSW 298

Query:   310 GPEWGEKGYIRMQRGISDKKGLCGIAMEASYP 341
             G  WG KGYI M R   +K   CGIA  AS+P
Sbjct:   299 GENWGNKGYILMAR---NKNNACGIANLASFP 327


>WB|WBGene00000776 [details] [associations]
            symbol:cpl-1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0040010 "positive regulation
            of growth rate" evidence=IMP] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] [GO:0040011
            "locomotion" evidence=IMP] [GO:0070265 "necrotic cell death"
            evidence=IMP] [GO:0031983 "vesicle lumen" evidence=IDA] [GO:0042718
            "yolk granule" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0009792 GO:GO:0040010 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0040011
            GO:GO:0031983 GO:GO:0070265 GeneTree:ENSGT00660000095458 KO:K01365
            GO:GO:0042718 MEROPS:I29.009 EMBL:Z92812 GeneID:180111
            KEGG:cel:CELE_T03E6.7 CTD:180111 PIR:T24387 RefSeq:NP_001256718.1
            HSSP:P80067 ProteinModelPortal:O45734 SMR:O45734 DIP:DIP-26616N
            IntAct:O45734 MINT:MINT-211563 STRING:O45734 PaxDb:O45734
            EnsemblMetazoa:T03E6.7.1 EnsemblMetazoa:T03E6.7.2 UCSC:T03E6.7.1
            WormBase:T03E6.7a InParanoid:O45734 OMA:HIENHNR NextBio:908128
            Uniprot:O45734
        Length = 337

 Score = 591 (213.1 bits), Expect = 1.7e-57, P = 1.7e-57
 Identities = 133/326 (40%), Positives = 184/326 (56%)

Query:    27 KELESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMD----KPYKLK 82
             +++ES    WD Y   +       S  E+      F +N++H+   N+      K +++ 
Sbjct:    23 RQIESAIEKWDDY---KEDFDKEYSESEEQTYMEAFVKNMIHIENHNRDHRLGRKTFEMG 79

Query:    83 LNKFADMTNHEFASTYAGSKIKHHRMFQGTR--GNGTFMYGKVTSIPPSVDWRKKGSVTA 140
             LN  AD+   ++          + R+F  +R   + +F+      +P  VDWR    VT 
Sbjct:    80 LNHIADLPFSQYRKLNG-----YRRLFGDSRIKNSSSFLAPFNVQVPDEVDWRDTHLVTD 134

Query:   141 VKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFE 199
             VK+QG CGSCWAFS   A+EG +     +LVSLSEQ LVDC T   N GCNGGLM+ AFE
Sbjct:   135 VKNQGMCGSCWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLMDQAFE 194

Query:   200 FIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVS 258
             +I+   GV TE  YPY+  D  C  +K++  A    G+ + P   E+ L  AVA Q P+S
Sbjct:   195 YIRDNHGVDTEESYPYKGRDMKCHFNKKTVGADD-KGYVDTPEGDEEQLKIAVATQGPIS 253

Query:   259 VAIDAGSSDFQFYSEGVFTGE-CGTE-LNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEK 316
             +AIDAG   FQ Y +GV+  E C +E L+HGV  VGYGT  +   YWIV+NSWG  WGEK
Sbjct:   254 IAIDAGHRSFQLYKKGVYYDEECSSEELDHGVLLVGYGTDPEHGDYWIVKNSWGAGWGEK 313

Query:   317 GYIRMQRGISDKKGLCGIAMEASYPI 342
             GYIR+ R   ++   CG+A +ASYP+
Sbjct:   314 GYIRIAR---NRNNHCGVATKASYPL 336


>DICTYBASE|DDB_G0279187 [details] [associations]
            symbol:cprG "cysteine proteinase 7" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0279187 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000030 ProtClustDB:CLSZ2846820 MEROPS:C01.081
            EMBL:U72746 RefSeq:XP_641720.2 ProteinModelPortal:Q94504 SMR:Q94504
            PRIDE:Q94504 EnsemblProtists:DDB0215005 GeneID:8621915
            KEGG:ddi:DDB_G0279187 OMA:INTETEK Uniprot:Q94504
        Length = 460

 Score = 490 (177.5 bits), Expect = 2.0e-57, Sum P(2) = 2.0e-57
 Identities = 109/275 (39%), Positives = 153/275 (55%)

Query:    27 KELESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKF 86
             K+  SE    + +  W   H    S +E + R+N+FK N+ +V++ N       L LN F
Sbjct:    18 KQQLSEVEYRNAFTNWMIAHQRHYSSEEFNGRYNIFKANMDYVNEWNTKGSETVLGLNVF 77

Query:    87 ADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQ 146
             AD++N E+ +TY G+      + + T  +      K+      VDWR +G+VT +K+QGQ
Sbjct:    78 ADISNEEYRATYLGTPFDASSL-EMTESD------KIFDASAQVDWRTQGAVTPIKNQGQ 130

Query:   147 CGSCWAFSTIAAVEGINHIMTNK--LVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKK 203
             CG CW+FST  A EG  ++   K  LVSLSEQ L+DC     N GC GGLM LAFE+I  
Sbjct:   131 CGGCWSFSTTGATEGAQYLANGKKNLVSLSEQNLIDCSGSYGNNGCEGGLMTLAFEYIIN 190

Query:   204 KGGVTTEAKYPYQANDGT-CDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAID 262
               G+ TE+ YPY A DG  C  + ++  A  +  + NV +  E  L   V + P SVAID
Sbjct:   191 NKGIDTESSYPYTAEDGKKCKFNPKNV-AAQLSSYVNVTSGSESDLAAKVTQGPTSVAID 249

Query:   263 AGSSDFQFYSEGVFTGE-CG-TELNHGVAAVGYGT 295
             A +  FQ Y  G++    C  T+L+HGV AVG+GT
Sbjct:   250 ASNQSFQLYVSGIYNEPACSSTQLDHGVLAVGFGT 284

 Score = 118 (46.6 bits), Expect = 2.0e-57, Sum P(2) = 2.0e-57
 Identities = 29/61 (47%), Positives = 35/61 (57%)

Query:   281 GTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASY 340
             G+  N GV    Y T  D   YWIV+NSWG  WG  GYI M +G +++   CGIA  AS 
Sbjct:   404 GSNSNGGV----YPTAGD---YWIVKNSWGTSWGMDGYILMTKGNNNQ---CGIATMASR 453

Query:   341 P 341
             P
Sbjct:   454 P 454


>UNIPROTKB|Q9GLE3 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9823 "Sus scrofa"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0005576 "extracellular
            region" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005576 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 MEROPS:I29.007
            CTD:1513 KO:K01371 OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55
            OMA:LKVPPSH EMBL:AF292030 RefSeq:NP_999467.1 UniGene:Ssc.1020
            ProteinModelPortal:Q9GLE3 SMR:Q9GLE3 STRING:Q9GLE3
            Ensembl:ENSSSCT00000007283 GeneID:397569 KEGG:ssc:397569
            ArrayExpress:Q9GLE3 Uniprot:Q9GLE3
        Length = 330

 Score = 590 (212.7 bits), Expect = 2.2e-57, P = 2.2e-57
 Identities = 131/315 (41%), Positives = 187/315 (59%)

Query:    36 WDLYER-WRSHHTVSRSLDEKHKRFNVFKQNVMHV--H--QTNKMDKPYKLKLNKFADMT 90
             W+L+++ +R  +  +  +DE  +R  ++++N+ H+  H  + +     Y+L +N   DMT
Sbjct:    27 WELWKKTYRKQY--NSKVDEISRRL-IWEKNLKHISIHNLEASLGVHTYELAMNHLGDMT 83

Query:    91 NHEFASTYAGSKIKHHRMFQGTRGNGT-FMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGS 149
             + E      G K+        +R N T ++       P S+D+RKKG VT VK+QGQCGS
Sbjct:    84 SEEVVQKMTGLKVPPSH----SRSNDTLYIPDWEGRTPDSIDYRKKGYVTPVKNQGQCGS 139

Query:   150 CWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTT 209
             CWAFS++ A+EG     T KL++LS Q LVDC   +N GC GG M  AF++++K  G+ +
Sbjct:   140 CWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNAFQYVQKNRGIDS 198

Query:   210 EAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDF 268
             E  YPY   D  C +   +  A    G+  +P  +E AL +AVA+  PVSVAIDA  + F
Sbjct:   199 EDAYPYVGQDENC-MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSF 257

Query:   269 QFYSEGVFTGE-CGTE-LNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGIS 326
             QFYS+GV+  E C ++ LNH V AVGYG    G K+WI++NSWG  WG KGYI M R   
Sbjct:   258 QFYSKGVYYDENCNSDNLNHAVLAVGYGIQ-KGKKHWIIKNSWGENWGNKGYILMAR--- 313

Query:   327 DKKGLCGIAMEASYP 341
             +K   CGIA  AS+P
Sbjct:   314 NKNNACGIANLASFP 328


>UNIPROTKB|P43235 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0001957
            "intramembranous ossification" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0045453 "bone resorption"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=TAS] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=TAS] [GO:0036021 "endolysosome lumen" evidence=TAS]
            [GO:0045087 "innate immune response" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            Reactome:REACT_118779 Reactome:REACT_6900 GO:GO:0005615
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087 GO:GO:0045453
            EMBL:CH471121 EMBL:AL355860 GO:GO:0004197 GO:GO:0001957
            HOVERGEN:HBG011513 GO:GO:0036021 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:U13665 EMBL:X82153
            EMBL:U20280 EMBL:S79895 EMBL:CR541675 EMBL:AL356292 EMBL:BC016058
            IPI:IPI00300599 PIR:JC2476 RefSeq:NP_000387.1 UniGene:Hs.632466
            PDB:1ATK PDB:1AU0 PDB:1AU2 PDB:1AU3 PDB:1AU4 PDB:1AYU PDB:1AYV
            PDB:1AYW PDB:1BGO PDB:1BY8 PDB:1MEM PDB:1NL6 PDB:1NLJ PDB:1Q6K
            PDB:1SNK PDB:1TU6 PDB:1U9V PDB:1U9W PDB:1U9X PDB:1VSN PDB:1YK7
            PDB:1YK8 PDB:1YT7 PDB:2ATO PDB:2AUX PDB:2AUZ PDB:2BDL PDB:2R6N
            PDB:3C9E PDB:3H7D PDB:3KW9 PDB:3KWB PDB:3KWZ PDB:3KX1 PDB:3O0U
            PDB:3O1G PDB:3OVZ PDB:4DMX PDB:4DMY PDB:7PCK PDBsum:1ATK
            PDBsum:1AU0 PDBsum:1AU2 PDBsum:1AU3 PDBsum:1AU4 PDBsum:1AYU
            PDBsum:1AYV PDBsum:1AYW PDBsum:1BGO PDBsum:1BY8 PDBsum:1MEM
            PDBsum:1NL6 PDBsum:1NLJ PDBsum:1Q6K PDBsum:1SNK PDBsum:1TU6
            PDBsum:1U9V PDBsum:1U9W PDBsum:1U9X PDBsum:1VSN PDBsum:1YK7
            PDBsum:1YK8 PDBsum:1YT7 PDBsum:2ATO PDBsum:2AUX PDBsum:2AUZ
            PDBsum:2BDL PDBsum:2R6N PDBsum:3C9E PDBsum:3H7D PDBsum:3KW9
            PDBsum:3KWB PDBsum:3KWZ PDBsum:3KX1 PDBsum:3O0U PDBsum:3O1G
            PDBsum:3OVZ PDBsum:4DMX PDBsum:4DMY PDBsum:7PCK
            ProteinModelPortal:P43235 SMR:P43235 DIP:DIP-39993N IntAct:P43235
            STRING:P43235 PhosphoSite:P43235 DMDM:1168793 PaxDb:P43235
            PRIDE:P43235 DNASU:1513 Ensembl:ENST00000271651 GeneID:1513
            KEGG:hsa:1513 UCSC:uc001evp.2 GeneCards:GC01M150768 HGNC:HGNC:2536
            MIM:265800 MIM:601105 neXtProt:NX_P43235 Orphanet:763
            PharmGKB:PA27034 InParanoid:P43235 OMA:LKVPPSH PhylomeDB:P43235
            BindingDB:P43235 ChEMBL:CHEMBL268 EvolutionaryTrace:P43235
            GenomeRNAi:1513 NextBio:6267 ArrayExpress:P43235 Bgee:P43235
            CleanEx:HS_CTSK CleanEx:HS_CTSO Genevestigator:P43235
            GermOnline:ENSG00000143387 Uniprot:P43235
        Length = 329

 Score = 589 (212.4 bits), Expect = 2.8e-57, P = 2.8e-57
 Identities = 134/323 (41%), Positives = 187/323 (57%)

Query:    29 LESEEGLWDLYERWRSHHT--VSRSLDEKHKRFNVFKQNVMHV--H--QTNKMDKPYKLK 82
             L  EE L   +E W+  H    +  +DE  +R  ++++N+ ++  H  + +     Y+L 
Sbjct:    16 LYPEEILDTHWELWKKTHRKQYNNKVDEISRRL-IWEKNLKYISIHNLEASLGVHTYELA 74

Query:    83 LNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTS-IPPSVDWRKKGSVTAV 141
             +N   DMT+ E      G K+        +R N T    +     P SVD+RKKG VT V
Sbjct:    75 MNHLGDMTSEEVVQKMTGLKVP----LSHSRSNDTLYIPEWEGRAPDSVDYRKKGYVTPV 130

Query:   142 KDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFI 201
             K+QGQCGSCWAFS++ A+EG     T KL++LS Q LVDC   +N GC GG M  AF+++
Sbjct:   131 KNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNAFQYV 189

Query:   202 KKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVA 260
             +K  G+ +E  YPY   + +C +   +  A    G+  +P  +E AL +AVA+  PVSVA
Sbjct:   190 QKNRGIDSEDAYPYVGQEESC-MYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVA 248

Query:   261 IDAGSSDFQFYSEGVFTGE-CGTE-LNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGY 318
             IDA  + FQFYS+GV+  E C ++ LNH V AVGYG    G K+WI++NSWG  WG KGY
Sbjct:   249 IDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQ-KGNKHWIIKNSWGENWGNKGY 307

Query:   319 IRMQRGISDKKGLCGIAMEASYP 341
             I M R   +K   CGIA  AS+P
Sbjct:   308 ILMAR---NKNNACGIANLASFP 327


>UNIPROTKB|Q90686 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9031 "Gallus gallus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            CTD:1513 KO:K01371 PANTHER:PTHR12411:SF55 EMBL:U37691
            IPI:IPI00575213 RefSeq:NP_990302.1 UniGene:Gga.51509
            ProteinModelPortal:Q90686 SMR:Q90686 MEROPS:C01.036 GeneID:395818
            KEGG:gga:395818 NextBio:20815886 Uniprot:Q90686
        Length = 334

 Score = 588 (212.0 bits), Expect = 3.6e-57, P = 3.6e-57
 Identities = 137/324 (42%), Positives = 185/324 (57%)

Query:    26 EKELESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMH--VHQTNKMDK-PYKLK 82
             E EL+++   WDL++R      V R         ++ ++  +H    +  ++ K  ++L 
Sbjct:    24 EPELDAQ---WDLWKR-TIQKAVQRQGGRNVPEVDLGEEPEVHRCPQRGARLGKHSFQLA 79

Query:    83 LNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTS-IPPSVDWRKKGSVTAV 141
             +N   DMT+ E   T  G ++   R     R NGT      +S  P +VDWR+KG VT V
Sbjct:    80 MNYLGDMTSEEVVRTMTGLRVPRSR----PRPNGTLYVPDWSSRAPAAVDWRRKGYVTPV 135

Query:   142 KDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFI 201
             KDQGQCGSCWAFS++ A+EG     T KL+SLS Q LV C    N GC GG M  AFE++
Sbjct:   136 KDQGQCGSCWAFSSVGALEGQLKRRTGKLLSLSPQNLVYC-VSNNNGCGGGYMTNAFEYV 194

Query:   202 KKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAK-QPVSVA 260
             +   G+ +E  YPY   D +C  S  +  A    G+  +P ++E AL +AVA+  PVSV 
Sbjct:   195 RLNRGIDSEDAYPYIGQDESCMYSP-TGKAAKCRGYREIPEDNEKALKRAVARIGPVSVG 253

Query:   261 IDAGSSDFQFYSEGVF--TGECGTE-LNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKG 317
             IDA    FQFYS GV+  TG C  E +NH V AVGYG    GTK+WI++NSWG EWG KG
Sbjct:   254 IDASLPSFQFYSRGVYYDTG-CNPENINHAVLAVGYGAQ-KGTKHWIIKNSWGTEWGNKG 311

Query:   318 YIRMQRGISDKKGLCGIAMEASYP 341
             Y+ + R +   K  CGIA  AS+P
Sbjct:   312 YVLLARNM---KQTCGIANLASFP 332


>MGI|MGI:107341 [details] [associations]
            symbol:Ctss "cathepsin S" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=ISO;IDA] [GO:0006508 "proteolysis"
            evidence=IDA] [GO:0008152 "metabolic process" evidence=ISO]
            [GO:0008233 "peptidase activity" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0009986 "cell
            surface" evidence=ISO] [GO:0016020 "membrane" evidence=IDA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0045453 "bone
            resorption" evidence=ISO] [GO:0051930 "regulation of sensory
            perception of pain" evidence=ISO] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:107341 GO:GO:0016020 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0008233 GO:GO:0031905 Reactome:REACT_102124
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 BRENDA:3.4.22.27
            ChiTaRS:CTSS EMBL:AF051732 EMBL:AF051727 EMBL:AF051728
            EMBL:AF051729 EMBL:AF051726 EMBL:AF051730 EMBL:AF051731
            EMBL:AF038546 EMBL:AJ002386 EMBL:AC092203 EMBL:Y18466 EMBL:AJ223208
            IPI:IPI00309520 UniGene:Mm.3619 PDB:1M0H PDBsum:1M0H
            ProteinModelPortal:O70370 SMR:O70370 STRING:O70370
            PhosphoSite:O70370 PaxDb:O70370 PRIDE:O70370
            Ensembl:ENSMUST00000116304 BindingDB:O70370 ChEMBL:CHEMBL4098
            NextBio:282932 Bgee:O70370 CleanEx:MM_CTSS Genevestigator:O70370
            GermOnline:ENSMUSG00000038642 Uniprot:O70370
        Length = 340

 Score = 583 (210.3 bits), Expect = 1.2e-56, P = 1.2e-56
 Identities = 130/315 (41%), Positives = 181/315 (57%)

Query:    36 WDLYERWRSHHTVSRSLDEKHKRFNVFKQNV--MHVHQTN-KMDK-PYKLKLNKFADMTN 91
             WDL+++  +H    +  +E+  R  ++++N+  + +H     M    Y++ +N   DMTN
Sbjct:    36 WDLWKK--THEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTN 93

Query:    92 HEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCW 151
              E        +I      Q  +   TF      ++P +VDWR+KG VT VK QG CG+CW
Sbjct:    94 EEILCRMGALRIPR----QSPK-TVTFRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACW 148

Query:   152 AFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ---NQGCNGGLMELAFEFIKKKGGVT 208
             AFS + A+EG   + T KL+SLS Q LVDC  ++   N+GC GG M  AF++I   GG+ 
Sbjct:   149 AFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIE 208

Query:   209 TEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVA-KQPVSVAIDAGSSD 267
              +A YPY+A D  C  + ++  A +   +  +P   EDAL +AVA K PVSV IDA  S 
Sbjct:   209 ADASYPYKATDEKCHYNSKNR-AATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSS 267

Query:   268 FQFYSEGVFTG-ECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGIS 326
             F FY  GV+    C   +NHGV  VGYGT LDG  YW+V+NSWG  +G++GYIRM R   
Sbjct:   268 FFFYKSGVYDDPSCTGNVNHGVLVVGYGT-LDGKDYWLVKNSWGLNFGDQGYIRMAR--- 323

Query:   327 DKKGLCGIAMEASYP 341
             + K  CGIA   SYP
Sbjct:   324 NNKNHCGIASYCSYP 338


>MGI|MGI:107823 [details] [associations]
            symbol:Ctsk "cathepsin K" species:10090 "Mus musculus"
            [GO:0005615 "extracellular space" evidence=ISO] [GO:0005737
            "cytoplasm" evidence=ISO] [GO:0005764 "lysosome" evidence=ISO]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0045453 "bone resorption" evidence=ISO]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:107823 GO:GO:0005615 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0045453
            GO:GO:0001957 HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 OMA:LKVPPSH EMBL:X94444
            EMBL:AJ006033 EMBL:BC046320 IPI:IPI00316575 PIR:S74227
            RefSeq:NP_031828.2 UniGene:Mm.272085 ProteinModelPortal:P55097
            SMR:P55097 MINT:MINT-3089515 STRING:P55097 PhosphoSite:P55097
            PRIDE:P55097 Ensembl:ENSMUST00000015664 GeneID:13038 KEGG:mmu:13038
            InParanoid:P55097 BioCyc:MetaCyc:MONOMER-14811 ChEMBL:CHEMBL1075277
            NextBio:282924 Bgee:P55097 CleanEx:MM_CTSK Genevestigator:P55097
            GermOnline:ENSMUSG00000028111 Uniprot:P55097
        Length = 329

 Score = 582 (209.9 bits), Expect = 1.6e-56, P = 1.6e-56
 Identities = 129/322 (40%), Positives = 183/322 (56%)

Query:    29 LESEEGLWDLYERWRSHHT--VSRSLDEKHKRFNVFKQNVMHVHQTNKMDK----PYKLK 82
             L  EE L   +E W+  H    +  +DE  +R  ++++N+  +   N         Y+L 
Sbjct:    16 LSPEEMLDTQWELWKKTHQKQYNSKVDEISRRL-IWEKNLKQISAHNLEASLGVHTYELA 74

Query:    83 LNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVK 142
             +N   DMT+ E      G +I   R +           G+V   P S+D+RKKG VT VK
Sbjct:    75 MNHLGDMTSEEVVQKMTGLRIPPSRSYSNDTLYTPEWEGRV---PDSIDYRKKGYVTPVK 131

Query:   143 DQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIK 202
             +QGQCGSCWAFS+  A+EG     T KL++LS Q LVDC T+ N GC GG M  AF++++
Sbjct:   132 NQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVTE-NYGCGGGYMTTAFQYVQ 190

Query:   203 KKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAI 261
             + GG+ +E  YPY   D +C +   ++ A    G+  +P  +E AL +AVA+  P+SV+I
Sbjct:   191 QNGGIDSEDAYPYVGQDESC-MYNATAKAAKCRGYREIPVGNEKALKRAVARVGPISVSI 249

Query:   262 DAGSSDFQFYSEGVFTGE-CGTE-LNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYI 319
             DA  + FQFYS GV+  E C  + +NH V  VGYGT   G+K+WI++NSWG  WG KGY 
Sbjct:   250 DASLASFQFYSRGVYYDENCDRDNVNHAVLVVGYGTQ-KGSKHWIIKNSWGESWGNKGYA 308

Query:   320 RMQRGISDKKGLCGIAMEASYP 341
              + R   +K   CGI   AS+P
Sbjct:   309 LLAR---NKNNACGITNMASFP 327


>UNIPROTKB|O46427 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9823 "Sus scrofa"
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0032526 "response to retinoic acid" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0043129
            "surfactant homeostasis" evidence=ISS] [GO:0010815 "bradykinin
            catabolic process" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0005829 "cytosol"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0030335 "positive regulation of cell
            migration" evidence=ISS] [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0016505 "apoptotic protease activator
            activity" evidence=ISS] [GO:0004252 "serine-type endopeptidase
            activity" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=ISS] [GO:0031638 "zymogen activation"
            evidence=ISS] [GO:0030108 "HLA-A specific activating MHC class I
            receptor activity" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0010628 "positive regulation of gene
            expression" evidence=ISS] [GO:0070324 "thyroid hormone binding"
            evidence=ISS] [GO:0006508 "proteolysis" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0060448
            "dichotomous subdivision of terminal units involved in lung
            branching" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] [GO:0005764 "lysosome" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0004177 "aminopeptidase
            activity" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=ISS] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 MEROPS:C01.040 CTD:1512 OMA:STSCHKT
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 EMBL:AF001169
            RefSeq:NP_999094.1 UniGene:Ssc.3593 PDB:1NB3 PDB:1NB5 PDB:8PCH
            PDBsum:1NB3 PDBsum:1NB5 PDBsum:8PCH ProteinModelPortal:O46427
            SMR:O46427 Ensembl:ENSSSCT00000001983 GeneID:396969 KEGG:ssc:396969
            EvolutionaryTrace:O46427 ArrayExpress:O46427 Uniprot:O46427
        Length = 335

 Score = 581 (209.6 bits), Expect = 2.0e-56, P = 2.0e-56
 Identities = 128/311 (41%), Positives = 181/311 (58%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTY 98
             ++ W   H    SL+E H R  VF  N   ++  N  +  +KL LN+F+DM+  E    Y
Sbjct:    35 FKSWMVQHQKKYSLEEYHHRLQVFVSNWRKINAHNAGNHTFKLGLNQFSDMSFDEIRHKY 94

Query:    99 AGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGS-VTAVKDQGQCGSCWAFSTIA 157
               S+ ++      T+GN  ++ G     PPS+DWRKKG+ V+ VK+QG CGSCW FST  
Sbjct:    95 LWSEPQN---CSATKGN--YLRG-TGPYPPSMDWRKKGNFVSPVKNQGSCGSCWTFSTTG 148

Query:   158 AVEGINHIMTNKLVSLSEQELVDCDTD-QNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQ 216
             A+E    I T K++SL+EQ+LVDC  +  N GC GGL   AFE+I+   G+  E  YPY+
Sbjct:   149 ALESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYK 208

Query:   217 ANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSDFQFYSEGV 275
               D  C    + + A   D   N+  N E+A+++AVA   PVS A +  ++DF  Y +G+
Sbjct:   209 GQDDHCKFQPDKAIAFVKDV-ANITMNDEEAMVEAVALYNPVSFAFEV-TNDFLMYRKGI 266

Query:   276 FTG-ECGT---ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGL 331
             ++   C     ++NH V AVGYG   +G  YWIV+NSWGP+WG  GY  ++RG    K +
Sbjct:   267 YSSTSCHKTPDKVNHAVLAVGYGEE-NGIPYWIVKNSWGPQWGMNGYFLIERG----KNM 321

Query:   332 CGIAMEASYPI 342
             CG+A  ASYPI
Sbjct:   322 CGLAACASYPI 332


>TAIR|locus:2175088 [details] [associations]
            symbol:ALP "aleurain-like protease" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009723 "response to ethylene stimulus" evidence=IEP]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0005829 "cytosol"
            evidence=RCA] [GO:0006096 "glycolysis" evidence=RCA] [GO:0006816
            "calcium ion transport" evidence=RCA] [GO:0006833 "water transport"
            evidence=RCA] [GO:0006972 "hyperosmotic response" evidence=RCA]
            [GO:0007030 "Golgi organization" evidence=RCA] [GO:0009266
            "response to temperature stimulus" evidence=RCA] [GO:0009651
            "response to salt stress" evidence=RCA] [GO:0009750 "response to
            fructose stimulus" evidence=RCA] [GO:0042744 "hydrogen peroxide
            catabolic process" evidence=RCA] [GO:0046686 "response to cadmium
            ion" evidence=RCA] [GO:0007568 "aging" evidence=IEP]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002688 GO:GO:0005773
            GO:GO:0007568 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AB011483 KO:K01366
            ProtClustDB:CLSN2689015 UniGene:At.25414 IPI:IPI00846287
            RefSeq:NP_001078774.1 ProteinModelPortal:A8MQZ1 SMR:A8MQZ1
            STRING:A8MQZ1 PRIDE:A8MQZ1 EnsemblPlants:AT5G60360.3 GeneID:836158
            KEGG:ath:AT5G60360 OMA:CGSTPMD Genevestigator:A8MQZ1 Uniprot:A8MQZ1
        Length = 361

 Score = 580 (209.2 bits), Expect = 2.5e-56, P = 2.5e-56
 Identities = 126/292 (43%), Positives = 170/292 (58%)

Query:    50 RSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTYAGSKIKHHRMF 109
             ++++E   RF++FK+N+  +  TNK    YKL +N+FAD+T  EF  T  G+        
Sbjct:    71 QNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQNCSATL 130

Query:   110 QGTRGNGTFMYGKVT--SIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMT 167
             +G+         KVT  ++P + DWR+ G V+ VKDQG CGSCW FST  A+E   H   
Sbjct:   131 KGSH--------KVTEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAF 182

Query:   168 NKLVSLSEQELVDC-DTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSK 226
              K +SLSEQ+LVDC     N GCNGGL   AFE+IK  GG+ TE  YPY   D TC  S 
Sbjct:   183 GKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKDETCKFSA 242

Query:   227 ESSPAVSIDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSDFQFYSEGVFT-GECGT-- 282
             E+   V +    N+    ED L  AV   +PVS+A +   S F+ Y  GV+T   CG+  
Sbjct:   243 ENV-GVQVLNSVNITLGAEDELKHAVGLVRPVSIAFEVIHS-FRLYKSGVYTDSHCGSTP 300

Query:   283 -ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCG 333
              ++NH V AVGYG   DG  YW+++NSWG +WG+KGY +M+ G    K +CG
Sbjct:   301 MDVNHAVLAVGYGVE-DGVPYWLIKNSWGADWGDKGYFKMEMG----KNMCG 347


>ZFIN|ZDB-GENE-001205-4 [details] [associations]
            symbol:ctsk "cathepsin K" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-001205-4 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            CTD:1513 KO:K01371 OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55
            EMBL:BC092901 IPI:IPI00512751 RefSeq:NP_001017778.1
            UniGene:Dr.76224 ProteinModelPortal:Q568D6 SMR:Q568D6 GeneID:550475
            KEGG:dre:550475 InParanoid:Q568D6 NextBio:20879718
            ArrayExpress:Q568D6 Uniprot:Q568D6
        Length = 333

 Score = 580 (209.2 bits), Expect = 2.5e-56, P = 2.5e-56
 Identities = 133/317 (41%), Positives = 182/317 (57%)

Query:    37 DLYERWR-SHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDK----PYKLKLNKFADMTN 91
             + +E W+ +H      L+E+  R  ++++N++ +   NK  +     Y L +N F DMT 
Sbjct:    28 EAWESWKITHKREYNGLNEESIRRTIWEKNMLFIEAHNKEYELGIHTYDLGMNHFGDMTL 87

Query:    92 HEFASTYAGSKIKHHRMFQGTRGNGTFMYG-KVTSIPPSVDWRKKGSVTAVKDQGQCGSC 150
              E A    G ++  +R       N TF+   +V  +P S+D+RK G VT+VK+QG CGSC
Sbjct:    88 EEVAEKVMGLQMPMYR----DPAN-TFVPDDRVGKLPKSIDYRKLGYVTSVKNQGSCGSC 142

Query:   151 WAFSTIAAVEGINHIMTNK--LVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVT 208
             WAFS++ A+EG   +M  K  LV LS Q LVDC T+ N GC GG M  AF ++    G+ 
Sbjct:   143 WAFSSVGALEG--QLMKTKGQLVDLSPQNLVDCVTE-NDGCGGGYMTNAFRYVSNNQGID 199

Query:   209 TEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSD 267
             +E  YPY   D  C  +  S  A S  G++ +P  +E AL  AVA   PVSV IDA  S 
Sbjct:   200 SEESYPYVGTDQQCAYNT-SGVAASCRGYKEIPQGNERALTAAVANVGPVSVGIDAMQST 258

Query:   268 FQFYSEGVFTG-ECGTE-LNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGI 325
             F +Y  GV+    C  E +NH V AVGYG T  G KYWIV+NSWG EWG+KGY+ M R  
Sbjct:   259 FLYYKSGVYYDPNCNKEDVNHAVLAVGYGATPRGKKYWIVKNSWGEEWGKKGYVLMAR-- 316

Query:   326 SDKKGLCGIAMEASYPI 342
              ++   CGIA  AS+P+
Sbjct:   317 -NRNNACGIANLASFPV 332


>ZFIN|ZDB-GENE-050626-55 [details] [associations]
            symbol:ctssb.2 "cathepsin S, b.2" species:7955
            "Danio rerio" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-050626-55
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            KO:K01368 EMBL:BC093339 IPI:IPI00507098 RefSeq:NP_001017661.1
            UniGene:Dr.132688 ProteinModelPortal:Q566T8 SMR:Q566T8
            GeneID:337572 KEGG:dre:337572 CTD:337572 InParanoid:Q566T8
            NextBio:20812306 ArrayExpress:Q566T8 Uniprot:Q566T8
        Length = 330

 Score = 580 (209.2 bits), Expect = 2.5e-56, P = 2.5e-56
 Identities = 131/312 (41%), Positives = 179/312 (57%)

Query:    39 YERWRSHHTVSRSL-DEKHKRFNVFKQNV--MHVHQTN-KMDK-PYKLKLNKFADMTNHE 93
             +E W+  H    S  DE+  R  ++++N+  + +H     M    Y L +N  ADMT  E
Sbjct:    27 WELWKKKHVKLYSCEDEEVGRRELWERNLELIAIHNLEASMGMHSYDLAINHMADMTTEE 86

Query:    94 FASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAF 153
                T A +++     F+  R    ++      +P ++DWR KG VT+VK+QG CGSCWAF
Sbjct:    87 ILQTLAVTRVPPG--FK--RPTAEYVSSSFAVVPDTLDWRDKGYVTSVKNQGACGSCWAF 142

Query:   154 STIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEAK 212
             S++ A+EG     T KLV LS Q LVDC +   N GCNGG M  AF+++   GG+ +E+ 
Sbjct:   143 SSVGALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGYMSQAFQYVIDNGGIDSESS 202

Query:   213 YPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSDFQFY 271
             YPYQ   G+C     S  A +   ++ V    E AL +A+A   PVSVAIDA    F FY
Sbjct:   203 YPYQGTQGSCRYDP-SQRAANCTSYKFVSQGDEQALKEALANIGPVSVAIDATRPQFIFY 261

Query:   272 SEGVFTG-ECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKG 330
               GV+    C  ++NHGV AVGYGT L G  YW+V+NSWG  +G+ GYIR+ R   +K  
Sbjct:   262 RSGVYDDPSCTQKVNHGVLAVGYGT-LSGQDYWLVKNSWGAGFGDGGYIRIAR---NKNN 317

Query:   331 LCGIAMEASYPI 342
             +CGIA EA YPI
Sbjct:   318 MCGIASEACYPI 329


>ZFIN|ZDB-GENE-030131-3539 [details] [associations]
            symbol:ctsh "cathepsin H" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-030131-3539
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 KO:K01366 HOVERGEN:HBG011513
            CTD:1512 OrthoDB:EOG4W9J43 MEROPS:I29.003 HSSP:P43235 EMBL:BC067615
            IPI:IPI00506892 RefSeq:NP_997853.1 UniGene:Dr.14176
            ProteinModelPortal:Q6NWF2 SMR:Q6NWF2 PRIDE:Q6NWF2 GeneID:324818
            KEGG:dre:324818 InParanoid:Q6NWF2 NextBio:20808976 Bgee:Q6NWF2
            Uniprot:Q6NWF2
        Length = 330

 Score = 579 (208.9 bits), Expect = 3.3e-56, P = 3.3e-56
 Identities = 130/321 (40%), Positives = 181/321 (56%)

Query:    29 LESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFAD 88
             L +EE  +  ++ W S +     ++E ++R  +F +N   + Q N+ +  + + LN+F+D
Sbjct:    21 LYTEEDEYH-FKSWMSQYNKKYEINEFYQRLQIFLENKKRIDQHNEGNHKFSMGLNQFSD 79

Query:    89 MTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGS-VTAVKDQGQC 147
             MT  EF  TY    +   +    TRGN     G     P ++DWR KG  +T VK+QG C
Sbjct:    80 MTFAEFKKTYL---LTEPQNCSATRGNHVSSNGLY---PDAIDWRTKGHYITDVKNQGPC 133

Query:   148 GSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTD-QNQGCNGGLMELAFEFIKKKGG 206
             GSCW FST   +E +  I T KL+ L+EQ+L+DC  D  N GCNGGL   AFE+I    G
Sbjct:   134 GSCWTFSTTGCLESVTAIATGKLLQLAEQQLIDCAGDFDNHGCNGGLPSHAFEYIMYNKG 193

Query:   207 VTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAK-QPVSVAIDAGS 265
             + TE  YPYQA  G C   K    A  +    N+    E  ++ AVA+  PVS A +  +
Sbjct:   194 LMTEDDYPYQAKGGQCRF-KPQLAAAFVKEVVNITKYDEMGMVDAVARLNPVSFAYEV-T 251

Query:   266 SDFQFYSEGVFTG-ECG--TEL-NHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRM 321
             SDF  Y +G++T  EC   T++ NH V AVGY    +GT YWIV+NSWG  WG KGY  +
Sbjct:   252 SDFMHYKDGIYTSTECHNTTDMVNHAVLAVGYAEE-NGTPYWIVKNSWGTNWGIKGYFYI 310

Query:   322 QRGISDKKGLCGIAMEASYPI 342
             +RG    K +CG+A  +SYPI
Sbjct:   311 ERG----KNMCGLAACSSYPI 327


>UNIPROTKB|Q24940 [details] [associations]
            symbol:Cat-1 "Cathepsin L-like proteinase" species:6192
            "Fasciola hepatica" [GO:0004175 "endopeptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005576 "extracellular region" evidence=IDA]
            [GO:0006508 "proteolysis" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005576 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0004197 EMBL:L33771 PIR:S43991 PDB:2O6X
            PDBsum:2O6X ProteinModelPortal:Q24940 SMR:Q24940 MEROPS:C01.033
            EvolutionaryTrace:Q24940 Uniprot:Q24940
        Length = 326

 Score = 577 (208.2 bits), Expect = 5.3e-56, P = 5.3e-56
 Identities = 130/317 (41%), Positives = 185/317 (58%)

Query:    37 DLYERW-RSHHTVSRSLDEKHKRFNVFKQNVMHVHQTN-KMDK---PYKLKLNKFADMTN 91
             DL+ +W R ++      D++H+R N++++NV H+ + N + D     Y L LN+F DMT 
Sbjct:    19 DLWHQWKRMYNKEYNGADDQHRR-NIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTF 77

Query:    92 HEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCW 151
              EF + Y     +   +      +G        ++P  +DWR+ G VT VKDQG CGSCW
Sbjct:    78 EEFKAKYLTEMSRASDILS----HGVPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSCW 133

Query:   152 AFSTIAAVEGINHIMTNKL--VSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVT 208
             AFST   +EG    M N+   +S SEQ+LVDC     N GC+GGLME A++++K+ G + 
Sbjct:   134 AFSTTGTMEG--QYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQFG-LE 190

Query:   209 TEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAV-AKQPVSVAIDAGSSD 267
             TE+ YPY A +G C  +K+   A  + G+  V +  E  L   V A++P +VA+D   SD
Sbjct:   191 TESSYPYTAVEGQCRYNKQLGVA-KVTGYYTVHSGSEVELKNLVGARRPAAVAVDV-ESD 248

Query:   268 FQFYSEGVFTGE-CGT-ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGI 325
             F  Y  G++  + C    +NH V AVGYGT   GT YWIV+NSWG  WGE+GYIRM R  
Sbjct:   249 FMMYRSGIYQSQTCSPLRVNHAVLAVGYGTQ-GGTDYWIVKNSWGTYWGERGYIRMAR-- 305

Query:   326 SDKKGLCGIAMEASYPI 342
              ++  +CGIA  AS P+
Sbjct:   306 -NRGNMCGIASLASLPM 321


>UNIPROTKB|Q86GF7 [details] [associations]
            symbol:Cys "Crustapain" species:6703 "Pandalus borealis"
            [GO:0005576 "extracellular region" evidence=IC] [GO:0007586
            "digestion" evidence=NAS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IDA] [GO:0030163 "protein catabolic process"
            evidence=NAS] [GO:0030574 "collagen catabolic process"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005576
            GO:GO:0007586 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0030163 GO:GO:0030574 EMBL:AB091669
            ProteinModelPortal:Q86GF7 SMR:Q86GF7 MEROPS:C01.030 Uniprot:Q86GF7
        Length = 323

 Score = 577 (208.2 bits), Expect = 5.3e-56, P = 5.3e-56
 Identities = 131/317 (41%), Positives = 178/317 (56%)

Query:    34 GLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTN-KMDK---PYKLKLNKFADM 89
             G W+ ++  +     + S +E H R +VF   +  + + N + DK    Y LK+N F+D+
Sbjct:    18 GEWENFKT-KFGKKYANSEEESH-RMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDL 75

Query:    90 TNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGS 149
             T+ E  +T  G   + H +    +   T      T +   VDWR KG+VT VKDQGQCGS
Sbjct:    76 THEEVLATKTGMTRRRHPLSVLPKSAPT------TPMAADVDWRNKGAVTPVKDQGQCGS 129

Query:   150 CWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVT 208
             CWAFS +AA+EG + + T  LVSLSEQ LVDC +   NQGCNGG    A+++I    G+ 
Sbjct:   130 CWAFSAVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRGID 189

Query:   209 TEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSD 267
             TE+ YPY+A D  C     +  A ++  +    +  E AL  AV  + PVSV IDAG S 
Sbjct:   190 TESSYPYKAIDDNCRYDAGNIGA-TVSSYVEPASGDESALQHAVQNEGPVSVCIDAGQSS 248

Query:   268 FQFYSEGVF-TGECGT-ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGI 325
             F  Y  GV+    C +   NH V AVGYGT  +G  YWIV+NSWG  WGE GYI+M R  
Sbjct:   249 FGSYGGGVYYEPNCDSWYANHAVTAVGYGTDANGGDYWIVKNSWGAWWGESGYIKMAR-- 306

Query:   326 SDKKGLCGIAMEASYPI 342
              ++   C IA  + YP+
Sbjct:   307 -NRDNNCAIATYSVYPV 322


>UNIPROTKB|P09668 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9606 "Homo sapiens"
            [GO:0001520 "outer dense fiber" evidence=IEA] [GO:0001669
            "acrosomal vesicle" evidence=IEA] [GO:0007283 "spermatogenesis"
            evidence=IEA] [GO:0030984 "kininogen binding" evidence=IEA]
            [GO:0032403 "protein complex binding" evidence=IEA] [GO:0043621
            "protein self-association" evidence=IEA] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0031648 "protein destabilization"
            evidence=IMP] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=ISS] [GO:0032526 "response to retinoic acid"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0030108 "HLA-A
            specific activating MHC class I receptor activity" evidence=IDA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEP] [GO:0010813 "neuropeptide catabolic process"
            evidence=IDA] [GO:0010815 "bradykinin catabolic process"
            evidence=IDA] [GO:0030335 "positive regulation of cell migration"
            evidence=IDA] [GO:0070371 "ERK1 and ERK2 cascade" evidence=IDA]
            [GO:0010628 "positive regulation of gene expression" evidence=IDA]
            [GO:0006508 "proteolysis" evidence=IDA;TAS] [GO:0031638 "zymogen
            activation" evidence=IDA] [GO:0016505 "apoptotic protease activator
            activity" evidence=IDA] [GO:0010952 "positive regulation of
            peptidase activity" evidence=IDA] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=ISS] [GO:0004252 "serine-type endopeptidase
            activity" evidence=ISS] [GO:0043066 "negative regulation of
            apoptotic process" evidence=ISS] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0033619 "membrane protein proteolysis"
            evidence=IDA] [GO:0004175 "endopeptidase activity" evidence=IDA]
            [GO:0004177 "aminopeptidase activity" evidence=IDA] [GO:0005764
            "lysosome" evidence=IDA] [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0002250 "adaptive immune response" evidence=IEP]
            [GO:0019882 "antigen processing and presentation" evidence=TAS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0070324 "thyroid hormone binding" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0008284
            "positive regulation of cell proliferation" evidence=ISS]
            [GO:0045766 "positive regulation of angiogenesis" evidence=ISS]
            [GO:0008233 "peptidase activity" evidence=IDA] [GO:0097208
            "alveolar lamellar body" evidence=IDA] [GO:0043129 "surfactant
            homeostasis" evidence=IDA] [GO:0004197 "cysteine-type endopeptidase
            activity" evidence=IDA;TAS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 Reactome:REACT_6900 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913 MEROPS:C01.040 CTD:1512
            OMA:STSCHKT OrthoDB:EOG4W9J43 GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 EMBL:X16832 EMBL:AF426247 EMBL:AK314698 EMBL:AC011944
            EMBL:BC002479 EMBL:X07549 IPI:IPI00297487 PIR:S12486
            RefSeq:NP_004381.2 UniGene:Hs.148641 PDB:1BZN PDBsum:1BZN
            ProteinModelPortal:P09668 SMR:P09668 IntAct:P09668 STRING:P09668
            PhosphoSite:P09668 DMDM:288558851 PaxDb:P09668 PRIDE:P09668
            DNASU:1512 Ensembl:ENST00000220166 GeneID:1512 KEGG:hsa:1512
            UCSC:uc021srk.1 GeneCards:GC15M079213 H-InvDB:HIX0012481
            HGNC:HGNC:2535 HPA:CAB000458 HPA:HPA003524 MIM:116820
            neXtProt:NX_P09668 PharmGKB:PA27033 InParanoid:P09668
            PhylomeDB:P09668 BRENDA:3.4.22.16 ChEMBL:CHEMBL2225 GenomeRNAi:1512
            NextBio:6261 ArrayExpress:P09668 Bgee:P09668 CleanEx:HS_CTSH
            Genevestigator:P09668 GermOnline:ENSG00000103811 GO:GO:0019882
            Uniprot:P09668
        Length = 335

 Score = 571 (206.1 bits), Expect = 2.3e-55, P = 2.3e-55
 Identities = 127/312 (40%), Positives = 177/312 (56%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTY 98
             ++ W S H  + S +E H R   F  N   ++  N  +  +K+ LN+F+DM+  E    Y
Sbjct:    35 FKSWMSKHRKTYSTEEYHHRLQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKHKY 94

Query:    99 AGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGS-VTAVKDQGQCGSCWAFSTIA 157
               S+ ++      T+ N  ++ G     PPSVDWRKKG+ V+ VK+QG CGSCW FST  
Sbjct:    95 LWSEPQN---CSATKSN--YLRG-TGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTG 148

Query:   158 AVEGINHIMTNKLVSLSEQELVDCDTD-QNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQ 216
             A+E    I T K++SL+EQ+LVDC  D  N GC GGL   AFE+I    G+  E  YPYQ
Sbjct:   149 ALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQ 208

Query:   217 ANDGTCDVSKESSPAVS-IDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSDFQFYSEG 274
               DG C    +   A+  +    N+    E+A+++AVA   PVS A +  + DF  Y  G
Sbjct:   209 GKDGYCKF--QPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEV-TQDFMMYRTG 265

Query:   275 VFTG-ECGT---ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKG 330
             +++   C     ++NH V AVGYG   +G  YWIV+NSWGP+WG  GY  ++RG    K 
Sbjct:   266 IYSSTSCHKTPDKVNHAVLAVGYGEK-NGIPYWIVKNSWGPQWGMNGYFLIERG----KN 320

Query:   331 LCGIAMEASYPI 342
             +CG+A  ASYPI
Sbjct:   321 MCGLAACASYPI 332


>UNIPROTKB|Q3T0I2 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9913 "Bos taurus"
            [GO:0031638 "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=ISS] [GO:0010815 "bradykinin
            catabolic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0010813 "neuropeptide catabolic
            process" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0045766 "positive regulation of angiogenesis" evidence=ISS]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=ISS] [GO:0008234 "cysteine-type peptidase activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0033619 "membrane protein proteolysis" evidence=ISS]
            [GO:0043066 "negative regulation of apoptotic process"
            evidence=ISS] [GO:0004252 "serine-type endopeptidase activity"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0016505 "apoptotic protease activator activity"
            evidence=ISS] [GO:0010952 "positive regulation of peptidase
            activity" evidence=ISS] [GO:0030108 "HLA-A specific activating MHC
            class I receptor activity" evidence=ISS] [GO:0002764 "immune
            response-regulating signaling pathway" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0070324 "thyroid
            hormone binding" evidence=ISS] [GO:0006508 "proteolysis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0097208
            "alveolar lamellar body" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004175
            "endopeptidase activity" evidence=ISS] [GO:0032526 "response to
            retinoic acid" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 EMBL:BC102386 IPI:IPI00693034
            RefSeq:NP_001029557.1 UniGene:Bt.52393 ProteinModelPortal:Q3T0I2
            SMR:Q3T0I2 STRING:Q3T0I2 MEROPS:C01.040 PRIDE:Q3T0I2
            Ensembl:ENSBTAT00000014593 GeneID:510524 KEGG:bta:510524 CTD:1512
            InParanoid:Q3T0I2 OMA:STSCHKT OrthoDB:EOG4W9J43 NextBio:20869490
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 Uniprot:Q3T0I2
        Length = 335

 Score = 570 (205.7 bits), Expect = 2.9e-55, P = 2.9e-55
 Identities = 125/312 (40%), Positives = 179/312 (57%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTY 98
             ++ W   H    S +E + R   F  N+  ++  N  +  +K+ LN+F+DM+  E    Y
Sbjct:    35 FQSWMVQHQKKYSSEEYYHRLQAFASNLREINAHNARNHTFKMGLNQFSDMSFDELKRKY 94

Query:    99 AGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGS-VTAVKDQGQCGSCWAFSTIA 157
               S+ ++      T+ N  ++ G     PPS+DWRKKG+ VT VK+QG CGSCW FST  
Sbjct:    95 LWSEPQN---CSATKSN--YLRG-TGPYPPSMDWRKKGNFVTPVKNQGSCGSCWTFSTTG 148

Query:   158 AVEGINHIMTNKLVSLSEQELVDCDTD-QNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQ 216
             A+E    I T KL  L+EQ+LVDC  +  N GC GGL   AFE+I+   G+  E  YPY+
Sbjct:   149 ALESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYR 208

Query:   217 ANDGTCDVSKESSPAVS-IDGHENVPANHEDALLKAVA-KQPVSVAIDAGSSDFQFYSEG 274
               DG C    + S A++ +    N+  N E+A+++AVA   PVS A +  ++DF  Y +G
Sbjct:   209 GQDGDCKY--QPSKAIAFVKDVANITLNDEEAMVEAVALHNPVSFAFEV-TADFMMYRKG 265

Query:   275 VFTG-ECGT---ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKG 330
             +++   C     ++NH V AVGYG    G  YWIV+NSWGP WG KGY  ++RG    K 
Sbjct:   266 IYSSTSCHKTPDKVNHAVLAVGYGEE-KGIPYWIVKNSWGPNWGMKGYFLIERG----KN 320

Query:   331 LCGIAMEASYPI 342
             +CG+A  AS+PI
Sbjct:   321 MCGLAACASFPI 332


>DICTYBASE|DDB_G0278401 [details] [associations]
            symbol:cprH "cysteine proteinase 8" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0278401 EMBL:AAFI02000023
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 ProtClustDB:CLSZ2430780 RefSeq:XP_642342.1
            ProteinModelPortal:Q54Y60 MEROPS:C01.A62 EnsemblProtists:DDB0205428
            GeneID:8621547 KEGG:ddi:DDB_G0278401 InParanoid:Q54Y60 OMA:FANMENE
            Uniprot:Q54Y60
        Length = 337

 Score = 569 (205.4 bits), Expect = 3.7e-55, P = 3.7e-55
 Identities = 137/333 (41%), Positives = 181/333 (54%)

Query:    26 EKELESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNK 85
             ++EL SE    D +  W   +  S S  E   R+N+FK N  ++ + N       L LNK
Sbjct:    18 KQEL-SESQYRDAFTDWMISNQKSYSSSEFITRYNIFKTNFDYIEEWNSKGSETVLGLNK 76

Query:    86 FADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQG 145
              AD+TN E+ S Y G       +  GT+    F   K +S   +VDWRKKG+VT VK+Q 
Sbjct:    77 MADITNEEYRSLYLGKPFDASSLI-GTKEEILFS-NKFSS---TVDWRKKGAVTHVKNQQ 131

Query:   146 QCGSCWAFSTIAAVEGINHIM---TNKLVSLSEQELVDCDTD-QNQGCNGGLMELAFEFI 201
              C  CW+FS   A EG + +    TN+LVSLSEQ L+DC T   N GCNGG++  AFE+I
Sbjct:   132 SCSGCWSFSATGATEGAHKLANNGTNELVSLSEQNLIDCSTPFGNTGCNGGVITYAFEYI 191

Query:   202 KKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAI 261
                GG+ TE  YP++  DGTC    E+S A +I  + NV    E +L  AV   PV+ +I
Sbjct:   192 ISNGGIDTEKSYPFEGTDGTCRYKSENSGA-TISSYVNVTFGSESSLESAVNVNPVACSI 250

Query:   262 DAGSSDFQFYSEGV-FTGECG-TELNHGVAAVGYGT----TLDGTK------YWIVRNSW 309
             DA  S F FY  G+ F   C  T L+HGV  VGYGT    + D +       YWI +NSW
Sbjct:   251 DASHSSFLFYKSGIYFEPACSRTNLDHGVLVVGYGTENSQSQDSSSEPNHSNYWIAKNSW 310

Query:   310 GPEWGEKGYIRMQRGISDKKGLCGIAMEASYPI 342
             G      GYI M +   D+  +CGI+  AS+PI
Sbjct:   311 GIN----GYILMSK---DRDNMCGISTLASFPI 336


>UNIPROTKB|G3R9A7 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9595 "Gorilla
            gorilla gorilla" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 OMA:STSCHKT GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 RefSeq:XP_004056662.1 Ensembl:ENSGGOT00000012331
            GeneID:101144312 Uniprot:G3R9A7
        Length = 335

 Score = 569 (205.4 bits), Expect = 3.7e-55, P = 3.7e-55
 Identities = 127/312 (40%), Positives = 176/312 (56%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTY 98
             +  W S H  + S +E H R   F  N   ++  N  +  +K+ LN+F+DM+  E    Y
Sbjct:    35 FRSWMSKHRKTYSTEEYHHRLQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKHKY 94

Query:    99 AGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGS-VTAVKDQGQCGSCWAFSTIA 157
               S+ ++      T+ N  ++ G     PPSVDWRKKG+ V+ VK+QG CGSCW FST  
Sbjct:    95 LWSEPQN---CSATKSN--YLRG-TGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTG 148

Query:   158 AVEGINHIMTNKLVSLSEQELVDCDTD-QNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQ 216
             A+E    I T K++SL+EQ+LVDC  D  N GC GGL   AFE+I    G+  E  YPYQ
Sbjct:   149 ALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQ 208

Query:   217 ANDGTCDVSKESSPAVS-IDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSDFQFYSEG 274
               DG C    +   A+  +    N+    E+A+++AVA   PVS A +  + DF  Y  G
Sbjct:   209 GKDGYCKF--QPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEV-TQDFMMYRTG 265

Query:   275 VFTG-ECGT---ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKG 330
             +++   C     ++NH V AVGYG   +G  YWIV+NSWGP+WG  GY  ++RG    K 
Sbjct:   266 IYSSTSCHKTPDKVNHAVLAVGYGEK-NGIPYWIVKNSWGPKWGMNGYFLIERG----KN 320

Query:   331 LCGIAMEASYPI 342
             +CG+A  ASYPI
Sbjct:   321 MCGLAACASYPI 332


>UNIPROTKB|F6R7P5 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9544 "Macaca
            mulatta" [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=ISS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0004175 "endopeptidase activity" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0004252
            "serine-type endopeptidase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005829 "cytosol" evidence=ISS] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0010628 "positive regulation of
            gene expression" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0010815 "bradykinin catabolic
            process" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0016505 "apoptotic protease
            activator activity" evidence=ISS] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=ISS] [GO:0030335
            "positive regulation of cell migration" evidence=ISS] [GO:0031638
            "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0032526 "response to retinoic
            acid" evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0043129 "surfactant homeostasis"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0070324
            "thyroid hormone binding" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            GO:GO:0043066 GO:GO:0005615 GO:GO:0008284 GO:GO:0070371
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458 CTD:1512
            OMA:STSCHKT GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 RefSeq:XP_001108862.1
            UniGene:Mmu.3000 Ensembl:ENSMMUT00000014095 GeneID:711437
            KEGG:mcc:711437 NextBio:19969972 Uniprot:F6R7P5
        Length = 335

 Score = 567 (204.7 bits), Expect = 6.1e-55, P = 6.1e-55
 Identities = 126/312 (40%), Positives = 176/312 (56%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTY 98
             ++ W S H  + S +E H R   F  N   ++  N  +  +K+ LN+F+DM+  E    Y
Sbjct:    35 FKSWMSKHHKTYSTEEYHHRMQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKHKY 94

Query:    99 AGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGS-VTAVKDQGQCGSCWAFSTIA 157
               S+ ++      T+ N  ++ G     PPS+DWRKKG+ V+ VK+QG CGSCW FST  
Sbjct:    95 LWSEPQN---CSATKSN--YLRG-TGPYPPSMDWRKKGNFVSPVKNQGACGSCWTFSTTG 148

Query:   158 AVEGINHIMTNKLVSLSEQELVDCDTD-QNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQ 216
             A+E    I T K++SL+EQ+LVDC  D  N GC GGL   AFE+I    G+  E  YPYQ
Sbjct:   149 ALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQ 208

Query:   217 ANDGTCDVSKESSPAVS-IDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSDFQFYSEG 274
               DG C        A+  +    N+    E+A+++AVA   PVS A +  + DF  Y  G
Sbjct:   209 GKDGDCKF--RPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEV-TQDFMIYKTG 265

Query:   275 VFTG-ECGT---ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKG 330
             +++   C     ++NH V AVGYG   +G  YWIV+NSWGP+WG  GY  ++RG    K 
Sbjct:   266 IYSSTSCHKTPDKVNHAVLAVGYGEE-NGIPYWIVKNSWGPQWGMNGYFLIERG----KN 320

Query:   331 LCGIAMEASYPI 342
             +CG+A  ASYPI
Sbjct:   321 MCGLAACASYPI 332


>UNIPROTKB|G1M0X4 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9646
            "Ailuropoda melanoleuca" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 EMBL:ACTA01057330 EMBL:ACTA01065330
            Ensembl:ENSAMET00000013529 Uniprot:G1M0X4
        Length = 337

 Score = 567 (204.7 bits), Expect = 6.1e-55, P = 6.1e-55
 Identities = 128/312 (41%), Positives = 177/312 (56%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTY 98
             ++ W   H    S +E   R   F  N   ++  N  +  +K+ LN+F+DM+  E    Y
Sbjct:    37 FKSWMVQHQKKYSSEEYQHRLRTFVGNWRKINAHNAGNHTFKMGLNQFSDMSFAEIKRKY 96

Query:    99 AGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGS-VTAVKDQGQCGSCWAFSTIA 157
               S+ ++      T+GN  ++ G     PP VDWRKKG  V+ VK+QG CGSCW FST  
Sbjct:    97 LWSEPQN---CSATKGN--YLRG-TGPYPPFVDWRKKGKFVSPVKNQGGCGSCWTFSTTG 150

Query:   158 AVEGINHIMTNKLVSLSEQELVDCDTD-QNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQ 216
             A+E    I T KL+SL+EQ+LVDC  D  N GC GGL   AFE+I+   G+  E  YPY+
Sbjct:   151 ALESAIAIKTGKLLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYIRYNRGIMGEDSYPYK 210

Query:   217 ANDGTCDVSKESSPAVS-IDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSDFQFYSEG 274
               DG C    + S A++ +    N+  N E A+++AVA   PVS A +  + DF  Y +G
Sbjct:   211 GQDGDCKF--QPSKAIAFVKDVANITINDEQAMVEAVALFNPVSFAFEV-TGDFMMYRKG 267

Query:   275 VFTG-ECGT---ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKG 330
             V++   C     ++NH V AVGYG   +G  YWIV+NSWGP+WG  GY  ++RG    K 
Sbjct:   268 VYSSTSCHKTPDKVNHAVLAVGYGEQ-NGVPYWIVKNSWGPQWGMHGYFLIERG----KN 322

Query:   331 LCGIAMEASYPI 342
             +CG+A  ASYPI
Sbjct:   323 MCGLAACASYPI 334


>UNIPROTKB|G1RBY1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:61853
            "Nomascus leucogenys" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 EMBL:ADFV01087552 RefSeq:XP_003275518.1
            Ensembl:ENSNLET00000011249 GeneID:100584322 Uniprot:G1RBY1
        Length = 335

 Score = 567 (204.7 bits), Expect = 6.1e-55, P = 6.1e-55
 Identities = 126/312 (40%), Positives = 177/312 (56%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTY 98
             ++ W S H  + S +E H R  +F  N   ++  N  +  +K+ LN+F+DM+  E    Y
Sbjct:    35 FKSWMSKHHKTYSTEEYHHRLQMFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKHKY 94

Query:    99 AGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGS-VTAVKDQGQCGSCWAFSTIA 157
               S+ ++      T+ N  ++ G     PPS+DWRKKG+ V+ VK+QG CGSCW FST  
Sbjct:    95 LWSEPQN---CSATKSN--YLRG-TGPYPPSMDWRKKGNFVSPVKNQGACGSCWTFSTTG 148

Query:   158 AVEGINHIMTNKLVSLSEQELVDCDTD-QNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQ 216
             A+E    I T K++SL+EQ+LVDC  D  N GC GGL   AFE+I    G+  E  YPYQ
Sbjct:   149 ALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQ 208

Query:   217 ANDGTCDVSKESSPAVS-IDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSDFQFYSEG 274
               DG C        A+  +    N+    E+A+++AVA   PVS A +  + DF  Y  G
Sbjct:   209 GKDGYCKF--RPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEV-TQDFMMYRRG 265

Query:   275 VFTG-ECGT---ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKG 330
             +++   C     ++NH V AVGYG   +G  YWIV+NSWGP+WG  GY  ++RG    K 
Sbjct:   266 IYSSTSCHKTPDKVNHAVLAVGYGEK-NGIPYWIVKNSWGPQWGMNGYFLIERG----KN 320

Query:   331 LCGIAMEASYPI 342
             +CG+A  ASYPI
Sbjct:   321 MCGLAACASYPI 332


>UNIPROTKB|G3SSC1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9785
            "Loxodonta africana" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 RefSeq:XP_003413898.1
            Ensembl:ENSLAFT00000003415 GeneID:100662496 Uniprot:G3SSC1
        Length = 335

 Score = 566 (204.3 bits), Expect = 7.8e-55, P = 7.8e-55
 Identities = 128/311 (41%), Positives = 175/311 (56%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTY 98
             ++ W + H    S +E H+R   F  N   ++  N  +  +K+ LN+F+DMT  E    Y
Sbjct:    35 FQSWMAQHQKKYSSEEYHQRQQTFVSNWRKINAHNARNHTFKMALNQFSDMTFAEIKQKY 94

Query:    99 AGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGS-VTAVKDQGQCGSCWAFSTIA 157
               S+ ++      T+GN  ++ G     PP VDWRKKG  V+ VK+QG CGSCW FST  
Sbjct:    95 LWSEPQN---CSATKGN--YLRG-TGPYPPFVDWRKKGHFVSPVKNQGACGSCWTFSTTG 148

Query:   158 AVEGINHIMTNKLVSLSEQELVDCDTD-QNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQ 216
             A+E    I   KL+SL+EQ+LVDC  D  N GC GGL   AFE+I    G+  E  YPY+
Sbjct:   149 ALESAIAIAGGKLLSLAEQQLVDCAKDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYK 208

Query:   217 ANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSDFQFYSEGV 275
               D  C    + + A   D   N+  N E+A+++AVA   PVS A +  + DF  YS+G+
Sbjct:   209 GQDDVCKFQPKKAIAFVKDV-ANITLNDEEAMVEAVALYNPVSFAFEV-TDDFMKYSKGI 266

Query:   276 FTG-ECGT---ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGL 331
             ++   C     ++NH V AVGYG    G  YWIV+NSWGP WG  GY  ++RG    K +
Sbjct:   267 YSSTSCHKTPDKVNHAVLAVGYGEE-KGIPYWIVKNSWGPYWGMDGYFLIERG----KNM 321

Query:   332 CGIAMEASYPI 342
             CG+A  ASYPI
Sbjct:   322 CGLAACASYPI 332


>DICTYBASE|DDB_G0281605 [details] [associations]
            symbol:cfaD "peptidase C1A family protein"
            species:44689 "Dictyostelium discoideum" [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IDA] [GO:0006508 "proteolysis" evidence=IDA] [GO:0031410
            "cytoplasmic vesicle" evidence=IDA] [GO:0031288 "sorocarp
            morphogenesis" evidence=IMP] [GO:0008285 "negative regulation of
            cell proliferation" evidence=IGI;IDA] [GO:0005576 "extracellular
            region" evidence=IEA;IDA] [GO:0005515 "protein binding"
            evidence=IPI] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0281605
            GO:GO:0008285 GO:GO:0005615 GenomeReviews:CM000152_GR
            eggNOG:COG4870 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0031410 EMBL:AAFI02000042
            GO:GO:0031288 RefSeq:XP_640530.1 HSSP:P07711
            ProteinModelPortal:Q54TR1 STRING:Q54TR1 PRIDE:Q54TR1
            EnsemblProtists:DDB0229857 GeneID:8623140 KEGG:ddi:DDB_G0281605
            InParanoid:Q54TR1 OMA:PSAHEHE ProtClustDB:CLSZ2430523
            Uniprot:Q54TR1
        Length = 531

 Score = 563 (203.2 bits), Expect = 1.6e-54, P = 1.6e-54
 Identities = 124/323 (38%), Positives = 184/323 (56%)

Query:    29 LESEEGLWDLYERWRSHHTVSRSLDEKH-KRFNVFK--QNVMHVHQTNKMDKPYKLKLNK 85
             L  EE   +L++ +++ +    S  ++H +RF  FK  + ++  H  N  +  YKL +N 
Sbjct:   215 LAKEEQASNLFKEYKAQYNKEYSSQDEHDERFINFKAARKIIATH--NAKESSYKLGMNH 272

Query:    86 FADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQG 145
             +AD++N EF +T    K+    +   T  +       + SIP +VDWR +  VT VKDQG
Sbjct:   273 YADLSNKEF-NTLVKPKVARPSV---TGADSVHDDESLRSIPSTVDWRNQNCVTPVKDQG 328

Query:   146 QCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDT-DQNQGCNGGLMELAFEFIKKK 204
              CGSCW F +  ++EG N +   +LVSLSEQ+LVDC     +QGC GG    AF+++ + 
Sbjct:   329 ICGSCWTFGSTGSLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEI 388

Query:   205 GGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDA 263
             G + TE+ YPY   +G C     +   VSI G+ NV +  E AL  A+A   PV++AIDA
Sbjct:   389 GSLATESNYPYLMQNGLCRDRTVTPSGVSITGYVNVTSGSESALQNAIATTGPVAIAIDA 448

Query:   264 GSSDFQFYSEGVFTGE-CGT---ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYI 319
                DF++Y  GV+    C     +L+H V A+GYGT   G  Y++V+NSW   WG  GY+
Sbjct:   449 SVDDFRYYMSGVYNNPACKNGLDDLDHEVLAIGYGT-YQGQDYFLVKNSWSTNWGMDGYV 507

Query:   320 RMQRGISDKKGLCGIAMEASYPI 342
              M R  +D   LCG++ +A+YPI
Sbjct:   508 YMAR--NDNN-LCGVSSQATYPI 527


>UNIPROTKB|P25326 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9913 "Bos taurus"
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0016020 "membrane" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0002250 "adaptive
            immune response" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0016020 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0002250 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            GO:GO:0097067 EMBL:BC102245 EMBL:M95211 EMBL:X62001 IPI:IPI00702008
            PIR:S15844 RefSeq:NP_001028787.1 UniGene:Bt.7938
            ProteinModelPortal:P25326 SMR:P25326 STRING:P25326 PRIDE:P25326
            Ensembl:ENSBTAT00000022774 GeneID:327711 KEGG:bta:327711 CTD:1520
            InParanoid:P25326 KO:K01368 OMA:KAMDQKC OrthoDB:EOG4JM7Q2
            NextBio:20810175 Uniprot:P25326
        Length = 331

 Score = 563 (203.2 bits), Expect = 1.6e-54, P = 1.6e-54
 Identities = 131/314 (41%), Positives = 180/314 (57%)

Query:    36 WDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHV--HQT-NKMDK-PYKLKLNKFADMTN 91
             WDL+++  ++    +  +E+  R  ++++N+  V  H   + M    Y+L +N   DMT+
Sbjct:    28 WDLWKK--TYGKQYKEKNEEVARRLIWEKNLKTVTLHNLEHSMGMHSYELGMNHLGDMTS 85

Query:    92 HEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCW 151
              E  S  +  ++      Q  R N T+       +P S+DWR+KG VT VK QG CGSCW
Sbjct:    86 EEVISLMSSLRVPS----QWPR-NVTYKSDPNQKLPDSMDWREKGCVTEVKYQGACGSCW 140

Query:   152 AFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ--NQGCNGGLMELAFEFIKKKGGVTT 209
             AFS + A+E    + T KLVSLS Q LVDC T +  N+GCNGG M  AF++I    G+ +
Sbjct:   141 AFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNNGIDS 200

Query:   210 EAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVA-KQPVSVAIDAGSSDF 268
             EA YPY+A DG C    ++  A +   +  +P   E+AL +AVA K PVSV IDA  S F
Sbjct:   201 EASYPYKAMDGKCQYDVKNR-AATCSRYIELPFGSEEALKEAVANKGPVSVGIDASHSSF 259

Query:   269 QFYSEGVFTG-ECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISD 327
               Y  GV+    C   +NHGV  VGYG  LDG  YW+V+NSWG  +G++GYIRM R   +
Sbjct:   260 FLYKTGVYYDPSCTQNVNHGVLVVGYGN-LDGKDYWLVKNSWGLHFGDQGYIRMARNSGN 318

Query:   328 KKGLCGIAMEASYP 341
                 CGIA   SYP
Sbjct:   319 H---CGIANYPSYP 329


>UNIPROTKB|F7B939 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458 CTD:1512
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 EMBL:ACFV01158341
            EMBL:ACFV01158342 EMBL:ACFV01158343 RefSeq:XP_002753411.1
            Ensembl:ENSCJAT00000004397 GeneID:100413104 Uniprot:F7B939
        Length = 336

 Score = 563 (203.2 bits), Expect = 1.6e-54, P = 1.6e-54
 Identities = 127/314 (40%), Positives = 178/314 (56%)

Query:    39 YERWRS-HH-TVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFAS 96
             ++ W + HH T SR  +E H+R   F  N   ++  N  +  +K+ +N+F+DM+  E   
Sbjct:    35 FKSWMAKHHKTYSRE-EEYHQRLQTFASNWRKINAHNNGNHTFKMAVNQFSDMSFAEIKR 93

Query:    97 TYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGS-VTAVKDQGQCGSCWAFST 155
              Y  S+ ++      T+ N  ++ G     PPSVDWRKKG  V+ VK+QG CGSCW FST
Sbjct:    94 KYLWSEPQN---CSATKSN--YLRG-TGPYPPSVDWRKKGHFVSPVKNQGACGSCWTFST 147

Query:   156 IAAVEGINHIMTNKLVSLSEQELVDCDTD-QNQGCNGGLMELAFEFIKKKGGVTTEAKYP 214
               A+E    I T K++SL+EQ+LVDC  D  N GC GGL   AFE+I    G+  E  YP
Sbjct:   148 TGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNNGIMGEDTYP 207

Query:   215 YQANDGTCDVSKESSPAVS-IDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSDFQFYS 272
             YQ  D  C    +   A+  +    N+    EDA+++AVA   PVS A +  + DF  Y 
Sbjct:   208 YQGKDSDCKF--QPGKAIGFVKDVANITIYDEDAMVEAVALYNPVSFAFEV-TQDFMMYK 264

Query:   273 EGVFTG-ECGT---ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDK 328
              G+++   C     ++NH V AVGYG   +G  YWIV+NSWGP+WG  GY  ++RG    
Sbjct:   265 RGIYSSTSCHKTPDKVNHAVLAVGYGEE-NGIPYWIVKNSWGPQWGMNGYFLIERG---- 319

Query:   329 KGLCGIAMEASYPI 342
             K +CG+A  ASYP+
Sbjct:   320 KNMCGLAACASYPV 333


>UNIPROTKB|F7BRD4 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001656 "metanephros development"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0001656
            GeneTree:ENSGT00660000095458 EMBL:ACFV01158341 EMBL:ACFV01158342
            EMBL:ACFV01158343 Ensembl:ENSCJAT00000004396 Uniprot:F7BRD4
        Length = 336

 Score = 563 (203.2 bits), Expect = 1.6e-54, P = 1.6e-54
 Identities = 127/314 (40%), Positives = 178/314 (56%)

Query:    39 YERWRS-HH-TVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFAS 96
             ++ W + HH T SR  +E H+R   F  N   ++  N  +  +K+ +N+F+DM+  E   
Sbjct:    35 FKSWMAKHHKTYSRE-EEYHQRLQTFASNWRKINAHNNGNHTFKMAVNQFSDMSFAEIKR 93

Query:    97 TYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGS-VTAVKDQGQCGSCWAFST 155
              Y  S+ ++      T+ N  ++ G     PPSVDWRKKG  V+ VK+QG CGSCW FST
Sbjct:    94 KYLWSEPQN---CSATKSN--YLRG-TGPYPPSVDWRKKGHFVSPVKNQGACGSCWTFST 147

Query:   156 IAAVEGINHIMTNKLVSLSEQELVDCDTD-QNQGCNGGLMELAFEFIKKKGGVTTEAKYP 214
               A+E    I T K++SL+EQ+LVDC  D  N GC GGL   AFE+I    G+  E  YP
Sbjct:   148 TGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNNGIMGEDTYP 207

Query:   215 YQANDGTCDVSKESSPAVS-IDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSDFQFYS 272
             YQ  D  C    +   A+  +    N+    EDA+++AVA   PVS A +  + DF  Y 
Sbjct:   208 YQGKDSDCKF--QPGKAIGFVKDVANITIYDEDAMVEAVALYNPVSFAFEV-TQDFMMYK 264

Query:   273 EGVFTG-ECGT---ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDK 328
              G+++   C     ++NH V AVGYG   +G  YWIV+NSWGP+WG  GY  ++RG    
Sbjct:   265 RGIYSSTSCHKTPDKVNHAVLAVGYGEE-NGIPYWIVKNSWGPQWGMNGYFLIERG---- 319

Query:   329 KGLCGIAMEASYPI 342
             K +CG+A  ASYP+
Sbjct:   320 KNMCGLAACASYPV 333


>UNIPROTKB|Q8HY81 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            CTD:1520 KO:K01368 OrthoDB:EOG4JM7Q2 EMBL:AY156692
            RefSeq:NP_001002938.2 UniGene:Cfa.1661 ProteinModelPortal:Q8HY81
            SMR:Q8HY81 STRING:Q8HY81 MEROPS:C01.034 GeneID:403400
            KEGG:cfa:403400 InParanoid:Q8HY81 NextBio:20816922 Uniprot:Q8HY81
        Length = 331

 Score = 559 (201.8 bits), Expect = 4.3e-54, P = 4.3e-54
 Identities = 130/314 (41%), Positives = 176/314 (56%)

Query:    36 WDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHV--HQT-NKMDK-PYKLKLNKFADMTN 91
             W+L+++  S      + +E  +R  ++++N+  V  H   + M    Y L +N   DMT 
Sbjct:    28 WNLWKKTYSKQYKEEN-EEVARRL-IWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTG 85

Query:    92 HEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCW 151
              E  S     ++      Q  R N T+       +P SVDWR+KG VT VK QG CG+CW
Sbjct:    86 EEVISLMGSLRVPS----QWQR-NVTYRSNSNQKLPDSVDWREKGCVTEVKYQGSCGACW 140

Query:   152 AFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ--NQGCNGGLMELAFEFIKKKGGVTT 209
             AFS + A+E    + T KLVSLS Q LVDC T++  N+GCNGG M  AF++I    G+ +
Sbjct:   141 AFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGIDS 200

Query:   210 EAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVA-KQPVSVAIDAGSSDF 268
             EA YPY+A +G C    +   A +   +  +P   EDAL +AVA K PVSVAIDA    F
Sbjct:   201 EASYPYKAMNGKCRYDSKKR-AATCSKYTELPFGSEDALKEAVANKGPVSVAIDASHYSF 259

Query:   269 QFYSEGVF-TGECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISD 327
               Y  GV+    C   +NHGV  VGYG  L+G  YW+V+NSWG  +G++GYIRM R   +
Sbjct:   260 FLYRSGVYYEPSCTQNVNHGVLVVGYGN-LNGKDYWLVKNSWGLNFGDQGYIRMARNSGN 318

Query:   328 KKGLCGIAMEASYP 341
                 CGIA   SYP
Sbjct:   319 H---CGIASYPSYP 329


>RGD|708447 [details] [associations]
            symbol:Testin "testin gene" species:10116 "Rattus norvegicus"
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0030054 "cell junction" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 RGD:708447 GO:GO:0005576 GO:GO:0030054 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            MEROPS:C01.972 OMA:RYHAENS OrthoDB:EOG4XWG0N EMBL:U16858
            IPI:IPI00207173 PIR:I52525 PIR:PC1251 RefSeq:NP_775155.1
            UniGene:Rn.10029 ProteinModelPortal:P15242 SMR:P15242
            Ensembl:ENSRNOT00000024467 GeneID:286916 KEGG:rno:286916
            UCSC:RGD:708447 CTD:286916 InParanoid:P15242 NextBio:625036
            Genevestigator:P15242 GermOnline:ENSRNOG00000018028 Uniprot:P15242
        Length = 333

 Score = 559 (201.8 bits), Expect = 4.3e-54, P = 4.3e-54
 Identities = 121/318 (38%), Positives = 183/318 (57%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQN--VMHVHQTNKMD--KPYKLKLNKFADMTNHEF 94
             +  WR+ H  + +++E+  +  V+++N  ++ +H    ++    + + +N F D+TN EF
Sbjct:    29 WNEWRTKHGKTYNMNEERLKRAVWEKNFKMIELHNWEYLEGRHDFTMAMNAFGDLTNIEF 88

Query:    95 ASTYAG---SKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCW 151
                  G    KIK   +FQ  +    F+Y     +P  VDWR+ G VT VK+QG C S W
Sbjct:    89 VKMMTGFQRQKIKKTHIFQDHQ----FLY-----VPKRVDWRQLGYVTPVKNQGHCASSW 139

Query:   152 AFSTIAAVEGINHIMTNKLVSLSEQELVDC-DTDQNQGCNGGLMELAFEFIKKKGGVTTE 210
             AFS   ++EG     T +L+ LSEQ L+DC  ++   GC+GG M+ AF+++K  GG+ TE
Sbjct:   140 AFSATGSLEGQMFRKTERLIPLSEQNLLDCMGSNVTHGCSGGFMQYAFQYVKDNGGLATE 199

Query:   211 AKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQ 269
               YPY+     C    E+S A ++     +P + E+AL+KAVAK  P+SVA+DA    FQ
Sbjct:   200 ESYPYRGQGRECRYHAENS-AANVRDFVQIPGS-EEALMKAVAKVGPISVAVDASHGSFQ 257

Query:   270 FYSEGVF-TGECG-TELNHGVAAVGYG---TTLDGTKYWIVRNSWGPEWGEKGYIRMQRG 324
             FY  G++   +C    LNH V  VGYG      DG  +W+V+NSWG EWG KGY+++ + 
Sbjct:   258 FYGSGIYYEPQCKRVHLNHAVLVVGYGFEGEESDGNSFWLVKNSWGEEWGMKGYMKLAKD 317

Query:   325 ISDKKGLCGIAMEASYPI 342
              S+    CGIA  ++YPI
Sbjct:   318 WSNH---CGIATYSTYPI 332


>UNIPROTKB|F1PAK0 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9615 "Canis lupus
            familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:AAEX03011051 Ensembl:ENSCAFT00000019176 OMA:YEPACTQ
            Uniprot:F1PAK0
        Length = 339

 Score = 558 (201.5 bits), Expect = 5.5e-54, P = 5.5e-54
 Identities = 130/314 (41%), Positives = 176/314 (56%)

Query:    36 WDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHV--HQT-NKMDK-PYKLKLNKFADMTN 91
             W+L+++  S      + +E  +R  ++++N+  V  H   + M    Y L +N   DMT 
Sbjct:    36 WNLWKKTYSKQYKEEN-EEVARRL-IWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTG 93

Query:    92 HEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCW 151
              E  S     ++      Q  R N T+       +P SVDWR+KG VT VK QG CG+CW
Sbjct:    94 EEVISLMGSLRVPS----QWQR-NVTYRSNSNQKLPDSVDWREKGCVTEVKYQGSCGACW 148

Query:   152 AFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ--NQGCNGGLMELAFEFIKKKGGVTT 209
             AFS + A+E    + T KLVSLS Q LVDC T++  N+GCNGG M  AF++I    G+ +
Sbjct:   149 AFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGIDS 208

Query:   210 EAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVA-KQPVSVAIDAGSSDF 268
             EA YPY+A +G C    +   A +   +  +P   EDAL +AVA K PVSVAIDA    F
Sbjct:   209 EASYPYKAVNGKCRYDSKKR-AATCSKYTELPFGSEDALKEAVANKGPVSVAIDASHYSF 267

Query:   269 QFYSEGVF-TGECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISD 327
               Y  GV+    C   +NHGV  VGYG  L+G  YW+V+NSWG  +G++GYIRM R   +
Sbjct:   268 FLYRSGVYYEPSCTQNVNHGVLVVGYGN-LNGKDYWLVKNSWGLNFGDQGYIRMARNSGN 326

Query:   328 KKGLCGIAMEASYP 341
                 CGIA   SYP
Sbjct:   327 H---CGIASYPSYP 337


>UNIPROTKB|F6X9C1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00660000095458
            OMA:STSCHKT Ensembl:ENSCAFT00000036196 EMBL:AAEX03002388
            Uniprot:F6X9C1
        Length = 305

 Score = 557 (201.1 bits), Expect = 7.0e-54, P = 7.0e-54
 Identities = 126/312 (40%), Positives = 177/312 (56%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTY 98
             ++ W   H    S +E  +R   F  N   ++  N  +  +K+ LN+F+DM   E    Y
Sbjct:     5 FKSWAVQHQKKYSSEEYLQRLQTFVGNWRKINAHNAGNHTFKMGLNQFSDMNFAEIKHKY 64

Query:    99 AGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGS-VTAVKDQGQCGSCWAFSTIA 157
               S+ ++      T+GN  ++ G     PP VDWRKKG  V+ VK+QG CGSCW FST  
Sbjct:    65 LWSEPQN---CSATKGN--YLRG-TGPYPPFVDWRKKGKFVSPVKNQGSCGSCWTFSTTG 118

Query:   158 AVEGINHIMTNKLVSLSEQELVDCDTD-QNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQ 216
             A+E    I + KL+SL+EQ+LVDC  +  N GC GG    AFE+I+   G+  E  YPY+
Sbjct:   119 ALESAIAIKSGKLLSLAEQQLVDCAQNFNNHGCQGGAPLQAFEYIRYNKGIMGEDSYPYK 178

Query:   217 ANDGTCDVSKESSPAVS-IDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSDFQFYSEG 274
               DG C    + S A++ +    N+  N E A+++AVA   PVS A +  +SDF  Y +G
Sbjct:   179 GQDGDCKY--QPSKAIAFVKDVANITINDEQAMVEAVALYNPVSFAFEV-TSDFMMYRKG 235

Query:   275 VFTG-ECGT---ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKG 330
             +++   C     ++NH V AVGYG   +G  YWIV+NSWGP+WG  GY  M+RG    K 
Sbjct:   236 IYSSTSCHKTPDKVNHAVLAVGYGEQ-NGIPYWIVKNSWGPQWGMNGYFLMERG----KN 290

Query:   331 LCGIAMEASYPI 342
             +CG+A  ASYPI
Sbjct:   291 MCGLAACASYPI 302


>UNIPROTKB|F7BJD8 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9796 "Equus
            caballus" [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=ISS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0004175 "endopeptidase activity" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0004252
            "serine-type endopeptidase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005829 "cytosol" evidence=ISS] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0010628 "positive regulation of
            gene expression" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0010815 "bradykinin catabolic
            process" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0016505 "apoptotic protease
            activator activity" evidence=ISS] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=ISS] [GO:0030335
            "positive regulation of cell migration" evidence=ISS] [GO:0031638
            "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0032526 "response to retinoic
            acid" evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0043129 "surfactant homeostasis"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0070324
            "thyroid hormone binding" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            GO:GO:0043066 GO:GO:0005615 GO:GO:0008284 GO:GO:0070371
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458
            OMA:STSCHKT GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129
            Ensembl:ENSECAT00000013967 Uniprot:F7BJD8
        Length = 305

 Score = 557 (201.1 bits), Expect = 7.0e-54, P = 7.0e-54
 Identities = 125/311 (40%), Positives = 174/311 (55%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTY 98
             ++ W   H    S +E H R   F  N   ++  N  +  +++ LN+F+ M   E    Y
Sbjct:     5 FKSWMVQHQKKYSSEEYHHRLQTFVSNWRKINAHNTGNHTFRMGLNQFSAMNFAELKHKY 64

Query:    99 AGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGS-VTAVKDQGQCGSCWAFSTIA 157
               S+ ++      T+GN  ++ G     PPSVDWRKKG+ V+ VK+QG CGSCW FST  
Sbjct:    65 LWSEPQN---CSATKGN--YLRG-AGPYPPSVDWRKKGNFVSPVKNQGGCGSCWTFSTTG 118

Query:   158 AVEGINHIMTNKLVSLSEQELVDCDTD-QNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQ 216
             A+E    I + KL+SL+EQ+LVDC  +  N GC GGL   AFE+I+   G+  E  YPY+
Sbjct:   119 ALESAVAIASGKLLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYK 178

Query:   217 ANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSDFQFYSEGV 275
               DG C      + A   D   N+  N E A+++AVA   PVS A +  + DF  Y +G+
Sbjct:   179 GQDGDCKFQPNKAIAFVKDV-ANITLNDEKAMVEAVALYNPVSFAFEV-TEDFMMYRKGI 236

Query:   276 FTG-ECGT---ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGL 331
             ++   C     ++NH V AVGYG   +G  YWIV+NSWGP WG  GY  ++RG    K +
Sbjct:   237 YSSTSCHKTPDKVNHAVLAVGYGEE-NGIPYWIVKNSWGPHWGMNGYFLIERG----KNM 291

Query:   332 CGIAMEASYPI 342
             CG+A  ASYPI
Sbjct:   292 CGLAACASYPI 302


>UNIPROTKB|F1SS93 [details] [associations]
            symbol:CTSS "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0002250 "adaptive immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0016020 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0002250
            GeneTree:ENSGT00560000076577 GO:GO:0097067 EMBL:CU463875
            Ensembl:ENSSSCT00000007284 OMA:CEIESAV Uniprot:F1SS93
        Length = 342

 Score = 556 (200.8 bits), Expect = 8.9e-54, P = 8.9e-54
 Identities = 131/314 (41%), Positives = 181/314 (57%)

Query:    36 WDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHV--HQT-NKMDK-PYKLKLNKFADMTN 91
             WDL+++  ++    +  +E+  R  ++++N+  V  H   + M    Y L +N   DMT+
Sbjct:    39 WDLWKK--TYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSYDLGMNHLGDMTS 96

Query:    92 HEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCW 151
              E  S  +  ++      Q  R N T+       +P S+DWR+KG VT VK QG CGSCW
Sbjct:    97 EEVISLMSCVRVPS----QWPR-NVTYKSNPNQKLPDSMDWREKGCVTEVKYQGSCGSCW 151

Query:   152 AFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ--NQGCNGGLMELAFEFIKKKGGVTT 209
             AFS + A+E    + T +LVSLS Q LVDC T++  N+GCNGG M  AF++I    G+ +
Sbjct:   152 AFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDS 211

Query:   210 EAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVA-KQPVSVAIDAGSSDF 268
             EA YPY+A DG C    ++  A +   +  +P   E AL +AVA K PVSVAIDA  S F
Sbjct:   212 EASYPYKAVDGKCKYDSKNR-AATCSRYTELPFADEYALKEAVANKGPVSVAIDAKHSSF 270

Query:   269 QFYSEGVFTG-ECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISD 327
              FY  GV+    C   +NHGV  VGYG  L+G  YW+V+NSWG  +G+ GYIRM R   +
Sbjct:   271 FFYRSGVYYDPSCTQNVNHGVLVVGYGN-LNGKDYWLVKNSWGLNFGDGGYIRMAR---N 326

Query:   328 KKGLCGIAMEASYP 341
              +  CGIA   SYP
Sbjct:   327 SENHCGIANYPSYP 340


>DICTYBASE|DDB_G0272815 [details] [associations]
            symbol:cprE "cysteine proteinase 5" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0272815 GO:GO:0005615
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GenomeReviews:CM000151_GR GO:GO:0005764
            EMBL:AAFI02000008 MEROPS:I29.003 KO:K01376 EMBL:L36205
            RefSeq:XP_644977.1 ProteinModelPortal:P54640 SMR:P54640
            PRIDE:P54640 EnsemblProtists:DDB0185092 GeneID:8618654
            KEGG:ddi:DDB_G0272815 OMA:METAFEF ProtClustDB:CLSZ2430780
            Uniprot:P54640
        Length = 344

 Score = 553 (199.7 bits), Expect = 1.9e-53, P = 1.9e-53
 Identities = 118/271 (43%), Positives = 161/271 (59%)

Query:    27 KELESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKF 86
             K+  SE    + +  W   H  S + +E   R+N+FK N+ +V Q N       L LN F
Sbjct:    18 KQQFSELQYRNAFTDWMITHQKSYTSEEFGARYNIFKANMDYVQQWNSKGSETVLGLNNF 77

Query:    87 ADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQ 146
             AD+TN E+ +TY G+K     +  GT+    F     TS   S DWR +G+VT VK+QGQ
Sbjct:    78 ADITNEEYRNTYLGTKFDASSLI-GTQEEKVF----TTSSAASKDWRSEGAVTPVKNQGQ 132

Query:   147 CGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGG 206
             CG CW+FST  + EG +     +LVSLSEQ L+DC T+ N GC+GGLM  AFE+I    G
Sbjct:   133 CGGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCSTE-NSGCDGGLMTYAFEYIINNNG 191

Query:   207 VTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSS 266
             + TE+ YPY+A +G C+   E+S A ++  ++ V A  E +L  AV   PVSVAIDA   
Sbjct:   192 IDTESSYPYKAENGKCEYKSENSGA-TLSSYKTVTAGSESSLESAVNVNPVSVAIDASHQ 250

Query:   267 DFQFYSEGVF-TGECGTE-LNHGVAAVGYGT 295
              FQ Y+ G++   EC +E L+HGV AVGYG+
Sbjct:   251 SFQLYTSGIYYEPECSSENLDHGVLAVGYGS 281


>MGI|MGI:1922258 [details] [associations]
            symbol:4930486L24Rik "RIKEN cDNA 4930486L24 gene"
            species:10090 "Mus musculus" [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030054 "cell
            junction" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 MGI:MGI:1922258
            GO:GO:0005576 GO:GO:0030054 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 HSSP:P07711
            EMBL:AY146988 EMBL:AK145933 EMBL:BC061218 IPI:IPI00280732
            RefSeq:NP_835199.1 UniGene:Mm.19839 ProteinModelPortal:Q80UB0
            SMR:Q80UB0 MEROPS:C01.972 PRIDE:Q80UB0 Ensembl:ENSMUST00000091569
            GeneID:214639 KEGG:mmu:214639 UCSC:uc007qvs.1 InParanoid:Q80UB0
            OMA:RYHAENS OrthoDB:EOG4XWG0N NextBio:374408 Bgee:Q80UB0
            CleanEx:MM_4930486L24RIK Genevestigator:Q80UB0 Uniprot:Q80UB0
        Length = 333

 Score = 552 (199.4 bits), Expect = 2.4e-53, P = 2.4e-53
 Identities = 125/318 (39%), Positives = 178/318 (55%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQN--VMHVHQTNKMD--KPYKLKLNKFADMTNHEF 94
             +  WR+ H  + +++E+  R  V+++N  ++ +H    ++    + + +N F D+TN EF
Sbjct:    29 WNEWRTKHGKAYNVNEERLRRAVWEKNFKMIELHNWEYLEGKHDFTMTMNAFGDLTNTEF 88

Query:    95 ASTYAG---SKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCW 151
                  G    KIK   +FQ  +    F+Y     +P  VDWR  G VT VK+QG C S W
Sbjct:    89 VKMMTGFRRQKIKRMHVFQDHQ----FLY-----VPKYVDWRMLGYVTPVKNQGYCASSW 139

Query:   152 AFSTIAAVEGINHIMTNKLVSLSEQELVDC-DTDQNQGCNGGLMELAFEFIKKKGGVTTE 210
             AFS   ++EG     T +LV LSEQ L+DC  ++    C+GG M+ AF+++K  GG+ TE
Sbjct:   140 AFSATGSLEGQMFKKTGRLVPLSEQNLLDCMGSNVTHDCSGGFMQNAFQYVKDNGGLATE 199

Query:   211 AKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQ 269
               YPY      C    E+S A ++     +P   E+AL+KAVAK  P+SVA+DA    FQ
Sbjct:   200 ESYPYIGPGRKCRYHAENS-AANVRDFVQIPGR-EEALMKAVAKVGPISVAVDASHDSFQ 257

Query:   270 FYSEGVF-TGECG-TELNHGVAAVGYG---TTLDGTKYWIVRNSWGPEWGEKGYIRMQRG 324
             FY  G++   +C    LNH V  VGYG      DG  YW+V+NSWG EWG KGYI++ + 
Sbjct:   258 FYDSGIYYEPQCKRVHLNHAVLVVGYGFEGEESDGNSYWLVKNSWGEEWGMKGYIKIAK- 316

Query:   325 ISDKKGLCGIAMEASYPI 342
               D    CGIA  A+YPI
Sbjct:   317 --DWNNHCGIATLATYPI 332


>UNIPROTKB|P25774 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0016020 "membrane"
            evidence=IEA] [GO:0005576 "extracellular region" evidence=NAS]
            [GO:0005764 "lysosome" evidence=IDA;NAS] [GO:0097067 "cellular
            response to thyroid hormone stimulus" evidence=IEP] [GO:0002250
            "adaptive immune response" evidence=IEP] [GO:0019882 "antigen
            processing and presentation" evidence=TAS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=TAS] [GO:0006955
            "immune response" evidence=TAS] [GO:0002474 "antigen processing and
            presentation of peptide antigen via MHC class I" evidence=TAS]
            [GO:0002480 "antigen processing and presentation of exogenous
            peptide antigen via MHC class I, TAP-independent" evidence=TAS]
            [GO:0019886 "antigen processing and presentation of exogenous
            peptide antigen via MHC class II" evidence=TAS] [GO:0036021
            "endolysosome lumen" evidence=TAS] [GO:0042590 "antigen processing
            and presentation of exogenous peptide antigen via MHC class I"
            evidence=TAS] [GO:0043202 "lysosomal lumen" evidence=TAS]
            [GO:0045087 "innate immune response" evidence=TAS] [GO:0043231
            "intracellular membrane-bounded organelle" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 Reactome:REACT_118779
            Reactome:REACT_6900 GO:GO:0005576 GO:GO:0002480 GO:GO:0016020
            GO:GO:0019886 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087 EMBL:CH471121
            GO:GO:0004197 GO:GO:0002250 HOVERGEN:HBG011513 GO:GO:0097067
            GO:GO:0036021 EMBL:AL356292 CTD:1520 KO:K01368 OMA:KAMDQKC
            OrthoDB:EOG4JM7Q2 EMBL:S93414 EMBL:M86553 EMBL:M90696 EMBL:U07374
            EMBL:U07370 EMBL:U07371 EMBL:U07372 EMBL:U07373 EMBL:CR541676
            EMBL:AK301472 EMBL:AK314482 EMBL:BC002642 IPI:IPI00299150
            IPI:IPI00910216 PIR:A42482 RefSeq:NP_001186668.1 RefSeq:NP_004070.3
            UniGene:Hs.181301 PDB:1BXF PDB:1GLO PDB:1MS6 PDB:1NPZ PDB:1NQC
            PDB:2C0Y PDB:2F1G PDB:2FQ9 PDB:2FRA PDB:2FRQ PDB:2FT2 PDB:2FUD
            PDB:2FYE PDB:2G6D PDB:2G7Y PDB:2H7J PDB:2HH5 PDB:2HHN PDB:2HXZ
            PDB:2OP3 PDB:2R9M PDB:2R9N PDB:2R9O PDB:3IEJ PDB:3KWN PDB:3MPE
            PDB:3MPF PDB:3N3G PDB:3N4C PDB:3OVX PDBsum:1BXF PDBsum:1GLO
            PDBsum:1MS6 PDBsum:1NPZ PDBsum:1NQC PDBsum:2C0Y PDBsum:2F1G
            PDBsum:2FQ9 PDBsum:2FRA PDBsum:2FRQ PDBsum:2FT2 PDBsum:2FUD
            PDBsum:2FYE PDBsum:2G6D PDBsum:2G7Y PDBsum:2H7J PDBsum:2HH5
            PDBsum:2HHN PDBsum:2HXZ PDBsum:2OP3 PDBsum:2R9M PDBsum:2R9N
            PDBsum:2R9O PDBsum:3IEJ PDBsum:3KWN PDBsum:3MPE PDBsum:3MPF
            PDBsum:3N3G PDBsum:3N4C PDBsum:3OVX ProteinModelPortal:P25774
            SMR:P25774 IntAct:P25774 STRING:P25774 MEROPS:I29.004
            PhosphoSite:P25774 DMDM:88984046 PaxDb:P25774 PeptideAtlas:P25774
            PRIDE:P25774 DNASU:1520 Ensembl:ENST00000368985
            Ensembl:ENST00000448301 GeneID:1520 KEGG:hsa:1520 UCSC:uc001evn.3
            GeneCards:GC01M150702 HGNC:HGNC:2545 HPA:CAB000460 HPA:HPA002988
            MIM:116845 neXtProt:NX_P25774 PharmGKB:PA27041 InParanoid:P25774
            PhylomeDB:P25774 BRENDA:3.4.22.27 BindingDB:P25774
            ChEMBL:CHEMBL2954 ChiTaRS:CTSS EvolutionaryTrace:P25774
            GenomeRNAi:1520 NextBio:6291 PMAP-CutDB:P25774 ArrayExpress:P25774
            Bgee:P25774 CleanEx:HS_CTSS Genevestigator:P25774
            GermOnline:ENSG00000163131 Uniprot:P25774
        Length = 331

 Score = 551 (199.0 bits), Expect = 3.0e-53, P = 3.0e-53
 Identities = 127/303 (41%), Positives = 170/303 (56%)

Query:    43 RSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTYAGSK 102
             ++   V R + EK+ +F V   N+ H    +     Y L +N   DMT+ E  S  +  +
Sbjct:    42 KNEEAVRRLIWEKNLKF-VMLHNLEHSMGMHS----YDLGMNHLGDMTSEEVMSLMSSLR 96

Query:   103 IKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGI 162
             +      Q  R N T+       +P SVDWR+KG VT VK QG CG+CWAFS + A+E  
Sbjct:    97 VPS----QWQR-NITYKSNPNRILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQ 151

Query:   163 NHIMTNKLVSLSEQELVDCDTDQ--NQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDG 220
               + T KLVSLS Q LVDC T++  N+GCNGG M  AF++I    G+ ++A YPY+A D 
Sbjct:   152 LKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQ 211

Query:   221 TCDVSKESSPAVSIDGHENVPANHEDALLKAVA-KQPVSVAIDAGSSDFQFYSEGVF-TG 278
              C    +   A +   +  +P   ED L +AVA K PVSV +DA    F  Y  GV+   
Sbjct:   212 KCQYDSKYR-AATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEP 270

Query:   279 ECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEA 338
              C   +NHGV  VGYG  L+G +YW+V+NSWG  +GE+GYIRM R   +K   CGIA   
Sbjct:   271 SCTQNVNHGVLVVGYGD-LNGKEYWLVKNSWGHNFGEEGYIRMAR---NKGNHCGIASFP 326

Query:   339 SYP 341
             SYP
Sbjct:   327 SYP 329


>RGD|621513 [details] [associations]
            symbol:Ctss "cathepsin S" species:10116 "Rattus norvegicus"
            [GO:0001656 "metanephros development" evidence=IEP] [GO:0002250
            "adaptive immune response" evidence=ISO] [GO:0005764 "lysosome"
            evidence=IEA;ISO] [GO:0006508 "proteolysis" evidence=IEA;ISO]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0009986 "cell
            surface" evidence=IDA] [GO:0016020 "membrane" evidence=ISO]
            [GO:0043231 "intracellular membrane-bounded organelle"
            evidence=ISO] [GO:0045453 "bone resorption" evidence=IMP]
            [GO:0051930 "regulation of sensory perception of pain"
            evidence=IMP] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            RGD:621513 GO:GO:0009986 GO:GO:0051930 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0045453
            GO:GO:0001656 HOVERGEN:HBG011513 CTD:1520 KO:K01368 MEROPS:I29.004
            BRENDA:3.4.22.27 EMBL:L03201 IPI:IPI00210228 PIR:A45087
            RefSeq:NP_059016.1 UniGene:Rn.11347 ProteinModelPortal:Q02765
            PhosphoSite:Q02765 PRIDE:Q02765 GeneID:50654 KEGG:rno:50654
            UCSC:RGD:621513 ChEMBL:CHEMBL1075217 NextBio:610462
            Genevestigator:Q02765 Uniprot:Q02765
        Length = 330

 Score = 549 (198.3 bits), Expect = 4.9e-53, P = 4.9e-53
 Identities = 132/316 (41%), Positives = 179/316 (56%)

Query:    36 WDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHV--HQT-NKMDK-PYKLKLNKFADMTN 91
             WDL+++ R      ++ +E  +R  ++++N+  +  H   + M    Y + +N   DMT 
Sbjct:    26 WDLWKKTRMRRNTDQN-EEDVRRL-IWEKNLKFIMLHNLEHSMGMHSYSVGMNHMGDMTP 83

Query:    92 HEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCW 151
              E    Y GS     R+ +    +GT       ++P SVDWR+KG VT VK QG CGSCW
Sbjct:    84 EEVIG-YMGSL----RIPRPWNRSGTLKSSSNQTLPDSVDWREKGCVTNVKYQGSCGSCW 138

Query:   152 AFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ---NQGCNGGLMELAFEFIKKKGGVT 208
             AFS   A+EG   + T KLVSLS Q LVDC T++   N+GC GG M  AF++I     + 
Sbjct:   139 AFSAEGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEAFQYIIDTS-ID 197

Query:   209 TEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVA-KQPVSVAID-AGSS 266
             +EA YPY+A D  C +    + A +   +  +P   E+AL +AVA K PVSV ID A  S
Sbjct:   198 SEASYPYKAMDEKC-LYDPKNRAATCSRYIELPFGDEEALKEAVATKGPVSVGIDDASHS 256

Query:   267 DFQFYSEGVFTG-ECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGI 325
              F  Y  GV+    C   +NHGV  VGYGT LDG  YW+V+NSWG  +G++GYIRM R  
Sbjct:   257 SFFLYQSGVYDDPSCTENMNHGVLVVGYGT-LDGKDYWLVKNSWGLHFGDQGYIRMAR-- 313

Query:   326 SDKKGLCGIAMEASYP 341
              + K  CGIA   SYP
Sbjct:   314 -NNKNHCGIASYCSYP 328


>UNIPROTKB|P83443 [details] [associations]
            symbol:P83443 "Macrodontain-1" species:203992 "Pseudananas
            sagenarius" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0006508 "proteolysis" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197
            ProteinModelPortal:P83443 SMR:P83443 MEROPS:C01.028 Uniprot:P83443
        Length = 213

 Score = 548 (198.0 bits), Expect = 6.3e-53, P = 6.3e-53
 Identities = 101/217 (46%), Positives = 137/217 (63%)

Query:   125 SIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTD 184
             ++P S+DWR  G+V  VK+QG CG CWAF+ IA VEGI  I    LV LSEQE++DC   
Sbjct:     1 AVPQSIDWRDYGAVNEVKNQGPCGGCWAFAAIATVEGIYKIRKGNLVYLSEQEVLDCAV- 59

Query:   185 QNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANH 244
              + GC GG +  A++FI    GVTT+  YPY+A  GTC+ +   + A  I G+  V  N 
Sbjct:    60 -SYGCKGGWVNRAYDFIISNNGVTTDENYPYRAYQGTCNANYFPNSAY-ITGYSYVRRND 117

Query:   245 EDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTGECGTELNHGVAAVGYGTTLDGTKYWI 304
             E  ++ AV+ QP++  IDA   +FQ+Y  GV++G CG  LNH +  +GYG       YWI
Sbjct:   118 ESHMMYAVSNQPIAALIDASGDNFQYYKGGVYSGPCGFSLNHAITIIGYGRD----SYWI 173

Query:   305 VRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYP 341
             VRNSWG  WG+ GY+R++R +S   G+CGIAM   +P
Sbjct:   174 VRNSWGSSWGQGGYVRIRRDVSHSGGVCGIAMSPLFP 210


>GENEDB_PFALCIPARUM|PF11_0161 [details] [associations]
            symbol:PF11_0161 "falcipain-2 precursor,
            putative" species:5833 "Plasmodium falciparum" [GO:0020020 "food
            vacuole" evidence=TAS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014186 HSSP:O65039 OMA:NNVEHIN GO:GO:0020020
            MEROPS:C01.046 HOGENOM:HOG000065857 ProtClustDB:PTZ00021
            RefSeq:XP_001347832.1 ProteinModelPortal:Q8I6U5 SMR:Q8I6U5
            IntAct:Q8I6U5 MINT:MINT-1546851 EnsemblProtists:PF11_0161:mRNA
            GeneID:810708 KEGG:pfa:PF11_0161 EuPathDB:PlasmoDB:PF3D7_1115300
            Uniprot:Q8I6U5
        Length = 482

 Score = 543 (196.2 bits), Expect = 2.1e-52, P = 2.1e-52
 Identities = 135/343 (39%), Positives = 191/343 (55%)

Query:    22 FDFHEKELESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHV--HQTNKMDKPY 79
             FD H+  + + E +   Y   ++++    S +E  +RF VF QN   V  H  NK    Y
Sbjct:   148 FD-HKFLMNNVEHINQFYTFIKTNNKQYNSPNEMKERFQVFLQNAHKVKMHNNNKKSL-Y 205

Query:    80 KLKLNKFADMTNHEFASTYA---GSK-IKHHRMFQGTRGNGTFM--Y-GKVTSIPPSVDW 132
             K +LN+FAD+T HEF S Y     SK +K+ +           +  Y G       + DW
Sbjct:   206 KKELNRFADLTYHEFKSKYLTLRSSKPLKNSKYLLDQINYDAVIKKYKGNENFDHAAYDW 265

Query:   133 RKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGG 192
             R    VT VKDQ  CGSCWAFS+I +VE    I  NKL++LSEQELVDC   +N GCNGG
Sbjct:   266 RLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSF-KNYGCNGG 324

Query:   193 LMELAFEFIKKKGGVTTEAKYPYQAN-DGTCDVSKESSPAVSIDGHENVPANHEDALLKA 251
             L+  AFE + + GG+ T+  YPY ++    C++ +  +    I  + +VP N     L+ 
Sbjct:   325 LINNAFEDMIELGGICTDDDYPYVSDAPNLCNIDR-CTEKYGIKNYLSVPDNKLKEALRF 383

Query:   252 VAKQPVSVAIDAGSSDFQFYSEGVFTGECGTELNHGVAAVGYGT-------TLDGTK--Y 302
             +   P+S++I A S DF FY EG+F GECG ELNH V  VG+G        T  G K  Y
Sbjct:   384 LG--PISISI-AVSDDFPFYKEGIFDGECGDELNHAVMLVGFGMKEIVNPLTKKGEKHYY 440

Query:   303 WIVRNSWGPEWGEKGYIRMQRGISDKKGL---CGIAMEASYPI 342
             +I++NSWG +WGE+G+I ++   +D+ GL   CG+  +A  P+
Sbjct:   441 YIIKNSWGQQWGERGFINIE---TDESGLMRKCGLGTDAFIPL 480


>UNIPROTKB|Q8I6U5 [details] [associations]
            symbol:PF11_0161 "Falcipain-2B" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            EMBL:AE014186 HSSP:O65039 OMA:NNVEHIN GO:GO:0020020 MEROPS:C01.046
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347832.1
            ProteinModelPortal:Q8I6U5 SMR:Q8I6U5 IntAct:Q8I6U5
            MINT:MINT-1546851 EnsemblProtists:PF11_0161:mRNA GeneID:810708
            KEGG:pfa:PF11_0161 EuPathDB:PlasmoDB:PF3D7_1115300 Uniprot:Q8I6U5
        Length = 482

 Score = 543 (196.2 bits), Expect = 2.1e-52, P = 2.1e-52
 Identities = 135/343 (39%), Positives = 191/343 (55%)

Query:    22 FDFHEKELESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHV--HQTNKMDKPY 79
             FD H+  + + E +   Y   ++++    S +E  +RF VF QN   V  H  NK    Y
Sbjct:   148 FD-HKFLMNNVEHINQFYTFIKTNNKQYNSPNEMKERFQVFLQNAHKVKMHNNNKKSL-Y 205

Query:    80 KLKLNKFADMTNHEFASTYA---GSK-IKHHRMFQGTRGNGTFM--Y-GKVTSIPPSVDW 132
             K +LN+FAD+T HEF S Y     SK +K+ +           +  Y G       + DW
Sbjct:   206 KKELNRFADLTYHEFKSKYLTLRSSKPLKNSKYLLDQINYDAVIKKYKGNENFDHAAYDW 265

Query:   133 RKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGG 192
             R    VT VKDQ  CGSCWAFS+I +VE    I  NKL++LSEQELVDC   +N GCNGG
Sbjct:   266 RLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSF-KNYGCNGG 324

Query:   193 LMELAFEFIKKKGGVTTEAKYPYQAN-DGTCDVSKESSPAVSIDGHENVPANHEDALLKA 251
             L+  AFE + + GG+ T+  YPY ++    C++ +  +    I  + +VP N     L+ 
Sbjct:   325 LINNAFEDMIELGGICTDDDYPYVSDAPNLCNIDR-CTEKYGIKNYLSVPDNKLKEALRF 383

Query:   252 VAKQPVSVAIDAGSSDFQFYSEGVFTGECGTELNHGVAAVGYGT-------TLDGTK--Y 302
             +   P+S++I A S DF FY EG+F GECG ELNH V  VG+G        T  G K  Y
Sbjct:   384 LG--PISISI-AVSDDFPFYKEGIFDGECGDELNHAVMLVGFGMKEIVNPLTKKGEKHYY 440

Query:   303 WIVRNSWGPEWGEKGYIRMQRGISDKKGL---CGIAMEASYPI 342
             +I++NSWG +WGE+G+I ++   +D+ GL   CG+  +A  P+
Sbjct:   441 YIIKNSWGQQWGERGFINIE---TDESGLMRKCGLGTDAFIPL 480


>UNIPROTKB|F1NZ37 [details] [associations]
            symbol:LOC420160 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 EMBL:AADN02062018
            IPI:IPI00587784 Ensembl:ENSGALT00000006765 OMA:CGVANQA
            Uniprot:F1NZ37
        Length = 340

 Score = 540 (195.1 bits), Expect = 4.4e-52, P = 4.4e-52
 Identities = 121/319 (37%), Positives = 173/319 (54%)

Query:    35 LWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKP----YKLKLNKFADMT 90
             L + +ERW+S +      + +  R  V++ N+  + Q N  +      ++L +N + D+ 
Sbjct:    30 LEEAWERWKSLYAKEYPGEAELIRREVWENNLRRIEQHNWEESQGQHTFRLGMNHYGDLM 89

Query:    91 NHEFASTYAG-SKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGS 149
             + EF     G + ++H           TF        P  VDWR +G VT VK+QG CGS
Sbjct:    90 DEEFNQLLNGFAPVQHEEPAL------TFQASAAQKTPAEVDWRMRGYVTPVKNQGHCGS 143

Query:   150 CWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVT 208
             CWAFS   A+EG+    T KL  LSEQ L+DC     N GC GG M  AF+++   GG+ 
Sbjct:   144 CWAFSATGALEGLVFNWTGKLAVLSEQNLIDCSWKLGNNGCQGGYMTRAFQYVHDNGGMN 203

Query:   209 TEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSD 267
             +E  YPYQA D +      +  A +      V    E AL +AVA   PVSVA+DA S  
Sbjct:   204 SEHIYPYQATDTSSCRYNPADRAANCSTVWLVAQGSEAALEQAVATVGPVSVAVDASSFF 263

Query:   268 FQFYSEGVFTGE-CGTELNHGVAAVGYGTTLDGTK---YWIVRNSWGPEWGEKGYIRMQR 323
             F FY  G+F    C  ++NHG+ AVGYG + +  K   YWI++NSW   WGEKGYIR+ +
Sbjct:   264 FHFYKSGIFNSMFCSQKVNHGMLAVGYGISQEARKNVSYWILKNSWSEVWGEKGYIRLLK 323

Query:   324 GISDKKGLCGIAMEASYPI 342
             G+++    CG+A +AS+P+
Sbjct:   324 GVNNH---CGVANQASFPL 339


>MGI|MGI:107285 [details] [associations]
            symbol:Ctsh "cathepsin H" species:10090 "Mus musculus"
            [GO:0001520 "outer dense fiber" evidence=ISO] [GO:0001669
            "acrosomal vesicle" evidence=ISO] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=IGI] [GO:0002764 "immune response-regulating
            signaling pathway" evidence=ISO] [GO:0004175 "endopeptidase
            activity" evidence=ISO;IMP] [GO:0004177 "aminopeptidase activity"
            evidence=ISO] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISO;IDA] [GO:0004252 "serine-type endopeptidase activity"
            evidence=IMP] [GO:0005615 "extracellular space" evidence=ISO]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0005829 "cytosol"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=ISO;IMP]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0008284
            "positive regulation of cell proliferation" evidence=IMP]
            [GO:0010628 "positive regulation of gene expression" evidence=ISO]
            [GO:0010634 "positive regulation of epithelial cell migration"
            evidence=IMP] [GO:0010813 "neuropeptide catabolic process"
            evidence=ISO] [GO:0010815 "bradykinin catabolic process"
            evidence=ISO] [GO:0010952 "positive regulation of peptidase
            activity" evidence=IGI;ISO] [GO:0016505 "apoptotic protease
            activator activity" evidence=IGI;ISO] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0030108 "HLA-A specific activating MHC
            class I receptor activity" evidence=ISO] [GO:0030335 "positive
            regulation of cell migration" evidence=ISO] [GO:0030984 "kininogen
            binding" evidence=ISO] [GO:0031638 "zymogen activation"
            evidence=ISO;IMP] [GO:0031648 "protein destabilization"
            evidence=ISO;IMP] [GO:0032403 "protein complex binding"
            evidence=ISO] [GO:0032526 "response to retinoic acid" evidence=IDA]
            [GO:0033619 "membrane protein proteolysis" evidence=ISO;IMP]
            [GO:0035085 "cilium axoneme" evidence=ISO] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IMP] [GO:0043129
            "surfactant homeostasis" evidence=ISO] [GO:0043621 "protein
            self-association" evidence=ISO] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IMP] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=IMP]
            [GO:0070324 "thyroid hormone binding" evidence=ISO] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISO] [GO:0097208 "alveolar
            lamellar body" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:107285 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 EMBL:CH466560 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            HOVERGEN:HBG011513 GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT OrthoDB:EOG4W9J43
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 BRENDA:3.4.22.16
            EMBL:U06119 EMBL:AK149949 EMBL:AK150583 EMBL:AK157376 EMBL:AK160026
            EMBL:Y18464 IPI:IPI00118987 RefSeq:NP_031827.2 UniGene:Mm.2277
            ProteinModelPortal:P49935 SMR:P49935 STRING:P49935 MEROPS:I29.003
            PhosphoSite:P49935 PaxDb:P49935 PRIDE:P49935
            Ensembl:ENSMUST00000034915 GeneID:13036 KEGG:mmu:13036
            InParanoid:Q3UCD6 ChEMBL:CHEMBL1949491 NextBio:282920 Bgee:P49935
            CleanEx:MM_CTSH Genevestigator:P49935 GermOnline:ENSMUSG00000032359
            Uniprot:P49935
        Length = 333

 Score = 536 (193.7 bits), Expect = 1.2e-51, P = 1.2e-51
 Identities = 120/311 (38%), Positives = 175/311 (56%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTY 98
             ++ W   H  + S  E + R  +F  N   +   N+ +  +K+ LN+F+DM+  E    +
Sbjct:    33 FKSWMKQHQKTYSSVEYNHRLQMFANNWRKIQAHNQRNHTFKMALNQFSDMSFAEIKHKF 92

Query:    99 AGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTA-VKDQGQCGSCWAFSTIA 157
               S+ ++      T+ N  ++ G     P S+DWRKKG+V + VK+QG CGSCW FST  
Sbjct:    93 LWSEPQN---CSATKSN--YLRG-TGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTG 146

Query:   158 AVEGINHIMTNKLVSLSEQELVDC-DTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQ 216
             A+E    I + K++SL+EQ+LVDC     N GC GGL   AFE+I    G+  E  YPY 
Sbjct:   147 ALESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFEYILYNKGIMEEDSYPYI 206

Query:   217 ANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSDFQFYSEGV 275
               D +C  + + + A  +    N+  N E A+++AVA   PVS A +  + DF  Y  GV
Sbjct:   207 GKDSSCRFNPQKAVAF-VKNVVNITLNDEAAMVEAVALYNPVSFAFEV-TEDFLMYKSGV 264

Query:   276 FTGE-CGT---ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGL 331
             ++ + C     ++NH V AVGYG   +G  YWIV+NSWG +WGE GY  ++RG    K +
Sbjct:   265 YSSKSCHKTPDKVNHAVLAVGYGEQ-NGLLYWIVKNSWGSQWGENGYFLIERG----KNM 319

Query:   332 CGIAMEASYPI 342
             CG+A  ASYPI
Sbjct:   320 CGLAACASYPI 330


>UNIPROTKB|G1SQF0 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9986
            "Oryctolagus cuniculus" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 RefSeq:XP_002721635.1 UniGene:Ocu.7137
            Ensembl:ENSOCUT00000006138 GeneID:100101597 Uniprot:G1SQF0
        Length = 333

 Score = 535 (193.4 bits), Expect = 1.5e-51, P = 1.5e-51
 Identities = 121/311 (38%), Positives = 176/311 (56%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTY 98
             ++ W S H    S +E  +R   F +N   ++  N  +  +++ LN+F+DM+  E    Y
Sbjct:    33 FKSWMSQHHKKYSAEEYPRRLQTFVRNWRKINAHNNGNHTFQMGLNQFSDMSFAEIKHKY 92

Query:    99 AGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGS-VTAVKDQGQCGSCWAFSTIA 157
               ++ ++      T+ N  ++ G     P SVDWRKKG+ V+ VK+QG CGSCW FST  
Sbjct:    93 LWTEPQN---CSATKSN--YLRG-TGPYPSSVDWRKKGNFVSPVKNQGACGSCWTFSTTG 146

Query:   158 AVEGINHIMTNKLVSLSEQELVDCDTD-QNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQ 216
             A+E    I   K++SL+EQ+LVDC  +  N GC GGL   AFE+I    G+  E  YPY+
Sbjct:   147 ALESAVAIAGGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQAFEYILYNKGIMGEDSYPYR 206

Query:   217 ANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSDFQFYSEGV 275
             A +G C    + + A   D   N+  N E+A+++AVA   PVS A +  + DF  Y +G+
Sbjct:   207 AMEGRCKFQPQKAIAFVKDV-ANITLNDEEAMVEAVALYNPVSFAFEV-TEDFMQYRKGI 264

Query:   276 FTG-ECGT---ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGL 331
             ++   C     ++NH V AVGYG   +G  YWIV+NSWG  WG  GY  ++RG    K +
Sbjct:   265 YSSTSCHKTPDKVNHAVLAVGYGEE-NGVPYWIVKNSWGSHWGMNGYFYIERG----KNM 319

Query:   332 CGIAMEASYPI 342
             CG+A  ASYPI
Sbjct:   320 CGLAACASYPI 330


>GENEDB_PFALCIPARUM|PF11_0165 [details] [associations]
            symbol:PF11_0165 "falcipain 2 precursor"
            species:5833 "Plasmodium falciparum" [GO:0020020 "food vacuole"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014186 HSSP:O65039 GO:GO:0020020
            RefSeq:XP_001347836.1 ProteinModelPortal:Q8I6U4 SMR:Q8I6U4
            IntAct:Q8I6U4 MINT:MINT-1559493 MEROPS:C01.046
            EnsemblProtists:PF11_0165:mRNA GeneID:810712 KEGG:pfa:PF11_0165
            EuPathDB:PlasmoDB:PF3D7_1115700 HOGENOM:HOG000065857 OMA:NESLHAN
            ProtClustDB:PTZ00021 BindingDB:Q8I6U4 ChEMBL:CHEMBL3470
            Uniprot:Q8I6U4
        Length = 484

 Score = 533 (192.7 bits), Expect = 2.4e-51, P = 2.4e-51
 Identities = 128/336 (38%), Positives = 190/336 (56%)

Query:    29 LESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNV--MHVHQTNKMDKPYKLKLNKF 86
             + + E +   Y   ++++    S +E  +RF VF QN   +++H  NK +  YK +LN+F
Sbjct:   156 MNNAEHINQFYMFIKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNNK-NSLYKKELNRF 214

Query:    87 ADMTNHEFASTYAG---SK-IKHHRMFQGTRGNGTFM--Y-GKVTSIPPSVDWRKKGSVT 139
             AD+T HEF + Y     SK +K+ +           +  Y G       + DWR    VT
Sbjct:   215 ADLTYHEFKNKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYKGNENFDHAAYDWRLHSGVT 274

Query:   140 AVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFE 199
              VKDQ  CGSCWAFS+I +VE    I  NKL++LSEQELVDC   +N GCNGGL+  AFE
Sbjct:   275 PVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSF-KNYGCNGGLINNAFE 333

Query:   200 FIKKKGGVTTEAKYPYQAN-DGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVS 258
              + + GG+ T+  YPY ++    C++ +  +    I  + +VP N     L+ +   P+S
Sbjct:   334 DMIELGGICTDDDYPYVSDAPNLCNIDR-CTEKYGIKNYLSVPDNKLKEALRFLG--PIS 390

Query:   259 VAIDAGSSDFQFYSEGVFTGECGTELNHGVAAVGYGT-------TLDGTK--YWIVRNSW 309
             +++ A S DF FY EG+F GECG +LNH V  VG+G        T  G K  Y+I++NSW
Sbjct:   391 ISV-AVSDDFAFYKEGIFDGECGDQLNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSW 449

Query:   310 GPEWGEKGYIRMQRGISDKKGL---CGIAMEASYPI 342
             G +WGE+G+I ++   +D+ GL   CG+  +A  P+
Sbjct:   450 GQQWGERGFINIE---TDESGLMRKCGLGTDAFIPL 482


>UNIPROTKB|Q8I6U4 [details] [associations]
            symbol:PF11_0165 "Falcipain-2A" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            EMBL:AE014186 HSSP:O65039 GO:GO:0020020 RefSeq:XP_001347836.1
            ProteinModelPortal:Q8I6U4 SMR:Q8I6U4 IntAct:Q8I6U4
            MINT:MINT-1559493 MEROPS:C01.046 EnsemblProtists:PF11_0165:mRNA
            GeneID:810712 KEGG:pfa:PF11_0165 EuPathDB:PlasmoDB:PF3D7_1115700
            HOGENOM:HOG000065857 OMA:NESLHAN ProtClustDB:PTZ00021
            BindingDB:Q8I6U4 ChEMBL:CHEMBL3470 Uniprot:Q8I6U4
        Length = 484

 Score = 533 (192.7 bits), Expect = 2.4e-51, P = 2.4e-51
 Identities = 128/336 (38%), Positives = 190/336 (56%)

Query:    29 LESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNV--MHVHQTNKMDKPYKLKLNKF 86
             + + E +   Y   ++++    S +E  +RF VF QN   +++H  NK +  YK +LN+F
Sbjct:   156 MNNAEHINQFYMFIKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNNK-NSLYKKELNRF 214

Query:    87 ADMTNHEFASTYAG---SK-IKHHRMFQGTRGNGTFM--Y-GKVTSIPPSVDWRKKGSVT 139
             AD+T HEF + Y     SK +K+ +           +  Y G       + DWR    VT
Sbjct:   215 ADLTYHEFKNKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYKGNENFDHAAYDWRLHSGVT 274

Query:   140 AVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFE 199
              VKDQ  CGSCWAFS+I +VE    I  NKL++LSEQELVDC   +N GCNGGL+  AFE
Sbjct:   275 PVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSF-KNYGCNGGLINNAFE 333

Query:   200 FIKKKGGVTTEAKYPYQAN-DGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVS 258
              + + GG+ T+  YPY ++    C++ +  +    I  + +VP N     L+ +   P+S
Sbjct:   334 DMIELGGICTDDDYPYVSDAPNLCNIDR-CTEKYGIKNYLSVPDNKLKEALRFLG--PIS 390

Query:   259 VAIDAGSSDFQFYSEGVFTGECGTELNHGVAAVGYGT-------TLDGTK--YWIVRNSW 309
             +++ A S DF FY EG+F GECG +LNH V  VG+G        T  G K  Y+I++NSW
Sbjct:   391 ISV-AVSDDFAFYKEGIFDGECGDQLNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSW 449

Query:   310 GPEWGEKGYIRMQRGISDKKGL---CGIAMEASYPI 342
             G +WGE+G+I ++   +D+ GL   CG+  +A  P+
Sbjct:   450 GQQWGERGFINIE---TDESGLMRKCGLGTDAFIPL 482


>RGD|2447 [details] [associations]
            symbol:Ctsh "cathepsin H" species:10116 "Rattus norvegicus"
          [GO:0001520 "outer dense fiber" evidence=IDA] [GO:0001656
          "metanephros development" evidence=IEP] [GO:0001669 "acrosomal
          vesicle" evidence=IDA] [GO:0001913 "T cell mediated cytotoxicity"
          evidence=ISO;ISS] [GO:0002250 "adaptive immune response"
          evidence=ISO] [GO:0002764 "immune response-regulating signaling
          pathway" evidence=ISO;ISS] [GO:0004175 "endopeptidase activity"
          evidence=ISO] [GO:0004177 "aminopeptidase activity" evidence=ISO;IDA]
          [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISO;ISS]
          [GO:0004252 "serine-type endopeptidase activity" evidence=ISO;ISS]
          [GO:0005615 "extracellular space" evidence=ISO;ISS;IDA] [GO:0005764
          "lysosome" evidence=ISO;ISS;IDA] [GO:0005829 "cytosol"
          evidence=ISO;ISS] [GO:0006508 "proteolysis" evidence=IEP;ISO]
          [GO:0007283 "spermatogenesis" evidence=IEP] [GO:0008233 "peptidase
          activity" evidence=ISO] [GO:0008234 "cysteine-type peptidase
          activity" evidence=ISO] [GO:0008284 "positive regulation of cell
          proliferation" evidence=ISO;ISS] [GO:0010628 "positive regulation of
          gene expression" evidence=ISO;ISS] [GO:0010634 "positive regulation
          of epithelial cell migration" evidence=ISO;ISS] [GO:0010813
          "neuropeptide catabolic process" evidence=ISO;ISS] [GO:0010815
          "bradykinin catabolic process" evidence=ISO;ISS] [GO:0010952
          "positive regulation of peptidase activity" evidence=ISO;ISS]
          [GO:0016505 "apoptotic protease activator activity" evidence=ISO;ISS]
          [GO:0030108 "HLA-A specific activating MHC class I receptor activity"
          evidence=ISO;ISS] [GO:0030335 "positive regulation of cell migration"
          evidence=ISO;ISS] [GO:0030984 "kininogen binding" evidence=IPI]
          [GO:0031638 "zymogen activation" evidence=ISO;ISS] [GO:0031648
          "protein destabilization" evidence=ISO;ISS] [GO:0032403 "protein
          complex binding" evidence=IPI] [GO:0032526 "response to retinoic
          acid" evidence=ISO;ISS] [GO:0033619 "membrane protein proteolysis"
          evidence=ISO;ISS] [GO:0035085 "cilium axoneme" evidence=IDA]
          [GO:0043066 "negative regulation of apoptotic process"
          evidence=ISO;ISS] [GO:0043129 "surfactant homeostasis"
          evidence=ISO;ISS] [GO:0043621 "protein self-association"
          evidence=IDA] [GO:0045766 "positive regulation of angiogenesis"
          evidence=ISO;ISS] [GO:0060448 "dichotomous subdivision of terminal
          units involved in lung branching" evidence=ISO;ISS] [GO:0070324
          "thyroid hormone binding" evidence=ISO;ISS] [GO:0070371 "ERK1 and
          ERK2 cascade" evidence=ISO;ISS] [GO:0097067 "cellular response to
          thyroid hormone stimulus" evidence=ISO;IEP] [GO:0097208 "alveolar
          lamellar body" evidence=ISO;ISS;IDA] InterPro:IPR000668
          InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
          InterPro:IPR000169 RGD:2447 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
          GO:GO:0008284 GO:GO:0070371 GO:GO:0001669 eggNOG:COG4870
          HOGENOM:HOG000230774 InterPro:IPR025661 InterPro:IPR025660
          InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
          PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0007283
          GO:GO:0045766 GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
          GO:GO:0043621 GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 KO:K01366
          GO:GO:0016505 GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
          HOVERGEN:HBG011513 GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
          GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT OrthoDB:EOG4W9J43
          GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
          GO:GO:0010813 GO:GO:0043129 MEROPS:I29.003 EMBL:Y00708 EMBL:BC085352
          EMBL:M38135 IPI:IPI00212809 PIR:S00211 RefSeq:NP_037071.1
          UniGene:Rn.1997 ProteinModelPortal:P00786 SMR:P00786 STRING:P00786
          PRIDE:P00786 Ensembl:ENSRNOT00000019285 GeneID:25425 KEGG:rno:25425
          UCSC:RGD:2447 InParanoid:P00786 BindingDB:P00786 NextBio:606599
          Genevestigator:P00786 GermOnline:ENSRNOG00000014064 GO:GO:0035086
          GO:GO:0001520 Uniprot:P00786
        Length = 333

 Score = 533 (192.7 bits), Expect = 2.4e-51, P = 2.4e-51
 Identities = 121/311 (38%), Positives = 171/311 (54%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTY 98
             +  W   H  + S  E   R  VF  N   +   N+ +  +K+ LN+F+DM+  E    Y
Sbjct:    33 FTSWMKQHQKTYSSREYSHRLQVFANNWRKIQAHNQRNHTFKMGLNQFSDMSFAEIKHKY 92

Query:    99 AGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTA-VKDQGQCGSCWAFSTIA 157
               S+ ++      T+ N  ++ G     P S+DWRKKG+V + VK+QG CGSCW FST  
Sbjct:    93 LWSEPQN---CSATKSN--YLRG-TGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTG 146

Query:   158 AVEGINHIMTNKLVSLSEQELVDCDTD-QNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQ 216
             A+E    I + K+++L+EQ+LVDC  +  N GC GGL   AFE+I    G+  E  YPY 
Sbjct:   147 ALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYI 206

Query:   217 ANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSDFQFYSEGV 275
               +G C  + E + A  +    N+  N E A+++AVA   PVS A +  + DF  Y  GV
Sbjct:   207 GKNGQCKFNPEKAVAF-VKNVVNITLNDEAAMVEAVALYNPVSFAFEV-TEDFMMYKSGV 264

Query:   276 FTGE-CGT---ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGL 331
             ++   C     ++NH V AVGYG   +G  YWIV+NSWG  WG  GY  ++RG    K +
Sbjct:   265 YSSNSCHKTPDKVNHAVLAVGYGEQ-NGLLYWIVKNSWGSNWGNNGYFLIERG----KNM 319

Query:   332 CGIAMEASYPI 342
             CG+A  ASYPI
Sbjct:   320 CGLAACASYPI 330


>WB|WBGene00007055 [details] [associations]
            symbol:tag-196 species:6239 "Caenorhabditis elegans"
            [GO:0004869 "cysteine-type endopeptidase inhibitor activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000010
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00031 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00043 SMART:SM00645 InterPro:IPR000169
            GO:GO:0004869 eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 EMBL:FO080488 PIR:T31871
            RefSeq:NP_505215.2 HSSP:Q9UBX1 ProteinModelPortal:O16454 SMR:O16454
            DIP:DIP-27400N IntAct:O16454 MINT:MINT-1044990 MEROPS:C01.A50
            PaxDb:O16454 EnsemblMetazoa:F41E6.6.1 EnsemblMetazoa:F41E6.6.2
            EnsemblMetazoa:F41E6.6.3 GeneID:179240 KEGG:cel:CELE_F41E6.6
            UCSC:F41E6.6.1 CTD:179240 WormBase:F41E6.6 InParanoid:O16454
            OMA:GGGLMTN NextBio:904514 Uniprot:O16454
        Length = 477

 Score = 530 (191.6 bits), Expect = 5.1e-51, P = 5.1e-51
 Identities = 130/308 (42%), Positives = 171/308 (55%)

Query:    37 DLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLK-LNKFADMTNHEFA 95
             D  +R    +T  R   E  KRF VFK+N   + +  K ++   +    KF+DMT  EF 
Sbjct:   176 DFVDRHEKKYTNKR---EVLKRFRVFKKNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFK 232

Query:    96 STYAGSKIKHHRMFQGTRGNGTFMYGKVT----SIPPSVDWRKKGSVTAVKDQGQCGSCW 151
                   + +   ++   + N  F    VT     +P S DWR+KG+VT VK+QG CGSCW
Sbjct:   233 KIMLPYQWEQP-VYPMEQAN--FEKHDVTINEEDLPESFDWREKGAVTQVKNQGNCGSCW 289

Query:   152 AFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEA 211
             AFST   VEG   I  NKLVSLSEQELVDCD+  +QGCNGGL   A++ I + GG+  E 
Sbjct:   290 AFSTTGNVEGAWFIAKNKLVSLSEQELVDCDS-MDQGCNGGLPSNAYKEIIRMGGLEPED 348

Query:   212 KYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFY 271
              YPY     TC + ++   AV I+G   +P +  +     V K P+S+ ++A +   QFY
Sbjct:   349 AYPYDGRGETCHLVRKDI-AVYINGSVELPHDEVEMQKWLVTKGPISIGLNANT--LQFY 405

Query:   272 SEGV---FTGECGT-ELNHGVAAVGYGTTLDGTK-YWIVRNSWGPEWGEKGYIRMQRGIS 326
               GV   F   C    LNHGV  VGYG   DG K YWIV+NSWGP WGE GY ++ RG  
Sbjct:   406 RHGVVHPFKIFCEPFMLNHGVLIVGYGK--DGRKPYWIVKNSWGPNWGEAGYFKLYRG-- 461

Query:   327 DKKGLCGI 334
               K +CG+
Sbjct:   462 --KNVCGV 467


>DICTYBASE|DDB_G0272298 [details] [associations]
            symbol:DDB_G0272298 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0272298 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246
            SMART:SM00848 EMBL:AAFI02000008 KO:K01365 RefSeq:XP_645281.1
            ProteinModelPortal:Q559Q3 MEROPS:C01.A53 EnsemblProtists:DDB0203746
            GeneID:8618447 KEGG:ddi:DDB_G0272298 InParanoid:Q559Q3 OMA:PANINWR
            Uniprot:Q559Q3
        Length = 305

 Score = 527 (190.6 bits), Expect = 1.1e-50, P = 1.1e-50
 Identities = 111/292 (38%), Positives = 164/292 (56%)

Query:    57 KRFNVFKQNVMHV-HQTNKMDKPYKLKLNKFADMTNHEFASTYAGSKIKHHRMFQGTRGN 115
             KRF++F+ N   + +  NK  +  ++ LN+++D+T  EFA  +    +   R        
Sbjct:    16 KRFDIFQDNYNFILNHRNKNGENIEMDLNEYSDLTQKEFADKFFEKLVPEPRSGPINDIK 75

Query:   116 GT-FMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLS 174
              T F +    +IP S DWR  G+V  VK+QG C SCW+FS + A+EG  +I   +L+ LS
Sbjct:    76 ATPFKHNVNATIPKSFDWRDHGAVGKVKNQGSCASCWSFSALGALEGHYYIKYGELLDLS 135

Query:   175 EQELVDCDTDQN-QGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVS 233
             EQ LVDC T    +GC  G M  AF++I   GGV  E++YPY   D  C  ++    A  
Sbjct:   136 EQNLVDCATPFGPKGCKTGWMHDAFKYIISSGGVNLESQYPYTGKDEVCKFNQSEKEA-K 194

Query:   234 IDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFTGECGTELN--HGVAA 290
             + G   +P   E AL++A+A   PV+V ID  + +FQ  S G++  +     N  H V A
Sbjct:   195 VSGFVMIPKFDESALMEAIALYGPVAVPIDTSTKEFQHLSGGIYYSDSCDPWNTIHAVLA 254

Query:   291 VGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPI 342
             +GYGT  +G  Y++++NSWG  WG  G+ +++RG+   KG CGI   ASYPI
Sbjct:   255 IGYGTDENGVDYFLMKNSWGKSWGTNGFFKVKRGV---KGKCGIVTAASYPI 303


>UNIPROTKB|J9P7C5 [details] [associations]
            symbol:J9P7C5 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 EMBL:AAEX03010953
            Ensembl:ENSCAFT00000012925 Uniprot:J9P7C5
        Length = 321

 Score = 527 (190.6 bits), Expect = 1.1e-50, P = 1.1e-50
 Identities = 124/310 (40%), Positives = 172/310 (55%)

Query:    41 RWRSHHTVSRSLDEKHKRFNVFKQNV----MHVHQTNKMDKPYKLKLNKFADMTNHEFAS 96
             +W++ H     ++E+  R  V+++N+    +H  + ++    + + +N F DMTN EF  
Sbjct:    26 QWKAMHRRLYGMNEEGWRRAVWEKNMKMIELHNREYSQGKHGFTMAMNAFGDMTNEEFRQ 85

Query:    97 TYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTI 156
                G + + H+     +G   F       IP SVDWR+KG VT VK+QGQCGSCWAFS  
Sbjct:    86 VINGFQNQKHK-----KGK-VFQEPLFAEIPKSVDWREKGYVTPVKNQGQCGSCWAFSAT 139

Query:   157 AAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQ 216
              A EG     T  LV LSEQ L       N+GCNGGLM+ AF+++K    + +E  YPY 
Sbjct:   140 GAFEGQMFWKTGNLVPLSEQNLAQ----GNEGCNGGLMDNAFQYVKDNRCLDSEESYPYL 195

Query:   217 AND-GTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEG 274
               D  TC+   E S A    G  ++P   E AL+KA+A    ++VAIDAG   FQFY   
Sbjct:   196 GRDTDTCNYKPECSAAHD-SGFVDLP-QREKALMKAMATLGSITVAIDAGHQYFQFYKSS 253

Query:   275 V-FTGECGT-ELNHGVAAVGYGTT-LDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGL 331
             + F  +C + +L+HGV  VGYG    D    WIV+NSW PEWG   Y++M +G ++    
Sbjct:   254 IYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKWIVKNSWSPEWGWNSYVKMAKGQNNH--- 310

Query:   332 CGIAMEASYP 341
             CGI   ASYP
Sbjct:   311 CGITA-ASYP 319


>RGD|1562210 [details] [associations]
            symbol:MGC114246 "similar to cathepsin R" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1562210 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 EMBL:CH474032 MEROPS:C01.042 KO:K09601
            OMA:ASHESFK OrthoDB:EOG4ZCT6D EMBL:BC091563 IPI:IPI00555186
            RefSeq:NP_001017509.1 UniGene:Rn.198321 SMR:Q5BJA0
            Ensembl:ENSRNOT00000061470 GeneID:498688 KEGG:rno:498688
            UCSC:RGD:1562210 InParanoid:Q5BJA0 NextBio:700535
            Genevestigator:Q5BJA0 Uniprot:Q5BJA0
        Length = 334

 Score = 526 (190.2 bits), Expect = 1.3e-50, P = 1.3e-50
 Identities = 120/336 (35%), Positives = 178/336 (52%)

Query:    17 GIVEGFDFHEKELESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNV--MHVHQ-TN 73
             G+  G    +  L++E   W   + W+  +  S SL+E+  R  V+++N+  + +H   N
Sbjct:    13 GVASGAPILDPSLDAE---W---QEWKKKYDKSYSLEEEELRRAVWEENLKMIKLHNGEN 66

Query:    74 KMDKP-YKLKLNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDW 132
              + K  + +++N+F D T  EF        ++ HR      G         +  P  VDW
Sbjct:    67 GLGKNGFTMEINEFGDTTGEEFRKMMVEFPVQTHR-----EGKSIMKRAAGSIFPKFVDW 121

Query:   133 RKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNG 191
             RKKG VT V+ QG C +CWAFS   A+E      + KL+ LS Q LVDC   Q N GC G
Sbjct:   122 RKKGYVTPVRRQGNCNACWAFSVTGAIEAQTIWQSGKLIPLSVQNLVDCSKPQGNNGCLG 181

Query:   192 GLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKA 251
             G    AF+++   GG+ +EA YPY+  DG C  + ++S A  I G  ++P + ED L+ A
Sbjct:   182 GDTYNAFQYVLHNGGLQSEATYPYEGKDGPCRYNPKNSSA-EITGFVSLPES-EDILMVA 239

Query:   252 VAK-QPVSVAIDAGSSDFQFYSEGVF-TGECGTE-LNHGVAAVGYG---TTLDGTKYWIV 305
             VA   P+S  IDA    F+FY +G++    C +  + HGV  VGYG       G  YW++
Sbjct:   240 VATIGPISAGIDASHESFKFYKKGIYHEPNCSSNSVTHGVLVVGYGFKGNDTGGDHYWLI 299

Query:   306 RNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYP 341
             +NSWG +WG +GY+++ +   DK   C IA  A YP
Sbjct:   300 KNSWGKQWGIRGYMKITK---DKNNHCAIASYAHYP 332


>ZFIN|ZDB-GENE-050208-336 [details] [associations]
            symbol:ctskl "cathepsin K, like" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-050208-336 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:BX465190
            GeneTree:ENSGT00660000095458 IPI:IPI00491185 RefSeq:XP_695425.1
            UniGene:Dr.110795 Ensembl:ENSDART00000062749 GeneID:567046
            KEGG:dre:567046 CTD:567046 NextBio:20888499 Bgee:F1QCP8
            Uniprot:F1QCP8
        Length = 349

 Score = 526 (190.2 bits), Expect = 1.3e-50, P = 1.3e-50
 Identities = 129/327 (39%), Positives = 182/327 (55%)

Query:    30 ESEEGLWDLYERWRSHHTVS--RSLDEKHKRFNVFKQNVMHVHQTNKMD-----KPYKLK 82
             ESEE     +  W+  H +S     ++ H++  +++ N+  + + N  D       +K+ 
Sbjct:    32 ESEEEAPTEWNLWKKKHEISYDEESEDVHRK-TIWETNMQKIWKNNN-DFSFGLSMFKMA 89

Query:    83 LNKFADMTNHEFASTYAGSKIKH--HRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTA 140
             +NK+ D+T+ E+     GSKIK   +R  + T      +  K   +  ++D+R KG VT 
Sbjct:    90 MNKYGDLTSVEY-KRLLGSKIKGTGNRKGKITSAQMLRLNAKRLGVT-NIDYRAKGYVTE 147

Query:   141 VKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQ-GCNGGLMELAFE 199
             VKDQG CGSCW+FST  A+EG  +  T +LVSLSEQ+LVDC       GC+G  M  A++
Sbjct:   148 VKDQGYCGSCWSFSTTGAIEGQMYKHTGRLVSLSEQQLVDCSRSYGTYGCSGAWMANAYD 207

Query:   200 FIKKKGGVTTEAKYPYQANDGT-CDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PV 257
             ++      +++  YPY + D   C   K  + A  I  +  VPA +E AL  AVA   PV
Sbjct:   208 YVINNALESSDT-YPYTSVDTQPCFYEKNLAMA-GISDYRFVPAGNEQALADAVATVGPV 265

Query:   258 SVAIDAGSSDFQFYSEGVFT-GECG-TELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGE 315
             SVAIDA +  F FYS G++    C    LNH V  VGYG+  +GT YWI++NSWG  WGE
Sbjct:   266 SVAIDADNPSFLFYSSGIYKESNCNPNNLNHAVLVVGYGSE-EGTDYWIIKNSWGTGWGE 324

Query:   316 KGYIRMQRGISDKKGLCGIAMEASYPI 342
              GY+RM   I + K  CGIA  A YPI
Sbjct:   325 GGYMRM---IRNGKNTCGIASYALYPI 348


>MGI|MGI:1927229 [details] [associations]
            symbol:Ctsm "cathepsin M" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008152 "metabolic process" evidence=ISS] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015645 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1927229 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF202528
            EMBL:AY014777 EMBL:AY057446 EMBL:AK005550 EMBL:AK005428
            IPI:IPI00131133 RefSeq:NP_071721.2 UniGene:Mm.279933
            ProteinModelPortal:Q9JL96 SMR:Q9JL96 STRING:Q9JL96 MEROPS:C01.023
            PRIDE:Q9JL96 DNASU:64139 Ensembl:ENSMUST00000099451 GeneID:64139
            KEGG:mmu:64139 UCSC:uc007qwj.1 CTD:64139 InParanoid:Q9JL96
            KO:K09600 OrthoDB:EOG4TTGKR NextBio:319931 Bgee:Q9JL96
            CleanEx:MM_CTSM Genevestigator:Q9JL96 GermOnline:ENSMUSG00000074484
            GermOnline:ENSMUSG00000074871 PANTHER:PTHR12411:SF58 Uniprot:Q9JL96
        Length = 333

 Score = 525 (189.9 bits), Expect = 1.7e-50, P = 1.7e-50
 Identities = 122/315 (38%), Positives = 180/315 (57%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNVMHV--HQ-TNKMDKP-YKLKLNKFADMTNHEF 94
             +++W+  +  + SL+E+ ++  V++ N+  +  H   N + K  + +++N F DMT  EF
Sbjct:    29 WQKWKIKYGKAYSLEEEGQKRAVWEDNMKKIKLHNGENGLGKHGFTMEMNAFGDMTLEEF 88

Query:    95 ASTYAGSKIKHHRMFQGTRGNGTFMYGKVT-SIPPSVDWRKKGSVTAVKDQGQCGSCWAF 153
                     +        T   G  +  +++ ++P  ++W+K+G VT V+ QG+C SCWAF
Sbjct:    89 RKVMIEIPVP-------TVKKGKSVQKRLSVNLPKFINWKKRGYVTPVQTQGRCNSCWAF 141

Query:   154 STIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEAK 212
             S   A+EG     T +L+ LS Q LVDC   Q N GC  G   LA  ++ + GG+ +EA 
Sbjct:   142 SVTGAIEGQMFRKTGQLIPLSVQNLVDCSRPQGNWGCYLGNTYLALHYVMENGGLESEAT 201

Query:   213 YPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSDFQFY 271
             YPY+  DG+C  S E+S A +I G E VP N EDAL+ AVA   P+SVAIDA  + F FY
Sbjct:   202 YPYEEKDGSCRYSPENSTA-NITGFEFVPKN-EDALMNAVASIGPISVAIDARHASFLFY 259

Query:   272 SEGVF-TGECGT-ELNHGVAAVGYGTT---LDGTKYWIVRNSWGPEWGEKGYIRMQRGIS 326
               G++    C +  + H +  VGYG T    DG KYW+V+NS G +WG KGY+++ R   
Sbjct:   260 KRGIYYEPNCSSCVVTHSMLLVGYGFTGRESDGRKYWLVKNSMGTQWGNKGYMKISR--- 316

Query:   327 DKKGLCGIAMEASYP 341
             DK   CGIA  A YP
Sbjct:   317 DKGNHCGIATYALYP 331


>UNIPROTKB|D3ZZR3 [details] [associations]
            symbol:D3ZZR3 "Uncharacterized protein" species:10116
            "Rattus norvegicus" [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0016020 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0002250 GeneTree:ENSGT00560000076577 GO:GO:0097067
            OrthoDB:EOG4JM7Q2 IPI:IPI00210228 PRIDE:D3ZZR3
            Ensembl:ENSRNOT00000028732 Uniprot:D3ZZR3
        Length = 331

 Score = 525 (189.9 bits), Expect = 1.7e-50, P = 1.7e-50
 Identities = 125/317 (39%), Positives = 175/317 (55%)

Query:    36 WDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHV--HQT-NKMDK-PYKLKLNKFADMTN 91
             WDL+++  +H    +  +E+  R  ++++N+  +  H   + M    Y + +N   DM  
Sbjct:    25 WDLWKK--THEKEYKDQNEEDVRRLIWEKNLKFIMLHNLEHSMGMHSYSVGMNHMGDMV- 81

Query:    92 HEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDW--RKKGSVTAVKDQGQCGS 149
                A T  G ++   R+ +  +  G        ++P  V W  R KG    +  QG CGS
Sbjct:    82 ---AETIIG-EMGSERLPRKRKALGLIPSSVNQNLPAGVKWKERTKGCWKNLVFQGSCGS 137

Query:   150 CWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ---NQGCNGGLMELAFEFIKKKGG 206
             CWAFS + A+EG   + T KLVSLS Q LVDC T++   N+GC GG M  AF++I   GG
Sbjct:   138 CWAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEAFQYIIDNGG 197

Query:   207 VTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVA-KQPVSVAIDAGS 265
             + +EA YPY+A D  C    ++  A +   +  +P   E+AL +AVA K PVSV IDA  
Sbjct:   198 IDSEASYPYKAMDEKCHYDPKNR-AATCSRYIELPFGDEEALKEAVATKGPVSVGIDASH 256

Query:   266 SDFQFYSEGVFTG-ECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRG 324
             S F  Y  GV+    C   +NHGV  VGYGT LDG  YW+V+NSWG  +G++GYIRM R 
Sbjct:   257 SSFFLYQSGVYDDPSCTENVNHGVLVVGYGT-LDGKDYWLVKNSWGLHFGDQGYIRMAR- 314

Query:   325 ISDKKGLCGIAMEASYP 341
               + K  CGIA   SYP
Sbjct:   315 --NNKNHCGIASYCSYP 329


>MGI|MGI:1349426 [details] [associations]
            symbol:Ctsj "cathepsin J" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=ISO]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0048471 "perinuclear region
            of cytoplasm" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:1349426 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF136272
            EMBL:AF158182 EMBL:AY034579 EMBL:AK005526 EMBL:AK131661
            EMBL:BC103769 IPI:IPI00126770 RefSeq:NP_036137.1 UniGene:Mm.31948
            ProteinModelPortal:Q9R014 SMR:Q9R014 MEROPS:C01.038 PRIDE:Q9R014
            Ensembl:ENSMUST00000071526 GeneID:26898 KEGG:mmu:26898
            UCSC:uc007qwa.1 CTD:26898 InParanoid:Q9R014 KO:K09599
            NextBio:304745 Bgee:Q9R014 CleanEx:MM_CTSJ Genevestigator:Q9R014
            GermOnline:ENSMUSG00000055298 Uniprot:Q9R014
        Length = 334

 Score = 523 (189.2 bits), Expect = 2.8e-50, P = 2.8e-50
 Identities = 125/336 (37%), Positives = 182/336 (54%)

Query:    17 GIVEGFDFHEKELESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNV--MHVH-QTN 73
             G+  G   H+ +L++E   W   + W++ +  S S  E+  R  V+++N+  + +H + N
Sbjct:    13 GVASGAQAHDPKLDAE---W---KDWKTKYAKSYSPKEEALRRAVWEENMRMIKLHNKEN 66

Query:    74 KMDKP-YKLKLNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDW 132
              + K  + +K+NKF D T+ EF  +     I    M      N   +      +P   DW
Sbjct:    67 SLGKNNFTMKMNKFGDQTSEEFRKSIDNIPIPA-AMTDPHAQNHVSI-----GLPDYKDW 120

Query:   133 RKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCD-TDQNQGCNG 191
             R++G VT V++QG+CGSCWAF+   A+EG     T  L  LS Q L+DC  T  N+GC  
Sbjct:   121 REEGYVTPVRNQGKCGSCWAFAAAGAIEGQMFWKTGNLTPLSVQNLLDCSKTVGNKGCQS 180

Query:   192 GLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKA 251
             G    AFE++ K  G+  EA YPY+  DG C    E++ A +I  + N+P N E  L  A
Sbjct:   181 GTAHQAFEYVLKNKGLEAEATYPYEGKDGPCRYRSENASA-NITDYVNLPPN-ELYLWVA 238

Query:   252 VAK-QPVSVAIDAGSSDFQFYSEGVF-TGECGTE-LNHGVAAVGYGT---TLDGTKYWIV 305
             VA   PVS AIDA    F+FY+ G++    C +  +NH V  VGYG+     DG  YW++
Sbjct:   239 VASIGPVSAAIDASHDSFRFYNGGIYYEPNCSSYFVNHAVLVVGYGSEGDVKDGNNYWLI 298

Query:   306 RNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYP 341
             +NSWG EWG  GY+++ +   D    CGIA  ASYP
Sbjct:   299 KNSWGEEWGMNGYMQIAK---DHNNHCGIASLASYP 331


>DICTYBASE|DDB_G0290957 [details] [associations]
            symbol:cprA "cysteine proteinase 1" species:44689
            "Dictyostelium discoideum" [GO:0006972 "hyperosmotic response"
            evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0290957
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GenomeReviews:CM000154_GR GO:GO:0005764
            GO:GO:0006972 EMBL:AAFI02000174 KO:K01376 EMBL:X02407 PIR:A22827
            RefSeq:XP_635417.1 ProteinModelPortal:P04988 MEROPS:C01.022
            GlycoSuiteDB:P04988 SWISS-2DPAGE:P04988 EnsemblProtists:DDB0201647
            GeneID:8627918 KEGG:ddi:DDB_G0290957 OMA:KISNFTM
            ProtClustDB:CLSZ2429603 Uniprot:P04988
        Length = 343

 Score = 522 (188.8 bits), Expect = 3.6e-50, P = 3.6e-50
 Identities = 122/305 (40%), Positives = 174/305 (57%)

Query:    51 SLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLK----LNKFADMTNHEFASTYAGSKIKHH 106
             S +E  +RF +FK N+  + + N +   +K      +NKFAD+++ EF + Y  +K    
Sbjct:    41 SHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNK---E 97

Query:   107 RMFQGTRGNGTFMYGK-VTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHI 165
              +F        ++  + + SIP + DWR +G+VT VK+QGQCGSCW+FST   VEG + I
Sbjct:    98 AIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFI 157

Query:   166 MTNKLVSLSEQELVDCDTD-------Q--NQGCNGGLMELAFEFIKKKGGVTTEAKYPYQ 216
               NKLVSLSEQ LVDCD +       Q  ++GCNGGL   A+ +I K GG+ TE+ YPY 
Sbjct:   158 SQNKLVSLSEQNLVDCDHECMEYEGEQACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYT 217

Query:   217 ANDGT-CDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGV 275
             A  GT C+ +  +  A  I     +P N        V+  P+++A DA   ++QFY  GV
Sbjct:   218 AETGTQCNFNSANIGA-KISNFTMIPKNETVMAGYIVSTGPLAIAADA--VEWQFYIGGV 274

Query:   276 FTGECG-TELNHGVAAVGYGT--TL--DGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKG 330
             F   C    L+HG+  VGY    T+      YWIV+NSWG +WGE+GYI ++RG    K 
Sbjct:   275 FDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRG----KN 330

Query:   331 LCGIA 335
              CG++
Sbjct:   331 TCGVS 335


>DICTYBASE|DDB_G0291191 [details] [associations]
            symbol:DDB_G0291191 "cysteine protease" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0291191
            GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AAFI02000175 MEROPS:C01.022
            ProtClustDB:CLSZ2429603 RefSeq:XP_635374.1
            ProteinModelPortal:Q54F16 PRIDE:Q54F16 EnsemblProtists:DDB0252831
            GeneID:8628022 KEGG:ddi:DDB_G0291191 OMA:NETQIAS Uniprot:Q54F16
        Length = 352

 Score = 521 (188.5 bits), Expect = 4.6e-50, P = 4.6e-50
 Identities = 126/322 (39%), Positives = 171/322 (53%)

Query:    51 SLDEKHKRFNVFKQNVMHVHQTNK----MDKPYKLKLNKFADMTNHEFASTYAGSKIKHH 106
             S +E   +F  FK N++++   NK    +    K  +NKFAD++  EF   Y  SK    
Sbjct:    39 SAEEYLVKFETFKSNLLNIDALNKQATTIGSDTKFGVNKFADLSKEEFKKYYLSSK--EA 96

Query:   107 RMFQGTRGNGTFMYGKVTSIPPSVDWRKKGS---------VTAVKDQGQCGSCWAFSTIA 157
             R+              +++ P + DWR  G          VTAVK+QGQCGSCW+FST  
Sbjct:    97 RLTDDLPMLPNLSDDIISATPAAFDWRNTGGSTKFPQGTPVTAVKNQGQCGSCWSFSTTG 156

Query:   158 AVEGINHIMTNKLVSLSEQELVDCD----TDQNQ-----GCNGGLMELAFEFIKKKGGVT 208
              VEG +++ T  LV LSEQ LVDCD    T +N+     GC+GGL   A+ +I K GG+ 
Sbjct:   157 NVEGQHYLSTGTLVGLSEQNLVDCDHTCMTYENENVCNAGCDGGLQPNAYNYIIKNGGIQ 216

Query:   209 TEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDF 268
             TEA YPY A DG C  +     A  I     VP N            P+++A DA   ++
Sbjct:   217 TEATYPYTAVDGECKFNSAQVGA-KISSFTMVPQNETQIASYLFNNGPLAIAADA--EEW 273

Query:   269 QFYSEGVFTGECGTELNHGVAAVGYGT--TLDG--TKYWIVRNSWGPEWGEKGYIRMQRG 324
             QFY  GVF   CG  L+HG+  VGYG   T+ G  T YWI++NSWG +WGE GY++++R 
Sbjct:   274 QFYMGGVFDFPCGQTLDHGILIVGYGAQDTIVGKNTPYWIIKNSWGADWGEAGYLKVERN 333

Query:   325 ISDKKGLCGIAMEASYPIKKSA 346
              +DK   CG+A   S  I  S+
Sbjct:   334 -TDK---CGVANFVSSSIVGSS 351


>UNIPROTKB|Q4QRC2 [details] [associations]
            symbol:Ctsql2 "Protein Ctsql2" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 HOVERGEN:HBG011513 EMBL:CH474032
            RGD:1303225 EMBL:BC097257 IPI:IPI00421946 RefSeq:NP_001002813.2
            UniGene:Rn.128678 SMR:Q4QRC2 MEROPS:C01.111
            Ensembl:ENSRNOT00000038758 GeneID:408201 KEGG:rno:408201 CTD:408201
            InParanoid:Q4QRC2 OMA:NDEGALM NextBio:696394 Genevestigator:Q4QRC2
            Uniprot:Q4QRC2
        Length = 343

 Score = 521 (188.5 bits), Expect = 4.6e-50, P = 4.6e-50
 Identities = 121/342 (35%), Positives = 188/342 (54%)

Query:    17 GIVEGFDFHEKELESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNV--MHVH-QTN 73
             G+V G       L+ +   W   + W+  +    S +E+  +  V+++NV  + +H + N
Sbjct:    13 GVVSGASAFNLSLDVQ---W---QEWKMKYEKLYSPEEELLKRVVWEENVKKIELHNREN 66

Query:    74 KMDK-PYKLKLNKFADMTNHEFASTYAGSKIKHHRMFQGT--RGNGT-F---MYGKVTSI 126
              + K  Y +++N FAD+T+ EF     G  +  +   +    R  G+ F    Y +  ++
Sbjct:    67 SLGKNTYIMEINNFADLTDEEFKDMITGITLPINNTMKSLWKRALGSPFPNSWYWR-DAL 125

Query:   127 PPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ- 185
             P S+DWRK+G VT V++QG+C SCWAF    A+EG     T KL  LS Q LVDC   Q 
Sbjct:   126 PKSIDWRKEGYVTRVREQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCSKPQG 185

Query:   186 NQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHE 245
             N+GC GG    AF+++ + GG+ +EA YPY+  +G C  + +++ A  I     +P + E
Sbjct:   186 NKGCRGGTTYNAFQYVLQNGGLESEATYPYKGKEGLCKYNPKNAYA-KITRFVALPED-E 243

Query:   246 DALLKAVA-KQPVSVAIDAGSSDFQFYSEGVF-TGECGTELNHGVAAVGYG---TTLDGT 300
             D L+ A+A K PV+  I    S  +FY +G++   +C   +NH V  VGYG      DG 
Sbjct:   244 DVLMDALATKGPVAAGIHVVYSSLRFYKKGIYHEPKCNNRVNHAVLVVGYGFEGNETDGN 303

Query:   301 KYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPI 342
              YW+++NSWG +WG KGY+++ +   D+   CGIA  A YPI
Sbjct:   304 NYWLIKNSWGKQWGLKGYMKIAK---DRNNHCGIATFAQYPI 342


>ZFIN|ZDB-GENE-050522-559 [details] [associations]
            symbol:ctssb.1 "cathepsin S, b.1" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-050522-559 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 MEROPS:C01.034
            EMBL:BC095694 IPI:IPI00607338 UniGene:Dr.75553
            ProteinModelPortal:Q502H6 SMR:Q502H6 InParanoid:Q502H6
            ArrayExpress:Q502H6 Uniprot:Q502H6
        Length = 330

 Score = 519 (187.8 bits), Expect = 7.4e-50, P = 7.4e-50
 Identities = 121/314 (38%), Positives = 175/314 (55%)

Query:    36 WDLYERWRSHHTVSRSLDEKHKRFNVFKQNV--MHVHQTN-KMDK-PYKLKLNKFADMTN 91
             W+L+++  ++  +  +  E+  R  ++++N+  + VH     M    Y L +N   D+T 
Sbjct:    27 WELWKK--TYGKIYTTEVEEFGRRQLWERNLQLITVHNLEASMGMHSYDLSMNHMGDLTT 84

Query:    92 HEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCW 151
              E   T A + +     F+    N     G   ++P S+DWR+KG V++VK QG CGSCW
Sbjct:    85 EEILQTLALTHVPSG--FKRQIANIVGSSGD--AVPDSLDWREKGYVSSVKMQGACGSCW 140

Query:   152 AFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTE 210
             AFS++ A+EG     T KLV LS Q LVDC +   N+GCNGG M  AF+++   GG+ ++
Sbjct:   141 AFSSVGALEGQLKKTTGKLVDLSPQNLVDCSSKYGNKGCNGGFMSDAFQYVIDNGGIASD 200

Query:   211 AKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQ 269
             + YPY+     C  S  S  A +   +  V    E+AL +AVA   P+SVAIDA    F 
Sbjct:   201 SAYPYRGVQQQCSYSS-SQRAANCTKYYFVRQGDENALKQAVASVGPISVAIDATRPQFV 259

Query:   270 FYSEGVFTGE-CGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDK 328
              Y  GV+    C   +NH V  VGYGT L G  +W+V+NSWG  +G+ GYIRM R   +K
Sbjct:   260 LYHSGVYNDPTCSKRVNHAVLVVGYGT-LSGQDHWLVKNSWGTRFGDGGYIRMAR---NK 315

Query:   329 KGLCGIAMEASYPI 342
               +CGIA  A YP+
Sbjct:   316 NNMCGIASYACYPV 329


>RGD|1588248 [details] [associations]
            symbol:Cts8 "cathepsin 8" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1588248 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 IPI:IPI00765053
            RefSeq:NP_001121688.1 UniGene:Rn.220599 Ensembl:ENSRNOT00000061486
            GeneID:680718 KEGG:rno:680718 UCSC:RGD:1588248 CTD:56094
            OMA:DSEWQEW OrthoDB:EOG4JT07C NextBio:719350 Uniprot:D3ZP54
        Length = 333

 Score = 518 (187.4 bits), Expect = 9.5e-50, P = 9.5e-50
 Identities = 120/322 (37%), Positives = 176/322 (54%)

Query:    31 SEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTN-KMD---KPYKLKLNKF 86
             S+  L   ++ W++ +  + SL+E+ ++  V+++N+  V Q N + D   K + ++LN F
Sbjct:    21 SDPSLDSEWQEWKTKYEKNYSLEEEGQKRAVWEENMKVVKQHNIEYDQEKKNFTMELNAF 80

Query:    87 ADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQ 146
             ADMT  EF        +++ R  +       F Y     +P  VDWR++G VT+VK+QG 
Sbjct:    81 ADMTGEEFRKMMTNIPVQNLRKKKSIH-QPIFRY-----LPKFVDWRRRGYVTSVKNQGT 134

Query:   147 CGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKG 205
             C SCWAFS   A+EG     T +LVSLS Q LVDC   + N GC+ G    A +++   G
Sbjct:   135 CNSCWAFSVAGAIEGQMFRKTGRLVSLSPQNLVDCSRPEGNHGCHMGSTLYALKYVWSNG 194

Query:   206 GVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAK-QPVSVAIDAG 264
             G+  E+ YPY+  +G C      S A  + G   V A  E+AL+ AVA   P+SV IDA 
Sbjct:   195 GLEAESTYPYEGKEGPCRYLPRRS-AARVTGFSTV-ARSEEALMHAVATIGPISVGIDAS 252

Query:   265 SSDFQFYSEGVF-TGECGTE-LNHGVAAVGYG---TTLDGTKYWIVRNSWGPEWGEKGYI 319
                F+FY  G++    C +  +NH V  VGYG      DG KYW+++NS G  WG  GY+
Sbjct:   253 HVSFRFYRRGIYYEPRCSSNRINHSVLVVGYGYEGRESDGRKYWLIKNSHGVGWGMNGYM 312

Query:   320 RMQRGISDKKGLCGIAMEASYP 341
             ++ RG ++    CGIA    YP
Sbjct:   313 KLARGWNNH---CGIATYGFYP 331


>RGD|631421 [details] [associations]
            symbol:Ctsq "cathepsin Q" species:10116 "Rattus norvegicus"
            [GO:0005764 "lysosome" evidence=NAS] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 RGD:631421 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 UniGene:Rn.34875 EMBL:AF187323 IPI:IPI00214897
            PIR:JC7183 RefSeq:NP_640355.1 UniGene:Rn.35820
            ProteinModelPortal:Q9QZE3 SMR:Q9QZE3 STRING:Q9QZE3 MEROPS:C01.039
            PRIDE:Q9QZE3 Ensembl:ENSRNOT00000024208 GeneID:246147
            KEGG:rno:246147 UCSC:RGD:631421 CTD:104002 InParanoid:Q9QZE3
            OMA:ESEDVLM OrthoDB:EOG4HHP48 NextBio:623425 Genevestigator:Q9QZE3
            GermOnline:ENSRNOG00000017946 Uniprot:Q9QZE3
        Length = 343

 Score = 515 (186.3 bits), Expect = 2.0e-49, P = 2.0e-49
 Identities = 115/318 (36%), Positives = 176/318 (55%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNV--MHVH-QTNKMDK-PYKLKLNKFADMTNHEF 94
             ++ W+  +    S +E+  +  V+++NV  + +H + N + K  Y +++N FADMT+ EF
Sbjct:    29 WQEWKIKYEKLYSPEEEVLKRVVWEENVKKIELHNRENSLGKNTYTMEINDFADMTDEEF 88

Query:    95 ASTYAGSKIKHHRMFQGT--RGNGTFM---YGKVTSIPPSVDWRKKGSVTAVKDQGQCGS 149
                  G ++  H   +    R  G+F    +    ++P  VDWR +G VT V+ QG C S
Sbjct:    89 KDMIIGFQLPVHNTEKRLWKRALGSFFPNSWNWRDALPKFVDWRNEGYVTRVRKQGGCSS 148

Query:   150 CWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVT 208
             CWAF    A+EG     T KL+ LS Q L+DC   Q N+GC  G    AF+++   GG+ 
Sbjct:   149 CWAFPVTGAIEGQMFKKTGKLIPLSVQNLIDCSKPQGNRGCLWGNTYNAFQYVLHNGGLE 208

Query:   209 TEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVA-KQPVSVAIDAGSSD 267
              EA YPY+  +G C  + ++S A  I G   +P + ED L+ AVA K P++  +   SS 
Sbjct:   209 AEATYPYERKEGVCRYNPKNSSA-KITGFVVLPES-EDVLMDAVATKGPIATGVHVISSS 266

Query:   268 FQFYSEGVF-TGECGTELNHGVAAVGYG---TTLDGTKYWIVRNSWGPEWGEKGYIRMQR 323
             F+FY +GV+   +C + +NH V  VGYG      DG  YW+++NSWG  WG +GY+++ +
Sbjct:   267 FRFYQKGVYHEPKCSSYVNHAVLVVGYGFEGNETDGNNYWLIKNSWGKRWGLRGYMKIAK 326

Query:   324 GISDKKGLCGIAMEASYP 341
                D+   C IA  A YP
Sbjct:   327 ---DRNNHCAIASLAQYP 341


>UNIPROTKB|F1P3U9 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005829 "cytosol" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=IEA] [GO:0010628 "positive regulation of gene expression"
            evidence=IEA] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=IEA] [GO:0010813 "neuropeptide catabolic
            process" evidence=IEA] [GO:0010815 "bradykinin catabolic process"
            evidence=IEA] [GO:0016505 "apoptotic protease activator activity"
            evidence=IEA] [GO:0030108 "HLA-A specific activating MHC class I
            receptor activity" evidence=IEA] [GO:0031638 "zymogen activation"
            evidence=IEA] [GO:0031648 "protein destabilization" evidence=IEA]
            [GO:0032526 "response to retinoic acid" evidence=IEA] [GO:0033619
            "membrane protein proteolysis" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0043129
            "surfactant homeostasis" evidence=IEA] [GO:0045766 "positive
            regulation of angiogenesis" evidence=IEA] [GO:0060448 "dichotomous
            subdivision of terminal units involved in lung branching"
            evidence=IEA] [GO:0070324 "thyroid hormone binding" evidence=IEA]
            [GO:0070371 "ERK1 and ERK2 cascade" evidence=IEA] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA]
            [GO:0097208 "alveolar lamellar body" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066
            GO:GO:0005615 GO:GO:0008284 GO:GO:0070371 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0032526 GO:GO:0010628
            GO:GO:0070324 GO:GO:0016505 GO:GO:0010634 GO:GO:0004197
            GO:GO:0042599 GO:GO:0031648 GO:GO:0097067 GO:GO:0031638
            GO:GO:0001913 GeneTree:ENSGT00660000095458 OMA:STSCHKT
            GO:GO:0030108 GO:GO:0010815 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 EMBL:AADN02038832 EMBL:AADN02038831 IPI:IPI00594147
            Ensembl:ENSGALT00000013440 Uniprot:F1P3U9
        Length = 261

 Score = 514 (186.0 bits), Expect = 2.5e-49, P = 2.5e-49
 Identities = 120/271 (44%), Positives = 158/271 (58%)

Query:    79 YKLKLNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGS- 137
             + + LN+F+DMT  EF   Y  S+ ++      TRGN  F+       P +VDWRKKG+ 
Sbjct:     1 FLVALNQFSDMTFAEFKKLYLWSEPQN---CSATRGN--FLRSD-GPCPEAVDWRKKGNF 54

Query:   138 VTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDC-DTDQNQGCNGGLMEL 196
             VT VK+QG CGSCW FST   +E    I T KL+SL+EQ LVDC     N GC+GGL   
Sbjct:    55 VTPVKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQLLVDCAQAFNNHGCSGGLPSQ 114

Query:   197 AFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ- 255
             AFE+I    G+  E  YPY+A +GTC    + + A   D   N+    E  +++AV K  
Sbjct:   115 AFEYILYNKGLMGEDAYPYRAQNGTCKFQPDKAIAFVKDVI-NITQYDEAGMVEAVGKHN 173

Query:   256 PVSVAIDAGSSDFQFYSEGVFTG-ECG---TELNHGVAAVGYGTTLDGTKYWIVRNSWGP 311
             PVS A +  +SDF  Y +GV++   C     ++NH V AVGYG   DG  YWIV+NSWGP
Sbjct:   174 PVSFAFEV-TSDFMHYRKGVYSNPRCEHTPDKVNHAVLAVGYGEE-DGRPYWIVKNSWGP 231

Query:   312 EWGEKGYIRMQRGISDKKGLCGIAMEASYPI 342
              WG  GY  ++RG    K +CG+A  ASYP+
Sbjct:   232 LWGMDGYFLIERG----KNMCGLAACASYPV 258


>RGD|69241 [details] [associations]
            symbol:Ctsj "cathepsin J" species:10116 "Rattus norvegicus"
           [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
           evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
           evidence=IEA] [GO:0048471 "perinuclear region of cytoplasm"
           evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
           PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:L14776
           RGD:69241 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
           InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246
           SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
           GO:GO:0005764 HOVERGEN:HBG011513 MEROPS:C01.038 CTD:26898 KO:K09599
           EMBL:AF310623 EMBL:BC097263 IPI:IPI00205027 PIR:I58002
           RefSeq:NP_058817.1 UniGene:Rn.34875 ProteinModelPortal:Q63088
           SMR:Q63088 PRIDE:Q63088 GeneID:29174 KEGG:rno:29174 NextBio:608244
           Genevestigator:Q63088 Uniprot:Q63088
        Length = 334

 Score = 514 (186.0 bits), Expect = 2.5e-49, P = 2.5e-49
 Identities = 123/337 (36%), Positives = 180/337 (53%)

Query:    17 GIVEGFDFHEKELESEEGLWDLYERWRSHHTVSRS-LDEKHKRFNVFKQNVMHVHQTNKM 75
             G+  G    +  L++E   W   + W++ +  S S ++E+ KR  V+++N+  +   NK 
Sbjct:    13 GVASGAPARDPNLDAE---W---QDWKTKYAKSYSPVEEELKRA-VWEENLKMIQLHNKE 65

Query:    76 D----KPYKLKLNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVD 131
             +      + +++N FAD T  EF  +   S I    +      N +        +P   D
Sbjct:    66 NGLGKNGFTMEMNAFADTTGEEFRKSL--SDI----LIPAAVTNPSAQKQVSIGLPNFKD 119

Query:   132 WRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCN 190
             WRK+G VT V++QG+CGSCWAF+ + A+EG     T  L  LS Q L+DC   + N GC 
Sbjct:   120 WRKEGYVTPVRNQGKCGSCWAFAAVGAIEGQMFSKTGNLTPLSVQNLLDCSKSEGNNGCR 179

Query:   191 GGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLK 250
              G    AF ++ K  G+  EA YPY+  DG C    E++ A +I G  N+P N E  L  
Sbjct:   180 WGTAHQAFNYVLKNKGLEAEATYPYEGKDGPCRYHSENASA-NITGFVNLPPN-ELYLWV 237

Query:   251 AVAK-QPVSVAIDAGSSDFQFYSEGVF-TGECGTEL-NHGVAAVGYG---TTLDGTKYWI 304
             AVA   PVS AIDA    F+FYS GV+    C + + NH V  VGYG      DG  YW+
Sbjct:   238 AVASIGPVSAAIDASHDSFRFYSGGVYHEPNCSSYVVNHAVLVVGYGFEGNETDGNNYWL 297

Query:   305 VRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYP 341
             ++NSWG EWG  G++++ +   D+   CGIA +AS+P
Sbjct:   298 IKNSWGEEWGINGFMKIAK---DRNNHCGIASQASFP 331


>MGI|MGI:1861723 [details] [associations]
            symbol:Ctsr "cathepsin R" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=ISA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=ISA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0030163 "protein
            catabolic process" evidence=ISA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1861723 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0030163
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF245399
            EMBL:AY014778 EMBL:AK014432 EMBL:AK005429 IPI:IPI00120321
            RefSeq:NP_064680.1 UniGene:Mm.315715 ProteinModelPortal:Q9JIA9
            SMR:Q9JIA9 MEROPS:C01.042 PRIDE:Q9JIA9 Ensembl:ENSMUST00000021889
            GeneID:56835 KEGG:mmu:56835 CTD:56835 InParanoid:Q9JIA9 KO:K09601
            OMA:ASHESFK OrthoDB:EOG4ZCT6D NextBio:313379 Bgee:Q9JIA9
            CleanEx:MM_CTSR Genevestigator:Q9JIA9 GermOnline:ENSMUSG00000055679
            Uniprot:Q9JIA9
        Length = 334

 Score = 513 (185.6 bits), Expect = 3.2e-49, P = 3.2e-49
 Identities = 125/338 (36%), Positives = 183/338 (54%)

Query:    17 GIVEGFDFHEKELESEEGLWDLYERWRSHHTVSRSL-DEKHKRFNVFKQNV--MHVH-QT 72
             G+  G    +  L++E   W   + W+  +  S SL +EK KR  V+++ +  + +H + 
Sbjct:    13 GVASGVPVLDSSLDAE---W---QDWKIKYNKSYSLKEEKLKRV-VWEEKLKMIKLHNRE 65

Query:    73 NKMDKP-YKLKLNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPS-V 130
             N + K  + +K+N+F D T+ EF        +  HR  +G     + M  +  SI P  V
Sbjct:    66 NSLGKNGFTMKMNEFGDQTDEEFRKMMIEISVWTHR--EGK----SIMKREAGSILPKFV 119

Query:   131 DWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGC 189
             DWRKKG VT V+ QG C +CWAF+   A+E      T KL  LS Q LVDC   Q N GC
Sbjct:   120 DWRKKGYVTPVRRQGDCDACWAFAVTGAIEAQAIWQTGKLTPLSVQNLVDCSKPQGNNGC 179

Query:   190 NGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALL 249
              GG    AF+++   GG+ +EA YPY+  DG C  + ++S A  I G  ++P + ED L+
Sbjct:   180 LGGDTYNAFQYVLHNGGLESEATYPYEGKDGPCRYNPKNSKA-EITGFVSLPQS-EDILM 237

Query:   250 KAVAK-QPVSVAIDAGSSDFQFYSEGVF-TGECGTE-LNHGVAAVGYG---TTLDGTKYW 303
              AVA   P++  IDA    F+ Y  G++    C ++ + HGV  VGYG      DG  YW
Sbjct:   238 AAVATIGPITAGIDASHESFKNYKGGIYHEPNCSSDTVTHGVLVVGYGFKGIETDGNHYW 297

Query:   304 IVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYP 341
             +++NSWG  WG +GY+++ +   DK   CGIA  A YP
Sbjct:   298 LIKNSWGKRWGIRGYMKLAK---DKNNHCGIASYAHYP 332


>UNIPROTKB|E9PTT3 [details] [associations]
            symbol:Ctsr "Protein Ctsr" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            IPI:IPI00627092 Ensembl:ENSRNOT00000024115 RGD:631422
            Uniprot:E9PTT3
        Length = 334

 Score = 511 (184.9 bits), Expect = 5.2e-49, P = 5.2e-49
 Identities = 114/310 (36%), Positives = 175/310 (56%)

Query:    43 RSHHTVSRSLDEKHKRFNVFKQNV--MHVH-QTNKMDKP-YKLKLNKFADMTNHEFASTY 98
             ++ +  S +++E+  R  V+++N+  + +H + N + K  + +++N+F D+T  EF    
Sbjct:    33 KTEYEKSYTMEEEGHRRAVWEENMKMIKLHNRENSLGKNGFIMEMNEFGDLTAEEFRKMM 92

Query:    99 AGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAA 158
                 I+ HR  +  R       G V  +P  VDWRKKG VT V++Q  C SCWAF+   A
Sbjct:    93 VNIPIRSHRKGKIIRKRDV---GNV--LPKFVDWRKKGYVTRVQNQKFCNSCWAFAVTGA 147

Query:   159 VEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEAKYPYQA 217
             +EG     T +L  LS Q LVDC   Q N+GC  G   +A+E++   GG+  EA YPY+ 
Sbjct:   148 IEGQMFNKTGQLTPLSVQNLVDCTKSQGNEGCQWGDPHIAYEYVLNNGGLEAEATYPYKG 207

Query:   218 NDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSDFQFYSEGVF 276
              +G C  + + S A  I G  ++P + ED L++AVA   P+SVA+DA  + F FY +G++
Sbjct:   208 KEGVCRYNPKHSKA-EITGFVSLPES-EDILMEAVATIGPISVAVDASFNSFGFYKKGLY 265

Query:   277 TG-ECGTE-LNHGVAAVGYG---TTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGL 331
                 C    +NH V  VGYG      DG  YW+++NSWG +WG +GY+++ +   D+   
Sbjct:   266 DEPNCSNNTVNHSVLVVGYGFEGNETDGNSYWLIKNSWGRKWGLRGYMKIPK---DQNNF 322

Query:   332 CGIAMEASYP 341
             C IA  A YP
Sbjct:   323 CAIASYAHYP 332


>UNIPROTKB|E9PSK9 [details] [associations]
            symbol:Ctsql2 "Protein Ctsql2" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            IPI:IPI00562656 Ensembl:ENSRNOT00000045847 RGD:1303225
            ArrayExpress:E9PSK9 Uniprot:E9PSK9
        Length = 342

 Score = 509 (184.2 bits), Expect = 8.5e-49, P = 8.5e-49
 Identities = 122/342 (35%), Positives = 187/342 (54%)

Query:    17 GIVEGFDFHEKELESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNV--MHVH-QTN 73
             G+V G       L+ +   W   + W+  +    S +E+  +  V+++NV  + +H + N
Sbjct:    13 GVVSGASAFNLSLDVQ---W---QEWKMKYEKLYSPEEELLKRVVWEENVKKIELHNREN 66

Query:    74 KMDK-PYKLKLNKFADMTNHEFASTYAGSKIKHHRMFQGT--RGNGT-F---MYGKVTSI 126
              + K  Y +++N FAD+T+ EF     G  +  +   +    R  G+ F    Y +  ++
Sbjct:    67 SLGKNTYIMEINNFADLTDEEFKDMITGITLPINNTMKSLWKRALGSPFPNSWYWR-DAL 125

Query:   127 PPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ- 185
             P S+DWRK+G VT V++QG+C SCWAF    A+EG     T KL  LS Q LVDC   Q 
Sbjct:   126 PKSIDWRKEGYVTRVREQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCSKPQG 185

Query:   186 NQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHE 245
             N+GC GG    AF+++ + GG+ +EA YPY+  +G C  + +++ A  I     +P + E
Sbjct:   186 NKGCRGGTTYNAFQYVLQNGGLESEATYPYKGKEGLCKYNPKNAYA-KITRFVALPED-E 243

Query:   246 DALLKAVA-KQPVSVAIDAGSSDFQFYSEGVF-TGECGTELNHGVAAVGYG---TTLDGT 300
             D L+ A+A K PV+  I    S F F S G++   +C   +NH V  VGYG      DG 
Sbjct:   244 DVLMDALATKGPVAAGIHVVYSYFHFVS-GIYHEPKCNNRVNHAVLVVGYGFEGNETDGN 302

Query:   301 KYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPI 342
              YW+++NSWG +WG KGY+++ +   D+   CGIA  A YPI
Sbjct:   303 NYWLIKNSWGKQWGLKGYMKIAK---DRNNHCGIATFAQYPI 341


>TAIR|locus:2050145 [details] [associations]
            symbol:AT2G21430 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002685 GenomeReviews:CT485783_GR
            EMBL:AC006841 EMBL:X74359 IPI:IPI00519637 PIR:B84601
            RefSeq:NP_565512.1 UniGene:At.14069 ProteinModelPortal:P43295
            SMR:P43295 MEROPS:C01.A04 PRIDE:P43295 EnsemblPlants:AT2G21430.1
            GeneID:816682 KEGG:ath:AT2G21430 TAIR:At2g21430 eggNOG:COG4870
            HOGENOM:HOG000230774 InParanoid:P43295 KO:K01373 OMA:GSIEEHY
            PhylomeDB:P43295 ProtClustDB:CLSN2688311 Genevestigator:P43295
            GermOnline:AT2G21430 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 Uniprot:P43295
        Length = 361

 Score = 507 (183.5 bits), Expect = 1.4e-48, P = 1.4e-48
 Identities = 120/328 (36%), Positives = 173/328 (52%)

Query:    23 DFHEKELESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLK 82
             D  E ++ S E  + L+++      V  S++E + RF+VFK N++   +  KMD   +  
Sbjct:    35 DETEPKVLSSEDHFTLFKK--KFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARHG 92

Query:    83 LNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVK 142
             + +F+D+T  EF   + G K      F+  +           ++P   DWR +G+VT VK
Sbjct:    93 VTQFSDLTRSEFRRKHLGVK----GGFKLPKDANQAPILPTQNLPEEFDWRDRGAVTPVK 148

Query:   143 DQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCD--TDQNQ------GCNGGLM 194
             +QG CGSCW+FST  A+EG + + T KLVSLSEQ+LVDCD   D  +      GCNGGLM
Sbjct:   149 NQGSCGSCWSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLM 208

Query:   195 ELAFEFIKKKGGVTTEAKYPYQANDG-TCDVSKESSPAVSIDGHENVPANHEDALLKAVA 253
               AFE+  K GG+  E  YPY   DG +C + + S    S+     V  N +      + 
Sbjct:   209 NSAFEYTLKTGGLMREKDYPYTGTDGGSCKLDR-SKIVASVSNFSVVSINEDQIAANLIK 267

Query:   254 KQPVSVAIDAGSSDFQFYSEGVFTGE-CGTELNHGVAAVGYGTT------LDGTKYWIVR 306
               P++VAI+A     Q Y  GV     C   LNHGV  VGYG+       L    YWI++
Sbjct:   268 NGPLAVAINAAY--MQTYIGGVSCPYICSRRLNHGVLLVGYGSAGFSQARLKEKPYWIIK 325

Query:   307 NSWGPEWGEKGYIRMQRGISDKKGLCGI 334
             NSWG  WGE G+ ++ +G    + +CG+
Sbjct:   326 NSWGESWGENGFYKICKG----RNICGV 349


>UNIPROTKB|H9KYW5 [details] [associations]
            symbol:CTSS "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0002250 "adaptive immune response" evidence=IEA]
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 OMA:YEPACTQ EMBL:AADN02010496
            Ensembl:ENSGALT00000001122 Uniprot:H9KYW5
        Length = 245

 Score = 505 (182.8 bits), Expect = 2.3e-48, P = 2.3e-48
 Identities = 108/230 (46%), Positives = 138/230 (60%)

Query:   115 NGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLS 174
             N T  Y +    P ++DWR+KG VT VK+QG CG+CWAFS + A+E    + T KLVSLS
Sbjct:    19 NQTSTYRRRGGAPDAMDWREKGCVTEVKNQGACGACWAFSAVGALEAQVKLKTGKLVSLS 78

Query:   175 EQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVS 233
              Q LVDC     N+GC GG M  AF++I    G+ +E  YPY A +GTC  +  S+ A +
Sbjct:    79 AQNLVDCSMMYGNKGCGGGFMTRAFQYIIDNNGIDSEESYPYMAQNGTCQYNV-STRAAT 137

Query:   234 IDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFTG-ECGTELNHGVAAV 291
                +  +P   E AL  AVA   PVSVAIDA    F  Y  GV+    C  E+NHGV  V
Sbjct:   138 CSKYVELPYADEAALKDAVANVGPVSVAIDATQPTFFLYRSGVYDDPRCTQEVNHGVLVV 197

Query:   292 GYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYP 341
             GYGT L+   +W+V+NSWG  +G+ GYIRM R  ++    CGIA  ASYP
Sbjct:   198 GYGT-LNEKDFWLVKNSWGERFGDGGYIRMSRNHANH---CGIASYASYP 243


>UNIPROTKB|G3V9F8 [details] [associations]
            symbol:Ctsm "RCG24133" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015645 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 EMBL:CH474032
            PANTHER:PTHR12411:SF58 Ensembl:ENSRNOT00000045830 RGD:631420
            Uniprot:G3V9F8
        Length = 333

 Score = 500 (181.1 bits), Expect = 7.7e-48, P = 7.7e-48
 Identities = 115/314 (36%), Positives = 176/314 (56%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNVMHV--HQ-TNKMDKP-YKLKLNKFADMTNHEF 94
             +++W+  +  + SL+E+ ++  V+++N+  +  H   N + K  + +++N F DMT  EF
Sbjct:    29 WQKWKIKYEKTYSLEEEGQKRAVWEENMKKIKLHNGENGLGKHGFTMEMNAFGDMTIEEF 88

Query:    95 ASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFS 154
                    K+         +   +    +  ++P  ++WRK+G VT V+ QG+C  CWAFS
Sbjct:    89 R------KLMIEIPIPTVKKENSVQKRQAVNVPNFINWRKRGYVTPVRRQGRCNVCWAFS 142

Query:   155 TIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEAKY 213
                A+EG     T +L+ LS Q LVDC   Q N GC  G   LA +++K+ GG+ +EA Y
Sbjct:   143 VAGAIEGQMFQKTGQLIPLSVQNLVDCSRPQGNLGCYLGNTYLALQYVKENGGLESEATY 202

Query:   214 PYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYS 272
             PY+  +G+C    ++S A SI   E VP N EDAL+ AVA   P+SVAIDA    F FY 
Sbjct:   203 PYEEKEGSCRYHPDNSTA-SITDFEFVPKN-EDALMNAVATLGPISVAIDARHESFLFYR 260

Query:   273 EGVF-TGECGTEL-NHGVAAVGYGTT---LDGTKYWIVRNSWGPEWGEKGYIRMQRGISD 327
              G++    C + +  H +  VGYG      DG KYWI++NS G +WG +GY+++ +   D
Sbjct:   261 NGIYHEPNCSSSVVTHAMLLVGYGFVGEESDGRKYWILKNSMGNKWGNRGYMKIAK---D 317

Query:   328 KKGLCGIAMEASYP 341
             +   CGIA  A YP
Sbjct:   318 QGNHCGIATYALYP 331


>UNIPROTKB|F1NHB8 [details] [associations]
            symbol:F1NHB8 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 EMBL:AADN02044011
            IPI:IPI00586027 Ensembl:ENSGALT00000021873 OMA:SELDHAV
            Uniprot:F1NHB8
        Length = 329

 Score = 498 (180.4 bits), Expect = 1.2e-47, P = 1.2e-47
 Identities = 119/316 (37%), Positives = 168/316 (53%)

Query:    38 LYERWRSHHTVSRSLDEKHK-RFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFAS 96
             L+  ++       S +E+H+ R   F  N+  VH  N+    Y L LN  AD T  E A+
Sbjct:    25 LFHHYKERFGKRYSSEEEHEHRKRTFIHNMRFVHSKNRAALSYSLALNHLADRTPQEMAA 84

Query:    97 TYAGSKIKHHRMFQGTRGNGTF---MYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAF 153
                   ++  R     +    F   +Y  +  +P S+DWR  G+VT VKDQ  CGSCW+F
Sbjct:    85 ------LRGRRRSGDPKSGQPFSMQLYASLV-LPESLDWRLYGAVTPVKDQAVCGSCWSF 137

Query:   154 STIAAVEGINHIMTNKLVSLSEQELVDCDTD-QNQGCNGGLMELAFEFIKKKGGV-TTEA 211
             +T  A+EG   + T  L  LS+Q L+DC     N  C+GG    A+E+IKK GG+ +TE+
Sbjct:   138 ATTGAMEGALFLKTGVLTPLSQQVLIDCSWGFGNYACDGGEEWRAYEWIKKHGGIASTES 197

Query:   212 KYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQF 270
               PY   +G C  + +S     + G+  V + + +AL  A+ K  PV+V IDA    F F
Sbjct:   198 YGPYLGQNGYCHYN-QSELVAPLAGYVTVESGNAEALKAALFKHGPVAVNIDASHKSFTF 256

Query:   271 YSEGVFTG-ECG---TELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGIS 326
             Y+ GV+    CG   +EL+H V AVGYG  L G  YW+++NSW   WG  GYI M   + 
Sbjct:   257 YANGVYEEPHCGNETSELDHAVLAVGYGV-LHGKSYWLIKNSWSTYWGNDGYILM--AMK 313

Query:   327 DKKGLCGIAMEASYPI 342
             D    CG+A  AS+PI
Sbjct:   314 DNN--CGVATAASFPI 327


>FB|FBgn0260462 [details] [associations]
            symbol:CG12163 species:7227 "Drosophila melanogaster"
            [GO:0035071 "salivary gland cell autophagic cell death"
            evidence=IEP] [GO:0048102 "autophagic cell death" evidence=IEP]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0004869 "cysteine-type
            endopeptidase inhibitor activity" evidence=IEA] [GO:0045169
            "fusome" evidence=IDA] [GO:0035220 "wing disc development"
            evidence=IGI] [GO:0022416 "chaeta development" evidence=IGI]
            InterPro:IPR000010 InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00043 SMART:SM00645
            InterPro:IPR000169 EMBL:AE014297 GO:GO:0004869 eggNOG:COG4870
            KO:K01373 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0022416 GO:GO:0035220 GO:GO:0035071
            GO:GO:0045169 GeneTree:ENSGT00660000095458 EMBL:AY121614
            EMBL:BT003231 RefSeq:NP_649521.1 RefSeq:NP_730901.1
            RefSeq:NP_730902.2 UniGene:Dm.7315 ProteinModelPortal:Q9VN93
            SMR:Q9VN93 DIP:DIP-17491N IntAct:Q9VN93 MINT:MINT-763966
            STRING:Q9VN93 MEROPS:C01.A27 PaxDb:Q9VN93
            EnsemblMetazoa:FBtr0078823 GeneID:40628 KEGG:dme:Dmel_CG12163
            UCSC:CG12163-RA FlyBase:FBgn0260462 InParanoid:Q9VN93 OMA:GPRWGEQ
            OrthoDB:EOG4CC2G9 PhylomeDB:Q9VN93 GenomeRNAi:40628 NextBio:819744
            Bgee:Q9VN93 GermOnline:CG12163 Uniprot:Q9VN93
        Length = 614

 Score = 495 (179.3 bits), Expect = 2.6e-47, P = 2.6e-47
 Identities = 112/300 (37%), Positives = 167/300 (55%)

Query:    51 SLDEKHKRFNVFKQNVMHVHQTNKMDK-PYKLKLNKFADMTNHEFASTYAGSKIKHHRMF 109
             S  E+  R  +F+QN+  + + N  +    K  + +FADMT+ E+      + +      
Sbjct:   321 STAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSEYKER---TGLWQRDEA 377

Query:   110 QGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNK 169
             + T G+   +      +P   DWR+K +VT VK+QG CGSCWAFS    +EG+  + T +
Sbjct:   378 KATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCGSCWAFSVTGNIEGLYAVKTGE 437

Query:   170 LVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESS 229
             L   SEQEL+DCDT  +  CNGGLM+ A++ IK  GG+  EA+YPY+A    C  ++  S
Sbjct:   438 LKEFSEQELLDCDTTDS-ACNGGLMDNAYKAIKDIGGLEYEAEYPYKAKKNQCHFNRTLS 496

Query:   230 PAVSIDGHENVPANHEDALLK-AVAKQPVSVAIDAGSSDFQFYSEGV---FTGECGTE-L 284
               V + G  ++P  +E A+ +  +A  P+S+ I+A +   QFY  GV   +   C  + L
Sbjct:   497 H-VQVAGFVDLPKGNETAMQEWLLANGPISIGINANA--MQFYRGGVSHPWKALCSKKNL 553

Query:   285 NHGVAAVGYGTT----LDGT-KYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEAS 339
             +HGV  VGYG +       T  YWIV+NSWGP WGE+GY R+ RG       CG++  A+
Sbjct:   554 DHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYRVYRG----DNTCGVSEMAT 609


>MGI|MGI:1860262 [details] [associations]
            symbol:Cts7 "cathepsin 7" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005576
            "extracellular region" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005768 "endosome" evidence=IEA]
            [GO:0005794 "Golgi apparatus" evidence=IEA] [GO:0006508
            "proteolysis" evidence=ISA] [GO:0007049 "cell cycle" evidence=IEA]
            [GO:0007067 "mitosis" evidence=IEA] [GO:0008152 "metabolic process"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=ISA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0051301 "cell
            division" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:1860262 GO:GO:0005634 GO:GO:0005794 GO:GO:0048471
            GO:GO:0005615 GO:GO:0051301 GO:GO:0007067 GO:GO:0005768
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GO:GO:0008233 EMBL:CH466546
            EMBL:AY014779 EMBL:CT030645 EMBL:BC064740 EMBL:AF250837
            IPI:IPI00131132 RefSeq:NP_062412.1 UniGene:Mm.3692 HSSP:O60911
            ProteinModelPortal:Q91ZF2 SMR:Q91ZF2 STRING:Q91ZF2 MEROPS:C01.016
            PRIDE:Q91ZF2 Ensembl:ENSMUST00000021892 GeneID:56092 KEGG:mmu:56092
            UCSC:uc007qwi.1 CTD:56092 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 InParanoid:Q91ZF2 OMA:ERRVIWE OrthoDB:EOG44QT2S
            NextBio:311908 Bgee:Q91ZF2 Genevestigator:Q91ZF2 Uniprot:Q91ZF2
        Length = 331

 Score = 494 (179.0 bits), Expect = 3.3e-47, P = 3.3e-47
 Identities = 114/314 (36%), Positives = 168/314 (53%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNV----MHVHQTNKMDKPYKLKLNKFADMTNHEF 94
             +E W+  +  + S +E+ +R  V++ NV     H+ +       + +++N+F DMT  E 
Sbjct:    29 WEEWKRSNDRTYSPEEEKQRRAVWEGNVKWIKQHIMENGLWMNNFTIEMNEFGDMTGEEM 88

Query:    95 ASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFS 154
                   S             NG  +  +   IPP++DWRK+G VT V+ QG CG+CWAFS
Sbjct:    89 KMLTESSSYPLR--------NGKHIQKRNPKIPPTLDWRKEGYVTPVRRQGSCGACWAFS 140

Query:   155 TIAAVEGINHIMTNKLVSLSEQELVDCDTDQN-QGCNGGLMELAFEFIKKKGGVTTEAKY 213
               A +EG     T KL+ LS Q L+DC      +GC+GG    AF+++K  GG+  EA Y
Sbjct:   141 VTACIEGQLFKKTGKLIPLSVQNLMDCSVSYGTKGCDGGRPYDAFQYVKNNGGLEAEATY 200

Query:   214 PYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKA-VAKQPVSVAIDAGSSDFQFYS 272
             PY+A    C    E S  V ++    VP N E+ALL+A V   P++VAID   + F  Y 
Sbjct:   201 PYEAKAKHCRYRPERS-VVKVNRFFVVPRN-EEALLQALVTHGPIAVAIDGSHASFHSYR 258

Query:   273 EGVF-TGECGTE-LNHGVAAVGYGTT---LDGTKYWIVRNSWGPEWGEKGYIRMQRGISD 327
              G++   +C  + L+HG+  VGYG      +  KYW+++NS G  WGE GY+++ RG   
Sbjct:   259 GGIYHEPKCRKDTLDHGLLLVGYGYEGHESENRKYWLLKNSHGERWGENGYMKLPRG--- 315

Query:   328 KKGLCGIAMEASYP 341
             +   CGIA  A YP
Sbjct:   316 QNNYCGIASYAMYP 329


>TAIR|locus:2120222 [details] [associations]
            symbol:RD19 "RESPONSIVE TO DEHYDRATION 19" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009269 "response to desiccation" evidence=IEP] [GO:0006970
            "response to osmotic stress" evidence=IGI] [GO:0005634 "nucleus"
            evidence=IDA] [GO:0005773 "vacuole" evidence=IDA] [GO:0042742
            "defense response to bacterium" evidence=IMP] [GO:0006096
            "glycolysis" evidence=RCA] [GO:0006833 "water transport"
            evidence=RCA] [GO:0006972 "hyperosmotic response" evidence=RCA]
            [GO:0007030 "Golgi organization" evidence=RCA] [GO:0009266
            "response to temperature stimulus" evidence=RCA] [GO:0009651
            "response to salt stress" evidence=IEP;RCA] [GO:0046686 "response
            to cadmium ion" evidence=RCA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=ISS] [GO:0009414 "response to
            water deprivation" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005634 GO:GO:0005773 EMBL:CP002687
            GenomeReviews:CT486007_GR GO:GO:0009651 GO:GO:0042742
            eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            ProtClustDB:CLSN2688311 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AL035679 EMBL:AL161594 GO:GO:0004197
            MEROPS:C01.022 EMBL:D13042 EMBL:AY080598 EMBL:AY133844
            IPI:IPI00544363 PIR:JN0718 RefSeq:NP_568052.1 UniGene:At.2850
            UniGene:At.74924 ProteinModelPortal:P43296 SMR:P43296 STRING:P43296
            PaxDb:P43296 PRIDE:P43296 EnsemblPlants:AT4G39090.1 GeneID:830064
            KEGG:ath:AT4G39090 TAIR:At4g39090 InParanoid:P43296 OMA:EDFDWRD
            PhylomeDB:P43296 Genevestigator:P43296 GermOnline:AT4G39090
            Uniprot:P43296
        Length = 368

 Score = 494 (179.0 bits), Expect = 3.3e-47, P = 3.3e-47
 Identities = 119/325 (36%), Positives = 165/325 (50%)

Query:    26 EKELESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNK 85
             E ++ + E  + L++R      V  S +E   RF+VFK N+    +  K+D      + +
Sbjct:    41 EPQVLTSEDHFSLFKR--KFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHGVTQ 98

Query:    86 FADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQG 145
             F+D+T  EF   + G +      F+  +           ++P   DWR  G+VT VK+QG
Sbjct:    99 FSDLTRSEFRKKHLGVRSG----FKLPKDANKAPILPTENLPEDFDWRDHGAVTPVKNQG 154

Query:   146 QCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCD-------TDQ-NQGCNGGLMELA 197
              CGSCW+FS   A+EG N + T KLVSLSEQ+LVDCD        D  + GCNGGLM  A
Sbjct:   155 SCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSA 214

Query:   198 FEFIKKKGGVTTEAKYPYQANDG-TCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQP 256
             FE+  K GG+  E  YPY   DG TC + K S    S+     +  + E      V   P
Sbjct:   215 FEYTLKTGGLMKEEDYPYTGKDGKTCKLDK-SKIVASVSNFSVISIDEEQIAANLVKNGP 273

Query:   257 VSVAIDAGSSDFQFYSEGVFTGE-CGTELNHGVAAVGYGTT------LDGTKYWIVRNSW 309
             ++VAI+AG    Q Y  GV     C   LNHGV  VGYG             YWI++NSW
Sbjct:   274 LAVAINAGY--MQTYIGGVSCPYICTRRLNHGVLLVGYGAAGYAPARFKEKPYWIIKNSW 331

Query:   310 GPEWGEKGYIRMQRGISDKKGLCGI 334
             G  WGE G+ ++ +G    + +CG+
Sbjct:   332 GETWGENGFYKICKG----RNICGV 352


>FB|FBgn0250848 [details] [associations]
            symbol:26-29-p "26-29kD-proteinase" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005811
            "lipid particle" evidence=IDA] [GO:0005875 "microtubule associated
            complex" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005875 EMBL:AE014296 GO:GO:0005811 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 MEROPS:I29.003 HSSP:O65039
            EMBL:AY122222 EMBL:AB011376 RefSeq:NP_620470.1 UniGene:Dm.3049
            SMR:Q9V3U6 MINT:MINT-890485 STRING:Q9V3U6
            EnsemblMetazoa:FBtr0075766 GeneID:39547 KEGG:dme:Dmel_CG8947
            UCSC:CG8947-RA CTD:39547 FlyBase:FBgn0250848 InParanoid:Q9V3U6
            OMA:IHSKNRA OrthoDB:EOG4BVQ8T GenomeRNAi:39547 NextBio:814210
            Uniprot:Q9V3U6
        Length = 549

 Score = 492 (178.3 bits), Expect = 5.4e-47, P = 5.4e-47
 Identities = 122/323 (37%), Positives = 168/323 (52%)

Query:    31 SEEGLWDLYERWRSHHTVSRSLDEKHK-RFNVFKQNVMHVHQTNKMDKPYKLKLNKFADM 89
             ++E +   +  ++  H V+   D +H+ R N+F+QN+ ++H  N+    Y L +N  AD 
Sbjct:   237 TDEHVDKAFHHFKRKHGVAYHSDTEHEHRKNIFRQNLRYIHSKNRAKLTYTLAVNHLADK 296

Query:    90 TNHEFASTYAGSKIKHHRMFQGTRGNGT-FMYG--KVTS-IPPSVDWRKKGSVTAVKDQG 145
             T  E        K +      G    G  F Y   K    IP   DWR  G+VT VKDQ 
Sbjct:   297 TEEEL-------KARRGYKSSGIYNTGKPFPYDVPKYKDEIPDQYDWRLYGAVTPVKDQS 349

Query:   146 QCGSCWAFSTIAAVEGINHIMTN-KLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKK 203
              CGSCW+F TI  +EG   +     LV LS+Q L+DC     N GC+GG     ++++ +
Sbjct:   350 VCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDCSWAYGNNGCDGGEDFRVYQWMLQ 409

Query:   204 KGGVTTEAKY-PYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAI 261
              GGV TE +Y PY   DG C V+  +  A  I G  NV +N  +A   A+ K  P+SVAI
Sbjct:   410 SGGVPTEEEYGPYLGQDGYCHVNNVTLVA-PIKGFVNVTSNDPNAFKLALLKHGPLSVAI 468

Query:   262 DAGSSDFQFYSEGVF-TGECGTE---LNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKG 317
             DA    F FYS GV+    C  +   L+H V AVGYG+ ++G  YW+V+NSW   WG  G
Sbjct:   469 DASPKTFSFYSHGVYYEPTCKNDVDGLDHAVLAVGYGS-INGEDYWLVKNSWSTYWGNDG 527

Query:   318 YIRMQRGISDKKGLCGIAMEASY 340
             YI M    S KK  CG+    +Y
Sbjct:   528 YILM----SAKKNNCGVMTMPTY 546


>RGD|1309226 [details] [associations]
            symbol:Cts7 "cathepsin 7" species:10116 "Rattus norvegicus"
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005768 "endosome" evidence=IEA] [GO:0005794 "Golgi apparatus"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0007067
            "mitosis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IEA] [GO:0051301 "cell division" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 RGD:1309226 GO:GO:0005634
            GO:GO:0005794 GO:GO:0048471 GO:GO:0005615 GO:GO:0051301
            GO:GO:0007067 GO:GO:0005768 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 MEROPS:C01.016 CTD:56092
            GeneTree:ENSGT00560000076577 OrthoDB:EOG44QT2S EMBL:CH474032
            IPI:IPI00870531 RefSeq:NP_001099569.1 UniGene:Rn.218615
            Ensembl:ENSRNOT00000043686 GeneID:290970 KEGG:rno:290970
            UCSC:RGD:1309226 OMA:VESFNAN Uniprot:D3ZZ07
        Length = 331

 Score = 485 (175.8 bits), Expect = 3.0e-46, P = 3.0e-46
 Identities = 113/315 (35%), Positives = 170/315 (53%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNVM----HVHQTNKMDKPYKLKLNKFADMTNHEF 94
             +E W+ ++  + S +E+ +R  V+++NV     H  Q       + +++N+F DMT  E 
Sbjct:    29 WEEWKRNNAKTYSPEEEKQRRAVWEENVKMIKWHTMQNGLWMNNFTIEMNEFGDMTGEEM 88

Query:    95 ASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFS 154
                   S +        T  NG  +  +   IP ++DWR  G V  V+ QG CG+CWAFS
Sbjct:    89 RMMTDSSAL--------TLRNGKHIQKRNVKIPKTLDWRDTGCVAPVRSQGGCGACWAFS 140

Query:   155 TIAAVEGINHIMTNKLVSLSEQELVDCD-TDQNQGCNGGLMELAFEFIKKKGGVTTEAKY 213
               A++E      T KL+ LS Q L+DC  T  N  C+GG    AF+++K  GG+  EA Y
Sbjct:   141 VAASIESQLFKKTGKLIPLSVQNLIDCTVTYGNNDCSGGKPYTAFQYVKNNGGLEAEATY 200

Query:   214 PYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKA-VAKQPVSVAIDAGSSDFQFYS 272
             PY+A    C    E S  V I     VP N E+AL++A V   P++VAID   + F+ Y 
Sbjct:   201 PYEAKLRHCRYRPERS-VVKIARFFVVPRN-EEALMQALVTYGPIAVAIDGSHASFKRYR 258

Query:   273 EGVF-TGECGTE-LNHGVAAVGYGTT---LDGTKYWIVRNSWGPEWGEKGYIRMQRGISD 327
              G++   +C  + L+HG+  VGYG      +  KYW+++NS G +WGE+GY+++ R   D
Sbjct:   259 GGIYHEPKCRRDTLDHGLLLVGYGYEGHESENRKYWLLKNSHGEQWGERGYMKLPR---D 315

Query:   328 KKGLCGIAMEASYPI 342
             +   CGIA  A YP+
Sbjct:   316 QNNYCGIASYAMYPL 330


>DICTYBASE|DDB_G0282991 [details] [associations]
            symbol:DDB_G0282991 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0282991 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            EMBL:AAFI02000049 eggNOG:NOG331187 RefSeq:XP_639299.1
            ProteinModelPortal:Q54RQ2 EnsemblProtists:DDB0185304 GeneID:8623870
            KEGG:ddi:DDB_G0282991 InParanoid:Q54RQ2 OMA:PENGNEY Uniprot:Q54RQ2
        Length = 339

 Score = 484 (175.4 bits), Expect = 3.8e-46, P = 3.8e-46
 Identities = 112/313 (35%), Positives = 173/313 (55%)

Query:    37 DLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFAS 96
             +L+  W + +    S  E + RFN FK+N  +V Q N+      L+LN FAD++ +E+ +
Sbjct:    25 NLFIEWTNKYNKIYSNKEFYMRFNNFKKNKEYVDQWNEKQLETILELNFFADLSRNEYIN 84

Query:    97 TYAGSKIKHHRMFQ-GTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQC-GSCWAFS 154
              Y  S I    + Q  T+  G        SI  S+DWR   +VT VK+QG C G+ ++FS
Sbjct:    85 NYLASFIDISNIEQKNTKYEGNLKNNFNNSIK-SIDWRNFDAVTPVKNQGLCSGAGYSFS 143

Query:   155 TIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEAKY 213
              I  +E  + I   +L++LSEQ ++DC TD  N GC GGL  +AF++I K+ G+ +E  Y
Sbjct:   144 AIGVIESSHFIKNKELITLSEQNIIDCTTDMGNNGCMGGLALIAFDYIIKQKGIDSEFNY 203

Query:   214 PYQA-------NDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSS 266
             PY+          G C  +   S A SI  +  +   +E+ L +++ K PVSV IDA   
Sbjct:   204 PYEGYLIEPYEGRGRCRYNSFYSKA-SISSYIEIERFNENELTQSLIKSPVSVMIDASQL 262

Query:   267 DFQFYSEGVFTG-ECG-TELNHGVAAVGYGTTLD-GTKYWIVRNSWGPEWGEKGYIRMQR 323
              F  Y  GV+    C  T LNHG+  +G+G T + G +Y+I++NS+G +WG KGYI + R
Sbjct:   263 SFMLYKSGVYKDPSCSSTILNHGILNIGFGVTPENGNEYYILKNSFGSKWGMKGYIYLSR 322

Query:   324 GISDKKGLCGIAM 336
               ++  G+  + +
Sbjct:   323 NFNNHCGISSVGI 335


>ZFIN|ZDB-GENE-050417-107 [details] [associations]
            symbol:zgc:110239 "zgc:110239" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-050417-107
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 MEROPS:I29.003 OrthoDB:EOG412M56 EMBL:BC092817
            IPI:IPI00503987 RefSeq:NP_001017633.1 UniGene:Dr.39081
            ProteinModelPortal:Q568K7 GeneID:550326 KEGG:dre:550326
            HOGENOM:HOG000007373 HOVERGEN:HBG105018 InParanoid:Q568K7
            NextBio:20879584 ArrayExpress:Q568K7 Uniprot:Q568K7
        Length = 546

 Score = 484 (175.4 bits), Expect = 3.8e-46, P = 3.8e-46
 Identities = 122/298 (40%), Positives = 160/298 (53%)

Query:    54 EKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTYAGSKIKH--HRMFQG 111
             E  +R + F  N+ +VH  N+    + L +N  AD +  E  S   G +  H  HR  Q 
Sbjct:   259 EHEEREHNFVHNIRYVHSMNRAGLSFSLSVNHLADRSQKEL-SMMRGCQRTHKVHRKAQ- 316

Query:   112 TRGNGTFMYGKVTSI--PPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNK 169
                   F   ++ SI  P SVDWR  G+VT VKDQ  CGSCW+F+T   +EG   + T +
Sbjct:   317 -----PFP-SEIRSIATPNSVDWRLYGAVTPVKDQAVCGSCWSFATTGTLEGALFLKTGQ 370

Query:   170 LVSLSEQELVDCDTD-QNQGCNGGLMELAFEFIKKKGGVTTEAKY-PYQANDGTCDVSKE 227
             L SLS+Q LVDC     N GC+GG    AFE+I K GG++T   Y  Y   +G C   K 
Sbjct:   371 LTSLSQQMLVDCTWGFGNNGCDGGEEWRAFEWIMKHGGISTAESYGAYMGMNGLCHYDK- 429

Query:   228 SSPAVSIDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSDFQFYSEGVF-TGEC--G-T 282
             SS    + G+ NV +    AL  A+ K  PV+V+IDA    F FYS GV+   EC  G  
Sbjct:   430 SSMVAQLTGYTNVTSGDILALKAAIFKFGPVAVSIDAAHRSFAFYSNGVYYEPECKNGIN 489

Query:   283 ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASY 340
             +L+H V AVGYG  ++   YW+V+NSW   WG  GYI M    S K   CG+A +A Y
Sbjct:   490 DLDHAVLAVGYGI-MNNESYWLVKNSWSSYWGNDGYILM----SMKDNNCGVATDAIY 542


>WB|WBGene00019986 [details] [associations]
            symbol:R09F10.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            EMBL:FO081137 HSSP:P53634 PIR:D89588 RefSeq:NP_509408.1
            ProteinModelPortal:Q23030 SMR:Q23030 STRING:Q23030 MEROPS:C01.A44
            PaxDb:Q23030 EnsemblMetazoa:R09F10.1 GeneID:181087
            KEGG:cel:CELE_R09F10.1 UCSC:R09F10.1 CTD:181087 WormBase:R09F10.1
            InParanoid:Q23030 OMA:EYPYSAL NextBio:912346 Uniprot:Q23030
        Length = 383

 Score = 481 (174.4 bits), Expect = 7.9e-46, P = 7.9e-46
 Identities = 115/326 (35%), Positives = 175/326 (53%)

Query:    25 HEKE-LESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKL 83
             H+ E L+ E+   D   ++   +T   S++E   R+ +F +NV+      + +    L +
Sbjct:    71 HKMENLKHEQMFNDFILKFDRKYT---SVEEFEYRYQIFLRNVIEFEAEEERNLGLDLDV 127

Query:    84 NKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKD 143
             N+F D T+ E       +K   +  F   +  G+++   V   P S+DWR++G +T +K+
Sbjct:   128 NEFTDWTDEELQKMVQENKYTKYD-FDTPKFEGSYLETGVIR-PASIDWREQGKLTPIKN 185

Query:   144 QGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKK 203
             QGQCGSCWAF+T+A+VE  N I   KLVSLSEQE+VDCD  +N GC+GG    A +F+K+
Sbjct:   186 QGQCGSCWAFATVASVEAQNAIKKGKLVSLSEQEMVDCD-GRNNGCSGGYRPYAMKFVKE 244

Query:   204 KGGVTTEAKYPYQA-NDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAID 262
              G + +E +YPY A     C + KE+   V ID    +  N ED       K PV+  ++
Sbjct:   245 NG-LESEKEYPYSALKHDQCFL-KENDTRVFIDDFRMLSNNEEDIANWVGTKGPVTFGMN 302

Query:   263 AGSSDFQFYSEGVFTG---ECGTELN---HGVAAVGYGTTLDGTKYWIVRNSWGPEWGEK 316
                + +  Y  G+F     +C TE +   H +  +GYG   +   YWIV+NSWG  WG  
Sbjct:   303 VVKAMYS-YRSGIFNPSVEDC-TEKSMGAHALTIIGYGGEGESA-YWIVKNSWGTSWGAS 359

Query:   317 GYIRMQRGISDKKGLCGIAMEASYPI 342
             GY R+ RG++     CG+A     PI
Sbjct:   360 GYFRLARGVNS----CGLANTVVAPI 381


>FB|FBgn0034229 [details] [associations]
            symbol:CG4847 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0032504
            "multicellular organism reproduction" evidence=IEP] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0005615 "extracellular space"
            evidence=ISM;IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:AE013599 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 GO:GO:0032504 GeneTree:ENSGT00560000076599
            KO:K01371 EMBL:BT099507 RefSeq:NP_725686.1 UniGene:Dm.4677
            SMR:A1ZAU4 IntAct:A1ZAU4 MEROPS:C01.A28 EnsemblMetazoa:FBtr0086935
            GeneID:36973 KEGG:dme:Dmel_CG4847 UCSC:CG4847-RB
            FlyBase:FBgn0034229 InParanoid:A1ZAU4 OMA:GGFQEYA OrthoDB:EOG4J9KFC
            ChiTaRS:CG4847 GenomeRNAi:36973 NextBio:801302 Uniprot:A1ZAU4
        Length = 420

 Score = 479 (173.7 bits), Expect = 1.3e-45, P = 1.3e-45
 Identities = 111/271 (40%), Positives = 146/271 (53%)

Query:    79 YKLKLNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSV 138
             +K  +N FAD+T+ EF S   G K       +    +   +      IP + DWR+ G V
Sbjct:   157 FKQAVNAFADLTHSEFLSQLTGLKRSPEAKARAA-ASLKLVNLPAKPIPDAFDWREHGGV 215

Query:   139 TAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQN---QGCNGGLME 195
             T VK QG CGSCWAF+T  A+EG     T  L +LSEQ LVDC   ++    GC+GG  E
Sbjct:   216 TPVKFQGTCGSCWAFATTGAIEGHTFRKTGSLPNLSEQNLVDCGPVEDFGLNGCDGGFQE 275

Query:   196 LAFEFIKK-KGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAK 254
              AF FI + + GV+ E  YPY  N GTC      S A ++ G   +P   E+ L K VA 
Sbjct:   276 AAFCFIDEVQKGVSQEGAYPYIDNKGTCKYDGSKSGA-TLQGFAAIPPKDEEQLKKVVAT 334

Query:   255 Q-PVSVAIDAGSSDFQFYSEGVFTG-ECGT-ELNHGVAAVGYGTTLDGTKYWIVRNSWGP 311
               PV+ +++ G    + Y+ G++   EC   E NH +  VGYG+   G  YWIV+NSW  
Sbjct:   335 LGPVACSVN-GLETLKNYAGGIYNDDECNKGEPNHSILVVGYGSE-KGQDYWIVKNSWDD 392

Query:   312 EWGEKGYIRMQRGISDKKGLCGIAMEASYPI 342
              WGEKGY R+ RG    K  C IA E SYP+
Sbjct:   393 TWGEKGYFRLPRG----KNYCFIAEECSYPV 419


>DICTYBASE|DDB_G0274385 [details] [associations]
            symbol:DDB_G0274385 "Cysteine proteinase 1,
            mitochondrial" species:44689 "Dictyostelium discoideum" [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0274385 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 EMBL:AAFI02000012 RefSeq:XP_644301.1
            ProteinModelPortal:Q86KD4 EnsemblProtists:DDB0167535 GeneID:8619729
            KEGG:ddi:DDB_G0274385 InParanoid:Q86KD4 OMA:SICVDAS Uniprot:Q86KD4
        Length = 358

 Score = 478 (173.3 bits), Expect = 1.6e-45, P = 1.6e-45
 Identities = 120/343 (34%), Positives = 176/343 (51%)

Query:    20 EGFDFHEKELESEEGLWDLYERW-RSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDK- 77
             +G+  ++  + S+  + D +  W + H  + +   E   RF+ FK+N+    + N M   
Sbjct:    25 QGYHRNDGIIHSDSSMRDTFNHWAKKHSKIYKDSIEMENRFSNFKENMKKNIELNSMHAG 84

Query:    78 PYKLKLNKFADMTNHEFAS-----TYAG------SKIK-----HHRMFQGTRGNGTFMYG 121
               K + N F+D++  EF++      + G      + IK     HH +  G +       G
Sbjct:    85 KAKFESNGFSDLSEEEFSNFHLNKAFKGKPSHLRNSIKPQPTPHHSLINGYK---EMENG 141

Query:   122 KVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDC 181
              +  +  S+DWRKKG VT VKDQGQCGSC+ FS +  +E       NK + LSEQ+ VDC
Sbjct:   142 DLNELY-SIDWRKKGLVTPVKDQGQCGSCYIFSAVEQIETAWIKAGNKPILLSEQQAVDC 200

Query:   182 DTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVP 241
             D    Q C GG     +E+  + GGV+T A+YPY A DGTC     + P VS   H    
Sbjct:   201 DPYDGQ-CGGGDPYTVYEYFSQVGGVSTNAQYPYTATDGTCVNMSRAVPVVSY--HYVTQ 257

Query:   242 ANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFTGECGTELNHGVAAVGYGTTL--- 297
                E+ L+K +    PVS+ +DA  S +Q YS G+ T  CG  ++H V  VG        
Sbjct:   258 GGDENTLIKTIVNDGPVSICVDA--STWQSYSGGIITTGCGKNIDHCVQVVGLEVDKTDP 315

Query:   298 -DGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEAS 339
              +  +Y+I+RNSWG +WG  GYI +  G SD   LCGI  E++
Sbjct:   316 SNPVQYYIIRNSWGTDWGIDGYIYVATG-SD---LCGITYEST 354


>ZFIN|ZDB-GENE-040426-1583 [details] [associations]
            symbol:ctssa "cathepsin S, a" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-040426-1583
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            GeneTree:ENSGT00660000095458 EMBL:CR548627 IPI:IPI00491948
            UniGene:Dr.81560 SMR:Q1L8W8 Ensembl:ENSDART00000053638 OMA:RNTREER
            OrthoDB:EOG480HX9 Uniprot:Q1L8W8
        Length = 328

 Score = 475 (172.3 bits), Expect = 3.4e-45, P = 3.4e-45
 Identities = 116/316 (36%), Positives = 165/316 (52%)

Query:    35 LWDLYERWRSHHTVS-RSLDEKHKRFNVFKQNVMHVHQTNKMDK----PYKLKLNKFADM 89
             L + +  W+S H  + R+  E+  R +V+KQN+  +   N+        Y L LN+ +DM
Sbjct:    23 LTNQWTTWKSQHNKTYRNTREERLRRSVWKQNLQDILLHNEAAAVGLHSYTLGLNQLSDM 82

Query:    90 TNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGS 149
             T  E         ++    F     N TF    + ++P  V+W + G V+ V++QG CGS
Sbjct:    83 TADEVNDM--NGLLEED--FPDV--NATFSPPSLQTLPQRVNWTEHGMVSPVQNQGPCGS 136

Query:   150 CWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVT 208
             CWAFS + ++E      T  LV LS Q L+DC     N+GC GG +  AF ++ +  G+ 
Sbjct:   137 CWAFSAVGSLEAQMKRRTAALVPLSAQNLLDCSVSLGNRGCKGGFLSRAFLYVIQNRGID 196

Query:   209 TEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSD 267
             +   YPY+  +G C  S  S  A    G   VP ++E AL  AVA   PVSV I+A    
Sbjct:   197 SSTFYPYEHKEGVCRYSV-SGRAGYCTGFRIVPRHNEAALQSAVANIGPVSVGINAKLLS 255

Query:   268 FQFYSEGVFTG-ECGTEL-NHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGI 325
             F  Y  G++   +C + L NH V  VGYG+  +G  YW+V+NSWG  WGE GYIRM R  
Sbjct:   256 FHRYRSGIYNDPKCSSALINHAVLVVGYGSE-NGQDYWLVKNSWGTAWGENGYIRMARN- 313

Query:   326 SDKKGLCGIAMEASYP 341
                K +CGI+    YP
Sbjct:   314 ---KNMCGISSFGIYP 326


>UNIPROTKB|F1NT07 [details] [associations]
            symbol:LOC100857883 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 EMBL:AADN02044012
            EMBL:AADN02044013 EMBL:AADN02044014 IPI:IPI00577314
            Ensembl:ENSGALT00000000192 OMA:IYKHGPV Uniprot:F1NT07
        Length = 317

 Score = 474 (171.9 bits), Expect = 4.4e-45, P = 4.4e-45
 Identities = 117/304 (38%), Positives = 161/304 (52%)

Query:    51 SLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTYAGSKIKHHRMFQ 110
             S  E   R  +F  ++  VH  N+    Y L LN  AD T  E A+      ++  R   
Sbjct:    25 SAREMEHRQRIFAHHMRFVHSKNRAALSYSLALNHLADRTPQEMAA------LRGRRR-S 77

Query:   111 GTRGNGT-FMYGKVTSI--PPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMT 167
             G   +G  F     T I  P S+DWR  G+VT VKDQ  CGSCW+F+T  A+EG   + T
Sbjct:    78 GDPNHGLPFPAEHYTGIILPESLDWRMYGAVTPVKDQAVCGSCWSFATTGAMEGALFLKT 137

Query:   168 NKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGV-TTEA--KYPYQANDGTCD 223
               L  LS+Q L+DC   + N  C+GG    A  +IKK GG+ +TE+   +P    +G C 
Sbjct:   138 GVLTPLSQQVLIDCSWGKGNYACDGGEEWRAKGWIKKHGGIASTESPPSFPLVLQNGLCH 197

Query:   224 VSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVF-TGECG 281
              + +S     I G+ NV + +  A+  A+ K  PV+V+IDA    F FYS G++   +C 
Sbjct:   198 YN-QSEMLAKITGYVNVTSGNITAVKTAIYKHGPVAVSIDASHKTFSFYSNGIYYEPKCA 256

Query:   282 T---ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEA 338
                 +L+H V AVGYG  L G  YW+++NSW   WG  GYI M   + D    CG+A EA
Sbjct:   257 NKPGQLDHAVLAVGYGV-LQGETYWLIKNSWSTYWGNDGYILM--AMKDNN--CGVATEA 311

Query:   339 SYPI 342
             +YPI
Sbjct:   312 TYPI 315


>UNIPROTKB|F1RU48 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            OMA:LAPPEWD GeneTree:ENSGT00660000095458 EMBL:CU928034
            EMBL:FP565364 Ensembl:ENSSSCT00000014140 Ensembl:ENSSSCT00000014154
            Uniprot:F1RU48
        Length = 460

 Score = 469 (170.2 bits), Expect = 1.5e-44, P = 1.5e-44
 Identities = 114/302 (37%), Positives = 158/302 (52%)

Query:    47 TVSRSLDEKHK---RFNVFKQNVMHVHQTNKMDK-PYKLKLNKFADMTNHEFASTYAGSK 102
             T +R+ D K +   R +VF  N++   +   +D    +  + KF+D+T  EF + Y    
Sbjct:   169 TYNRTYDTKEEARWRMSVFANNMVRAQKIQALDTGTARYGVTKFSDLTEEEFRTIYLNP- 227

Query:   103 IKHHRMFQGTRGNGTFMYGKVTSIPPSV-DWRKKGSVTAVKDQGQCGSCWAFSTIAAVEG 161
                  + Q   G    +   V+S+PP   DWRKKG+VT VKDQG CGSCWAFS    VEG
Sbjct:   228 -----LLQEEPGRKMRLAKSVSSLPPPEWDWRKKGAVTKVKDQGMCGSCWAFSVTGNVEG 282

Query:   162 INHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGT 221
                +    L+SLSEQEL+DCD   ++GC GGL   A+  IK  GG+ TE  Y Y+ +  T
Sbjct:   283 QWFLKQGTLLSLSEQELLDCDK-VDKGCMGGLPSNAYSAIKTLGGLETEEDYSYRGHLQT 341

Query:   222 CDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGV---FTG 278
             C  + E +  V I+    +  N +        K P+SVAI+A     QFY  G+      
Sbjct:   342 CSFNAEKAK-VYINDSVELSQNEQKLAAWLAEKGPISVAINAFG--MQFYRHGISHPLRP 398

Query:   279 ECGTEL-NHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAME 337
              C   L +H V  VGYG     T +W ++NSWG +WGE+GY  + RG     G CG+ + 
Sbjct:   399 LCSPWLIDHAVLLVGYGNR-SATPFWAIKNSWGTDWGEEGYYYLYRG----SGACGVNIM 453

Query:   338 AS 339
             AS
Sbjct:   454 AS 455


>TAIR|locus:2082687 [details] [associations]
            symbol:AT3G54940 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002686 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HSSP:P53634
            OMA:GGGLMTN EMBL:AY070063 IPI:IPI00528988 RefSeq:NP_567010.5
            UniGene:At.28412 ProteinModelPortal:Q8VYS0 SMR:Q8VYS0 PRIDE:Q8VYS0
            EnsemblPlants:AT3G54940.2 GeneID:824659 KEGG:ath:AT3G54940
            TAIR:At3g54940 PhylomeDB:Q8VYS0 ProtClustDB:CLSN2718801
            ArrayExpress:Q8VYS0 Genevestigator:Q8VYS0 Uniprot:Q8VYS0
        Length = 367

 Score = 469 (170.2 bits), Expect = 1.5e-44, P = 1.5e-44
 Identities = 113/299 (37%), Positives = 154/299 (51%)

Query:    53 DEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTYAG-SKIKHHRMFQG 111
             +E   R  +F +NV+   +   MD      + +F+D+T  EF   Y G + +   R   G
Sbjct:    66 EEYIHRLGIFAKNVLKAAEHQMMDPSAVHGVTQFSDLTEEEFKRMYTGVADVGGSR--GG 123

Query:   112 TRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLV 171
             T G    M  +V  +P   DWR+KG VT VK+QG CGSCWAFST  A EG + + T KL+
Sbjct:   124 TVGAEAPMV-EVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLL 182

Query:   172 SLSEQELVDCDT-----DQ---NQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCD 223
             SLSEQ+LVDCD      D+   + GC GGLM  A+E++ + GG+  E  YPY    G C 
Sbjct:   183 SLSEQQLVDCDQACDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTGKRGHCK 242

Query:   224 VSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTGE-CGT 282
                E   AV +     +P +        V   P++V ++A     Q Y  GV     C  
Sbjct:   243 FDPEKV-AVRVLNFTTIPLDENQIAANLVRHGPLAVGLNAVF--MQTYIGGVSCPLICSK 299

Query:   283 E-LNHGVAAVGYGTT------LDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGI 334
               +NHGV  VGYG+       L    YWI++NSWG +WGE GY ++ RG      +CGI
Sbjct:   300 RNVNHGVLLVGYGSKGFSILRLSNKPYWIIKNSWGKKWGENGYYKLCRG----HDICGI 354


>TAIR|locus:2130180 [details] [associations]
            symbol:AT4G16190 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0042744 "hydrogen peroxide catabolic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005773
            EMBL:CP002687 HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 EMBL:Z97340 EMBL:AL161543 UniGene:At.25555
            EMBL:AY039556 EMBL:AY129473 EMBL:AY136316 EMBL:BT000733
            EMBL:AK226366 IPI:IPI00543588 PIR:D71428 RefSeq:NP_567489.1
            HSSP:P25779 ProteinModelPortal:Q9SUL1 SMR:Q9SUL1 STRING:Q9SUL1
            MEROPS:C01.A06 PRIDE:Q9SUL1 EnsemblPlants:AT4G16190.1 GeneID:827311
            KEGG:ath:AT4G16190 TAIR:At4g16190 InParanoid:Q9SUL1 OMA:NACGINK
            PhylomeDB:Q9SUL1 ProtClustDB:CLSN2917559 Genevestigator:Q9SUL1
            Uniprot:Q9SUL1
        Length = 373

 Score = 469 (170.2 bits), Expect = 1.5e-44, P = 1.5e-44
 Identities = 115/326 (35%), Positives = 171/326 (52%)

Query:    25 HEKELESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLN 84
             ++++L + E  + L++  +   T +  ++  H RF VFK N+    +   +D      + 
Sbjct:    44 NDEQLLNAEHHFTLFKS-KYEKTYATQVEHDH-RFRVFKANLRRARRNQLLDPSAVHGVT 101

Query:    85 KFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQ 144
             +F+D+T  EF   + G K +  R+   T+   T      + +P   DWR++G+VT VK+Q
Sbjct:   102 QFSDLTPKEFRRKFLGLKRRGFRLPTDTQ---TAPILPTSDLPTEFDWREQGAVTPVKNQ 158

Query:   145 GQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCD--TDQNQ------GCNGGLMEL 196
             G CGSCW+FS I A+EG + + T +LVSLSEQ+LVDCD   D  Q      GC+GGLM  
Sbjct:   159 GMCGSCWSFSAIGALEGAHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSGGLMNN 218

Query:   197 AFEFIKKKGGVTTEAKYPYQANDGT-CDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ 255
             AFE+  K GG+  E  YPY   D T C   K S    S+     V ++ +      V   
Sbjct:   219 AFEYALKAGGLMKEEDYPYTGRDHTACKFDK-SKIVASVSNFSVVSSDEDQIAANLVQHG 277

Query:   256 PVSVAIDAGSSDFQFYSEGVFTGE-CGTELNHGVAAVGYGTT------LDGTKYWIVRNS 308
             P+++AI+A     Q Y  GV     C    +HGV  VG+G++      L    YWI++NS
Sbjct:   278 PLAIAINA--MWMQTYIGGVSCPYVCSKSQDHGVLLVGFGSSGYAPIRLKEKPYWIIKNS 335

Query:   309 WGPEWGEKGYIRMQRGISDKKGLCGI 334
             WG  WGE GY ++ RG      +CG+
Sbjct:   336 WGAMWGEHGYYKICRG---PHNMCGM 358


>UNIPROTKB|Q9UBX1 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=TAS] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_6900 GO:GO:0019886 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0043202
            GO:GO:0004197 HOVERGEN:HBG011513 EMBL:AJ007331 EMBL:AF088886
            EMBL:AF132894 EMBL:AF136279 EMBL:AF071748 EMBL:AF071749
            EMBL:AK313657 EMBL:BC011682 EMBL:BC036451 EMBL:AL137742
            IPI:IPI00002816 RefSeq:NP_003784.2 UniGene:Hs.11590 PDB:1D5U
            PDB:1M6D PDBsum:1D5U PDBsum:1M6D ProteinModelPortal:Q9UBX1
            SMR:Q9UBX1 STRING:Q9UBX1 MEROPS:C01.018 PhosphoSite:Q9UBX1
            DMDM:12643325 PaxDb:Q9UBX1 PeptideAtlas:Q9UBX1 PRIDE:Q9UBX1
            DNASU:8722 Ensembl:ENST00000310325 GeneID:8722 KEGG:hsa:8722
            UCSC:uc001oip.3 CTD:8722 GeneCards:GC11M066332 HGNC:HGNC:2531
            HPA:CAB002141 MIM:603539 neXtProt:NX_Q9UBX1 PharmGKB:PA27031
            InParanoid:Q9UBX1 OMA:LAPPEWD OrthoDB:EOG4CC41T PhylomeDB:Q9UBX1
            BindingDB:Q9UBX1 ChEMBL:CHEMBL2517 ChiTaRS:CTSF
            EvolutionaryTrace:Q9UBX1 GenomeRNAi:8722 NextBio:32715
            ArrayExpress:Q9UBX1 Bgee:Q9UBX1 CleanEx:HS_CTSF
            Genevestigator:Q9UBX1 GermOnline:ENSG00000174080 Uniprot:Q9UBX1
        Length = 484

 Score = 464 (168.4 bits), Expect = 5.0e-44, P = 5.0e-44
 Identities = 112/302 (37%), Positives = 158/302 (52%)

Query:    44 SHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDK-PYKLKLNKFADMTNHEFASTYAGSK 102
             +++    S +E   R +VF  N++   +   +D+   +  + KF+D+T  EF + Y  + 
Sbjct:   193 TYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNTL 252

Query:   103 IKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGI 162
             ++      G +       G +   PP  DWR KG+VT VKDQG CGSCWAFS    VEG 
Sbjct:   253 LRKE---PGNKMKQAKSVGDLA--PPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQ 307

Query:   163 NHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTC 222
               +    L+SLSEQEL+DCD   ++ C GGL   A+  IK  GG+ TE  Y YQ +  +C
Sbjct:   308 WFLNQGTLLSLSEQELLDCDK-MDKACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQSC 366

Query:   223 DVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGV---FTG 278
             + S E +     D  E   + +E  L   +AK+ P+SVAI+A     QFY  G+      
Sbjct:   367 NFSAEKAKVYINDSVEL--SQNEQKLAAWLAKRGPISVAINAFG--MQFYRHGISRPLRP 422

Query:   279 ECGTEL-NHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAME 337
              C   L +H V  VGYG   D   +W ++NSWG +WGEKGY  + RG     G CG+   
Sbjct:   423 LCSPWLIDHAVLLVGYGNRSD-VPFWAIKNSWGTDWGEKGYYYLHRG----SGACGVNTM 477

Query:   338 AS 339
             AS
Sbjct:   478 AS 479


>FB|FBgn0033874 [details] [associations]
            symbol:CG6347 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE013599 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 HSSP:P53634 EMBL:AY069609
            RefSeq:NP_610906.1 UniGene:Dm.608 SMR:Q7K0S6 MEROPS:C01.A29
            EnsemblMetazoa:FBtr0087637 GeneID:36531 KEGG:dme:Dmel_CG6347
            UCSC:CG6347-RA FlyBase:FBgn0033874 InParanoid:Q7K0S6 OMA:FEYIRDH
            OrthoDB:EOG4FQZ74 GenomeRNAi:36531 NextBio:799046 Uniprot:Q7K0S6
        Length = 352

 Score = 455 (165.2 bits), Expect = 4.5e-43, P = 4.5e-43
 Identities = 114/307 (37%), Positives = 165/307 (53%)

Query:    53 DEKHKRFNVFKQNVMHVHQTNK-MDKP---YKLKLNKFADMTNHEFASTYAGSKIKHHRM 108
             +E+  R ++F   +  +  +NK  D     ++L +N  ADMT  E A T  GSKI     
Sbjct:    52 EERVYRESIFAAKMSLITLSNKNADNGVSGFRLGVNTLADMTRKEIA-TLLGSKISEFGE 110

Query:   109 FQGTRGNGTFMYGK---VTSIPPSVDWRKKGSVTAVKDQGQ-CGSCWAFSTIAAVEGINH 164
              + T G+  F+  +     ++P   DWR+KG VT    QG  CG+CW+F+T  A+EG   
Sbjct:   111 -RYTNGHINFVTARNPASANLPEMFDWREKGGVTPPGFQGVGCGACWSFATTGALEGHLF 169

Query:   165 IMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCD 223
               T  L SLS+Q LVDC  D  N GC+GG  E  FE+I+  G VT   KYPY   +  C 
Sbjct:   170 RRTGVLASLSQQNLVDCADDYGNMGCDGGFQEYGFEYIRDHG-VTLANKYPYTQTEMQCR 228

Query:   224 VSKESS-PA----VSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFT 277
              ++ +  P     V I  +  +    E+ + + +A   P++ +++A +  F+ YS G++ 
Sbjct:   229 QNETAGRPPRESLVKIRDYATITPGDEEKMKEVIATLGPLACSMNADTISFEQYSGGIYE 288

Query:   278 GE-CGT-ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIA 335
              E C   ELNH V  VGYGT  +G  YWI++NS+   WGE G++R+ R      G CGIA
Sbjct:   289 DEECNQGELNHSVTVVGYGTE-NGRDYWIIKNSYSQNWGEGGFMRILRNAG---GFCGIA 344

Query:   336 MEASYPI 342
              E SYPI
Sbjct:   345 SECSYPI 351


>GENEDB_PFALCIPARUM|PF11_0162 [details] [associations]
            symbol:PF11_0162 "falcipain-3" species:5833
            "Plasmodium falciparum" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 HSSP:P43235 EMBL:AE014186 GO:GO:0020020
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347833.1
            ProteinModelPortal:Q8IIL0 SMR:Q8IIL0 MEROPS:C01.063
            EnsemblProtists:PF11_0162:mRNA GeneID:810709 KEGG:pfa:PF11_0162
            EuPathDB:PlasmoDB:PF3D7_1115400 OMA:ENDEDYW ChEMBL:CHEMBL1250373
            Uniprot:Q8IIL0
        Length = 492

 Score = 453 (164.5 bits), Expect = 7.3e-43, P = 7.3e-43
 Identities = 112/312 (35%), Positives = 159/312 (50%)

Query:    53 DEKHKRFNVFKQNVMHVHQTNKMDKP-YKLKLNKFADMTNHEFASTYAGSKIKHHRMFQG 111
             +E  KRF +F +N   +   NK     YK  +NKF D++  EF S Y    +K H  F+ 
Sbjct:   186 EEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKFGDLSPEEFRSKYLN--LKTHGPFKT 243

Query:   112 TRGNGTFM--YGKVTS-IPPS--------VDWRKKGSVTAVKDQGQCGSCWAFSTIAAVE 160
                  ++   Y  V     P+         DWR  G VT VKDQ  CGSCWAFS++ +VE
Sbjct:   244 LSPPVSYEANYEDVIKKYKPADAKLDRIAYDWRLHGGVTPVKDQALCGSCWAFSSVGSVE 303

Query:   161 GINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQAN-D 219
                 I    L   SEQELVDC   +N GC GG +  AF+ +   GG+ ++  YPY +N  
Sbjct:   304 SQYAIRKKALFLFSEQELVDCSV-KNNGCYGGYITNAFDDMIDLGGLCSQDDYPYVSNLP 362

Query:   220 GTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTGE 279
              TC++ K  +   +I  + ++P +     L+ +   P+S++I A S DF FY  G + GE
Sbjct:   363 ETCNL-KRCNERYTIKSYVSIPDDKFKEALRYLG--PISISI-AASDDFAFYRGGFYDGE 418

Query:   280 CGTELNHGVAAVGYGTT---------LDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKG 330
             CG   NH V  VGYG           ++   Y+I++NSWG +WGE GYI ++   +  K 
Sbjct:   419 CGAAPNHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWGSDWGEGGYINLETDENGYKK 478

Query:   331 LCGIAMEASYPI 342
              C I  EA  P+
Sbjct:   479 TCSIGTEAYVPL 490


>UNIPROTKB|Q8IIL0 [details] [associations]
            symbol:PF11_0162 "Falcipain-3" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 HSSP:P43235 EMBL:AE014186 GO:GO:0020020
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347833.1
            ProteinModelPortal:Q8IIL0 SMR:Q8IIL0 MEROPS:C01.063
            EnsemblProtists:PF11_0162:mRNA GeneID:810709 KEGG:pfa:PF11_0162
            EuPathDB:PlasmoDB:PF3D7_1115400 OMA:ENDEDYW ChEMBL:CHEMBL1250373
            Uniprot:Q8IIL0
        Length = 492

 Score = 453 (164.5 bits), Expect = 7.3e-43, P = 7.3e-43
 Identities = 112/312 (35%), Positives = 159/312 (50%)

Query:    53 DEKHKRFNVFKQNVMHVHQTNKMDKP-YKLKLNKFADMTNHEFASTYAGSKIKHHRMFQG 111
             +E  KRF +F +N   +   NK     YK  +NKF D++  EF S Y    +K H  F+ 
Sbjct:   186 EEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKFGDLSPEEFRSKYLN--LKTHGPFKT 243

Query:   112 TRGNGTFM--YGKVTS-IPPS--------VDWRKKGSVTAVKDQGQCGSCWAFSTIAAVE 160
                  ++   Y  V     P+         DWR  G VT VKDQ  CGSCWAFS++ +VE
Sbjct:   244 LSPPVSYEANYEDVIKKYKPADAKLDRIAYDWRLHGGVTPVKDQALCGSCWAFSSVGSVE 303

Query:   161 GINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQAN-D 219
                 I    L   SEQELVDC   +N GC GG +  AF+ +   GG+ ++  YPY +N  
Sbjct:   304 SQYAIRKKALFLFSEQELVDCSV-KNNGCYGGYITNAFDDMIDLGGLCSQDDYPYVSNLP 362

Query:   220 GTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTGE 279
              TC++ K  +   +I  + ++P +     L+ +   P+S++I A S DF FY  G + GE
Sbjct:   363 ETCNL-KRCNERYTIKSYVSIPDDKFKEALRYLG--PISISI-AASDDFAFYRGGFYDGE 418

Query:   280 CGTELNHGVAAVGYGTT---------LDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKG 330
             CG   NH V  VGYG           ++   Y+I++NSWG +WGE GYI ++   +  K 
Sbjct:   419 CGAAPNHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWGSDWGEGGYINLETDENGYKK 478

Query:   331 LCGIAMEASYPI 342
              C I  EA  P+
Sbjct:   479 TCSIGTEAYVPL 490


>RGD|1308181 [details] [associations]
            symbol:Ctsf "cathepsin F" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1308181 eggNOG:COG4870 HOGENOM:HOG000230774
            KO:K01373 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 MEROPS:C01.018
            CTD:8722 OMA:LAPPEWD OrthoDB:EOG4CC41T GeneTree:ENSGT00660000095458
            EMBL:CH473953 EMBL:BC099780 EMBL:EU253481 IPI:IPI00201100
            RefSeq:NP_001029282.1 UniGene:Rn.25087 SMR:Q499S6
            Ensembl:ENSRNOT00000026718 GeneID:361704 KEGG:rno:361704
            UCSC:RGD:1308181 InParanoid:Q499S6 NextBio:677325
            Genevestigator:Q499S6 Uniprot:Q499S6
        Length = 462

 Score = 451 (163.8 bits), Expect = 1.2e-42, P = 1.2e-42
 Identities = 110/303 (36%), Positives = 156/303 (51%)

Query:    44 SHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDK-PYKLKLNKFADMTNHEFASTYAGSK 102
             +++    S +E   R  VF +N++   +   +D+   +  + KF+D+T  EF + Y    
Sbjct:   171 TYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHTIYLNP- 229

Query:   103 IKHHRMFQGTRGNGTFMYGKVTSI-PPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEG 161
                  + Q   G    +   +  + PP  DWRKKG+VT VKDQG CGSCWAFS    VEG
Sbjct:   230 -----LLQKESGGKMSLAKSINDLAPPEWDWRKKGAVTEVKDQGMCGSCWAFSVTGNVEG 284

Query:   162 INHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGT 221
                +    L+SLSEQEL+DCD   ++ C GGL   A+  IK  GG+ TE  Y YQ +   
Sbjct:   285 QWFLNRGTLLSLSEQELLDCDK-MDKACMGGLPSNAYTAIKNLGGLETEDDYGYQGHVQA 343

Query:   222 CDVSKESSPAVSIDGHENVPANHEDALLKAVA-KQPVSVAIDAGSSDFQFYSEGV---FT 277
             C+ S + +     D  E   +  E+ +   +A K P+SVAI+A     QFY  G+   F 
Sbjct:   344 CNFSTQMAKVYINDSVEL--SRDENKIAAWLAQKGPISVAINAFG--MQFYRHGIAHPFR 399

Query:   278 GECGTE-LNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAM 336
               C    ++H V  VGYG       YW ++NSWG +WGE+GY  + RG     G CG+  
Sbjct:   400 PLCSPWFIDHAVLLVGYGNR-SNIPYWAIKNSWGRDWGEEGYYYLYRG----SGACGVNT 454

Query:   337 EAS 339
              AS
Sbjct:   455 MAS 457


>MGI|MGI:1861434 [details] [associations]
            symbol:Ctsf "cathepsin F" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=ISS]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISS] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:1861434 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513 MEROPS:C01.018
            CTD:8722 OMA:LAPPEWD OrthoDB:EOG4CC41T EMBL:AF136280 EMBL:AF217224
            EMBL:AJ131851 EMBL:AK075862 EMBL:BC058758 IPI:IPI00126769
            RefSeq:NP_063914.1 UniGene:Mm.29561 ProteinModelPortal:Q9R013
            SMR:Q9R013 STRING:Q9R013 PhosphoSite:Q9R013 PaxDb:Q9R013
            PRIDE:Q9R013 Ensembl:ENSMUST00000119694 GeneID:56464 KEGG:mmu:56464
            UCSC:uc008gbc.1 GeneTree:ENSGT00660000095458 InParanoid:Q9R013
            NextBio:312722 Bgee:Q9R013 CleanEx:MM_CTSF Genevestigator:Q9R013
            GermOnline:ENSMUSG00000006458 Uniprot:Q9R013
        Length = 462

 Score = 449 (163.1 bits), Expect = 1.9e-42, P = 1.9e-42
 Identities = 109/302 (36%), Positives = 153/302 (50%)

Query:    44 SHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDK-PYKLKLNKFADMTNHEFASTYAGSK 102
             +++    S +E   R  VF +N++   +   +D+   +  + KF+D+T  EF + Y    
Sbjct:   171 TYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHTIYLNP- 229

Query:   103 IKHHRMFQGTRGNGTFMYGKVTSI-PPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEG 161
                  + Q   G        +  + PP  DWRKKG+VT VK+QG CGSCWAFS    VEG
Sbjct:   230 -----LLQKESGRKMSPAKSINDLAPPEWDWRKKGAVTEVKNQGMCGSCWAFSVTGNVEG 284

Query:   162 INHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGT 221
                +    L+SLSEQEL+DCD   ++ C GGL   A+  IK  GG+ TE  Y YQ +  T
Sbjct:   285 QWFLNRGTLLSLSEQELLDCDK-VDKACLGGLPSNAYAAIKNLGGLETEDDYGYQGHVQT 343

Query:   222 CDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGV---FTG 278
             C+ S + +  V I+    +  N          K P+SVAI+A     QFY  G+   F  
Sbjct:   344 CNFSAQMAK-VYINDSVELSRNENKIAAWLAQKGPISVAINAFG--MQFYRHGIAHPFRP 400

Query:   279 ECGTE-LNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAME 337
              C    ++H V  VGYG       YW ++NSWG +WGE+GY  + RG     G CG+   
Sbjct:   401 LCSPWFIDHAVLLVGYGNR-SNIPYWAIKNSWGSDWGEEGYYYLYRG----SGACGVNTM 455

Query:   338 AS 339
             AS
Sbjct:   456 AS 457


>UNIPROTKB|Q0VCU3 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            HOVERGEN:HBG011513 MEROPS:C01.018 CTD:8722 OMA:LAPPEWD
            OrthoDB:EOG4CC41T GeneTree:ENSGT00660000095458 EMBL:DAAA02063594
            EMBL:BC120003 IPI:IPI00717812 RefSeq:NP_001068884.1 UniGene:Bt.7264
            SMR:Q0VCU3 Ensembl:ENSBTAT00000014587 GeneID:509715 KEGG:bta:509715
            InParanoid:Q0VCU3 NextBio:20869091 Uniprot:Q0VCU3
        Length = 460

 Score = 448 (162.8 bits), Expect = 2.5e-42, P = 2.5e-42
 Identities = 114/304 (37%), Positives = 156/304 (51%)

Query:    47 TVSRSLDEKHK---RFNVFKQNVMHVHQTNKMDK-PYKLKLNKFADMTNHEFASTYAGSK 102
             T +R+ D + +   R +VF  N++   +   +D+   +  + KF+D+T  EF + Y    
Sbjct:   169 TYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTARYGVTKFSDLTEEEFRTIYLNPL 228

Query:   103 IKHHRMFQGTRGNGTFMYGKVTSIPPSV-DWRKKGSVTAVKDQGQCGSCWAFSTIAAVEG 161
             +K         G        VT +PP   DWR KG+VT VKDQG CGSCWAFS    VEG
Sbjct:   229 LKD------APGRNMRPAQPVTDVPPPQWDWRNKGAVTNVKDQGMCGSCWAFSVTGNVEG 282

Query:   162 INHIMTNKLVSLSEQELVDCD-TDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDG 220
                +    L+SLSEQEL+DCD TD+   C GGL   A+  I+  GG+ TE  Y Y+    
Sbjct:   283 QWFLKRGTLLSLSEQELLDCDKTDK--ACLGGLPSNAYSAIRTLGGLETEDDYSYRGRLQ 340

Query:   221 TCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGV---F 276
             TC  S E +     D  E   + +E  L   +AK  PVS+AI+A     QFY  G+    
Sbjct:   341 TCSFSAEKAKVYINDSVEL--SKNEQKLAAWLAKNGPVSIAINAFG--MQFYRHGISHPL 396

Query:   277 TGECGTEL-NHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIA 335
                C   L +H V  VGYG       +W ++NSWG +WGE+GY  + RG     G CG+ 
Sbjct:   397 RPLCSPWLIDHAVLLVGYGNR-SAIPFWAIKNSWGTDWGEEGYYYLHRG----SGACGVN 451

Query:   336 MEAS 339
             + AS
Sbjct:   452 IMAS 455


>DICTYBASE|DDB_G0272742 [details] [associations]
            symbol:DDB_G0272742 species:44689 "Dictyostelium
            discoideum" [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0272742 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639 EMBL:AAFI02000008
            eggNOG:NOG331187 RefSeq:XP_644986.1 ProteinModelPortal:Q7KWP5
            PRIDE:Q7KWP5 EnsemblProtists:DDB0168242 GeneID:8618663
            KEGG:ddi:DDB_G0272742 InParanoid:Q7KWP5 OMA:ATESAHF Uniprot:Q7KWP5
        Length = 345

 Score = 445 (161.7 bits), Expect = 5.2e-42, P = 5.2e-42
 Identities = 111/322 (34%), Positives = 168/322 (52%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTY 98
             +  W + +  + +  E   R+N FK N+  ++Q N       L LN+FAD++N E+   Y
Sbjct:    29 FTAWMTSNQRTYASSEFTNRYNTFKSNLDFINQWNSKGSKTVLALNEFADISNEEYRKNY 88

Query:    99 AGSKIKHHRMFQGTRGNGTFMYGKVTSIPPS----VDWRKKGSVTAVKDQ-GQCGSCWAF 153
               +    +++      +      K +S   S    +DWRKKG+V +VK Q G CGS W  
Sbjct:    89 LRNDNNINKLSSLLINDKEDKEIKSSSSSGSGSSGIDWRKKGAVPSVKSQIGGCGS-WPI 147

Query:   154 STIAAVEGINHIMTNK--LVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEA 211
             + + A E  + +   K   +SLS Q L+DC ++ N+ C  G +  AF++I + GG+ +E 
Sbjct:   148 TAVGATESAHFLANPKDPFISLSMQNLIDC-SNLNKQCYQGTVNEAFQYIIENGGIDSEE 206

Query:   212 KYPYQAND-GTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQF 270
              Y +   + G C  +  +S A  I  +E V +  E +L  AV+ +PV+  IDA  S FQF
Sbjct:   207 SYKFSGGEPGKCKYNSSNSVA-KITSYEKVKSGSESSLESAVSLKPVAAYIDASLSSFQF 265

Query:   271 YSEGVF-TGECG-TELNHGVAAVGYG----TTLDGTK----YWIVRNSWGPEWGEKGYIR 320
             YS G++    C  T+LNH +  VG+     T  D  K    YWIV+NS+G  WGE GYI 
Sbjct:   266 YSSGIYYEPSCNSTDLNHSILIVGFSDFSTTPTDSLKHSSNYWIVQNSFGKNWGENGYIF 325

Query:   321 MQRGISDKKGLCGIAMEASYPI 342
             M +   D+   CGI+  ASY I
Sbjct:   326 MSK---DRDDNCGISKMASYVI 344


>FB|FBgn0032228 [details] [associations]
            symbol:CG5367 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE014134 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 HSSP:P80067
            RefSeq:NP_609387.1 UniGene:Dm.26782 ProteinModelPortal:Q9VKY4
            SMR:Q9VKY4 MEROPS:C01.A30 EnsemblMetazoa:FBtr0080055 GeneID:34401
            KEGG:dme:Dmel_CG5367 UCSC:CG5367-RA FlyBase:FBgn0032228
            InParanoid:Q9VKY4 OMA:QIVDCSV OrthoDB:EOG4THT8X PhylomeDB:Q9VKY4
            GenomeRNAi:34401 NextBio:788324 ArrayExpress:Q9VKY4 Bgee:Q9VKY4
            Uniprot:Q9VKY4
        Length = 338

 Score = 442 (160.7 bits), Expect = 1.1e-41, P = 1.1e-41
 Identities = 101/316 (31%), Positives = 171/316 (54%)

Query:    39 YERWRSHHTVS--RSLDEKHKRFNVFKQN--VMHVHQTNKMD--KPYKLKLNKFADMTNH 92
             +E++++++     R+ DE  + +  F++N  V+  H  N  +    ++LK N FADM+  
Sbjct:    36 FEKFKNNNNRKYLRTYDEM-RSYKAFEENFKVIEEHNQNYKEGQTSFRLKPNIFADMSTD 94

Query:    93 EFASTYAGSKIKHHRMFQGTRGNGTFMYGK--VTSIPPSVDWRKKGSVTAVKDQGQCGSC 150
              +   +    +K +   + +  N   + G   + ++P S+DWR KG +T   +Q  CGSC
Sbjct:    95 GYLKGFL-RLLKSN--IEDSADNMAEIVGSPLMANVPESLDWRSKGFITPPYNQLSCGSC 151

Query:   151 WAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTT 209
             +AFS   ++ G     T K++SLS+Q++VDC     NQGC GG +     +++  GG+  
Sbjct:   152 YAFSIAESIMGQVFKRTGKILSLSKQQIVDCSVSHGNQGCVGGSLRNTLSYLQSTGGIMR 211

Query:   210 EAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSDF 268
             +  YPY A  G C    + S  V++     +P   E A+  AV    PV+++I+A    F
Sbjct:   212 DQDYPYVARKGKCQFVPDLS-VVNVTSWAILPVRDEQAIQAAVTHIGPVAISINASPKTF 270

Query:   269 QFYSEGVFTGE-CGT-ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGIS 326
             Q YS+G++    C +  +NH +  +G+G       YWI++N WG  WGE GYIR+++G++
Sbjct:   271 QLYSDGIYDDPLCSSASVNHAMVVIGFGKD-----YWILKNWWGQNWGENGYIRIRKGVN 325

Query:   327 DKKGLCGIAMEASYPI 342
                 +CGIA  A+Y I
Sbjct:   326 ----MCGIANYAAYAI 337


>UNIPROTKB|E2RR02 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            OMA:LAPPEWD GeneTree:ENSGT00660000095458 EMBL:AAEX03011628
            Ensembl:ENSCAFT00000019742 Uniprot:E2RR02
        Length = 460

 Score = 442 (160.7 bits), Expect = 1.1e-41, P = 1.1e-41
 Identities = 110/304 (36%), Positives = 158/304 (51%)

Query:    47 TVSRSLDEKHK---RFNVFKQNVMHVHQTNKMDK-PYKLKLNKFADMTNHEFASTYAGSK 102
             T +R+ + K +   R +VF  N++   +   +D+   +  + KF+D+T  EF + Y    
Sbjct:   168 TYNRTYETKEEAEWRMSVFSNNMVRAQKIQALDRGTAQYGITKFSDLTEEEFRTIYLNPL 227

Query:   103 IKHHRMFQGTRGNGTFMYGKVT--SIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVE 160
             ++ +R      G    +   ++  + PP  DWR KG+VT VKDQG CGSCWAFS    VE
Sbjct:   228 LRENR------GKKMRLAKSISDHAPPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVE 281

Query:   161 GINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDG 220
             G   +    L+SLSEQEL+DCD   ++ C GGL   A+  I   GG+ TE  Y YQ +  
Sbjct:   282 GQWFLKEGTLLSLSEQELLDCDK-VDKACLGGLPSNAYSAIMTLGGLETEDDYSYQGHLQ 340

Query:   221 TCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGV---F 276
              C  S + +     D  E   + +E  L   +AK+ P+SVAI+A     QFY  G+    
Sbjct:   341 ACSFSAKKARVYINDSMEL--SQNEQKLAAWLAKKGPISVAINAFG--MQFYRHGISHPL 396

Query:   277 TGECGTEL-NHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIA 335
                C   L +H V  VGYG    G  +W ++NSWG +WGE+GY  + RG     G CG+ 
Sbjct:   397 RPLCSPWLIDHAVLLVGYGNR-SGIPFWAIKNSWGTDWGEEGYYYLHRG----SGACGVN 451

Query:   336 MEAS 339
               AS
Sbjct:   452 TMAS 455


>ZFIN|ZDB-GENE-030131-9831 [details] [associations]
            symbol:ctsf "cathepsin F" species:7955 "Danio
            rerio" [GO:0004869 "cysteine-type endopeptidase inhibitor activity"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000010 InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00031 Pfam:PF00112 PRINTS:PR00705 SMART:SM00043
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-030131-9831
            GO:GO:0004869 eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 HOVERGEN:HBG011513 CTD:8722 OrthoDB:EOG4CC41T
            MEROPS:I25.006 EMBL:BC124243 IPI:IPI00503226 RefSeq:NP_001071036.1
            UniGene:Dr.81265 ProteinModelPortal:Q08CH0 SMR:Q08CH0 GeneID:565588
            KEGG:dre:565588 InParanoid:Q08CH0 NextBio:20885952
            ArrayExpress:Q08CH0 Uniprot:Q08CH0
        Length = 473

 Score = 430 (156.4 bits), Expect = 2.0e-40, P = 2.0e-40
 Identities = 103/316 (32%), Positives = 160/316 (50%)

Query:    25 HEKELESEEGLWDLYERWR-SHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDK-PYKLK 82
             H K ++    L  +++ +  +++    S +E  KR  +F+QN+        +++   +  
Sbjct:   161 HSKPMKESVELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYG 220

Query:    83 LNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVK 142
             + KF+D+T  EF   Y    +    + +  +             P + DWR  G+V+ VK
Sbjct:   221 ITKFSDLTEDEFRMMYLNPMLSQWSLKKEMKP----AIPASAPAPDTWDWRDHGAVSPVK 276

Query:   143 DQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIK 202
             +QG CGSCWAFS    +EG     T +L+SLSEQELVDCD   +Q C GGL   A+E I+
Sbjct:   277 NQGMCGSCWAFSVTGNIEGQWFKKTGQLLSLSEQELVDCDK-LDQACGGGLPSNAYEAIE 335

Query:   203 KKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAID 262
               GG+ TE  Y Y  +  +CD S     A  I+    +P + ++         PVS A++
Sbjct:   336 NLGGLETETDYSYTGHKQSCDFST-GKVAAYINSSVELPKDEKEIAAFLAENGPVSAALN 394

Query:   263 AGSSDFQFYSEGV---FTGECGT-ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGY 318
             A +   QFY +GV       C    ++H V  VG+G   +G  +W ++NSWG ++GE+GY
Sbjct:   395 AFA--MQFYRKGVSHPLKIFCNPWMIDHAVLLVGFGQR-NGVPFWAIKNSWGEDYGEQGY 451

Query:   319 IRMQRGISDKKGLCGI 334
               + RG     GLCGI
Sbjct:   452 YYLYRG----SGLCGI 463


>RGD|1564827 [details] [associations]
            symbol:RGD1564827 "similar to cathepsin M" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 IPI:IPI00192321
            Ensembl:ENSRNOT00000023990 ArrayExpress:D3ZY04 Uniprot:D3ZY04
        Length = 338

 Score = 423 (154.0 bits), Expect = 1.1e-39, P = 1.1e-39
 Identities = 84/210 (40%), Positives = 117/210 (55%)

Query:   138 VTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMEL 196
             V     QG+C SCWAF  + A+EG     T KL  LS Q LVDC   Q N+GC GG    
Sbjct:   133 VHTASTQGRCNSCWAFPVVGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYN 192

Query:   197 AFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQP 256
             AF+++ + GG+ +EA YPY+  +G C  +  SS  ++       P  +ED L+ AVA +P
Sbjct:   193 AFQYVLQNGGLESEATYPYEGKEGLCRYNPNSSAKITXICAP--PQKNEDVLMDAVATKP 250

Query:   257 VSVAIDAGSSDFQFYSEGVF-TGECGTELNHGVAAVGYG---TTLDGTKYWIVRNSWGPE 312
             V+  I    S  +FY +G++   +C   +NH V  VGYG      DG  YW+++NSWG  
Sbjct:   251 VAAGIHVVHSSLRFYKKGIYHEPKCNNYVNHAVLVVGYGFEGNETDGNNYWLIQNSWGER 310

Query:   313 WGEKGYIRMQRGISDKKGLCGIAMEASYPI 342
             WG  GY+++ +   D+   CGIA  A YPI
Sbjct:   311 WGLNGYMKIAK---DRNNHCGIATFAQYPI 337


>DICTYBASE|DDB_G0281077 [details] [associations]
            symbol:DDB_G0281077 species:44689 "Dictyostelium
            discoideum" [GO:0030246 "carbohydrate binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR008965 InterPro:IPR013128 InterPro:IPR019028
            Pfam:PF00112 Pfam:PF09478 PRINTS:PR00705 SMART:SM00645
            SMART:SM01063 InterPro:IPR000169 dictyBase:DDB_G0281077
            GO:GO:0030246 EMBL:AAFI02000040 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 SUPFAM:SSF49384 ProtClustDB:CLSZ2430562
            RefSeq:XP_640803.1 ProteinModelPortal:Q54UH3
            EnsemblProtists:DDB0203998 GeneID:8622857 KEGG:ddi:DDB_G0281077
            InParanoid:Q54UH3 OMA:LINDFNF Uniprot:Q54UH3
        Length = 662

 Score = 362 (132.5 bits), Expect = 2.4e-38, Sum P(2) = 2.4e-38
 Identities = 73/186 (39%), Positives = 109/186 (58%)

Query:   125 SIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTD 184
             S P S+DWR  G V+ VK+QG CGSC+AFST+ A+E   +   N++++LSEQ LVDC  +
Sbjct:   470 SRPISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALEAHYYRKNNRMLNLSEQNLVDCTRN 529

Query:   185 QNQG-CNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPAN 243
                G C+GG M   F +IK+ GG+  ++ YPY+   G C  +   + +  I  +  +  +
Sbjct:   530 YGNGECSGGWMHNCFRYIKENGGINLQSTYPYEGRVGLCRYNSGDAQS-RISNYVMIKQH 588

Query:   244 HEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFTGE-CGT-ELNHGVAAVGYGTTLDGT 300
              E+ L  AVA   PVSVA DA + +F +YS G++  + C      H V  VGYG   +G 
Sbjct:   589 DEEDLANAVASVGPVSVAYDASTREFMYYSSGIYNSDSCDKYRTTHAVVVVGYGIE-NGV 647

Query:   301 KYWIVR 306
              +WI++
Sbjct:   648 DFWIIK 653

 Score = 78 (32.5 bits), Expect = 2.4e-38, Sum P(2) = 2.4e-38
 Identities = 23/97 (23%), Positives = 45/97 (46%)

Query:    21 GFDFHEKELESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNK--MDKP 78
             G D  ++ELE +      + +W +    +   D+   ++  FK +   + Q  +   +  
Sbjct:   148 GKDCRKRELEYQNS----FIQWSNQFNRTYRADQFLLKYEAFKDSSRFIEQYKRENQNST 203

Query:    79 YKLKLNKFADMTNHEFASTYAGSKIKHHRMFQGTRGN 115
              +L L +F+DMT+ EF + Y  SK+    + + T  N
Sbjct:   204 MELGLTQFSDMTHDEFLNIYT-SKLYEFNLNETTPSN 239


>FB|FBgn0037396 [details] [associations]
            symbol:CG11459 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE014297 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 KO:K01365 HSSP:P07711 EMBL:AY060710
            RefSeq:NP_649608.1 UniGene:Dm.3894 SMR:Q9VNK6 MEROPS:C01.A31
            EnsemblMetazoa:FBtr0078623 GeneID:40741 KEGG:dme:Dmel_CG11459
            UCSC:CG11459-RA FlyBase:FBgn0037396 InParanoid:Q9VNK6 OMA:NYDEREL
            OrthoDB:EOG4MGQPX ChiTaRS:CG11459 GenomeRNAi:40741 NextBio:820359
            Uniprot:Q9VNK6
        Length = 336

 Score = 407 (148.3 bits), Expect = 5.5e-38, P = 5.5e-38
 Identities = 103/317 (32%), Positives = 161/317 (50%)

Query:    36 WDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKM----DKPYKLKLNKFADMTN 91
             WD Y+    ++   R+ D+ H+   +++Q V+ V   N++       +K+ LNKF+D T+
Sbjct:    30 WDQYKA--KYNKQYRNRDKYHRA--LYEQRVLAVESHNQLYLQGKVAFKMGLNKFSD-TD 84

Query:    92 HEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQG-QCGSC 150
                   Y  S I             T  Y +   I   +DWR+ G ++ V DQG +C SC
Sbjct:    85 QRILFNYRSS-IPAPLETSTNALTETVNYKRYDQITEGIDWRQYGYISPVGDQGTECLSC 143

Query:   151 WAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTE 210
             WAFST   +E         LV LS + LVDC    N GC+GG + +AF + +  G + T+
Sbjct:   144 WAFSTSGVLEAHMAKKYGNLVPLSPKHLVDCVPYPNNGCSGGWVSVAFNYTRDHG-IATK 202

Query:   211 AKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ--PVSVAIDAGSSDF 268
               YPY+   G C + K    A ++ G+  +  N+++  L  V     PV+V+ID    +F
Sbjct:   203 ESYPYEPVSGEC-LWKSDRSAGTLSGYVTL-GNYDERELAEVVYNIGPVAVSIDHLHEEF 260

Query:   269 QFYSEGVFT-GECGT---ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRG 324
               YS GV +   C +   +L H V  VG+GT      YWI++NS+G +WGE GY+++ R 
Sbjct:   261 DQYSGGVLSIPACRSKRQDLTHSVLLVGFGTHRKWGDYWIIKNSYGTDWGESGYLKLARN 320

Query:   325 ISDKKGLCGIAMEASYP 341
              ++   +CG+A    YP
Sbjct:   321 ANN---MCGVASLPQYP 334


>DICTYBASE|DDB_G0281079 [details] [associations]
            symbol:DDB_G0281079 species:44689 "Dictyostelium
            discoideum" [GO:0030246 "carbohydrate binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR008965 InterPro:IPR013128 InterPro:IPR019028
            Pfam:PF00112 Pfam:PF09478 PRINTS:PR00705 SMART:SM00645
            SMART:SM01063 InterPro:IPR000169 dictyBase:DDB_G0281079
            GO:GO:0030246 EMBL:AAFI02000040 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 SUPFAM:SSF49384 RefSeq:XP_640804.1
            ProteinModelPortal:Q54UH2 EnsemblProtists:DDB0204000 GeneID:8622858
            KEGG:ddi:DDB_G0281079 InParanoid:Q54UH2 OMA:ALESHYY
            ProtClustDB:CLSZ2430562 Uniprot:Q54UH2
        Length = 664

 Score = 346 (126.9 bits), Expect = 9.1e-37, Sum P(2) = 9.1e-37
 Identities = 72/188 (38%), Positives = 106/188 (56%)

Query:   125 SIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTD 184
             S P S+DWR  G V+ VK+QG CGSC+AFST+ A+E   +   N+++ LSEQ LVDC   
Sbjct:   469 SRPISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALESHYYRKNNRMLDLSEQNLVDCTAS 528

Query:   185 ---QNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVP 241
                +N GC+GG M   + +I++ GG+  E+ YPY+   G C  +   + +  I     + 
Sbjct:   529 NKYRNGGCSGGWMHNCYSYIQENGGINQESTYPYEGKFGQCRYNSGDAQS-RISKFVMIK 587

Query:   242 ANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFTGE-CGT-ELNHGVAAVGYGTTLD 298
              + E+ L   VA   PVSVA DA + +F +YS G++  + C      H V  VGY    +
Sbjct:   588 QHDEEDLADTVASVGPVSVAYDASTREFMYYSRGIYYSDNCNKYRTTHAVVVVGYDNE-N 646

Query:   299 GTKYWIVR 306
             G  YWI++
Sbjct:   647 GVDYWIIK 654

 Score = 80 (33.2 bits), Expect = 9.1e-37, Sum P(2) = 9.1e-37
 Identities = 23/99 (23%), Positives = 46/99 (46%)

Query:    21 GFDFHEKELESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNK--MDKP 78
             G D  ++ELE +      + +W +    +   D+   ++  FK +   + Q  +   +  
Sbjct:   147 GKDCRKRELEYQNS----FIQWSNQFNRTYRADQFLLKYEAFKDSSRFIEQYKRENQNST 202

Query:    79 YKLKLNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGT 117
              +L L +F+DMT+ EF + Y  SK+    + + T  N +
Sbjct:   203 MELGLTQFSDMTHDEFLNVYT-SKLYEFNLNETTPSNSS 240


>GENEDB_PFALCIPARUM|PF14_0553 [details] [associations]
            symbol:PF14_0553 "cysteine proteinase
            falcipain-1" species:5833 "Plasmodium falciparum" [GO:0042540
            "hemoglobin catabolic process" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014187 KO:K01376 HSSP:P07688 GO:GO:0042540
            RefSeq:XP_001348727.1 ProteinModelPortal:Q8I6V0 PRIDE:Q8I6V0
            EnsemblProtists:PF14_0553:mRNA GeneID:812135 KEGG:pfa:PF14_0553
            EuPathDB:PlasmoDB:PF3D7_1458000 HOGENOM:HOG000065906
            ProtClustDB:CLSZ2457715 ChEMBL:CHEMBL1250371 Uniprot:Q8I6V0
        Length = 569

 Score = 289 (106.8 bits), Expect = 2.2e-35, Sum P(2) = 2.2e-35
 Identities = 80/253 (31%), Positives = 130/253 (51%)

Query:    57 KRFNVFKQNVMHVHQTNKMDKPYKLKLNKF----ADMTNH--EFASTYAGSKIKHHRMFQ 110
             K  N   +N M+  + N+     + +L ++      + NH  E  S    + +K + +  
Sbjct:   257 KNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILIS 316

Query:   111 GTRGNGTFMYGKVTS-IPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNK 169
                 NG      + S +P  +D+R+KG V   KDQG CGSCWAF+++  +E +       
Sbjct:   317 EFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKN 376

Query:   170 LVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESS 229
             ++S SEQE+VDC  D N GC+GG    +F ++ +   +    +Y Y+A D    ++    
Sbjct:   377 ILSFSEQEVVDCSKD-NFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKDDMFCLNYRCK 434

Query:   230 PAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFTGECGTELNHGV 288
               VS+    ++ A  E+ L+ A+ +  P+SV +   ++DF  YSEGV+ G C  ELNH V
Sbjct:   435 RKVSLS---SIGAVKENQLILALNEVGPLSVNVGV-NNDFVAYSEGVYNGTCSEELNHSV 490

Query:   289 AAVGYGTTLDGTK 301
               VGYG  ++ TK
Sbjct:   491 LLVGYGQ-VEKTK 502

 Score = 123 (48.4 bits), Expect = 2.2e-35, Sum P(2) = 2.2e-35
 Identities = 19/41 (46%), Positives = 26/41 (63%)

Query:   302 YWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPI 342
             YWI++NSW  +WGE G++R+ R  +     CGI  E  YPI
Sbjct:   528 YWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPI 568

 Score = 100 (40.3 bits), Expect = 1.7e-10, Sum P(2) = 1.7e-10
 Identities = 18/53 (33%), Positives = 35/53 (66%)

Query:    43 RSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKP--YKLKLNKFADMTNHE 93
             + H+ V +++DE+ ++F +FK N + +   NK++K   YK K+N+F+D +  E
Sbjct:   230 KEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEE 282


>UNIPROTKB|Q8I6V0 [details] [associations]
            symbol:PF14_0553 "Cysteine proteinase falcipain-1"
            species:36329 "Plasmodium falciparum 3D7" [GO:0042540 "hemoglobin
            catabolic process" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014187 KO:K01376 HSSP:P07688 GO:GO:0042540
            RefSeq:XP_001348727.1 ProteinModelPortal:Q8I6V0 PRIDE:Q8I6V0
            EnsemblProtists:PF14_0553:mRNA GeneID:812135 KEGG:pfa:PF14_0553
            EuPathDB:PlasmoDB:PF3D7_1458000 HOGENOM:HOG000065906
            ProtClustDB:CLSZ2457715 ChEMBL:CHEMBL1250371 Uniprot:Q8I6V0
        Length = 569

 Score = 289 (106.8 bits), Expect = 2.2e-35, Sum P(2) = 2.2e-35
 Identities = 80/253 (31%), Positives = 130/253 (51%)

Query:    57 KRFNVFKQNVMHVHQTNKMDKPYKLKLNKF----ADMTNH--EFASTYAGSKIKHHRMFQ 110
             K  N   +N M+  + N+     + +L ++      + NH  E  S    + +K + +  
Sbjct:   257 KNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILIS 316

Query:   111 GTRGNGTFMYGKVTS-IPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNK 169
                 NG      + S +P  +D+R+KG V   KDQG CGSCWAF+++  +E +       
Sbjct:   317 EFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKN 376

Query:   170 LVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESS 229
             ++S SEQE+VDC  D N GC+GG    +F ++ +   +    +Y Y+A D    ++    
Sbjct:   377 ILSFSEQEVVDCSKD-NFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKDDMFCLNYRCK 434

Query:   230 PAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFTGECGTELNHGV 288
               VS+    ++ A  E+ L+ A+ +  P+SV +   ++DF  YSEGV+ G C  ELNH V
Sbjct:   435 RKVSLS---SIGAVKENQLILALNEVGPLSVNVGV-NNDFVAYSEGVYNGTCSEELNHSV 490

Query:   289 AAVGYGTTLDGTK 301
               VGYG  ++ TK
Sbjct:   491 LLVGYGQ-VEKTK 502

 Score = 123 (48.4 bits), Expect = 2.2e-35, Sum P(2) = 2.2e-35
 Identities = 19/41 (46%), Positives = 26/41 (63%)

Query:   302 YWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPI 342
             YWI++NSW  +WGE G++R+ R  +     CGI  E  YPI
Sbjct:   528 YWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPI 568

 Score = 100 (40.3 bits), Expect = 1.7e-10, Sum P(2) = 1.7e-10
 Identities = 18/53 (33%), Positives = 35/53 (66%)

Query:    43 RSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKP--YKLKLNKFADMTNHE 93
             + H+ V +++DE+ ++F +FK N + +   NK++K   YK K+N+F+D +  E
Sbjct:   230 KEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEE 282


>ZFIN|ZDB-GENE-080724-8 [details] [associations]
            symbol:ctso "cathepsin O" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            ZFIN:ZDB-GENE-080724-8 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 CTD:1519 KO:K01374 EMBL:CR931784
            IPI:IPI00513613 RefSeq:XP_695717.3 UniGene:Dr.88386
            Ensembl:ENSDART00000074786 GeneID:567333 KEGG:dre:567333
            NextBio:20888622 Uniprot:E7FA09
        Length = 334

 Score = 379 (138.5 bits), Expect = 5.1e-35, P = 5.1e-35
 Identities = 87/266 (32%), Positives = 142/266 (53%)

Query:    74 KMDKPYKLKLNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWR 133
             K ++  +  +N+F+ ++  +F   Y  ++ +    F  ++     +  K  + PP  DWR
Sbjct:    73 KSNQSAQYGVNQFSYLSQKQFKEQYLTARAEAAPKFDQSKSE---IKVKANN-PPRFDWR 128

Query:   134 KKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGL 193
               G V  V +QG CG CWAFS + A+E ++     KL  LS Q+++DC   QNQGCNGG 
Sbjct:   129 DHGVVGPVHNQGSCGGCWAFSIVEAIESVSAKGGEKLQQLSVQQVIDCSY-QNQGCNGGS 187

Query:   194 -MELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVP-ANHEDALLKA 251
              +E  +   + K  + +EA+YP++  DG C    ++   V++  +     +  E+ ++ A
Sbjct:   188 PVEALYWLTQSKLKLVSEAEYPFKGADGVCQFFPQAHAGVAVRNYSAYDFSGQEEVMMSA 247

Query:   252 VAK-QPVSVAIDAGSSDFQFYSEGVFTGECGT-ELNHGVAAVGYGTTLDGTKYWIVRNSW 309
             +    P+ V +DA S  +Q Y  G+    C + + NH V   GY TT +   YWIVRNSW
Sbjct:   248 LVDFGPLVVIVDAIS--WQDYLGGIIQHHCSSHKANHAVLITGYDTTGE-VPYWIVRNSW 304

Query:   310 GPEWGEKGYIRMQRGISDKKGLCGIA 335
             G  WG+ GY  ++ G +D   +CG+A
Sbjct:   305 GTSWGDDGYAYIKIG-ND---VCGVA 326


>RGD|1309354 [details] [associations]
            symbol:Ctsw "cathepsin W" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1309354 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 MEROPS:C01.037
            CTD:1521 eggNOG:NOG288820 HOVERGEN:HBG100117 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 EMBL:CH473953 EMBL:BC093401 IPI:IPI00371471
            RefSeq:NP_001019413.1 UniGene:Rn.34406 Ensembl:ENSRNOT00000037404
            GeneID:293676 KEGG:rno:293676 UCSC:RGD:1309354 InParanoid:Q561Q9
            NextBio:636716 Genevestigator:Q561Q9 Uniprot:Q561Q9
        Length = 371

 Score = 259 (96.2 bits), Expect = 1.3e-34, Sum P(2) = 1.3e-34
 Identities = 73/255 (28%), Positives = 120/255 (47%)

Query:    54 EKHKRFNVFKQNVMHVHQTNKMDK-PYKLKLNKFADMTNHEFASTYAGSKIKHHRMFQGT 112
             E  +R  +F  N+    +  + D    +     F+D+T  EF   Y G +    R+    
Sbjct:    56 EYTRRLGIFAHNLAQAQRLQEEDLGTAEFGQTPFSDLTEEEFGQLY-GHQRAPERILNMA 114

Query:   113 RGNGTFMYGKVTSIPPSVDWRK-KGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLV 171
             +   +  +G+  S+PP+ DWRK K  ++++K+QG C  CWA +    ++ +  I T + V
Sbjct:   115 KKVKSERWGE--SVPPTCDWRKVKNIISSIKNQGNCRCCWAIAAADNIQTLWRIKTQQFV 172

Query:   172 SLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGT--CDVSKESS 229
              +S QEL+DCD   N GCNGG +  A+  +    G+ +E  YP+Q +     C   K   
Sbjct:   173 DVSVQELLDCDRCGN-GCNGGFVWDAYITVLNNSGLASEEDYPFQGHQKPHRCLADKYRK 231

Query:   230 PAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFTGE---CGTEL- 284
              A   D    + +++E  +   +A   P++V I+      Q+Y +GV       C   L 
Sbjct:   232 VAWIQDF--TMLSSNEQVIAGYLAIHGPITVTINMKL--LQYYQKGVIKATPSTCDPHLV 287

Query:   285 NHGVAAVGYGTTLDG 299
             NH V  VG+G    G
Sbjct:   288 NHSVLLVGFGKEKGG 302

 Score = 132 (51.5 bits), Expect = 1.3e-34, Sum P(2) = 1.3e-34
 Identities = 26/50 (52%), Positives = 29/50 (58%)

Query:   300 TKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPIKKSATNP 349
             T YWI++NSWG EWGEKGY R+ RG       CGIA    YPI      P
Sbjct:   319 TPYWILKNSWGAEWGEKGYFRLYRG----NNTCGIA---KYPITARVDRP 361


>WB|WBGene00012747 [details] [associations]
            symbol:Y40H7A.10 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000230773 EMBL:AL033510
            HSSP:P80067 MEROPS:C01.A48 PIR:T26792 RefSeq:NP_502836.1
            ProteinModelPortal:Q9XWA4 SMR:Q9XWA4 STRING:Q9XWA4
            EnsemblMetazoa:Y40H7A.10 GeneID:189809 KEGG:cel:CELE_Y40H7A.10
            UCSC:Y40H7A.10 CTD:189809 WormBase:Y40H7A.10 eggNOG:NOG286423
            InParanoid:Q9XWA4 OMA:NGPMIVC NextBio:943702 Uniprot:Q9XWA4
        Length = 343

 Score = 370 (135.3 bits), Expect = 4.6e-34, P = 4.6e-34
 Identities = 101/288 (35%), Positives = 148/288 (51%)

Query:    57 KRFNVFKQNVMHVHQTNKMDK-PYKLKLNKFADMTNHEFASTYAGSKIKHHRMFQGTRGN 115
             KRF +F +N+  V + NK D      +LN F+D+T  E+       K  H       +  
Sbjct:    70 KRFTIFSRNLDLVERYNKEDAGKVTYELNDFSDLTEEEWKKYLMTPKPDHSEKSLKPK-- 127

Query:   116 GTFMYGKVTSIPPSVDWRK-KGS--VTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVS 172
              T +  K  ++P SVDWR   G+  VT +K QG CGSCWAF+T AA+E    I    L S
Sbjct:   128 -TLIDKK--NLPNSVDWRNVNGTNHVTGIKYQGPCGSCWAFATAAAIESAVSISGGGLQS 184

Query:   173 LSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAV 232
             LS Q+L+DC T  +  C GG    A ++ +  G +TT   YPY      C   +E+ P V
Sbjct:   185 LSSQQLLDC-TVVSDKCGGGEPVEALKYAQSHG-ITTAHNYPYYFWTTKC---RETVPTV 239

Query:   233 S-IDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFTG-ECGTELNHGVA 289
             + I     + A  ED + + VA   P+ V  +  ++  +FY  G+    +CGTE  H + 
Sbjct:   240 ARISSW--MKAESEDEMAQIVALNGPMIVCANFATNKNRFYHSGIAEDPDCGTEPTHALI 297

Query:   290 AVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAME 337
              +GYG       YWI++N++   WGEKGY+R++R ++     CGI  E
Sbjct:   298 VIGYGPD-----YWILKNTYSKVWGEKGYMRVKRDVN----WCGINTE 336


>UNIPROTKB|E1BPI9 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 CTD:1519 KO:K01374 OMA:SNVCGIA
            EMBL:DAAA02044933 IPI:IPI01004081 RefSeq:XP_002694471.2
            RefSeq:XP_874012.4 Ensembl:ENSBTAT00000014691 GeneID:616804
            KEGG:bta:616804 Uniprot:E1BPI9
        Length = 313

 Score = 368 (134.6 bits), Expect = 7.4e-34, P = 7.4e-34
 Identities = 92/263 (34%), Positives = 132/263 (50%)

Query:    83 LNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVT--SIPPSVDWRKKGSVTA 140
             +N+F+ +   EF + Y  S       F          Y  ++  S+P   DWR K  VT 
Sbjct:    61 INQFSYLFPEEFKAIYLRSSPSRFPRFPAEE------YTSISNLSLPLRFDWRDKHVVTQ 114

Query:   141 VKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEF 200
             V++Q  CG CWAFS + AVE +  I    L  LS Q+++DC    N GCNGG    A  +
Sbjct:   115 VRNQKTCGGCWAFSVVGAVESVCAIKGQPLEVLSVQQVIDCSYS-NYGCNGGSPLSALYW 173

Query:   201 IKK-KGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVP-ANHEDALLKAV-AKQPV 257
             + K +  +  +++YP+QA +G C    +S    SI G+     +  ED + +A+ A  P+
Sbjct:   174 LNKLQVKLVRDSEYPFQAQNGLCRYFSDSHSGSSIKGYSAYDFSGQEDKMAEALLALGPL 233

Query:   258 SVAIDAGSSDFQFYSEGVFTGECGT-ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEK 316
              V +DA S  +Q Y  G+    C + E NH V   G+  T     YWIVRNSWG  WG  
Sbjct:   234 IVVVDAMS--WQDYLGGIIQHHCSSGEANHAVLVTGFDKT-GSIPYWIVRNSWGTSWGID 290

Query:   317 GYIRMQRGISDKKGLCGIAMEAS 339
             GY+R++ G      +CGIA   S
Sbjct:   291 GYVRVKMG----GNVCGIADSVS 309


>UNIPROTKB|P43234 [details] [associations]
            symbol:CTSO "Cathepsin O" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 Reactome:REACT_6900
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0004197
            CleanEx:HS_CTSO EMBL:X77383 EMBL:BC049206 IPI:IPI00017257
            PIR:A55090 RefSeq:NP_001325.1 UniGene:Hs.75262
            ProteinModelPortal:P43234 SMR:P43234 IntAct:P43234 STRING:P43234
            MEROPS:C01.035 PhosphoSite:P43234 DMDM:1168795 PRIDE:P43234
            DNASU:1519 Ensembl:ENST00000433477 GeneID:1519 KEGG:hsa:1519
            UCSC:uc003ipg.3 CTD:1519 GeneCards:GC04M156845 HGNC:HGNC:2542
            HPA:HPA002041 MIM:600550 neXtProt:NX_P43234 PharmGKB:PA27040
            HOVERGEN:HBG105050 InParanoid:P43234 KO:K01374 OMA:SNVCGIA
            OrthoDB:EOG4V6ZH1 PhylomeDB:P43234 BindingDB:P43234
            ChEMBL:CHEMBL3035 GenomeRNAi:1519 NextBio:6287 Bgee:P43234
            Genevestigator:P43234 GermOnline:ENSG00000151792 Uniprot:P43234
        Length = 321

 Score = 366 (133.9 bits), Expect = 1.2e-33, P = 1.2e-33
 Identities = 91/261 (34%), Positives = 129/261 (49%)

Query:    83 LNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVK 142
             +N+F+ +   EF + Y  SK      +         M     S+P   DWR K  VT V+
Sbjct:    69 INQFSYLFPEEFKAIYLRSKPSKFPRYSAE----VHMSIPNVSLPLRFDWRDKQVVTQVR 124

Query:   143 DQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIK 202
             +Q  CG CWAFS + AVE    I    L  LS Q+++DC  + N GCNGG    A  ++ 
Sbjct:   125 NQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYN-NYGCNGGSTLNALNWLN 183

Query:   203 K-KGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVP-ANHEDALLKAVAK-QPVSV 259
             K +  +  +++YP++A +G C     S    SI G+     ++ ED + KA+    P+ V
Sbjct:   184 KMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVV 243

Query:   260 AIDAGSSDFQFYSEGVFTGECGT-ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGY 318
              +DA S  +Q Y  G+    C + E NH V   G+  T   T YWIVRNSWG  WG  GY
Sbjct:   244 IVDAVS--WQDYLGGIIQHHCSSGEANHAVLITGFDKT-GSTPYWIVRNSWGSSWGVDGY 300

Query:   319 IRMQRGISDKKGLCGIAMEAS 339
               ++ G      +CGIA   S
Sbjct:   301 AHVKMG----SNVCGIADSVS 317


>WB|WBGene00013076 [details] [associations]
            symbol:Y51A2D.8 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            EMBL:AL021497 HSSP:P53634 HOGENOM:HOG000019851 PIR:T27079
            RefSeq:NP_507627.1 ProteinModelPortal:Q9XXQ7 SMR:Q9XXQ7
            MEROPS:C01.A49 EnsemblMetazoa:Y51A2D.8 GeneID:180208
            KEGG:cel:CELE_Y51A2D.8 UCSC:Y51A2D.8 CTD:180208 WormBase:Y51A2D.8
            eggNOG:NOG307864 InParanoid:Q9XXQ7 OMA:VAVYFKV NextBio:908434
            Uniprot:Q9XXQ7
        Length = 386

 Score = 312 (114.9 bits), Expect = 4.9e-33, Sum P(2) = 4.9e-33
 Identities = 73/201 (36%), Positives = 105/201 (52%)

Query:   138 VTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELA 197
             V  +KDQGQC  CW F+  A VE +    + K  SLS+QE+ DC T+   GC GG + L 
Sbjct:   164 VGPIKDQGQCACCWGFAVTALVETVYAAHSGKFKSLSDQEVCDCGTEGTPGCKGGSLTLG 223

Query:   198 FEFIKKKGGVTTEAKYPY---QANDGT-CDVSKESS--PAVSIDGHENVPANHEDALLKA 251
              +++KK G ++ +  YPY   +AN G  C + +     PA + +     P   E+ +++ 
Sbjct:   224 VQYVKKYG-LSGDEDYPYDQNRANQGRRCRLRETDRIVPARAFNFAVINPRRAEEQIIQV 282

Query:   252 VA--KQPVSVAIDAGSSDFQFYSEGVFT-GECGTELN-HGVAAVGYGTTLDGT----KYW 303
             +   K PV+V    G   F+ Y EGV    +C      H  A VGY T  D       YW
Sbjct:   283 LTEWKVPVAVYFKVGDQ-FKEYKEGVIIEDDCRRATQWHAGAIVGYDTVEDSRGRSHDYW 341

Query:   304 IVRNSWGPEWGEKGYIRMQRG 324
             I++NSWG +W E GY+R+ RG
Sbjct:   342 IIKNSWGGDWAESGYVRVVRG 362

 Score = 64 (27.6 bits), Expect = 4.9e-33, Sum P(2) = 4.9e-33
 Identities = 20/79 (25%), Positives = 40/79 (50%)

Query:    23 DFHEKELESE--EGLWDLYERWRSHHTVS-RSLDEKHKRFNVFKQNVMHVHQTNKMDKP- 78
             +F E  ++ +  E L+  +E ++  +    +   E  +RFN F ++  +V + N   K  
Sbjct:    25 EFFEINIDRDHPEKLYKAFEDFKKKYNRKYKDESENQQRFNNFVKSYNNVDKLNAKSKAA 84

Query:    79 -YKLK--LNKFADMTNHEF 94
              Y  +  +NKF+D++  EF
Sbjct:    85 GYDTQFGINKFSDLSTAEF 103


>UNIPROTKB|F1RU23 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 KO:K08569 EMBL:CU928325
            RefSeq:XP_003122571.1 UniGene:Ssc.28940 Ensembl:ENSSSCT00000014177
            GeneID:100525853 KEGG:ssc:100525853 OMA:CWAMAAV Uniprot:F1RU23
        Length = 367

 Score = 358 (131.1 bits), Expect = 8.5e-33, P = 8.5e-33
 Identities = 104/339 (30%), Positives = 162/339 (47%)

Query:    34 GLWDLYERWRSHHTVSRSLDEKH-KRFNVFKQNVMHVHQTNKMDK-PYKLKLNKFADMTN 91
             GL +++  ++  +  S S   +H +R ++F QN+    +  + D    +  +  F+D+T 
Sbjct:    37 GLKEVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDLTE 96

Query:    92 HEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTS---IPPSVDWRKK-GSVTAVKDQGQC 147
              EF   +      HH    G   +     G   S   +P S DWRKK G ++A+K Q  C
Sbjct:    97 EEFGQLHG-----HH-WGAGKAPSMGIKVGSEESGETVPQSCDWRKKPGVISAIKHQKDC 150

Query:   148 GSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGV 207
               CWA + +  VE    I  ++ V LS Q+++DCD   N GCNGG +  AF  +    G+
Sbjct:   151 NCCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRCGN-GCNGGFVWDAFLTVLNTSGL 209

Query:   208 TTEAKYPYQANDGT--CDVSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAG 264
              +E  YPY+    T  C ++K+      I     +    E ++ + +A + P++V I+AG
Sbjct:   210 ASEQDYPYKGTVKTHRC-LAKQHRKVAWIQDFLMLQFC-EQSIARYLATEGPITVTINAG 267

Query:   265 SSDFQFYSEGVFTGE---CGTEL-NHGVAAVGYGTT--LDGTK--------YWIVRNSWG 310
                 Q Y  GV       C   L NH V  VG+G +  ++G +        YWI++NSWG
Sbjct:   268 L--LQQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGRRPRPGHSIPYWILKNSWG 325

Query:   311 PEWGEKGYIRMQRGISDKKGLCGIAMEASYPIKKSATNP 349
             P+WGE+GY R+ RG       CGI     YP+      P
Sbjct:   326 PDWGEEGYFRLHRG----SNTCGIT---KYPVTARVDKP 357


>UNIPROTKB|F1PGK4 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 OMA:SNVCGIA
            EMBL:AAEX03010073 Ensembl:ENSCAFT00000013638 Uniprot:F1PGK4
        Length = 316

 Score = 357 (130.7 bits), Expect = 1.1e-32, P = 1.1e-32
 Identities = 86/261 (32%), Positives = 130/261 (49%)

Query:    83 LNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVK 142
             +N+F+ ++  EF + Y  SK      +            +  S+P   DWR K  VT V+
Sbjct:    64 INQFSYLSPEEFKAIYLRSKPSRSPRYPAEVRTSI----RNVSLPLRFDWRDKRVVTQVR 119

Query:   143 DQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIK 202
             +Q  CG CWAFS + AVE    I    L  +S Q+++DC  + N GC+GG    A  ++ 
Sbjct:   120 NQQTCGGCWAFSVVGAVESAYAIKGKPLADISVQQVIDCSYN-NYGCSGGSTLNALNWLN 178

Query:   203 K-KGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVP-ANHEDALLKAVAK-QPVSV 259
             K +  +  +++YP++A +G C    +S    SI G+     ++ ED + K +    P+ V
Sbjct:   179 KTQVKLVRDSEYPFKAQNGLCHYFSDSYSGFSIRGYSAYDFSDQEDEMAKVLLTFGPLVV 238

Query:   260 AIDAGSSDFQFYSEGVFTGECGT-ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGY 318
              +DA S  +Q Y  G+    C + E NH V   G+   +  T YWIVRNSWG  WG  GY
Sbjct:   239 VVDAVS--WQDYLGGIIQHHCSSGEANHAVLITGFDK-IGSTPYWIVRNSWGSSWGVDGY 295

Query:   319 IRMQRGISDKKGLCGIAMEAS 339
               ++ G      +CGIA   S
Sbjct:   296 AHVKMG----GNICGIADSVS 312


>UNIPROTKB|H0YD65 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 EMBL:AP002748
            HGNC:HGNC:2531 ChiTaRS:CTSF Ensembl:ENST00000524994 Uniprot:H0YD65
        Length = 283

 Score = 349 (127.9 bits), Expect = 7.7e-32, P = 7.7e-32
 Identities = 86/233 (36%), Positives = 125/233 (53%)

Query:    47 TVSRSLDEKHKRF--NVFKQNVMHVHQTNKMDK-PYKLKLNKFADMTNHEFASTYAGSKI 103
             T +R+ + K  R+  +VF  N++   +   +D+   +  + KF+D+T  EF + Y  + +
Sbjct:    42 TYNRTYESKEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTIYLNTLL 101

Query:   104 KHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGIN 163
             +      G +       G +   PP  DWR KG+VT VKDQG CGSCWAFS    VEG  
Sbjct:   102 RKE---PGNKMKQAKSVGDLA--PPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQW 156

Query:   164 HIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCD 223
              +    L+SLSEQEL+DCD   ++ C GGL   A+  IK  GG+ TE  Y YQ +  +C+
Sbjct:   157 FLNQGTLLSLSEQELLDCDK-MDKACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQSCN 215

Query:   224 VSKESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGV 275
              S E +     D  E   + +E  L   +AK+ P+SVAI+A     QFY  G+
Sbjct:   216 FSAEKAKVYINDSVEL--SQNEQKLAAWLAKRGPISVAINAFG--MQFYRHGI 264


>UNIPROTKB|F1MHV4 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 OMA:GRCGDGC EMBL:DAAA02063574
            IPI:IPI00716321 Ensembl:ENSBTAT00000027681 Uniprot:F1MHV4
        Length = 375

 Score = 254 (94.5 bits), Expect = 7.8e-32, Sum P(2) = 7.8e-32
 Identities = 72/251 (28%), Positives = 121/251 (48%)

Query:    54 EKHKRFNVFKQNVMHVHQTNKMDK-PYKLKLNKFADMTNHEFASTYAGSKIKHHRMFQGT 112
             E  +R ++F QN+    +  + D    +  + +F+D+T  EF   Y GS++    +   +
Sbjct:    58 EYARRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSDLTEEEFVQLY-GSQVAGEALGV-S 115

Query:   113 RGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVS 172
             R  G+  +G+  S P + DWRK G+++ V+DQ  C  CWA +    +E +  I     V 
Sbjct:   116 RKVGSEEWGE--SEPQTCDWRKVGTISPVRDQRNCNCCWAMAAAGNIEALWAIKFRHFVE 173

Query:   173 LSEQ-ELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGT--CDVSKESS 229
             +S Q EL+DCD   N GC GG +  AF  +    G+ +E  YP+  +  T  C ++K+  
Sbjct:   174 VSVQPELLDCDRCGN-GCRGGFVWDAFLTVLNNSGLASEKDYPFNGSGKTHRC-LAKKYK 231

Query:   230 PAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTGE---CG-TELN 285
                 I     + A  +        + P++V I+   +  Q Y +GV       C  T+++
Sbjct:   232 KVAWIQDFIILQACEQSMARHLATEGPITVTINM--TLLQQYQKGVIKATPTTCDPTQVD 289

Query:   286 HGVAAVGYGTT 296
             H V  VG+G T
Sbjct:   290 HSVLLVGFGKT 300

 Score = 121 (47.7 bits), Expect = 7.8e-32, Sum P(2) = 7.8e-32
 Identities = 21/48 (43%), Positives = 28/48 (58%)

Query:   302 YWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPIKKSATNP 349
             YWI++NSWGP+WGE+GY R+ RG       CGI     +P+      P
Sbjct:   325 YWILKNSWGPQWGEEGYFRLHRG----SNTCGIT---KFPVTARVDKP 365


>UNIPROTKB|Q5T8F0 [details] [associations]
            symbol:CTSL1 "Cathepsin L1 light chain" species:9606 "Homo
            sapiens" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            EMBL:AL160279 UniGene:Hs.731507 UniGene:Hs.731952 HGNC:HGNC:2537
            ChiTaRS:CTSL1 IPI:IPI00640540 SMR:Q5T8F0 Ensembl:ENST00000342020
            ChEMBL:CHEMBL1293261 Uniprot:Q5T8F0
        Length = 225

 Score = 347 (127.2 bits), Expect = 1.3e-31, P = 1.3e-31
 Identities = 73/184 (39%), Positives = 108/184 (58%)

Query:    39 YERWRSHHTVSRSLDEKHKRFNVFKQNV----MHVHQTNKMDKPYKLKLNKFADMTNHEF 94
             + +W++ H     ++E+  R  V+++N+    +H  +  +    + + +N F DMT+ EF
Sbjct:    29 WTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEF 88

Query:    95 ASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFS 154
                  G + +  R     +G   F        P SVDWR+KG VT VK+QGQCGSCWAFS
Sbjct:    89 RQVMNGFQNRKPR-----KGK-VFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFS 142

Query:   155 TIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVTTEAKY 213
                A+EG     T +L+SLSEQ LVDC   Q N+GCNGGLM+ AF++++  GG+ +E  Y
Sbjct:   143 ATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESY 202

Query:   214 PYQA 217
             PY+A
Sbjct:   203 PYEA 206


>DICTYBASE|DDB_G0276111 [details] [associations]
            symbol:DDB_G0276111 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0276111 Pfam:PF00188
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            PROSITE:PS00139 EMBL:AAFI02000014 Gene3D:3.40.33.10
            InterPro:IPR001283 InterPro:IPR014044 PRINTS:PR00837 SMART:SM00198
            SUPFAM:SSF55797 ProtClustDB:CLSZ2429919 RefSeq:XP_643261.1
            ProteinModelPortal:Q75JH0 EnsemblProtists:DDB0169514 GeneID:8620304
            KEGG:ddi:DDB_G0276111 InParanoid:Q75JH0 OMA:GFVTSIK Uniprot:Q75JH0
        Length = 415

 Score = 341 (125.1 bits), Expect = 5.4e-31, P = 5.4e-31
 Identities = 77/219 (35%), Positives = 118/219 (53%)

Query:   124 TSIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEG---INHIMTNKLVSLSEQELVD 180
             TS    VDW+  G VT++K+QGQCG C++F+T AA+E    I + + N  + LSEQ  V 
Sbjct:   207 TSSTGDVDWKSLGFVTSIKNQGQCGGCYSFATCAALESAYLIKNNLPNTDIDLSEQNFVS 266

Query:   181 CDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENV 240
             C    N GC GG  +   + +K  G +  E  YPY+A  G+C    +S       G+ N+
Sbjct:   267 C---VNYGCGGGNGQSCLDKLKSTG-IMYETSYPYKAVTGSCPNVIQSPQPFKWTGYSNI 322

Query:   241 PANHEDALLKAVAKQPV--SVAIDAGSSDFQFYSEGVFTGECGTELNHGVAAVGYGTTLD 298
               N E A L A+   P+  S+ +D+G   FQ Y  G+++    +  NH +  VGY +  D
Sbjct:   323 QGNKE-AFLNALKSGPIYASLYVDSG---FQLYKSGIYSCSQSSTPNHAITIVGYSSA-D 377

Query:   299 GTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAME 337
              +  ++++NSWG  +GE GYIR++ G  +     GI  +
Sbjct:   378 NS--YLIKNSWGTIYGESGYIRLKEGSCNLYSFTGITTQ 414


>MGI|MGI:2139628 [details] [associations]
            symbol:Ctso "cathepsin O" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:2139628 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0005764 GeneTree:ENSGT00560000076599 MEROPS:C01.035 CTD:1519
            HOVERGEN:HBG105050 KO:K01374 OMA:SNVCGIA OrthoDB:EOG4V6ZH1
            EMBL:AK034490 EMBL:AK049470 EMBL:AK165930 EMBL:AK166103
            EMBL:BC044664 IPI:IPI00453524 RefSeq:NP_808330.1 UniGene:Mm.254642
            ProteinModelPortal:Q8BM88 SMR:Q8BM88 STRING:Q8BM88
            PhosphoSite:Q8BM88 PRIDE:Q8BM88 Ensembl:ENSMUST00000029649
            GeneID:229445 KEGG:mmu:229445 UCSC:uc008pon.1 InParanoid:Q8BM88
            NextBio:379433 Bgee:Q8BM88 CleanEx:MM_CTSO Genevestigator:Q8BM88
            GermOnline:ENSMUSG00000028015 Uniprot:Q8BM88
        Length = 312

 Score = 340 (124.7 bits), Expect = 6.9e-31, P = 6.9e-31
 Identities = 92/298 (30%), Positives = 140/298 (46%)

Query:    46 HTVSRSLDEKHKRFNVFKQNVMHVHQ-TNKMDKPYKLK---LNKFADMTNHEFASTYAGS 101
             H V+ +    H+R     +  +H H+  N            +N+F+ +   EF + Y GS
Sbjct:    19 HGVAGTWSWSHQREAAALRESLHRHRYLNSFPHENSTAFYGVNQFSYLFPEEFKALYLGS 78

Query:   102 KIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEG 161
             K      +      G      V S+P   DWR K  V  V++Q  CG CWAFS ++A+E 
Sbjct:    79 KYAWAPRYPA---EGQRPIPNV-SLPLRFDWRDKHVVNPVRNQEMCGGCWAFSVVSAIES 134

Query:   162 INHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKG-GVTTEAKYPYQANDG 220
                I    L  LS Q+++DC  + N GC GG    A  ++ +    +  +++YP++A +G
Sbjct:   135 ARAIQGKSLDYLSVQQVIDCSFN-NSGCLGGSPLCALRWLNETQLKLVADSQYPFKAVNG 193

Query:   221 TCDVSKESSPAVSIDGHENVP-ANHEDALLKAVAK-QPVSVAIDAGSSDFQFYSEGVFTG 278
              C    +S   VS+           ED + +A+    P+ V +DA S  +Q Y  G+   
Sbjct:   194 QCRHFPQSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMS--WQDYLGGIIQH 251

Query:   279 ECGT-ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIA 335
              C + E NH V   G+  T   T YW+VRNSWG  WG +GY  ++ G      +CGIA
Sbjct:   252 HCSSGEANHAVLITGFDRT-GNTPYWMVRNSWGSSWGVEGYAHVKMG----GNVCGIA 304


>WB|WBGene00008861 [details] [associations]
            symbol:F15D4.4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 SMART:SM00848 PROSITE:PS00640
            GeneTree:ENSGT00560000076599 EMBL:Z80344 HSSP:P53634
            eggNOG:NOG310593 PIR:T20981 ProteinModelPortal:Q93512 SMR:Q93512
            MEROPS:C01.A45 EnsemblMetazoa:F15D4.4 KEGG:cel:CELE_F15D4.4
            UCSC:F15D4.4 CTD:184530 WormBase:F15D4.4 InParanoid:Q93512
            OMA:ITMEQNI NextBio:925068 Uniprot:Q93512
        Length = 608

 Score = 341 (125.1 bits), Expect = 2.4e-30, P = 2.4e-30
 Identities = 97/289 (33%), Positives = 138/289 (47%)

Query:    57 KRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNG 116
             KRFNV+ +    V + N M   Y+L ++ +   TN +F+    G                
Sbjct:   153 KRFNVYSKVKKEVDEHNIM---YELGMSSYKMSTN-QFSVALDGEVAPLTLNLDALTPTA 208

Query:   117 TFMYGKVTS-----IPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLV 171
             T +   ++S       P+VDWR    +  + DQ  CG CWAFS I+ +E    I      
Sbjct:   209 TVIPATISSRKKRDTEPTVDWRP--FLKPILDQSTCGGCWAFSMISMIESFFAIQGYNTS 266

Query:   172 SLSEQELVDCDT--DQ-----NQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDV 224
             SLS Q+L+ CDT  D      N GC GG  ++A  +++        +  P+   D +CD 
Sbjct:   267 SLSVQQLLTCDTKVDSTYGLANVGCKGGYFQIAGSYLEVSAA-RDASLIPFDLEDTSCDS 325

Query:   225 S--KESSPAVSI--DGH--ENVPANH----EDALLKAVAKQPVSVAIDAGSSDFQFYSEG 274
             S      P + +  DG+   N  A      E  +   V K P++V + AG  D   YSEG
Sbjct:   326 SFFPPVVPTILLFDDGYISGNFTAAQLITMEQNIEDKVRKGPIAVGMAAGP-DIYKYSEG 384

Query:   275 VFTGECGTELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQR 323
             V+ G+CGT +NH V  VG+  T D   YWI+RNSWG  WGE GY R++R
Sbjct:   385 VYDGDCGTIINHAVVIVGF--TDD---YWIIRNSWGASWGEAGYFRVKR 428


>WB|WBGene00011102 [details] [associations]
            symbol:R07E3.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            EMBL:Z49207 HSSP:P53634 PIR:T24030 RefSeq:NP_001041280.1
            ProteinModelPortal:Q21810 SMR:Q21810 STRING:Q21810 MEROPS:C01.A43
            PaxDb:Q21810 EnsemblMetazoa:R07E3.1a GeneID:181242
            KEGG:cel:CELE_R07E3.1 UCSC:R07E3.1a CTD:181242 WormBase:R07E3.1a
            HOGENOM:HOG000021028 InParanoid:Q21810 OMA:ACKNEVI NextBio:913066
            ArrayExpress:Q21810 Uniprot:Q21810
        Length = 402

 Score = 333 (122.3 bits), Expect = 3.8e-30, P = 3.8e-30
 Identities = 101/309 (32%), Positives = 146/309 (47%)

Query:    43 RSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTYAGSK 102
             +S+ T   SL   +  +N   +N+ + +  N+     +   N  +D T+ EF  T     
Sbjct:    99 KSYATSQESLKRLNAYYNT-DENIANWNIQNEHGSA-EYGHNDMSDWTDEEFEKTLLPKS 156

Query:   103 I-----KHHRMFQGTRGNGTFMYGKVTS-IPPSVDWRKKGSVTAVKDQGQCGSCWAFSTI 156
                   K     +    + T   G+ +S  P   DWR K  +T VK QGQCGSCWAF++ 
Sbjct:   157 FYKRLHKEAEFIEPIPESLTAKKGESSSPFPDFFDWRDKNVITPVKAQGQCGSCWAFAST 216

Query:   157 AAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQ 216
             A VE    I   +  +LSEQ L+DCD   N  C+GG  + AF +I + G +      PY 
Sbjct:   217 ATVEAAWAIAHGEKRNLSEQTLLDCDLVDN-ACDGGDEDKAFRYIHRNG-LANAVDLPYV 274

Query:   217 AN-DGTCDVSKESSPAVSIDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSDFQFYSEG 274
             A+    C V+   +    I     +  + ED+++  +    PV++ + A     + Y  G
Sbjct:   275 AHRQNGCAVNDHWN-TTRIKAAYFLH-HDEDSIINWLVNFGPVNIGM-AVIQPMRAYKGG 331

Query:   275 VFTGE---CGTELN--HGVAAVGYGTTLDGTKYWIVRNSWGPEWG-EKGYIRMQRGISDK 328
             VFT     C  E+   H +   GYGT+  G KYWIV+NSWG  WG E GYI   RGI+  
Sbjct:   332 VFTPSEYACKNEVIGLHALLITGYGTSKTGEKYWIVKNSWGNTWGVEHGYIYFARGIN-- 389

Query:   329 KGLCGIAME 337
                CGI  E
Sbjct:   390 --ACGIEDE 396


>MGI|MGI:1338045 [details] [associations]
            symbol:Ctsw "cathepsin W" species:10090 "Mus musculus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 MGI:MGI:1338045 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 MEROPS:C01.037
            CTD:1521 eggNOG:NOG288820 HOVERGEN:HBG100117 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 EMBL:AF014941 EMBL:AC122861 IPI:IPI00111727
            RefSeq:NP_034115.2 UniGene:Mm.113590 ProteinModelPortal:P56203
            SMR:P56203 PhosphoSite:P56203 PRIDE:P56203 DNASU:13041
            Ensembl:ENSMUST00000025844 GeneID:13041 KEGG:mmu:13041
            InParanoid:P56203 NextBio:282936 Bgee:P56203 CleanEx:MM_CTSW
            Genevestigator:P56203 GermOnline:ENSMUSG00000024910 Uniprot:P56203
        Length = 371

 Score = 249 (92.7 bits), Expect = 4.8e-30, Sum P(2) = 4.8e-30
 Identities = 71/255 (27%), Positives = 123/255 (48%)

Query:    54 EKHKRFNVFKQNVMHVHQTNKMDK-PYKLKLNKFADMTNHEFASTYAGSKIKHHRMFQGT 112
             E  +R ++F  N+    +  + D    +     F+D+T  EF   Y G +    R    T
Sbjct:    56 EYTRRLSIFAHNLAQAQRLQQEDLGTAEFGETPFSDLTEEEFGQLY-GQERSPERTPNMT 114

Query:   113 RGNGTFMYGKVTSIPPSVDWRK-KGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLV 171
             +   +  +G+  S+P + DWRK K  +++VK+QG C  CWA +    ++ +  I   + V
Sbjct:   115 KKVESNTWGE--SVPRTCDWRKAKNIISSVKNQGSCKCCWAMAAADNIQALWRIKHQQFV 172

Query:   172 SLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQAN--DGTCDVSKESS 229
              +S QEL+DC+   N GCNGG +  A+  +    G+ +E  YP+Q +     C ++K+  
Sbjct:   173 DVSVQELLDCERCGN-GCNGGFVWDAYLTVLNNSGLASEKDYPFQGDRKPHRC-LAKKYK 230

Query:   230 PAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFTG---ECGT-EL 284
                 I     + +N+E A+   +A   P++V I+      Q Y +GV       C   ++
Sbjct:   231 KVAWIQDFTML-SNNEQAIAHYLAVHGPITVTINMKL--LQHYQKGVIKATPSSCDPRQV 287

Query:   285 NHGVAAVGYGTTLDG 299
             +H V  VG+G   +G
Sbjct:   288 DHSVLLVGFGKEKEG 302

 Score = 118 (46.6 bits), Expect = 4.8e-30, Sum P(2) = 4.8e-30
 Identities = 22/52 (42%), Positives = 30/52 (57%)

Query:   302 YWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAM-----EASYPIKKSATN 348
             YWI++NSWG  WGEKGY R+ RG       CG+       +   P+KK+ T+
Sbjct:   321 YWILKNSWGAHWGEKGYFRLYRG----NNTCGVTKYPFTAQVDSPVKKARTS 368


>UNIPROTKB|O97578 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9615 "Canis
            lupus familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 EMBL:AF060171 RefSeq:NP_001182763.1
            UniGene:Cfa.28653 ProteinModelPortal:O97578 SMR:O97578
            MEROPS:C01.070 PRIDE:O97578 GeneID:403458 KEGG:cfa:403458
            InParanoid:O97578 NextBio:20816976 Uniprot:O97578
        Length = 435

 Score = 332 (121.9 bits), Expect = 4.9e-30, P = 4.9e-30
 Identities = 101/311 (32%), Positives = 154/311 (49%)

Query:    52 LDEKHKRFNVFKQNVMHVHQTNKMDKPYKL-KLNKFADMTNHEFASTYAGSKIKHHRMFQ 110
             L E +    ++K N   V   N + K +   +  ++  +T  +  +   G KI   +   
Sbjct:   134 LQENNSN-RLYKYNYEFVKAINTIQKSWTATRYIEYETLTLRDMMTRVGGRKIPRPKPTP 192

Query:   111 GTRGNGTFMYGKVTSIPPSVDWRK-KGS--VTAVKDQGQCGSCWAFSTIAAVEGINHIMT 167
              T      ++ +++ +P S DWR  +G+  V+ V++Q  CGSC+AF++ A +E    I+T
Sbjct:   193 LTAE----IHEEISRLPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTAMLEARIRILT 248

Query:   168 NKLVS--LSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTC--- 222
             N   +  LS QE+V C +   QGC GG   L      +  G+  EA +PY  +D  C   
Sbjct:   249 NNTQTPILSPQEIVSC-SQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYAGSDSPCKPN 307

Query:   223 DVSKE-SSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVF--TGE 279
             D  +  SS    + G      N     L+ V   P++VA +    DF  Y +G++  TG 
Sbjct:   308 DCFRYYSSEYYYVGGFYGA-CNEALMKLELVRHGPMAVAFEV-YDDFFHYQKGIYYHTGL 365

Query:   280 CGT----EL-NHGVAAVGYGT-TLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCG 333
                    EL NH V  VGYGT +  G  YWIV+NSWG  WGE GY R++RG +D+  +  
Sbjct:   366 RDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRG-TDECAIES 424

Query:   334 IAMEASYPIKK 344
             IA+ A+ PI K
Sbjct:   425 IAVAAT-PIPK 434


>UNIPROTKB|F1NWG2 [details] [associations]
            symbol:CTSC "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 InterPro:IPR014882 Pfam:PF08773
            OMA:YDDFLHY GO:GO:0001913 EMBL:AADN02004805 IPI:IPI00577371
            Ensembl:ENSGALT00000027869 Uniprot:F1NWG2
        Length = 463

 Score = 328 (120.5 bits), Expect = 1.3e-29, P = 1.3e-29
 Identities = 91/242 (37%), Positives = 133/242 (54%)

Query:   122 KVTSIPPSVDWRKKGSV---TAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVS--LSEQ 176
             KV+ +P S DWR    V   + V++Q  CGSC+AF+++  +E    I+TN       S Q
Sbjct:   227 KVSGLPESWDWRNVNGVNYVSPVRNQASCGSCYAFASMGMLEARIRILTNNTQKPVFSPQ 286

Query:   177 ELVDCDTDQNQGCNGGLMEL-AFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSID 235
             ++V C +  +QGC+GG   L A ++++  G V  E  +PY A D  C + K S       
Sbjct:   287 QVVSC-SQYSQGCDGGFPYLIAGKYVQDFG-VVEEDCFPYTAKDTPC-LFKRSCYHYYTS 343

Query:   236 GHENVPANH---EDALLKA--VAKQPVSVAIDAGSSDFQFYSEGVF--TG---ECGT-EL 284
              +  V   +    +AL+K   V   P++VA +   +DF FY EG++  TG   E    EL
Sbjct:   344 EYHYVGGFYGACNEALMKLELVLSGPMAVAFEV-YNDFMFYKEGIYHHTGLKDEFNPFEL 402

Query:   285 -NHGVAAVGYGTTLD-GTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPI 342
              NH V  VGYG   + G K+WIV+NSWG  WGE GY R++RG +D+  +  IA+ A+ PI
Sbjct:   403 TNHAVLLVGYGKDPESGEKFWIVKNSWGTSWGEDGYFRIRRG-TDECAIESIAVAAT-PI 460

Query:   343 KK 344
              K
Sbjct:   461 PK 462


>ZFIN|ZDB-GENE-030619-9 [details] [associations]
            symbol:ctsc "cathepsin C" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030619-9 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 MEROPS:C01.070 HSSP:P43235
            EMBL:BC064286 IPI:IPI00486570 RefSeq:NP_999887.1 UniGene:Dr.32463
            ProteinModelPortal:Q6P2V1 SMR:Q6P2V1 PRIDE:Q6P2V1 GeneID:368704
            KEGG:dre:368704 InParanoid:Q6P2V1 NextBio:20813127
            ArrayExpress:Q6P2V1 Bgee:Q6P2V1 Uniprot:Q6P2V1
        Length = 455

 Score = 327 (120.2 bits), Expect = 1.6e-29, P = 1.6e-29
 Identities = 96/302 (31%), Positives = 149/302 (49%)

Query:    62 FKQNVMHVHQTNKMDKPYKLKLNKFAD-MTNHEFASTYAGSKIKHHRMFQGTRGNGTFMY 120
             +  N+M V + N + K +      F + ++ HE      G      R+ +  R       
Sbjct:   161 YTNNMMFVDEINSVQKSWTATAYSFHETLSIHEMLRRSGGPA---SRIPRRVRPVTVAAD 217

Query:   121 GKVTS-IPPSVDWRKKGSV---TAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVS--LS 174
              K  S +P   DWR    V   + V++Q QCGSC++F+T+  +E    I TN       S
Sbjct:   218 SKAASGLPQHWDWRNVNGVNFVSPVRNQAQCGSCYSFATMGMLEARVRIQTNNTQQPVFS 277

Query:   175 EQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSI 234
              Q++V C +  +QGC+GG   L  ++I+  G +  E  +PY  +D  C++  + +   + 
Sbjct:   278 PQQVVSC-SQYSQGCDGGFPYLIGKYIQDFG-IVEEDCFPYTGSDSPCNLPAKCTKYYAS 335

Query:   235 DGHEN---VPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVF--TG--ECGT--EL 284
             D H          E A++  + K  P+ VA++    DF  Y EG++  TG  +     EL
Sbjct:   336 DYHYVGGFYGGCSESAMMLELVKNGPMGVALEV-YPDFMNYKEGIYHHTGLRDANNPFEL 394

Query:   285 -NHGVAAVGYGTT-LDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPI 342
              NH V  VGYG     G KYWIV+NSWG  WGE G+ R++RG +D+  +  IA+ A+ PI
Sbjct:   395 TNHAVLLVGYGQCHKTGEKYWIVKNSWGSGWGENGFFRIRRG-TDECAIESIAVAAT-PI 452

Query:   343 KK 344
              K
Sbjct:   453 PK 454


>UNIPROTKB|F1STR1 [details] [associations]
            symbol:CTSC "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0004252
            "serine-type endopeptidase activity" evidence=IEA] [GO:0001913 "T
            cell mediated cytotoxicity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 KO:K01275 InterPro:IPR014882
            Pfam:PF08773 OMA:YDDFLHY GO:GO:0001913 EMBL:CU855751
            RefSeq:XP_003129789.1 UniGene:Ssc.6155 Ensembl:ENSSSCT00000016280
            GeneID:100522387 KEGG:ssc:100522387 Uniprot:F1STR1
        Length = 463

 Score = 324 (119.1 bits), Expect = 3.4e-29, P = 3.4e-29
 Identities = 90/240 (37%), Positives = 126/240 (52%)

Query:   122 KVTSIPPSVDWRK-KGS--VTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVS--LSEQ 176
             K   +P S DWR  +G+  VT V++Q  CGSC++F+++  +E    I+TN   +  LS Q
Sbjct:   227 KSLHLPASWDWRNVRGTNFVTPVRNQASCGSCYSFASMGMMEARIRILTNNTQTPILSPQ 286

Query:   177 ELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDG 236
             E+V C +   QGC GG   L      +  G+  EA +PY   D  C V +      S + 
Sbjct:   287 EVVSC-SQYAQGCAGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCTVKEGCFRYYSSEY 345

Query:   237 HE--NVPANHEDALLKA--VAKQPVSVAIDAGSSDFQFYSEGVF--TGECGT----EL-N 285
             H          +AL+K   V   P++VA +    DF  Y +G++  TG        EL N
Sbjct:   346 HYVGGFYGGCNEALMKLELVHHGPMAVAFEV-YDDFLHYRKGIYHHTGLRDPFNPFELTN 404

Query:   286 HGVAAVGYGTTL-DGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPIKK 344
             H V  VGYGT L  G  YWIV+NSWG  WGE GY R++RG +D+  +  IA+ A+ PI K
Sbjct:   405 HAVLLVGYGTDLASGMDYWIVKNSWGTSWGEDGYFRIRRG-TDECAIESIAVAAT-PIPK 462


>UNIPROTKB|F1P0K2 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            OMA:SNVCGIA EMBL:AADN02016534 IPI:IPI00651180
            Ensembl:ENSGALT00000015270 Uniprot:F1P0K2
        Length = 320

 Score = 323 (118.8 bits), Expect = 4.4e-29, P = 4.4e-29
 Identities = 74/220 (33%), Positives = 114/220 (51%)

Query:   121 GKVTSIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVD 180
             G+   +P   DWR K  +  V++Q  CG CWAFS +  +E    I  + L  LS Q+++D
Sbjct:   102 GEEKPLPKKFDWRDKKVIAEVRNQQTCGGCWAFSVVGGIESAYAIKGHNLEELSVQQVID 161

Query:   181 CDTDQNQGCNGGLMELAFEFIKK-KGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHEN 239
             C    N GC+GG    A  ++ + K  +  +++Y ++A  G C     S   VSI G   
Sbjct:   162 CSYS-NYGCSGGSTITALSWLNQTKVKLVRDSEYTFKAQTGLCHYFPHSDFGVSITGFAA 220

Query:   240 VP-ANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFTGECGT-ELNHGVAAVGYGTT 296
                +  E+ +++ +    P++V +DA S  +Q Y  G+    C + + NH V   G+ TT
Sbjct:   221 YDFSGQEEEMMRVLVDWGPLAVTVDAVS--WQDYLGGIIQYHCSSGKANHAVLITGFDTT 278

Query:   297 LDGT-KYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIA 335
               G   YWIV+NSWG  WG  GY+R++ G      +CGIA
Sbjct:   279 --GIIPYWIVQNSWGRTWGIDGYVRVKIG----SNVCGIA 312


>UNIPROTKB|F1N455 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1 exclusion domain chain"
            species:9913 "Bos taurus" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 IPI:IPI00697314 UniGene:Bt.49573
            InterPro:IPR014882 Pfam:PF08773 OMA:YDDFLHY GO:GO:0001913
            EMBL:DAAA02062487 EMBL:DAAA02062488 Ensembl:ENSBTAT00000014735
            Uniprot:F1N455
        Length = 463

 Score = 322 (118.4 bits), Expect = 5.6e-29, P = 5.6e-29
 Identities = 88/240 (36%), Positives = 124/240 (51%)

Query:   122 KVTSIPPSVDWRKKGS---VTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVS--LSEQ 176
             K+  +P S DWR       VT V++QG CGSC++F+++  +E    I+TN   +  LS Q
Sbjct:   227 KILHLPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQ 286

Query:   177 ELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDG 236
             E+V C +   QGC GG   L      +  G+  E  +PY   D  C + +      S + 
Sbjct:   287 EVVSC-SQYAQGCEGGFPYLIAGKYAQDFGLVEEDCFPYTGTDSPCRLKEGCFRYYSSEY 345

Query:   237 HE--NVPANHEDALLKA--VAKQPVSVAIDAGSSDFQFYSEGVF--TGECGT----EL-N 285
             H          +AL+K   V + P++VA +    DF  Y +GV+  TG        EL N
Sbjct:   346 HYVGGFYGGCNEALMKLELVHQGPMAVAFEV-YDDFLHYRKGVYHHTGLRDPFNPFELTN 404

Query:   286 HGVAAVGYGT-TLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPIKK 344
             H V  VGYGT    G  YWIV+NSWG  WGE GY R++RG +D+  +  IA+ A+ PI K
Sbjct:   405 HAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRG-TDECAIESIALAAT-PIPK 462


>UNIPROTKB|Q3ZCJ8 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9913 "Bos
            taurus" [GO:0031638 "zymogen activation" evidence=IDA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0005764 EMBL:BC102115 IPI:IPI00697314 RefSeq:NP_001028789.1
            UniGene:Bt.49573 ProteinModelPortal:Q3ZCJ8 SMR:Q3ZCJ8 STRING:Q3ZCJ8
            PRIDE:Q3ZCJ8 GeneID:352958 KEGG:bta:352958 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 InParanoid:Q3ZCJ8 KO:K01275
            OrthoDB:EOG4H19VZ BindingDB:Q3ZCJ8 ChEMBL:CHEMBL1075050
            NextBio:20812686 GO:GO:0031638 InterPro:IPR014882 Pfam:PF08773
            Uniprot:Q3ZCJ8
        Length = 463

 Score = 322 (118.4 bits), Expect = 5.6e-29, P = 5.6e-29
 Identities = 88/240 (36%), Positives = 124/240 (51%)

Query:   122 KVTSIPPSVDWRKKGS---VTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVS--LSEQ 176
             K+  +P S DWR       VT V++QG CGSC++F+++  +E    I+TN   +  LS Q
Sbjct:   227 KILHLPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQ 286

Query:   177 ELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDG 236
             E+V C +   QGC GG   L      +  G+  E  +PY   D  C + +      S + 
Sbjct:   287 EVVSC-SQYAQGCEGGFPYLIAGKYAQDFGLVEEDCFPYTGTDSPCRLKEGCFRYYSSEY 345

Query:   237 HE--NVPANHEDALLKA--VAKQPVSVAIDAGSSDFQFYSEGVF--TGECGT----EL-N 285
             H          +AL+K   V + P++VA +    DF  Y +GV+  TG        EL N
Sbjct:   346 HYVGGFYGGCNEALMKLELVHQGPMAVAFEV-YDDFLHYRKGVYHHTGLRDPFNPFELTN 404

Query:   286 HGVAAVGYGT-TLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPIKK 344
             H V  VGYGT    G  YWIV+NSWG  WGE GY R++RG +D+  +  IA+ A+ PI K
Sbjct:   405 HAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRG-TDECAIESIALAAT-PIPK 462


>UNIPROTKB|F1PSK8 [details] [associations]
            symbol:F1PSK8 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 InterPro:IPR014882
            Pfam:PF08773 EMBL:AAEX03012741 Ensembl:ENSCAFT00000007054
            Uniprot:F1PSK8
        Length = 405

 Score = 321 (118.1 bits), Expect = 7.1e-29, P = 7.1e-29
 Identities = 101/312 (32%), Positives = 154/312 (49%)

Query:    52 LDEKHKRFNVFKQNVMHVHQTNKMDKPYKL-KLNKFADMTNHEFASTYAGSKIKHHRMFQ 110
             L E +    ++K N   V   N + K +   +  ++  +T  +  +   G KI   +   
Sbjct:   103 LQENNSN-RLYKYNYEFVKAINTIQKSWTATRYIEYETLTLRDMMTRGGGRKIPRPKPTP 161

Query:   111 GTRGNGTFMYGKVTSIPPSVDWRK-KGS--VTAVKDQG-QCGSCWAFSTIAAVEGINHIM 166
              T      ++ +++ +P S DWR  +G+  V+ V++Q   CGSC+AF++ A +E    I+
Sbjct:   162 LTAE----IHEEISRLPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRIL 217

Query:   167 TNKLVS--LSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTC-- 222
             TN   +  LS QE+V C +   QGC GG   L      +  G+  EA +PY  +D  C  
Sbjct:   218 TNNTQTPILSPQEIVSC-SQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYAGSDSPCKP 276

Query:   223 -DVSKE-SSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVF--TG 278
              D  +  SS    + G      N     L+ V   P++VA +    DF  Y +G++  TG
Sbjct:   277 NDCFRYYSSEYYYVGGFYGA-CNEALMKLELVRHGPMAVAFEV-YDDFFHYQKGIYYHTG 334

Query:   279 ECGT----EL-NHGVAAVGYGT-TLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLC 332
                     EL NH V  VGYGT +  G  YWIV+NSWG  WGE GY R++RG +D+  + 
Sbjct:   335 LRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRG-TDECAIE 393

Query:   333 GIAMEASYPIKK 344
              IA+ A+ PI K
Sbjct:   394 SIAVAAT-PIPK 404


>UNIPROTKB|J9P219 [details] [associations]
            symbol:J9P219 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 InterPro:IPR014882
            Pfam:PF08773 OMA:YDDFLHY EMBL:AAEX03012741
            Ensembl:ENSCAFT00000050015 Uniprot:J9P219
        Length = 406

 Score = 319 (117.4 bits), Expect = 1.2e-28, P = 1.2e-28
 Identities = 100/303 (33%), Positives = 152/303 (50%)

Query:    61 VFKQNVMHVHQTNKMDKPYKL-KLNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFM 119
             ++K N   V   N + K +   +  ++  +T  +  +   G KI   R  + T      +
Sbjct:   111 LYKYNYEFVKAINTIQKSWTATRYIEYETLTLRDMMTRGGGRKIP--RKPKPTPLTAE-I 167

Query:   120 YGKVTSIPPSVDWRK-KGS--VTAVKDQG-QCGSCWAFSTIAAVEGINHIMTNKLVS--L 173
             + +++ +P S DWR  +G+  V+ V++Q   CGSC+AF++ A +E    I+TN   +  L
Sbjct:   168 HEEISRLPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRILTNNTQTPIL 227

Query:   174 SEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTC---DVSKE-SS 229
             S QE+V C +   QGC GG   L      +  G+  EA +PY  +D  C   D  +  SS
Sbjct:   228 SPQEIVSC-SQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYAGSDSPCKPNDCFRYYSS 286

Query:   230 PAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVF--TGECGT----E 283
                 + G      N     L+ V   P++VA +    DF  Y +G++  TG        E
Sbjct:   287 EYYYVGGFYGA-CNEALMKLELVRHGPMAVAFEV-YDDFFHYQKGIYYHTGLRDPFNPFE 344

Query:   284 L-NHGVAAVGYGT-TLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYP 341
             L NH V  VGYGT +  G  YWIV+NSWG  WGE GY R++RG +D+  +  IA+ A+ P
Sbjct:   345 LTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRG-TDECAIESIAVAAT-P 402

Query:   342 IKK 344
             I K
Sbjct:   403 IPK 405


>UNIPROTKB|P53634 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9606 "Homo
            sapiens" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0005783 "endoplasmic reticulum" evidence=IEA] [GO:0005794
            "Golgi apparatus" evidence=IEA] [GO:0007568 "aging" evidence=IEA]
            [GO:0010033 "response to organic substance" evidence=IEA]
            [GO:0031404 "chloride ion binding" evidence=IEA] [GO:0042802
            "identical protein binding" evidence=IEA] [GO:0043621 "protein
            self-association" evidence=IEA] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0006508 "proteolysis" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0006955
            "immune response" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005783 GO:GO:0005794 Reactome:REACT_6900
            GO:GO:0006955 GO:GO:0007568 GO:GO:0010033 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004252 GO:GO:0005764 GO:GO:0031404 CTD:1075
            HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ InterPro:IPR014882
            Pfam:PF08773 MEROPS:C01.070 EMBL:X87212 EMBL:U79415 EMBL:AF234263
            EMBL:AF234264 EMBL:AF254757 EMBL:AF525032 EMBL:AF525033
            EMBL:AK292117 EMBL:AK311923 EMBL:AK223038 EMBL:BX537913
            EMBL:AC011088 EMBL:CH471185 EMBL:BC054028 EMBL:BC100891
            EMBL:BC100892 EMBL:BC100893 EMBL:BC100894 EMBL:BC109386
            EMBL:BC110071 EMBL:BC113850 EMBL:BC113897 IPI:IPI00022810
            IPI:IPI00171323 IPI:IPI00872258 PIR:S23941 PIR:S66504
            RefSeq:NP_001107645.1 RefSeq:NP_001805.3 RefSeq:NP_680475.1
            UniGene:Hs.128065 PDB:1K3B PDB:2DJF PDB:2DJG PDB:3PDF PDBsum:1K3B
            PDBsum:2DJF PDBsum:2DJG PDBsum:3PDF ProteinModelPortal:P53634
            SMR:P53634 IntAct:P53634 MINT:MINT-4655964 STRING:P53634
            PhosphoSite:P53634 DMDM:1705632 PaxDb:P53634 PRIDE:P53634
            DNASU:1075 Ensembl:ENST00000227266 Ensembl:ENST00000524463
            Ensembl:ENST00000529974 GeneID:1075 KEGG:hsa:1075 UCSC:uc001pck.4
            UCSC:uc001pcm.4 GeneCards:GC11M088026 HGNC:HGNC:2528 HPA:CAB025364
            MIM:170650 MIM:245000 MIM:245010 MIM:602365 neXtProt:NX_P53634
            Orphanet:2342 Orphanet:678 PharmGKB:PA27028 HOGENOM:HOG000127503
            InParanoid:P53634 OMA:YDDFLHY PhylomeDB:P53634
            BioCyc:MetaCyc:HS03265-MONOMER SABIO-RK:P53634 BindingDB:P53634
            ChEMBL:CHEMBL2252 EvolutionaryTrace:P53634 GenomeRNAi:1075
            NextBio:4488 PMAP-CutDB:P53634 ArrayExpress:P53634 Bgee:P53634
            Genevestigator:P53634 GermOnline:ENSG00000109861 GO:GO:0001913
            Uniprot:P53634
        Length = 463

 Score = 318 (117.0 bits), Expect = 1.5e-28, P = 1.5e-28
 Identities = 86/240 (35%), Positives = 125/240 (52%)

Query:   122 KVTSIPPSVDWRKKGS---VTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVS--LSEQ 176
             K+  +P S DWR       V+ V++Q  CGSC++F+++  +E    I+TN   +  LS Q
Sbjct:   227 KILHLPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQ 286

Query:   177 ELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDG 236
             E+V C +   QGC GG   L      +  G+  EA +PY   D  C + ++     S + 
Sbjct:   287 EVVSC-SQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCKMKEDCFRYYSSEY 345

Query:   237 HE--NVPANHEDALLKA--VAKQPVSVAIDAGSSDFQFYSEGVF--TGECGT----EL-N 285
             H          +AL+K   V   P++VA +    DF  Y +G++  TG        EL N
Sbjct:   346 HYVGGFYGGCNEALMKLELVHHGPMAVAFEV-YDDFLHYKKGIYHHTGLRDPFNPFELTN 404

Query:   286 HGVAAVGYGT-TLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPIKK 344
             H V  VGYGT +  G  YWIV+NSWG  WGE GY R++RG +D+  +  IA+ A+ PI K
Sbjct:   405 HAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRG-TDECAIESIAVAAT-PIPK 462


>RGD|2445 [details] [associations]
            symbol:Ctsc "cathepsin C" species:10116 "Rattus norvegicus"
          [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA;ISO]
          [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
          [GO:0004252 "serine-type endopeptidase activity" evidence=IEA;ISO]
          [GO:0005764 "lysosome" evidence=IDA;TAS] [GO:0005783 "endoplasmic
          reticulum" evidence=IDA] [GO:0005794 "Golgi apparatus" evidence=IDA]
          [GO:0006508 "proteolysis" evidence=IEP;ISO;TAS] [GO:0007568 "aging"
          evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
          evidence=ISO] [GO:0010033 "response to organic substance"
          evidence=IDA] [GO:0031404 "chloride ion binding" evidence=IDA]
          [GO:0042802 "identical protein binding" evidence=IDA] [GO:0043621
          "protein self-association" evidence=IDA] InterPro:IPR000668
          InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
          InterPro:IPR000169 RGD:2445 GO:GO:0005783 GO:GO:0005794 GO:GO:0007568
          GO:GO:0010033 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
          InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139
          PROSITE:PS00639 GO:GO:0004252 GO:GO:0005764 GO:GO:0043621
          GO:GO:0042802 GO:GO:0031404 GO:GO:0004197
          GeneTree:ENSGT00560000076599 CTD:1075 HOGENOM:HOG000068022
          HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ InterPro:IPR014882
          Pfam:PF08773 MEROPS:C01.070 OMA:YDDFLHY GO:GO:0001913 EMBL:D90404
          IPI:IPI00193765 PIR:A41158 RefSeq:NP_058793.1 UniGene:Rn.203177
          PDB:1JQP PDBsum:1JQP ProteinModelPortal:P80067 SMR:P80067
          STRING:P80067 PhosphoSite:P80067 PRIDE:P80067
          Ensembl:ENSRNOT00000022342 GeneID:25423 KEGG:rno:25423
          InParanoid:P80067 SABIO-RK:P80067 EvolutionaryTrace:P80067
          NextBio:606591 ArrayExpress:P80067 Genevestigator:P80067
          GermOnline:ENSRNOG00000016496 Uniprot:P80067
        Length = 462

 Score = 315 (115.9 bits), Expect = 3.2e-28, P = 3.2e-28
 Identities = 88/241 (36%), Positives = 125/241 (51%)

Query:   122 KVTSIPPSVDWRK-KGS--VTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVS--LSEQ 176
             ++ S+P S DWR  +G   V+ V++Q  CGSC++F+++  +E    I+TN   +  LS Q
Sbjct:   226 QILSLPESWDWRNVRGINFVSPVRNQESCGSCYSFASLGMLEARIRILTNNSQTPILSPQ 285

Query:   177 ELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKE-----SSPA 231
             E+V C +   QGC+GG   L      +  GV  E  +PY A D  C   +      SS  
Sbjct:   286 EVVSC-SPYAQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDAPCKPKENCLRYYSSEY 344

Query:   232 VSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVF--TGECGT----EL- 284
               + G      N     L+ V   P++VA +    DF  Y  G++  TG        EL 
Sbjct:   345 YYVGGFYG-GCNEALMKLELVKHGPMAVAFEV-HDDFLHYHSGIYHHTGLSDPFNPFELT 402

Query:   285 NHGVAAVGYGTT-LDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPIK 343
             NH V  VGYG   + G  YWIV+NSWG +WGE GY R++RG +D+  +  IAM A+ PI 
Sbjct:   403 NHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRIRRG-TDECAIESIAM-AAIPIP 460

Query:   344 K 344
             K
Sbjct:   461 K 461


>MGI|MGI:109553 [details] [associations]
            symbol:Ctsc "cathepsin C" species:10090 "Mus musculus"
            [GO:0001913 "T cell mediated cytotoxicity" evidence=IGI]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IMP]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0005783 "endoplasmic
            reticulum" evidence=ISO] [GO:0005794 "Golgi apparatus"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=ISO;IMP]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0010033
            "response to organic substance" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0031404 "chloride ion
            binding" evidence=ISO] [GO:0042802 "identical protein binding"
            evidence=ISO] [GO:0043621 "protein self-association" evidence=ISO]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 MGI:MGI:109553 GO:GO:0005783
            GO:GO:0005794 GO:GO:0007568 GO:GO:0010033 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004252 GO:GO:0005764 GO:GO:0031404 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 MEROPS:C01.070 OMA:YDDFLHY
            GO:GO:0001913 EMBL:U89269 EMBL:U74683 EMBL:BC067063 IPI:IPI00130015
            RefSeq:NP_034112.3 UniGene:Mm.322945 ProteinModelPortal:P97821
            SMR:P97821 STRING:P97821 PhosphoSite:P97821 PaxDb:P97821
            PRIDE:P97821 Ensembl:ENSMUST00000032779 GeneID:13032 KEGG:mmu:13032
            InParanoid:P97821 BindingDB:P97821 ChEMBL:CHEMBL3454 ChiTaRS:CTSC
            NextBio:282904 Bgee:P97821 CleanEx:MM_CTSC Genevestigator:P97821
            Uniprot:P97821
        Length = 462

 Score = 310 (114.2 bits), Expect = 1.3e-27, P = 1.3e-27
 Identities = 85/240 (35%), Positives = 127/240 (52%)

Query:   122 KVTSIPPSVDWRK-KGS--VTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVS--LSEQ 176
             ++ ++P S DWR  +G   V+ V++Q  CGSC++F+++  +E    I+TN   +  LS Q
Sbjct:   226 QILNLPESWDWRNVQGVNYVSPVRNQESCGSCYSFASMGMLEARIRILTNNSQTPILSPQ 285

Query:   177 ELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDG 236
             E+V C +   QGC+GG   L      +  GV  E+ +PY A D  C   +      S D 
Sbjct:   286 EVVSC-SPYAQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDSPCKPRENCLRYYSSDY 344

Query:   237 HE--NVPANHEDALLKA--VAKQPVSVAIDAGSSDFQFYSEGVF--TGECGT----EL-N 285
             +          +AL+K   V   P++VA +    DF  Y  G++  TG        EL N
Sbjct:   345 YYVGGFYGGCNEALMKLELVKHGPMAVAFEV-HDDFLHYHSGIYHHTGLSDPFNPFELTN 403

Query:   286 HGVAAVGYGTT-LDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPIKK 344
             H V  VGYG   + G +YWI++NSWG  WGE GY R++RG +D+  +  IA+ A+ PI K
Sbjct:   404 HAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRG-TDECAIESIAV-AAIPIPK 461


>WB|WBGene00013764 [details] [associations]
            symbol:Y113G7B.15 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 GeneTree:ENSGT00560000076599
            EMBL:AL110477 HOGENOM:HOG000019851 RefSeq:NP_507904.2
            ProteinModelPortal:Q9U2X1 SMR:Q9U2X1 DIP:DIP-25339N IntAct:Q9U2X1
            MINT:MINT-1058673 STRING:Q9U2X1 MEROPS:C01.A47
            EnsemblMetazoa:Y113G7B.15 GeneID:190976 KEGG:cel:CELE_Y113G7B.15
            UCSC:Y113G7B.15 CTD:190976 WormBase:Y113G7B.15 eggNOG:NOG302449
            OMA:AEEDIME Uniprot:Q9U2X1
        Length = 362

 Score = 308 (113.5 bits), Expect = 1.7e-27, P = 1.7e-27
 Identities = 73/204 (35%), Positives = 108/204 (52%)

Query:   138 VTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDC-DTDQNQGCNGGLMEL 196
             V  VKDQ QCG CWAF+T A  E  N + +    SLS+QE+ DC D+    GC GG    
Sbjct:   148 VGPVKDQEQCGCCWAFATTAITEAANTLYSKSFTSLSDQEICDCADSGDTPGCVGGDPRN 207

Query:   197 AFEFIKKKGGVTTEAKYPYQ---AND-GTCDVSKESSPAVSIDGHENVPANHEDALLKAV 252
               + +  +G  +++  YPY+   AN  G C V  E S  +  +   NV    +D   + +
Sbjct:   208 GLKMVHLRGQ-SSDGDYPYEEYRANTTGNC-VGDEKSTVIQPETL-NVYRFDQDYAEEDI 264

Query:   253 AKQ------PVSVAIDAGSSDFQFYSEGVFTGECGTELN----HGVAAVGYGTTLDGTKY 302
              +       P +V    G + F++Y+ GV   E   ++     H VA VGYGT+ DG  Y
Sbjct:   265 MENLYLNHIPTAVYFRVGEN-FEWYTSGVLQSEDCYQMTPAEWHSVAIVGYGTSDDGVPY 323

Query:   303 WIVRNSWGPEWGEKGYIRMQRGIS 326
             W+VRNSW  +WG  GY++++RG++
Sbjct:   324 WLVRNSWNSDWGLHGYVKIRRGVN 347

 Score = 183 (69.5 bits), Expect = 1.0e-11, P = 1.0e-11
 Identities = 58/198 (29%), Positives = 89/198 (44%)

Query:    39 YERWRSHHTVS-RSLDEKHKRFNVFKQNVMHVHQTN----KMDKPYKLKLNKFADMTNHE 93
             +  +  HH    R+  EK +R   F +N   + + N    +  +      NKFAD    E
Sbjct:    30 FNNFTMHHKKHYRTPAEKDRRLAHFAKNHQKIQELNAKARREGRNVTFGWNKFADKNRQE 89

Query:    94 FASTYAGSKIKHH--------RMFQGTRGNGTFMYGKVTS-IPPSVDWRK---KGS--VT 139
              ++  +    K+H        R  +G+R +      + +  IP   D R     GS  V 
Sbjct:    90 LSARNSKIHPKNHTDLPIYKPRHPRGSRNHHNKRSKRQSGDIPDYFDLRDIYVDGSPVVG 149

Query:   140 AVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDC-DTDQNQGCNGGLMELAF 198
              VKDQ QCG CWAF+T A  E  N + +    SLS+QE+ DC D+    GC GG      
Sbjct:   150 PVKDQEQCGCCWAFATTAITEAANTLYSKSFTSLSDQEICDCADSGDTPGCVGGDPRNGL 209

Query:   199 EFIKKKGGVTTEAKYPYQ 216
             + +  +G  +++  YPY+
Sbjct:   210 KMVHLRGQ-SSDGDYPYE 226


>UNIPROTKB|F1RWA9 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 EMBL:CU855637
            Ensembl:ENSSSCT00000009707 OMA:WAFSIVG Uniprot:F1RWA9
        Length = 194

 Score = 305 (112.4 bits), Expect = 3.5e-27, P = 3.5e-27
 Identities = 74/198 (37%), Positives = 103/198 (52%)

Query:   146 QCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKK-K 204
             QCG CWAFS ++AVE    I    L  LS Q+++DC  + N GCNGG    A  ++ K +
Sbjct:     1 QCGGCWAFSVVSAVESAYAIKGQPLEVLSVQQVIDCSYN-NYGCNGGSTLNALYWLNKTQ 59

Query:   205 GGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVP-ANHEDALLKAVAKQ-PVSVAID 262
               V ++++YP++A +G C     S   VSI  +     +  ED + K +    P+ V +D
Sbjct:    60 VKVVSDSEYPFKAQNGLCHYFSCSHSGVSIKDYSAYDFSGQEDEMAKTLLTLGPLIVIVD 119

Query:   263 AGSSDFQFYSEGVFTGECGT-ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRM 321
             A S  +Q Y  G+    C + E NH V   G+  T   T YWIVRNSWG  WG  GY  +
Sbjct:   120 AVS--WQDYLGGIIQHHCSSGEANHAVLVTGFDKT-GSTPYWIVRNSWGSAWGIDGYALV 176

Query:   322 QRGISDKKGLCGIAMEAS 339
             + G      +CGIA   S
Sbjct:   177 KMG----GNICGIADSVS 190


>UNIPROTKB|J9NSE7 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 InterPro:IPR014882 Pfam:PF08773
            EMBL:AAEX03017125 Ensembl:ENSCAFT00000014269 OMA:INGQICH
            Uniprot:J9NSE7
        Length = 458

 Score = 306 (112.8 bits), Expect = 3.6e-27, P = 3.6e-27
 Identities = 97/311 (31%), Positives = 149/311 (47%)

Query:    52 LDEKHKRFNVFKQNVMHVHQTNKMDKPYKL-KLNKFADMTNHEFASTYAGSKIKHHRMFQ 110
             L E +    ++K N   V   N + K +   +  ++  +T  +      G KI   +   
Sbjct:   157 LQENNSN-RLYKYNYEFVKAINTIQKSWTATRYIEYETLTLRDMMRRAGGRKIPRPKPTP 215

Query:   111 GTRGNGTFMYGKVTSIPPSVDWRK-KGS--VTAVKDQGQCGSCWAFSTIAAVEGINHIMT 167
              T      ++ +++ +P S DWR  +G+  V+ V++Q  CGSC+AF++   +E    I+T
Sbjct:   216 LTAE----IHEEISRLPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTVMLEARIRILT 271

Query:   168 NKLVS--LSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVS 225
             N   +  LS QE+V C +   QGC GG   L      +  G+  EA + Y  +D  C  +
Sbjct:   272 NNTQTPILSPQEIVSC-SQYAQGCEGGFPYLIAGKYAQDFGLVDEACFSYAGSDSPCKPN 330

Query:   226 K----ESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVF--TGE 279
                   SS    + G      N     L+ V   P++VA +    DF  Y +G++  TG 
Sbjct:   331 DCFHYYSSEYHYVGGFYGA-CNEALMKLELVRHGPMAVAFEV-YDDFFHYQKGIYYHTGL 388

Query:   280 CGT----EL-NHGVAAVGYGT-TLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCG 333
                    EL NH V  VGYGT +  G  YWIV+NSWG  WGE GY ++ RG +D+  +  
Sbjct:   389 RDPINPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFQICRG-TDECAIES 447

Query:   334 IAMEASYPIKK 344
             IA+ A+ PI K
Sbjct:   448 IAVAAT-PIPK 457


>WB|WBGene00019314 [details] [associations]
            symbol:K02E7.10 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 SMART:SM00645 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 HSSP:P07711
            EMBL:FO080411 PIR:T32392 RefSeq:NP_493904.1 UniGene:Cel.14828
            ProteinModelPortal:O17255 SMR:O17255 EnsemblMetazoa:K02E7.10
            GeneID:186889 KEGG:cel:CELE_K02E7.10 UCSC:K02E7.10 CTD:186889
            WormBase:K02E7.10 eggNOG:NOG331187 HOGENOM:HOG000114005
            InParanoid:O17255 OMA:GNANEAR NextBio:933344 Uniprot:O17255
        Length = 299

 Score = 303 (111.7 bits), Expect = 5.7e-27, P = 5.7e-27
 Identities = 82/222 (36%), Positives = 114/222 (51%)

Query:   130 VDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTN-KLVSLSEQELVDCDTDQNQG 188
             +DWR+KG V  VKDQG+C + +AF+ IAA+E +     N KL+S SEQ+++DC    N  
Sbjct:    84 LDWREKGIVGPVKDQGKCNASYAFAAIAAIESMYAKANNGKLLSFSEQQIIDCANFTNP- 142

Query:   189 CNGGLMELAFEFIKKKGGVTTEAKYPY--QANDGTCDVSKESSPAVSIDGHENVPANHED 246
             C   L  +      K+ GV TEA YPY  + N G C+   +SS       + +V  N E 
Sbjct:   143 CQENLENVLSNRFLKENGVGTEADYPYVGKENVGKCEY--DSSKMKLRPTYIDVYPNEEW 200

Query:   247 ALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTG---ECGTELN-HGVAAVGYGTTLDGT-K 301
             A             + +  S F  Y  G++     ECG       +A VGYG   DG  K
Sbjct:   201 ARAHITTFGTGYFRMRSPPSFFH-YKTGIYNPTKEECGNANEARSLAIVGYGK--DGAEK 257

Query:   302 YWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPIK 343
             YWIV+ S+G  WGE GY+++ R ++     CG+A   S PIK
Sbjct:   258 YWIVKGSFGTSWGEHGYMKLARNVN----ACGMAESISIPIK 295


>WB|WBGene00008231 [details] [associations]
            symbol:tag-329 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            eggNOG:NOG288820 EMBL:Z70750 HSSP:P53634 HOGENOM:HOG000019851
            PIR:T20110 RefSeq:NP_505458.1 ProteinModelPortal:Q18740 SMR:Q18740
            MEROPS:C01.A36 EnsemblMetazoa:C50F4.3 GeneID:183677
            KEGG:cel:CELE_C50F4.3 UCSC:C50F4.3 CTD:183677 WormBase:C50F4.3
            InParanoid:Q18740 OMA:WIFRNSW NextBio:921986 Uniprot:Q18740
        Length = 374

 Score = 302 (111.4 bits), Expect = 7.3e-27, P = 7.3e-27
 Identities = 93/302 (30%), Positives = 137/302 (45%)

Query:    53 DEKHKRFNVFKQNVMHVHQTNKMDKPYKLK-------LNKFADMTNHEFASTYAG-SKIK 104
             DE  K+F  F+Q V   ++  KM+K  K         +NKF+D++  E    Y+     K
Sbjct:    60 DEIEKKFR-FQQFVATHNRVGKMNKAAKKAGHDTKYGINKFSDLSKKEIHGMYSKFGPPK 118

Query:   105 HHRMFQGTRGNGTFMYGKVTSIPPSVDWRKK--GS---VTAVKDQGQCGSCWAFSTIAAV 159
             ++            +  ++  +P + D R K  G    +  +K Q  C  CW F+  A  
Sbjct:   119 NNTNVPKFNLKNLRVKRQMEGLPKTFDLRNKKVGGHYIIGPIKTQDSCACCWGFAATAVA 178

Query:   160 EGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQAND 219
             E    +   K ++LSEQE+ DC      GCNGG      E+IK+ G +T   +YP+  N 
Sbjct:   179 EAALTVHLKKAMNLSEQEVCDCAPKHGPGCNGGDPVDGLEYIKEMG-LTGGKEYPFNVNR 237

Query:   220 GT----CDVSK---ESSPAVSIDGHENVPANHEDALLKAV--AKQPVSVAIDAGSSDFQF 270
              T    C+  K   E +P + +D +   P N E  +   +     P+SVA   G+S    
Sbjct:   238 STQLGRCESEKYDRELNP-LELDYYAIDPFNAEYQMTHHLYLLNLPISVAFRTGAS-LSS 295

Query:   271 YSEGVFT-GECGTELN---HGVAAVGYGTTLDGT----KYWIVRNSWGPEWGEKGYIRMQ 322
             Y  G+    +C  E     H  A VGYGTT +       YWI RNSW  +WG+ GY R+ 
Sbjct:   296 YLSGILELADCDDEKGGHWHSGAIVGYGTTKNSAGRTVDYWIFRNSWWTDWGDDGYARIV 355

Query:   323 RG 324
             RG
Sbjct:   356 RG 357


>UNIPROTKB|Q5QP40 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015644 Pfam:PF00112
            InterPro:IPR000169 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 EMBL:AL355860 HOVERGEN:HBG011513
            PANTHER:PTHR12411:SF55 EMBL:AL356292 UniGene:Hs.632466
            HGNC:HGNC:2536 IPI:IPI00514633 SMR:Q5QP40 STRING:Q5QP40
            Ensembl:ENST00000443913 Uniprot:Q5QP40
        Length = 258

 Score = 297 (109.6 bits), Expect = 2.5e-26, P = 2.5e-26
 Identities = 71/189 (37%), Positives = 105/189 (55%)

Query:    29 LESEEGLWDLYERWRSHHT--VSRSLDEKHKRFNVFKQNVMHV--H--QTNKMDKPYKLK 82
             L  EE L   +E W+  H    +  +DE  +R  ++++N+ ++  H  + +     Y+L 
Sbjct:    75 LYPEEILDTHWELWKKTHRKQYNNKVDEISRRL-IWEKNLKYISIHNLEASLGVHTYELA 133

Query:    83 LNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTS-IPPSVDWRKKGSVTAV 141
             +N   DMT+ E      G K+        +R N T    +     P SVD+RKKG VT V
Sbjct:   134 MNHLGDMTSEEVVQKMTGLKVP----LSHSRSNDTLYIPEWEGRAPDSVDYRKKGYVTPV 189

Query:   142 KDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFI 201
             K+QGQCGSCWAFS++ A+EG     T KL++LS Q LVDC   +N GC GG M  AF+++
Sbjct:   190 KNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNAFQYV 248

Query:   202 KKKGGVTTE 210
             +K  G+ +E
Sbjct:   249 QKNRGIDSE 257


>UNIPROTKB|E2RPX3 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 CTD:1521 KO:K08569 OMA:GRCGDGC
            EMBL:AAEX03011632 RefSeq:XP_540846.2 Ensembl:ENSCAFT00000020910
            GeneID:483725 KEGG:cfa:483725 Uniprot:E2RPX3
        Length = 374

 Score = 227 (85.0 bits), Expect = 6.5e-26, Sum P(2) = 6.5e-26
 Identities = 65/250 (26%), Positives = 108/250 (43%)

Query:    53 DEKHKRFNVFKQNVMHVHQTNKMDK-PYKLKLNKFADMTNHEFASTYAGSKIKHHRMFQG 111
             +E  +R ++F  N+    Q    D    +  +  F+D+T  EF   Y   ++       G
Sbjct:    57 EEYARRLDIFAHNLAQAQQLEDEDLGTAEFGVTPFSDLTEEEFGQFYGHQRMAGEAPSVG 116

Query:   112 TRGNGTFMYGKVTSIPPSVDWRK-KGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKL 170
              +      +G+   +PP+ DWRK  G ++ +K QG C  CWA +    +E +  I  ++ 
Sbjct:   117 RKVESE-EWGE--PVPPTCDWRKLPGIISPIKQQGNCRCCWAMAAAGNIEALWGIRYHQP 173

Query:   171 VSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGT--CDVSKES 228
             V +S QEL+DC      GC GG    AF  +    G+ +   YP+  N     C ++K+ 
Sbjct:   174 VEVSVQELLDCGRC-GDGCKGGFTWDAFITVLNNSGLASAKDYPFLGNTKPHRC-LAKKY 231

Query:   229 SPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTGE---CGTE-L 284
                  I     +  N +        K P++V I+      Q Y +GV       C  + +
Sbjct:   232 KKVAWIQDFIMLQGNEQAIAWYLATKGPITVTINMKL--LQHYQKGVIQATHTTCDPQRV 289

Query:   285 NHGVAAVGYG 294
             +H V  VG+G
Sbjct:   290 DHSVLLVGFG 299

 Score = 117 (46.2 bits), Expect = 6.5e-26, Sum P(2) = 6.5e-26
 Identities = 21/41 (51%), Positives = 26/41 (63%)

Query:   302 YWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPI 342
             YWI++NSWG EWGE+GY R+ RG       CGI     YP+
Sbjct:   324 YWILKNSWGAEWGEEGYFRLHRG----NNTCGIT---KYPV 357


>UNIPROTKB|E9PKT6 [details] [associations]
            symbol:CTSH "Cathepsin H" species:9606 "Homo sapiens"
            [GO:0001520 "outer dense fiber" evidence=IEA] [GO:0001656
            "metanephros development" evidence=IEA] [GO:0001669 "acrosomal
            vesicle" evidence=IEA] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0007283 "spermatogenesis" evidence=IEA] [GO:0008284 "positive
            regulation of cell proliferation" evidence=IEA] [GO:0010634
            "positive regulation of epithelial cell migration" evidence=IEA]
            [GO:0016505 "apoptotic protease activator activity" evidence=IEA]
            [GO:0030984 "kininogen binding" evidence=IEA] [GO:0031638 "zymogen
            activation" evidence=IEA] [GO:0031648 "protein destabilization"
            evidence=IEA] [GO:0032403 "protein complex binding" evidence=IEA]
            [GO:0032526 "response to retinoic acid" evidence=IEA] [GO:0033619
            "membrane protein proteolysis" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0043621 "protein
            self-association" evidence=IEA] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IEA] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            InterPro:IPR000169 GO:GO:0043066 GO:GO:0008284 PANTHER:PTHR12411
            PROSITE:PS00139 GO:GO:0045766 GO:GO:0004252 GO:GO:0032526
            GO:GO:0016505 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 GO:GO:0060448 GO:GO:0033619
            EMBL:AC011944 HGNC:HGNC:2535 IPI:IPI00375426
            ProteinModelPortal:E9PKT6 SMR:E9PKT6 PRIDE:E9PKT6
            Ensembl:ENST00000528741 ArrayExpress:E9PKT6 Bgee:E9PKT6
            Uniprot:E9PKT6
        Length = 134

 Score = 289 (106.8 bits), Expect = 1.8e-25, P = 1.8e-25
 Identities = 64/138 (46%), Positives = 84/138 (60%)

Query:    81 LKLNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGS-VT 139
             + LN+F+DM+  E    Y  S+ ++      T+ N  ++ G     PPSVDWRKKG+ V+
Sbjct:     1 MALNQFSDMSFAEIKHKYLWSEPQN---CSATKSN--YLRG-TGPYPPSVDWRKKGNFVS 54

Query:   140 AVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTD-QNQGCNGGLMELAF 198
              VK+QG CGSCW FST  A+E    I T K++SL+EQ+LVDC  D  N GC GGL   AF
Sbjct:    55 PVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAF 114

Query:   199 EFIKKKGGVTTEAKYPYQ 216
             E+I    G+  E  YPYQ
Sbjct:   115 EYILYNKGIMGEDTYPYQ 132


>UNIPROTKB|P56202 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 GO:GO:0006955 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AF013611
            EMBL:AF015954 EMBL:AF055903 EMBL:AP001201 EMBL:BC048255
            IPI:IPI00328978 RefSeq:NP_001326.2 UniGene:Hs.416848
            ProteinModelPortal:P56202 SMR:P56202 STRING:P56202 MEROPS:C01.037
            PhosphoSite:P56202 DMDM:259016196 PaxDb:P56202 PRIDE:P56202
            Ensembl:ENST00000307886 GeneID:1521 KEGG:hsa:1521 UCSC:uc001ogc.1
            CTD:1521 GeneCards:GC11P065647 HGNC:HGNC:2546 HPA:CAB016345
            MIM:602364 neXtProt:NX_P56202 PharmGKB:PA27042 eggNOG:NOG288820
            HOVERGEN:HBG100117 InParanoid:P56202 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 PhylomeDB:P56202 GenomeRNAi:1521 NextBio:6295
            ArrayExpress:P56202 Bgee:P56202 CleanEx:HS_CTSW
            Genevestigator:P56202 GermOnline:ENSG00000172543 Uniprot:P56202
        Length = 376

 Score = 222 (83.2 bits), Expect = 1.8e-25, Sum P(2) = 1.8e-25
 Identities = 72/262 (27%), Positives = 113/262 (43%)

Query:    54 EKHK-RFNVFKQNVMHVHQTNKMDK-PYKLKLNKFADMTNHEFASTYAGSKIKHHRMFQG 111
             E+H  R ++F  N+    +  + D    +  +  F+D+T  EF   Y      + R   G
Sbjct:    57 EEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQLYG-----YRRAAGG 111

Query:   112 TRGNGTFMYGKVT--SIPPSVDWRKKGS-VTAVKDQGQCGSCWAFSTIAAVEGINHIMTN 168
                 G  +  +    S+P S DWRK  S ++ +KDQ  C  CWA +    +E +  I   
Sbjct:   112 VPSMGREIRSEEPEESVPFSCDWRKVASAISPIKDQKNCNCCWAMAAAGNIETLWRISFW 171

Query:   169 KLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGT--CDVSK 226
               V +S QEL+DC      GC+GG +  AF  +    G+ +E  YP+Q       C   K
Sbjct:   172 DFVDVSVQELLDCGRC-GDGCHGGFVWDAFITVLNNSGLASEKDYPFQGKVRAHRCHPKK 230

Query:   227 ESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFTGE---CGT 282
                 A   D    +  N+E  + + +A   P++V I+      Q Y +GV       C  
Sbjct:   231 YQKVAWIQDFI--MLQNNEHRIAQYLATYGPITVTINM--KPLQLYRKGVIKATPTTCDP 286

Query:   283 EL-NHGVAAVGYGTTLDGTKYW 303
             +L +H V  VG+G+       W
Sbjct:   287 QLVDHSVLLVGFGSVKSEEGIW 308

 Score = 120 (47.3 bits), Expect = 1.8e-25, Sum P(2) = 1.8e-25
 Identities = 22/50 (44%), Positives = 28/50 (56%)

Query:   300 TKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPIKKSATNP 349
             T YWI++NSWG +WGEKGY R+ RG       CGI     +P+      P
Sbjct:   324 TPYWILKNSWGAQWGEKGYFRLHRG----SNTCGIT---KFPLTARVQKP 366


>DICTYBASE|DDB_G0286015 [details] [associations]
            symbol:gmsA species:44689 "Dictyostelium discoideum"
            [GO:0005615 "extracellular space" evidence=IDA] [GO:0019953 "sexual
            reproduction" evidence=IEP] [GO:0009897 "external side of plasma
            membrane" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA;ISS] [GO:0000747 "conjugation with cellular
            fusion" evidence=IMP] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005576 "extracellular
            region" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0286015 Pfam:PF00188 GO:GO:0005615
            GenomeReviews:CM000153_GR GO:GO:0009897 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AAFI02000085 GO:GO:0000747
            Gene3D:3.40.33.10 InterPro:IPR001283 InterPro:IPR014044
            SMART:SM00198 SUPFAM:SSF55797 HSSP:P07688 RefSeq:XP_637893.1
            ProteinModelPortal:Q54ME1 MEROPS:C01.A52 EnsemblProtists:DDB0191145
            GeneID:8625403 KEGG:ddi:DDB_G0286015 InParanoid:Q54ME1 OMA:PGIAYEK
            ProtClustDB:CLSZ2429919 Uniprot:Q54ME1
        Length = 448

 Score = 279 (103.3 bits), Expect = 4.2e-24, P = 4.2e-24
 Identities = 76/218 (34%), Positives = 105/218 (48%)

Query:   129 SVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEG---INHIMTNK-LVSLSEQELVDCDTD 184
             +VDW      T ++DQGQCGSCWAF++ AA+E    I +    K  + LS Q  V+C   
Sbjct:   243 TVDWTSYQ--TPIRDQGQCGSCWAFASSAALESRYLIKYGTAQKSTLQLSNQNAVNCIAS 300

Query:   185 QNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANH 244
                GCNGG     F F K  G +  E   PY+A  GT  ++  S        +       
Sbjct:   301 ---GCNGGWSGNYFNFFKTPG-IAYEKDDPYKAVTGTSCITTSSVARFKYTNY-GYTEKT 355

Query:   245 EDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTGECG-TELNHGVAAVGYGTTLDGTKYW 303
             + ALL  + K PV++A+   S+ FQ Y  G++      T +NH V  VGY    D  K  
Sbjct:   356 KAALLAELKKGPVTIAVYVDSA-FQNYKSGIYNSATKYTGINHLVLLVGYDQATDAYK-- 412

Query:   304 IVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYP 341
              ++NSWG  WGE GY+R+    +    L   A  + YP
Sbjct:   413 -IKNSWGSWWGESGYMRIT---ASNDNLAIFAYNSYYP 446


>UNIPROTKB|E2QV47 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0097208 "alveolar lamellar body"
            evidence=IEA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0070371 "ERK1 and ERK2 cascade"
            evidence=IEA] [GO:0070324 "thyroid hormone binding" evidence=IEA]
            [GO:0060448 "dichotomous subdivision of terminal units involved in
            lung branching" evidence=IEA] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IEA] [GO:0043129 "surfactant homeostasis"
            evidence=IEA] [GO:0043066 "negative regulation of apoptotic
            process" evidence=IEA] [GO:0033619 "membrane protein proteolysis"
            evidence=IEA] [GO:0032526 "response to retinoic acid" evidence=IEA]
            [GO:0031648 "protein destabilization" evidence=IEA] [GO:0031638
            "zymogen activation" evidence=IEA] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=IEA] [GO:0016505
            "apoptotic protease activator activity" evidence=IEA] [GO:0010815
            "bradykinin catabolic process" evidence=IEA] [GO:0010813
            "neuropeptide catabolic process" evidence=IEA] [GO:0010634
            "positive regulation of epithelial cell migration" evidence=IEA]
            [GO:0010628 "positive regulation of gene expression" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=IEA] [GO:0005829 "cytosol" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0004252 "serine-type endopeptidase activity"
            evidence=IEA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IEA] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 GO:GO:0016505 GO:GO:0010634
            GO:GO:0004197 GO:GO:0042599 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 Ensembl:ENSCAFT00000036196 Uniprot:E2QV47
        Length = 136

 Score = 272 (100.8 bits), Expect = 1.1e-23, P = 1.1e-23
 Identities = 60/139 (43%), Positives = 83/139 (59%)

Query:   210 EAKYPYQANDGTCDVSKESSPAVS-IDGHENVPANHEDALLKAVAK-QPVSVAIDAGSSD 267
             E  YPY+  DG C    + S A++ +    N+  N E A+++AVA   PVS A +  +SD
Sbjct:     3 EDSYPYKGQDGDCKY--QPSKAIAFVKDVANITINDEQAMVEAVALYNPVSFAFEV-TSD 59

Query:   268 FQFYSEGVFTG-ECGT---ELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQR 323
             F  Y +G+++   C     ++NH V AVGYG   +G  YWIV+NSWGP+WG  GY  M+R
Sbjct:    60 FMMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEQ-NGIPYWIVKNSWGPQWGMNGYFLMER 118

Query:   324 GISDKKGLCGIAMEASYPI 342
             G    K +CG+A  ASYPI
Sbjct:   119 G----KNMCGLAACASYPI 133


>DICTYBASE|DDB_G0288221 [details] [associations]
            symbol:DDB_G0288221 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0288221 Pfam:PF00188 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 PANTHER:PTHR12411 EMBL:AAFI02000109 Gene3D:3.40.33.10
            InterPro:IPR001283 InterPro:IPR014044 SMART:SM00198 SUPFAM:SSF55797
            MEROPS:C01.A52 ProtClustDB:CLSZ2429919 RefSeq:XP_636852.1
            ProteinModelPortal:Q54J84 EnsemblProtists:DDB0187839 GeneID:8626520
            KEGG:ddi:DDB_G0288221 InParanoid:Q54J84 Uniprot:Q54J84
        Length = 395

 Score = 268 (99.4 bits), Expect = 2.9e-23, P = 2.9e-23
 Identities = 69/208 (33%), Positives = 109/208 (52%)

Query:   129 SVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGI----NHIMTNKLVSLSEQELVDCDTD 184
             SVDW      T V+DQG+C SCW F ++AA+E      N +     + LS Q  ++C T 
Sbjct:   191 SVDW--SDYQTPVRDQGECKSCWVFGSLAALESRYLIKNGVSEKSTLHLSAQNAMNCITS 248

Query:   185 QNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANH 244
                GC  G     F++ +  G +  E  YPY A  G+ D    SS      G+++V  N 
Sbjct:   249 ---GCESGWPANVFDYFESSG-IAFEKDYPYDAI-GS-DNCTSSSNKFEYSGYDSVE-NT 301

Query:   245 EDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTG-ECGTELNHGVAAVGYGTTLDGTKYW 303
             +D+L++ +   P+++A+ + ++ FQ Y+ G++   E   ++NH V  VGY    D    W
Sbjct:   302 KDSLIQELKNGPITIALYSDTA-FQSYAGGIYDSVEEYKDVNHIVLLVGYDKPTDS---W 357

Query:   304 IVRNSWGPEWGEKGYIRMQRGISDKKGL 331
              ++NS G +WGE GY R+    +DK G+
Sbjct:   358 KIKNSLGTKWGELGYARITAS-NDKLGI 384


>DICTYBASE|DDB_G0288563 [details] [associations]
            symbol:DDB_G0288563 species:44689 "Dictyostelium
            discoideum" [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0288563
            GO:GO:0005615 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197
            EMBL:AAFI02000117 PANTHER:PTHR12411:SF16 RefSeq:XP_636643.1
            MEROPS:C01.A58 PRIDE:Q54IS1 EnsemblProtists:DDB0187993
            GeneID:8626689 KEGG:ddi:DDB_G0288563 InParanoid:Q54IS1 OMA:AWEYMEL
            Uniprot:Q54IS1
        Length = 314

 Score = 267 (99.0 bits), Expect = 3.8e-23, P = 3.8e-23
 Identities = 73/221 (33%), Positives = 112/221 (50%)

Query:   115 NGTFMYGKV-TSIPPSVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVS- 172
             NG  + G + TS    V W     +  + +Q QCGSCWAFS+   +     I +N   + 
Sbjct:    80 NGEELKGSIPTSFDSRVQW--PDCIHPILNQEQCGSCWAFSSSEVLSDRLCIASNNKTNP 137

Query:   173 --LSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGT---CDVSKE 227
               LS Q LV CD   N GC+GG+ +LA+E+++ KG + T++  PY A +GT   C  S  
Sbjct:   138 GALSPQTLVACDVYGNDGCSGGIPQLAWEYMELKG-LPTDSCVPYTAGNGTVYSCQRSCS 196

Query:   228 SSPAVSIDGHENVPANHED-ALLKAVAKQPVSVAIDAGS----SDFQFYSEGVFTGECGT 282
              S   S+  +   P   +  + ++ + +  ++     G+     DF  YS GV+    G+
Sbjct:   197 DSEDYSL--YRAKPFTLKTCSSVQCIQENILAYGPIVGTMEVYEDFMSYSSGVYVMTPGS 254

Query:   283 EL--NHGVAAVGYGTTLDGTK---YWIVRNSWGPEWGEKGY 318
              L   H +  VG+G   D T    YWIV NSWG +WG++G+
Sbjct:   255 SLLGGHAIKIVGWG--FDQTSQLNYWIVANSWGADWGQQGF 293


>WB|WBGene00044760 [details] [associations]
            symbol:Y71H2AM.25 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0004181
            "metallocarboxypeptidase activity" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            GeneTree:ENSGT00560000076599 EMBL:FO081822 eggNOG:NOG331187
            HOGENOM:HOG000114005 RefSeq:NP_001040887.1
            ProteinModelPortal:Q2AAB9 SMR:Q2AAB9 EnsemblMetazoa:Y71H2AM.25
            GeneID:4363054 KEGG:cel:CELE_Y71H2AM.25 UCSC:Y71H2AM.25 CTD:4363054
            WormBase:Y71H2AM.25 InParanoid:Q2AAB9 NextBio:959635 Uniprot:Q2AAB9
        Length = 299

 Score = 262 (97.3 bits), Expect = 1.3e-22, P = 1.3e-22
 Identities = 69/205 (33%), Positives = 102/205 (49%)

Query:   130 VDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTN-KLVSLSEQELVDCDTDQNQG 188
             +DWR KG V  VKDQG+C +  AF+  +++E +    TN  L+S SEQ+L+DCD    +G
Sbjct:    86 LDWRDKGIVGPVKDQGKCNASHAFAISSSIESMYAKATNGSLLSFSEQQLIDCDDHGFKG 145

Query:   189 CNGG-LMELAFEFIKKKGGVTTEAKYPYQAND-GTCDVSKESSPAVSIDGHENVPANHED 246
             C     +     FI    G+ TEA YPY   + G C      S  + +   E V +N   
Sbjct:   146 CEEQPAINAVSYFIFH--GIETEADYPYAGKENGKCTFDSTKSK-IQLKDAEFVVSNETQ 202

Query:   247 ALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTG---EC-GTELNHGVAAVGYGTTLDGT-K 301
                      P    + A  S +  Y  G++     EC  T     +  VGYG  ++G  K
Sbjct:   203 GKELVTNYGPAFFTMRAPPSLYD-YKIGIYNPSIEECTSTHEIRSMVIVGYG--IEGVQK 259

Query:   302 YWIVRNSWGPEWGEKGYIRMQRGIS 326
             YWIV+ S+G  WGE+GY+++ R ++
Sbjct:   260 YWIVKGSFGTSWGEQGYMKLARDVN 284


>TAIR|locus:505006093 [details] [associations]
            symbol:AT1G02305 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0005829 "cytosol" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684 GO:GO:0005773
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 HOGENOM:HOG000241341
            KO:K01363 PANTHER:PTHR12411:SF16 OMA:CCGFLCG UniGene:At.23486
            UniGene:At.42610 UniGene:At.43952 EMBL:AY039887 EMBL:AF428337
            EMBL:BT002227 IPI:IPI00524601 RefSeq:NP_563648.1 HSSP:P07858
            ProteinModelPortal:Q93VC9 SMR:Q93VC9 IntAct:Q93VC9 STRING:Q93VC9
            MEROPS:C01.049 PRIDE:Q93VC9 ProMEX:Q93VC9 EnsemblPlants:AT1G02305.1
            GeneID:839538 KEGG:ath:AT1G02305 TAIR:At1g02305 InParanoid:Q93VC9
            PhylomeDB:Q93VC9 ProtClustDB:CLSN2687619 Genevestigator:Q93VC9
            Uniprot:Q93VC9
        Length = 362

 Score = 174 (66.3 bits), Expect = 1.7e-22, Sum P(2) = 1.7e-22
 Identities = 39/116 (33%), Positives = 58/116 (50%)

Query:   243 NHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFTGECGTELN-HGVAAVGYGTTLDGT 300
             +H D ++  V K  PV VA      DF  Y  GV+    GT +  H V  +G+GT+ DG 
Sbjct:   245 SHPDDIMAEVYKNGPVEVAFTV-YEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDGE 303

Query:   301 KYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGI--AMEASYPIKKSATNPTGPSD 354
              YW++ N W   WG+ GY +++RG ++    CGI   + A  P  ++       SD
Sbjct:   304 DYWLLANQWNRSWGDDGYFKIRRGTNE----CGIEHGVVAGLPSDRNVVKGITTSD 355

 Score = 151 (58.2 bits), Expect = 1.7e-22, Sum P(2) = 1.7e-22
 Identities = 36/90 (40%), Positives = 46/90 (51%)

Query:   132 WRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCN 190
             W +  S+  + DQG CGSCWAF  + ++     I  N  VSLS  +L+ C      QGCN
Sbjct:   116 WSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYNMNVSLSVNDLLACCGFLCGQGCN 175

Query:   191 GGLMELAFEFIKKKGGVTTEAKYPYQANDG 220
             GG    A+ + K  G VT E   PY  N G
Sbjct:   176 GGYPIAAWRYFKHHGVVTEECD-PYFDNTG 204


>WB|WBGene00022189 [details] [associations]
            symbol:Y71H2AR.2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0004181
            "metallocarboxypeptidase activity" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] [GO:0008340 "determination of adult lifespan"
            evidence=IMP] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0008340 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 HSSP:P07711
            eggNOG:NOG331187 HOGENOM:HOG000114005 EMBL:FO081570
            RefSeq:NP_497627.1 UniGene:Cel.28419 ProteinModelPortal:Q9BL26
            SMR:Q9BL26 EnsemblMetazoa:Y71H2AR.2 GeneID:190615
            KEGG:cel:CELE_Y71H2AR.2 UCSC:Y71H2AR.2 CTD:190615
            WormBase:Y71H2AR.2 InParanoid:Q9BL26 OMA:CAMATTI NextBio:946382
            Uniprot:Q9BL26
        Length = 345

 Score = 258 (95.9 bits), Expect = 3.4e-22, P = 3.4e-22
 Identities = 69/205 (33%), Positives = 104/205 (50%)

Query:   130 VDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTN-KLVSLSEQELVDCDTDQNQG 188
             +DWR+KG V  VKDQG+C +  AF+  +++E +    TN  L+S SEQ+L+DC+    +G
Sbjct:    86 LDWREKGIVGPVKDQGKCNASHAFAITSSIESMYAKATNGTLLSFSEQQLIDCNDQGYKG 145

Query:   189 CNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVPANHEDAL 248
             C       A  ++   G + TEA YPY   D T +     S    I   + V A   + L
Sbjct:   146 CEEQFAMNAIGYLATHG-IETEADYPYV--DKTNEKCTFDSTKSKIHLKKGVVAEGNEVL 202

Query:   249 LKAVAKQ--PVSVAIDAGSSDFQFYSEGVFTG---EC-GTELNHGVAAVGYGTTLDGT-K 301
              K       P    + A  S +  Y  G++     EC  T     +  VGYG  ++G  K
Sbjct:   203 GKVYVTNYGPAFFTMRAPPSLYD-YKIGIYNPSIEECTSTHEIRSMVIVGYG--IEGEQK 259

Query:   302 YWIVRNSWGPEWGEKGYIRMQRGIS 326
             YWIV+ S+G  WGE+GY+++ R ++
Sbjct:   260 YWIVKGSFGTSWGEQGYMKLARDVN 284


>DICTYBASE|DDB_G0292462 [details] [associations]
            symbol:DDB_G0292462 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0292462 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            EMBL:AAFI02000190 RefSeq:XP_629634.1 MEROPS:C01.A56
            EnsemblProtists:DDB0184413 GeneID:8628698 KEGG:ddi:DDB_G0292462
            InParanoid:Q54D62 OMA:NTQVESH Uniprot:Q54D62
        Length = 323

 Score = 257 (95.5 bits), Expect = 4.3e-22, P = 4.3e-22
 Identities = 71/208 (34%), Positives = 108/208 (51%)

Query:   138 VTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVS--LSEQELVDCD----TDQ----NQ 187
             ++ V++Q  CGSCWA  T   +     I ++K +   LS Q L+DCD    +D     N 
Sbjct:    60 MSPVREQQSCGSCWAQVTSGILADRMCIESDKNIKMLLSPQYLMDCDGSCVSDGVSGCNN 119

Query:   188 GCNGGLMELAFEFIKKKGGVTTEAKYPYQAN-DGTCDVS-KESSPAVSIDGHENVPANH- 244
             GC GG + LA   +  +G V+ E    YQA+ D +C  +  + SP  +   ++       
Sbjct:   120 GCKGGFVGLALTRLINEGIVSDEC-LSYQASKDSSCPTTCDDGSPISNTTIYKATSCRAF 178

Query:   245 ---EDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTGECGTEL-NHGVAAVGYGTTLDGT 300
                +DA  + +   PV +A     SDF+ +   V+     T++ +H V  VG+GTT DG 
Sbjct:   179 PTVQDAQYEIMTNGPV-IATFMLYSDFKPHKWDVYIKSSNTQVESHAVRVVGWGTTSDGV 237

Query:   301 KYWIVRNSWGPEWGEKGYIRMQRGISDK 328
              YWI  NSWG  WG+KGY +++RG SD+
Sbjct:   238 DYWIAANSWGTGWGDKGYFKIRRG-SDE 264

 Score = 147 (56.8 bits), Expect = 1.1e-07, P = 1.1e-07
 Identities = 27/61 (44%), Positives = 38/61 (62%)

Query:   285 NHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISD---KKGLCGIAME-ASY 340
             +H V  VG+GTT DG  YWI  NSWG  WG+KGY +++RG  +   ++G   +  + AS 
Sbjct:   222 SHAVRVVGWGTTSDGVDYWIAANSWGTGWGDKGYFKIRRGSDEAAFEEGFITVTADTASV 281

Query:   341 P 341
             P
Sbjct:   282 P 282


>UNIPROTKB|E9PI30 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            EMBL:AP001201 HGNC:HGNC:2546 IPI:IPI00984532
            ProteinModelPortal:E9PI30 SMR:E9PI30 Ensembl:ENST00000528419
            ArrayExpress:E9PI30 Bgee:E9PI30 Uniprot:E9PI30
        Length = 364

 Score = 222 (83.2 bits), Expect = 5.9e-22, Sum P(2) = 5.9e-22
 Identities = 72/262 (27%), Positives = 113/262 (43%)

Query:    54 EKHK-RFNVFKQNVMHVHQTNKMDK-PYKLKLNKFADMTNHEFASTYAGSKIKHHRMFQG 111
             E+H  R ++F  N+    +  + D    +  +  F+D+T  EF   Y      + R   G
Sbjct:    57 EEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQLYG-----YRRAAGG 111

Query:   112 TRGNGTFMYGKVT--SIPPSVDWRKKGS-VTAVKDQGQCGSCWAFSTIAAVEGINHIMTN 168
                 G  +  +    S+P S DWRK  S ++ +KDQ  C  CWA +    +E +  I   
Sbjct:   112 VPSMGREIRSEEPEESVPFSCDWRKVASAISPIKDQKNCNCCWAMAAAGNIETLWRISFW 171

Query:   169 KLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGT--CDVSK 226
               V +S QEL+DC      GC+GG +  AF  +    G+ +E  YP+Q       C   K
Sbjct:   172 DFVDVSVQELLDCGRC-GDGCHGGFVWDAFITVLNNSGLASEKDYPFQGKVRAHRCHPKK 230

Query:   227 ESSPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFTGE---CGT 282
                 A   D    +  N+E  + + +A   P++V I+      Q Y +GV       C  
Sbjct:   231 YQKVAWIQDFI--MLQNNEHRIAQYLATYGPITVTINM--KPLQLYRKGVIKATPTTCDP 286

Query:   283 EL-NHGVAAVGYGTTLDGTKYW 303
             +L +H V  VG+G+       W
Sbjct:   287 QLVDHSVLLVGFGSVKSEEGIW 308

 Score = 85 (35.0 bits), Expect = 5.9e-22, Sum P(2) = 5.9e-22
 Identities = 17/33 (51%), Positives = 21/33 (63%)

Query:   300 TKYWIVRNSWGPEWGEK-GYIRMQRGISDKKGL 331
             T YWI++NSWG +WGEK   I   RG   + GL
Sbjct:   324 TPYWILKNSWGAQWGEKVSVIYWGRG-QGRTGL 355


>DICTYBASE|DDB_G0280187 [details] [associations]
            symbol:DDB_G0280187 "cathepsin Z-like protein"
            species:44689 "Dictyostelium discoideum" [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            dictyBase:DDB_G0280187 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            EMBL:AAFI02000035 KO:K08568 RefSeq:XP_641294.1
            ProteinModelPortal:Q54VR1 MEROPS:C01.A61 PRIDE:Q54VR1
            EnsemblProtists:DDB0233838 GeneID:8622427 KEGG:ddi:DDB_G0280187
            InParanoid:Q54VR1 OMA:VWKVGDY Uniprot:Q54VR1
        Length = 291

 Score = 175 (66.7 bits), Expect = 7.3e-21, Sum P(2) = 7.3e-21
 Identities = 35/91 (38%), Positives = 58/91 (63%)

Query:   238 ENVPANHEDALLKAV-AKQPVSVAIDAGSSDFQFYSEGVFTGECGT--ELNHGVAAVGYG 294
             E+   N   A+++ + A+ P++  ++   + F+ Y+ GVFT   G+  E+NH ++ +G+G
Sbjct:   185 EHGQVNGSVAMMQEIFARGPIACGMEVTDA-FESYTSGVFTSSVGSTGEINHEISIIGWG 243

Query:   295 TTLDGTKYWIVRNSWGPEWGEKGYIRMQRGI 325
             T  +G  YWI RNSWG  +GE G+ R+QRGI
Sbjct:   244 TE-NGVDYWIGRNSWGTYFGELGFFRIQRGI 273

 Score = 127 (49.8 bits), Expect = 7.3e-21, Sum P(2) = 7.3e-21
 Identities = 37/108 (34%), Positives = 54/108 (50%)

Query:   125 SIPPSVDWRK-KGS--VTAVKDQG---QCGSCWAFSTIAAVEG---INHIMTNKLVSLSE 175
             ++P   DWR   GS  +T  ++Q     CGSCWA  T +A+     I    T   V L+ 
Sbjct:    48 TLPTQYDWRNISGSSYITITRNQHLPQYCGSCWAHGTTSALGDRIKIGRKGTFPEVVLAP 107

Query:   176 QELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCD 223
             Q L++C    N  C+GG    A+ ++  KG +T E   PY+A D  C+
Sbjct:   108 QVLLNCAGPDNT-CDGGDPTEAYAYMAAKG-ITDETCAPYEAIDNECN 153


>TAIR|locus:2133402 [details] [associations]
            symbol:AT4G01610 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0005829 "cytosol" evidence=IDA] [GO:0000902 "cell
            morphogenesis" evidence=RCA] [GO:0006635 "fatty acid
            beta-oxidation" evidence=RCA] [GO:0010162 "seed dormancy process"
            evidence=RCA] [GO:0016049 "cell growth" evidence=RCA] [GO:0048193
            "Golgi vesicle transport" evidence=RCA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0005773 EMBL:CP002687
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 eggNOG:NOG315657
            HOGENOM:HOG000241341 KO:K01363 PANTHER:PTHR12411:SF16 OMA:DAIPDHF
            HSSP:P07858 ProtClustDB:CLSN2687619 EMBL:AF370193 EMBL:AY065167
            EMBL:AY114015 EMBL:AY086034 EMBL:AF083797 EMBL:BT001190
            EMBL:AK175280 EMBL:AK175481 EMBL:AK175539 EMBL:AK176165
            EMBL:AK176244 EMBL:AK176281 EMBL:AK176330 EMBL:AK176416
            EMBL:AK176433 EMBL:AK176487 EMBL:AK221398 EMBL:AK230235
            IPI:IPI00530811 RefSeq:NP_567215.1 UniGene:At.24471
            ProteinModelPortal:Q94K85 SMR:Q94K85 STRING:Q94K85 MEROPS:C01.144
            PaxDb:Q94K85 PRIDE:Q94K85 EnsemblPlants:AT4G01610.1 GeneID:826792
            KEGG:ath:AT4G01610 TAIR:At4g01610 InParanoid:Q94K85
            PhylomeDB:Q94K85 Genevestigator:Q94K85 Uniprot:Q94K85
        Length = 359

 Score = 174 (66.3 bits), Expect = 7.4e-21, Sum P(2) = 7.4e-21
 Identities = 39/121 (32%), Positives = 60/121 (49%)

Query:   240 VPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTGECGTELN-HGVAAVGYGTTLD 298
             V +N +D + +     PV V+      DF  Y  GV+    G+ +  H V  +G+GT+ +
Sbjct:   240 VKSNPQDIMAEVYKNGPVEVSFTV-YEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSE 298

Query:   299 GTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAME--ASYPIKKSATN-PTGPSDY 355
             G  YW++ N W   WG+ GY  ++RG ++    CGI  E  A  P  K+     TG +D 
Sbjct:   299 GEDYWLMANQWNRGWGDDGYFMIRRGTNE----CGIEDEPVAGLPSSKNVFRVDTGSNDL 354

Query:   356 P 356
             P
Sbjct:   355 P 355

 Score = 135 (52.6 bits), Expect = 7.4e-21, Sum P(2) = 7.4e-21
 Identities = 43/148 (29%), Positives = 66/148 (44%)

Query:    79 YKLKLN-KFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVD----WR 133
             +K  +N +F++ T  EF     G K    + F G        +     +P + D    W 
Sbjct:    59 WKAAINDRFSNATVAEF-KRLLGVKPTPKKHFLGVP---IVSHDPSLKLPKAFDARTAWP 114

Query:   134 KKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCDTDQ-NQGCNGG 192
             +  S+  + DQG CGSCWAF  + ++     I     +SLS  +L+ C   +   GC+GG
Sbjct:   115 QCTSIGNILDQGHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLACCGFRCGDGCDGG 174

Query:   193 LMELAFEFIKKKGGVTTEAKYPYQANDG 220
                 A+++    G VT E   PY  N G
Sbjct:   175 YPIAAWQYFSYSGVVTEECD-PYFDNTG 201


>UNIPROTKB|E2R6Q7 [details] [associations]
            symbol:CTSB "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730 "nucleolus"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790
            GO:GO:0005764 GO:GO:0004197 CTD:1508 GeneTree:ENSGT00560000076599
            KO:K01363 OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16
            EMBL:AAEX03014318 RefSeq:XP_543203.3 Ensembl:ENSCAFT00000012692
            GeneID:486077 KEGG:cfa:486077 NextBio:20859923 Uniprot:E2R6Q7
        Length = 339

 Score = 155 (59.6 bits), Expect = 8.1e-20, Sum P(2) = 8.1e-20
 Identities = 38/115 (33%), Positives = 55/115 (47%)

Query:   229 SPAVSIDGHE-----NVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTGECGTE 283
             SP+   D H      +V  N ++ + +     PV  A     SDF  Y  GV+    G  
Sbjct:   216 SPSYKEDKHYGCSSYSVSDNEKEIMAEIYKNGPVEAAFTV-YSDFLLYKSGVYQHVTGEM 274

Query:   284 LN-HGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAME 337
             +  H V  +G+G   DGT YW+V NSW  +WG+ G+ ++ RG    +  CGI  E
Sbjct:   275 MGGHAVRILGWGVE-DGTPYWLVGNSWNTDWGDNGFFKILRG----RDHCGIESE 324

 Score = 146 (56.5 bits), Expect = 8.1e-20, Sum P(2) = 8.1e-20
 Identities = 34/96 (35%), Positives = 51/96 (53%)

Query:   120 YGKVTSIPPSVD----WRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSL-- 173
             + K   +P S D    W    ++  ++DQG CGSCWAF  + A+     I TN  V++  
Sbjct:    74 FAKNLILPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNVEV 133

Query:   174 SEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVT 208
             S ++++ C  DQ   GCNGG    A+ F  K+G V+
Sbjct:   134 SAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQGLVS 169


>ZFIN|ZDB-GENE-040426-2650 [details] [associations]
            symbol:ctsba "cathepsin B, a" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0031101 "fin regeneration"
            evidence=IEP] [GO:0008233 "peptidase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 ZFIN:ZDB-GENE-040426-2650 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790
            GO:GO:0004197 GO:GO:0031101 MEROPS:C01.060 HOVERGEN:HBG003480
            PANTHER:PTHR12411:SF16 HSSP:P07688 EMBL:BC044517 IPI:IPI00485996
            UniGene:Dr.3374 ProteinModelPortal:Q803E4 SMR:Q803E4 STRING:Q803E4
            PRIDE:Q803E4 InParanoid:Q803E4 ArrayExpress:Q803E4 Bgee:Q803E4
            Uniprot:Q803E4
        Length = 330

 Score = 156 (60.0 bits), Expect = 1.4e-19, Sum P(2) = 1.4e-19
 Identities = 40/124 (32%), Positives = 56/124 (45%)

Query:   222 CDVSKES--SPAVSIDGH-----ENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEG 274
             CD+  E   SP    D H      +VP+N    + +     PV  A      DF  Y  G
Sbjct:   206 CDMKCEPGYSPLYKEDKHFGKTSYSVPSNQNGIMAELFKNGPVEAAFTV-YEDFLLYKSG 264

Query:   275 VFTGECGTELN-HGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCG 333
             V+    G+ L  H +  +G+G   +G  YW+  NSW  +WG+ GY ++ RG  D    CG
Sbjct:   265 VYQHMSGSALGGHAIKILGWGEE-NGVPYWLAANSWNTDWGDNGYFKILRG-EDH---CG 319

Query:   334 IAME 337
             I  E
Sbjct:   320 IESE 323

 Score = 142 (55.0 bits), Expect = 1.4e-19, Sum P(2) = 1.4e-19
 Identities = 33/95 (34%), Positives = 47/95 (49%)

Query:   120 YGKVTSIPPSVD----WRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVS--L 173
             Y +   +P + D    W    ++  ++DQG CGSCWAF    A+     I +N  VS  +
Sbjct:    73 YTEGLKLPKNFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIQSNAKVSVEI 132

Query:   174 SEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVT 208
             S Q+L+ C      GCNGG    A++F    G VT
Sbjct:   133 SSQDLLTCCDSCGMGCNGGYPSAAWDFWTTDGLVT 167


>WB|WBGene00000784 [details] [associations]
            symbol:cpr-4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39895 EMBL:L39926 EMBL:FO081381
            PIR:T37280 RefSeq:NP_504682.1 UniGene:Cel.5404
            ProteinModelPortal:P43508 SMR:P43508 DIP:DIP-25376N
            MINT:MINT-1069892 STRING:P43508 MEROPS:C01.A34 PaxDb:P43508
            EnsemblMetazoa:F44C4.3 GeneID:179053 KEGG:cel:CELE_F44C4.3
            UCSC:F44C4.3 CTD:179053 WormBase:F44C4.3 InParanoid:P43508
            OMA:CCGFLCG NextBio:903704 Uniprot:P43508
        Length = 335

 Score = 162 (62.1 bits), Expect = 1.9e-19, Sum P(2) = 1.9e-19
 Identities = 35/84 (41%), Positives = 45/84 (53%)

Query:   252 VAKQPVSVAIDAGSSDFQFYSEGVFTGECGTELN-HGVAAVGYGTTLDGTKYWIVRNSWG 310
             +A  PV  A      DF  Y  GV+    G EL  H +  +G+GT  +GT YW+V NSW 
Sbjct:   247 IAHGPVEAAFTV-YEDFYQYKTGVYVHTTGQELGGHAIRILGWGTD-NGTPYWLVANSWN 304

Query:   311 PEWGEKGYIRMQRGISDKKGLCGI 334
               WGE GY R+ RG ++    CGI
Sbjct:   305 VNWGENGYFRIIRGTNE----CGI 324

 Score = 134 (52.2 bits), Expect = 1.9e-19, Sum P(2) = 1.9e-19
 Identities = 39/130 (30%), Positives = 64/130 (49%)

Query:   125 SIPPSVD----WRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVS--LSEQEL 178
             +IP + D    W    S+  ++DQ  CGSCWAF+   A      I +N  V+  LS +++
Sbjct:    80 TIPATFDARTQWPNCMSINNIRDQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDV 139

Query:   179 VDCDTDQNQGCNGGLMELAFEFIKKKG---GVTTEAKY---PYQANDGTCDVSKESSPAV 232
             + C ++   GC GG    A++++ K G   G + EA++   PY        V   + P+ 
Sbjct:   140 LSCCSNCGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSC 199

Query:   233 SIDGHENVPA 242
               DG++  PA
Sbjct:   200 PDDGYDT-PA 208


>FB|FBgn0034709 [details] [associations]
            symbol:Swim "Secreted Wg-interacting molecule" species:7227
            "Drosophila melanogaster" [GO:0004197 "cysteine-type endopeptidase
            activity" evidence=ISS] [GO:0006955 "immune response" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044 "scavenger
            receptor activity" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0042600 "chorion" evidence=IDA]
            [GO:0035593 "positive regulation of Wnt receptor signaling pathway
            by establishment of Wnt protein localization to extracellular
            region" evidence=IMP] [GO:0030177 "positive regulation of Wnt
            receptor signaling pathway" evidence=IDA] [GO:0005615
            "extracellular space" evidence=IDA] [GO:0017147 "Wnt-protein
            binding" evidence=IDA] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR013128 Pfam:PF00112 PROSITE:PS50958 SMART:SM00201
            SMART:SM00645 EMBL:AE013599 GO:GO:0005615 GO:GO:0006955
            GO:GO:0030247 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GO:GO:0017147 GO:GO:0005044
            GeneTree:ENSGT00560000076599 GO:GO:0042600 eggNOG:NOG310046
            OMA:DNCNRCT HSSP:P80067 EMBL:AY113377 RefSeq:NP_611652.2
            RefSeq:NP_726176.1 UniGene:Dm.732 SMR:Q7JWQ7 IntAct:Q7JWQ7
            EnsemblMetazoa:FBtr0071784 EnsemblMetazoa:FBtr0071785 GeneID:37537
            KEGG:dme:Dmel_CG3074 UCSC:CG3074-RA FlyBase:FBgn0034709
            HOGENOM:HOG000264150 InParanoid:Q7JWQ7 OrthoDB:EOG48CZ9P
            GenomeRNAi:37537 NextBio:804155 GO:GO:0035593 Uniprot:Q7JWQ7
        Length = 431

 Score = 155 (59.6 bits), Expect = 2.3e-19, Sum P(2) = 2.3e-19
 Identities = 42/128 (32%), Positives = 68/128 (53%)

Query:   126 IPPSVDWRKKGS--VTAVKDQGQCGSCWAFSTIAAVEGINHIMTN--KLVSLSEQELVDC 181
             +P S +   K S  ++ V DQG CG+ W  ST +       I +   + V LS Q ++ C
Sbjct:   187 LPSSFNALDKWSSYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNILSC 246

Query:   182 DTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPAVSIDGHENVP 241
              T + QGC GG ++ A+ ++ KKG V  E  YPY  +  TC + + +S ++  +G +  P
Sbjct:   247 -TRRQQGCEGGHLDAAWRYLHKKG-VVDENCYPYTQHRDTCKI-RHNSRSLRANGCQK-P 302

Query:   242 ANHE-DAL 248
              N + D+L
Sbjct:   303 VNVDRDSL 310

 Score = 146 (56.5 bits), Expect = 2.3e-19, Sum P(2) = 2.3e-19
 Identities = 41/113 (36%), Positives = 53/113 (46%)

Query:   243 NHE-DALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTGECGTELN----HGVAAVGYGTTL 297
             N E D + +     PV   +   + DF  YS GV+             H V  VG+G   
Sbjct:   320 NREADIMAEIFHSGPVQATMRV-NRDFFAYSGGVYRETAANRKAPTGFHSVKLVGWGEEH 378

Query:   298 DGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIA--MEASYPIKKSATN 348
             +G KYWI  NSWG  WGE GY R+ RG ++    CGI   + AS+P   S  N
Sbjct:   379 NGEKYWIAANSWGSWWGEHGYFRILRGSNE----CGIEEYVLASWPYVYSYYN 427


>ZFIN|ZDB-GENE-070323-1 [details] [associations]
            symbol:ctsbb "capthepsin B, b" species:7955 "Danio
            rerio" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-070323-1 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197
            GeneTree:ENSGT00560000076599 PANTHER:PTHR12411:SF16 OMA:CCGFLCG
            EMBL:CU207296 EMBL:CABZ01037785 IPI:IPI00877452
            Ensembl:ENSDART00000097263 Bgee:F1QZT5 Uniprot:F1QZT5
        Length = 326

 Score = 162 (62.1 bits), Expect = 4.2e-19, Sum P(2) = 4.2e-19
 Identities = 34/100 (34%), Positives = 51/100 (51%)

Query:   239 NVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTGECGTELN-HGVAAVGYGTTL 297
             NVP++ +  + +     PV  A      DF  Y  GV+    G+ L  H V  +G+G   
Sbjct:   225 NVPSDQQQIMTELYTNGPVEAAFTV-YEDFPLYKSGVYQHLTGSALGGHAVKILGWGEE- 282

Query:   298 DGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAME 337
             +GT +W+V NSW  +WG+ GY ++ RG  +    CGI  E
Sbjct:   283 NGTPFWLVANSWNSDWGDNGYFKILRGHDE----CGIESE 318

 Score = 130 (50.8 bits), Expect = 4.2e-19, Sum P(2) = 4.2e-19
 Identities = 29/107 (27%), Positives = 51/107 (47%)

Query:   108 MFQGTRGNGTFMYGKVTSIPPSVD----WRKKGSVTAVKDQGQCGSCWAFSTIAAVEG-- 161
             + +G R   T  +     +P S D    W    ++  ++DQG CGSCWAF  + ++    
Sbjct:    57 VLKGPRLPHTVKHSTNVKLPDSFDLRDQWPNCKTLNQIRDQGSCGSCWAFGAVESISDRI 116

Query:   162 INHIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVT 208
               H    +   +S ++L+ C      GC+GG    A+++ ++ G VT
Sbjct:   117 CIHSKGKQSPEISAEDLLSCCDQCGFGCSGGFPAEAWDYWRRSGLVT 163


>WB|WBGene00010204 [details] [associations]
            symbol:F57F5.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0006898
            "receptor-mediated endocytosis" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0009792 GO:GO:0006898
            GO:GO:0040007 GO:GO:0002119 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0040011
            EMBL:Z75953 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            PANTHER:PTHR12411:SF16 RefSeq:NP_506011.2 ProteinModelPortal:Q20950
            SMR:Q20950 DIP:DIP-24447N IntAct:Q20950 MINT:MINT-211137
            STRING:Q20950 MEROPS:C01.A42 EnsemblMetazoa:F57F5.1 GeneID:179645
            KEGG:cel:CELE_F57F5.1 UCSC:F57F5.1 CTD:179645 WormBase:F57F5.1
            OMA:ADDINAC Uniprot:Q20950
        Length = 351

 Score = 165 (63.1 bits), Expect = 9.2e-19, Sum P(2) = 9.2e-19
 Identities = 34/80 (42%), Positives = 45/80 (56%)

Query:   256 PVSVAIDAGSSDFQFYSEGVFTGECGTELN-HGVAAVGYGTTLDGTKYWIVRNSWGPEWG 314
             PV VA      DF+ YS GV+    G  L  H V  +G+G   +GT YW+  NSW  +WG
Sbjct:   267 PVEVAFTV-YEDFEHYSGGVYVHTAGASLGGHAVKMLGWGVD-NGTPYWLCANSWNEDWG 324

Query:   315 EKGYIRMQRGISDKKGLCGI 334
             E GY R+ RG+++    CGI
Sbjct:   325 ENGYFRIIRGVNE----CGI 340

 Score = 125 (49.1 bits), Expect = 9.2e-19, Sum P(2) = 9.2e-19
 Identities = 40/151 (26%), Positives = 65/151 (43%)

Query:    69 VHQTNKMDKPYKLKLNKFADMTNHEFASTYAGSKI----KHHRMFQGTRGNGTFMYGKVT 124
             V   NK+   +K +L  +             G+K+    + +R+F+ T         +  
Sbjct:    41 VDYVNKVQTSFKAELGSYFSSYPDTIKKQLMGAKMVEIPEEYRVFEMTHPEV-----EDA 95

Query:   125 SIPPSVD----WRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNK--LVSLSEQEL 178
             ++P S D    W    S++ ++DQ  CGSCWA S    +     I +N   ++S+S  ++
Sbjct:    96 AVPDSFDSRTAWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNAKTILSISADDI 155

Query:   179 -VDCDTDQNQGCNGGLMELAFEFIKKKGGVT 208
                C      GCNGG    A+    KKG VT
Sbjct:   156 NACCGMVCGNGCNGGYPIEAWRHYVKKGYVT 186


>DICTYBASE|DDB_G0283401 [details] [associations]
            symbol:ctsZ "cathepsin Z precursor" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            dictyBase:DDB_G0283401 GO:GO:0005615 GenomeReviews:CM000153_GR
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 EMBL:AAFI02000055 KO:K08568 OMA:QCGTCTE
            eggNOG:NOG275763 RefSeq:XP_639036.1 ProteinModelPortal:Q54R55
            IntAct:Q54R55 MEROPS:C01.A60 PRIDE:Q54R55
            EnsemblProtists:DDB0233836 GeneID:8624061 KEGG:ddi:DDB_G0283401
            InParanoid:Q54R55 Uniprot:Q54R55
        Length = 296

 Score = 226 (84.6 bits), Expect = 1.0e-18, P = 1.0e-18
 Identities = 66/213 (30%), Positives = 108/213 (50%)

Query:   147 CGSCWAFSTIAAVEGINHIMTNKL---VSLSEQELVDCDTDQNQGCNGGLMELAFEFIKK 203
             CG CWAF++ +++     I        V+++ Q L+DC+      C+GG    AF FI +
Sbjct:    85 CGGCWAFASTSSISDRIKIQRKAAFPDVNVAPQHLIDCNGGGT--CDGGDPGDAFAFINE 142

Query:   204 KGGVTTEAKYPYQAND--GTCDVS-KESSP---AVSIDGHENVPANH-------EDALLK 250
              G V    K PYQA +    C  + K  +P     +I  H N+           +D + +
Sbjct:   143 NGIVDETCK-PYQAKNLPDECSPACKTCNPDGTCQAIPVHTNITVTEYGSVRGAKDMMAE 201

Query:   251 AVAKQPVSVAIDAGSSDFQFYSEGVFTGECGTEL-NHGVAAVGYGTTLDGTKYWIVRNSW 309
               A+ P++ +IDA +S  + Y+ G+F       L NH ++ +G+G   D T YWIVRNSW
Sbjct:   202 IYARGPIACSIDA-TSKLEAYTSGIFKEFKLDPLPNHIISVIGWGVQ-DSTPYWIVRNSW 259

Query:   310 GPEWGEKGYIRMQRGISDKKGLCGIAMEASYPI 342
             G  +GE G+  + +G S  + L GI ++ ++ +
Sbjct:   260 GSYYGEGGFFNIVQG-SLFENL-GIELDCNWAV 290

 Score = 119 (46.9 bits), Expect = 0.00014, P = 0.00014
 Identities = 38/115 (33%), Positives = 55/115 (47%)

Query:   126 IPPSVDWRKKGSV---TAVKDQG---QCGSCWAFSTIAAVEGINHIMTNKL---VSLSEQ 176
             +P S DWR    V   T  ++Q     CG CWAF++ +++     I        V+++ Q
Sbjct:    58 VPQSWDWRNVSGVNYLTMNRNQHIPQYCGGCWAFASTSSISDRIKIQRKAAFPDVNVAPQ 117

Query:   177 ELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESSPA 231
              L+DC+      C+GG    AF FI + G V    K PYQA +    +  E SPA
Sbjct:   118 HLIDCNGGGT--CDGGDPGDAFAFINENGIVDETCK-PYQAKN----LPDECSPA 165


>UNIPROTKB|P07688 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9913 "Bos taurus"
            [GO:0042470 "melanosome" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730
            "nucleolus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0042470
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 EMBL:L06075 EMBL:M64620
            EMBL:U16336 EMBL:U16337 EMBL:U16338 EMBL:U16339 EMBL:U16341
            EMBL:U16342 EMBL:U16343 EMBL:BC102997 IPI:IPI00692061 PIR:S38328
            RefSeq:NP_776456.1 UniGene:Bt.393 PDB:1ITO PDB:1QDQ PDB:1SP4
            PDB:2DC6 PDB:2DC7 PDB:2DC8 PDB:2DC9 PDB:2DCA PDB:2DCB PDB:2DCC
            PDB:2DCD PDBsum:1ITO PDBsum:1QDQ PDBsum:1SP4 PDBsum:2DC6
            PDBsum:2DC7 PDBsum:2DC8 PDBsum:2DC9 PDBsum:2DCA PDBsum:2DCB
            PDBsum:2DCC PDBsum:2DCD ProteinModelPortal:P07688 SMR:P07688
            STRING:P07688 MEROPS:C01.060 PRIDE:P07688
            Ensembl:ENSBTAT00000036795 GeneID:281105 KEGG:bta:281105 CTD:1508
            eggNOG:NOG315657 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 InParanoid:P07688 KO:K01363 OMA:GYPSGAW
            OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 BindingDB:P07688
            ChEMBL:CHEMBL2323 EvolutionaryTrace:P07688 NextBio:20805177
            ArrayExpress:P07688 GO:GO:0097067 PANTHER:PTHR12411:SF16
            Uniprot:P07688
        Length = 335

 Score = 152 (58.6 bits), Expect = 1.5e-18, Sum P(2) = 1.5e-18
 Identities = 41/123 (33%), Positives = 59/123 (47%)

Query:   221 TCDVSKESSPAVSIDGHENVP----ANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGV 275
             TC+     SP+   D H        AN+E  ++  + K  PV  A     SDF  Y  GV
Sbjct:   210 TCEPGY--SPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSV-YSDFLLYKSGV 266

Query:   276 FTGECGTELN-HGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGI 334
             +    G  +  H +  +G+G   +GT YW+V NSW  +WG+ G+ ++ RG  D    CGI
Sbjct:   267 YQHVSGEIMGGHAIRILGWGVE-NGTPYWLVGNSWNTDWGDNGFFKILRG-QDH---CGI 321

Query:   335 AME 337
               E
Sbjct:   322 ESE 324

 Score = 137 (53.3 bits), Expect = 1.5e-18, Sum P(2) = 1.5e-18
 Identities = 32/90 (35%), Positives = 46/90 (51%)

Query:   126 IPPSVD----WRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSL---SEQEL 178
             +P S D    W    ++  ++DQG CGSCWAF  + A+     I +N  V++   +E  L
Sbjct:    80 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 139

Query:   179 VDCDTDQNQGCNGGLMELAFEFIKKKGGVT 208
               C  +   GCNGG    A+ F  KKG V+
Sbjct:   140 TCCGGECGDGCNGGFPSGAWNFWTKKGLVS 169


>TAIR|locus:2204873 [details] [associations]
            symbol:AT1G02300 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002684 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 KO:K01363
            PANTHER:PTHR12411:SF16 OMA:ADDINAC IPI:IPI00534431
            RefSeq:NP_563647.1 UniGene:At.43952 ProteinModelPortal:F4HVZ1
            SMR:F4HVZ1 MEROPS:C01.A10 EnsemblPlants:AT1G02300.1 GeneID:839576
            KEGG:ath:AT1G02300 ArrayExpress:F4HVZ1 Uniprot:F4HVZ1
        Length = 379

 Score = 172 (65.6 bits), Expect = 1.6e-18, Sum P(2) = 1.6e-18
 Identities = 38/122 (31%), Positives = 61/122 (50%)

Query:   236 GHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTGECGTELN-HGVAAVGYG 294
             G   +  + +D + +     PV VA      DF  Y  GV+    GT++  H V  +G+G
Sbjct:   256 GAYRINPDPQDIMAEVYKNGPVEVAFTV-YEDFAHYKSGVYKYITGTKIGGHAVKLIGWG 314

Query:   295 TTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGI--AMEASYPIKKSATNPTGP 352
             T+ DG  YW++ N W   WG+ GY +++RG ++    CGI  ++ A  P +K+       
Sbjct:   315 TSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNE----CGIEQSVVAGLPSEKNVFKGITT 370

Query:   353 SD 354
             SD
Sbjct:   371 SD 372

 Score = 116 (45.9 bits), Expect = 1.6e-18, Sum P(2) = 1.6e-18
 Identities = 30/77 (38%), Positives = 38/77 (49%)

Query:   145 GQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDC-DTDQNQGCNGGLMELAFEFIKK 203
             G CGSCWAF  + ++     I  N  VSLS  +++ C       GCNGG    A+ + K 
Sbjct:   146 GHCGSCWAFGAVESLSDRFCIKYNLNVSLSANDVIACCGLLCGFGCNGGFPMGAWLYFKY 205

Query:   204 KGGVTTEAKYPYQANDG 220
              G VT E   PY  N G
Sbjct:   206 HGVVTQECD-PYFDNTG 221


>MGI|MGI:88561 [details] [associations]
            symbol:Ctsb "cathepsin B" species:10090 "Mus musculus"
            [GO:0004175 "endopeptidase activity" evidence=ISO] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISO] [GO:0005576
            "extracellular region" evidence=ISO] [GO:0005615 "extracellular
            space" evidence=ISO] [GO:0005737 "cytoplasm" evidence=ISO]
            [GO:0005739 "mitochondrion" evidence=ISO;IDA] [GO:0005764
            "lysosome" evidence=ISO;IDA] [GO:0005901 "caveola" evidence=ISO]
            [GO:0006508 "proteolysis" evidence=ISO] [GO:0008233 "peptidase
            activity" evidence=ISO] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISO] [GO:0009897 "external side of plasma
            membrane" evidence=ISO] [GO:0009986 "cell surface" evidence=ISO]
            [GO:0016324 "apical plasma membrane" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0030984 "kininogen binding"
            evidence=ISO] [GO:0032403 "protein complex binding" evidence=ISO]
            [GO:0042277 "peptide binding" evidence=ISO] [GO:0042383
            "sarcolemma" evidence=ISO] [GO:0043621 "protein self-association"
            evidence=ISO] [GO:0048471 "perinuclear region of cytoplasm"
            evidence=ISO] [GO:0050790 "regulation of catalytic activity"
            evidence=IEA] [GO:0060548 "negative regulation of cell death"
            evidence=ISO] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 MGI:MGI:88561
            GO:GO:0005739 GO:GO:0042470 GO:GO:0048471 GO:GO:0005615
            GO:GO:0009612 GO:GO:0009611 GO:GO:0009897 GO:GO:0045471
            GO:GO:0016324 GO:GO:0009749 GO:GO:0006914 GO:GO:0043434
            eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0042383 GO:GO:0014070
            GO:GO:0042277 GO:GO:0060548 GO:GO:0005901 GO:GO:0014075
            GO:GO:0004197 GO:GO:0070670 GO:GO:0007519 MEROPS:C01.060 CTD:1508
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 KO:K01363 OMA:GYPSGAW OrthoDB:EOG4K6G4C
            BRENDA:3.4.22.1 GO:GO:0097067 PANTHER:PTHR12411:SF16 ChiTaRS:CTSB
            EMBL:M65270 EMBL:M65263 EMBL:M65264 EMBL:M65265 EMBL:M65266
            EMBL:M65267 EMBL:M65268 EMBL:M65269 EMBL:M14222 EMBL:X54966
            EMBL:S69034 EMBL:AK083393 EMBL:AK147192 EMBL:AK149884 EMBL:AK151790
            EMBL:AK167361 EMBL:BC006656 IPI:IPI00113517 PIR:A38458
            RefSeq:NP_031824.1 UniGene:Mm.236553 UniGene:Mm.489070
            ProteinModelPortal:P10605 SMR:P10605 IntAct:P10605 STRING:P10605
            PhosphoSite:P10605 SWISS-2DPAGE:P10605 PaxDb:P10605 PRIDE:P10605
            Ensembl:ENSMUST00000006235 GeneID:13030 KEGG:mmu:13030
            UCSC:uc007uhh.1 InParanoid:P10605 BioCyc:MetaCyc:MONOMER-14810
            BindingDB:P10605 ChEMBL:CHEMBL5187 NextBio:282900 Bgee:P10605
            CleanEx:MM_CTSB Genevestigator:P10605 GermOnline:ENSMUSG00000021939
            Uniprot:P10605
        Length = 339

 Score = 150 (57.9 bits), Expect = 1.8e-18, Sum P(2) = 1.8e-18
 Identities = 37/105 (35%), Positives = 56/105 (53%)

Query:   111 GTRGNGTFMYGKVTSIPPSVDWRKKGS----VTAVKDQGQCGSCWAFSTIAAVEGINHIM 166
             G +  G   +G+   +P + D R++ S    +  ++DQG CGSCWAF  + A+     I 
Sbjct:    65 GPKLPGRVAFGEDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIH 124

Query:   167 TNKLVSL--SEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVT 208
             TN  V++  S ++L+ C   Q   GCNGG    A+ F  KKG V+
Sbjct:   125 TNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVS 169

 Score = 139 (54.0 bits), Expect = 1.8e-18, Sum P(2) = 1.8e-18
 Identities = 38/124 (30%), Positives = 58/124 (46%)

Query:   222 CDVSKES--SPAVSIDGHENVPA----NHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEG 274
             C+ S E+  SP+   D H    +    N    ++  + K  PV  A     SDF  Y  G
Sbjct:   207 CNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTV-FSDFLTYKSG 265

Query:   275 VFTGECGTELN-HGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCG 333
             V+  E G  +  H +  +G+G   +G  YW+  NSW  +WG+ G+ ++ RG    +  CG
Sbjct:   266 VYKHEAGDMMGGHAIRILGWGVE-NGVPYWLAANSWNLDWGDNGFFKILRG----ENHCG 320

Query:   334 IAME 337
             I  E
Sbjct:   321 IESE 324


>WB|WBGene00021072 [details] [associations]
            symbol:W07B8.4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:FO081739 PIR:T31728 RefSeq:NP_503382.1
            HSSP:P53634 ProteinModelPortal:O16288 SMR:O16288 STRING:O16288
            MEROPS:C01.A39 PaxDb:O16288 EnsemblMetazoa:W07B8.4 GeneID:178611
            KEGG:cel:CELE_W07B8.4 UCSC:W07B8.4 CTD:178611 WormBase:W07B8.4
            InParanoid:O16288 OMA:ESQYGCK NextBio:901836 Uniprot:O16288
        Length = 335

 Score = 165 (63.1 bits), Expect = 1.8e-18, Sum P(2) = 1.8e-18
 Identities = 36/88 (40%), Positives = 46/88 (52%)

Query:   252 VAKQPVSVAIDAGSSDFQFYSEGVFTGECGTELN-HGVAAVGYGTTLDGTKYWIVRNSWG 310
             +A  PV V       DF  Y  G++T   G EL  H V  +G+G   +GT YW+  NSW 
Sbjct:   243 LAHGPVEVGFIV-YEDFYLYKTGIYTHVAGGELGGHAVKMLGWGVD-NGTPYWLAANSWN 300

Query:   311 PEWGEKGYIRMQRGISDKKGLCGIAMEA 338
               WGEKGY R+ RG+ +    CGI   A
Sbjct:   301 TVWGEKGYFRILRGVDE----CGIESAA 324

 Score = 121 (47.7 bits), Expect = 1.8e-18, Sum P(2) = 1.8e-18
 Identities = 33/93 (35%), Positives = 47/93 (50%)

Query:   125 SIPPSVD----WRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVS--LSEQEL 178
             SIP S D    W +  SV  ++DQ  CGSCWA +   A+     I +N  V+  LS +++
Sbjct:    72 SIPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGDVNTLLSAEDI 131

Query:   179 VDCDTDQ---NQGCNGGLMELAFEFIKKKGGVT 208
             + C T +     GC GG    A+ +  K G VT
Sbjct:   132 LTCCTGKFNCGDGCEGGYPIQAWRYWVKNGLVT 164


>UNIPROTKB|P07858 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9606 "Homo sapiens"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0005739 "mitochondrion" evidence=IEA] [GO:0042470 "melanosome"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042981 "regulation of apoptotic process" evidence=TAS]
            [GO:0006508 "proteolysis" evidence=IDA] [GO:0005764 "lysosome"
            evidence=IDA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEP] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IDA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IDA] [GO:0005622 "intracellular" evidence=TAS]
            [GO:0036021 "endolysosome lumen" evidence=TAS] [GO:0045087 "innate
            immune response" evidence=TAS] [GO:0008233 "peptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0005730 "nucleolus" evidence=IDA] [GO:0043231 "intracellular
            membrane-bounded organelle" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_118779 GO:GO:0005739
            GO:GO:0042470 GO:GO:0048471 Reactome:REACT_6900 GO:GO:0005615
            GO:GO:0009612 GO:GO:0009611 GO:GO:0005730 GO:GO:0042981
            GO:GO:0009897 GO:GO:0045471 GO:GO:0016324 GO:GO:0009749
            GO:GO:0006914 GO:GO:0043434 eggNOG:COG4870 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087
            GO:GO:0050790 GO:GO:0042383 GO:GO:0014070 GO:GO:0042277
            GO:GO:0060548 GO:GO:0005901 GO:GO:0014075 GO:GO:0004197
            GO:GO:0070670 EMBL:CH471157 GO:GO:0007519 MEROPS:C01.060 CTD:1508
            HOGENOM:HOG000241341 HOVERGEN:HBG003480 KO:K01363 OMA:GYPSGAW
            OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 GO:GO:0097067
            PANTHER:PTHR12411:SF16 EMBL:M14221 EMBL:L16510 EMBL:AK092070
            EMBL:AK075393 EMBL:BC010240 EMBL:BC095408 EMBL:M13230
            IPI:IPI00295741 PIR:A26498 RefSeq:NP_001899.1 RefSeq:NP_680090.1
            RefSeq:NP_680091.1 RefSeq:NP_680092.1 RefSeq:NP_680093.1
            UniGene:Hs.520898 PDB:1CSB PDB:1GMY PDB:1HUC PDB:1PBH PDB:2IPP
            PDB:2PBH PDB:3AI8 PDB:3CBJ PDB:3CBK PDB:3K9M PDB:3PBH PDBsum:1CSB
            PDBsum:1GMY PDBsum:1HUC PDBsum:1PBH PDBsum:2IPP PDBsum:2PBH
            PDBsum:3AI8 PDBsum:3CBJ PDBsum:3CBK PDBsum:3K9M PDBsum:3PBH
            ProteinModelPortal:P07858 SMR:P07858 DIP:DIP-42785N IntAct:P07858
            MINT:MINT-1397666 STRING:P07858 PhosphoSite:P07858 DMDM:68067549
            SWISS-2DPAGE:P07858 UCD-2DPAGE:P07858 PaxDb:P07858
            PeptideAtlas:P07858 PRIDE:P07858 DNASU:1508 Ensembl:ENST00000345125
            Ensembl:ENST00000353047 Ensembl:ENST00000434271
            Ensembl:ENST00000453527 Ensembl:ENST00000530640
            Ensembl:ENST00000531089 Ensembl:ENST00000533455
            Ensembl:ENST00000534510 GeneID:1508 KEGG:hsa:1508 UCSC:uc003wum.3
            GeneCards:GC08M011700 H-InvDB:HIX0007320 HGNC:HGNC:2527
            HPA:CAB000457 HPA:HPA018156 MIM:116810 neXtProt:NX_P07858
            PharmGKB:PA27027 InParanoid:P07858 PhylomeDB:P07858
            BindingDB:P07858 ChEMBL:CHEMBL4072 ChiTaRS:CTSB
            EvolutionaryTrace:P07858 GenomeRNAi:1508 NextBio:6235
            PMAP-CutDB:P07858 ArrayExpress:P07858 Bgee:P07858 CleanEx:HS_CTSB
            Genevestigator:P07858 GermOnline:ENSG00000164733 GO:GO:0036021
            Uniprot:P07858
        Length = 339

 Score = 147 (56.8 bits), Expect = 2.0e-18, Sum P(2) = 2.0e-18
 Identities = 34/98 (34%), Positives = 50/98 (51%)

Query:   242 ANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFTGECGTELN-HGVAAVGYGTTLDG 299
             +N E  ++  + K  PV  A     SDF  Y  GV+    G  +  H +  +G+G   +G
Sbjct:   233 SNSEKDIMAEIYKNGPVEGAFSV-YSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVE-NG 290

Query:   300 TKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAME 337
             T YW+V NSW  +WG+ G+ ++ RG  D    CGI  E
Sbjct:   291 TPYWLVANSWNTDWGDNGFFKILRG-QDH---CGIESE 324

 Score = 142 (55.0 bits), Expect = 2.0e-18, Sum P(2) = 2.0e-18
 Identities = 34/97 (35%), Positives = 51/97 (52%)

Query:   119 MYGKVTSIPPSVD----WRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSL- 173
             M+ +   +P S D    W +  ++  ++DQG CGSCWAF  + A+     I TN  VS+ 
Sbjct:    73 MFTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVE 132

Query:   174 -SEQELVDCDTDQ-NQGCNGGLMELAFEFIKKKGGVT 208
              S ++L+ C       GCNGG    A+ F  +KG V+
Sbjct:   133 VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVS 169


>WB|WBGene00000785 [details] [associations]
            symbol:cpr-5 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39896 EMBL:L39927 EMBL:FO081739
            PIR:T37277 RefSeq:NP_503383.1 UniGene:Cel.19730
            ProteinModelPortal:P43509 SMR:P43509 DIP:DIP-25329N IntAct:P43509
            MINT:MINT-1051285 STRING:P43509 MEROPS:C01.A35 PaxDb:P43509
            EnsemblMetazoa:W07B8.5 GeneID:178612 KEGG:cel:CELE_W07B8.5
            UCSC:W07B8.5.1 CTD:178612 WormBase:W07B8.5 InParanoid:P43509
            OMA:DAIPDHF NextBio:901840 Uniprot:P43509
        Length = 344

 Score = 163 (62.4 bits), Expect = 2.4e-18, Sum P(2) = 2.4e-18
 Identities = 35/84 (41%), Positives = 46/84 (54%)

Query:   256 PVSVAIDAGSSDFQFYSEGVFTGECGTELN-HGVAAVGYGTTLDGTKYWIVRNSWGPEWG 314
             P+ VA      DF  Y+ GV+    G  L  H V  +G+G   +GT YW+V NSW   WG
Sbjct:   256 PIEVAFTV-YEDFYQYTTGVYVHTAGASLGGHAVKILGWGVD-NGTPYWLVANSWNVAWG 313

Query:   315 EKGYIRMQRGISDKKGLCGIAMEA 338
             EKGY R+ RG+++    CGI   A
Sbjct:   314 EKGYFRIIRGLNE----CGIEHSA 333

 Score = 123 (48.4 bits), Expect = 2.4e-18, Sum P(2) = 2.4e-18
 Identities = 29/82 (35%), Positives = 42/82 (51%)

Query:   132 WRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVS--LSEQELVDCDTDQ---N 186
             W    S+  ++DQ  CGSCWAF+   A+     I +N  V+  LS ++L+ C T      
Sbjct:    92 WPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLLSCCTGMFSCG 151

Query:   187 QGCNGGLMELAFEFIKKKGGVT 208
              GC GG    A+++  K G VT
Sbjct:   152 NGCEGGYPIQAWKWWVKHGLVT 173

 Score = 39 (18.8 bits), Expect = 1.2e-09, Sum P(2) = 1.2e-09
 Identities = 10/25 (40%), Positives = 13/25 (52%)

Query:   228 SSPAVSIDGHENVPANHEDALLKAV 252
             ++ AV I GH   PA    AL+  V
Sbjct:    12 AASAVVIPGHREAPALTGQALIDYV 36


>UNIPROTKB|A1E295 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9823 "Sus scrofa"
            [GO:0005615 "extracellular space" evidence=IDA] [GO:0042470
            "melanosome" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730
            "nucleolus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0042470
            GO:GO:0005615 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 MEROPS:C01.060 CTD:1508
            GeneTree:ENSGT00560000076599 HOVERGEN:HBG003480 KO:K01363
            OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16 EMBL:EF095956
            RefSeq:NP_001090927.1 UniGene:Ssc.53773 ProteinModelPortal:A1E295
            SMR:A1E295 PRIDE:A1E295 Ensembl:ENSSSCT00000026923 GeneID:100037961
            KEGG:ssc:100037961 Uniprot:A1E295
        Length = 335

 Score = 142 (55.0 bits), Expect = 9.5e-18, Sum P(2) = 9.5e-18
 Identities = 32/90 (35%), Positives = 49/90 (54%)

Query:   126 IPPSVD----WRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSL--SEQELV 179
             +P S D    W    ++  ++DQG CGSCWAF  + A+     I +N  V++  S ++++
Sbjct:    80 LPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDML 139

Query:   180 DCDTDQ-NQGCNGGLMELAFEFIKKKGGVT 208
              C  D+   GCNGG    A+ F  KKG V+
Sbjct:   140 TCCGDECGDGCNGGFPSGAWNFWTKKGLVS 169

 Score = 141 (54.7 bits), Expect = 9.5e-18, Sum P(2) = 9.5e-18
 Identities = 32/100 (32%), Positives = 50/100 (50%)

Query:   239 NVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTGECGTELN-HGVAAVGYGTTL 297
             ++  N ++ + +     PV  A     SDF  Y  GV+    G  +  H +  +G+G   
Sbjct:   231 SISRNEKEIMAEIYKNGPVEGAFTV-YSDFLQYKSGVYQHVTGDLMGGHAIRILGWGVE- 288

Query:   298 DGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAME 337
             +GT YW+V NSW  +WG+ G+ ++ RG  D    CGI  E
Sbjct:   289 NGTPYWLVGNSWNTDWGDNGFFKILRG-QDH---CGIESE 324


>FB|FBgn0033873 [details] [associations]
            symbol:CG6337 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 EMBL:AE013599
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 HSSP:P80067 EMBL:AY084123
            RefSeq:NP_610905.1 UniGene:Dm.5230 SMR:Q7JYA0 IntAct:Q7JYA0
            EnsemblMetazoa:FBtr0087646 GeneID:36530 KEGG:dme:Dmel_CG6337
            UCSC:CG6337-RA FlyBase:FBgn0033873 eggNOG:NOG310593
            InParanoid:Q7JYA0 OMA:NRTTYRE OrthoDB:EOG4MCVFZ GenomeRNAi:36530
            NextBio:799041 Uniprot:Q7JYA0
        Length = 340

 Score = 230 (86.0 bits), Expect = 9.7e-18, P = 9.7e-18
 Identities = 88/321 (27%), Positives = 138/321 (42%)

Query:    18 IVEGFDF-HEKELESEEGLWDLYERWRSHHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMD 76
             I  G+ F H ++L   +   D + +  +  T +R+    +  +N   Q   H  Q ++  
Sbjct:    13 IDSGWAFNHGQDLVDFQTYEDNFNKTYAS-TSARNFANYYFIYNR-NQVAQHNAQADRNR 70

Query:    77 KPYKLKLNKFADMTNHEFASTYAGSKIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKG 136
               Y+  +N+F+D+   +FA+       K          +         S     D+   G
Sbjct:    71 TTYREAVNQFSDIRLIQFAALLP----KAVNTVTSAASDPPASQAASASFDIITDF---G 123

Query:   137 SVTAVKDQG-QCGSCWAFSTIAAVEGINHIMT-NKLVS-LSEQELVDCDTDQNQGCNGGL 193
                AV+DQG  C S WA++T  AVE +N + T N L S LS Q+L+DC      GC+   
Sbjct:   124 LTVAVEDQGVNCSSSWAYATAKAVEIMNAVQTANPLPSSLSAQQLLDC-AGMGTGCSTQT 182

Query:   194 MELAFEFIKK--KGGVTTEAKYPYQAN---DGTCDVSKESSPAVSIDGHENVPANHEDAL 248
                A  ++ +     +  E  YP   +    G C      S  V + G+  V  N + A+
Sbjct:   183 PLAALNYLTQLTDAYLYPEVDYPNNNSLKTPGMCQPPSSVSVGVKLAGYSTVADNDDAAV 242

Query:   249 LKAVAKQ-PVSVAIDAGSSDFQFYSEGVFTGECGTELN----HGVAAVGYGTTLDGT-KY 302
             ++ V+   PV V  +  +  F  YS GV+  E     N      +  VGY   +D    Y
Sbjct:   243 MRYVSNGFPVIVEYNPATFGFMQYSSGVYVQETRALTNPKSSQFLVVVGYDHDVDSNLDY 302

Query:   303 WIVRNSWGPEWGEKGYIRMQR 323
             W   NS+G  WGE+GYIR+ R
Sbjct:   303 WRCLNSFGDTWGEEGYIRIVR 323


>UNIPROTKB|Q9UBR2 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0060441 "epithelial tube
            branching involved in lung morphogenesis" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            Reactome:REACT_11123 Reactome:REACT_17015 InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 EMBL:CH471077 GO:GO:0005615 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AL109840 GO:GO:0060441 MEROPS:C01.013 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN
            BRENDA:3.4.18.1 EMBL:AF073890 EMBL:AF032906 EMBL:AF136273
            EMBL:AF136276 EMBL:AF136274 EMBL:AF136275 EMBL:AK314931
            EMBL:BC042168 EMBL:AF009923 IPI:IPI00002745 RefSeq:NP_001327.2
            UniGene:Hs.252549 PDB:1DEU PDB:1EF7 PDBsum:1DEU PDBsum:1EF7
            ProteinModelPortal:Q9UBR2 SMR:Q9UBR2 STRING:Q9UBR2 DMDM:12643324
            PaxDb:Q9UBR2 PeptideAtlas:Q9UBR2 PRIDE:Q9UBR2 DNASU:1522
            Ensembl:ENST00000217131 GeneID:1522 KEGG:hsa:1522 UCSC:uc002yai.2
            GeneCards:GC20M057570 HGNC:HGNC:2547 HPA:CAB025114 MIM:603169
            neXtProt:NX_Q9UBR2 PharmGKB:PA27043 InParanoid:Q9UBR2 OMA:QCGTCTE
            PhylomeDB:Q9UBR2 BindingDB:Q9UBR2 ChEMBL:CHEMBL4160 ChiTaRS:CTSZ
            EvolutionaryTrace:Q9UBR2 GenomeRNAi:1522 NextBio:6299 Bgee:Q9UBR2
            CleanEx:HS_CTSZ Genevestigator:Q9UBR2 GermOnline:ENSG00000101160
            Uniprot:Q9UBR2
        Length = 303

 Score = 220 (82.5 bits), Expect = 2.2e-17, P = 2.2e-17
 Identities = 74/222 (33%), Positives = 104/222 (46%)

Query:   129 SVDWRKKGSVTAVKDQGQ-CGSCWAF-STIAAVEGINHIMTNKLVS--LSEQELVDCDTD 184
             +VD     S+T  +   Q CGSCWA  ST A  + IN        S  LS Q ++DC   
Sbjct:    70 NVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTLLSVQNVIDCG-- 127

Query:   185 QNQG-CNGGLMELAFEFIKKKGGVTTEAKYPYQAND---------GTCDVSKESSPAVSI 234
              N G C GG  +L+      + G+  E    YQA D         GTC+  KE     + 
Sbjct:   128 -NAGSCEGG-NDLSVWDYAHQHGIPDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNY 185

Query:   235 D----GHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTG-ECGTELNHGVA 289
                  G     +  E  + +  A  P+S  I A +     Y+ G++   +  T +NH V+
Sbjct:   186 TLWRVGDYGSLSGREKMMAEIYANGPISCGIMA-TERLANYTGGIYAEYQDTTYINHVVS 244

Query:   290 AVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGI-SDKKG 330
               G+G + DGT+YWIVRNSWG  WGE+G++R+      D KG
Sbjct:   245 VAGWGIS-DGTEYWIVRNSWGEPWGERGWLRIVTSTYKDGKG 285


>UNIPROTKB|E1BTI7 [details] [associations]
            symbol:TINAG "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0005044 "scavenger receptor activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0006955 "immune
            response" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0030247 "polysaccharide binding"
            evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
            [GO:0007155 "cell adhesion" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GO:GO:0007155 GO:GO:0005604 GO:GO:0005044
            GeneTree:ENSGT00560000076599 CTD:27283 OMA:WGQLTSS
            EMBL:AADN02002720 EMBL:AADN02002721 IPI:IPI00581566
            RefSeq:XP_419905.3 UniGene:Gga.11215 Ensembl:ENSGALT00000026295
            GeneID:421888 KEGG:gga:421888 Uniprot:E1BTI7
        Length = 467

 Score = 153 (58.9 bits), Expect = 2.5e-17, Sum P(2) = 2.5e-17
 Identities = 42/123 (34%), Positives = 62/123 (50%)

Query:   211 AKYPYQANDGTCDVSKESSPAVSIDG-HENVPANHEDALLKAVAKQPVSVAIDAGSSDFQ 269
             ++Y     +G C  + E S  +   G H  V +   D + + +AK PV  AI     DF 
Sbjct:   332 SEYGKNHTNGPCPNALEDSNRLYRCGSHYRVSSKETDIMEEIMAKGPVQ-AIMKVYEDFF 390

Query:   270 FYSEGVF--TGECGTELN-HGVAAVGYGTTLDGT-----KYWIVRNSWGPEWGEKGYIRM 321
              Y EG++  + + G++   H V  +G+G+ L G      K+WI  NSWG  WGE GY R+
Sbjct:   391 LYKEGIYRHSYKAGSKWKTHSVKLLGWGS-LPGKNGQKQKFWIAANSWGKYWGENGYFRI 449

Query:   322 QRG 324
              RG
Sbjct:   450 LRG 452

 Score = 130 (50.8 bits), Expect = 2.5e-17, Sum P(2) = 2.5e-17
 Identities = 29/74 (39%), Positives = 43/74 (58%)

Query:   143 DQGQCGSCWAFSTIA-AVEGINHIMTNKLV-SLSEQELVDCDTDQNQGCNGGLMELAFEF 200
             DQ  CG+ WAFST + A + I      ++  +LS Q L+ CDT   +GCNGG ++ A+ +
Sbjct:   241 DQRNCGASWAFSTASVAADRITIHSDGQITDNLSVQNLISCDTGNQRGCNGGSIDGAWRY 300

Query:   201 IKKKGGVTTEAKYP 214
             +   G V + A YP
Sbjct:   301 LTTHG-VVSYACYP 313


>RGD|621509 [details] [associations]
            symbol:Ctsb "cathepsin B" species:10116 "Rattus norvegicus"
            [GO:0004175 "endopeptidase activity" evidence=IMP;IDA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA;ISO;IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005576
            "extracellular region" evidence=IDA] [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0005730 "nucleolus" evidence=IEA;ISO]
            [GO:0005737 "cytoplasm" evidence=IDA] [GO:0005739 "mitochondrion"
            evidence=IEA;ISO;IDA] [GO:0005764 "lysosome" evidence=IEA;ISO;IDA]
            [GO:0006508 "proteolysis" evidence=IEA;IEP;ISO;IMP;IDA;TAS]
            [GO:0006914 "autophagy" evidence=IEP] [GO:0006950 "response to
            stress" evidence=IEP] [GO:0007283 "spermatogenesis" evidence=IEP]
            [GO:0007519 "skeletal muscle tissue development" evidence=IEP]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0009611
            "response to wounding" evidence=IEP] [GO:0009612 "response to
            mechanical stimulus" evidence=IEP] [GO:0009749 "response to glucose
            stimulus" evidence=IEP] [GO:0009897 "external side of plasma
            membrane" evidence=IDA] [GO:0009986 "cell surface" evidence=IDA]
            [GO:0014070 "response to organic cyclic compound" evidence=IEP]
            [GO:0014075 "response to amine stimulus" evidence=IEP] [GO:0016324
            "apical plasma membrane" evidence=IDA] [GO:0030984 "kininogen
            binding" evidence=IPI] [GO:0032403 "protein complex binding"
            evidence=IPI] [GO:0034097 "response to cytokine stimulus"
            evidence=IEP] [GO:0042277 "peptide binding" evidence=IDA]
            [GO:0042383 "sarcolemma" evidence=IDA] [GO:0042470 "melanosome"
            evidence=IEA] [GO:0043231 "intracellular membrane-bounded
            organelle" evidence=ISO] [GO:0043434 "response to peptide hormone
            stimulus" evidence=IEP] [GO:0043621 "protein self-association"
            evidence=IDA] [GO:0045471 "response to ethanol" evidence=IEP]
            [GO:0048471 "perinuclear region of cytoplasm" evidence=ISO;IDA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0060548 "negative regulation of cell death" evidence=IMP]
            [GO:0070670 "response to interleukin-4" evidence=IEP] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA;ISO]
            [GO:0005901 "caveola" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:621509 GO:GO:0005739 GO:GO:0042470
            GO:GO:0005615 GO:GO:0009612 GO:GO:0009611 GO:GO:0009897
            GO:GO:0045471 GO:GO:0016324 GO:GO:0009749 GO:GO:0006914
            GO:GO:0043434 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0007283
            GO:GO:0005764 GO:GO:0042383 GO:GO:0043621 GO:GO:0014070
            GO:GO:0042277 GO:GO:0060548 GO:GO:0014075 GO:GO:0004197
            GO:GO:0070670 GO:GO:0007519 MEROPS:C01.060 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 GO:GO:0097067
            PANTHER:PTHR12411:SF16 EMBL:X82396 EMBL:M11305 IPI:IPI00212811
            PIR:S51041 UniGene:Rn.100909 PDB:1CPJ PDB:1CTE PDB:1MIR PDB:1THE
            PDBsum:1CPJ PDBsum:1CTE PDBsum:1MIR PDBsum:1THE
            ProteinModelPortal:P00787 SMR:P00787 STRING:P00787 PRIDE:P00787
            UCSC:RGD:621509 InParanoid:P00787 SABIO-RK:P00787 BindingDB:P00787
            ChEMBL:CHEMBL2602 EvolutionaryTrace:P00787 ArrayExpress:P00787
            Genevestigator:P00787 GermOnline:ENSRNOG00000010331 Uniprot:P00787
        Length = 339

 Score = 141 (54.7 bits), Expect = 2.7e-17, Sum P(2) = 2.7e-17
 Identities = 32/98 (32%), Positives = 50/98 (51%)

Query:   242 ANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFTGECGTELN-HGVAAVGYGTTLDG 299
             ++ E  ++  + K  PV  A     SDF  Y  GV+  E G  +  H +  +G+G   +G
Sbjct:   233 SDSEKEIMAEIYKNGPVEGAFTV-FSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIE-NG 290

Query:   300 TKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAME 337
               YW+V NSW  +WG+ G+ ++ RG    +  CGI  E
Sbjct:   291 VPYWLVANSWNVDWGDNGFFKILRG----ENHCGIESE 324

 Score = 138 (53.6 bits), Expect = 2.7e-17, Sum P(2) = 2.7e-17
 Identities = 34/91 (37%), Positives = 51/91 (56%)

Query:   125 SIPPSVDWRKKGS----VTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSL--SEQEL 178
             ++P S D R++ S    +  ++DQG CGSCWAF  + A+     I TN  V++  S ++L
Sbjct:    79 NLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDL 138

Query:   179 VDCDTDQ-NQGCNGGLMELAFEFIKKKGGVT 208
             + C   Q   GCNGG    A+ F  +KG V+
Sbjct:   139 LTCCGIQCGDGCNGGYPSGAWNFWTRKGLVS 169


>UNIPROTKB|Q6IN22 [details] [associations]
            symbol:Ctsb "Cathepsin B" species:10116 "Rattus norvegicus"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 RGD:621509 GO:GO:0005739
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 CTD:1508
            GeneTree:ENSGT00560000076599 HOVERGEN:HBG003480 KO:K01363
            OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16 UniGene:Rn.100909
            EMBL:CH474023 HSSP:P00785 EMBL:BC072490 IPI:IPI00562653
            RefSeq:NP_072119.2 SMR:Q6IN22 IntAct:Q6IN22 STRING:Q6IN22
            Ensembl:ENSRNOT00000014177 GeneID:64529 KEGG:rno:64529
            InParanoid:Q6IN22 NextBio:613362 Genevestigator:Q6IN22
            Uniprot:Q6IN22
        Length = 339

 Score = 141 (54.7 bits), Expect = 2.7e-17, Sum P(2) = 2.7e-17
 Identities = 32/98 (32%), Positives = 50/98 (51%)

Query:   242 ANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFTGECGTELN-HGVAAVGYGTTLDG 299
             ++ E  ++  + K  PV  A     SDF  Y  GV+  E G  +  H +  +G+G   +G
Sbjct:   233 SDSEKEIMAEIYKNGPVEGAFTV-FSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIE-NG 290

Query:   300 TKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAME 337
               YW+V NSW  +WG+ G+ ++ RG    +  CGI  E
Sbjct:   291 VPYWLVANSWNVDWGDNGFFKILRG----ENHCGIESE 324

 Score = 138 (53.6 bits), Expect = 2.7e-17, Sum P(2) = 2.7e-17
 Identities = 34/91 (37%), Positives = 51/91 (56%)

Query:   125 SIPPSVDWRKKGS----VTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSL--SEQEL 178
             ++P S D R++ S    +  ++DQG CGSCWAF  + A+     I TN  V++  S ++L
Sbjct:    79 NLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDL 138

Query:   179 VDCDTDQ-NQGCNGGLMELAFEFIKKKGGVT 208
             + C   Q   GCNGG    A+ F  +KG V+
Sbjct:   139 LTCCGIQCGDGCNGGYPSGAWNFWTRKGLVS 169


>WB|WBGene00000786 [details] [associations]
            symbol:cpr-6 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            PANTHER:PTHR12411:SF16 EMBL:L39894 EMBL:L39939 EMBL:FO080666
            PIR:T37274 RefSeq:NP_741818.1 UniGene:Cel.18138
            ProteinModelPortal:P43510 SMR:P43510 DIP:DIP-25139N
            MINT:MINT-1074025 STRING:P43510 MEROPS:C01.A51 PaxDb:P43510
            PRIDE:P43510 EnsemblMetazoa:C25B8.3a GeneID:180931
            KEGG:cel:CELE_C25B8.3 UCSC:C25B8.3a CTD:180931 WormBase:C25B8.3a
            InParanoid:P43510 OMA:KAKWGLM NextBio:911608 ArrayExpress:P43510
            Uniprot:P43510
        Length = 379

 Score = 141 (54.7 bits), Expect = 3.4e-17, Sum P(2) = 3.4e-17
 Identities = 31/81 (38%), Positives = 43/81 (53%)

Query:   256 PVSVAIDAGSSDFQFYSEGVFTGECGTELN--HGVAAVGYGTTLDGTKYWIVRNSWGPEW 313
             P+ +A +    DF  Y  GV+    G +L   H V  +G+G   DG  YW V NSW  +W
Sbjct:   275 PLEIAFEV-YEDFLNYDGGVYV-HTGGKLGGGHAVKLIGWGID-DGIPYWTVANSWNTDW 331

Query:   314 GEKGYIRMQRGISDKKGLCGI 334
             GE G+ R+ RG+ +    CGI
Sbjct:   332 GEDGFFRILRGVDE----CGI 348

 Score = 139 (54.0 bits), Expect = 3.4e-17, Sum P(2) = 3.4e-17
 Identities = 38/101 (37%), Positives = 52/101 (51%)

Query:   126 IPPSVD----WRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTN-KL-VSLSEQELV 179
             IP S D    W K  S+  ++DQ  CGSCWAF  + A+     I ++ +L V+LS  +L+
Sbjct:   105 IPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLL 164

Query:   180 DCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDG 220
              C      GCNGG    A+ +  K G VT      Y AN+G
Sbjct:   165 SCCKSCGFGCNGGDPLAAWRYWVKDGIVTGSN---YTANNG 202


>UNIPROTKB|F1N9D7 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9031 "Gallus gallus"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005730 "nucleolus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005739 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0005764
            GO:GO:0004197 GeneTree:ENSGT00560000076599 OMA:GYPSGAW
            GO:GO:0097067 PANTHER:PTHR12411:SF16 IPI:IPI00573387
            EMBL:AADN02018292 Ensembl:ENSGALT00000026896
            Ensembl:ENSGALT00000036723 Uniprot:F1N9D7
        Length = 340

 Score = 143 (55.4 bits), Expect = 4.2e-17, Sum P(2) = 4.2e-17
 Identities = 35/115 (30%), Positives = 54/115 (46%)

Query:   229 SPAVSIDGHEN-----VPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTGECGTE 283
             SP+   D H       VP + ++ + +     PV  A      DF  Y  GV+    G +
Sbjct:   217 SPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIV-YEDFLMYKSGVYQHVSGEQ 275

Query:   284 LN-HGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAME 337
             +  H +  +G+G   +GT YW+  NSW  +WG+ G+ ++ RG  D    CGI  E
Sbjct:   276 VGGHAIRILGWGVE-NGTPYWLAANSWNTDWGDNGFFKILRG-EDH---CGIESE 325

 Score = 134 (52.2 bits), Expect = 4.2e-17, Sum P(2) = 4.2e-17
 Identities = 30/90 (33%), Positives = 51/90 (56%)

Query:   126 IPPSVDWRKKG----SVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSL--SEQELV 179
             +P + D RK+     +++ ++DQG CGSCWAF  + A+     + TN  VS+  S ++L+
Sbjct:    80 LPDTFDSRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLL 139

Query:   180 DC-DTDQNQGCNGGLMELAFEFIKKKGGVT 208
              C   +   GCNGG    A+ +  ++G V+
Sbjct:   140 SCCGFECGMGCNGGYPSGAWRYWTERGLVS 169


>UNIPROTKB|Q9GZM7 [details] [associations]
            symbol:TINAGL1 "Tubulointerstitial nephritis antigen-like"
            species:9606 "Homo sapiens" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0005737 "cytoplasm"
            evidence=IEA] [GO:0043236 "laminin binding" evidence=IEA]
            [GO:0016197 "endosomal transport" evidence=TAS] [GO:0005201
            "extracellular matrix structural constituent" evidence=NAS]
            [GO:0005576 "extracellular region" evidence=NAS] [GO:0031012
            "extracellular matrix" evidence=IDA] [GO:0005615 "extracellular
            space" evidence=ISS] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0005737 GO:GO:0005615
            GO:GO:0006955 GO:GO:0030247 EMBL:CH471059 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GO:GO:0016197 EMBL:AC114488 GO:GO:0005044 GO:GO:0005201
            eggNOG:NOG310046 HOGENOM:HOG000241342 HOVERGEN:HBG053961
            EMBL:AF236155 EMBL:AF236151 EMBL:AF236152 EMBL:AF236153
            EMBL:AF236154 EMBL:AF236150 EMBL:AF205436 EMBL:AB050716
            EMBL:AB050719 EMBL:AK074124 EMBL:AY358421 EMBL:AF289569
            EMBL:AK027839 EMBL:AK292770 EMBL:AK298382 EMBL:AK075398
            EMBL:BC009048 EMBL:BC064633 IPI:IPI00005563 IPI:IPI00439435
            IPI:IPI00910801 RefSeq:NP_001191343.1 RefSeq:NP_001191344.1
            RefSeq:NP_071447.1 UniGene:Hs.199368 ProteinModelPortal:Q9GZM7
            SMR:Q9GZM7 IntAct:Q9GZM7 MINT:MINT-253718 STRING:Q9GZM7
            MEROPS:C01.975 PhosphoSite:Q9GZM7 DMDM:61213628 PaxDb:Q9GZM7
            PRIDE:Q9GZM7 Ensembl:ENST00000271064 Ensembl:ENST00000457433
            GeneID:64129 KEGG:hsa:64129 UCSC:uc001bta.3 CTD:64129
            GeneCards:GC01P032042 HGNC:HGNC:19168 HPA:HPA048695
            neXtProt:NX_Q9GZM7 PharmGKB:PA38810 InParanoid:Q9GZM7 OMA:DNCNRCT
            OrthoDB:EOG4BG8W0 PhylomeDB:Q9GZM7 ChiTaRS:TINAGL1 GenomeRNAi:64129
            NextBio:66016 ArrayExpress:Q9GZM7 Bgee:Q9GZM7 CleanEx:HS_TINAGL1
            Genevestigator:Q9GZM7 GermOnline:ENSG00000142910 Uniprot:Q9GZM7
        Length = 467

 Score = 152 (58.6 bits), Expect = 5.3e-17, Sum P(2) = 5.3e-17
 Identities = 31/79 (39%), Positives = 48/79 (60%)

Query:   143 DQGQCGSCWAFSTIA-AVEGIN-HIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEF 200
             DQG C   WAFST A A + ++ H + +    LS Q L+ CDT Q QGC GG ++ A+ F
Sbjct:   222 DQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQQQGCRGGRLDGAWWF 281

Query:   201 IKKKGGVTTEAKYPYQAND 219
             ++++G V ++  YP+   +
Sbjct:   282 LRRRG-VVSDHCYPFSGRE 299

 Score = 128 (50.1 bits), Expect = 5.3e-17, Sum P(2) = 5.3e-17
 Identities = 32/99 (32%), Positives = 51/99 (51%)

Query:   242 ANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFT------GECGTELNHGVAAV---G 292
             +N ++ + + +   PV   ++    DF  Y  G+++      G       HG  +V   G
Sbjct:   348 SNDKEIMKELMENGPVQALMEV-HEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITG 406

Query:   293 YGT-TL-DGT--KYWIVRNSWGPEWGEKGYIRMQRGISD 327
             +G  TL DG   KYW   NSWGP WGE+G+ R+ RG+++
Sbjct:   407 WGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNE 445


>UNIPROTKB|F1SVA2 [details] [associations]
            symbol:TINAGL1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005615 "extracellular space" evidence=IDA] [GO:0043236
            "laminin binding" evidence=IEA] [GO:0031012 "extracellular matrix"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044
            "scavenger receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005737
            GO:GO:0005615 GO:GO:0006955 GO:GO:0030247 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GO:GO:0031012 GO:GO:0005044 GeneTree:ENSGT00560000076599
            OMA:DNCNRCT EMBL:CU856262 Ensembl:ENSSSCT00000003995 Uniprot:F1SVA2
        Length = 467

 Score = 148 (57.2 bits), Expect = 9.7e-17, Sum P(2) = 9.7e-17
 Identities = 30/79 (37%), Positives = 48/79 (60%)

Query:   143 DQGQCGSCWAFSTIA-AVEGIN-HIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEF 200
             DQG C   WAFST A A + ++ H + +    LS Q L+ CDT   QGC GG ++ A+ F
Sbjct:   222 DQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHNQQGCQGGRLDGAWWF 281

Query:   201 IKKKGGVTTEAKYPYQAND 219
             ++++G V ++  YP+  ++
Sbjct:   282 LRRRG-VVSDHCYPFSGHE 299

 Score = 130 (50.8 bits), Expect = 9.7e-17, Sum P(2) = 9.7e-17
 Identities = 33/99 (33%), Positives = 50/99 (50%)

Query:   242 ANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFT------GECGTELNHGVAAV---G 292
             +N +D + + +   PV   ++    DF  Y  G+++      G       HG  +V   G
Sbjct:   348 SNEKDIMKELMENGPVQALMEV-HEDFFLYQSGIYSHTPVSHGRPERYRRHGTHSVKITG 406

Query:   293 YGT-TL-DGT--KYWIVRNSWGPEWGEKGYIRMQRGISD 327
             +G  TL DG   KYW   NSWGP WGE+G+ R+ RG ++
Sbjct:   407 WGEETLPDGRMLKYWTAANSWGPGWGERGHFRIVRGANE 445


>DICTYBASE|DDB_G0283921 [details] [associations]
            symbol:ctsB "cathepsin B precursor" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0283921 GO:GO:0005615
            GenomeReviews:CM000153_GR GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:AAFI02000058
            eggNOG:NOG315657 PANTHER:PTHR12411:SF16 OMA:CSLSCQS
            RefSeq:XP_638805.1 HSSP:P07688 MEROPS:C01.A59
            EnsemblProtists:DDB0233997 GeneID:8624329 KEGG:ddi:DDB_G0283921
            Uniprot:Q54QD9
        Length = 311

 Score = 148 (57.2 bits), Expect = 1.2e-16, Sum P(2) = 1.2e-16
 Identities = 33/93 (35%), Positives = 54/93 (58%)

Query:   124 TSIPPSVDWRKKGSVTAVKDQGQCGSCWAF-STIAAVEGINHIMTNKLVSLSEQELVDCD 182
             TS     +W    +++ +++Q +CGSCWAF +T +A + +  I  N+ V LS  ++V CD
Sbjct:    81 TSFNAQTNWPNCTTISQIQNQARCGSCWAFGATESATDRLC-IHNNENVQLSFMDMVTCD 139

Query:   183 TDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPY 215
                N GC GG    A+ +++K+G V+ E   PY
Sbjct:   140 ETDN-GCEGGDAFSAWNWLRKQGAVSEEC-LPY 170

 Score = 122 (48.0 bits), Expect = 1.2e-16, Sum P(2) = 1.2e-16
 Identities = 35/115 (30%), Positives = 52/115 (45%)

Query:   221 TCDVSKESSPAVSIDGHENVPA---NHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVF 276
             T +    SS   S D H+       + ++A+++ +    PV         DF  Y  GV+
Sbjct:   192 TKECQSNSSLIYSQDKHKMAKIYSFDSDEAIMQEIVTNGPVEACFTV-FEDFLAYKSGVY 250

Query:   277 TGECGTELN-HGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRG---ISD 327
                 G +L  H V  VG+GT L+G  Y+   N W   WG+ G   ++RG   ISD
Sbjct:   251 VHTTGKDLGGHCVKLVGFGT-LNGVDYYAANNQWTTSWGDNGTFLIKRGDCGISD 304


>RGD|708479 [details] [associations]
            symbol:Ctsz "cathepsin Z" species:10116 "Rattus norvegicus"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=TAS]
            [GO:0005615 "extracellular space" evidence=IEA;ISO] [GO:0005783
            "endoplasmic reticulum" evidence=IEA;ISO] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0060441 "epithelial tube branching involved in
            lung morphogenesis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:708479 GO:GO:0005576 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004197 MEROPS:C01.013 CTD:1522 HOVERGEN:HBG004456 KO:K08568
            EMBL:AB023781 EMBL:BC091110 IPI:IPI00207663 RefSeq:NP_899159.1
            UniGene:Rn.1475 ProteinModelPortal:Q9R1T3 SMR:Q9R1T3 PRIDE:Q9R1T3
            GeneID:252929 KEGG:rno:252929 BindingDB:Q9R1T3 NextBio:624097
            Genevestigator:Q9R1T3 Uniprot:Q9R1T3
        Length = 306

 Score = 217 (81.4 bits), Expect = 1.3e-16, P = 1.3e-16
 Identities = 65/193 (33%), Positives = 91/193 (47%)

Query:   147 CGSCWAF-STIAAVEGINHIMTNKLVS--LSEQELVDCDTDQNQG-CNGGLMELAFEFIK 202
             CGSCWA  ST A  + IN        S  LS Q ++DC    N G C GG     +E+  
Sbjct:    91 CGSCWAHGSTSALADRINIKRKGAWPSTLLSVQNVIDCG---NAGSCEGGNDLPVWEYAH 147

Query:   203 KKGGVTTEAKYPYQAND---------GTCDVSKESSPAVSID----GHENVPANHEDALL 249
             K G +  E    YQA D         GTC   KE     +      G     +  E  + 
Sbjct:   148 KHG-IPDETCNNYQAKDQECDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSLSGREKMMA 206

Query:   250 KAVAKQPVSVAIDAGSSDFQFYSEGVFTG-ECGTELNHGVAAVGYGTTLDGTKYWIVRNS 308
             +  A  P+S  I A +     Y+ G++T  +    +NH ++  G+G + DG +YWIVRNS
Sbjct:   207 EIYANGPISCGIMA-TERMSNYTGGIYTEYQNQAIINHIISVAGWGVSNDGIEYWIVRNS 265

Query:   309 WGPEWGEKGYIRM 321
             WG  WGE+G++R+
Sbjct:   266 WGEPWGERGWMRI 278


>UNIPROTKB|P43233 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9031 "Gallus gallus"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 MEROPS:C01.060 CTD:1508
            HOGENOM:HOG000241341 HOVERGEN:HBG003480 KO:K01363 OrthoDB:EOG4K6G4C
            PANTHER:PTHR12411:SF16 EMBL:U18083 IPI:IPI00573387 PIR:S58770
            RefSeq:NP_990702.1 UniGene:Gga.3854 ProteinModelPortal:P43233
            SMR:P43233 STRING:P43233 PRIDE:P43233 GeneID:396329 KEGG:gga:396329
            InParanoid:P43233 NextBio:20816377 Uniprot:P43233
        Length = 340

 Score = 137 (53.3 bits), Expect = 1.7e-16, Sum P(2) = 1.7e-16
 Identities = 35/115 (30%), Positives = 53/115 (46%)

Query:   229 SPAVSIDGHEN-----VPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTGECGTE 283
             SP+   D H       VP + ++ + +     PV  A      DF  Y  GV+    G +
Sbjct:   217 SPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIV-YEDFLMYKSGVYQHVSGEQ 275

Query:   284 LN-HGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAME 337
             +  H +  +G+G   +GT YW+  NSW  +WG  G+ ++ RG  D    CGI  E
Sbjct:   276 VGGHAIRILGWGVE-NGTPYWLAANSWNTDWGITGFFKILRG-EDH---CGIESE 325

 Score = 135 (52.6 bits), Expect = 1.7e-16, Sum P(2) = 1.7e-16
 Identities = 30/90 (33%), Positives = 51/90 (56%)

Query:   126 IPPSVDWRKKG----SVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSL--SEQELV 179
             +P + D RK+     +++ ++DQG CGSCWAF  + A+     + TN  VS+  S ++L+
Sbjct:    80 LPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLL 139

Query:   180 DC-DTDQNQGCNGGLMELAFEFIKKKGGVT 208
              C   +   GCNGG    A+ +  ++G V+
Sbjct:   140 SCCGFECGMGCNGGYPSGAWRYWTERGLVS 169


>UNIPROTKB|F1PIF2 [details] [associations]
            symbol:CTSZ "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0060441 "epithelial tube branching involved
            in lung morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0005783 GO:GO:0005615 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GO:GO:0060441 GeneTree:ENSGT00560000076599 OMA:QCGTCTE
            EMBL:AAEX03014054 Ensembl:ENSCAFT00000019357 Uniprot:F1PIF2
        Length = 261

 Score = 207 (77.9 bits), Expect = 1.7e-16, P = 1.7e-16
 Identities = 70/211 (33%), Positives = 97/211 (45%)

Query:   147 CGSCWAF-STIAAVEGINHIMTNKLVS--LSEQELVDCDTDQNQG-CNGGLMELAFEFIK 202
             CGSCWA  ST A  + IN        S  LS Q ++DC    N G C GG  +L      
Sbjct:    47 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVLDC---ANAGSCEGG-NDLPVWSYA 102

Query:   203 KKGGVTTEAKYPYQAND---------GTCDVSKESSPAVSID----GHENVPANHEDALL 249
              + G+  E    YQA D         GTC   KE     +      G     +  E  + 
Sbjct:   103 HEHGIPDETCNNYQAKDQECNKFNQCGTCTEFKECHAIQNYTLWRVGDYGSLSGREKMMA 162

Query:   250 KAVAKQPVSVAIDAGSSDFQFYSEGVFTG-ECGTELNHGVAAVGYGTTLDGTKYWIVRNS 308
             +  A  P+S  I A +     Y+ G+    +    +NH ++ VG+G + DGT+YWIVRNS
Sbjct:   163 EIYANGPISCGIMA-TEKMVNYTGGIHAEYQEQAYINHVISVVGWGVS-DGTEYWIVRNS 220

Query:   309 WGPEWGEKGYIRMQRGI-SDKKGLC-GIAME 337
             WG  WGE+G++R+      D KG    +A+E
Sbjct:   221 WGEPWGERGWMRIVTSTYKDGKGASYNLAVE 251


>TAIR|locus:2060420 [details] [associations]
            symbol:AT2G22160 "AT2G22160" species:3702 "Arabidopsis
            thaliana" [GO:0005575 "cellular_component" evidence=ND] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] EMBL:CP002685
            GenomeReviews:CT485783_GR InterPro:IPR013201 Pfam:PF08246
            SMART:SM00848 EMBL:AC007168 IPI:IPI00544896 PIR:F84609
            RefSeq:NP_179806.1 UniGene:At.66231 HSSP:P25774
            ProteinModelPortal:Q9SIE8 SMR:Q9SIE8 EnsemblPlants:AT2G22160.1
            GeneID:816750 KEGG:ath:AT2G22160 TAIR:At2g22160 eggNOG:NOG297278
            InParanoid:Q9SIE8 OMA:HRCITLA PhylomeDB:Q9SIE8 ArrayExpress:Q9SIE8
            Genevestigator:Q9SIE8 Uniprot:Q9SIE8
        Length = 105

 Score = 206 (77.6 bits), Expect = 2.3e-16, P = 2.3e-16
 Identities = 42/101 (41%), Positives = 63/101 (62%)

Query:    45 HHTVSRSLDEKHKRFNVFKQNVMHVHQTNKMDKPYKLKLNKFADMTNHEFASTYAGSKIK 104
             H+ V   + +    F+VFK+N  ++ +TNK  KPYKLKLNKFA++T+ EF + +    + 
Sbjct:     3 HYLVP--IHQTESSFDVFKKNAEYIVKTNKERKPYKLKLNKFANLTDVEFVNAHTCFDMS 60

Query:   105 HHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVTAVKDQG 145
              H+    ++    F Y  +T  P S+DWR+KG+VT VKDQG
Sbjct:    61 DHKKILDSK---PFFYENMTQAPDSLDWREKGAVTNVKDQG 98


>RGD|70956 [details] [associations]
            symbol:Tinagl1 "tubulointerstitial nephritis antigen-like 1"
           species:10116 "Rattus norvegicus" [GO:0005044 "scavenger receptor
           activity" evidence=IEA] [GO:0005576 "extracellular region"
           evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA;ISO] [GO:0006508
           "proteolysis" evidence=IEA] [GO:0006955 "immune response"
           evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
           evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
           [GO:0031012 "extracellular matrix" evidence=IEA;ISO] [GO:0043236
           "laminin binding" evidence=IEA;ISO] InterPro:IPR000668
           InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
           PROSITE:PS50958 SMART:SM00201 SMART:SM00645 RGD:70956 GO:GO:0005737
           GO:GO:0005576 GO:GO:0006955 GO:GO:0030247 GO:GO:0008234
           GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
           GO:GO:0031012 GO:GO:0005044 eggNOG:NOG310046 HOGENOM:HOG000241342
           HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OrthoDB:EOG4BG8W0
           EMBL:AB050717 IPI:IPI00190428 RefSeq:NP_446034.1 UniGene:Rn.1256
           ProteinModelPortal:Q9EQT5 PRIDE:Q9EQT5 GeneID:94174 KEGG:rno:94174
           UCSC:RGD:70956 InParanoid:Q9EQT5 NextBio:617830 ArrayExpress:Q9EQT5
           Genevestigator:Q9EQT5 GermOnline:ENSRNOG00000013179 Uniprot:Q9EQT5
        Length = 467

 Score = 144 (55.7 bits), Expect = 2.8e-16, Sum P(2) = 2.8e-16
 Identities = 30/85 (35%), Positives = 49/85 (57%)

Query:   143 DQGQCGSCWAFSTIA-AVEGIN-HIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEF 200
             DQG C   WAFST A A + ++ H + +    LS Q L+ CDT   +GC GG ++ A+ F
Sbjct:   221 DQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSCDTHHQKGCRGGRLDGAWWF 280

Query:   201 IKKKGGVTTEAKYPYQANDGTCDVS 225
             ++++G V ++  YP+   +   + S
Sbjct:   281 LRRRG-VVSDNCYPFSGREQNDEAS 304

 Score = 130 (50.8 bits), Expect = 2.8e-16, Sum P(2) = 2.8e-16
 Identities = 35/100 (35%), Positives = 52/100 (52%)

Query:   242 ANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFT------GECGTELNHGVAAV--- 291
             A+ E  ++K + +  PV   ++    DF  Y  G+++      G       HG  +V   
Sbjct:   347 ASDEKEIMKELMENGPVQALMEV-HEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKIT 405

Query:   292 GYGT-TL-DGT--KYWIVRNSWGPEWGEKGYIRMQRGISD 327
             G+G  TL DG   KYW   NSWGP WGE+G+ R+ RGI++
Sbjct:   406 GWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGINE 445


>UNIPROTKB|Q9EQT5 [details] [associations]
            symbol:Tinagl1 "Tubulointerstitial nephritis antigen-like"
            species:10116 "Rattus norvegicus" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 RGD:70956 GO:GO:0005737
            GO:GO:0005576 GO:GO:0006955 GO:GO:0030247 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GO:GO:0031012 GO:GO:0005044 eggNOG:NOG310046 HOGENOM:HOG000241342
            HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OrthoDB:EOG4BG8W0
            EMBL:AB050717 IPI:IPI00190428 RefSeq:NP_446034.1 UniGene:Rn.1256
            ProteinModelPortal:Q9EQT5 PRIDE:Q9EQT5 GeneID:94174 KEGG:rno:94174
            UCSC:RGD:70956 InParanoid:Q9EQT5 NextBio:617830 ArrayExpress:Q9EQT5
            Genevestigator:Q9EQT5 GermOnline:ENSRNOG00000013179 Uniprot:Q9EQT5
        Length = 467

 Score = 144 (55.7 bits), Expect = 2.8e-16, Sum P(2) = 2.8e-16
 Identities = 30/85 (35%), Positives = 49/85 (57%)

Query:   143 DQGQCGSCWAFSTIA-AVEGIN-HIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEF 200
             DQG C   WAFST A A + ++ H + +    LS Q L+ CDT   +GC GG ++ A+ F
Sbjct:   221 DQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSCDTHHQKGCRGGRLDGAWWF 280

Query:   201 IKKKGGVTTEAKYPYQANDGTCDVS 225
             ++++G V ++  YP+   +   + S
Sbjct:   281 LRRRG-VVSDNCYPFSGREQNDEAS 304

 Score = 130 (50.8 bits), Expect = 2.8e-16, Sum P(2) = 2.8e-16
 Identities = 35/100 (35%), Positives = 52/100 (52%)

Query:   242 ANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFT------GECGTELNHGVAAV--- 291
             A+ E  ++K + +  PV   ++    DF  Y  G+++      G       HG  +V   
Sbjct:   347 ASDEKEIMKELMENGPVQALMEV-HEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKIT 405

Query:   292 GYGT-TL-DGT--KYWIVRNSWGPEWGEKGYIRMQRGISD 327
             G+G  TL DG   KYW   NSWGP WGE+G+ R+ RGI++
Sbjct:   406 GWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGINE 445


>MGI|MGI:1891190 [details] [associations]
            symbol:Ctsz "cathepsin Z" species:10090 "Mus musculus"
            [GO:0005615 "extracellular space" evidence=ISO] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1891190 GO:GO:0005783 GO:GO:0005615 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0060441 GeneTree:ENSGT00560000076599 MEROPS:C01.013 CTD:1522
            HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN OMA:QCGTCTE
            ChiTaRS:CTSZ EMBL:AJ242663 EMBL:AF136277 EMBL:AF136278
            EMBL:BC008619 IPI:IPI00986833 RefSeq:NP_071720.1 UniGene:Mm.156919
            ProteinModelPortal:Q9WUU7 SMR:Q9WUU7 IntAct:Q9WUU7 STRING:Q9WUU7
            PaxDb:Q9WUU7 PRIDE:Q9WUU7 Ensembl:ENSMUST00000016400 GeneID:64138
            KEGG:mmu:64138 InParanoid:Q9WUU7 NextBio:319927 Bgee:Q9WUU7
            CleanEx:MM_CTSZ Genevestigator:Q9WUU7 GermOnline:ENSMUSG00000016256
            Uniprot:Q9WUU7
        Length = 306

 Score = 215 (80.7 bits), Expect = 2.9e-16, P = 2.9e-16
 Identities = 65/193 (33%), Positives = 90/193 (46%)

Query:   147 CGSCWAF-STIAAVEGINHIMTNKLVS--LSEQELVDCDTDQNQG-CNGGLMELAFEFIK 202
             CGSCWA  ST A  + IN        S  LS Q ++DC    N G C GG     +E+  
Sbjct:    91 CGSCWAHGSTSAMADRINIKRKGAWPSILLSVQNVIDCG---NAGSCEGGNDLPVWEYAH 147

Query:   203 KKGGVTTEAKYPYQAND---------GTCDVSKESSPAVSID----GHENVPANHEDALL 249
             K G +  E    YQA D         GTC   KE     +      G     +  E  + 
Sbjct:   148 KHG-IPDETCNNYQAKDQDCDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSLSGREKMMA 206

Query:   250 KAVAKQPVSVAIDAGSSDFQFYSEGVFTG-ECGTELNHGVAAVGYGTTLDGTKYWIVRNS 308
             +  A  P+S  I A +     Y+ G++   +    +NH ++  G+G + DG +YWIVRNS
Sbjct:   207 EIYANGPISCGIMA-TEMMSNYTGGIYAEHQDQAVINHIISVAGWGVSNDGIEYWIVRNS 265

Query:   309 WGPEWGEKGYIRM 321
             WG  WGEKG++R+
Sbjct:   266 WGEPWGEKGWMRI 278


>FB|FBgn0030521 [details] [associations]
            symbol:CtsB1 "Cathepsin B1" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS] [GO:0035071 "salivary gland cell autophagic cell
            death" evidence=IEP] [GO:0048102 "autophagic cell death"
            evidence=IEP] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:AE014298 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0035071
            GO:GO:0004197 MEROPS:C01.060 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 KO:K01363 PANTHER:PTHR12411:SF16
            HSSP:P07688 EMBL:AY060640 RefSeq:NP_572920.1 UniGene:Dm.3926
            SMR:Q9VY87 IntAct:Q9VY87 MINT:MINT-932864 STRING:Q9VY87
            EnsemblMetazoa:FBtr0073838 GeneID:32341 KEGG:dme:Dmel_CG10992
            UCSC:CG10992-RA FlyBase:FBgn0030521 InParanoid:Q9VY87 OMA:TEGHIRR
            OrthoDB:EOG48W9HM ChiTaRS:CG10992 GenomeRNAi:32341 NextBio:778020
            Uniprot:Q9VY87
        Length = 340

 Score = 136 (52.9 bits), Expect = 3.6e-16, Sum P(2) = 3.6e-16
 Identities = 36/111 (32%), Positives = 52/111 (46%)

Query:   116 GTFMYGKVTSIPPSVDWRKKG----SVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLV 171
             G      V  +P   D RK+     ++  ++DQG CGSCWAF  + A+     I +   V
Sbjct:    77 GDLYVNSVDELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKV 136

Query:   172 SL--SEQELVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPYQANDG 220
             +   S  +LV C      GCNGG    A+ +  +KG V+     PY +N G
Sbjct:   137 NFHFSADDLVSCCHTCGFGCNGGFPGAAWSYWTRKGIVSGG---PYGSNQG 184

 Score = 133 (51.9 bits), Expect = 3.6e-16, Sum P(2) = 3.6e-16
 Identities = 42/143 (29%), Positives = 63/143 (44%)

Query:   205 GGVTTEAKYPYQANDGTCDVSKESS-PAVSIDGHENVPANHEDALLKAVAKQPVSVAIDA 263
             GG T +  +  Q+   T D +K+    + S     NV    E+ +       PV  A   
Sbjct:   207 GGRTPKCSHVCQSGY-TVDYAKDKHFGSKSYSVRRNVREIQEEIMTNG----PVEGAFTV 261

Query:   264 GSSDFQFYSEGVFTGECGTELN-HGVAAVGYGTT-LDGTKYWIVRNSWGPEWGEKGYIRM 321
                D   Y +GV+  E G EL  H +  +G+G    +   YW++ NSW  +WG+ G+ R+
Sbjct:   262 -YEDLILYKDGVYQHEHGKELGGHAIRILGWGVWGEEKIPYWLIGNSWNTDWGDHGFFRI 320

Query:   322 QRGISDKKGLCGIAMEASYPIKK 344
              RG  D    CGI    S  + K
Sbjct:   321 LRG-QDH---CGIESSISAGLPK 339


>WB|WBGene00000789 [details] [associations]
            symbol:cpz-2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GeneTree:ENSGT00560000076599 KO:K08568 EMBL:Z81103
            HSSP:P80067 PIR:T23720 RefSeq:NP_506318.1 ProteinModelPortal:P92005
            SMR:P92005 STRING:P92005 MEROPS:C01.A41 PaxDb:P92005
            EnsemblMetazoa:M04G12.2 GeneID:179818 KEGG:cel:CELE_M04G12.2
            UCSC:M04G12.2 CTD:179818 WormBase:M04G12.2 eggNOG:NOG275763
            InParanoid:P92005 OMA:VEYWIAR NextBio:906990 Uniprot:P92005
        Length = 467

 Score = 223 (83.6 bits), Expect = 3.9e-16, P = 3.9e-16
 Identities = 58/191 (30%), Positives = 93/191 (48%)

Query:   147 CGSCWAFSTIAAV-EGINHIMTNK--LVSLSEQELVDCDTDQNQGCNGGLMELAFEFIKK 203
             CGSCW F T  A+ +  N     +  +  LS QE++DC+   N  C GG +    E  K 
Sbjct:   248 CGSCWVFGTTGALNDRFNVARKGRWPMTQLSPQEIIDCNGKGN--CQGGEIGNVLEHAKI 305

Query:   204 KGGVTTEAKYPYQANDGTCDV-----SKESSPAVSIDGHENV------PANHEDALLKAV 252
             +G +  E    Y+A +G C+      S   +   S+  +              D ++  +
Sbjct:   306 QG-LVEEGCNVYRATNGECNPYHRCGSCWPNECFSLTNYTRYYVKDYGQVQGRDKIMSEI 364

Query:   253 AKQ-PVSVAIDAGSSDFQF-YSEGVFTGECGTELNHGVAAVGYGTTLDGTKYWIVRNSWG 310
              K  P++ AI A +  F++ Y +GV++ +   E NH ++  G+G   +G +YWI RNSWG
Sbjct:   365 KKGGPIACAIGA-TKKFEYEYVKGVYSEKSDLESNHIISLTGWGVDENGVEYWIARNSWG 423

Query:   311 PEWGEKGYIRM 321
               WGE G+ R+
Sbjct:   424 EAWGELGWFRV 434


>WB|WBGene00000788 [details] [associations]
            symbol:cpz-1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] [GO:0009792 "embryo development ending in
            birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0010171 "body morphogenesis" evidence=IMP]
            [GO:0018996 "molting cycle, collagen and cuticulin-based cuticle"
            evidence=IMP] [GO:0031012 "extracellular matrix" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0009792 GO:GO:0040007 GO:GO:0002119
            GO:GO:0018996 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0010171 GO:GO:0031012
            GeneTree:ENSGT00560000076599 KO:K08568 OMA:QCGTCTE EMBL:FO081275
            EMBL:BK001409 PIR:T29872 RefSeq:NP_491023.2 HSSP:Q9UBR2
            ProteinModelPortal:G5EGP8 SMR:G5EGP8 IntAct:G5EGP8 MEROPS:C01.A38
            EnsemblMetazoa:F32B5.8 GeneID:171829 KEGG:cel:CELE_F32B5.8
            CTD:171829 WormBase:F32B5.8 NextBio:872879 Uniprot:G5EGP8
        Length = 306

 Score = 214 (80.4 bits), Expect = 4.0e-16, P = 4.0e-16
 Identities = 74/266 (27%), Positives = 116/266 (43%)

Query:    82 KLNKFADMTNHEFASTYAGS-KIKHHRMFQGTRGNGTFMYGKVTSIPPSVDWRKKGSVT- 139
             K+ K+++   +     Y  + ++  H+ +        F       +P + DWR    +  
Sbjct:    23 KVRKYSNRNRYNLKGCYKQTGRVFEHKRYDRIYETEDF---DSEDLPKTWDWRDANGINY 79

Query:   140 AVKDQGQ-----CGSCWAF-STIAAVEGINHIMTNKLVS--LSEQELVDCD---TDQNQG 188
             A  D+ Q     CGSCWAF +T A  + IN    N      LS QE++DC    T    G
Sbjct:    80 ASADRNQHIPQYCGSCWAFGATSALADRINIKRKNAWPQAYLSVQEVIDCSGAGTCVMGG 139

Query:   189 CNGGLMELAFEFIKKKGGVTTEAKYPYQANDGTCDVSKESS---PA--VSIDGHENVPAN 243
               GG+ + A E      G+  E    YQA DG CD         P    SI  +     +
Sbjct:   140 EPGGVYKYAHEH-----GIPHETCNNYQARDGKCDPYNRCGSCWPGECFSIKNYTLYKVS 194

Query:   244 -----HEDALLKAVA--KQPVSVAIDAGSSDFQFYSEGVFTGECGTELNHGVAAVGYGTT 296
                  H    +KA    K P++  I A +  F+ Y+ G++      +++H ++  G+G  
Sbjct:   195 EYGTVHGYEKMKAEIYHKGPIACGI-AATKAFETYAGGIYKEVTDEDIDHIISVHGWGVD 253

Query:   297 LD-GTKYWIVRNSWGPEWGEKGYIRM 321
              + G +YWI RNSWG  WGE G+ ++
Sbjct:   254 HESGVEYWIGRNSWGEPWGEHGWFKI 279


>UNIPROTKB|E1B9H1 [details] [associations]
            symbol:TINAGL1 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0043236 "laminin binding" evidence=IEA] [GO:0031012
            "extracellular matrix" evidence=IEA] [GO:0005737 "cytoplasm"
            evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0005044 "scavenger receptor
            activity" evidence=IEA] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0005737 GO:GO:0006955
            GO:GO:0030247 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00639 GO:GO:0031012 GO:GO:0005044
            GeneTree:ENSGT00560000076599 OMA:DNCNRCT EMBL:DAAA02006255
            IPI:IPI00732137 Ensembl:ENSBTAT00000038022 Uniprot:E1B9H1
        Length = 469

 Score = 145 (56.1 bits), Expect = 4.5e-16, Sum P(2) = 4.5e-16
 Identities = 30/78 (38%), Positives = 47/78 (60%)

Query:   143 DQGQCGSCWAFSTIA-AVEGIN-HIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEF 200
             DQG C   WAFST A A + ++ H + +    LS Q L+ CDT   QGC GG ++ A+ F
Sbjct:   224 DQGNCAGSWAFSTAAVASDRVSIHSLGHMSPVLSPQNLLSCDTHNQQGCRGGRLDGAWWF 283

Query:   201 IKKKGGVTTEAKYPYQAN 218
             ++++G V ++  YP+  +
Sbjct:   284 LRRRG-VVSDHCYPFSGH 300

 Score = 127 (49.8 bits), Expect = 4.5e-16, Sum P(2) = 4.5e-16
 Identities = 32/99 (32%), Positives = 50/99 (50%)

Query:   242 ANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFT------GECGTELNHGVAAV---G 292
             +N ++ + + +   PV   ++    DF  Y  G+++      G       HG  +V   G
Sbjct:   350 SNEKEIMKELMENGPVQALMEV-HEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITG 408

Query:   293 YGT-TL-DGT--KYWIVRNSWGPEWGEKGYIRMQRGISD 327
             +G  TL DG   KYW   NSWGP WGE+G+ R+ RG ++
Sbjct:   409 WGEETLPDGRTIKYWTAANSWGPAWGERGHFRIVRGANE 447


>WB|WBGene00000782 [details] [associations]
            symbol:cpr-2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 eggNOG:NOG315657 GeneTree:ENSGT00560000076599
            HOGENOM:HOG000241341 PANTHER:PTHR12411:SF16 EMBL:Z81531
            RefSeq:NP_507186.3 ProteinModelPortal:O45466 SMR:O45466
            MEROPS:C01.A40 PaxDb:O45466 EnsemblMetazoa:F36D3.9 GeneID:185355
            KEGG:cel:CELE_F36D3.9 CTD:185355 WormBase:F36D3.9 OMA:FDARLRW
            Uniprot:O45466
        Length = 326

 Score = 144 (55.7 bits), Expect = 4.5e-16, Sum P(2) = 4.5e-16
 Identities = 28/69 (40%), Positives = 40/69 (57%)

Query:   267 DFQFYSEGVFTGECG-TELNHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGI 325
             DF+ Y  G++    G ++  H V  +G+GT   GT YW+  NSWG +WGE G  R+ RG+
Sbjct:   253 DFEKYKSGIYRHIAGRSKGGHAVKLIGWGTER-GTPYWLAVNSWGSQWGESGTFRILRGV 311

Query:   326 SDKKGLCGI 334
              +    CGI
Sbjct:   312 DE----CGI 316

 Score = 122 (48.0 bits), Expect = 4.5e-16, Sum P(2) = 4.5e-16
 Identities = 34/110 (30%), Positives = 54/110 (49%)

Query:   132 WRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVS--LSEQELVDC-DTDQNQG 188
             W +  S+  +++Q  CGSCWAFST   +     I +N      +S  +L+ C      +G
Sbjct:    93 WPQCKSMKLIREQSNCGSCWAFSTAEVISDRTCIASNGTQQPIISPTDLLTCCGMSCGEG 152

Query:   189 CNGGLMELAFEFIKKKGGVT------TEAK-YPYQ-ANDGTCDVSKESSP 230
             C+GG    AF++  ++G VT      T  K YP +  N   C V+ ++ P
Sbjct:   153 CDGGFPYRAFQWWARRGVVTGGDYLGTGCKPYPIRPCNSDNC-VNLQTPP 201


>WB|WBGene00009158 [details] [associations]
            symbol:F26E4.3 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005576
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005044
            GeneTree:ENSGT00560000076599 HSSP:P07711 EMBL:Z81070
            eggNOG:NOG310046 HOGENOM:HOG000241342 OMA:DNCNRCT PIR:T21421
            RefSeq:NP_492593.2 ProteinModelPortal:P90850 SMR:P90850
            PaxDb:P90850 EnsemblMetazoa:F26E4.3.1 EnsemblMetazoa:F26E4.3.2
            GeneID:172827 KEGG:cel:CELE_F26E4.3 UCSC:F26E4.3.1 CTD:172827
            WormBase:F26E4.3 InParanoid:P90850 NextBio:877161 Uniprot:P90850
        Length = 452

 Score = 144 (55.7 bits), Expect = 6.5e-16, Sum P(2) = 6.5e-16
 Identities = 39/113 (34%), Positives = 61/113 (53%)

Query:   122 KVTSIPPSVDWRKKGS--VTAVKDQGQCGSCWAFSTIA-AVEGINHIMTNKLVS-LSEQE 177
             K   +P   D R K    +  V DQG CGS W+ ST A + + +  I   ++ S LS Q+
Sbjct:   180 KPRELPEHFDARDKWGPLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQ 239

Query:   178 LVDCDTDQNQGCNGGLMELAFEFIKKKGGVTTEAKYPY---QAND-GTCDVSK 226
             L+ C+  + +GC GG ++ A+ +I+K G V  +  YPY   Q+ + G C + K
Sbjct:   240 LLSCNQHRQKGCEGGYLDRAWWYIRKLG-VVGDHCYPYVSGQSREPGHCLIPK 291

 Score = 126 (49.4 bits), Expect = 6.5e-16, Sum P(2) = 6.5e-16
 Identities = 37/126 (29%), Positives = 52/126 (41%)

Query:   212 KYPYQANDGT-CDVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQF 270
             K  Y    G  C    + S A  +     V +  ED   + +   PV         DF  
Sbjct:   291 KRDYTNRQGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVV-HEDFFM 349

Query:   271 YSEGVF-----TGECGT----ELNHGVAAVGYG---TTLDGTKYWIVRNSWGPEWGEKGY 318
             Y+ GV+       + G     E  H V  +G+G   +T    KYW+  NSWG +WGE GY
Sbjct:   350 YAGGVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGY 409

Query:   319 IRMQRG 324
              ++ RG
Sbjct:   410 FKVLRG 415


>UNIPROTKB|E2QXH3 [details] [associations]
            symbol:TINAGL1 "Uncharacterized protein" species:9615
            "Canis lupus familiaris" [GO:0043236 "laminin binding"
            evidence=IEA] [GO:0031012 "extracellular matrix" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006955 "immune response" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044 "scavenger
            receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005737
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639 GO:GO:0031012
            GO:GO:0005044 GeneTree:ENSGT00560000076599 CTD:64129 OMA:DNCNRCT
            EMBL:AAEX03001668 RefSeq:XP_535330.3 Ensembl:ENSCAFT00000035659
            GeneID:478155 KEGG:cfa:478155 NextBio:20853523 Uniprot:E2QXH3
        Length = 467

 Score = 145 (56.1 bits), Expect = 9.1e-16, Sum P(2) = 9.1e-16
 Identities = 30/75 (40%), Positives = 46/75 (61%)

Query:   143 DQGQCGSCWAFSTIA-AVEGIN-HIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEF 200
             DQG C   WAFST A A + ++ H + +    LS Q L+ CDT   QGC GG ++ A+ F
Sbjct:   222 DQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHNQQGCRGGRLDGAWWF 281

Query:   201 IKKKGGVTTEAKYPY 215
             ++++G V ++  YP+
Sbjct:   282 LRRRG-VVSDHCYPF 295

 Score = 124 (48.7 bits), Expect = 9.1e-16, Sum P(2) = 9.1e-16
 Identities = 32/98 (32%), Positives = 49/98 (50%)

Query:   243 NHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFT------GECGTELNHGVAAV---GY 293
             N ++ + + +   PV   ++    DF  Y  G+++      G       HG  +V   G+
Sbjct:   349 NEKEIMKELMENGPVQALMEV-HEDFFLYQGGIYSHTPVSLGRPERYRRHGTHSVKITGW 407

Query:   294 GT-TL-DGT--KYWIVRNSWGPEWGEKGYIRMQRGISD 327
             G  TL DG   KYW   NSWGP WGE+G+ R+ RG ++
Sbjct:   408 GEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANE 445


>MGI|MGI:2137617 [details] [associations]
            symbol:Tinagl1 "tubulointerstitial nephritis antigen-like 1"
            species:10090 "Mus musculus" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
            [GO:0043236 "laminin binding" evidence=IPI] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 MGI:MGI:2137617
            GO:GO:0005737 GO:GO:0005576 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GO:GO:0031012 CleanEx:MM_ARG1 GO:GO:0005044
            GeneTree:ENSGT00560000076599 eggNOG:NOG310046 HOGENOM:HOG000241342
            HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OMA:DNCNRCT
            OrthoDB:EOG4BG8W0 EMBL:AB047402 EMBL:AB050626 EMBL:BC005738
            EMBL:BC018539 IPI:IPI00115458 RefSeq:NP_001161805.1
            RefSeq:NP_075965.2 UniGene:Mm.15801 ProteinModelPortal:Q99JR5
            SMR:Q99JR5 STRING:Q99JR5 PhosphoSite:Q99JR5 PaxDb:Q99JR5
            PRIDE:Q99JR5 Ensembl:ENSMUST00000030560 Ensembl:ENSMUST00000105998
            Ensembl:ENSMUST00000105999 GeneID:94242 KEGG:mmu:94242
            InParanoid:Q99JR5 NextBio:352247 Bgee:Q99JR5 Genevestigator:Q99JR5
            GermOnline:ENSMUSG00000028776 Uniprot:Q99JR5
        Length = 466

 Score = 147 (56.8 bits), Expect = 1.1e-15, Sum P(2) = 1.1e-15
 Identities = 30/79 (37%), Positives = 47/79 (59%)

Query:   143 DQGQCGSCWAFSTIA-AVEGIN-HIMTNKLVSLSEQELVDCDTDQNQGCNGGLMELAFEF 200
             DQG C   WAFST A A + ++ H + +    LS Q L+ CDT   QGC GG ++ A+ F
Sbjct:   221 DQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSCDTHHQQGCRGGRLDGAWWF 280

Query:   201 IKKKGGVTTEAKYPYQAND 219
             ++++G V ++  YP+   +
Sbjct:   281 LRRRG-VVSDNCYPFSGRE 298

 Score = 121 (47.7 bits), Expect = 1.1e-15, Sum P(2) = 1.1e-15
 Identities = 33/97 (34%), Positives = 49/97 (50%)

Query:   245 EDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFT------GECGTELNHGVAAV---GYG 294
             E  ++K + +  PV   ++    DF  Y  G+++      G       HG  +V   G+G
Sbjct:   349 EKEIMKELMENGPVQALMEV-HEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWG 407

Query:   295 T-TL-DGT--KYWIVRNSWGPEWGEKGYIRMQRGISD 327
               TL DG   KYW   NSWGP WGE+G+ R+ RG ++
Sbjct:   408 EETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNE 444


>ZFIN|ZDB-GENE-060503-240 [details] [associations]
            symbol:tinagl1 "tubulointerstitial nephritis
            antigen-like 1" species:7955 "Danio rerio" [GO:0005044 "scavenger
            receptor activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0030414 "peptidase inhibitor activity" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0002040 "sprouting
            angiogenesis" evidence=IMP] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR008037 InterPro:IPR013128 Pfam:PF00112 Pfam:PF05375
            PROSITE:PS00524 PROSITE:PS50958 SMART:SM00201 SMART:SM00645
            ZFIN:ZDB-GENE-060503-240 GO:GO:0006955 GO:GO:0030247 GO:GO:0030414
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 GO:GO:0002040
            GO:GO:0005044 GeneTree:ENSGT00560000076599 GO:GO:0010466
            SUPFAM:SSF57283 HOVERGEN:HBG053961 MEROPS:C01.975 OMA:DNCNRCT
            EMBL:BX950864 IPI:IPI00609339 UniGene:Dr.103937
            Ensembl:ENSDART00000087096 Ensembl:ENSDART00000126228
            InParanoid:Q1LUC6 Uniprot:Q1LUC6
        Length = 471

 Score = 144 (55.7 bits), Expect = 1.2e-15, Sum P(2) = 1.2e-15
 Identities = 35/90 (38%), Positives = 51/90 (56%)

Query:   129 SVD-WRKKGSVTAVKDQGQCGSCWAFSTIA-AVEGIN-HIMTNKLVSLSEQELVDCDTDQ 185
             +VD W   G +    DQG C + WAFST A A + I+   M +    LS Q L+ CDT  
Sbjct:   206 AVDKW--PGKIHEPLDQGNCNASWAFSTAAVASDRISIQSMGHMTPQLSPQNLISCDTRH 263

Query:   186 NQGCNGGLMELAFEFIKKKGGVTTEAKYPY 215
               GC GG ++ A+ F++++G VT +  YP+
Sbjct:   264 QDGCAGGRIDGAWWFMRRRGVVTQDC-YPF 292

 Score = 124 (48.7 bits), Expect = 1.2e-15, Sum P(2) = 1.2e-15
 Identities = 30/100 (30%), Positives = 46/100 (46%)

Query:   242 ANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFT---------GECGTELNHGVAAV 291
             + +E+ ++K +    PV   ++    DF  Y  G+F           +      H V   
Sbjct:   343 STNENEIMKEIMDNGPVQAIMEV-HEDFFVYKSGIFRHTDVNYHKPSQYRKHATHSVRIT 401

Query:   292 GYGTTLDGT----KYWIVRNSWGPEWGEKGYIRMQRGISD 327
             G+G   D +    KYWI  NSWG  WGE GY R+ RG+++
Sbjct:   402 GWGEERDYSGRTRKYWIGANSWGKNWGEDGYFRIARGVNE 441


>UNIPROTKB|A5GFX7 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9823 "Sus scrofa"
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GO:GO:0060441 GeneTree:ENSGT00560000076599 MEROPS:C01.013 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN
            OMA:QCGTCTE EMBL:CR956646 RefSeq:NP_001116576.1 UniGene:Ssc.16769
            ProteinModelPortal:A5GFX7 SMR:A5GFX7 STRING:A5GFX7
            Ensembl:ENSSSCT00000008249 GeneID:100141405 KEGG:ssc:100141405
            ArrayExpress:A5GFX7 Uniprot:A5GFX7
        Length = 304

 Score = 201 (75.8 bits), Expect = 2.6e-14, P = 2.6e-14
 Identities = 64/193 (33%), Positives = 89/193 (46%)

Query:   147 CGSCWAF-STIAAVEGINHIMTNKLVS--LSEQELVDCDTDQNQG-CNGGLMELAFEFIK 202
             CGSCWA  ST A  + IN        S  LS Q ++DC    N G C GG  +L      
Sbjct:    90 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCG---NAGSCEGG-DDLPVWAYA 145

Query:   203 KKGGVTTEAKYPYQAND---------GTCDVSKESSPAVSID----GHENVPANHEDALL 249
              + G+  E    YQA D         GTC   KE     +      G     +  E  + 
Sbjct:   146 HRHGIPDETCNNYQAKDQVCDKFNQCGTCTEFKECHVIQNYTLWKVGDYGSVSGREKMMA 205

Query:   250 KAVAKQPVSVAIDAGSSDFQFYSEGVFTG-ECGTELNHGVAAVGYGTTLDGTKYWIVRNS 308
             +  A  P+S  I A +     Y+ G++   +    +NH V+  G+G +  GT+YWIVRNS
Sbjct:   206 EIYANGPISCGIMA-TEKMSNYTGGIYAEYKDQAYINHIVSVAGWGVS-GGTEYWIVRNS 263

Query:   309 WGPEWGEKGYIRM 321
             WG  WGE+G++R+
Sbjct:   264 WGEPWGERGWMRI 276


>UNIPROTKB|P05689 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:BC122603
            EMBL:X01809 IPI:IPI00708474 PIR:A29172 RefSeq:NP_001071303.1
            UniGene:Bt.4902 ProteinModelPortal:P05689 SMR:P05689 MEROPS:C01.013
            PRIDE:P05689 GeneID:404187 KEGG:bta:404187 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 InParanoid:P05689 KO:K08568
            OrthoDB:EOG42Z4QN BRENDA:3.4.18.1 NextBio:20817615 Uniprot:P05689
        Length = 304

 Score = 198 (74.8 bits), Expect = 6.5e-14, P = 6.5e-14
 Identities = 70/219 (31%), Positives = 99/219 (45%)

Query:   147 CGSCWAF-STIAAVEGINHIMTNKLVS--LSEQELVDCDTDQNQGCNGGLMELAFEFIKK 203
             CGSCWA  ST A  + IN        S  LS Q ++DC  D    C GG     +E+  +
Sbjct:    90 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCG-DAGS-CEGGNDLPVWEYAHR 147

Query:   204 KGGVTTEAKYPYQAND---------GTCDVSKESSPAVSID----GHENVPANHEDALLK 250
              G +  E    YQA D         GTC   KE     +      G     +  E  + +
Sbjct:   148 HG-IPDETCNNYQAKDQECDKFNQCGTCTEFKECHVIKNYTLWKVGDYGSLSGREKMMAE 206

Query:   251 AVAKQPVSVAIDAGSSDFQFYSEGVFTGECGTE--LNHGVAAVGYGTTLDGTKYWIVRNS 308
                  P+S  I A +     Y+ G+++ E   +  +NH V+  G+G + DG +YWIVRNS
Sbjct:   207 IYTNGPISCGIMA-TEKMSNYTGGIYS-EYNDQAFINHIVSVAGWGVS-DGMEYWIVRNS 263

Query:   309 WGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPIKKSAT 347
             WG  WGE G++R+    S  KG  G     +  I++S T
Sbjct:   264 WGEPWGEHGWMRIVT--STYKG--GEGARYNLAIEESCT 298


>DICTYBASE|DDB_G0286055 [details] [associations]
            symbol:DDB_G0286055 "peptidase C1A family protein"
            species:44689 "Dictyostelium discoideum" [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 dictyBase:DDB_G0286055 Pfam:PF00188 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411 EMBL:AAFI02000085
            Gene3D:3.40.33.10 InterPro:IPR001283 InterPro:IPR014044
            PRINTS:PR00837 SMART:SM00198 SUPFAM:SSF55797
            ProtClustDB:CLSZ2429919 RefSeq:XP_637918.1
            ProteinModelPortal:Q54MB6 EnsemblProtists:DDB0186794 GeneID:8625429
            KEGG:ddi:DDB_G0286055 InParanoid:Q54MB6 OMA:GENGFAR Uniprot:Q54MB6
        Length = 435

 Score = 202 (76.2 bits), Expect = 1.1e-13, P = 1.1e-13
 Identities = 72/242 (29%), Positives = 110/242 (45%)

Query:   125 SIPP--SVDWRKKGSVTAVKDQGQCGSCWAFSTIAAVEGINHIMTNKLVSLSEQELVDCD 182
             S+P   S DWR  G V   KD   C S WAF+     E  + + T      S Q+L+DC 
Sbjct:   205 SVPTDGSFDWRDNGVVGFPKDSSNCASGWAFTAAGIFESRSAMRTRHRYDYSAQQLIDCI 264

Query:   183 -------TDQNQG----CN--GGLMELAFEFIKKKGGVTTEAKYPYQ-ANDGTCDVSKES 228
                    ++ + G    C+   G +  A  + +  G   T   YPY  A+   C  + +S
Sbjct:   265 NVCIIIFSNFSIGNYTKCSRFSGELNKALMYAQAYGLQATST-YPYVGASSIGCSYN-QS 322

Query:   229 SPAVSIDGHENVPANHEDALLKAVAKQ-PVSVAIDAGSSDFQFYSEGVFTGECGTEL--- 284
             S AV   G         D++++   KQ PV V I   +++F +Y+ G+F  EC   L   
Sbjct:   323 SIAVE-GGDVEYSQVGRDSIVEKCRKQGPVGVGIYV-TNEFLYYAGGIF--ECNNTLIDN 378

Query:   285 ---NHGVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGIAMEASYP 341
                NH V  VGY    +   Y+I++N++G  WGE G+ R+   ++     C IA   +Y 
Sbjct:   379 ANINHNVLLVGYN---EKDNYYIIKNNFGRTWGENGFARITADVNKD---CLIAKNPAYS 432

Query:   342 IK 343
             I+
Sbjct:   433 IQ 434


>UNIPROTKB|F1MW68 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9913 "Bos taurus"
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0060441
            GeneTree:ENSGT00560000076599 IPI:IPI00708474 UniGene:Bt.4902
            OMA:QCGTCTE EMBL:DAAA02036315 PRIDE:F1MW68
            Ensembl:ENSBTAT00000025007 Uniprot:F1MW68
        Length = 304

 Score = 196 (74.1 bits), Expect = 1.2e-13, P = 1.2e-13
 Identities = 70/219 (31%), Positives = 99/219 (45%)

Query:   147 CGSCWAF-STIAAVEGINHIMTNKLVS--LSEQELVDCDTDQNQGCNGGLMELAFEFIKK 203
             CGSCWA  ST A  + IN        S  LS Q ++DC  D    C GG     +E+  +
Sbjct:    90 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVLDCG-DAGS-CEGGNDLPVWEYAHR 147

Query:   204 KGGVTTEAKYPYQAND---------GTCDVSKESSPAVSID----GHENVPANHEDALLK 250
              G +  E    YQA D         GTC   KE     +      G     +  E  + +
Sbjct:   148 HG-IPDETCNNYQAKDQECDKFNQCGTCTEFKECHVIKNYTLWKVGDYGSLSGREKMMAE 206

Query:   251 AVAKQPVSVAIDAGSSDFQFYSEGVFTGECGTE--LNHGVAAVGYGTTLDGTKYWIVRNS 308
                  P+S  I A +     Y+ G+++ E   +  +NH V+  G+G + DG +YWIVRNS
Sbjct:   207 IYTNGPISCGIMA-TEKMSNYTGGIYS-EYNDQAFINHIVSVAGWGVS-DGMEYWIVRNS 263

Query:   309 WGPEWGEKGYIRMQRGISDKKGLCGIAMEASYPIKKSAT 347
             WG  WGE G++R+    S  KG  G     +  I++S T
Sbjct:   264 WGEPWGEHGWMRIVT--STYKG--GEGARYNLAIEESCT 298


>RGD|1359482 [details] [associations]
            symbol:Tinag "tubulointerstitial nephritis antigen"
            species:10116 "Rattus norvegicus" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0005604 "basement membrane"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=IEA] [GO:0006955
            "immune response" evidence=IEA] [GO:0007155 "cell adhesion"
            evidence=ISO] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR001212 InterPro:IPR013128
            Pfam:PF00112 Pfam:PF01033 PROSITE:PS00524 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 RGD:1359482 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GO:GO:0007155 EMBL:CH473954 GO:GO:0005604
            GO:GO:0005044 MEROPS:C01.973 CTD:27283 eggNOG:NOG310046
            HOGENOM:HOG000241342 HOVERGEN:HBG053961 OMA:WGQLTSS
            OrthoDB:EOG47PX5P EMBL:BC081887 IPI:IPI00370427
            RefSeq:NP_001005549.1 UniGene:Rn.43851 STRING:Q66HF6
            Ensembl:ENSRNOT00000041567 GeneID:300846 KEGG:rno:300846
            UCSC:RGD:1359482 InParanoid:Q66HF6 NextBio:647630
            Genevestigator:Q66HF6 Uniprot:Q66HF6
        Length = 475

 Score = 128 (50.1 bits), Expect = 1.8e-13, Sum P(2) = 1.8e-13
 Identities = 32/102 (31%), Positives = 47/102 (46%)

Query:   240 VPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTGECGTE---------LNHGVAA 290
             + +N  + + + +   PV  AI     DF +Y  G++     T            H V  
Sbjct:   356 ISSNETEIMREIIQNGPVQ-AIMQVHEDFFYYKTGIYRHVVSTNEEPEKYRKLRTHAVKL 414

Query:   291 VGYGTTLDGT-----KYWIVRNSWGPEWGEKGYIRMQRGISD 327
              G+GT L G      K+WI  NSWG  WGE GY R+ RG+++
Sbjct:   415 TGWGT-LRGAQGKKEKFWIAANSWGKSWGENGYFRILRGVNE 455

 Score = 121 (47.7 bits), Expect = 1.8e-13, Sum P(2) = 1.8e-13
 Identities = 30/93 (32%), Positives = 48/93 (51%)

Query:   143 DQGQCGSCWAFSTIAAVEGINHIMTNK---LVSLSEQELVDCDTDQNQGCNGGLMELAFE 199
             DQ  C + WAFST A+V      + +K     +LS Q L+ C      GCN G ++ A+ 
Sbjct:   235 DQKNCAASWAFST-ASVAADRIAIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRAWW 293

Query:   200 FIKKKGGVTTEAKYPY----QANDGTCDVSKES 228
             F++K+G + + A YP       N+ +C ++  S
Sbjct:   294 FLRKRG-LVSHACYPLFKEQSTNNNSCAMASRS 325


>WB|WBGene00021070 [details] [associations]
            symbol:W07B8.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:FO081739 HSSP:P07688 PIR:T31730
            RefSeq:NP_503384.1 ProteinModelPortal:O16289 SMR:O16289
            EnsemblMetazoa:W07B8.1 GeneID:178613 KEGG:cel:CELE_W07B8.1
            UCSC:W07B8.1 CTD:178613 WormBase:W07B8.1 eggNOG:NOG245289
            InParanoid:O16289 OMA:TTGIYVH NextBio:901844 Uniprot:O16289
        Length = 335

 Score = 141 (54.7 bits), Expect = 4.6e-13, Sum P(2) = 4.6e-13
 Identities = 32/113 (28%), Positives = 53/113 (46%)

Query:   223 DVSKESSPAVSIDGHENVPANHEDALLKAVAKQPVSVAIDAGSSDFQFYSEGVFTGECGT 282
             D+ K+    VS+D    +P +  +     +   P+    +    DF  Y+ G++    G 
Sbjct:   220 DIDKDRHYGVSVD---QLPNSQIEIQSDVMLNGPIQATFEV-YDDFLQYTTGIYVHLTGN 275

Query:   283 ELNH-GVAAVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKKGLCGI 334
             +  H  V  +G+G    G  YW+  NSWG +WGE G  R+ RG ++    CG+
Sbjct:   276 KQGHLSVRIIGWGVW-QGVPYWLCANSWGRQWGENGTFRVLRGTNE----CGL 323

 Score = 97 (39.2 bits), Expect = 4.6e-13, Sum P(2) = 4.6e-13
 Identities = 32/105 (30%), Positives = 50/105 (47%)

Query:   124 TSIPPSVD----WRKKGSVTAVKDQGQCGSCWAFSTIAAVEGIN-HIMTN----KLVSLS 174
             + + PS D    W +  S+  + D  +C + WAF   AA E ++  +  N    K   LS
Sbjct:    74 SDLSPSFDARERWPECMSIPQINDISECKTSWAF---AAAESMSDRLCINSGGFKNTILS 130

Query:   175 EQELVDCDTDQ---NQGCNGGLMELAFEFIKKKGGVTTEAKYPYQ 216
              +EL+ C T      +GC GG    A+++I+K G + T   Y  Q
Sbjct:   131 AEELLSCCTGMFSCGEGCEGGNPFKAWQYIQKHG-IPTGGSYESQ 174

WARNING:  HSPs involving 47 database sequences were not reported due to the
          limiting value of parameter B = 250.


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.315   0.132   0.405    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      360       349   0.00099  116 3  11 23  0.44    34
                                                     34  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  297
  No. of states in DFA:  622 (66 KB)
  Total size of DFA:  265 KB (2140 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  29.71u 0.17s 29.88t   Elapsed:  00:00:01
  Total cpu time:  29.76u 0.17s 29.93t   Elapsed:  00:00:01
  Start:  Tue May 21 05:14:36 2013   End:  Tue May 21 05:14:37 2013
WARNINGS ISSUED:  2

Back to top